Where communities thrive


  • Join over 1.5M+ people
  • Join over 100K+ communities
  • Free without limits
  • Create your own community
People
Repo info
Activity
  • 17:58
    jgrund review_requested #2341
  • 17:53
    johnsonw synchronize #2343
  • 17:53

    johnsonw on add-fs-type

    Add fs_type field to targets ta… (compare)

  • 17:12

    jgrund on fix-fencing

    (compare)

  • 17:12

    jgrund on master

    Fix Vbox fencing (#2342) * Fix… (compare)

  • 17:12
    jgrund closed #2342
  • 17:01
    jgrund review_requested #2342
  • 17:01
    jgrund review_requested #2342
  • 17:01
    jgrund review_requested #2342
  • 16:07
    johnsonw synchronize #2154
  • 16:07

    johnsonw on EX-1476

    EX-1476 Create step to create a… (compare)

  • 16:05
    johnsonw synchronize #2343
  • 16:05

    johnsonw on add-fs-type

    Add fs_type field to targets ta… (compare)

  • 14:59

    utopiabound on postoffice-write

    Postoffice: Use atomic conf fil… (compare)

  • 14:51

    utopiabound on postoffice-write

    Postoffice: Use atomic conf fil… (compare)

  • 14:21
    jgrund synchronize #2342
  • 14:21

    jgrund on fix-fencing

    Update mds / oss hostnames so P… (compare)

  • 14:16
    johnsonw synchronize #2154
  • 14:16

    johnsonw on EX-1476

    EX-1476 Create step to create a… (compare)

  • 14:14
    johnsonw synchronize #2343
Joe Grund
@jgrund
Not at the moment, but we are looking at using containers to collocate iml with a storage-server.
I’ll update here if it works ok
Alex Talker
@AlexTalker
Hello! Can somebody help me to understand how to deploy your software using Docker? I found the file https://github.com/whamcloud/integrated-manager-for-lustre/blob/master/docker/docker-compose.yml but docker-compose up tells me that ERROR: for setup Cannot create container for service setup: invalid mount config for type "bind": bind mount source path does not exist: /tmp/iml_pw what the file is for?
Alex Talker
@AlexTalker
Okay, I figured that one out. Now I'm trying to rebuild iml-node-libzfs package because it conflicts with my version of nodejs(and if I force its installation, then it fails at runtime). But I stuck on part where libzfs-sys Rust package can't find libzfs_impl.h just because it is in libzfs folder in /usr/include. Does anybody know full recipe for cooking this thing? I'm targeting CentOS 7.4.
LinuxLustre
@LinuxLustre
I am looking for some help on a project where I am trying to use the IML API to download Lustre performance metrics (bytes written to and read from the filesystem). I believe that this is possible because this of page: https://whamcloud.github.io/Online-Help/docs/api/rest_API.html. That document indicates that this task should be possible and it refers to downloading time series of data using the /metrics/ sub-URL, but I haven't been able to make this work yet. Would you be willing to help me see where I might be going wrong? I am running IML version 2.1.2, so that might be a complicating factor, but most things I am seeing with the API appear to be consistent with the current documentation. Thanks for any help you can give!
LinuxLustre
@LinuxLustre
Someone pointed me to this page:
whamcloud/integrated-manager-for-lustre#449
where @jgrund had already posted just what I needed. This URL got me the time-series data on read/write throughput:
https://url-to-manager-here/api/target/metric/?kind=OST&reduce_fn=sum&metrics=stats_read_bytes,stats_write_bytes&begin=2018-01-31T00:00:00.000Z
I wanted to say thanks and post it again in case this helps someone else in the future.
Joe Grund
@jgrund

@AlexTalker In regards for deploying with docker, we’ve intended it to be used with docker stack. Here is a doc for that:

https://whamcloud.github.io/Online-Help/docs/Install_Guide/ig_docker_stack.html

Alex Talker
@AlexTalker
@jgrund As far as I know, stack is good for deploying into cluster, which seems to be possible for your architecture but project seems to be oriented on standalone installation(if I'd used rpm and lets say CentOS). Besides, debugging with compose is a way easier. And I see no real difference for your project between these approaches.
@jgrund Production installation is run on standalone server anyway, so I use this only as temporary environment.
Joe Grund
@jgrund
Sure, no reason why you can’t use compose, just know stack is how we are using it for deployment
Alex Talker
@AlexTalker
@jgrund Also, can you tell me what you mean every time you write "test this please" in PR? I get confused since I'm not part of your team and do not have access to test infrastructure, while surely I do test everything manually, otherwise there's no PR
@jgrund Regarding stack, you do share services between nodes or bind them all to one and the same?
Joe Grund
@jgrund
@AlexTalker sorry, not intended to say you haven’t tested :) It’s how we trigger jenkins runs for external contributions using this plugin: https://wiki.jenkins.io/display/JENKINS/GitHub+pull+request+builder+plugin
Alex Talker
@AlexTalker
@jgrund Wow, can't you mention the bot(which seems to be exist) so it will look more targeting? Or it won't work this way?
Joe Grund
@jgrund
Yeah, the phrasing is unfortunate
I’ll check if I can have a custom trigger
Alex Talker
@AlexTalker
@jgrund Thanks, also, quite often testing process seems to be failing due to dependency installation issue, you might need to pay more attention to such cases.
Joe Grund
@jgrund
Retriggerd that run
Joe Grund
@jgrund

@jgrund Regarding stack, you do share services between nodes or bind them all to one and the same?

All on one node, for now anyway

Alex Talker
@AlexTalker
@jgrund Also, regarding whamcloud/integrated-manager-for-lustre#917 I've got a suggestion that this fix somehow doesn't work on production installation and I think I saw there reverse situation(volume node, associated with target on passive node were deleted after it fail-back) but I could not reproduce this case on Docker and I reproduced this case a few times and it always ended up successfully, so I think this one good to go when you decide it is appropriate to.
But I'll look into it tomorrow I think, should be okay
@jgrund Also, regarding issue with device-scanner if you remember. I dig into the problem why multipath triggered events and it seems every time somebody opens device-mapper device for writing and closes the file description, the event is generated. Even if nothing has been written. Since it requires deep kernel knowledge, I delegated this task but you still might want to check if data you supply has actually changed and supply it only if it is.
@jgrund You can reproduce it if you write with dd on a multipath disk, as often dd ends - so often the even is triggered.
Alex Talker
@AlexTalker
@jgrund I do not know for sure but I think this triggers device-aggregator on server side and since nothing changes, it is useless to go further than device-scanner process
Joe Grund
@jgrund
I’ll take a look at that.

In regards to filtering, I have an issue that I still need to get to for that:

whamcloud/device-scanner#193

Alex Talker
@AlexTalker
The issue seems to cover my problem
Unfortunately, we required to interact with devices so often by design. Yet another way to implement HA, you know
But if we'll figure out how to disable triggering this event, I'll let you know
Joe Grund
@jgrund
Are the UEvents emitted identical for each write?
Alex Talker
@AlexTalker
@jgrund The only thing that changes is ID, no particular change in data that I could notice via udevadm monitor -p. Can't remember how ID's named now :/
Amit Kumar
@ahkumar
@jgrund Hi Joe, I am was able to get iml5 after reproving the OS to remove previous version of postgress. I have IML up and now I don't see that the OST's are going offline
@jgrund this one looks great!!
Joe Grund
@jgrund
@ahkumar Ok, I can stop by and take a look at what’s happening next break
@ahkumar Thanks :)
Amit Kumar
@ahkumar
@jgrund thank you!!
Amit Kumar
@ahkumar
@jgrund Wondering if this doc https://whamcloud.github.io/Online-Help/docs/Contributor_Docs/cd_Installing_IML_On_Vagrant.html is available offline? so I can work offline ?
Joe Grund
@jgrund

Not really. If you have IML installed, those docs are bundled in under the help link, but I suspect you want a standalone solution.

We have an open that describes a way to do so (which we plan to automate) here: whamcloud/Online-Help#142

But there are also plugins available that can download webpages for you in Chrome or Firefox if you want to go that route.

Amit Kumar
@ahkumar
@jgrund Cool will try other options you mention, thank you!!
Alex Talker
@AlexTalker
@jgrund whamcloud/device-scanner#270 We checked and udev event is triggered on closing multipath device only on systems with IML. I asked to find out why systemd-udevd actually triggers the event.
Joe Grund
@jgrund
Ok, I’ll take a look
Amit Kumar
@ahkumar
@jgrund : i have a vagrant setup of the latest IML. Interestingly after I resized my MDT to a bigger disk size, I am not able to see the updated MDT size when I am selecting disks to create a file system. How can I get IML to redetect the targets to find its correct size. On the mds host itself I can see the disk size to be the updated size but not in the web interface?
Joe Grund
@jgrund
@ahkumar Is this a ZFS or ldiskfs setup?
Amit Kumar
@ahkumar
@jgrund ldiskfs
Joe Grund
@jgrund
Using the mpath devices?
Amit Kumar
@ahkumar
yup
Joe Grund
@jgrund
How are you expanding the disk?
Amit Kumar
@ahkumar
I removed the original mdt created by initial setup: and then created a new fixed disk and attached it to stat controller port where the original disk was connected on the iscsi host
*sata controller port