Where communities thrive


  • Join over 1.5M+ people
  • Join over 100K+ communities
  • Free without limits
  • Create your own community
People
Activity
    Markus Döring
    @mdoering
    Hi @dimus !
    Dmitry Mozzherin
    @dimus
    :+1:
    Dmitry Mozzherin
    @dimus
    hey @mdoering , @olafbanki are we ready so start? Markus, remember we talked about protonym/basyonym refs file?
    Markus Döring
    @mdoering
    Hi Dima. I had been sidetracked but will work on the export for you when back home. End of the week looks reasonable
    is that good enough?
    Dmitry Mozzherin
    @dimus
    thanks @mdoering , sure, I will wait for your file. @olafbanki should we start with meetings next week then?
    olafbanki
    @olafbanki
    Yes that sounds good to me
    olafbanki
    @olafbanki
    @dimus Markus has a sick bay at home so has not been able to prepare the file
    I propose we only start with the standups if the starting file is there
    As a placeholder we could reserve this Friday 15:30 to 16:00, your 8:30 to 9:00
    Dmitry Mozzherin
    @dimus
    Good time for me @olafbanki , hope @mdoering will get better soon
    Markus Döring
    @mdoering
    hi guys, just exported all names to here: http://rs.gbif.org/datasets/backbone/col-names.txt.gz
    Dmitry Mozzherin
    @dimus
    thanks @mdoering!
    looks good
    olafbanki
    @olafbanki
    @mdoering @dimus can you confirm we are calling tomorrow at 15:30 to 16:00, illinois 8:30 to 9:00?
    Dmitry Mozzherin
    @dimus
    good for me @olafbanki
    Markus Döring
    @mdoering
    hm, friday afternoon? I wont make it tomorrow, still have a doc appointment then. Didnt we say tue/thu before the regular col call?
    olafbanki
    @olafbanki
    We can do that
    Dmitry Mozzherin
    @dimus
    it works for me well, unless we want BHL guys' participation
    tue or thu 7am illinois or so?
    olafbanki
    @olafbanki
    Next week Tuesday 14:00 will not be an option for me. But also happy if you Markus and Dima take the first call.
    Markus Döring
    @mdoering
    friday afternoon is just impossible for me. 7:30-8am?
    olafbanki
    @olafbanki
    I might make that
    Markus Döring
    @mdoering
    this monday maybe, Olaf?
    just before our col+ call
    olafbanki
    @olafbanki
    Yes Monday could be an option
    Dmitry Mozzherin
    @dimus
    7 illinois?
    Markus Döring
    @mdoering
    7:30 Illinois
    olafbanki
    @olafbanki
    7:30
    Dmitry Mozzherin
    @dimus
    ok sounds good
    Markus Döring
    @mdoering
    :+1:
    olafbanki
    @olafbanki
    great thanks
    Dmitry Mozzherin
    @dimus
    Mon 7:30 illinois
    olafbanki
    @olafbanki
    perfect
    Should we get Joel and Mike hooked up?
    Dmitry Mozzherin
    @dimus
    we can try, might be a bit too early for them
    olafbanki
    @olafbanki
    It will be 8:30 for DC maybe that is feasible
    Dmitry Mozzherin
    @dimus
    Right, might be OK for Joel. Mike is the same as us. Lets see what they say
    Dmitry Mozzherin
    @dimus
    I put together updated data from main sources including CoL from 2019 annual edition, GBIF from september etc. and ran name finding with updated algorithms. So now I have names data to work with, and we have public gRPC connection pointing to it https://bhlrpc.globalnames.org:80
    Today I tried to run gRPC client to get all texts. It went through in 3h30m for whole corpus downloading 57mil pages from the stream. So I think plumbing now works as it should. I will run it again now searching for 'sp. nov. and friends'
    Markus Döring
    @mdoering
    :+1:
    Dmitry Mozzherin
    @dimus
    Fast and dirty grep against whole corpus gave 1459064 lines with /sp\.\s?(nov|n)\./ and they are located in 25618 journals, which is ~10% of BHL so makes the job ~10x smaller