Where communities thrive


  • Join over 1.5M+ people
  • Join over 100K+ communities
  • Free without limits
  • Create your own community
People
Repo info
Activity
  • Nov 09 2017 12:45

    dimus on refactoring

    eu_words (compare)

  • Apr 27 2017 19:54

    dimus on refactoring

    ignore idea files (compare)

  • Apr 27 2017 19:52

    dimus on master

    ignore .idea files (compare)

  • Sep 16 2016 11:33

    dimus on master

    blacklist sp (compare)

  • Sep 16 2016 11:31

    dimus on refactoring

    blacklist of species (compare)

  • Jun 24 2016 00:34

    dimus on refactoring

    full stack works (compare)

  • Jun 23 2016 15:36

    dimus on refactoring

    wip (compare)

  • Jun 22 2016 21:51

    dimus on refactoring

    wip (compare)

  • Jun 20 2016 22:11

    dimus on refactoring

    select test for name candidate (compare)

  • Jun 20 2016 20:52

    dimus on refactoring

    extraction of canonical form (compare)

  • Jun 18 2016 02:27

    dimus on refactoring

    add dictionary approach wip (compare)

  • Jun 14 2016 22:17

    dimus on refactoring

    first parsing works (compare)

  • Jun 09 2016 19:03

    dimus on refactoring

    migrate to python3 (compare)

  • May 25 2016 23:49

    dimus on refactoring

    add parser (compare)

  • May 25 2016 21:13

    dimus on refactoring

    add config reader (compare)

  • May 25 2016 18:20

    dimus on refactoring

    add list_data for lookup lists (compare)

  • May 24 2016 21:14

    dimus on refactoring

    add new namefinder add NameCandidate add white and grey lists (compare)

  • May 24 2016 21:14

    dimus on refactoring

    add white and grey lists (compare)

  • May 20 2016 21:50

    dimus on refactoring

    add NameCandidate (compare)

  • May 20 2016 11:25

    dimus on refactoring

    wip (compare)

Dmitry Mozzherin
@dimus
He is not a computer scientist, but he is a very good taxonomist, and I did find when I was working on sci name parser that taxonomist's help is invaluable
Also he is one of the first people who started to think about biodiversity from the point of informatics, and he was leading GlobalNames for almost 10 years
don't worry about cute applictions -- thats not our task :)
Wencan Luo
@wencanluo
Good point. Domain knowledge will help a lot for this project
I will be offline for a few hours. You can leave me a message anytime.
Dmitry Mozzherin
@dimus
I'l be off for today soon too. We just finished migration of EOL to Smithsonian and I am tired
Wencan Luo
@wencanluo
Take a break! Talk to you later
Dmitry Mozzherin
@dimus
How is 10AM tomorrow for you for our first meeting?
Dmitry Mozzherin
@dimus
for managing tickets there is a new interesting development -- it has less overhead and more integrated into github -- ZenHub
here how it looks for current NetiNeti, however I wonder if we will be better of starting a new NetiNeti project as you will not share code base with old one. Then I will rename old NetiNeti to neti_neti_py
Wencan Luo
@wencanluo
Do you prefer to implement it with a new programming language?
Dmitry Mozzherin
@dimus
I like your suggestion to have it in Java
Wencan Luo
@wencanluo
Do you prefer to keep working with Python until a stable version? And then move to Java?
Because I'm more conformable with python and there are more NLP toolkits in Python
Wencan Luo
@wencanluo
OK. Let's start with python and rewrite the project in Java if needed in the end
Dmitry Mozzherin
@dimus
I need to add you to NetiNeti project
Wencan Luo
@wencanluo
my user name:wencanluo
Dmitry Mozzherin
@dimus
Python is fine, as speed is not super important at this stage
Wencan Luo
@wencanluo
do you want to merge my NetiNeti first? since currently there is a bug in the test code in your master
I can send a pull request
Dmitry Mozzherin
@dimus
I sent you invitation
Wencan Luo
@wencanluo
done
the synchronization of gitter has some problem. The message is not ordered by time
Good! On the right panel, Gitter has notified the recent acivities.
Dmitry Mozzherin
@dimus
Looks like we did start our meeting :D
Lets decide on organizaition of code first
Wencan Luo
@wencanluo
ok
Dmitry Mozzherin
@dimus
What people often do -- they create GSOC branch
I think in our case we can make GSOC-2015 and make it your 'master'
Wencan Luo
@wencanluo
Good idea
Dmitry Mozzherin
@dimus
can you access zenhub?
Wencan Luo
@wencanluo
yes
Dmitry Mozzherin
@dimus
it is right on the github page
awesome
:ok_hand:
let me set a branch
I cleaned up junk and set gsoc-2015
so lets keep this branch it as your realm
Wencan Luo
@wencanluo
hold on. I'm figuring out how to fork the branch instead of the master
Dmitry Mozzherin
@dimus
and let me start skype again -- besides logistics I would like us to have an 'idialized design' session and to abstract from implementation at first
One way to do it is to setup upstream and origin on your local repository
we can talk about that too
Wencan Luo
@wencanluo
good
Wencan Luo
@wencanluo
Let's continue. I will figure it out later
Dmitry Mozzherin
@dimus
ok, I'll get to my skype machine
Dmitry Mozzherin
@dimus
Here is an issue that reflects a usecase for settinga a status of a name -- https://github.com/jhpoelen/eol-globi-data/issues/132#issuecomment-94572336
Dmitry Mozzherin
@dimus
Wencan asked -- What kind of resources are available to use? Currently, is there a data set that has gold standard labels for such a task? Even a small one will be very helpful .
The task is actually not NetiNeti, but rather marking names that we have in our database for GNI/resolver and figuring out their 'status'