These are chat archives for cltk/cltk

11th
Mar 2016
yhg0112
@yhg0112
Mar 11 2016 05:12
Hello, I'm yhg currently being in graduate school in SNU, south korea. I'm working with neural machine translation, and i found you on Google SummerOfCode. Your project looks very interesting in the perspective of NMT. After done few projects, i've realized that one critical point in NMT is that how many data is available. Would you let me know that how many data you have, and what kind of data you have?
tlkononova
@tlkononova
Mar 11 2016 08:03
This message was deleted
tlkononova
@tlkononova
Mar 11 2016 08:19
Hi, my name is Tatiana, I'm a GSoC 2016 aspirant. I'm currently doing an MA in NLP in Moscow, and I have a BA in oriental studies. I would be happy to contribute to this exciting project. I can work with Modern (Classical) Persian or Ottoman Turkish (for the latter data is much harder to get though). My work would include collecting a corpus and re-implementing basic CLTK functionality for it.
Rajarshee Mitra
@rajarsheem
Mar 11 2016 16:44
Hi, Any small task for application?
Kyle P. Johnson
@kylepjohnson
Mar 11 2016 22:31
@tlkononova You are the first to approach us with knowledge of Persian and Turkish. Would you please email me at kyle@kyle-p-johnson.com ? Adding support, even partial, to these would be incredible.
@yhg0112 Here are our corpora: https://github.com/cltk
Are you familiar with any Classical langauges? for the MT projoect, this would be important. Please email me for more.