These are chat archives for cltk/cltk

12th
Mar 2017
Shreyaa Sridhar
@shreyaasridhar
Mar 12 2017 04:10
I have a small problem.
I use python 2.7. But cltk requires py3. Is there anyway I can work with both python 2&3 installed in the same computer?
Ashir Borah
@EagleShot
Mar 12 2017 04:12
Yeah. Just install Python and run on the command line using the proper command
Parminder Singh
@Trion129
Mar 12 2017 04:17
@shreyaasridhar you can use python 3 using python3 and python 2.7 using python2 or python
Shreyaa Sridhar
@shreyaasridhar
Mar 12 2017 05:42
Yo
You mean directly install both?
ykl7
@ykl7
Mar 12 2017 05:44
@shreyaasridhar another way, which i find better, is to use virtual environments.
Kyle P. Johnson
@kylepjohnson
Mar 12 2017 05:47
@shreyaasridhar Please read the installation docs. This is covered very carefully there
Gautam Sabhahit
@lazycoder1
Mar 12 2017 07:04
Hi Kyle ,i had emailed asking about the possibility of implementing a POS tagger and a stemmer for sanskrit for the GSoC project. From what i have seen these have not been implemented yet. Has anyone taken up this or can i start researching on these topics and any resource you can recommend for me to implement it would be good too. Thank you
erzaliator
@erzaliator
Mar 12 2017 11:43
Hey @kylepjohnson ! I had mailed you earlier and posted on the mailing list again. So, here's the repo to the basic implemetation of word2vec for sanskrit: https://github.com/erzaliator/SanskritWord2Vec
a) I need to complete it for the entire ramayana corpus and then the rest of the sanskrit corpus. I am on it already.
b) I need to implement get_sims() as well; just as the latin word2vec.py
c) Other tasks are commented in the code itself. This code needs a lot of optimization.
d) All suggestions are open for the modifications.
e) I hope I'm going in the right direction. I shall be awaiting further instructions.
P.S. I am simultaneously going through the beginner's exercises.
Samriddhi Sinha
@djokester
Mar 12 2017 16:51
@kylepjohnson I DM'd you my project idea. Please do give me your feedback.
priyaraistar
@priyaraistar
Mar 12 2017 18:06
Guys, just need to inform you of something. There is just one week left for GSoC. Please utilize your time elsewhere. Kyle is known for accepting proposals that come from his own students. There are several members listed under the CLTK organisation. Ask any one of them and they will tell you the same thing. Please do not try to make it to this organisation. Your efforts will be wasted.
@djokester @erzaliator he won't reply to your mailed proposals. He didnt reply to mine either last year. He will merge all your PRs, but he will not accept or review your proposals.
Ashir Borah
@EagleShot
Mar 12 2017 18:09
Sorry to be that guy but he did reply to my emails
Samriddhi Sinha
@djokester
Mar 12 2017 18:26
Even to mine. He is taking time to review the proposals. Under GSoC guidelines he is expected to reply within 36 hours. And he does. So be a bit patient.
Manvendra Singh
@manu-chroma
Mar 12 2017 18:27
and he did reply to my emails last year. and so did the other mentor @lukehollis. please stop making false accusations here.
Aakash Dm
@konemshad
Mar 12 2017 18:31
same here , he did reply .
priyaraistar
@priyaraistar
Mar 12 2017 18:40
@all, I have no complaints against luke. Kyle is the problem here. You won't get selected no matter how good your proposal is. @manu-chroma trust me when I say three of your co-contributors have said this. They might not accept it in public.
@/all
Ashir Borah
@EagleShot
Mar 12 2017 18:46
@priyaraistar your git account is just an hour old. Did you make it just so that you can come here and flame?
Manvendra Singh
@manu-chroma
Mar 12 2017 18:47
CLTK is relatively new organisation in GSoC. it's the 2nd time CLTK is participating. the fact that it was granted only two slots last year and those were taken up be deserving candidates makes all your accusations baseless. I think you've a bias here. try to understand there are very limited slots here and this makes selection quite competitive. I myself made a proposal last year and didn't get accepted. I still kept contributing to CLTK and Kyle was really helpful before and after the results. I suggest you apply to different organisation where you think you might have a better chance of getting selected.
Samriddhi Sinha
@djokester
Mar 12 2017 18:52
@EagleShot peace out please. Don't lose your temper unnecessarily. Don't get pulled into a war of words unnecessarily. Don't make personal attacks. Let @manu-chroma do the talking.
Shreyaa Sridhar
@shreyaasridhar
Mar 12 2017 18:55
Lets just try to learn. It isn't about getting selected into GSOC. This project is very useful. @all
Luke Hollis
@lukehollis
Mar 12 2017 18:59
Hi folks, and thanks to everyone for their patience as we reply to emails and direct messages on Gitter! @manu-chroma is correct that we received a lot of applications last year, the majority of which were very strong, and it was difficult to only pick two. Selection is based on strength of proposal and how it specifically addresses project ideas: https://github.com/cltk/cltk/wiki/Project-ideas#gsoc-projects For the Meteor app and CLTK api project, please send proposal drafts to Kyle and I for more feedback as early as possible.
Also, on another topic, http://archive.cltk.org/ is up to 64 million words served in classical languages, but this only represents a small portion of the corpora managed by the cltk. I can’t wait to see how many more texts we can serve this year!
Luke Hollis
@lukehollis
Mar 12 2017 19:29
Specifically for converting corpora, I’m open to all ideas, but I’ve just been adding converter.py files to each corpus repo, which converts the repo texts from their original format to the cltk_json format detailed here: https://github.com/cltk/cltk_api/wiki/JSON-data-format-specifications
^ that format is the minimum possible amount of data used for rendering on the frontend interface, and I detailed more of the process of conversion in the cltk_api gitter channel: https://gitter.im/cltk/cltk_api
Luke Hollis
@lukehollis
Mar 12 2017 19:35
Also, @ashiful-haque, the very talented UX designer, will be developing frontend design templates for our archive.cltk.org project and will post them in https://gitter.im/cltk/ux