These are chat archives for cltk/cltk

15th
Mar 2016
Nathan D. Smith
@nathans
Mar 15 2016 00:01
sorry for the necromancy, but I don't think #63 is really fixed :-)
Kyle P. Johnson
@kylepjohnson
Mar 15 2016 00:06
Ha, no it wasn't at all fixed. I just could find any code to reproduce it :blush:
Nathan D. Smith
@nathans
Mar 15 2016 00:08
ah, ok, like I said, I'll work up a PR
good to include a new unittest for this?
Kyle P. Johnson
@kylepjohnson
Mar 15 2016 00:10
Unittest would be terrific for this, yes. I'd say you could probably just add your dieresis Beta Code to the current test: https://github.com/cltk/cltk/blob/master/cltk/tests/test_corpus.py#L47
Tip: This test requires a fair amount of downloads, so if you can't test the entire module locally, that's fine. Just push and I can adjust if need be
Nathan D. Smith
@nathans
Mar 15 2016 00:13
ok
something I've seen that I've not encountered before, when importing cltk, FileNotFoundError: [Errno 2] No such file or directory: 'LICENSE'
Kyle P. Johnson
@kylepjohnson
Mar 15 2016 00:18
Let me check
Nathan D. Smith
@nathans
Mar 15 2016 00:18
workaround was just to symlink a LICENSE file in place :-) happens when important cltk from any folder lacking that
Kyle P. Johnson
@kylepjohnson
Mar 15 2016 00:21
I'm not getting it. Do you mind posting the error anyways?
Nathan D. Smith
@nathans
Mar 15 2016 00:23
as an issue?
or in here?
Kyle P. Johnson
@kylepjohnson
Mar 15 2016 00:41
just here. I'm curious
Wentao Lu
@wugui2020
Mar 15 2016 01:01
Hi Kyle I know there might be overwhelming emails flooding your inbox but could you please check my last reply? Thank you! @kylepjohnson
Kyle P. Johnson
@kylepjohnson
Mar 15 2016 01:02
@wugui2020 Will do
Wentao Lu
@wugui2020
Mar 15 2016 01:11
My email is momi2020@uw.edu btw @kylepjohnson
Nathan D. Smith
@nathans
Mar 15 2016 02:58
@kylepjohnson in cltk/cltk/tests:
python3 test_corpus.py
Traceback (most recent call last):
File "test_corpus.py", line 9, in <module>
from cltk.corpus.greek.beta_to_unicode import Replacer
File "<frozen importlib._bootstrap>", line 2237, in _find_and_load
File "<frozen importlib._bootstrap>", line 2226, in _find_and_load_unlocked
File "<frozen importlib._bootstrap>", line 1191, in _load_unlocked
File "<frozen importlib._bootstrap>", line 1161, in _load_backward_compatible
File "/home/nathan/software/cltk/lib/python3.4/site-packages/cltk-0.1.33-py3.4.egg/cltk/init.py", line 18, in <module>
FileNotFoundError: [Errno 2] No such file or directory: 'LICENSE'
Sourav Singh
@souravsingh
Mar 15 2016 11:11
Hi @kylepjohnson I want to know if a dictionary qualifies as a corpus for a language.
PengFoo
@PengFoo
Mar 15 2016 14:32
Hi, @kylepjohnson i send a work plan for GSoC 2016 and my email is fupeng@hotmail.com
Kyle P. Johnson
@kylepjohnson
Mar 15 2016 15:17
@souravsingh, I usually call a dictionary a "data set". Yes, it is definitely the kind of thing we want to host in the GitHub organization.
Nathan D. Smith
@nathans
Mar 15 2016 18:05
@kylepjohnson last betacode question I have:
I see combining a vowell with apostraphe is used to get breve accents, e.g. I' -> ῐ
however this creates a collision in places where the same vowel is followed by an apostrophe to indicate elision, e.g. δι’ αὐτοῦ
I don't see that use of the apostrophe in the TLG pdf you linked (nor in CATSS betacode description), but I do see it on the wikipedia page
I'm not really sure how it should be handled, because it seems like an honest ambiguity