These are chat archives for cltk/cltk
Easyand also follow up https://github.com/cltk/cltk/wiki/Quickstart-for-contributors
Hey @chetanya-shrimali and welcome! Now, I am new myself so take my advice with a grain of salt.
As the mentors have pointed out, you will probably need to start by going through the easier issues and checking out the beginner's exercises (https://github.com/cltk/cltk/wiki/Beginners'-exercises). Meanwhile, it would probably help to experiment with the software itself and familiarize yourself with the documentation.
I want to add some corpora to cltk core in either hindi or punjabi language to get to know about the cltk codebase better. But I don't know about the copyright issues. Can anyone please help, which corpus is eligible to be added to cltk? Thanks.
@vikrant97 , I would suggest you to look through the corpora where ancient Punjabi is available. Ancient punjabi here refers to the 10th century or before as the language heavily borrowed words from Persian and Arabic after the Arab invasions in India. :smile: