These are chat archives for FreeCodeCamp/DataScience
discussion on how we can use statistical methods to measure and improve the efficacy of http://freeCodeCamp.com
That is another thing. No per se. You can use python to connect to translation API's but depending on the length it might take long or cost money and they are far for being reliable translation (still...). I don't know.
Then you can find more about vector modelling (the simplest approach) and cluster over that. k-means (again, the simplest) is the most used.
If clustering by language, you might not even need to apply any clustering algorithm at all. Just find a way to identify they are in different languages.
ankitnau25 sends brownie points to @evaristoc :sparkles: :thumbsup: :sparkles: