These are chat archives for FreeCodeCamp/DataScience
discussion on how we can use statistical methods to measure and improve the efficacy of http://freeCodeCamp.com
That is an important observation. I am evaluating it as an option to host datasets when compared to other dedicated dataset-storage platforms like datadotworld or kaggle. It might be a good place for those things you mentioned but I am still not sure if it is the best place to host fCC datasets when compared to dedicated dataset-storage platforms like kaggle or datadotworld.
However, BitTorrent has other advantages that we really like.
The following is a list of the main attributes we are expecting to get from the selected data-storage host(s):
Whatever your level of experience is, I would like to hear your opinion. You might be surprised how valuable that can be.
@timjavins would you do me a favour?
Can you test the current dataset in BitTorrent and try to download it?
I need at least 4 tests apart of mine. I will ask you questions about from where you downloaded it and the pros and cons you found.
If you find problems I won't give you many clues : you have to try to troubleshoot yourself simulating someone who is doing it without no much help than the provided on Internet to solve any issue. Only there we will know if there is enough information to solve it and where to find it. No finding solutions is still information to share.
Hope you can help?
@/all Kaggle just published a survey of 16,000 data scientists and made all the full dataset open! I’m planning to publish a summary of their findings on Monday, like I've written about Stack Overflow and Oreilly's datasets in the past.
Please take a look at the dataset, and if you all find any interesting insights from this dataset, let me know. I’ll credit you in my article. Here’s their announcement and links to their datasets: https://www.kaggle.com/surveys/2017