These are chat archives for FreeCodeCamp/DataScience
discussion on how we can use statistical methods to measure and improve the efficacy of http://freeCodeCamp.com
@huyingjie there are 3 files - all of them combined have data from 31-Dec-2014 to around 9-Dec-2017.
I made some notes in the description. Be careful with some posts that are duplicated between files (exact dates of collection overlapped) AND time - the time, particularly the hours, showed by the
sent variable is local time. Mine is Western Europe. Additionally, I collected the files in different times of the year and there is a file (the last one) that is in Winter Time (-1 hour to summer time).
If your project consider times, that will be an added data manipulation challenge ;) .
@huyingjie This data is more real than much of the things you will find in Kaggle. Be careful with conclusions too. Remember there are other fellow students involved. Data is open but be professional.
@GoldbergData immediately! :)
I will contact you privately.
Back to the emoji project, it appears that the results look VERY different if I analyse the keywords (those between
: ) than the unicodes (probably copy/paste of the emoji from a source)...
The top10, in order of popularity?
This is a real deviation of what I was expecting to be honest... Now I am not sure of the reliability of this result...
The unicode form is one interpreted by your machine. The alias seems to call an existing image in Github / Gitter.
:joy:renders an emoji but it is NOT in the list... No good...
Well... getting close...
I do another analysis to show you and then I might be done for today with this one.
evaristoc sends brownie points to @timjavins :sparkles: :thumbsup: :sparkles:
@timjavins Ok. What is the boring stuff?
What makes CompTIA so boring? First time I see the certification by the way. Not in USA myself. And what are you really after, if I might know? And what are you expecting to get from learning NNs?
Anyway I take the opportunity to share a reflection with those asking questions about DS.
It is my humble opinion based on the poor experience I have that:
MOST OF THE DS TASKS are VERY but VERY VERY but VEEEEERYYYYYY BORING, for the disappointment of many in this channel.
Not that I am the best example but look at me: I am just stuck in a relatively simple project right now. So.... What do you expect when working with BIG DATA? Confetti? Unless you are brilliant, I think it will be a lot of... you know...
This might not relate to you, @timjavins, but I think many people coming to this channel are really but really really really really really really really really really underestimating the kind of things Data Analysis and DS entails....
So, I really congratulate you for going through the boring stuff now. Really: Well Done!!!!!! I am sure you might really need that for the sake of having fun later :) :) .
Final update of the emoji project and I go. The top10 of those aliases only with lowercases (which is might include majority but not all the aliases):
Definitively different to what I got in the other results, but this is a more likely trend reflecting the activities in the Casual chatroom. Hmmmm... See you all !!
:)(its emoticon) is one simple alias too.