These are chat archives for FreeCodeCamp/DataScience

12th
Dec 2015
Kevin Quoc Truong
@wwwfreedom
Dec 12 2015 00:45
Hey all, I’m reading ‘Data Smart’ by John Foreman and I want to recommend it to you guys. It’s a good introduction into data science explaining the general concept in plain english. Check it out :)
Check this website out as well for roadmaps getting into data science. I found it via ycombinator https://www.metacademy.org/roadmaps/
Angela Manuel
@Smangeee
Dec 12 2015 01:57
thanks @wwwfreedom
CamperBot
@camperbot
Dec 12 2015 01:57
smangeee sends brownie points to @wwwfreedom :sparkles: :thumbsup: :sparkles:
:star: 166 | @wwwfreedom | http://www.freecodecamp.com/wwwfreedom
Quincy Larson
@QuincyLarson
Dec 12 2015 02:37
@wwwfreedom great book!
evaristoc
@evaristoc
Dec 12 2015 11:26
@wwwfreedom thanks for sharing!
CamperBot
@camperbot
Dec 12 2015 11:26
evaristoc sends brownie points to @wwwfreedom :sparkles: :thumbsup: :sparkles:
:star: 167 | @wwwfreedom | http://www.freecodecamp.com/wwwfreedom
Vikesh Tiwari
@vicky002
Dec 12 2015 11:29
Hi consider me a newbie in Data Science. How do I start learning ? what is the first step?
evaristoc
@evaristoc
Dec 12 2015 11:37

Hi @vicky002: there are lots to learn through other MOOC's. I have used coursera, edX, udacity and ivercity. Here a more comprehensive list.

There is more material online. Some campers have been giving references in this room. Scroll up a bit and see if you find something, or just try a search using the gitter search feature (in my screen at the top right; don't get panic if it takes you to a different room :): you might accidentally pressed a wrong link).

Vikesh Tiwari
@vicky002
Dec 12 2015 11:39
ohh ok thank you. I'm currently a student and have worked and contributed to many open source projects. Now I'm thinking I should try my hands on Data Science. Thank you. I'm checking right now :D
thank you @evaristoc :+1:
CamperBot
@camperbot
Dec 12 2015 11:40
vicky002 sends brownie points to @evaristoc :sparkles: :thumbsup: :sparkles:
:star: 185 | @evaristoc | http://www.freecodecamp.com/evaristoc
evaristoc
@evaristoc
Dec 12 2015 11:42
@vicky002 I found your repos very interesting! I am keeping a watch at some...
Vikesh Tiwari
@vicky002
Dec 12 2015 11:42
ohh yeah! :shipit:
ohh sir you work at Yahoo? :o
evaristoc
@evaristoc
Dec 12 2015 11:44
@vicky002! No, sorry!
Vikesh Tiwari
@vicky002
Dec 12 2015 11:44
thank you. and one of my repo is trending on Github today : https://github.com/trending?l=html see AlgoWiki
ohh ok :D
evaristoc
@evaristoc
Dec 12 2015 12:13
@vicky002 what is your level? You said you are a student... of what?
Vikesh Tiwari
@vicky002
Dec 12 2015 12:13
CS student, 3rd year
evaristoc
@evaristoc
Dec 12 2015 12:58

Ok...

I see you and others have prepared an outstanding collection of references in the repo AlgoWiki...

I imagine it can be hard to maintain... I don't have formal education as CS but for what I have seen so far, maybe things you would like to consider including/exploring in the future?

  • R
  • heterogeneous computing (I just know a bit of it, but it is being used a lot in advanced analytics settings)
  • perhaps something about architectures (eg HDFS)
  • I would make a space for julia
  • No hadoop? cassandra? spark? kaftka? etc on your list?
  • For basic knowledge, statistics and linear algebra, perhaps automatas
  • Make a room for discrete optimisation...
  • Make a room for computational statistics...

Check the following too:

evaristoc
@evaristoc
Dec 12 2015 13:07
@vicky002 and don't forget kaggle, of course... and similar sites...
Vikesh Tiwari
@vicky002
Dec 12 2015 13:09
ohh thank you so much @evaristoc . I'll definitely check all these. big help!
CamperBot
@camperbot
Dec 12 2015 13:09
vicky002 sends brownie points to @evaristoc :sparkles: :thumbsup: :sparkles:
:star: 186 | @evaristoc | http://www.freecodecamp.com/evaristoc
Roel Verbunt
@roelver
Dec 12 2015 19:55
Damn. Since a couple of days Gitter has set a hard limit on the forum user list size. Only the first 125 are listed now. Now the FCC leaderboard lost its source for adding new users. This was the URL I used: https://gitter.im/api/v1/rooms/546fd572db8155e6700d6eaf/users?access_token=f1670594b8b9cd40d03f724d989f7d1840530219
I tried query parms limit= and skip= but they are ignored.
Ademola Adegbuyi
@ooade
Dec 12 2015 20:26
Please can someone explain what contiguous integration means and perhaps its functions? I'm watching the December summit and i don't seem to get it.
Darwin RC
@darwinrc
Dec 12 2015 21:02
@marhyorh No one better to explain it than Martin Fowler: http://martinfowler.com/articles/continuousIntegration.html
Ademola Adegbuyi
@ooade
Dec 12 2015 21:28
Ok thanks @darwinrc
CamperBot
@camperbot
Dec 12 2015 21:28
marhyorh sends brownie points to @darwinrc :sparkles: :thumbsup: :sparkles:
:star: 240 | @darwinrc | http://www.freecodecamp.com/darwinrc
evaristoc
@evaristoc
Dec 12 2015 22:25
@roelver: have you tried to talk with people at gitter (gitter developers room)? I think I found something similar with another project I did with @andela-bfowotade but I ignored as a defect to be solved later...
Roel Verbunt
@roelver
Dec 12 2015 22:27
I reported it to Gitter support, and asked for a 'skip' feature. No response yet.
evaristoc
@evaristoc
Dec 12 2015 22:29
@roelver yes... maybe this one too? https://gitter.im/gitterHQ/developers
Carl Parrish
@carl-parrish
Dec 12 2015 22:37
I just realize one thing Slack does better than gitter (I tend to think gitter is better at showing code btw) In slack you can add a reaction to a post and it doesn’t seem like you can do that here (am I missing something?)
evaristoc
@evaristoc
Dec 12 2015 22:39
@carl-parrish I think you aren't... not seen that reaction post feature here...
Roel Verbunt
@roelver
Dec 12 2015 22:40
@evaristoc OK. So the have a support room as well. Makes sense. I expect it to be intentionally, so maybe they limited the response just because of me. I usually update the user list every day. They may have detected that pattern.
evaristoc
@evaristoc
Dec 12 2015 22:54

@roelver check with them... I am not aware of any change in policy but it could be...

It could be not only you: we have another project that also downloads a lot of data through the API; we left the project unattended for a while until I checked it a few weeks ago: I found it was not capturing all data as before...

Yes... @roelver and I was just to say that it could impact the bot too... and then I found the messages in the developer room... 0_0
Roel Verbunt
@roelver
Dec 12 2015 22:59
@evaristoc I read on the Gitter/Dev room that @abhisekp already reported this Gitter limit and they confirmed.
@evaristoc thanks for your help
CamperBot
@camperbot
Dec 12 2015 22:59
roelver sends brownie points to @evaristoc :sparkles: :thumbsup: :sparkles:
:star: 187 | @evaristoc | http://www.freecodecamp.com/evaristoc
evaristoc
@evaristoc
Dec 12 2015 23:00
@roelver positive... I also found that too... wruack! that is no nice...
Roel Verbunt
@roelver
Dec 12 2015 23:00
The easiest way to resolve this is a data dump from FCC directly of course ;-)
evaristoc
@evaristoc
Dec 12 2015 23:00
They wrote to you, @roelver
@roelver we can try to work an API to get data from FCC... I need a bit more training though... but I will keep you informed
Roel Verbunt
@roelver
Dec 12 2015 23:03
@evaristoc I already wrote a node script to export what I need: https://github.com/roelver/fccexport Berkeley asked me for that.
But a more generic data dump is fine too of course.
Carl Parrish
@carl-parrish
Dec 12 2015 23:06
Have you guys considered getting data from coderbits,wakatime, and codeivate to help round out your data?
evaristoc
@evaristoc
Dec 12 2015 23:08
@carl-parrish sounds like fun
Abhisek Pattnaik
@abhisekp
Dec 12 2015 23:09
but they're unreliable and many don't use them :(
evaristoc
@evaristoc
Dec 12 2015 23:11
@roelver speaking on behalf of @BerkeleyTrue: very busy at the moment with the next deployments...
Carl Parrish
@carl-parrish
Dec 12 2015 23:13
@abhisekp I’m thinking of a way in our profile to say that we’ll allow shared use of data (sort of like we do with github) then allow that as a datapoint for how long it takes to get a zipline done for instance. Perhaps also add rescueTime to that list.
evaristoc
@evaristoc
Dec 12 2015 23:21
@roelver maybe @abhisekp can give you more insights about the current situation of the API as it is at the moment... they are busy with the bot so they might have similar needs... I think it is a different level of data requirement though: bot is more about only usernames of few users required per events occurring in parallel at several rooms.
@carl-parrish interesting...
@roelver keep us informed about the progress of the skip measure, please?
Roel Verbunt
@roelver
Dec 12 2015 23:29
@evaristoc @abhisekp I registered issue #1022 for this. gitterHQ/gitter#1022