These are chat archives for FreeCodeCamp/DataScience

5th
May 2017
Leonardo Raduy Lemos
@PunkDado
May 05 2017 01:27
Hi guys! My name is Leonardo. I live in Curitiba, Brazil, and I'm really a fan of Data Science.
Looking foreward to meet all you!
vidushi2806
@vidushi2806
May 05 2017 04:07
Heyy all! I am Vidushi. I am new to Data Science. Would love to learn a lot. I am from India :)
Ghost
@ghost~580ed9c0d73408ce4f309ef0
May 05 2017 05:01
Hello World! , I am Vipul Aggarwal. I live in New Delhi,India and I'love to learn and explore it.
Welcome To the world of Data Science freeCodeCamp
How are you??
Will Worth
@willworth
May 05 2017 06:55
Hi everyone! I'm going to a big data meet up tomorrow in Alicante. Very exciting. What do you wish you knew when you started? Is csv just the standard for this stuff? My first impulse is to switch it to JSON!
Eric Leung
@erictleung
May 05 2017 07:24

@XpandGroup thanks for sharing!

@PunkDado welcome! Let us know if there's anything in particular you're interested in learning about :smile:

@vidushi2806 hello and welcome!

@willworth hello! Sounds like an awesome meet up. CSV is fairly standard, and so is JSON. They are generally interchangeable (i.e. you can put the same data into CSV and JSON), however, CSV is designed for tabular data (it is rectangle/square), versus JSON, which doesn't have to be and can be more flexible than CSV in terms of its schema or how it is structured.

CamperBot
@camperbot
May 05 2017 07:24
erictleung sends brownie points to @xpandgroup and @punkdado and @vidushi2806 and @willworth :sparkles: :thumbsup: :sparkles:
:warning: @vidushi2806's account is not linked with freeCodeCamp. Please visit the settings and link your GitHub account.
:warning: @xpandgroup's account is not linked with freeCodeCamp. Please visit the settings and link your GitHub account.
:cookie: 264 | @willworth |http://www.freecodecamp.com/willworth
:cookie: 3 | @punkdado |http://www.freecodecamp.com/punkdado
kumar8k
@kumar8k
May 05 2017 07:25
hello
Eric Leung
@erictleung
May 05 2017 08:19
@kumar8k hi there! And welcome!
satenndrra
@satenndrra
May 05 2017 09:27
Hello Everyone! I am an ex-mainframe programmer turned fullstack javascript developer (still learning)... Just read Quincy's FCC coders' survey article on medium https://medium.freecodecamp.com/we-asked-20-000-people-who-they-are-and-how-theyre-learning-to-code-fff5d668969
Interesting data, would love to build some visualization for this.... anyone here willing to collaborate? (I have decent html, css, js skills...some node & meteor)
Anshika Mehndiratta
@anshikam
May 05 2017 09:43
Just read this article from Quincy Larson : https://medium.freecodecamp.com/we-asked-20-000-people-who-they-are-and-how-theyre-learning-to-code-fff5d668969 and would be interested in building data driven visualizations for the same. I am a Ruby on Rails developer with good understanding of JS, CSS and use D3.js and Polymer web components in everyday life
evaristoc
@evaristoc
May 05 2017 09:47

People

Just a link about a recent discussion about stackoverflow at codinghorror. Interesting points of views:
https://blog.codinghorror.com/code-its-trivial/
@erictleung:
Can you open an issue as we did last year to discuss the possible contributions that people could make? I can help you to manage it if needed.
Vivek
@vivekmittal7
May 05 2017 10:07
Hello everyone
satenndrra
@satenndrra
May 05 2017 10:11
@anshikam I could participate with the front-end (D3) and we could keep the backend in ruby/rails if you want... Anyone else interested? I will wait for any responses, then we can take this offline, if that's OK.
Leonardo Raduy Lemos
@PunkDado
May 05 2017 11:22
@erictleung, I learned R 2 years ago, but didnt practice much. Now Im studying Python/ Django, and have great interest in noSQL. Actually I have a Computer Eng degree (graduated in 1998), but never seriously worked on the field - I was marketing executive for 14 years, and business consultant for the last 5 years. I spent most of my days looking at spreadsheets and crunching numbers in Oder to issue recommendations to companies. Also have taught Stat at university. Maybe I could collaborate with FCC using my professional marketing skills.
@satenndrra, @anshikam, I'm willing to collaborate as well. Don't know Ruby, but can use R or just help to ask and answer some questions about the data.
satenndrra
@satenndrra
May 05 2017 12:15
Sure @PunkDado .... We could brainstorm together what is it that we want to visualize. Or maybe do this https://www.google.co.in/amp/s/www.ae.be/blog-en/combining-the-power-of-r-and-d3-js/%3Fhs_amp%3Dtrue
Sorry, new to FCC... So how do you guys usually collaborate on projects? Continue the discussions in the main group or start a private chat for it?
evaristoc
@evaristoc
May 05 2017 13:38

@all,

Survey 2017

Great there are lots of people willing to contribute! We could possibly enjoy the same shared success at last year.

Please feel free to suggest ideas either here on in this channel or in the issue page of the current project:

https://github.com/freeCodeCamp/2017-new-coder-survey


How we did last year?

In 2016 we discussed advances of the project on this channel or through opened issues in the Github account where the project resided.

That year, @QuincyLarson suggested several questions for people to go, tagged as Question as a guidance for what he thought were interesting questions to look at. This was not a compulsory list though: people could work on what they wanted, in teams or alone.

If you want to know how the issue list worked, check the issues list of last year.

We have been inviting FCC users to work those ideas also in Kaggle: the data will be likely the same cleaned file. If you have plans to work on Kaggle, we will be happy to help you here or in Kaggle to deliver a Top Quality and Informative chart; just let us know.

Examples of some initiatives in 2016

Between the most active users last year you could find @SamAI-Software and @krisgesling. They both and specially Sam worked a nice final product about the project that you can see here. This projected received support from Quincy and people at CodeNewbie.

There were other initiatives around the data. An important one to mention is this one, carried out by an external senior analyst that found the data in Kaggle.

Please let us know how to help you. I advice to have a look at the data now and see if there is something that @erictleung should also take in consideration.

Success!

evaristoc
@evaristoc
May 05 2017 13:44

@PunkDado :

Maybe I could collaborate with FCC using my professional marketing skills.

sounds great! I will contact you soon?

@XpandGroup Thanks for the invite! Success with the meeting!
CamperBot
@camperbot
May 05 2017 13:46
evaristoc sends brownie points to @xpandgroup :sparkles: :thumbsup: :sparkles:
:warning: @xpandgroup's account is not linked with freeCodeCamp. Please visit the settings and link your GitHub account.
Eric Leung
@erictleung
May 05 2017 16:56

@evaristoc make an issue to discuss possible contributions? Like analyses or such? Or are you talking about contributions to cleaning?

I've done the best I could with the time and have made a PR to add a single data set to the repository freeCodeCamp/2017-new-coder-survey#1

People interested in how I cleaned the data using the programming language R can be found here https://github.com/erictleung/2017-new-coder-survey. So if you have any questions or concerns with parts of the data, let me know.

Siddhartha Sahu
@padf00t
May 05 2017 17:42
I am new to data science and am looking for an opportunity to work on some data project
I would need some guidance to begin with, so any suggestions are welcome
Eric Leung
@erictleung
May 05 2017 17:50

@padf00t welcome! Two places I'd look at are R for Data Science (it goes through the whole process of data science from data viz to exploration) and the UCI Machine Learning Repository to find data sets to explore.

To get going with data science, you need to be curious and ask questions of the data set and see if you can answer those questions. Let us know if you have any other questions. Good luck! :+1:

Siddhartha Sahu
@padf00t
May 05 2017 17:52
Hey @erictleung ! Thanks for the tip. I will be sure to check these
CamperBot
@camperbot
May 05 2017 17:52
padf00t sends brownie points to @erictleung :sparkles: :thumbsup: :sparkles:
:cookie: 493 | @erictleung |http://www.freecodecamp.com/erictleung
evaristoc
@evaristoc
May 05 2017 17:54
@erictleung: I already opened an issue for contributions. I think we could eventually follow a similar approach like that by Sam last year. I was almost opening an issue for cleaning but I think issues regarding the cleaning of the dataset should be in separated posts? Whatever you think it is more convenient.
@padf00t I would suggest to take a dataset, probably the survey one, and try to answer a very simple question first? Let us know how it goes! Success!
Siddhartha Sahu
@padf00t
May 05 2017 18:14
Hey @evaristoc I have shortlisted a few datasets. Will be finalisisng on one soon and start
Eric Leung
@erictleung
May 05 2017 20:16
The survey data set has been merged into the survey repository! This is a single data set combining the survey data parts. If you have any questions about it, I can do my best to answer them as I wrote the scripts to put it together and to clean it up. Have fun! https://github.com/freeCodeCamp/2017-new-coder-survey/tree/master/clean-data