These are chat archives for FreeCodeCamp/DataScience

7th
Oct 2017
chessybo
@chessybo
Oct 07 2017 00:00
can yall please look at my first data science project. i havent got any feedback. i had my roommate tell me i shouldnt have used the net difference, rather just the percentage of those with a job after bootcamp if the total bootcamp attendance for that age group. https://www.kaggle.com/chessybo/bootcamp-success-vs-age
Arun Kumar
@arunkumar413
Oct 07 2017 03:42
@Shobhit1610 clicks, n conversions
evaristoc
@evaristoc
Oct 07 2017 08:12

@chessybo I think your analysis is clear and shows in different graphs there is a difference in attendance according to demographics.

@arunkumar413 A first usual step is segmentation using descriptive statistics and possibly unsupervised learning techniques. Although not inherently predictive, it could become predictive if the description holds for every case.

If you want to apply supervised ML for that, a simple recommended approach that I saw being applied in a now closed kaggle competition was a regularized logistic regression with hash encoding: https://www.kaggle.com/c/avazu-ctr-prediction/discussion/10927#58054. There were variations and improvements of this same method by several participants. You can go through the whole discussion of the aforementioned competition to see if you find something that meet your expectations.

evaristoc
@evaristoc
Oct 07 2017 11:38

@QuincyLarson

In a different order of ideas, I have been contacted by an NGO here in Amsterdam to help them with their digitization policy as a volunteer.

The NGO is veeery behind, basically run by either old people or with poorly IT-literacy, so there could be a lot of work to do. However there are a few young members looking for modernising the organisation. For what I have seen, their IT needs are plenty and it could happen that they might need some new applications, possibly a lot of of hard, extensive work.

I have thought at fCC as possible provider. Question to you: Is fCC moving from attending NGOs to rather developing modules of more general use and impact, or is still fCC working closely with NGOs in partially/fully customised solutions?

Hope to hear from you.

Arun Kumar
@arunkumar413
Oct 07 2017 13:43
How to multiple variable analysis and optimisation
Arun Kumar
@arunkumar413
Oct 07 2017 16:00
Generally CTR = clicks/impression and conversion rate = clicks/conversions. Based on this data is possible to create a model. But how to factor the other variables such as city, mobile device, ad type etc, country, postal code etc,.
Quincy Larson
@QuincyLarson
Oct 07 2017 21:02
@evaristoc We are now focused on building more general purpose tools for nonprofits, like Mail for Good (https://medium.freecodecamp.org/our-nonprofit-needed-a-cheaper-way-to-send-email-blasts-so-we-engineered-one-167322e3f28e) rather than trying to build one-off solutions for specific nonprofits.