These are chat archives for FreeCodeCamp/DataScience

14th
Oct 2016
Xavier Sumba
@cuent
Oct 14 2016 14:54
Hey guys, take a look to this video. I really enjoyed. It was delighted how she plays with data. I am considering in learning F#.
https://www.youtube.com/watch?v=qlKZKN7il7c&feature=youtu.be
Also, anyone knows a good tutorial about T-SNE. I'd would like to go deep because it could be useful for my actual project.
Alice Jiang
@becausealice2
Oct 14 2016 16:29
Free R eBook (account registration is free and keeps track of your ebooks)
evaristoc
@evaristoc
Oct 14 2016 16:54

@cuent : some people here started to talk about F# in this channel last week and I checked some exercises. It is gaining my interest more and more. In fact recently I am finding python bit too small and constraining...

T-SNE: a Dutch person seems to be behind the development of the technique? Reading about... Is that a sort of clustering technique? Or mainly representational, like Multidimensional Scaling? Hmmm... seems more like the last one... Looks VERY simple conceptually too!
kNN again!, with apparently MC + binary search + back to t-Student and concepts in information theory to measure the loss function...
Amazing how some simple stuff could be more effective than too complex implementations in the general cases... we were realising that in a discussion during the last meetup...
@cuent: for what I am reading, the key here is finding the "probability distribution between pairs of high-dimensional objects" and then the probability distribution of the similarity distance. Looks like a (double) Bayesian approach to me unless your approach is parametric... Also it is here where the sampling takes place: it could be a numerical approximation? It seems that involves a bit of tweaking if you get fancy.
Those are my first impressions, not sure if I am correct...
Not sure if you are interested just in the practice of the technique or also in the concepts? Don't know a course but for basic concepts I would suggest something in Computational Statistics if that is not your field: it could be valuable to understand the implementation.

Nice approach!


People

IoT in Amsterdam: turning into an Smart City - the Beacon Mile

(particularly @darwinrc but maybe @cuent and @luishendrix92? @Lightwaves?)

Who want to check and play with this? Maybe a very small project here on our own? Something perhaps for Latinamerican users?

I can share with you some advances already done by some companies here, trying to get involved.

Also... Skale project

Sorry about my lack of involvement in the skale project the last months. I am still interested but stuck with some basic stuff to be honest. I still suggest to keep an eye on that project, who knows what comes. Keep you updated.

Alice Jiang
@becausealice2
Oct 14 2016 16:56
I finally started reading the R book I posted a bit ago, it's super basic. Definitely for people starting from point 0
evaristoc
@evaristoc
Oct 14 2016 16:57
:+1: Ask questions here, @alicejiang1; some of us can answer your questions, but our known expert is definitively @erictleung.
Alice Jiang
@becausealice2
Oct 14 2016 16:59
It was more a recommendation than a statement that I'm trying to learn R. Packt is worth signing up for, though. Lots of deals and free books... Last one I got was this one.
I've been focusing on relearning all the maths behind DS, but I've spent a bit of time recently slowly learning Scala :)
Eric Leung
@erictleung
Oct 14 2016 17:02
@alicejiang1 another form of learning, in case books get boring to read, is to interactively learn R from within R! You can checkout http://swirlstats.com/. It's an R package to teach you R stuffs. Funny thing, I'm also trying to learn Scala slowly as well :smile:
Alice Jiang
@becausealice2
Oct 14 2016 17:07
I've used Swirl before :) it was kinda fun :D
evaristoc
@evaristoc
Oct 14 2016 17:15
@cuent: just watching the video. Good! And for what I see in the video T-SNE is more like Multidimensional Scaling...
... and the presentation is also in D3.js??? Great...
I am changing to F# too...
Xavier Sumba
@cuent
Oct 14 2016 20:46

@evaristoc Yes, I am interested a lot in clustering algorithms.

I didn't know that was possible to build presentations on D3.js. Is there some tool to create presentations? or should we write code from start?

Eric Leung
@erictleung
Oct 14 2016 23:45
@evaristoc you brought up t-SNE earlier. I just ran into this interactive blog post talking about how to interpret t-SNE plots http://distill.pub/2016/misread-tsne/