Hey guys, take a look to this video. I really enjoyed. It was delighted how she plays with data. I am considering in learning F#.

https://www.youtube.com/watch?v=qlKZKN7il7c&feature=youtu.be

https://www.youtube.com/watch?v=qlKZKN7il7c&feature=youtu.be

Also, anyone knows a good tutorial about T-SNE. I'd would like to go deep because it could be useful for my actual project.

@cuent : some people here started to talk about F# in this channel last week and I checked some exercises. It is gaining my interest more and more. In fact recently I am finding python bit too small and constraining...

T-SNE: a Dutch person seems to be behind the development of the technique? Reading about... Is that a sort of clustering technique? Or mainly representational, like Multidimensional Scaling? Hmmm... seems more like the last one... Looks VERY simple conceptually too!

kNN *again!*, with apparently MC + binary search + back to t-Student and concepts in information theory to measure the loss function...

Amazing how some simple stuff could be more effective than too complex implementations in the general cases... we were realising that in a discussion during the last meetup...

@cuent: for what I am reading, the key here is finding the "probability distribution between pairs of high-dimensional objects" and then the probability distribution of the similarity distance. Looks like a (double) Bayesian approach to me unless your approach is parametric... Also it is here where the sampling takes place: it could be a numerical approximation? It seems that involves a bit of tweaking if you get fancy.

Those are my first impressions, not sure if I am correct...

Not sure if you are interested just in the practice of the technique or also in the concepts? Don't know a course but for basic concepts I would suggest something in Computational Statistics if that is not your field: it could be valuable to understand the implementation.

Nice approach!

(particularly @darwinrc but maybe @cuent and @luishendrix92? @Lightwaves?)

Who want to check and play with this? Maybe a very small project here on our own? Something perhaps for Latinamerican users?

I can share with you some advances already done by some companies here, trying to get involved.

Sorry about my lack of involvement in the skale project the last months. I am still interested but stuck with some basic stuff to be honest. I still suggest to keep an eye on that project, who knows what comes. Keep you updated.

I finally started reading the R book I posted a bit ago, it's super basic. Definitely for people starting from point 0

:+1: Ask questions here, @alicejiang1; some of us can answer your questions, but our known expert is definitively @erictleung.

It was more a recommendation than a statement that I'm trying to learn R. Packt is worth signing up for, though. Lots of deals and free books... Last one I got was this one.

I've been focusing on relearning all the maths behind DS, but I've spent a bit of time recently slowly learning Scala :)

@alicejiang1 another form of learning, in case books get boring to read, is to interactively learn R from *within* R! You can checkout http://swirlstats.com/. It's an R package to teach you R stuffs. Funny thing, I'm also trying to learn Scala slowly as well :smile:

@cuent: just watching the video. Good! And for what I see in the video T-SNE is more like Multidimensional Scaling...

... and the presentation is also in D3.js??? Great...

I am changing to F# too...

@evaristoc Yes, I am interested a lot in clustering algorithms.

I didn't know that was possible to build presentations on D3.js. Is there some tool to create presentations? or should we write code from start?

@evaristoc you brought up t-SNE earlier. I just ran into this interactive blog post talking about how to interpret t-SNE plots http://distill.pub/2016/misread-tsne/