The Home of Community

800K+ People
300K+ Rooms
90K+ Communities
100+ Countries

Spark with Scala / Lobby

A place to discuss and ask questions about using Scala for Spark programming.

scala spark

datastax / spark-cassandra-connector

https://academy.datastax.com/slack #Spark-connector #Dse-Analytics

spark cassandra datastax scala python java

CommBank / grimlock

library for performing data-science and machine learning related data preparation, aggregation, manipulation and querying tasks

data-science machine learning scalding spark data preparation scala

derrickburns / generalized-kmeans-clustering

This project generalizes the Spark MLLIB K-Means clusterer to support clustering of dense or sparse, low or high dimensional data using distance functions defined by Bregman divergences (e.g. squared Euclidean distance, Kullback-Leibler divergence, etc.) Several variants of standard K-Means are easily implemented atop this package, including bisecting K-means, and Anytime K-means.

scala project generalizes spark mllib k-means

OryxProject / oryx

Oryx 2: Lambda architecture on Spark, Kafka for real-time large scale machine learning

java oryx lambda architecture spark kafka

GELOG / adam-ibs

Ports the IBS/MDS/IBD functionality of Plink to Spark / ADAM

ports functionality plink spark adam

ypg-data / sparrow

Scala library for converting Spark rows to case classes

scala library converting spark rows case

jove-sh / jove-notebook

Deprecated project(s), see https://github.com/alexarchambault/jupyter-scala

scala full-fledged notebooks spark

jbtv / sparkling

A Clojure library for Apache Spark: fast, fully-features, and developer friendly

clojure library apache spark fast fully-features

spark-windows-installer / Lobby

Mainly support for installing spark on windows and anything related

spark windows installer