These are the slides for the http://www.meetup.com/Big-Data-Romandie/events/230345605/ Most of the interesting bits are in the attached notebooks though : https://gist.github.com/huitseeker/a868af0dd8064cfe9806f4974a955386 https://gist.github.com/huitseeker/3c6d1246178eea56d958a4757a8cadbd To be used with : https://github.com/andypetrella/spark-notebook