Dr. Francesco Bongiovanni has expertise in scalable distributed systems and algorithms, cloud computing, applied formal methods, and distributed optimizations. He has a B.Sc. in Computer Systems, M.Sc. in Software Engineering of Distributed Systems, and Ph.D. in Computer Science. He has worked at INRIA and Verimag Laboratory. This presentation provides an overview of big data frameworks and tools including HDFS, Mesos, Spark, Spark Streaming, Spark SQL, GraphX, MLLib, Chapel, ZooKeeper, and SparkR that can be run on the eScience cluster for processing large datasets in a scalable, fault-tolerant manner. Examples demonstrate performing operations like averaging 1 billion elements