Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Low latency access of bigdata using spark and shark
1. Low Latency Access of Big data using Spark and Shark
by Pradeep Kumar G.S
2. Objective
This presentation will cover the technology which enables
us to access Big data with low latency access.
Combines In-memory datastore and Massive data crunch
together.
3. Spark
Spark is an open source cluster computing system that
aims to make data analytics fast — both fast to run and
fast to write.
To run programs faster, Spark provides primitives for in-
memory cluster computing: your job can load data into
memory and query it repeatedly much more quickly than
with disk-based systems like Hadoop MapReduce.
Its build on top of Apache Mesos which is a cluster manager
that provides efficient resource isolation and sharing across
distributed applications
4. Shark
Shark - Hive on Spark.
Shark has majority of features what hive has like:
● SERDE
● UDF
we will see the comparison of Shark Vs Hive Vs Impala