V
V
BIG DATAV
- Afif Al Mamun
Contents
• Big Data
• Three Vs
• Handling Big Data
• Hadoop
• Hive
• NoSQL
• Why Big Data
Big Data
Big data is a term that
describes the large
volume of data – both
structured and
unstructured
Three Vs
• Volume
• Velocity
• Variety
Three Vs
• Volume
 Terabytes/Petabyte
s of Data
 Transactions
 Social Media
Three Vs
• Velocity
 Fast rate of data
receiving
 Smart Devices Real
Time Data
 High Velocity of
Data Streams
Three Vs
• Variety
 Different Formats
of Data
 Structured/Unstruc
tured/Semi
Structured
 SQL/NoSQL
Handling Big data
• Can not be handled by
conventional DBMS
• HDFS
• Map Reduce Algorithm
• Tools
 Apache Hadoop
 Hive
 NoSQL
HDFS
• Hadoop Distributed File
System
Map Reduce
• A Word Count Example
by Map Reduce
Algorithm
Hadoop Distributed
File System
• HDFS Architecture
• Master – Slave
• Namenode
• Datanodes
Hive
• Built on Hadoop
• Supports SQL-like
queries
• Used for analytics,
Reports and
Query Processing
• Integrated with JDBC
• Used by Facebook,
Netflix, LinkedIn
NoSQL
• Key-value stores
• Document databases
• Graph database
• Flexibility
• Scalability
• Popular NoSQL Databases
• MongoDB
• Oracle NoSQL
• Cassandra DB
NoSQL
Hands on MongoDB
A JSON Document to insert
data in a MongoDB Database
named students
NoSQL
vs
RDBMS
Importance of Big
Data Analytics
• Data Science Perspective
• Business Perspective
• Real time usability
perspective
• Job market perspective
Big Data Analytics and
Data Sciences
• Data Scientists are
responsible to deal with Big
Data in business.
Data Scientists are responsible
to deal with Big Data in
business.
Business and Big Data
Analytics
Real time benefits of
Big Data Analytics
• Banking
• Healthcare
• Energy
• Technology
• Consumer
• Manufacturing
Job Opportunities and
Big Data Analytics
The knowledge and
experience of Big Data
analytics can provide you an
edge over others.
References
• https://www.sas.com/en_us/insights/big-data/what-is-big-data.html
• https://hadoop.apache.org/docs/r1.2.1/images/hdfsarchitecture.gif
• https://www.oracle.com/big-data/guide/what-is-big-data.html
• https://blog.sqlauthority.com/2013/10/02/big-data-what-is-big-data-3-vs-of-big-data-
volume-velocity-and-variety-day-2-of-21/
• https://bigdataldn.com/big-data-the-3-vs-explained/
• https://www.edureka.co/blog/mapreduce-tutorial/
• https://www.toptal.com/database/the-definitive-guide-to-nosql-databases
• https://www.whizlabs.com/blog/big-data-analytics-importance/
Introduction to Big Data

Introduction to Big Data