NAVEEN ABRAHAM
PRATEEK SABHARWAL
POOJA S KUMAR
RIYA ASEEF
NIKIHL JOHN ABRAHAM
BIG DATA
Next Big thing
in the IT world
Google, LinkedIn,
Facebook
Minimize timeCost reduction
Similar to ‘small data’ but bigger in size
Solve problems in a better way
Techniques, tools and architecture
Cannot be analyzed with traditional computing
techniques.
Eg: Walmart, Facebook
CHARACTERISTICS
VOLUME
Data quantity
VELOCITY
Data speed
VARIETY
Data types
Cost Saving
Time reduction
New product development
Understanding market conditions
Control online reputation
BENEFITS OF BIG DATA
STORING BIG DATA
Analyzing your data
characteristics
• Selecting data sources
for analysis
• Eliminating redundant
data
• Establishing the role of
NoSQL
Overview of Big
Data storage
• Data models: key value,
graph, document,
column-family
• Hadoop Distributed File
System
SELECTING BIG DATA STORES
Choosing the
correct data stores
Moving code to data
Implementing
polyglot data store
solutions
Aligning
business
goals
PROCESSING BIG DATA
Integrating disparate
data stores
Mapping data to the
programming framework
Connecting and extracting
data from storage
Transforming data for
processing
Processing
Requirments
Volume
Velocity
Variety
Ambiguity
Complexity
Eg: Hadoop
WHY BIG DATA
Growth of Big Data is needed
• Increase of storage capacities
• Increase of processing power
• Availability of data(different data types)
• Every day we create 2.5 quintillion bytes of data; 90% of the data in
the world today has been created in the last two years alone
WHY BIG DATA
FB generates 10TB
daily
Twitter generates 7TB
of data
Daily
IBM claims 90% of
today’s stored data
was generated in just
the last two years.
Orchestration
Data
Sources
Analytics
and
Reporting
Data Storage
Real- Time Message
Ingestion
Batch
Processing
Stream
Processing
Analytical
Data Store
ETL
WHY
Evolve?
TOOLS USED IN BIG
DATA ANALYSIS
DATA PROCESSING
DISTRIBUTED STORAGE
Amazon S3
Distributed Processing
TYPES OF TOOLS USED IN BIG DATA
APPLICATION OF BIG DATA
Banking –
University of florida
Media and
Entertainment
Healthcare- Taiwan
tracks corona using
Big data
Education –
University of
Tasmania
Manufacturing and
Natural Resources-
Predictive Model
Government- FDA
Retail and
Wholesale Industry
CONCLUSION
Big Data: A new competitive advantage for businesses
Crucial way to
outperform peers
Creation of new
service offerings
and the design of
future products
Create new growth
opportunities and
entirely new
categories of
companies
Various challenges
to overcome
THANK
YOU

Big data - Characteristics, types and Application

  • 1.
    NAVEEN ABRAHAM PRATEEK SABHARWAL POOJAS KUMAR RIYA ASEEF NIKIHL JOHN ABRAHAM
  • 2.
    BIG DATA Next Bigthing in the IT world Google, LinkedIn, Facebook Minimize timeCost reduction
  • 3.
    Similar to ‘smalldata’ but bigger in size Solve problems in a better way Techniques, tools and architecture Cannot be analyzed with traditional computing techniques. Eg: Walmart, Facebook
  • 4.
  • 5.
    Cost Saving Time reduction Newproduct development Understanding market conditions Control online reputation BENEFITS OF BIG DATA
  • 6.
    STORING BIG DATA Analyzingyour data characteristics • Selecting data sources for analysis • Eliminating redundant data • Establishing the role of NoSQL Overview of Big Data storage • Data models: key value, graph, document, column-family • Hadoop Distributed File System
  • 7.
    SELECTING BIG DATASTORES Choosing the correct data stores Moving code to data Implementing polyglot data store solutions Aligning business goals
  • 8.
    PROCESSING BIG DATA Integratingdisparate data stores Mapping data to the programming framework Connecting and extracting data from storage Transforming data for processing Processing Requirments Volume Velocity Variety Ambiguity Complexity Eg: Hadoop
  • 9.
    WHY BIG DATA Growthof Big Data is needed • Increase of storage capacities • Increase of processing power • Availability of data(different data types) • Every day we create 2.5 quintillion bytes of data; 90% of the data in the world today has been created in the last two years alone
  • 10.
    WHY BIG DATA FBgenerates 10TB daily Twitter generates 7TB of data Daily IBM claims 90% of today’s stored data was generated in just the last two years.
  • 12.
    Orchestration Data Sources Analytics and Reporting Data Storage Real- TimeMessage Ingestion Batch Processing Stream Processing Analytical Data Store
  • 13.
  • 14.
    TOOLS USED INBIG DATA ANALYSIS
  • 15.
  • 20.
  • 21.
  • 26.
  • 34.
    TYPES OF TOOLSUSED IN BIG DATA
  • 35.
    APPLICATION OF BIGDATA Banking – University of florida Media and Entertainment Healthcare- Taiwan tracks corona using Big data Education – University of Tasmania Manufacturing and Natural Resources- Predictive Model Government- FDA Retail and Wholesale Industry
  • 36.
    CONCLUSION Big Data: Anew competitive advantage for businesses Crucial way to outperform peers Creation of new service offerings and the design of future products Create new growth opportunities and entirely new categories of companies Various challenges to overcome
  • 37.