• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Future of Data - Big Data

Future of Data - Big Data






Total Views
Views on SlideShare
Embed Views



0 Embeds 0

No embeds



Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
Post Comment
Edit your comment

    Future of Data - Big Data Future of Data - Big Data Presentation Transcript

    • Future of Data : Big Data Shankar Radhakrishnan Cognizant
    • Topics How did we get here ? Data Explosion Big Data Big Data in an Enterprise Big Data Platform - Hadoop Big Data AdoptionQ&A
    • How did we get here?Familiar World Data Integration Problems  EDW  Datamarts Data Processing Problems  Familiar Problems Data warehouse Storage Management Performance Problems Limitations out of ComplexityNew World  Newer type of data to integrate  Increase in volume  Newer analytical requirements
    • Data Explosion
    • Newer Interests Social Intelligence  DBIM, Sentiment Analysis, Social Customer Care Predictive Analytics  Propensity, Price Elasticity, Anti-Fraud Analytics Segmentation Insights  Funnel Analysis, Behavioral Patterns, Cohort Analysis Mobile Analytics  Ad-Targeting, Geo-spatial Analytics
    • Categories Structured Data  Enterprise Data (CRM, ERP, Data Stores, Reference Data) Semi-structured Data  Machine Generated Data (Sensor Data, RFIDs) Unstructured Data  Social Data (Comments, Tweets), Blog posts
    • Big Data Volume Complexity Big Velocity Data Variety“Big Data” refers to high volume, velocity, variety and complex information assets thatdemand cost-effective, innovative forms of information processing for enhanced insightand decision making
    • Big Data Platforms• Data Integration o Informatica, Infosphere o talenD, Pentaho, Karmasphere, Apache Sqoop, Apache Flume• Database Framework o Hadoop (Distributions: Cloudera, Hortonworks, MapR) o Hbase o Hive• NoSQL Databases o MongoDB, CouchDB• Machine Data Processing o Splunk, Mahout• Text Analytics o Clarabridge, Lexanalytics
    • Big Data in an Enterprise Big Data Big Data ETL Sources Platform Datamarts ETL Analytical Datamarts Applications Datamarts Data ETL Data warehouse Sources
    • Hadoop - Ecosystem
    • Big Data : Adoption Drivers Cluster Distributed Platform Storage Scalable Process Availability Performance Data Augmented Integration Data Possibilities Processing TCO Ecosystem Actionable ROI Insights
    • Big Data – Adoption Scenarios Replatforming to Big Data (Hadoop, MapR) Archival Solution (Hadoop) Offloading Data warehouse, EDW (Hadoop, Hive) Social Media Integration Machine Data Analysis (Splunk, Mahout) Complex Analytical Requirements (Hbase)
    • Q&A