Future of Data - Big Data
Upcoming SlideShare
Loading in...5
×
 

Future of Data - Big Data

on

  • 1,663 views

 

Statistics

Views

Total Views
1,663
Views on SlideShare
1,663
Embed Views
0

Actions

Likes
1
Downloads
90
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Future of Data - Big Data Future of Data - Big Data Presentation Transcript

  • Future of Data : Big Data Shankar Radhakrishnan Cognizant
  • Topics How did we get here ? Data Explosion Big Data Big Data in an Enterprise Big Data Platform - Hadoop Big Data AdoptionQ&A
  • How did we get here?Familiar World Data Integration Problems  EDW  Datamarts Data Processing Problems  Familiar Problems Data warehouse Storage Management Performance Problems Limitations out of ComplexityNew World  Newer type of data to integrate  Increase in volume  Newer analytical requirements
  • Data Explosion
  • Newer Interests Social Intelligence  DBIM, Sentiment Analysis, Social Customer Care Predictive Analytics  Propensity, Price Elasticity, Anti-Fraud Analytics Segmentation Insights  Funnel Analysis, Behavioral Patterns, Cohort Analysis Mobile Analytics  Ad-Targeting, Geo-spatial Analytics
  • Categories Structured Data  Enterprise Data (CRM, ERP, Data Stores, Reference Data) Semi-structured Data  Machine Generated Data (Sensor Data, RFIDs) Unstructured Data  Social Data (Comments, Tweets), Blog posts
  • Big Data Volume Complexity Big Velocity Data Variety“Big Data” refers to high volume, velocity, variety and complex information assets thatdemand cost-effective, innovative forms of information processing for enhanced insightand decision making
  • Big Data Platforms• Data Integration o Informatica, Infosphere o talenD, Pentaho, Karmasphere, Apache Sqoop, Apache Flume• Database Framework o Hadoop (Distributions: Cloudera, Hortonworks, MapR) o Hbase o Hive• NoSQL Databases o MongoDB, CouchDB• Machine Data Processing o Splunk, Mahout• Text Analytics o Clarabridge, Lexanalytics
  • Big Data in an Enterprise Big Data Big Data ETL Sources Platform Datamarts ETL Analytical Datamarts Applications Datamarts Data ETL Data warehouse Sources
  • Hadoop - Ecosystem
  • Big Data : Adoption Drivers Cluster Distributed Platform Storage Scalable Process Availability Performance Data Augmented Integration Data Possibilities Processing TCO Ecosystem Actionable ROI Insights
  • Big Data – Adoption Scenarios Replatforming to Big Data (Hadoop, MapR) Archival Solution (Hadoop) Offloading Data warehouse, EDW (Hadoop, Hive) Social Media Integration Machine Data Analysis (Splunk, Mahout) Complex Analytical Requirements (Hbase)
  • Q&A