SlideShare a Scribd company logo
Big Data In Small Steps
By Devashish Khatwani
January 2015
The Three (or Four) Questions
1
2
3
What is Big Data and What does it have to do with my IT?
How can Big Data deliver value?
How do I implement it?
2Devashish Khatwani
3Devashish Khatwani
Question 1: What is Big Data and What does it have to do with my
IT?
VSize of the data being
processed. It can run up
to Petabytes of data for
companies such as
Google and Facebook
VSpeed at which the data
is generated. To give you
a perspective more than
90% of the data
generated in human
history was generated in
the last two years.
Banks can relate it with
the # of credit card
transactions happening
per minute
Types of data which
encompass the dataset
you need to process. It
could be structured data
coming from databases
or could be unstructured
data coming from tweets
about your company or
could be semi structured
such as the one coming
from an online feedback
form
Uncertainty of the data,
you can think of it has a
partially filled feedback
form or a tweet with
hashtags such as #YOLO
V VVOLUME VELOCITY VARIETY VERACITY
Huge amount of Data is not Big Data, Big Data is defined by
four key attributes
4Devashish Khatwani
The leaders in Big Data implementation are moving away
from the traditional technology stack
5
Monolithic Commodity
Hardware
Centralized Relational
Database
Queries (SQL)
Distributed Commodity
Hardware
Hadoop
Parallel
Relational
Database
No SQL
Database
Relational
Database
Monolithic Commodity
Hardware
Interactive
Query
Real Time
Query
Map Reduce
Data Visualization Tools
Traditional Technology
Stack
Big Data Technology
Stack
Data Storage and
Management
Data
Processing
Data
Analysis and Presentation
1 2 3
1
2
3
Devashish Khatwani
Migrating to a Big Data Technology Stack has to be gradual
so that regular reporting is not hindered
6
Monolithic Commodity
Hardware
Centralized Relational
Database
Queries (SQL)
Data Generation
Source
ETL
Process
Distributed Commodity
Hardware
Parallel
Relational
Database
No SQL
Database
Relational
Database
Monolithic Commodity
Hardware
Interactive
Query
Real Time
Query
Map Reduce
Data Visualization Tools
Hadoop
Data Generation
Source
Regular Reports
ETL
Process
Export to the existing Database
Regular Reports
Devashish Khatwani
7Devashish Khatwani
Question 2: How can Big Data deliver value?
Technology is just an enabler, in order to make money you
need to analyse your Data
8
Models to present History- This is similar to the
reporting that most companies have today with
an added layer of drill down queries and
advanced scorecard reporting
1
Models to explain the history – This would be
analysis such as segmentation analysis, sensitivity
analysis etc. which can be used to inform future
decisions or to analyse the efficacy of past
decisions
2
Models to predict the future – This maturity
level entails advanced statistical models such as
predictive analytics, simulations, optimization and
machine learning
3
• Use Big Data to build models
which may predict the future
• Use historical data and/or
experiments to validate the
models
• Make business decisions based
on these models to get your ROI
IncreasingComplexityandMaturity
Devashish Khatwani
Big Data Use Cases: Insurance Industry can use Big Data for increasing
customer loyalty, actuarial risk management and increase the efficiency of claims
function
9
Increasing Customer Loyalty: By running a real time sentiment analysis on various social media platforms, emails ,
chats, website etc. Insurance providers can develop custom response approaches to known behaviour patterns. For
instance by analyzing the website activity of the users and correlating it with the subsequent calls made to the call
centre the insurance provider can predict the nature of the incoming call and develop a custom response to the
situation.
1
Acturial Risk Management: Current auto insurance premiums are based on credit scores of individuals,
demographic variables and vehicle classifications. Insurance companies can extract driving patterns through the
use of telematics and use this data to offer better premiums to its customers. This will not only help the insurance
provider to finely segment the customer base but also alleviate the problem of moral hazard
2
Increasing the efficiency of Claims function: With the text analysis of previous claims, a claims officer is able to
cross reference similar claims more quickly and can speed up the claims process. Text analysis of various claims
can reveal patterns and combining them with demographic and behavioral data can generate cohorts which the
insurance provider can use to avoid frauds
3
Devashish Khatwani
Big Data Use Cases: Telecom Industry can use Big Data for better
segmentation, optimize capacity planning and optimizing promotional spend
10
Better Segmentation: Usually segmentation is based on demographic, geographic, behavioral and psychographic
attributes but with Big Data a telecom provider can start micro-segmenting with adding on a layer of:
• Activity based data( website tracking, purchase history, call centre data, mobile usage data, response to
incentives)
• Social Network Profile
• Social Influence and sentiment data
1
Optimize Capacity Planning: Network capacities, workforce capacity etc. can be optimized based on analysis of
historical data. For instance instead of using aggregated time series forecasting for the number of calls received by
the call centres , the telecom provider can use time series forecasting at customer segment level and then
aggregate the forecast to generate a more accurate prediction
2
Optimizing Promotional Spend: The ROI of each email campaign, social media campaign etc. can be calculated and
an optimized mix of promotion methods can be generated. Furthermore factors such as date, time, text of the
campaign can be analyzed for finding out the best combination through experiments
3
Devashish Khatwani
11Devashish Khatwani
Question 3: How do I implement it?
Establishing a Centre of Excellence for Big Data implementation is
the fastest way to Big Data Success
12
Corporate
Business UnitCOE
Big Data
Project
Option 1: Internal Consulting Option 2: Centralized Option 3: Centre of Excellence
Corporate
Business UnitCOE
Big Data
Project
Corporate
Business UnitCOE
Big Data
Project
Analytics team
This setup treats the COE as
internal team of experts which can
be called upon by the Business Unit
for projects. The onus of initiating a
Big Data project lies with the
Business Unit. The COE can be
treated either as a cost centre or a
profit centre under this structure
Under this setup COE identifies and
executes the Big Data initiative with
support from the business unit.
The onus of identifying a viable Big
Data initiative rests with the COE.
The resources in the COE must be
well versed with Business Unit’s
business for this structure to be
effective
Under this structure the COE is a
small organization with very
specialized Big Data skills and the
Business Unit itself is well versed
with basic analytics capabilities.
This structure works well for
organization who have historically
be analytically savvy and have taken
data driven decisions in the past
Devashish Khatwani
13
Big Data Initiatives need to be championed by CXOs in order for
them to have maximum impact
Source: LEAP Study 2014 by AT Kearney and Carnegie Melon University
Devashish Khatwani
You need seven types of people for
your Big Data Initiative
14
1
5 6
7
3
24
1
2
3
4
5
6
7
Executive Leader
Project Manager
Internal Trainer
External Liaison
Data Technologist
Data Scientist
Data Analyst
CORE
Devashish Khatwani
The Data Scientist is the person who
will have the statistical know how of
coding and performing statistical
analysis such as clustering, predictive
analytics, Sentiment Analysis, Machine
learning etc. He is the most important
link of converting data to actionable
insights. Some of the skills that a data
scientist should posses are:
1. Programming experience in
Python, Java, R and SQL
2. Knowledge of data mining,
machine learning and statistical
methods
3. Experience working with
relational databases
The Data Analyst is responsible for
brainstorming for different models
which need to be studied and
statistically validated by the Data
Scientist . He is also responsible for
calculating the dollar impact of actions
taken based on big data insights
Some of the skills that a data analyst
has to be familiar with is basic level
statistics and sound understanding of
product/function for which the Big
Data initiative is being run. For
instance if you are running a
sentiment analysis on Social Media
then the business analyst should be an
expert in social media marketing
The three people who form the core of your Big Data Initiative:
Data Technologist, Data Scientist and Data Analyst
15
The Data Technologist is responsible
for identifying the data sources of the
organization and should be able to
work on different aspects of data
management such as data
1. Data Governance
2. Data Architecture
3. Data Quality
4. Data Security
5. Data Warehousing
6. Data Availability
Data Technologist Data Scientist Data Analyst
Devashish Khatwani
16
Prototyping and then developing a repeatable solution is the best
way for extracting value from Big Data
Source: LEAP Study 2014 by AT Kearney and Carnegie Melon University
Devashish Khatwani
17
Getting Started on your Big Data journey
Source: Deloitte, Big Data An Insurance business imperative
Devashish Khatwani
Devashish Khatwani
Thank You
Devashish Khatwani
B Tech – Electrical Engineering, IIT Roorkee
MBA – Rotman School of Management
Devashish.khatwani@gmail.com
18

More Related Content

What's hot

Introduction to data analytics
Introduction to data analyticsIntroduction to data analytics
Introduction to data analytics
SSaudia
 
Introduction to Business Anlytics and Strategic Landscape
Introduction to Business Anlytics and Strategic LandscapeIntroduction to Business Anlytics and Strategic Landscape
Introduction to Business Anlytics and Strategic Landscape
Rani Channamma University, Sangolli Rayanna First Grade Constituent College, Belagavi
 
Predictive analytics 2025_br
Predictive analytics 2025_brPredictive analytics 2025_br
Predictive analytics 2025_br
Shubhashish Biswas
 
Simplify Your Analytics Strategy
Simplify Your Analytics StrategySimplify Your Analytics Strategy
Simplify Your Analytics Strategy
KASHISH MUKHEJA
 
Data Analytics in Azure Cloud
Data Analytics in Azure CloudData Analytics in Azure Cloud
Data Analytics in Azure Cloud
Microsoft Canada
 
Risk mgmt-analysis-wp-326822
Risk mgmt-analysis-wp-326822Risk mgmt-analysis-wp-326822
Risk mgmt-analysis-wp-326822
Shubhashish Biswas
 
Introduction to Business Data Analytics
Introduction to Business Data AnalyticsIntroduction to Business Data Analytics
Introduction to Business Data Analytics
VadivelM9
 
Do's and Don'ts of Data Driven Marketing
Do's and Don'ts of Data Driven MarketingDo's and Don'ts of Data Driven Marketing
Do's and Don'ts of Data Driven Marketing
SparkPost
 
Digital analytics project
Digital analytics projectDigital analytics project
Digital analytics project
Julie George
 
Simplify your analytics strategy
Simplify your analytics strategySimplify your analytics strategy
Simplify your analytics strategy
Shikhar Gupta
 
Predictive analytics in action real-world examples and advice
Predictive analytics in action real-world examples and advicePredictive analytics in action real-world examples and advice
Predictive analytics in action real-world examples and advice
The Marketing Distillery
 
Elsevier
ElsevierElsevier
Elsevier
Christina Azzam
 
Does big data really mean better decisions?
Does big data really mean better decisions?Does big data really mean better decisions?
Does big data really mean better decisions?
Intelligent Software Solutions
 
What is business analytics
What is business analyticsWhat is business analytics
What is business analytics
Sherpa Consulting
 
A study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websitesA study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websites
Bhanu Prakash
 
1305 track 3 siegel
1305 track 3 siegel1305 track 3 siegel
1305 track 3 siegel
Rising Media, Inc.
 
Analytical thinking 9 - July 2012
Analytical thinking 9 - July 2012Analytical thinking 9 - July 2012
Analytical thinking 9 - July 2012
Charlotte Skornik
 
Talent Analytics
Talent AnalyticsTalent Analytics
Talent Analytics
Aniket Joshi
 
ForresterPredictiveWave
ForresterPredictiveWaveForresterPredictiveWave
ForresterPredictiveWave
Timothy M. Caffrey, MBA
 
Odgers Berndtson and Unico Big Data White Paper
Odgers Berndtson and Unico Big Data White PaperOdgers Berndtson and Unico Big Data White Paper
Odgers Berndtson and Unico Big Data White Paper
Robertson Executive Search
 

What's hot (20)

Introduction to data analytics
Introduction to data analyticsIntroduction to data analytics
Introduction to data analytics
 
Introduction to Business Anlytics and Strategic Landscape
Introduction to Business Anlytics and Strategic LandscapeIntroduction to Business Anlytics and Strategic Landscape
Introduction to Business Anlytics and Strategic Landscape
 
Predictive analytics 2025_br
Predictive analytics 2025_brPredictive analytics 2025_br
Predictive analytics 2025_br
 
Simplify Your Analytics Strategy
Simplify Your Analytics StrategySimplify Your Analytics Strategy
Simplify Your Analytics Strategy
 
Data Analytics in Azure Cloud
Data Analytics in Azure CloudData Analytics in Azure Cloud
Data Analytics in Azure Cloud
 
Risk mgmt-analysis-wp-326822
Risk mgmt-analysis-wp-326822Risk mgmt-analysis-wp-326822
Risk mgmt-analysis-wp-326822
 
Introduction to Business Data Analytics
Introduction to Business Data AnalyticsIntroduction to Business Data Analytics
Introduction to Business Data Analytics
 
Do's and Don'ts of Data Driven Marketing
Do's and Don'ts of Data Driven MarketingDo's and Don'ts of Data Driven Marketing
Do's and Don'ts of Data Driven Marketing
 
Digital analytics project
Digital analytics projectDigital analytics project
Digital analytics project
 
Simplify your analytics strategy
Simplify your analytics strategySimplify your analytics strategy
Simplify your analytics strategy
 
Predictive analytics in action real-world examples and advice
Predictive analytics in action real-world examples and advicePredictive analytics in action real-world examples and advice
Predictive analytics in action real-world examples and advice
 
Elsevier
ElsevierElsevier
Elsevier
 
Does big data really mean better decisions?
Does big data really mean better decisions?Does big data really mean better decisions?
Does big data really mean better decisions?
 
What is business analytics
What is business analyticsWhat is business analytics
What is business analytics
 
A study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websitesA study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websites
 
1305 track 3 siegel
1305 track 3 siegel1305 track 3 siegel
1305 track 3 siegel
 
Analytical thinking 9 - July 2012
Analytical thinking 9 - July 2012Analytical thinking 9 - July 2012
Analytical thinking 9 - July 2012
 
Talent Analytics
Talent AnalyticsTalent Analytics
Talent Analytics
 
ForresterPredictiveWave
ForresterPredictiveWaveForresterPredictiveWave
ForresterPredictiveWave
 
Odgers Berndtson and Unico Big Data White Paper
Odgers Berndtson and Unico Big Data White PaperOdgers Berndtson and Unico Big Data White Paper
Odgers Berndtson and Unico Big Data White Paper
 

Similar to Big Data

Barry Ooi; Big Data lookb4YouLeap
Barry Ooi; Big Data lookb4YouLeapBarry Ooi; Big Data lookb4YouLeap
Barry Ooi; Big Data lookb4YouLeap
Barry Ooi
 
Business Analytics Unit III: Developing analytical talent
Business Analytics Unit III: Developing analytical talentBusiness Analytics Unit III: Developing analytical talent
Business Analytics Unit III: Developing analytical talent
Rani Channamma University, Sangolli Rayanna First Grade Constituent College, Belagavi
 
Sit717 enterprise business intelligence 2019 t2 copy1
Sit717 enterprise business intelligence 2019 t2 copy1Sit717 enterprise business intelligence 2019 t2 copy1
Sit717 enterprise business intelligence 2019 t2 copy1
NellutlaKishore
 
The big data strategy using social media
The big data strategy using social mediaThe big data strategy using social media
The big data strategy using social media
Vaibhav Thombre
 
Simplify your analytics strategy
Simplify your analytics strategySimplify your analytics strategy
Simplify your analytics strategy
Soumyadeep Sengupta
 
Introduction to visualizing Big Data
Introduction to visualizing Big DataIntroduction to visualizing Big Data
Introduction to visualizing Big Data
Dawit Nida
 
BIG DATA & BUSINESS ANALYTICS
BIG DATA & BUSINESS ANALYTICSBIG DATA & BUSINESS ANALYTICS
BIG DATA & BUSINESS ANALYTICS
Vikram Joshi
 
Thebigdatastrategyusingsocialmedia 140126142538-phpapp01
Thebigdatastrategyusingsocialmedia 140126142538-phpapp01Thebigdatastrategyusingsocialmedia 140126142538-phpapp01
Thebigdatastrategyusingsocialmedia 140126142538-phpapp01
聪 徐
 
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
IJSCAI Journal
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
gerogepatton
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
ijscai
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
ijscai
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
gerogepatton
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
gerogepatton
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
gerogepatton
 
Is Your Company Braced Up for handling Big Data
Is Your Company Braced Up for handling Big DataIs Your Company Braced Up for handling Big Data
Is Your Company Braced Up for handling Big Data
himanshu13jun
 
Simplify your analytics strategy
Simplify your analytics strategySimplify your analytics strategy
Simplify your analytics strategy
Shaun Kollannur
 
Better Business Outcomes with Big Data Analytics
Better Business Outcomes with Big Data AnalyticsBetter Business Outcomes with Big Data Analytics
Better Business Outcomes with Big Data Analytics
IBM Software India
 
IRJET - Big Data: Evolution Cum Revolution
IRJET - Big Data: Evolution Cum RevolutionIRJET - Big Data: Evolution Cum Revolution
IRJET - Big Data: Evolution Cum Revolution
IRJET Journal
 
Lecture3 business intelligence
Lecture3 business intelligenceLecture3 business intelligence
Lecture3 business intelligence
hktripathy
 

Similar to Big Data (20)

Barry Ooi; Big Data lookb4YouLeap
Barry Ooi; Big Data lookb4YouLeapBarry Ooi; Big Data lookb4YouLeap
Barry Ooi; Big Data lookb4YouLeap
 
Business Analytics Unit III: Developing analytical talent
Business Analytics Unit III: Developing analytical talentBusiness Analytics Unit III: Developing analytical talent
Business Analytics Unit III: Developing analytical talent
 
Sit717 enterprise business intelligence 2019 t2 copy1
Sit717 enterprise business intelligence 2019 t2 copy1Sit717 enterprise business intelligence 2019 t2 copy1
Sit717 enterprise business intelligence 2019 t2 copy1
 
The big data strategy using social media
The big data strategy using social mediaThe big data strategy using social media
The big data strategy using social media
 
Simplify your analytics strategy
Simplify your analytics strategySimplify your analytics strategy
Simplify your analytics strategy
 
Introduction to visualizing Big Data
Introduction to visualizing Big DataIntroduction to visualizing Big Data
Introduction to visualizing Big Data
 
BIG DATA & BUSINESS ANALYTICS
BIG DATA & BUSINESS ANALYTICSBIG DATA & BUSINESS ANALYTICS
BIG DATA & BUSINESS ANALYTICS
 
Thebigdatastrategyusingsocialmedia 140126142538-phpapp01
Thebigdatastrategyusingsocialmedia 140126142538-phpapp01Thebigdatastrategyusingsocialmedia 140126142538-phpapp01
Thebigdatastrategyusingsocialmedia 140126142538-phpapp01
 
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
 
Is Your Company Braced Up for handling Big Data
Is Your Company Braced Up for handling Big DataIs Your Company Braced Up for handling Big Data
Is Your Company Braced Up for handling Big Data
 
Simplify your analytics strategy
Simplify your analytics strategySimplify your analytics strategy
Simplify your analytics strategy
 
Better Business Outcomes with Big Data Analytics
Better Business Outcomes with Big Data AnalyticsBetter Business Outcomes with Big Data Analytics
Better Business Outcomes with Big Data Analytics
 
IRJET - Big Data: Evolution Cum Revolution
IRJET - Big Data: Evolution Cum RevolutionIRJET - Big Data: Evolution Cum Revolution
IRJET - Big Data: Evolution Cum Revolution
 
Lecture3 business intelligence
Lecture3 business intelligenceLecture3 business intelligence
Lecture3 business intelligence
 

Big Data

  • 1. Big Data In Small Steps By Devashish Khatwani January 2015
  • 2. The Three (or Four) Questions 1 2 3 What is Big Data and What does it have to do with my IT? How can Big Data deliver value? How do I implement it? 2Devashish Khatwani
  • 3. 3Devashish Khatwani Question 1: What is Big Data and What does it have to do with my IT?
  • 4. VSize of the data being processed. It can run up to Petabytes of data for companies such as Google and Facebook VSpeed at which the data is generated. To give you a perspective more than 90% of the data generated in human history was generated in the last two years. Banks can relate it with the # of credit card transactions happening per minute Types of data which encompass the dataset you need to process. It could be structured data coming from databases or could be unstructured data coming from tweets about your company or could be semi structured such as the one coming from an online feedback form Uncertainty of the data, you can think of it has a partially filled feedback form or a tweet with hashtags such as #YOLO V VVOLUME VELOCITY VARIETY VERACITY Huge amount of Data is not Big Data, Big Data is defined by four key attributes 4Devashish Khatwani
  • 5. The leaders in Big Data implementation are moving away from the traditional technology stack 5 Monolithic Commodity Hardware Centralized Relational Database Queries (SQL) Distributed Commodity Hardware Hadoop Parallel Relational Database No SQL Database Relational Database Monolithic Commodity Hardware Interactive Query Real Time Query Map Reduce Data Visualization Tools Traditional Technology Stack Big Data Technology Stack Data Storage and Management Data Processing Data Analysis and Presentation 1 2 3 1 2 3 Devashish Khatwani
  • 6. Migrating to a Big Data Technology Stack has to be gradual so that regular reporting is not hindered 6 Monolithic Commodity Hardware Centralized Relational Database Queries (SQL) Data Generation Source ETL Process Distributed Commodity Hardware Parallel Relational Database No SQL Database Relational Database Monolithic Commodity Hardware Interactive Query Real Time Query Map Reduce Data Visualization Tools Hadoop Data Generation Source Regular Reports ETL Process Export to the existing Database Regular Reports Devashish Khatwani
  • 7. 7Devashish Khatwani Question 2: How can Big Data deliver value?
  • 8. Technology is just an enabler, in order to make money you need to analyse your Data 8 Models to present History- This is similar to the reporting that most companies have today with an added layer of drill down queries and advanced scorecard reporting 1 Models to explain the history – This would be analysis such as segmentation analysis, sensitivity analysis etc. which can be used to inform future decisions or to analyse the efficacy of past decisions 2 Models to predict the future – This maturity level entails advanced statistical models such as predictive analytics, simulations, optimization and machine learning 3 • Use Big Data to build models which may predict the future • Use historical data and/or experiments to validate the models • Make business decisions based on these models to get your ROI IncreasingComplexityandMaturity Devashish Khatwani
  • 9. Big Data Use Cases: Insurance Industry can use Big Data for increasing customer loyalty, actuarial risk management and increase the efficiency of claims function 9 Increasing Customer Loyalty: By running a real time sentiment analysis on various social media platforms, emails , chats, website etc. Insurance providers can develop custom response approaches to known behaviour patterns. For instance by analyzing the website activity of the users and correlating it with the subsequent calls made to the call centre the insurance provider can predict the nature of the incoming call and develop a custom response to the situation. 1 Acturial Risk Management: Current auto insurance premiums are based on credit scores of individuals, demographic variables and vehicle classifications. Insurance companies can extract driving patterns through the use of telematics and use this data to offer better premiums to its customers. This will not only help the insurance provider to finely segment the customer base but also alleviate the problem of moral hazard 2 Increasing the efficiency of Claims function: With the text analysis of previous claims, a claims officer is able to cross reference similar claims more quickly and can speed up the claims process. Text analysis of various claims can reveal patterns and combining them with demographic and behavioral data can generate cohorts which the insurance provider can use to avoid frauds 3 Devashish Khatwani
  • 10. Big Data Use Cases: Telecom Industry can use Big Data for better segmentation, optimize capacity planning and optimizing promotional spend 10 Better Segmentation: Usually segmentation is based on demographic, geographic, behavioral and psychographic attributes but with Big Data a telecom provider can start micro-segmenting with adding on a layer of: • Activity based data( website tracking, purchase history, call centre data, mobile usage data, response to incentives) • Social Network Profile • Social Influence and sentiment data 1 Optimize Capacity Planning: Network capacities, workforce capacity etc. can be optimized based on analysis of historical data. For instance instead of using aggregated time series forecasting for the number of calls received by the call centres , the telecom provider can use time series forecasting at customer segment level and then aggregate the forecast to generate a more accurate prediction 2 Optimizing Promotional Spend: The ROI of each email campaign, social media campaign etc. can be calculated and an optimized mix of promotion methods can be generated. Furthermore factors such as date, time, text of the campaign can be analyzed for finding out the best combination through experiments 3 Devashish Khatwani
  • 11. 11Devashish Khatwani Question 3: How do I implement it?
  • 12. Establishing a Centre of Excellence for Big Data implementation is the fastest way to Big Data Success 12 Corporate Business UnitCOE Big Data Project Option 1: Internal Consulting Option 2: Centralized Option 3: Centre of Excellence Corporate Business UnitCOE Big Data Project Corporate Business UnitCOE Big Data Project Analytics team This setup treats the COE as internal team of experts which can be called upon by the Business Unit for projects. The onus of initiating a Big Data project lies with the Business Unit. The COE can be treated either as a cost centre or a profit centre under this structure Under this setup COE identifies and executes the Big Data initiative with support from the business unit. The onus of identifying a viable Big Data initiative rests with the COE. The resources in the COE must be well versed with Business Unit’s business for this structure to be effective Under this structure the COE is a small organization with very specialized Big Data skills and the Business Unit itself is well versed with basic analytics capabilities. This structure works well for organization who have historically be analytically savvy and have taken data driven decisions in the past Devashish Khatwani
  • 13. 13 Big Data Initiatives need to be championed by CXOs in order for them to have maximum impact Source: LEAP Study 2014 by AT Kearney and Carnegie Melon University Devashish Khatwani
  • 14. You need seven types of people for your Big Data Initiative 14 1 5 6 7 3 24 1 2 3 4 5 6 7 Executive Leader Project Manager Internal Trainer External Liaison Data Technologist Data Scientist Data Analyst CORE Devashish Khatwani
  • 15. The Data Scientist is the person who will have the statistical know how of coding and performing statistical analysis such as clustering, predictive analytics, Sentiment Analysis, Machine learning etc. He is the most important link of converting data to actionable insights. Some of the skills that a data scientist should posses are: 1. Programming experience in Python, Java, R and SQL 2. Knowledge of data mining, machine learning and statistical methods 3. Experience working with relational databases The Data Analyst is responsible for brainstorming for different models which need to be studied and statistically validated by the Data Scientist . He is also responsible for calculating the dollar impact of actions taken based on big data insights Some of the skills that a data analyst has to be familiar with is basic level statistics and sound understanding of product/function for which the Big Data initiative is being run. For instance if you are running a sentiment analysis on Social Media then the business analyst should be an expert in social media marketing The three people who form the core of your Big Data Initiative: Data Technologist, Data Scientist and Data Analyst 15 The Data Technologist is responsible for identifying the data sources of the organization and should be able to work on different aspects of data management such as data 1. Data Governance 2. Data Architecture 3. Data Quality 4. Data Security 5. Data Warehousing 6. Data Availability Data Technologist Data Scientist Data Analyst Devashish Khatwani
  • 16. 16 Prototyping and then developing a repeatable solution is the best way for extracting value from Big Data Source: LEAP Study 2014 by AT Kearney and Carnegie Melon University Devashish Khatwani
  • 17. 17 Getting Started on your Big Data journey Source: Deloitte, Big Data An Insurance business imperative Devashish Khatwani
  • 18. Devashish Khatwani Thank You Devashish Khatwani B Tech – Electrical Engineering, IIT Roorkee MBA – Rotman School of Management Devashish.khatwani@gmail.com 18