SlideShare a Scribd company logo
1 of 41
S ANAND, CHIEF DATA SCIENTIST, GRAMENER
MONETISING DATA
REMOVING YOUR MENTAL HURDLES
DATA
ANALYSIS VISUALSEXPLORATION
IS
EVERYWHERE
DATA
ANALYSIS VISUALSEXPLORATION
IS
EVERYWHERE
COMMON COMPLAINT #1
WE DON’T HAVE DATA
We have internal
information. Getting
information from outside is
our challenge. There’s no way
of doing that.
– Senior Editor
Leading Media Company
“
India’s religions
United Kingdom’s religions
UNCOVER YOUR DARK DATA
Source: http://www.patrickcheesman.com/dark-data-problems-and-solutions/
• INACCESSIBLE data (e.g. technology is outdated)
• FORGOTTEN data (e.g. collected, but not actively used)
• UNCOLLECTED data (e.g. information exists, not digitized)
• SINGLE PURPOSE data (e.g. used for a specific purpose)
We’ve used network diagrams to detect terrorism, corporate fraud, product
affinities and behavioural customer segmentation
AUGMENT YOUR
DATA
SOURCES
DATA IS
EVERYWHERE
COMMON COMPLAINT #1
WE DON’T HAVE DATA
COMMON COMPLAINT #2
THE DATA ISN’T STRUCTURED
CRM DATA
SALES DATA
PRICING DATA
CALL RECORDS
WEB LOG DATA
VENDOR INVOICES
SOCIAL MEDIA DATA
CLICKTHROUGH DATA
COMPETITOR RESEARCH
CUSTOMER TRANSACTIONS
…
CENSUS DATA
E-COMMERCE PRICES
COMMODITY PRICES
STOCK MARKET DATA
FINANCIAL REPORTING
SOCIAL MEDIA DATA
MOBILE PENETRATION
AADHAR DATA
COURT CASE BRIEFS
SHAPE FILES
…
How does Mahabharata, one of the largest epics with 1.8
million words lend itself to text analytics?
Can this ‘unstructured data’ be processed to extract
analytical insights?
What does sentiment analysis of this tome convey?
Is there a better way to explore relations between
characters?
How can closeness of characters be analysed & visualized?
Visualising the Mahabharata
“ Can we help CFOs
understand what questions
are being asked by
investors and analysts
during earnings releases?
How this is different from
competition?
– Product Head
Global Financial
Services Firm
WHAT DO FINANCIAL ANALYSTS ASK IBM VS
MSFT?
DATA IS
EVERYWHERE
EXTRACT THE
META DATA
AUGMENT YOUR
DATA
SOURCES
COMMON COMPLAINT #2
THE DATA ISN’T STRUCTURED
COMMON COMPLAINT #3
THE DATA ISN’T RICH / CLEAN
COMMON
WHO, WHAT, WHEN, WHERE
TEXT
TEXT KEYWORDS
SENTIMENT
IMAGE
VISUAL RECOGNITION
AUDIO / CALLS
TRANSCRIPTS
MOOD ANALYSIS
“ Can we get the results of
every single election in
history, and create a portal
to visualize these results?
– Rajdeep Sardesai
CNN-IBN
The PDF files have a reasonably clear structure
… that translates into text that can be parsed
Not every spelling error is easily identifiable by the first letter
… with several names spelt wrong
These are, in fact two
different constituencies
But these are exactly
the same
... and so are these
I’ve no idea if these are
2, or 3, constituencies!
… with the ability for the system to correct errors automatically
DATA IS
EVERYWHERE
TRANSFORM THE DATA &
ENRICH IT
EXTRACT THE
META DATA
AUGMENT YOUR
DATA
SOURCES
COMMON COMPLAINT #3
THE DATA ISN’T RICH / CLEAN
DATA
ANALYSIS VISUALSEXPLORATION
IS
EVERYWHERE
COMMON COMPLAINT #1
WE DON’T HAVE THE TOOLS
This is a dataset (1975 – 1990) that has
been around for several years, and has
been studied extensively. Yet, a
visualization can reveal patterns that
are neither obvious nor well known.
For example,
• Are birthdays uniformly distributed?
• Do doctors or parents exercise the C-section option to move dates?
• Is there any day of the month that has unusually high or low births?
• Are there any months with relatively high or low births?
More births Fewer births … on average, for each day of the year (from 1975 to 1990)
LET’S LOOK AT 15 YEARS OF US BIRTH DATA
THE PATTERN IN INDIA IS QUITE DIFFERENT
This is a birth date dataset that’s
obtained from school admission data
for over 10 million children. When we
compare this with births in the US, we
see none of the same patterns.
For example,
• Is there an aversion to the 13th or is there a local cultural nuance?
• Are holidays avoided for births?
• Which months have a higher propensity for births, and why?
• Are there any patterns not found in the US data?
More births Fewer births … on average, for each day of the year (from 2007 to 2013)
THIS ADVERSELY IMPACTS CHILDREN’S MARKS
It’s a well established fact that older
children tend to do better at school in
most activities. Since many children
have had their birth dates brought
forward, these younger children suffer.
The average marks of children “born” on the 1st, 5th, 10th, 15th etc. of the
month tend to score lower marks.
• Are holidays avoided for births?
• Which months have a higher propensity for births, and why?
• Are there any patterns not found in the US data?
Higher marks Lower marks … on average, for children born on a given day of the year (from 2007 to 2013)
DEPLOY
MODERN
TOOLS
ANALYSIS IS
EVERYWHERE
COMMON COMPLAINT #1
WE DON’T HAVE THE TOOLS
COMMON COMPLAINT #2
WE DON’T GET INSIGHTS
R
SAS
EXCEL
PYTHON
DATABASES
ML SERVICES
RESTAURANT FOUND AN UNUSUAL DIP IN
SALES
A restaurant chain had data for every
single transaction made over a few
years. Plotting this as a time series
showed them nothing unusual.
However, the same data on a calendar
map reveals a very different story.
Specifically, at the bottom left point-of-sale terminal, sales dips on every
Wednesday. At the bottom right point-of-sale terminal, sales rises on
every Wednesday (almost as if to compensate for the loss.)
It turns out that the manager closes the bottom-left counter every
Wednesday afternoon due to shortage of staff, assuming that it results in
no loss of sales. There is, however, a net loss every Wednesday.
DEPLOY
MODERN
TOOLS
ANALYSIS IS
EVERYWHERE
TEST DATASETS
ANONYMISATION
EVALUATION CRITERIA
IMPROVEMENT METRIC
DATA INFRASTRUCTURE
MODEL INFRASTRUCTURE
VISUALS INFRASTRUCTURE
SET UP AN ML PLATFORM
INFRASTRUCTURE FOR
RAPIDITY
COMMON COMPLAINT #2
MODELS ARE COMPLICATED
COMMON COMPLAINT #3
IMPLEMENTATIONS ARE SLOW
Nation-wide statistics on
behaviour and performance of students
Over 1,000 questions each administered to
several lakhs of students across the country
Having books improves reading ability
Having more books at home improves the performance of children when it
comes to reading. (But children typically only have only 1-10 books at home)
… but the impact in social is less
While having more books improves the reading % score by 8%, it only
increases the social % by 4%
Tuitions help very little
… but children of illiterate parents do
worse
Watching TV occasionally is good
Children who watch TV
every day don’t do as well
as children who watch TV
only once a week.
But children who never
watch TV fare the worst.
Watching TV every day
helps improve children’s
reading ability a little bit
more…
… but mathematical
abilities fall dramatically at
that point
Having educated parents helps most
This table shows the % improvement in score due to each factor
THIS TECHNIQUE CAN BE
APPLIED TO ANY DATASET
AUTOMATING ANALYSIS IN POULTRY FARMING
We group by every
input factor
… and calculate the
impact on every metric.
By moving from average to the best
group, what’s the improvement?
The actual performance
by each group is shown
0-3m 3-6m 6m-1yr 1-2 yrs > 2 yrs
11 12.3 12.7 15.3 16.1
Our product can create visualisations from data automatically, without any supervision.
Above is an example. Irrespective of the dataset, this visual shows which input parameters
have a significant impact on the output. Another such example is the cluster scatterplot.
Only significant results shown
68% correlation
between AUD & EUR
Plot of 6 month daily
AUD - EUR values
Block of correlated
currencies
… clustered
hierarchically
Restaurant: Product Sales Correlation
Restaurant: Product sales correlation
DEPLOY
MODERN
TOOLS
ANALYSIS IS
EVERYWHERE
CLUSTER PLOTS
CORRELATIONS
CROSS TABULATION
GROUP MEANS
KEYWORD EXTRACTION
NETWORK ANALYSIS
SANKEY DRILLDOWNS
SENTIMENT ANALYSIS
…
INFRASTRUCTURE FOR
RAPIDITY
COMMON COMPLAINT #3
IMPLEMENTATIONS ARE SLOW
BUILD AND USE
TEMPLATES
DATA
ANALYSIS VISUALSEXPLORATION
IS
EVERYWHERE
S ANAND, CHIEF DATA SCIENTIST, GRAMENER
THE CAPABILITIES ARE
IN YOUR REACH TODAY
EXPLORE THE ART OF DATA

More Related Content

What's hot

The Art of Storytelling Using Data Science
The Art of Storytelling Using Data ScienceThe Art of Storytelling Using Data Science
The Art of Storytelling Using Data ScienceGramener
 
Data & Storytelling - What Now?
Data & Storytelling  - What Now? Data & Storytelling  - What Now?
Data & Storytelling - What Now? Gramener
 
'Visual Intelligence' by Ganes Kesari, at Hyderabad Analytics Club
'Visual Intelligence' by Ganes Kesari, at Hyderabad Analytics Club'Visual Intelligence' by Ganes Kesari, at Hyderabad Analytics Club
'Visual Intelligence' by Ganes Kesari, at Hyderabad Analytics ClubGramener
 
The value of storytelling through data
The value of storytelling through dataThe value of storytelling through data
The value of storytelling through dataGramener
 
The Business Value of Reinforcement Learning and Causal Inference
The Business Value of Reinforcement Learning and Causal InferenceThe Business Value of Reinforcement Learning and Causal Inference
The Business Value of Reinforcement Learning and Causal InferenceHanan Shteingart
 
Humanizing Data Storytelling for Greater Business Impact
Humanizing Data Storytelling for Greater Business ImpactHumanizing Data Storytelling for Greater Business Impact
Humanizing Data Storytelling for Greater Business ImpactGramener
 
How Big Data identifies early indicators of Mental Stress
How Big Data identifies early indicators of Mental StressHow Big Data identifies early indicators of Mental Stress
How Big Data identifies early indicators of Mental StressCoert Du Plessis (杜康)
 
P 02 ta_in_uw_transformation_2017_06_13_v5
P 02 ta_in_uw_transformation_2017_06_13_v5P 02 ta_in_uw_transformation_2017_06_13_v5
P 02 ta_in_uw_transformation_2017_06_13_v5Vishwa Kolla
 
P 02 internal_data_first_2017_04_22_v6
P 02 internal_data_first_2017_04_22_v6P 02 internal_data_first_2017_04_22_v6
P 02 internal_data_first_2017_04_22_v6Vishwa Kolla
 
Business Optimization via Causal Inference
Business Optimization via Causal InferenceBusiness Optimization via Causal Inference
Business Optimization via Causal InferenceHanan Shteingart
 
DI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
DI&A Slides: Descriptive, Prescriptive, and Predictive AnalyticsDI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
DI&A Slides: Descriptive, Prescriptive, and Predictive AnalyticsDATAVERSITY
 
Living in a data economy: Transforming the role of HR
Living in a data economy: Transforming the role of HRLiving in a data economy: Transforming the role of HR
Living in a data economy: Transforming the role of HRMartin Sutherland
 
Causality in Python PyCon 2021 ISRAEL
Causality in Python PyCon 2021 ISRAELCausality in Python PyCon 2021 ISRAEL
Causality in Python PyCon 2021 ISRAELHanan Shteingart
 
Oct 2017 Measurement Hour: Highlights from the Summit on the Future of Measur...
Oct 2017 Measurement Hour: Highlights from the Summit on the Future of Measur...Oct 2017 Measurement Hour: Highlights from the Summit on the Future of Measur...
Oct 2017 Measurement Hour: Highlights from the Summit on the Future of Measur...Paine Publishing
 
Analysis of “what do you do with all this big data” –ted talk by susan etlinger
Analysis of “what do you do with all this big data” –ted talk by susan etlingerAnalysis of “what do you do with all this big data” –ted talk by susan etlinger
Analysis of “what do you do with all this big data” –ted talk by susan etlingerDarpan Deoghare
 
'Recession-proofing' your Business with Data
'Recession-proofing' your Business with Data'Recession-proofing' your Business with Data
'Recession-proofing' your Business with DataGanes Kesari
 
Growth, Engagement & Search Metrics: Snake Oil or North Stars
Growth, Engagement & Search Metrics: Snake Oil or North StarsGrowth, Engagement & Search Metrics: Snake Oil or North Stars
Growth, Engagement & Search Metrics: Snake Oil or North StarsJune Andrews
 

What's hot (20)

The Art of Storytelling Using Data Science
The Art of Storytelling Using Data ScienceThe Art of Storytelling Using Data Science
The Art of Storytelling Using Data Science
 
Data & Storytelling - What Now?
Data & Storytelling  - What Now? Data & Storytelling  - What Now?
Data & Storytelling - What Now?
 
'Visual Intelligence' by Ganes Kesari, at Hyderabad Analytics Club
'Visual Intelligence' by Ganes Kesari, at Hyderabad Analytics Club'Visual Intelligence' by Ganes Kesari, at Hyderabad Analytics Club
'Visual Intelligence' by Ganes Kesari, at Hyderabad Analytics Club
 
The value of storytelling through data
The value of storytelling through dataThe value of storytelling through data
The value of storytelling through data
 
The Business Value of Reinforcement Learning and Causal Inference
The Business Value of Reinforcement Learning and Causal InferenceThe Business Value of Reinforcement Learning and Causal Inference
The Business Value of Reinforcement Learning and Causal Inference
 
Humanizing Data Storytelling for Greater Business Impact
Humanizing Data Storytelling for Greater Business ImpactHumanizing Data Storytelling for Greater Business Impact
Humanizing Data Storytelling for Greater Business Impact
 
How Big Data identifies early indicators of Mental Stress
How Big Data identifies early indicators of Mental StressHow Big Data identifies early indicators of Mental Stress
How Big Data identifies early indicators of Mental Stress
 
P 02 ta_in_uw_transformation_2017_06_13_v5
P 02 ta_in_uw_transformation_2017_06_13_v5P 02 ta_in_uw_transformation_2017_06_13_v5
P 02 ta_in_uw_transformation_2017_06_13_v5
 
Moving Big Data to Big Value
Moving Big Data to Big ValueMoving Big Data to Big Value
Moving Big Data to Big Value
 
P 02 internal_data_first_2017_04_22_v6
P 02 internal_data_first_2017_04_22_v6P 02 internal_data_first_2017_04_22_v6
P 02 internal_data_first_2017_04_22_v6
 
Business Optimization via Causal Inference
Business Optimization via Causal InferenceBusiness Optimization via Causal Inference
Business Optimization via Causal Inference
 
1120 track2 bennett
1120 track2 bennett1120 track2 bennett
1120 track2 bennett
 
DI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
DI&A Slides: Descriptive, Prescriptive, and Predictive AnalyticsDI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
DI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
 
Living in a data economy: Transforming the role of HR
Living in a data economy: Transforming the role of HRLiving in a data economy: Transforming the role of HR
Living in a data economy: Transforming the role of HR
 
Causality in Python PyCon 2021 ISRAEL
Causality in Python PyCon 2021 ISRAELCausality in Python PyCon 2021 ISRAEL
Causality in Python PyCon 2021 ISRAEL
 
Math in data
Math in dataMath in data
Math in data
 
Oct 2017 Measurement Hour: Highlights from the Summit on the Future of Measur...
Oct 2017 Measurement Hour: Highlights from the Summit on the Future of Measur...Oct 2017 Measurement Hour: Highlights from the Summit on the Future of Measur...
Oct 2017 Measurement Hour: Highlights from the Summit on the Future of Measur...
 
Analysis of “what do you do with all this big data” –ted talk by susan etlinger
Analysis of “what do you do with all this big data” –ted talk by susan etlingerAnalysis of “what do you do with all this big data” –ted talk by susan etlinger
Analysis of “what do you do with all this big data” –ted talk by susan etlinger
 
'Recession-proofing' your Business with Data
'Recession-proofing' your Business with Data'Recession-proofing' your Business with Data
'Recession-proofing' your Business with Data
 
Growth, Engagement & Search Metrics: Snake Oil or North Stars
Growth, Engagement & Search Metrics: Snake Oil or North StarsGrowth, Engagement & Search Metrics: Snake Oil or North Stars
Growth, Engagement & Search Metrics: Snake Oil or North Stars
 

Similar to Data monetization

Data visualization
Data visualizationData visualization
Data visualizationMukul Taneja
 
The Art of Data Visualization
The Art of Data VisualizationThe Art of Data Visualization
The Art of Data VisualizationGramener
 
Insights from Data: Overcoming Objections
Insights from Data: Overcoming ObjectionsInsights from Data: Overcoming Objections
Insights from Data: Overcoming ObjectionsGramener
 
Data visualization for social problems
Data visualization for social problemsData visualization for social problems
Data visualization for social problemsGramener
 
BUS308 – Week 1 Lecture 2 Describing Data Expected Out.docx
BUS308 – Week 1 Lecture 2 Describing Data Expected Out.docxBUS308 – Week 1 Lecture 2 Describing Data Expected Out.docx
BUS308 – Week 1 Lecture 2 Describing Data Expected Out.docxcurwenmichaela
 
New Age Tools in Data Journalism - Analytics & Visualization
New Age Tools in Data Journalism - Analytics & VisualizationNew Age Tools in Data Journalism - Analytics & Visualization
New Age Tools in Data Journalism - Analytics & VisualizationGanes Kesari
 
New Age Tools in Data Journalism - Analytics & Visualization
New Age Tools in Data Journalism - Analytics & VisualizationNew Age Tools in Data Journalism - Analytics & Visualization
New Age Tools in Data Journalism - Analytics & VisualizationGramener
 
Introduction to statistice shs1
Introduction to statistice shs1Introduction to statistice shs1
Introduction to statistice shs1KarenCato1
 
BUS 308 Week 2 Lecture 1 Examining Differences - overview .docx
BUS 308 Week 2 Lecture 1 Examining Differences - overview .docxBUS 308 Week 2 Lecture 1 Examining Differences - overview .docx
BUS 308 Week 2 Lecture 1 Examining Differences - overview .docxjasoninnes20
 
BUS 308 Week 2 Lecture 1 Examining Differences - overview .docx
BUS 308 Week 2 Lecture 1 Examining Differences - overview .docxBUS 308 Week 2 Lecture 1 Examining Differences - overview .docx
BUS 308 Week 2 Lecture 1 Examining Differences - overview .docxcurwenmichaela
 
Case Study Hereditary AngioedemaAll responses must be in your .docx
Case Study  Hereditary AngioedemaAll responses must be in your .docxCase Study  Hereditary AngioedemaAll responses must be in your .docx
Case Study Hereditary AngioedemaAll responses must be in your .docxcowinhelen
 
Module 1.2 data preparation
Module 1.2  data preparationModule 1.2  data preparation
Module 1.2 data preparationSara Hooker
 
Module 1 introduction to machine learning
Module 1  introduction to machine learningModule 1  introduction to machine learning
Module 1 introduction to machine learningSara Hooker
 
The Art of Speaking Data.
The Art of Speaking Data.The Art of Speaking Data.
The Art of Speaking Data.David Wellman
 
The role of statistics and the data analysis process.ppt
The role of statistics and the data analysis process.pptThe role of statistics and the data analysis process.ppt
The role of statistics and the data analysis process.pptJakeCuenca10
 
CDEV 103 Child Growth and Development .docx
CDEV 103 Child Growth and Development        .docxCDEV 103 Child Growth and Development        .docx
CDEV 103 Child Growth and Development .docxtarifarmarie
 
Story 5 1031013
Story 5 1031013Story 5 1031013
Story 5 1031013Darren Yeh
 

Similar to Data monetization (20)

Data visualization
Data visualizationData visualization
Data visualization
 
The Art of Data Visualization
The Art of Data VisualizationThe Art of Data Visualization
The Art of Data Visualization
 
Insights from Data: Overcoming Objections
Insights from Data: Overcoming ObjectionsInsights from Data: Overcoming Objections
Insights from Data: Overcoming Objections
 
Data visualization for social problems
Data visualization for social problemsData visualization for social problems
Data visualization for social problems
 
BUS308 – Week 1 Lecture 2 Describing Data Expected Out.docx
BUS308 – Week 1 Lecture 2 Describing Data Expected Out.docxBUS308 – Week 1 Lecture 2 Describing Data Expected Out.docx
BUS308 – Week 1 Lecture 2 Describing Data Expected Out.docx
 
New Age Tools in Data Journalism - Analytics & Visualization
New Age Tools in Data Journalism - Analytics & VisualizationNew Age Tools in Data Journalism - Analytics & Visualization
New Age Tools in Data Journalism - Analytics & Visualization
 
New Age Tools in Data Journalism - Analytics & Visualization
New Age Tools in Data Journalism - Analytics & VisualizationNew Age Tools in Data Journalism - Analytics & Visualization
New Age Tools in Data Journalism - Analytics & Visualization
 
Introduction to statistice shs1
Introduction to statistice shs1Introduction to statistice shs1
Introduction to statistice shs1
 
BUS 308 Week 2 Lecture 1 Examining Differences - overview .docx
BUS 308 Week 2 Lecture 1 Examining Differences - overview .docxBUS 308 Week 2 Lecture 1 Examining Differences - overview .docx
BUS 308 Week 2 Lecture 1 Examining Differences - overview .docx
 
BUS 308 Week 2 Lecture 1 Examining Differences - overview .docx
BUS 308 Week 2 Lecture 1 Examining Differences - overview .docxBUS 308 Week 2 Lecture 1 Examining Differences - overview .docx
BUS 308 Week 2 Lecture 1 Examining Differences - overview .docx
 
Case Study Hereditary AngioedemaAll responses must be in your .docx
Case Study  Hereditary AngioedemaAll responses must be in your .docxCase Study  Hereditary AngioedemaAll responses must be in your .docx
Case Study Hereditary AngioedemaAll responses must be in your .docx
 
Introducing Statistics
Introducing StatisticsIntroducing Statistics
Introducing Statistics
 
Module 1.2 data preparation
Module 1.2  data preparationModule 1.2  data preparation
Module 1.2 data preparation
 
Module 1 introduction to machine learning
Module 1  introduction to machine learningModule 1  introduction to machine learning
Module 1 introduction to machine learning
 
The Art of Speaking Data.
The Art of Speaking Data.The Art of Speaking Data.
The Art of Speaking Data.
 
The role of statistics and the data analysis process.ppt
The role of statistics and the data analysis process.pptThe role of statistics and the data analysis process.ppt
The role of statistics and the data analysis process.ppt
 
CDEV 103 Child Growth and Development .docx
CDEV 103 Child Growth and Development        .docxCDEV 103 Child Growth and Development        .docx
CDEV 103 Child Growth and Development .docx
 
Jerait PDF.pdf
Jerait PDF.pdfJerait PDF.pdf
Jerait PDF.pdf
 
Dreams into nightmares
Dreams into nightmaresDreams into nightmares
Dreams into nightmares
 
Story 5 1031013
Story 5 1031013Story 5 1031013
Story 5 1031013
 

More from Gramener

6 Methods to Improve Your Manufacturing Process with Computer Vision
6 Methods to Improve Your Manufacturing Process with Computer Vision6 Methods to Improve Your Manufacturing Process with Computer Vision
6 Methods to Improve Your Manufacturing Process with Computer VisionGramener
 
Detecting Manufacturing Defects with Computer Vision
Detecting Manufacturing Defects with Computer VisionDetecting Manufacturing Defects with Computer Vision
Detecting Manufacturing Defects with Computer VisionGramener
 
How to Identify the Right Key Opinion Leaders (KOLs) in Pharma & Healthcare
How to Identify the Right Key Opinion Leaders (KOLs) in Pharma  & HealthcareHow to Identify the Right Key Opinion Leaders (KOLs) in Pharma  & Healthcare
How to Identify the Right Key Opinion Leaders (KOLs) in Pharma & HealthcareGramener
 
Automated Barcode Generation System in Manufacturing
Automated Barcode Generation System in ManufacturingAutomated Barcode Generation System in Manufacturing
Automated Barcode Generation System in ManufacturingGramener
 
The Role of Technology to Save Biodiversity
The Role of Technology to Save BiodiversityThe Role of Technology to Save Biodiversity
The Role of Technology to Save BiodiversityGramener
 
Enable Storytelling with Power BI & Comicgen Plugin
Enable Storytelling with Power BI  & Comicgen PluginEnable Storytelling with Power BI  & Comicgen Plugin
Enable Storytelling with Power BI & Comicgen PluginGramener
 
The Most Effective Method For Selecting Data Science Projects
The Most Effective Method For Selecting Data Science ProjectsThe Most Effective Method For Selecting Data Science Projects
The Most Effective Method For Selecting Data Science ProjectsGramener
 
Low Code Platform To Build Data & AI Products
Low Code Platform To Build Data & AI ProductsLow Code Platform To Build Data & AI Products
Low Code Platform To Build Data & AI ProductsGramener
 
5 Key Foundations To Build An Effective CX Program
5 Key Foundations To Build An Effective CX Program5 Key Foundations To Build An Effective CX Program
5 Key Foundations To Build An Effective CX ProgramGramener
 
Using Power BI To Improve Media Buying & Ad Performance
Using Power BI To Improve Media Buying & Ad PerformanceUsing Power BI To Improve Media Buying & Ad Performance
Using Power BI To Improve Media Buying & Ad PerformanceGramener
 
Recession Proofing With Data : Webinar
Recession Proofing With Data : WebinarRecession Proofing With Data : Webinar
Recession Proofing With Data : WebinarGramener
 
Engage Your Audience With PowerPoint Decks: Webinar
Engage Your Audience With PowerPoint Decks: WebinarEngage Your Audience With PowerPoint Decks: Webinar
Engage Your Audience With PowerPoint Decks: WebinarGramener
 
Structure Your Data Science Teams For Best Outcomes
Structure Your Data Science Teams For Best OutcomesStructure Your Data Science Teams For Best Outcomes
Structure Your Data Science Teams For Best OutcomesGramener
 
Dawn Of Geospatial AI - Webinar
Dawn Of Geospatial AI - WebinarDawn Of Geospatial AI - Webinar
Dawn Of Geospatial AI - WebinarGramener
 
5 Steps To Become A Data-Driven Organization : Webinar
5 Steps To Become A Data-Driven Organization : Webinar5 Steps To Become A Data-Driven Organization : Webinar
5 Steps To Become A Data-Driven Organization : WebinarGramener
 
5 Steps To Measure ROI On Your Data Science Initiatives - Webinar
 5 Steps To Measure ROI On Your Data Science Initiatives - Webinar 5 Steps To Measure ROI On Your Data Science Initiatives - Webinar
5 Steps To Measure ROI On Your Data Science Initiatives - WebinarGramener
 
Saving Lives with Geospatial AI - Pycon Indonesia 2020
Saving Lives with Geospatial AI - Pycon Indonesia 2020Saving Lives with Geospatial AI - Pycon Indonesia 2020
Saving Lives with Geospatial AI - Pycon Indonesia 2020Gramener
 
Driving Transformation in Industries with Artificial Intelligence (AI)
Driving Transformation in Industries with Artificial Intelligence (AI)Driving Transformation in Industries with Artificial Intelligence (AI)
Driving Transformation in Industries with Artificial Intelligence (AI)Gramener
 
Data and Storytelling | What Now?
Data and Storytelling | What Now?Data and Storytelling | What Now?
Data and Storytelling | What Now?Gramener
 
Introduction to Data Storytelling | Rasagy Sharma - Gramener
Introduction to Data Storytelling | Rasagy Sharma - GramenerIntroduction to Data Storytelling | Rasagy Sharma - Gramener
Introduction to Data Storytelling | Rasagy Sharma - GramenerGramener
 

More from Gramener (20)

6 Methods to Improve Your Manufacturing Process with Computer Vision
6 Methods to Improve Your Manufacturing Process with Computer Vision6 Methods to Improve Your Manufacturing Process with Computer Vision
6 Methods to Improve Your Manufacturing Process with Computer Vision
 
Detecting Manufacturing Defects with Computer Vision
Detecting Manufacturing Defects with Computer VisionDetecting Manufacturing Defects with Computer Vision
Detecting Manufacturing Defects with Computer Vision
 
How to Identify the Right Key Opinion Leaders (KOLs) in Pharma & Healthcare
How to Identify the Right Key Opinion Leaders (KOLs) in Pharma  & HealthcareHow to Identify the Right Key Opinion Leaders (KOLs) in Pharma  & Healthcare
How to Identify the Right Key Opinion Leaders (KOLs) in Pharma & Healthcare
 
Automated Barcode Generation System in Manufacturing
Automated Barcode Generation System in ManufacturingAutomated Barcode Generation System in Manufacturing
Automated Barcode Generation System in Manufacturing
 
The Role of Technology to Save Biodiversity
The Role of Technology to Save BiodiversityThe Role of Technology to Save Biodiversity
The Role of Technology to Save Biodiversity
 
Enable Storytelling with Power BI & Comicgen Plugin
Enable Storytelling with Power BI  & Comicgen PluginEnable Storytelling with Power BI  & Comicgen Plugin
Enable Storytelling with Power BI & Comicgen Plugin
 
The Most Effective Method For Selecting Data Science Projects
The Most Effective Method For Selecting Data Science ProjectsThe Most Effective Method For Selecting Data Science Projects
The Most Effective Method For Selecting Data Science Projects
 
Low Code Platform To Build Data & AI Products
Low Code Platform To Build Data & AI ProductsLow Code Platform To Build Data & AI Products
Low Code Platform To Build Data & AI Products
 
5 Key Foundations To Build An Effective CX Program
5 Key Foundations To Build An Effective CX Program5 Key Foundations To Build An Effective CX Program
5 Key Foundations To Build An Effective CX Program
 
Using Power BI To Improve Media Buying & Ad Performance
Using Power BI To Improve Media Buying & Ad PerformanceUsing Power BI To Improve Media Buying & Ad Performance
Using Power BI To Improve Media Buying & Ad Performance
 
Recession Proofing With Data : Webinar
Recession Proofing With Data : WebinarRecession Proofing With Data : Webinar
Recession Proofing With Data : Webinar
 
Engage Your Audience With PowerPoint Decks: Webinar
Engage Your Audience With PowerPoint Decks: WebinarEngage Your Audience With PowerPoint Decks: Webinar
Engage Your Audience With PowerPoint Decks: Webinar
 
Structure Your Data Science Teams For Best Outcomes
Structure Your Data Science Teams For Best OutcomesStructure Your Data Science Teams For Best Outcomes
Structure Your Data Science Teams For Best Outcomes
 
Dawn Of Geospatial AI - Webinar
Dawn Of Geospatial AI - WebinarDawn Of Geospatial AI - Webinar
Dawn Of Geospatial AI - Webinar
 
5 Steps To Become A Data-Driven Organization : Webinar
5 Steps To Become A Data-Driven Organization : Webinar5 Steps To Become A Data-Driven Organization : Webinar
5 Steps To Become A Data-Driven Organization : Webinar
 
5 Steps To Measure ROI On Your Data Science Initiatives - Webinar
 5 Steps To Measure ROI On Your Data Science Initiatives - Webinar 5 Steps To Measure ROI On Your Data Science Initiatives - Webinar
5 Steps To Measure ROI On Your Data Science Initiatives - Webinar
 
Saving Lives with Geospatial AI - Pycon Indonesia 2020
Saving Lives with Geospatial AI - Pycon Indonesia 2020Saving Lives with Geospatial AI - Pycon Indonesia 2020
Saving Lives with Geospatial AI - Pycon Indonesia 2020
 
Driving Transformation in Industries with Artificial Intelligence (AI)
Driving Transformation in Industries with Artificial Intelligence (AI)Driving Transformation in Industries with Artificial Intelligence (AI)
Driving Transformation in Industries with Artificial Intelligence (AI)
 
Data and Storytelling | What Now?
Data and Storytelling | What Now?Data and Storytelling | What Now?
Data and Storytelling | What Now?
 
Introduction to Data Storytelling | Rasagy Sharma - Gramener
Introduction to Data Storytelling | Rasagy Sharma - GramenerIntroduction to Data Storytelling | Rasagy Sharma - Gramener
Introduction to Data Storytelling | Rasagy Sharma - Gramener
 

Recently uploaded

Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationBoston Institute of Analytics
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxTanveerAhmed817946
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 

Recently uploaded (20)

Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project Presentation
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptx
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Decoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in ActionDecoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in Action
 

Data monetization

  • 1. S ANAND, CHIEF DATA SCIENTIST, GRAMENER MONETISING DATA REMOVING YOUR MENTAL HURDLES
  • 4. We have internal information. Getting information from outside is our challenge. There’s no way of doing that. – Senior Editor Leading Media Company “
  • 7.
  • 8. UNCOVER YOUR DARK DATA Source: http://www.patrickcheesman.com/dark-data-problems-and-solutions/ • INACCESSIBLE data (e.g. technology is outdated) • FORGOTTEN data (e.g. collected, but not actively used) • UNCOLLECTED data (e.g. information exists, not digitized) • SINGLE PURPOSE data (e.g. used for a specific purpose)
  • 9. We’ve used network diagrams to detect terrorism, corporate fraud, product affinities and behavioural customer segmentation
  • 10. AUGMENT YOUR DATA SOURCES DATA IS EVERYWHERE COMMON COMPLAINT #1 WE DON’T HAVE DATA COMMON COMPLAINT #2 THE DATA ISN’T STRUCTURED CRM DATA SALES DATA PRICING DATA CALL RECORDS WEB LOG DATA VENDOR INVOICES SOCIAL MEDIA DATA CLICKTHROUGH DATA COMPETITOR RESEARCH CUSTOMER TRANSACTIONS … CENSUS DATA E-COMMERCE PRICES COMMODITY PRICES STOCK MARKET DATA FINANCIAL REPORTING SOCIAL MEDIA DATA MOBILE PENETRATION AADHAR DATA COURT CASE BRIEFS SHAPE FILES …
  • 11. How does Mahabharata, one of the largest epics with 1.8 million words lend itself to text analytics? Can this ‘unstructured data’ be processed to extract analytical insights? What does sentiment analysis of this tome convey? Is there a better way to explore relations between characters? How can closeness of characters be analysed & visualized? Visualising the Mahabharata
  • 12. “ Can we help CFOs understand what questions are being asked by investors and analysts during earnings releases? How this is different from competition? – Product Head Global Financial Services Firm
  • 13. WHAT DO FINANCIAL ANALYSTS ASK IBM VS MSFT?
  • 14. DATA IS EVERYWHERE EXTRACT THE META DATA AUGMENT YOUR DATA SOURCES COMMON COMPLAINT #2 THE DATA ISN’T STRUCTURED COMMON COMPLAINT #3 THE DATA ISN’T RICH / CLEAN COMMON WHO, WHAT, WHEN, WHERE TEXT TEXT KEYWORDS SENTIMENT IMAGE VISUAL RECOGNITION AUDIO / CALLS TRANSCRIPTS MOOD ANALYSIS
  • 15. “ Can we get the results of every single election in history, and create a portal to visualize these results? – Rajdeep Sardesai CNN-IBN
  • 16. The PDF files have a reasonably clear structure
  • 17. … that translates into text that can be parsed
  • 18. Not every spelling error is easily identifiable by the first letter
  • 19. … with several names spelt wrong These are, in fact two different constituencies But these are exactly the same ... and so are these I’ve no idea if these are 2, or 3, constituencies!
  • 20. … with the ability for the system to correct errors automatically
  • 21.
  • 22. DATA IS EVERYWHERE TRANSFORM THE DATA & ENRICH IT EXTRACT THE META DATA AUGMENT YOUR DATA SOURCES COMMON COMPLAINT #3 THE DATA ISN’T RICH / CLEAN
  • 24. This is a dataset (1975 – 1990) that has been around for several years, and has been studied extensively. Yet, a visualization can reveal patterns that are neither obvious nor well known. For example, • Are birthdays uniformly distributed? • Do doctors or parents exercise the C-section option to move dates? • Is there any day of the month that has unusually high or low births? • Are there any months with relatively high or low births? More births Fewer births … on average, for each day of the year (from 1975 to 1990) LET’S LOOK AT 15 YEARS OF US BIRTH DATA
  • 25. THE PATTERN IN INDIA IS QUITE DIFFERENT This is a birth date dataset that’s obtained from school admission data for over 10 million children. When we compare this with births in the US, we see none of the same patterns. For example, • Is there an aversion to the 13th or is there a local cultural nuance? • Are holidays avoided for births? • Which months have a higher propensity for births, and why? • Are there any patterns not found in the US data? More births Fewer births … on average, for each day of the year (from 2007 to 2013)
  • 26. THIS ADVERSELY IMPACTS CHILDREN’S MARKS It’s a well established fact that older children tend to do better at school in most activities. Since many children have had their birth dates brought forward, these younger children suffer. The average marks of children “born” on the 1st, 5th, 10th, 15th etc. of the month tend to score lower marks. • Are holidays avoided for births? • Which months have a higher propensity for births, and why? • Are there any patterns not found in the US data? Higher marks Lower marks … on average, for children born on a given day of the year (from 2007 to 2013)
  • 27. DEPLOY MODERN TOOLS ANALYSIS IS EVERYWHERE COMMON COMPLAINT #1 WE DON’T HAVE THE TOOLS COMMON COMPLAINT #2 WE DON’T GET INSIGHTS R SAS EXCEL PYTHON DATABASES ML SERVICES
  • 28. RESTAURANT FOUND AN UNUSUAL DIP IN SALES A restaurant chain had data for every single transaction made over a few years. Plotting this as a time series showed them nothing unusual. However, the same data on a calendar map reveals a very different story. Specifically, at the bottom left point-of-sale terminal, sales dips on every Wednesday. At the bottom right point-of-sale terminal, sales rises on every Wednesday (almost as if to compensate for the loss.) It turns out that the manager closes the bottom-left counter every Wednesday afternoon due to shortage of staff, assuming that it results in no loss of sales. There is, however, a net loss every Wednesday.
  • 29. DEPLOY MODERN TOOLS ANALYSIS IS EVERYWHERE TEST DATASETS ANONYMISATION EVALUATION CRITERIA IMPROVEMENT METRIC DATA INFRASTRUCTURE MODEL INFRASTRUCTURE VISUALS INFRASTRUCTURE SET UP AN ML PLATFORM INFRASTRUCTURE FOR RAPIDITY COMMON COMPLAINT #2 MODELS ARE COMPLICATED COMMON COMPLAINT #3 IMPLEMENTATIONS ARE SLOW
  • 30. Nation-wide statistics on behaviour and performance of students Over 1,000 questions each administered to several lakhs of students across the country
  • 31. Having books improves reading ability Having more books at home improves the performance of children when it comes to reading. (But children typically only have only 1-10 books at home) … but the impact in social is less While having more books improves the reading % score by 8%, it only increases the social % by 4%
  • 32. Tuitions help very little … but children of illiterate parents do worse
  • 33. Watching TV occasionally is good Children who watch TV every day don’t do as well as children who watch TV only once a week. But children who never watch TV fare the worst. Watching TV every day helps improve children’s reading ability a little bit more… … but mathematical abilities fall dramatically at that point
  • 34. Having educated parents helps most This table shows the % improvement in score due to each factor THIS TECHNIQUE CAN BE APPLIED TO ANY DATASET
  • 35. AUTOMATING ANALYSIS IN POULTRY FARMING We group by every input factor … and calculate the impact on every metric. By moving from average to the best group, what’s the improvement? The actual performance by each group is shown 0-3m 3-6m 6m-1yr 1-2 yrs > 2 yrs 11 12.3 12.7 15.3 16.1 Our product can create visualisations from data automatically, without any supervision. Above is an example. Irrespective of the dataset, this visual shows which input parameters have a significant impact on the output. Another such example is the cluster scatterplot. Only significant results shown
  • 36. 68% correlation between AUD & EUR Plot of 6 month daily AUD - EUR values Block of correlated currencies … clustered hierarchically
  • 39. DEPLOY MODERN TOOLS ANALYSIS IS EVERYWHERE CLUSTER PLOTS CORRELATIONS CROSS TABULATION GROUP MEANS KEYWORD EXTRACTION NETWORK ANALYSIS SANKEY DRILLDOWNS SENTIMENT ANALYSIS … INFRASTRUCTURE FOR RAPIDITY COMMON COMPLAINT #3 IMPLEMENTATIONS ARE SLOW BUILD AND USE TEMPLATES
  • 41. S ANAND, CHIEF DATA SCIENTIST, GRAMENER THE CAPABILITIES ARE IN YOUR REACH TODAY EXPLORE THE ART OF DATA

Editor's Notes

  1. https://flic.kr/p/aCqg7w
  2. For the same chain, we also looked at the daily sales across restaurants. Here are a series of calendar maps showing the daily sales for four different points of sale terminals at one restaurant. Each calendar map shows a calendar for 7 months. Each day is coloured based on the value of sales on that day. Red indicates low sales, green indicates high sales. For the two terminals at the front (i.e. the ones you see on top), sales was relatively low during the first two months, but picked up steadily thereafter. It’s easy to spot the exceptions among this. For example, the 30th and 31st of January were good days for both terminals. Interestingly, when you look at the terminal at the bottom left, there is a red bar indicating consistent dip in sales every Wednesday. Almost as if to compensate, the terminal at the bottom right has an increase in sales every Wednesday – but not as significant as the dip. We did not have an explanation for this, though our client did a few weeks later. It turned out that the person manning the bottom left counter takes half-day off every Wednesday, and was not being replaced by the manager. The queue naturally shifts over to the other terminal, increasing the sales. But this restaurant is in an area where there are many other food outlets. Once the queue reaches a certain size, people drop off, resulting in a net loss in sales every Wednesday – a loss that had gone unobserved for at least 7 months.
  3. So, what we did was put a variant of this visual together. On the right, you have a series of currencies like the Australian dollar, the Euro, the British pound, etc; some commodities like silver and gold; and some stock indices like Sensex, FTSE, and S&P. The cells here have a number inside that indicates the pairwise correlation between a pair of securities. For example, the number 68 on the top left indicates a 68% correlation between the Australian dollar and the Euro. To the left of the Euro and just below the dollar (diagonally opposite to the 68), there’s a scatter plot that shows the daily prices of both these currencies. Each dot is one day’s data. The x-axis shows the Australian dollar value. The y-axis shows the Euro value. This helps identify what the pattern of movements of any two currencies is. From this, you can easily see visually that the Australian dollar and the Euro both tend to move together. Or, where there are strong correlations like the FTSE & S&P, the pattern is almost a straight line. In some cases there are negative correlations. For instance, if you take the Sensex against the Japanese Yen, the correlation is -79%. The cells are coloured based on their correlation values. Greens indicate strong positive correlation. Reds indicate strong negative correlation. These are also grouped hierarchically. On the left, we have a series of lines indicating clusters. The most similar securities are grouped together. So FTSE and S&P with a 98% correlation are very close. The ones that are less correlated are kept further away based on a tree-structure. This leads to clustering of securities. For example, there is a green block in the center which has SGD, JPY, XAU, CHF and CNY. All of these are fairly well correlated. When any one currency in this block goes up, all the others go up as well. When any one goes down, all others go down as well. Similarly, you have another block to its top left: S&P, FTSE, Sensex and to a certain extent, the Pakistani Rupee. These move together as a block as well. But when this block goes up, all the currencies in the other block go down, as indicated by the red negative correlations between these two blocks. This can be used very easily for decision making. For example, one client who was trading with Singapore and Japan looked at the strong correlation and decided to consolidate their holdings in Japanese Yen. They then moved up and down this column to find a good hedge. FTSE looked like a good hedge – it was the most negatively correlated with JPY at that time -- and they decided to place a third of their portfolio in FTSE. A sheet like this improves people’s understanding of relatively complex data, and results in significantly increased trade volumes.
  4. We were working with a restaurant who had 7 months’ worth of sales data, and asked what we could do with this data. It was a fairly open-ended problem. Among other things, we looked at the various product categories they sold, such as starters, breads, desserts, etc. and the pairwise correlations between each of these. The number in each cell shows the pairwise correlation between any two products. The 17 on the top left, for example, indicates a 17% correlation between side dishes and meals. The scatter plots diagonally opposite show the correlations between these visually as well. These are colour coded based on the correlation. The redder it is, the more negative the correlation. The greener it is, the more positive the correlation. There are a few patterns that emerge. For example: desserts are positively correlated with every product. The row and column are green right through, indicating that it doesn’t matter what people eat – they usually have desserts at the end. Starters are an interesting category. They were introduced 4 years ago as a loss-leader, with the aim of increasing the restaurant’s menu variety and to bring in footfall. As a result, they were priced at cost. You can see from this that starters sell well with breads (rotis, naans, etc). They sell well with desserts, but then, everything sells well with desserts. But they reduce the sales of every other product! What’s been happening is that since starters were so attractive, people were coming in, ordering starters and desserts, and leaving. As a result, this initiative had been a net loss for the profit margin, though it had not been spotted for nearly four years.
  5. When you look at the correlations at an individual item level, it turns out that there’s one product that is negatively correlated with almost every other product: the 1 litre mineral water bottle. This is a curious phenomenon, and our client explained this once they realised what was happening. Theirs is a low-end chain of restaurants and it’s mostly individuals (not families) that visit this restaurant. Their customers are rather price-conscious. When they buy 1 litre of water, they want to make sure that they do not waste it. And when an entire litre is consumed, there’s not much space in the stomach for other things. An obvious solution was to replace the 1 litre packaging with a smaller 200ml bottle. This ends up turning the entire row and column of reds into neutral yellows, resulting in an overall increase in sale of all products.
  6. https://flic.kr/p/aCqg7w