SlideShare a Scribd company logo
1 of 27
Anand Ranganathan,
VP of Solutions
Aug 2017
DATA SCIENCE OUT OF THE BOX:
Case Studies In The
Telecommunications Industry
Telecommunications Service Providers have huge
amounts of data related to customer activity that come to
them in real-time
2
• Calling, SMS and data usage information
• Purchase and recharge data
• Plan information
• Browsing data (DPI)
• Location information from CDRs, probes or other sources
• Device Data logs
• Call Center logs
But, they face challenges in getting value from this data to
improve customer experience
3
Difficult to integrate data about
customers from multiple
sources into a single view
Difficult to integrate the insights
from the models with other tools
Difficult to build models Difficult to act upon the insights
Difficult to operationalize the
models
Difficult to gain business value
1
2
3
4
5
6
What telcos would
like to do …
4
5
Make every
interaction
with the brand….
6
We believe it’s the little
things..
… Targeted,
precise and
contextual
7
Predict which
customers will
need int’l
roaming in the
next day?
8
Provide personalized, real-time offers to
customers whose data pack is predicted to run out
in the next 12 hours??
9
Provide real-time
predictive issue
resolution to your
customers, before they
call the call center?
10
Harnessing Data in
Real-Time is key to
creating a great
customer
experience…
… Most
enterprises,
though, have
struggled to
deploy and get
value from analytics …
especially, real-time
analytics
Does it have to take years to deploy an advanced analytics
solution ?
Do you really need an army of Data Scientists to create new
models every year ?
Do you really have to stitch together 10 solutions for a ‘single
customer view’ that gets updated once a day?
Why is it still so difficult to create personalized and contextual
campaigns ?
Our Vision – Easy to use Real-time analytics in a box
12
Our initial target
domains are:
• Telecommunications
• Healthcare
• Banking
Allow rapid
deployment of
analytics and reduce
time to value
Through reusable
machine learning
pipelines that cover
common needs
in several industries.
Firstly, what is a machine learning pipeline?
13
Training
Data
Parsing, Cleaning,
Transformations
Feature
Extraction
Train
Model
Model
Test Data
Predictions
Parsing, Cleaning,
Transformations
Feature
Extraction
Score Model
Training Pipeline
Scoring
Pipeline
We have 40+ readily deployable ML Pipelines covering
common telco marketing requirements
14
Machine
Learning Real-
Time and
Offline
Predictive
Models
Wallet, Purchase & Journey Models
§ Predict subscriber’s next top-up amount
§ Predict when subscriber might top-up
§ Predict if subscriber will buy or renew package
§ Predict Package expiry
§ Predict if package will expire with high balance
§ Prepaid to Postpaid Conversion Propensity
§ Churn Propensity
§ Next Best Action Model
§ Customer Lifetime Value Prediction
Spatio-Temporal Models
§ Predict home location, work location, weekend travel locations
§ Predict where subscriber will be at given hour & day, e.g. on Fridays
at 7 PM
§ Determine frequently visited locations (malls, churches, office
buildings etc.)
§ Mobility Profiling, e.g. frequent traveler,
stay-at-home, regular commuter
§ Home / Work Location Based Segmentation e.g. Stay-at-home
housewife, Traveling Salesman etc.
Anomaly Detection
§ Detect anomalies in calling pattern within the network / Cell
Site / Location / Subscriber
§ Anomaly Detection in SMS/data usage at Network / Cell Site
/ Location or Subscriber level
§ Anomaly Detection in dropped calls / dropped data sessions
at Network / Cell Site / Location or Subscriber level
Device Models
§ Detect Call Drops & Poor Call Quality from device logs
§ Detect Poorly performing device battery
§ Detect Anomalous Apps based on GPS, wake-lock etc.
§ Determine interests based on App Usage
Communication
• Determine relative preference of SMS, Voice or Data
§ Predict best time of day, day of week or location to
reach subscriber with offers
§ Determine preferred channels of communication
Customer Experience
§ Customer Satisfaction Model, based on dropped calls,
failed data sessions, poor call quality and device issues
§ Predict if customer will call contact center
§ Predict why customer may call contact center
Clickstream and Interests
§ URL Categorization into rich topic hierarchy
§ Long term and short term Interest derivation based
on browsing data of communication
§ Interest prediction based on location & device type
Social Network
§ Determine influencers and social hubs
§ Discover close contacts
§ Identify common interest communities within the
subscriber base
… used to create dynamic profiles of customers, locations
and business or retail outlets
15
Historical:
Typical	home	/	work	locations?
Recharge	patterns
Calling	network
Real	Time:
Websites	visited	in	last	hour
Number	of	dropped	calls	in	past	day
Recharge	prediction	in	next	6	hours
Historical:
Typical	population	at	location
Spend	patterns	at	location
Typical	Mobility	profiles
Real	Time:
Anomalous	network	loads
Number	of	queries	for	weather	
Current	population
Historical:
Historical	Population	trends
Browsing	behaviors
Communication	patterns
Real	Time:
Number	of	customers	near	business	now.
Number	of	calls	to	business	in	last	1	hour
Number	of	visits	to	competing	business
Key principle behind data science out of the box
16
Build ML pipeline once & Operationalize repeatedly
Operationalizing The Pipelines
– The ENGINEERING
Building Pipelines
– The ART
• Repeated for every new deployment
• Create the transformations & features on
historical data
• Train initial version of the model &
generate initial scores
• Create the transformations on streaming
data and update features
• Update scores and models “frequently”
based on streaming data
• Done once on some static representative
datasets
• Explore different possible transformations
of the data
• Explore different kinds of features
• Explore different models
• Finalize on a certain pipeline for a given
problem
Machine Learning is not a one-off process taking place in
a static world
17
All model-building & scoring activities happen at a certain point in time
TIME
NOW
Historical	Data	that	has	
been	collected	so	far	
Streaming	Data	that	will	come	
in	the	future
Build	initial	versions	of	the	
model,	score	them	and	
create	initial	profiles	based	
on	this	data
Update	scores	in	the	
profiles	and	refresh	models	
based	on	this	data
Typical Enterprise Architecture
18
Separate processing pathways for real-time analytics and long-term historical
analytics
Telco Data Sources:
• CDRs
• DPI
• Location
• SMSC
• Billing
ETL
Real-time
Streaming
Data.
Historical Data
Problems with basic pipeline in streaming settings
19
Training
Data
Parsing, Cleaning,
Transformations
Feature
Extraction
Train
Model
Model
Test Data
Predictions
Parsing, Cleaning,
Transformations
Feature
Extraction
Score Model
• Doesn’t show feature creation &
updates on combination of historical &
streaming data
• Doesn't show scoring based on most
recent feature values
• Doesn’t show model refresh
Patterns for Machine Learning Pipelines
20
Update models and
predictions on every event.
E.g. time-series predictions
and anomaly detection for
fraud detection.
Refresh models periodically
and score on every event.
E.g. topup prediction with
models updated every
week.
Build model one-time or
infrequently and score on
every event. E.g. Real-time
churn prediction with static
model
Update models and
predictions periodically.
E.g. user interest models,
hangout predictions and
recommendation models.
Build model one-time or
infrequently and score on
every event. E.g. Real-time
churn prediction with static
model
Build models and
predictions one time or very
infrequently. E.g. offline
churn prediction scores.
Online Frequent/Periodic Batch
MODELBUILDING
Frequent/PeriodicOnline
SCORING
Typical Enterprise Architecture with Unscrambl Brain
21
Separate processing pathways for real-time analytics and long-term historical
analytics
Telco Data Sources:
• CDRs
• DPI
• Location
• SMSC
• Billing
ETL
Real-time
Streaming
Data
Historical Data
• Stream
Analytics
• Profile Store
• Aggregate
Store
Brain is powered by 3 specialized components
22
Leveldb based time-
series aggregate store
Recharges, Number of dropped calls,
Number of international calls,… in the
past 10 minutes, hour, day, week, month
or year
Redis-based
profile store
Last known location of
customers, predicted home
and work locations,,…
Python-based ML
pipeline framework
Call Center Call Prediction Model,
Preferred Channel Prediction
Model,
Social Network Models
Online Learning, Online Scoring
23
One-Time Initialization of features
from Historical Data
Online Model Building & Scoring on Streaming Data
Historical
Data
Parsing, Cleaning,
Transformations
Feature
Extraction
Model
Maintain
Features
Streaming
Data
Parsing, Cleaning,
Transformations
Feature
Extraction
Get Features
for one entity
Train & Score
Model
Write Predictions
Periodic Learning, Online Scoring
24
Historical
Data
Parsing, Cleaning,
Transformations
Feature
Extraction
Train
Model
Model
Maintain
Features
Get Features
for all entities
Streaming
Data
Parsing, Cleaning,
Transformations
Feature
Extraction
Get Features
for one entity
Score
Model
Write Predictions
One-Time Initialization of features
from Historical Data
Periodic Model Re-Training
Online Update of Features and Scoring on
Streaming Data
Periodic Learning & Periodic Scoring
25
One-Time Initialization of features
from Historical Data
Periodic Model Re-Training &
Re-Scoring of all Entities
Online Update of Features from Streaming
Data
Historical
Data
Parsing, Cleaning,
Transformations
Feature
Extraction
Train &
Score
Model
Model
Maintain
Features
Get Features
for all entities
Streaming
Data
Parsing, Cleaning,
Transformations
Feature
Extraction
Write Predictions
Case Study : Telco in SE Asia
26
60+ million subscribers
7+ million optin subscribers
10+ billion CDRs per day
100+ billion URL records per day
15 Machine Learning pipelines rapidly deployed on Spark and Brain to derive a
variety of profile attributes about subscribers
Able to update models and profiles as frequently as needed
Data Science Out of The Box : Case Studies in the Telecommunication by Anand Ranganathan

More Related Content

What's hot

BigID & Collibra Joint Deck: Using BigID’s Privacy-centric Data Discovery to...
BigID & Collibra Joint Deck: Using BigID’s Privacy-centric Data  Discovery to...BigID & Collibra Joint Deck: Using BigID’s Privacy-centric Data  Discovery to...
BigID & Collibra Joint Deck: Using BigID’s Privacy-centric Data Discovery to...BigID Inc
 
Introducing Databricks Delta
Introducing Databricks DeltaIntroducing Databricks Delta
Introducing Databricks DeltaDatabricks
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata Hortonworks
 
Data Mesh at CMC Markets: Past, Present and Future
Data Mesh at CMC Markets: Past, Present and FutureData Mesh at CMC Markets: Past, Present and Future
Data Mesh at CMC Markets: Past, Present and FutureLorenzo Nicora
 
Five Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data GovernanceFive Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data GovernanceDATAVERSITY
 
Big Data & Analytics - Use Cases in Mobile, E-commerce, Media and more
Big Data & Analytics - Use Cases in Mobile, E-commerce, Media and moreBig Data & Analytics - Use Cases in Mobile, E-commerce, Media and more
Big Data & Analytics - Use Cases in Mobile, E-commerce, Media and moreAmazon Web Services
 
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationData Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationDATAVERSITY
 
Data Architecture for Solutions.pdf
Data Architecture for Solutions.pdfData Architecture for Solutions.pdf
Data Architecture for Solutions.pdfAlan McSweeney
 
Understanding DataOps and Its Impact on Application Quality
Understanding DataOps and Its Impact on Application QualityUnderstanding DataOps and Its Impact on Application Quality
Understanding DataOps and Its Impact on Application QualityDevOps.com
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
The path to a Modern Data Architecture in Financial Services
The path to a Modern Data Architecture in Financial ServicesThe path to a Modern Data Architecture in Financial Services
The path to a Modern Data Architecture in Financial ServicesHortonworks
 
Modern Data architecture Design
Modern Data architecture DesignModern Data architecture Design
Modern Data architecture DesignKujambu Murugesan
 
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data ArchitectureADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data ArchitectureDATAVERSITY
 
Mongo DB: Operational Big Data Database
Mongo DB: Operational Big Data DatabaseMongo DB: Operational Big Data Database
Mongo DB: Operational Big Data DatabaseXpand IT
 
A Reference Architecture for ETL 2.0
A Reference Architecture for ETL 2.0 A Reference Architecture for ETL 2.0
A Reference Architecture for ETL 2.0 DataWorks Summit
 
Data platform modernization with Databricks.pptx
Data platform modernization with Databricks.pptxData platform modernization with Databricks.pptx
Data platform modernization with Databricks.pptxCalvinSim10
 

What's hot (20)

BigID & Collibra Joint Deck: Using BigID’s Privacy-centric Data Discovery to...
BigID & Collibra Joint Deck: Using BigID’s Privacy-centric Data  Discovery to...BigID & Collibra Joint Deck: Using BigID’s Privacy-centric Data  Discovery to...
BigID & Collibra Joint Deck: Using BigID’s Privacy-centric Data Discovery to...
 
Introducing Databricks Delta
Introducing Databricks DeltaIntroducing Databricks Delta
Introducing Databricks Delta
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
 
Data Mesh at CMC Markets: Past, Present and Future
Data Mesh at CMC Markets: Past, Present and FutureData Mesh at CMC Markets: Past, Present and Future
Data Mesh at CMC Markets: Past, Present and Future
 
Five Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data GovernanceFive Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data Governance
 
Big Data & Analytics - Use Cases in Mobile, E-commerce, Media and more
Big Data & Analytics - Use Cases in Mobile, E-commerce, Media and moreBig Data & Analytics - Use Cases in Mobile, E-commerce, Media and more
Big Data & Analytics - Use Cases in Mobile, E-commerce, Media and more
 
LLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in HiveLLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in Hive
 
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationData Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital Transformation
 
Big Data
Big DataBig Data
Big Data
 
Data Architecture for Solutions.pdf
Data Architecture for Solutions.pdfData Architecture for Solutions.pdf
Data Architecture for Solutions.pdf
 
Understanding DataOps and Its Impact on Application Quality
Understanding DataOps and Its Impact on Application QualityUnderstanding DataOps and Its Impact on Application Quality
Understanding DataOps and Its Impact on Application Quality
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Hadoop and HBase @eBay
Hadoop and HBase @eBayHadoop and HBase @eBay
Hadoop and HBase @eBay
 
The path to a Modern Data Architecture in Financial Services
The path to a Modern Data Architecture in Financial ServicesThe path to a Modern Data Architecture in Financial Services
The path to a Modern Data Architecture in Financial Services
 
Apache Kylin
Apache KylinApache Kylin
Apache Kylin
 
Modern Data architecture Design
Modern Data architecture DesignModern Data architecture Design
Modern Data architecture Design
 
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data ArchitectureADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
 
Mongo DB: Operational Big Data Database
Mongo DB: Operational Big Data DatabaseMongo DB: Operational Big Data Database
Mongo DB: Operational Big Data Database
 
A Reference Architecture for ETL 2.0
A Reference Architecture for ETL 2.0 A Reference Architecture for ETL 2.0
A Reference Architecture for ETL 2.0
 
Data platform modernization with Databricks.pptx
Data platform modernization with Databricks.pptxData platform modernization with Databricks.pptx
Data platform modernization with Databricks.pptx
 

Similar to Data Science Out of The Box : Case Studies in the Telecommunication by Anand Ranganathan

Cloud 2020: taking your customers into the future - Peter Schwartz Avanxo Clo...
Cloud 2020: taking your customers into the future - Peter Schwartz Avanxo Clo...Cloud 2020: taking your customers into the future - Peter Schwartz Avanxo Clo...
Cloud 2020: taking your customers into the future - Peter Schwartz Avanxo Clo...Avanxo
 
Implementing Advanced Analytics Platform
Implementing Advanced Analytics PlatformImplementing Advanced Analytics Platform
Implementing Advanced Analytics PlatformArvind Sathi
 
Webinar: Analytics with NoSQL: Why, for What, and When?
Webinar: Analytics with NoSQL: Why, for What, and When?Webinar: Analytics with NoSQL: Why, for What, and When?
Webinar: Analytics with NoSQL: Why, for What, and When?MongoDB
 
Webinar: Making A Single View of the Customer Real with MongoDB
Webinar: Making A Single View of the Customer Real with MongoDBWebinar: Making A Single View of the Customer Real with MongoDB
Webinar: Making A Single View of the Customer Real with MongoDBMongoDB
 
Solving churn challenge in Big Data environment - Jelena Pekez
Solving churn challenge in Big Data environment  - Jelena PekezSolving churn challenge in Big Data environment  - Jelena Pekez
Solving churn challenge in Big Data environment - Jelena PekezInstitute of Contemporary Sciences
 
Use of Analytics to recover from COVID19 hit economy
Use of Analytics to recover from COVID19 hit economyUse of Analytics to recover from COVID19 hit economy
Use of Analytics to recover from COVID19 hit economyAmit Parija
 
Time-to-Event Models, presented by DataSong and Revolution Analytics
Time-to-Event Models, presented by DataSong and Revolution AnalyticsTime-to-Event Models, presented by DataSong and Revolution Analytics
Time-to-Event Models, presented by DataSong and Revolution AnalyticsRevolution Analytics
 
Machine Learning & IT Service Intelligence for the Enterprise: The Future is ...
Machine Learning & IT Service Intelligence for the Enterprise: The Future is ...Machine Learning & IT Service Intelligence for the Enterprise: The Future is ...
Machine Learning & IT Service Intelligence for the Enterprise: The Future is ...Precisely
 
Platforming the Major Analytic Use Cases for Modern Engineering
Platforming the Major Analytic Use Cases for Modern EngineeringPlatforming the Major Analytic Use Cases for Modern Engineering
Platforming the Major Analytic Use Cases for Modern EngineeringDATAVERSITY
 
Big data sharing at fintech academy oct19 (1)
Big data sharing at fintech academy oct19 (1)Big data sharing at fintech academy oct19 (1)
Big data sharing at fintech academy oct19 (1)sgfta2020
 
Open Blueprint for Real-Time Analytics with In-Stream Processing (ISP); 2017 ...
Open Blueprint for Real-Time Analytics with In-Stream Processing (ISP); 2017 ...Open Blueprint for Real-Time Analytics with In-Stream Processing (ISP); 2017 ...
Open Blueprint for Real-Time Analytics with In-Stream Processing (ISP); 2017 ...Grid Dynamics
 
Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)Turi, Inc.
 
Solutions Using WSO2 Analytics
Solutions Using WSO2 AnalyticsSolutions Using WSO2 Analytics
Solutions Using WSO2 AnalyticsWSO2
 
Big Data and the Next Best Offer
Big Data and the Next Best OfferBig Data and the Next Best Offer
Big Data and the Next Best OfferMichel Bruley
 
The Big Picture: Learned Behaviors in Churn
The Big Picture: Learned Behaviors in ChurnThe Big Picture: Learned Behaviors in Churn
The Big Picture: Learned Behaviors in ChurnCloudera, Inc.
 
Machine Learning in Customer Analytics
Machine Learning in Customer AnalyticsMachine Learning in Customer Analytics
Machine Learning in Customer AnalyticsCourse5i
 
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...Flink Forward
 
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...EY + Neo4j: Why graph technology makes sense for fraud detection and customer...
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...Neo4j
 

Similar to Data Science Out of The Box : Case Studies in the Telecommunication by Anand Ranganathan (20)

Cloud 2020: taking your customers into the future - Peter Schwartz Avanxo Clo...
Cloud 2020: taking your customers into the future - Peter Schwartz Avanxo Clo...Cloud 2020: taking your customers into the future - Peter Schwartz Avanxo Clo...
Cloud 2020: taking your customers into the future - Peter Schwartz Avanxo Clo...
 
Implementing Advanced Analytics Platform
Implementing Advanced Analytics PlatformImplementing Advanced Analytics Platform
Implementing Advanced Analytics Platform
 
Webinar: Analytics with NoSQL: Why, for What, and When?
Webinar: Analytics with NoSQL: Why, for What, and When?Webinar: Analytics with NoSQL: Why, for What, and When?
Webinar: Analytics with NoSQL: Why, for What, and When?
 
Webinar: Making A Single View of the Customer Real with MongoDB
Webinar: Making A Single View of the Customer Real with MongoDBWebinar: Making A Single View of the Customer Real with MongoDB
Webinar: Making A Single View of the Customer Real with MongoDB
 
Solving churn challenge in Big Data environment - Jelena Pekez
Solving churn challenge in Big Data environment  - Jelena PekezSolving churn challenge in Big Data environment  - Jelena Pekez
Solving churn challenge in Big Data environment - Jelena Pekez
 
Big Data use cases in telcos
Big Data use cases in telcosBig Data use cases in telcos
Big Data use cases in telcos
 
Big Data use cases in telcos
Big Data use cases in telcosBig Data use cases in telcos
Big Data use cases in telcos
 
Use of Analytics to recover from COVID19 hit economy
Use of Analytics to recover from COVID19 hit economyUse of Analytics to recover from COVID19 hit economy
Use of Analytics to recover from COVID19 hit economy
 
Time-to-Event Models, presented by DataSong and Revolution Analytics
Time-to-Event Models, presented by DataSong and Revolution AnalyticsTime-to-Event Models, presented by DataSong and Revolution Analytics
Time-to-Event Models, presented by DataSong and Revolution Analytics
 
Machine Learning & IT Service Intelligence for the Enterprise: The Future is ...
Machine Learning & IT Service Intelligence for the Enterprise: The Future is ...Machine Learning & IT Service Intelligence for the Enterprise: The Future is ...
Machine Learning & IT Service Intelligence for the Enterprise: The Future is ...
 
Platforming the Major Analytic Use Cases for Modern Engineering
Platforming the Major Analytic Use Cases for Modern EngineeringPlatforming the Major Analytic Use Cases for Modern Engineering
Platforming the Major Analytic Use Cases for Modern Engineering
 
Big data sharing at fintech academy oct19 (1)
Big data sharing at fintech academy oct19 (1)Big data sharing at fintech academy oct19 (1)
Big data sharing at fintech academy oct19 (1)
 
Open Blueprint for Real-Time Analytics with In-Stream Processing (ISP); 2017 ...
Open Blueprint for Real-Time Analytics with In-Stream Processing (ISP); 2017 ...Open Blueprint for Real-Time Analytics with In-Stream Processing (ISP); 2017 ...
Open Blueprint for Real-Time Analytics with In-Stream Processing (ISP); 2017 ...
 
Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)
 
Solutions Using WSO2 Analytics
Solutions Using WSO2 AnalyticsSolutions Using WSO2 Analytics
Solutions Using WSO2 Analytics
 
Big Data and the Next Best Offer
Big Data and the Next Best OfferBig Data and the Next Best Offer
Big Data and the Next Best Offer
 
The Big Picture: Learned Behaviors in Churn
The Big Picture: Learned Behaviors in ChurnThe Big Picture: Learned Behaviors in Churn
The Big Picture: Learned Behaviors in Churn
 
Machine Learning in Customer Analytics
Machine Learning in Customer AnalyticsMachine Learning in Customer Analytics
Machine Learning in Customer Analytics
 
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
 
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...EY + Neo4j: Why graph technology makes sense for fraud detection and customer...
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...
 

More from Data Con LA

Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA
 
Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA
 
Data Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup ShowcaseData Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup ShowcaseData Con LA
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA
 
Data Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendationsData Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendationsData Con LA
 
Data Con LA 2022 - AI Ethics
Data Con LA 2022 - AI EthicsData Con LA 2022 - AI Ethics
Data Con LA 2022 - AI EthicsData Con LA
 
Data Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learningData Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learningData Con LA
 
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and AtlasData Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and AtlasData Con LA
 
Data Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA
 
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...Data Con LA
 
Data Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWSData Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWSData Con LA
 
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA
 
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA
 
Data Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data ScienceData Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data ScienceData Con LA
 
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing EntertainmentData Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing EntertainmentData Con LA
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA
 
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...Data Con LA
 
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA
 
Data Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with KafkaData Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with KafkaData Con LA
 

More from Data Con LA (20)

Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 Keynotes
 
Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 Keynotes
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 Keynote
 
Data Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup ShowcaseData Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup Showcase
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 Keynote
 
Data Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendationsData Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendations
 
Data Con LA 2022 - AI Ethics
Data Con LA 2022 - AI EthicsData Con LA 2022 - AI Ethics
Data Con LA 2022 - AI Ethics
 
Data Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learningData Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learning
 
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and AtlasData Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
 
Data Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentation
 
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
 
Data Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWSData Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWS
 
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
 
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
 
Data Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data ScienceData Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data Science
 
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing EntertainmentData Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
 
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
 
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
 
Data Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with KafkaData Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with Kafka
 

Recently uploaded

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 

Data Science Out of The Box : Case Studies in the Telecommunication by Anand Ranganathan

  • 1. Anand Ranganathan, VP of Solutions Aug 2017 DATA SCIENCE OUT OF THE BOX: Case Studies In The Telecommunications Industry
  • 2. Telecommunications Service Providers have huge amounts of data related to customer activity that come to them in real-time 2 • Calling, SMS and data usage information • Purchase and recharge data • Plan information • Browsing data (DPI) • Location information from CDRs, probes or other sources • Device Data logs • Call Center logs
  • 3. But, they face challenges in getting value from this data to improve customer experience 3 Difficult to integrate data about customers from multiple sources into a single view Difficult to integrate the insights from the models with other tools Difficult to build models Difficult to act upon the insights Difficult to operationalize the models Difficult to gain business value 1 2 3 4 5 6
  • 6. 6 We believe it’s the little things.. … Targeted, precise and contextual
  • 7. 7 Predict which customers will need int’l roaming in the next day?
  • 8. 8 Provide personalized, real-time offers to customers whose data pack is predicted to run out in the next 12 hours??
  • 9. 9 Provide real-time predictive issue resolution to your customers, before they call the call center?
  • 10. 10 Harnessing Data in Real-Time is key to creating a great customer experience… … Most enterprises, though, have struggled to deploy and get value from analytics … especially, real-time analytics
  • 11. Does it have to take years to deploy an advanced analytics solution ? Do you really need an army of Data Scientists to create new models every year ? Do you really have to stitch together 10 solutions for a ‘single customer view’ that gets updated once a day? Why is it still so difficult to create personalized and contextual campaigns ?
  • 12. Our Vision – Easy to use Real-time analytics in a box 12 Our initial target domains are: • Telecommunications • Healthcare • Banking Allow rapid deployment of analytics and reduce time to value Through reusable machine learning pipelines that cover common needs in several industries.
  • 13. Firstly, what is a machine learning pipeline? 13 Training Data Parsing, Cleaning, Transformations Feature Extraction Train Model Model Test Data Predictions Parsing, Cleaning, Transformations Feature Extraction Score Model Training Pipeline Scoring Pipeline
  • 14. We have 40+ readily deployable ML Pipelines covering common telco marketing requirements 14 Machine Learning Real- Time and Offline Predictive Models Wallet, Purchase & Journey Models § Predict subscriber’s next top-up amount § Predict when subscriber might top-up § Predict if subscriber will buy or renew package § Predict Package expiry § Predict if package will expire with high balance § Prepaid to Postpaid Conversion Propensity § Churn Propensity § Next Best Action Model § Customer Lifetime Value Prediction Spatio-Temporal Models § Predict home location, work location, weekend travel locations § Predict where subscriber will be at given hour & day, e.g. on Fridays at 7 PM § Determine frequently visited locations (malls, churches, office buildings etc.) § Mobility Profiling, e.g. frequent traveler, stay-at-home, regular commuter § Home / Work Location Based Segmentation e.g. Stay-at-home housewife, Traveling Salesman etc. Anomaly Detection § Detect anomalies in calling pattern within the network / Cell Site / Location / Subscriber § Anomaly Detection in SMS/data usage at Network / Cell Site / Location or Subscriber level § Anomaly Detection in dropped calls / dropped data sessions at Network / Cell Site / Location or Subscriber level Device Models § Detect Call Drops & Poor Call Quality from device logs § Detect Poorly performing device battery § Detect Anomalous Apps based on GPS, wake-lock etc. § Determine interests based on App Usage Communication • Determine relative preference of SMS, Voice or Data § Predict best time of day, day of week or location to reach subscriber with offers § Determine preferred channels of communication Customer Experience § Customer Satisfaction Model, based on dropped calls, failed data sessions, poor call quality and device issues § Predict if customer will call contact center § Predict why customer may call contact center Clickstream and Interests § URL Categorization into rich topic hierarchy § Long term and short term Interest derivation based on browsing data of communication § Interest prediction based on location & device type Social Network § Determine influencers and social hubs § Discover close contacts § Identify common interest communities within the subscriber base
  • 15. … used to create dynamic profiles of customers, locations and business or retail outlets 15 Historical: Typical home / work locations? Recharge patterns Calling network Real Time: Websites visited in last hour Number of dropped calls in past day Recharge prediction in next 6 hours Historical: Typical population at location Spend patterns at location Typical Mobility profiles Real Time: Anomalous network loads Number of queries for weather Current population Historical: Historical Population trends Browsing behaviors Communication patterns Real Time: Number of customers near business now. Number of calls to business in last 1 hour Number of visits to competing business
  • 16. Key principle behind data science out of the box 16 Build ML pipeline once & Operationalize repeatedly Operationalizing The Pipelines – The ENGINEERING Building Pipelines – The ART • Repeated for every new deployment • Create the transformations & features on historical data • Train initial version of the model & generate initial scores • Create the transformations on streaming data and update features • Update scores and models “frequently” based on streaming data • Done once on some static representative datasets • Explore different possible transformations of the data • Explore different kinds of features • Explore different models • Finalize on a certain pipeline for a given problem
  • 17. Machine Learning is not a one-off process taking place in a static world 17 All model-building & scoring activities happen at a certain point in time TIME NOW Historical Data that has been collected so far Streaming Data that will come in the future Build initial versions of the model, score them and create initial profiles based on this data Update scores in the profiles and refresh models based on this data
  • 18. Typical Enterprise Architecture 18 Separate processing pathways for real-time analytics and long-term historical analytics Telco Data Sources: • CDRs • DPI • Location • SMSC • Billing ETL Real-time Streaming Data. Historical Data
  • 19. Problems with basic pipeline in streaming settings 19 Training Data Parsing, Cleaning, Transformations Feature Extraction Train Model Model Test Data Predictions Parsing, Cleaning, Transformations Feature Extraction Score Model • Doesn’t show feature creation & updates on combination of historical & streaming data • Doesn't show scoring based on most recent feature values • Doesn’t show model refresh
  • 20. Patterns for Machine Learning Pipelines 20 Update models and predictions on every event. E.g. time-series predictions and anomaly detection for fraud detection. Refresh models periodically and score on every event. E.g. topup prediction with models updated every week. Build model one-time or infrequently and score on every event. E.g. Real-time churn prediction with static model Update models and predictions periodically. E.g. user interest models, hangout predictions and recommendation models. Build model one-time or infrequently and score on every event. E.g. Real-time churn prediction with static model Build models and predictions one time or very infrequently. E.g. offline churn prediction scores. Online Frequent/Periodic Batch MODELBUILDING Frequent/PeriodicOnline SCORING
  • 21. Typical Enterprise Architecture with Unscrambl Brain 21 Separate processing pathways for real-time analytics and long-term historical analytics Telco Data Sources: • CDRs • DPI • Location • SMSC • Billing ETL Real-time Streaming Data Historical Data • Stream Analytics • Profile Store • Aggregate Store
  • 22. Brain is powered by 3 specialized components 22 Leveldb based time- series aggregate store Recharges, Number of dropped calls, Number of international calls,… in the past 10 minutes, hour, day, week, month or year Redis-based profile store Last known location of customers, predicted home and work locations,,… Python-based ML pipeline framework Call Center Call Prediction Model, Preferred Channel Prediction Model, Social Network Models
  • 23. Online Learning, Online Scoring 23 One-Time Initialization of features from Historical Data Online Model Building & Scoring on Streaming Data Historical Data Parsing, Cleaning, Transformations Feature Extraction Model Maintain Features Streaming Data Parsing, Cleaning, Transformations Feature Extraction Get Features for one entity Train & Score Model Write Predictions
  • 24. Periodic Learning, Online Scoring 24 Historical Data Parsing, Cleaning, Transformations Feature Extraction Train Model Model Maintain Features Get Features for all entities Streaming Data Parsing, Cleaning, Transformations Feature Extraction Get Features for one entity Score Model Write Predictions One-Time Initialization of features from Historical Data Periodic Model Re-Training Online Update of Features and Scoring on Streaming Data
  • 25. Periodic Learning & Periodic Scoring 25 One-Time Initialization of features from Historical Data Periodic Model Re-Training & Re-Scoring of all Entities Online Update of Features from Streaming Data Historical Data Parsing, Cleaning, Transformations Feature Extraction Train & Score Model Model Maintain Features Get Features for all entities Streaming Data Parsing, Cleaning, Transformations Feature Extraction Write Predictions
  • 26. Case Study : Telco in SE Asia 26 60+ million subscribers 7+ million optin subscribers 10+ billion CDRs per day 100+ billion URL records per day 15 Machine Learning pipelines rapidly deployed on Spark and Brain to derive a variety of profile attributes about subscribers Able to update models and profiles as frequently as needed