SlideShare a Scribd company logo
1 of 31
Download to read offline
7th International Conference on Pattern Recognition and Machine
Intelligence
Dec 5-8th, ISI Kolkata
Incremental Learning of Non-Stationary Temporal
Causal Networks for Telecommunication Domain
Ram Mohan
R&D Department, Flytxt, Trivandrum, India
Santanu Chaudhury
CEERI, Pilani, India
Brejesh Lall
Department of Electrical Engineering, Indian Institute of Technology, Delhi, India
2Β© 2015 Flytxt. All rights reserved.
Table Of Contents
Why Causal Modelling?
β€’ Simpson’s Paradox
β€’ Decision Making
Problem Statement Definition
β€’ Volume of Data
β€’ Non-Stationary Distributions
β€’ Challenge Lies only in Structure Learning
Solution
β€’ Data and Model
β€’ Algorithm
β€’ Results
3Β© 2015 Flytxt. All rights reserved.
Why Causal Modelling?
Simpson’s Paradox
4Β© 2015 Flytxt. All rights reserved.
Why Causal Modelling?
Simpson’s Paradox
Cancer
Male/Female Smoker
5Β© 2015 Flytxt. All rights reserved.
Why Causal Modelling?
Simpson’s Paradox – Continuous Values
6Β© 2015 Flytxt. All rights reserved.
Why Causal Modelling?
Churn
Recharge
MRP
Outgoing
Call Minutes
7Β© 2015 Flytxt. All rights reserved.
Causal Inference
Judea Pearl in her work defines structural causal modelling.
DATA
Causal Inference
Algorithms(Structure
Learning Algorithms)
8Β© 2015 Flytxt. All rights reserved.
Causal Model
A causal diagram is a graphical tool that enables the visualization of causal relationships
between variables in a causal model. A typical causal diagram will comprise a set of
variables (or nodes) defined as being within the scope of the model being represented.
Any variable in the diagram should be connected by an arrow to another variable with
which it has a causal influence - the arrowhead delineates the direction of this causal
relationship, e.g., an arrow connecting variables A and B with the arrowhead at B
indicates a relationship whereby (all other factors being equal) a qualitative or
quantitative change in A may cause change in B.
Causal/Bayesian Network β€˜G’ is a Directed Acyclic Graph, where Probability Distribution
P factorizes according to P(X)=∏_(𝑖=1)^𝑛▒〖𝑃(π‘₯_π‘–β”‚π‘ƒπ‘Ž(π‘₯_𝑖 ) ),γ€— P is specified as a set of
CPDs associated with G and the I(G) βŠ† I(P)
9Β© 2015 Flytxt. All rights reserved.
Problem Statement Definition
Causal modelling on Large Volume Non-Stationary Data in a streaming setup.
β€’ Big Data
β€’ Volume
β€’ Velocity
β€’ Variety
β€’ Under lying Distribution
β€’ Non-Stationary
10Β© 2015 Flytxt. All rights reserved.
Non-Stationary Distributions
A time-series dataset can be classified as generated from Stationary Process or Non-
Stationary Process
In statistics, a stationary process is a stochastic process whose joint probability
distribution does not change when shifted in time. Consequently, parameters such as
mean and variance, if they are present, also do not change over time.
Non-stationary process joint probability distribution undergoes change when shifted in
time. Consequently parameters such as mean and variance, if they are present, do
change over time.
Why model Non-Stationary?
β€’ There is a shift from detection-diagnosis-mitigation of unfavorable events(anomalies) to prediction-
prognosis-prevention is seen across various engineering domains such as health care,
telecommunication, financial institutions.
β€’ To be able to forecast we have to model Non-Stationary time series data as a machine learning
model.
11Β© 2015 Flytxt. All rights reserved.
Concept Drift
In predictive analytics and machine learning, the concept drift is a phenomenon a
phenomenon associated with time instance when the underlying statistical properties
of target variable actually undergoes a change.
12Β© 2015 Flytxt. All rights reserved.
Learning in a Non Stationary domain
Adaptive learning refers to updating predictive models online during their operation to
react to concept drifts
Have the following tasks:
β€’ Detect concept drift
β€’ distinguish drifts from noise and be adaptive to changes, but robust to noise
β€’ operate in less than arrival time interval and use not more than a fixed amount of memory for any
storage
13Β© 2015 Flytxt. All rights reserved.
Solution :: Data
Data of approximately ~90,000 subscribers for a period of 8 months from August 2016
to March 2017.
The resultant dataset has a total of ~.7 million records and 32 features. One record
includes subscriber info from 2 successive time instance(As we are learning a Temporal
model under First Order Markovian Assumption).
14Β© 2015 Flytxt. All rights reserved.
Solution : Data – Feature Info
15Β© 2015 Flytxt. All rights reserved.
Solution: Ci-Sufficient Statistics
Huynh et al. in their "Streaming Clustering with Bayesian Nonparametric Models " have
described learning Bayesian mixture models from streaming data.
The proposed Streaming algorithm identifies Gaussian cluster in the data incrementally
without having to revisit older batches of data.
For every cluster if we maintain the CI-Sufficient Statistics ( 𝑋 𝑏
(𝑖)
𝑋 𝑏
(𝑗)
, 𝑋 𝑏
(𝑖)
, 𝑋 𝑏
(𝑗)
,
𝑋 𝑏
(𝑖)2
, 𝑋 𝑏
(𝑗)2
), we will be able to construct the Temporal Causal network.
16Β© 2015 Flytxt. All rights reserved.
Solution :: Data
X - Axis TotalOutGoingCallMinutes
Y – Axis TotalOutGoingCallRevenue
17Β© 2015 Flytxt. All rights reserved.
Solution :: Data
Gaussian Clusters in the data
β€’ X - Axis TotalOutGoingCallMinutes
β€’ Y – Axis TotalOutGoingCallRevenue
18Β© 2015 Flytxt. All rights reserved.
Solution : Non-Parametric Temporal Causal Network
Wu and Ghosal in their work β€œKullback Leibler property of kernel mixture priors in
Bayesian density estimation" show that the mixture of multivariate Gaussians can
approximate any density on 𝑅 𝑑
, provided that the number of components can get
arbitrarily large”.
Ickstadt et al. in their work "Nonparametric Bayesian Networks" use it as it overcomes
the two limitations of Gaussian Bayesian network models*:
β€’ We no longer make normality assumption for the underlying data, but assume a mixture of
multivariate normal distributions for the density.
β€’ We no longer assume that the variables X are in linear relationships.
Friedman et al. have shown that the learning of Dynamic Bayesian network ( which
under Markovian assumption is also a Temporal Causal Network) can be decomposed in
to Prior network and Transition Network
19Β© 2015 Flytxt. All rights reserved.
Solution : PC-Algorithm
20Β© 2015 Flytxt. All rights reserved.
Solution : Conditional Independence Tests
πœŒπ‘–,𝑗|π‘˜ =
𝜌 𝑖,𝑗|π‘˜hβˆ’πœŒ 𝑖,β„Ž|π‘˜h βˆ™ 𝜌 𝑗,β„Ž|π‘˜h
(1βˆ’πœŒπ‘–,β„Ž|π‘˜h
2 )βˆ™(1βˆ’πœŒ 𝑗,β„Ž|π‘˜h
2 )
𝑍 𝑖, 𝑗 π‘˜ =
1
2
π‘™π‘œπ‘”
1+𝜌 𝑖,𝑗|π‘˜
1βˆ’πœŒ 𝑖,𝑗|π‘˜
πœŒπ‘–,𝑗 =
𝑛 π‘βˆˆπ‘π‘Žπ‘‘π‘β„Žπ‘’π‘  𝑋 𝑏
(𝑖)
𝑋 𝑏
(𝑗)
βˆ’ π‘βˆˆπ‘π‘Žπ‘‘π‘β„Žπ‘’π‘  𝑋 𝑏
(𝑖)
π‘βˆˆπ‘π‘Žπ‘‘π‘β„Žπ‘’π‘  𝑋 𝑏
(𝑗)
𝑛 π‘βˆˆπ‘π‘Žπ‘‘π‘β„Žπ‘’π‘  𝑋 𝑏
(𝑖)2
βˆ’( π‘βˆˆπ‘π‘Žπ‘‘π‘β„Žπ‘’π‘  𝑋 𝑏
(𝑖)
)2 𝑛 π‘βˆˆπ‘π‘Žπ‘‘π‘β„Žπ‘’π‘  𝑋 𝑏
(𝑗)2
βˆ’( π‘βˆˆπ‘π‘Žπ‘‘π‘β„Žπ‘’π‘  𝑋 𝑏
(𝑗)
)2
21Β© 2015 Flytxt. All rights reserved.
Solution : Temporal Causal Network
22Β© 2015 Flytxt. All rights reserved.
Concept Drift Rules
First type of concept drift : for new batch of data determine the KL divergence for
subscriber record to cluster distribution, if the KL divergence is beyond the threshold
and the independency test from the existing structure are different then re-learning of
type 1 is required.
Second type of concept drift : For the new batch of data likelihood of the records
associated with the clusters are determined. For any cluster if likelihood has dropped
below a threshold re-learning of type 2 is required. We use these rules in algorithm 1.
The information regarding type of re-learning associated with each type is explained in
the following section.
23Β© 2015 Flytxt. All rights reserved.
Algorithm
24Β© 2015 Flytxt. All rights reserved.
Solution Results : Comparision of Non-parameteric TCN vs
Others
Formulation Temporal Causal Network
β€’ Combining DPGMM and Linear Gaussian Causal Network
β€’ Distributed Learning Algorithm
Synthetic Data – From NIPS TiMINo Paper:
β€’ Confounder with time lag:𝑍𝑑 = π‘Ž βˆ™ π‘π‘‘βˆ’1 + 𝑁𝑍,𝑑, 𝑋𝑑 = 0.6 βˆ™ π‘‹π‘‘βˆ’1 + 0.5 βˆ™ π‘π‘‘βˆ’1 + 𝑁𝑋,𝑑, π‘Œπ‘‘ = 0.6 βˆ™ π‘Œπ‘‘βˆ’1 +
0.5 βˆ™ π‘π‘‘βˆ’2 + π‘π‘Œ,𝑑
β€’ Linear Gaussian with instantaneous effects.: 𝑋𝑑 = π‘Ž βˆ™ π‘‹π‘‘βˆ’1 + 𝑁𝑋,𝑑, π‘Šπ‘‘ = 0.6 βˆ™ π‘Šπ‘‘βˆ’1 + 0.5 βˆ™ 𝑋𝑑 +
𝑁 π‘Š,𝑑, π‘Œπ‘‘ = 0.6 βˆ™ π‘Œπ‘‘βˆ’1 + 0.5 βˆ™ π‘Šπ‘‘βˆ’1 + π‘π‘Œ,𝑑, 𝑍𝑑 = 0.6 βˆ™ π‘π‘‘βˆ’1 + 0.5 βˆ™ π‘Šπ‘‘ + 0.2 βˆ™ π‘Œπ‘‘βˆ’1 + 𝑁𝑍,𝑑
β€’ Non Linear without instantaneous effects: 𝑋𝑑 = 0.8 βˆ™ π‘‹π‘‘βˆ’1 + 0.3 βˆ™ 𝑁𝑋,𝑑, π‘Œπ‘‘ = 0.4 βˆ™ π‘Œπ‘‘βˆ’1 + π‘‹π‘‘βˆ’1 βˆ’
25Β© 2015 Flytxt. All rights reserved.
Solution : Results Synthetic Dataset
Linear Gaussian with instantaneous effects
Non Linear without instantaneous effects
26Β© 2015 Flytxt. All rights reserved.
Solution : Comparision of Non-parameteric TCN vs Others
27Β© 2015 Flytxt. All rights reserved.
Solution : Results : Telecommunication Dataset
28Β© 2015 Flytxt. All rights reserved.
Solution : Telecommunication Dataset
29Β© 2015 Flytxt. All rights reserved.
Conclusion
By formulating a non-parametric Temporal Causal Modelling as a combination of
DPGMM based Gaussian clusters, followed by Gaussian Bayesian Network we are able
to scale to large volume of data.
Then we propose to learn incremental learning algorithm to continuously learn the
temporal causal network.
The incremental learning algorithm defines rules to determine the concept drift and
then trigger relearning based on type of concept drift.
The defined algorithm identifies the concept drift that occurred in Indian
Telecommunication industry, without visiting the old data.
30Β© 2015 Flytxt. All rights reserved.
References
Causal inference in statistics: An overview
β€’ Judea Pearl, 2009 Statistics Survey
Incremental Learning of Nonparametric Bayesian Mixture Models:
β€’ Ryan Gomess, Max Welling, Pietro Perona :: Computer Vision and Pattern Recognition, 2008.
Estimating High-Dimensional Directed Acyclic Graphs with the PC-Algorithm
β€’ Markus Kalisch, Peter Buhlmann, 2007 JMLR
Order-Independent Constraint-Based Causal Structure Learning
β€’ Diego Colombo, Marloes H. Maathuis, 2014 JMLR
Streaming Clustering with Bayesian Nonparametric Models
β€’ Viet Huynha, Dinh Phunga, 2016 ELSEVIER
Thank You
www.flytxt.com | info@flytxt.com
Β© 2015 Flytxt. All rights reserved.

More Related Content

What's hot

Design of Efficient High Speed Vedic Multiplier
Design of Efficient High Speed Vedic MultiplierDesign of Efficient High Speed Vedic Multiplier
Design of Efficient High Speed Vedic Multiplierijsrd.com
Β 
010_20160216_Variational Gaussian Process
010_20160216_Variational Gaussian Process010_20160216_Variational Gaussian Process
010_20160216_Variational Gaussian ProcessHa Phuong
Β 
Dueling Network Architectures for Deep Reinforcement Learning
Dueling Network Architectures for Deep Reinforcement LearningDueling Network Architectures for Deep Reinforcement Learning
Dueling Network Architectures for Deep Reinforcement LearningYoonho Lee
Β 
Eeee2017 Conference - OR in the digital era - ICT challenges | Presentation
Eeee2017 Conference - OR in the digital era - ICT challenges | PresentationEeee2017 Conference - OR in the digital era - ICT challenges | Presentation
Eeee2017 Conference - OR in the digital era - ICT challenges | PresentationChristos Papalitsas
Β 
Approaches to online quantile estimation
Approaches to online quantile estimationApproaches to online quantile estimation
Approaches to online quantile estimationData Con LA
Β 
Dynamic approach to k means clustering algorithm-2
Dynamic approach to k means clustering algorithm-2Dynamic approach to k means clustering algorithm-2
Dynamic approach to k means clustering algorithm-2IAEME Publication
Β 
The Predictron: End-to-end Learning and Planning
The Predictron: End-to-end Learning and PlanningThe Predictron: End-to-end Learning and Planning
The Predictron: End-to-end Learning and PlanningYoonho Lee
Β 
A hybrid approach to clustering in big data
A hybrid approach to clustering in big dataA hybrid approach to clustering in big data
A hybrid approach to clustering in big dataNexgen Technology
Β 
A graph theoretic approach to scheduling in cognitive radio networks
A graph theoretic approach to scheduling in cognitive radio networksA graph theoretic approach to scheduling in cognitive radio networks
A graph theoretic approach to scheduling in cognitive radio networksNexgen Technology
Β 
Deep Learning for Time Series Data
Deep Learning for Time Series DataDeep Learning for Time Series Data
Deep Learning for Time Series DataArun Kejariwal
Β 
Efficient projections
Efficient projectionsEfficient projections
Efficient projectionsTomasz Waszczyk
Β 
A Study of Efficiency Improvements Technique for K-Means Algorithm
A Study of Efficiency Improvements Technique for K-Means AlgorithmA Study of Efficiency Improvements Technique for K-Means Algorithm
A Study of Efficiency Improvements Technique for K-Means AlgorithmIRJET Journal
Β 
Evolution of regenerative Ca-ion wave-packet in Neuronal-firing Fields: Quant...
Evolution of regenerative Ca-ion wave-packet in Neuronal-firing Fields: Quant...Evolution of regenerative Ca-ion wave-packet in Neuronal-firing Fields: Quant...
Evolution of regenerative Ca-ion wave-packet in Neuronal-firing Fields: Quant...AM Publications,India
Β 
Time series deep learning
Time series   deep learningTime series   deep learning
Time series deep learningAlberto Arrigoni
Β 
Kernel methods for data integration in systems biology
Kernel methods for data integration in systems biologyKernel methods for data integration in systems biology
Kernel methods for data integration in systems biologytuxette
Β 
Thesis presentation
Thesis presentationThesis presentation
Thesis presentationSammy Alhashemi
Β 
How to Accelerate Molecular Simulations with Data? by Ε½ofia TrsΕ₯anovΓ‘, Machin...
How to Accelerate Molecular Simulations with Data? by Ε½ofia TrsΕ₯anovΓ‘, Machin...How to Accelerate Molecular Simulations with Data? by Ε½ofia TrsΕ₯anovΓ‘, Machin...
How to Accelerate Molecular Simulations with Data? by Ε½ofia TrsΕ₯anovΓ‘, Machin...Paris Women in Machine Learning and Data Science
Β 

What's hot (17)

Design of Efficient High Speed Vedic Multiplier
Design of Efficient High Speed Vedic MultiplierDesign of Efficient High Speed Vedic Multiplier
Design of Efficient High Speed Vedic Multiplier
Β 
010_20160216_Variational Gaussian Process
010_20160216_Variational Gaussian Process010_20160216_Variational Gaussian Process
010_20160216_Variational Gaussian Process
Β 
Dueling Network Architectures for Deep Reinforcement Learning
Dueling Network Architectures for Deep Reinforcement LearningDueling Network Architectures for Deep Reinforcement Learning
Dueling Network Architectures for Deep Reinforcement Learning
Β 
Eeee2017 Conference - OR in the digital era - ICT challenges | Presentation
Eeee2017 Conference - OR in the digital era - ICT challenges | PresentationEeee2017 Conference - OR in the digital era - ICT challenges | Presentation
Eeee2017 Conference - OR in the digital era - ICT challenges | Presentation
Β 
Approaches to online quantile estimation
Approaches to online quantile estimationApproaches to online quantile estimation
Approaches to online quantile estimation
Β 
Dynamic approach to k means clustering algorithm-2
Dynamic approach to k means clustering algorithm-2Dynamic approach to k means clustering algorithm-2
Dynamic approach to k means clustering algorithm-2
Β 
The Predictron: End-to-end Learning and Planning
The Predictron: End-to-end Learning and PlanningThe Predictron: End-to-end Learning and Planning
The Predictron: End-to-end Learning and Planning
Β 
A hybrid approach to clustering in big data
A hybrid approach to clustering in big dataA hybrid approach to clustering in big data
A hybrid approach to clustering in big data
Β 
A graph theoretic approach to scheduling in cognitive radio networks
A graph theoretic approach to scheduling in cognitive radio networksA graph theoretic approach to scheduling in cognitive radio networks
A graph theoretic approach to scheduling in cognitive radio networks
Β 
Deep Learning for Time Series Data
Deep Learning for Time Series DataDeep Learning for Time Series Data
Deep Learning for Time Series Data
Β 
Efficient projections
Efficient projectionsEfficient projections
Efficient projections
Β 
A Study of Efficiency Improvements Technique for K-Means Algorithm
A Study of Efficiency Improvements Technique for K-Means AlgorithmA Study of Efficiency Improvements Technique for K-Means Algorithm
A Study of Efficiency Improvements Technique for K-Means Algorithm
Β 
Evolution of regenerative Ca-ion wave-packet in Neuronal-firing Fields: Quant...
Evolution of regenerative Ca-ion wave-packet in Neuronal-firing Fields: Quant...Evolution of regenerative Ca-ion wave-packet in Neuronal-firing Fields: Quant...
Evolution of regenerative Ca-ion wave-packet in Neuronal-firing Fields: Quant...
Β 
Time series deep learning
Time series   deep learningTime series   deep learning
Time series deep learning
Β 
Kernel methods for data integration in systems biology
Kernel methods for data integration in systems biologyKernel methods for data integration in systems biology
Kernel methods for data integration in systems biology
Β 
Thesis presentation
Thesis presentationThesis presentation
Thesis presentation
Β 
How to Accelerate Molecular Simulations with Data? by Ε½ofia TrsΕ₯anovΓ‘, Machin...
How to Accelerate Molecular Simulations with Data? by Ε½ofia TrsΕ₯anovΓ‘, Machin...How to Accelerate Molecular Simulations with Data? by Ε½ofia TrsΕ₯anovΓ‘, Machin...
How to Accelerate Molecular Simulations with Data? by Ε½ofia TrsΕ₯anovΓ‘, Machin...
Β 

Similar to Ikdd co ds2017presentation_v2

accessible-streaming-algorithms
accessible-streaming-algorithmsaccessible-streaming-algorithms
accessible-streaming-algorithmsFarhan Zaki
Β 
A fuzzy clustering algorithm for high dimensional streaming data
A fuzzy clustering algorithm for high dimensional streaming dataA fuzzy clustering algorithm for high dimensional streaming data
A fuzzy clustering algorithm for high dimensional streaming dataAlexander Decker
Β 
Ying hua, c. (2010): adopting co-evolution and constraint-satisfaction concep...
Ying hua, c. (2010): adopting co-evolution and constraint-satisfaction concep...Ying hua, c. (2010): adopting co-evolution and constraint-satisfaction concep...
Ying hua, c. (2010): adopting co-evolution and constraint-satisfaction concep...ArchiLab 7
Β 
Application Of Extreme Value Theory To Bursts Prediction
Application Of Extreme Value Theory To Bursts PredictionApplication Of Extreme Value Theory To Bursts Prediction
Application Of Extreme Value Theory To Bursts PredictionCSCJournals
Β 
M3AT: Monitoring Agents Assignment Model for the Data-Intensive Applications
M3AT: Monitoring Agents Assignment Model for the Data-Intensive ApplicationsM3AT: Monitoring Agents Assignment Model for the Data-Intensive Applications
M3AT: Monitoring Agents Assignment Model for the Data-Intensive ApplicationsVladislavKashansky
Β 
Traffic Prediction from Street Network images.pptx
Traffic Prediction from  Street Network images.pptxTraffic Prediction from  Street Network images.pptx
Traffic Prediction from Street Network images.pptxchirantanGupta1
Β 
Short Term Electrical Load Forecasting by Artificial Neural Network
Short Term Electrical Load Forecasting by Artificial Neural NetworkShort Term Electrical Load Forecasting by Artificial Neural Network
Short Term Electrical Load Forecasting by Artificial Neural NetworkIJERA Editor
Β 
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETSFAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETScsandit
Β 
IRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET Journal
Β 
IRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET Journal
Β 
DATI, AI E ROBOTICA @POLITO
DATI, AI E ROBOTICA @POLITODATI, AI E ROBOTICA @POLITO
DATI, AI E ROBOTICA @POLITOMarcoMellia
Β 
A Survey on Privacy-Preserving Data Aggregation Without Secure Channel
A Survey on Privacy-Preserving Data Aggregation Without Secure ChannelA Survey on Privacy-Preserving Data Aggregation Without Secure Channel
A Survey on Privacy-Preserving Data Aggregation Without Secure ChannelIRJET Journal
Β 
Journal paper 1
Journal paper 1Journal paper 1
Journal paper 1Editor IJCATR
Β 
Support Vector Machine for Wind Speed Prediction
Support Vector Machine for Wind Speed PredictionSupport Vector Machine for Wind Speed Prediction
Support Vector Machine for Wind Speed PredictionIJRST Journal
Β 
Performance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsPerformance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsDinusha Dilanka
Β 
IRJET- AC Duct Monitoring and Cleaning Vehicle for Train Coaches
IRJET- AC Duct Monitoring and Cleaning Vehicle for Train CoachesIRJET- AC Duct Monitoring and Cleaning Vehicle for Train Coaches
IRJET- AC Duct Monitoring and Cleaning Vehicle for Train CoachesIRJET Journal
Β 
IRJET- A Data Stream Mining Technique Dynamically Updating a Model with Dynam...
IRJET- A Data Stream Mining Technique Dynamically Updating a Model with Dynam...IRJET- A Data Stream Mining Technique Dynamically Updating a Model with Dynam...
IRJET- A Data Stream Mining Technique Dynamically Updating a Model with Dynam...IRJET Journal
Β 
DA ST-1 SET-B-Solution.pdf we also provide the many type of solution
DA ST-1 SET-B-Solution.pdf we also provide the many type of solutionDA ST-1 SET-B-Solution.pdf we also provide the many type of solution
DA ST-1 SET-B-Solution.pdf we also provide the many type of solutiongitikasingh2004
Β 
Clustering big spatiotemporal interval data
Clustering big spatiotemporal interval dataClustering big spatiotemporal interval data
Clustering big spatiotemporal interval dataNexgen Technology
Β 

Similar to Ikdd co ds2017presentation_v2 (20)

accessible-streaming-algorithms
accessible-streaming-algorithmsaccessible-streaming-algorithms
accessible-streaming-algorithms
Β 
A fuzzy clustering algorithm for high dimensional streaming data
A fuzzy clustering algorithm for high dimensional streaming dataA fuzzy clustering algorithm for high dimensional streaming data
A fuzzy clustering algorithm for high dimensional streaming data
Β 
Ying hua, c. (2010): adopting co-evolution and constraint-satisfaction concep...
Ying hua, c. (2010): adopting co-evolution and constraint-satisfaction concep...Ying hua, c. (2010): adopting co-evolution and constraint-satisfaction concep...
Ying hua, c. (2010): adopting co-evolution and constraint-satisfaction concep...
Β 
Application Of Extreme Value Theory To Bursts Prediction
Application Of Extreme Value Theory To Bursts PredictionApplication Of Extreme Value Theory To Bursts Prediction
Application Of Extreme Value Theory To Bursts Prediction
Β 
M3AT: Monitoring Agents Assignment Model for the Data-Intensive Applications
M3AT: Monitoring Agents Assignment Model for the Data-Intensive ApplicationsM3AT: Monitoring Agents Assignment Model for the Data-Intensive Applications
M3AT: Monitoring Agents Assignment Model for the Data-Intensive Applications
Β 
Traffic Prediction from Street Network images.pptx
Traffic Prediction from  Street Network images.pptxTraffic Prediction from  Street Network images.pptx
Traffic Prediction from Street Network images.pptx
Β 
Short Term Electrical Load Forecasting by Artificial Neural Network
Short Term Electrical Load Forecasting by Artificial Neural NetworkShort Term Electrical Load Forecasting by Artificial Neural Network
Short Term Electrical Load Forecasting by Artificial Neural Network
Β 
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETSFAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
Β 
rerngvit_phd_seminar
rerngvit_phd_seminarrerngvit_phd_seminar
rerngvit_phd_seminar
Β 
IRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms Comparison
Β 
IRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms Comparison
Β 
DATI, AI E ROBOTICA @POLITO
DATI, AI E ROBOTICA @POLITODATI, AI E ROBOTICA @POLITO
DATI, AI E ROBOTICA @POLITO
Β 
A Survey on Privacy-Preserving Data Aggregation Without Secure Channel
A Survey on Privacy-Preserving Data Aggregation Without Secure ChannelA Survey on Privacy-Preserving Data Aggregation Without Secure Channel
A Survey on Privacy-Preserving Data Aggregation Without Secure Channel
Β 
Journal paper 1
Journal paper 1Journal paper 1
Journal paper 1
Β 
Support Vector Machine for Wind Speed Prediction
Support Vector Machine for Wind Speed PredictionSupport Vector Machine for Wind Speed Prediction
Support Vector Machine for Wind Speed Prediction
Β 
Performance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsPerformance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning Algorithms
Β 
IRJET- AC Duct Monitoring and Cleaning Vehicle for Train Coaches
IRJET- AC Duct Monitoring and Cleaning Vehicle for Train CoachesIRJET- AC Duct Monitoring and Cleaning Vehicle for Train Coaches
IRJET- AC Duct Monitoring and Cleaning Vehicle for Train Coaches
Β 
IRJET- A Data Stream Mining Technique Dynamically Updating a Model with Dynam...
IRJET- A Data Stream Mining Technique Dynamically Updating a Model with Dynam...IRJET- A Data Stream Mining Technique Dynamically Updating a Model with Dynam...
IRJET- A Data Stream Mining Technique Dynamically Updating a Model with Dynam...
Β 
DA ST-1 SET-B-Solution.pdf we also provide the many type of solution
DA ST-1 SET-B-Solution.pdf we also provide the many type of solutionDA ST-1 SET-B-Solution.pdf we also provide the many type of solution
DA ST-1 SET-B-Solution.pdf we also provide the many type of solution
Β 
Clustering big spatiotemporal interval data
Clustering big spatiotemporal interval dataClustering big spatiotemporal interval data
Clustering big spatiotemporal interval data
Β 

Recently uploaded

Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
Β 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
Β 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
Β 
Junnasandra Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore...Junnasandra Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore...amitlee9823
Β 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptDr. Soumendra Kumar Patra
Β 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
Β 
Chintamani Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore ...Chintamani Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore ...amitlee9823
Β 
Call Girls Indiranagar Just Call πŸ‘— 7737669865 πŸ‘— Top Class Call Girl Service B...
Call Girls Indiranagar Just Call πŸ‘— 7737669865 πŸ‘— Top Class Call Girl Service B...Call Girls Indiranagar Just Call πŸ‘— 7737669865 πŸ‘— Top Class Call Girl Service B...
Call Girls Indiranagar Just Call πŸ‘— 7737669865 πŸ‘— Top Class Call Girl Service B...amitlee9823
Β 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
Β 
Delhi Call Girls CP 9711199171 β˜Žβœ”πŸ‘Œβœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 β˜Žβœ”πŸ‘Œβœ” Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 β˜Žβœ”πŸ‘Œβœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 β˜Žβœ”πŸ‘Œβœ” Whatsapp Hard And Sexy Vip Callshivangimorya083
Β 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
Β 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
Β 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
Β 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
Β 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
Β 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
Β 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
Β 

Recently uploaded (20)

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Β 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
Β 
CHEAP Call Girls in Saket (-DELHI )πŸ” 9953056974πŸ”(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )πŸ” 9953056974πŸ”(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )πŸ” 9953056974πŸ”(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )πŸ” 9953056974πŸ”(=)/CALL GIRLS SERVICE
Β 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
Β 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
Β 
Junnasandra Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore...Junnasandra Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore...
Β 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
Β 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Β 
Chintamani Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore ...Chintamani Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore ...
Β 
Call Girls Indiranagar Just Call πŸ‘— 7737669865 πŸ‘— Top Class Call Girl Service B...
Call Girls Indiranagar Just Call πŸ‘— 7737669865 πŸ‘— Top Class Call Girl Service B...Call Girls Indiranagar Just Call πŸ‘— 7737669865 πŸ‘— Top Class Call Girl Service B...
Call Girls Indiranagar Just Call πŸ‘— 7737669865 πŸ‘— Top Class Call Girl Service B...
Β 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
Β 
Delhi Call Girls CP 9711199171 β˜Žβœ”πŸ‘Œβœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 β˜Žβœ”πŸ‘Œβœ” Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 β˜Žβœ”πŸ‘Œβœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 β˜Žβœ”πŸ‘Œβœ” Whatsapp Hard And Sexy Vip Call
Β 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Β 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
Β 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
Β 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
Β 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Β 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
Β 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Β 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
Β 

Ikdd co ds2017presentation_v2

  • 1. 7th International Conference on Pattern Recognition and Machine Intelligence Dec 5-8th, ISI Kolkata Incremental Learning of Non-Stationary Temporal Causal Networks for Telecommunication Domain Ram Mohan R&D Department, Flytxt, Trivandrum, India Santanu Chaudhury CEERI, Pilani, India Brejesh Lall Department of Electrical Engineering, Indian Institute of Technology, Delhi, India
  • 2. 2Β© 2015 Flytxt. All rights reserved. Table Of Contents Why Causal Modelling? β€’ Simpson’s Paradox β€’ Decision Making Problem Statement Definition β€’ Volume of Data β€’ Non-Stationary Distributions β€’ Challenge Lies only in Structure Learning Solution β€’ Data and Model β€’ Algorithm β€’ Results
  • 3. 3Β© 2015 Flytxt. All rights reserved. Why Causal Modelling? Simpson’s Paradox
  • 4. 4Β© 2015 Flytxt. All rights reserved. Why Causal Modelling? Simpson’s Paradox Cancer Male/Female Smoker
  • 5. 5Β© 2015 Flytxt. All rights reserved. Why Causal Modelling? Simpson’s Paradox – Continuous Values
  • 6. 6Β© 2015 Flytxt. All rights reserved. Why Causal Modelling? Churn Recharge MRP Outgoing Call Minutes
  • 7. 7Β© 2015 Flytxt. All rights reserved. Causal Inference Judea Pearl in her work defines structural causal modelling. DATA Causal Inference Algorithms(Structure Learning Algorithms)
  • 8. 8Β© 2015 Flytxt. All rights reserved. Causal Model A causal diagram is a graphical tool that enables the visualization of causal relationships between variables in a causal model. A typical causal diagram will comprise a set of variables (or nodes) defined as being within the scope of the model being represented. Any variable in the diagram should be connected by an arrow to another variable with which it has a causal influence - the arrowhead delineates the direction of this causal relationship, e.g., an arrow connecting variables A and B with the arrowhead at B indicates a relationship whereby (all other factors being equal) a qualitative or quantitative change in A may cause change in B. Causal/Bayesian Network β€˜G’ is a Directed Acyclic Graph, where Probability Distribution P factorizes according to P(X)=∏_(𝑖=1)^𝑛▒〖𝑃(π‘₯_π‘–β”‚π‘ƒπ‘Ž(π‘₯_𝑖 ) ),γ€— P is specified as a set of CPDs associated with G and the I(G) βŠ† I(P)
  • 9. 9Β© 2015 Flytxt. All rights reserved. Problem Statement Definition Causal modelling on Large Volume Non-Stationary Data in a streaming setup. β€’ Big Data β€’ Volume β€’ Velocity β€’ Variety β€’ Under lying Distribution β€’ Non-Stationary
  • 10. 10Β© 2015 Flytxt. All rights reserved. Non-Stationary Distributions A time-series dataset can be classified as generated from Stationary Process or Non- Stationary Process In statistics, a stationary process is a stochastic process whose joint probability distribution does not change when shifted in time. Consequently, parameters such as mean and variance, if they are present, also do not change over time. Non-stationary process joint probability distribution undergoes change when shifted in time. Consequently parameters such as mean and variance, if they are present, do change over time. Why model Non-Stationary? β€’ There is a shift from detection-diagnosis-mitigation of unfavorable events(anomalies) to prediction- prognosis-prevention is seen across various engineering domains such as health care, telecommunication, financial institutions. β€’ To be able to forecast we have to model Non-Stationary time series data as a machine learning model.
  • 11. 11Β© 2015 Flytxt. All rights reserved. Concept Drift In predictive analytics and machine learning, the concept drift is a phenomenon a phenomenon associated with time instance when the underlying statistical properties of target variable actually undergoes a change.
  • 12. 12Β© 2015 Flytxt. All rights reserved. Learning in a Non Stationary domain Adaptive learning refers to updating predictive models online during their operation to react to concept drifts Have the following tasks: β€’ Detect concept drift β€’ distinguish drifts from noise and be adaptive to changes, but robust to noise β€’ operate in less than arrival time interval and use not more than a fixed amount of memory for any storage
  • 13. 13Β© 2015 Flytxt. All rights reserved. Solution :: Data Data of approximately ~90,000 subscribers for a period of 8 months from August 2016 to March 2017. The resultant dataset has a total of ~.7 million records and 32 features. One record includes subscriber info from 2 successive time instance(As we are learning a Temporal model under First Order Markovian Assumption).
  • 14. 14Β© 2015 Flytxt. All rights reserved. Solution : Data – Feature Info
  • 15. 15Β© 2015 Flytxt. All rights reserved. Solution: Ci-Sufficient Statistics Huynh et al. in their "Streaming Clustering with Bayesian Nonparametric Models " have described learning Bayesian mixture models from streaming data. The proposed Streaming algorithm identifies Gaussian cluster in the data incrementally without having to revisit older batches of data. For every cluster if we maintain the CI-Sufficient Statistics ( 𝑋 𝑏 (𝑖) 𝑋 𝑏 (𝑗) , 𝑋 𝑏 (𝑖) , 𝑋 𝑏 (𝑗) , 𝑋 𝑏 (𝑖)2 , 𝑋 𝑏 (𝑗)2 ), we will be able to construct the Temporal Causal network.
  • 16. 16Β© 2015 Flytxt. All rights reserved. Solution :: Data X - Axis TotalOutGoingCallMinutes Y – Axis TotalOutGoingCallRevenue
  • 17. 17Β© 2015 Flytxt. All rights reserved. Solution :: Data Gaussian Clusters in the data β€’ X - Axis TotalOutGoingCallMinutes β€’ Y – Axis TotalOutGoingCallRevenue
  • 18. 18Β© 2015 Flytxt. All rights reserved. Solution : Non-Parametric Temporal Causal Network Wu and Ghosal in their work β€œKullback Leibler property of kernel mixture priors in Bayesian density estimation" show that the mixture of multivariate Gaussians can approximate any density on 𝑅 𝑑 , provided that the number of components can get arbitrarily large”. Ickstadt et al. in their work "Nonparametric Bayesian Networks" use it as it overcomes the two limitations of Gaussian Bayesian network models*: β€’ We no longer make normality assumption for the underlying data, but assume a mixture of multivariate normal distributions for the density. β€’ We no longer assume that the variables X are in linear relationships. Friedman et al. have shown that the learning of Dynamic Bayesian network ( which under Markovian assumption is also a Temporal Causal Network) can be decomposed in to Prior network and Transition Network
  • 19. 19Β© 2015 Flytxt. All rights reserved. Solution : PC-Algorithm
  • 20. 20Β© 2015 Flytxt. All rights reserved. Solution : Conditional Independence Tests πœŒπ‘–,𝑗|π‘˜ = 𝜌 𝑖,𝑗|π‘˜hβˆ’πœŒ 𝑖,β„Ž|π‘˜h βˆ™ 𝜌 𝑗,β„Ž|π‘˜h (1βˆ’πœŒπ‘–,β„Ž|π‘˜h 2 )βˆ™(1βˆ’πœŒ 𝑗,β„Ž|π‘˜h 2 ) 𝑍 𝑖, 𝑗 π‘˜ = 1 2 π‘™π‘œπ‘” 1+𝜌 𝑖,𝑗|π‘˜ 1βˆ’πœŒ 𝑖,𝑗|π‘˜ πœŒπ‘–,𝑗 = 𝑛 π‘βˆˆπ‘π‘Žπ‘‘π‘β„Žπ‘’π‘  𝑋 𝑏 (𝑖) 𝑋 𝑏 (𝑗) βˆ’ π‘βˆˆπ‘π‘Žπ‘‘π‘β„Žπ‘’π‘  𝑋 𝑏 (𝑖) π‘βˆˆπ‘π‘Žπ‘‘π‘β„Žπ‘’π‘  𝑋 𝑏 (𝑗) 𝑛 π‘βˆˆπ‘π‘Žπ‘‘π‘β„Žπ‘’π‘  𝑋 𝑏 (𝑖)2 βˆ’( π‘βˆˆπ‘π‘Žπ‘‘π‘β„Žπ‘’π‘  𝑋 𝑏 (𝑖) )2 𝑛 π‘βˆˆπ‘π‘Žπ‘‘π‘β„Žπ‘’π‘  𝑋 𝑏 (𝑗)2 βˆ’( π‘βˆˆπ‘π‘Žπ‘‘π‘β„Žπ‘’π‘  𝑋 𝑏 (𝑗) )2
  • 21. 21Β© 2015 Flytxt. All rights reserved. Solution : Temporal Causal Network
  • 22. 22Β© 2015 Flytxt. All rights reserved. Concept Drift Rules First type of concept drift : for new batch of data determine the KL divergence for subscriber record to cluster distribution, if the KL divergence is beyond the threshold and the independency test from the existing structure are different then re-learning of type 1 is required. Second type of concept drift : For the new batch of data likelihood of the records associated with the clusters are determined. For any cluster if likelihood has dropped below a threshold re-learning of type 2 is required. We use these rules in algorithm 1. The information regarding type of re-learning associated with each type is explained in the following section.
  • 23. 23Β© 2015 Flytxt. All rights reserved. Algorithm
  • 24. 24Β© 2015 Flytxt. All rights reserved. Solution Results : Comparision of Non-parameteric TCN vs Others Formulation Temporal Causal Network β€’ Combining DPGMM and Linear Gaussian Causal Network β€’ Distributed Learning Algorithm Synthetic Data – From NIPS TiMINo Paper: β€’ Confounder with time lag:𝑍𝑑 = π‘Ž βˆ™ π‘π‘‘βˆ’1 + 𝑁𝑍,𝑑, 𝑋𝑑 = 0.6 βˆ™ π‘‹π‘‘βˆ’1 + 0.5 βˆ™ π‘π‘‘βˆ’1 + 𝑁𝑋,𝑑, π‘Œπ‘‘ = 0.6 βˆ™ π‘Œπ‘‘βˆ’1 + 0.5 βˆ™ π‘π‘‘βˆ’2 + π‘π‘Œ,𝑑 β€’ Linear Gaussian with instantaneous effects.: 𝑋𝑑 = π‘Ž βˆ™ π‘‹π‘‘βˆ’1 + 𝑁𝑋,𝑑, π‘Šπ‘‘ = 0.6 βˆ™ π‘Šπ‘‘βˆ’1 + 0.5 βˆ™ 𝑋𝑑 + 𝑁 π‘Š,𝑑, π‘Œπ‘‘ = 0.6 βˆ™ π‘Œπ‘‘βˆ’1 + 0.5 βˆ™ π‘Šπ‘‘βˆ’1 + π‘π‘Œ,𝑑, 𝑍𝑑 = 0.6 βˆ™ π‘π‘‘βˆ’1 + 0.5 βˆ™ π‘Šπ‘‘ + 0.2 βˆ™ π‘Œπ‘‘βˆ’1 + 𝑁𝑍,𝑑 β€’ Non Linear without instantaneous effects: 𝑋𝑑 = 0.8 βˆ™ π‘‹π‘‘βˆ’1 + 0.3 βˆ™ 𝑁𝑋,𝑑, π‘Œπ‘‘ = 0.4 βˆ™ π‘Œπ‘‘βˆ’1 + π‘‹π‘‘βˆ’1 βˆ’
  • 25. 25Β© 2015 Flytxt. All rights reserved. Solution : Results Synthetic Dataset Linear Gaussian with instantaneous effects Non Linear without instantaneous effects
  • 26. 26Β© 2015 Flytxt. All rights reserved. Solution : Comparision of Non-parameteric TCN vs Others
  • 27. 27Β© 2015 Flytxt. All rights reserved. Solution : Results : Telecommunication Dataset
  • 28. 28Β© 2015 Flytxt. All rights reserved. Solution : Telecommunication Dataset
  • 29. 29Β© 2015 Flytxt. All rights reserved. Conclusion By formulating a non-parametric Temporal Causal Modelling as a combination of DPGMM based Gaussian clusters, followed by Gaussian Bayesian Network we are able to scale to large volume of data. Then we propose to learn incremental learning algorithm to continuously learn the temporal causal network. The incremental learning algorithm defines rules to determine the concept drift and then trigger relearning based on type of concept drift. The defined algorithm identifies the concept drift that occurred in Indian Telecommunication industry, without visiting the old data.
  • 30. 30Β© 2015 Flytxt. All rights reserved. References Causal inference in statistics: An overview β€’ Judea Pearl, 2009 Statistics Survey Incremental Learning of Nonparametric Bayesian Mixture Models: β€’ Ryan Gomess, Max Welling, Pietro Perona :: Computer Vision and Pattern Recognition, 2008. Estimating High-Dimensional Directed Acyclic Graphs with the PC-Algorithm β€’ Markus Kalisch, Peter Buhlmann, 2007 JMLR Order-Independent Constraint-Based Causal Structure Learning β€’ Diego Colombo, Marloes H. Maathuis, 2014 JMLR Streaming Clustering with Bayesian Nonparametric Models β€’ Viet Huynha, Dinh Phunga, 2016 ELSEVIER
  • 31. Thank You www.flytxt.com | info@flytxt.com Β© 2015 Flytxt. All rights reserved.