SlideShare a Scribd company logo
Causality Detection in Time Series 
Tushar Mehndiratta 
IDD CSE ( V year) 
10211026
Overview 
 Introduction 
 Detecting Causality 
 Control Experimentation 
 Granger Causality 
 Building Causal Relationship Graphs 
 Exhaustive Granger Method 
 Lasso Granger Method 
 Forward backward Granger Method 
 Application of Causal Modelling 
 Brain Imaging 
 Topic Mining 
 Anomaly Detection 
 Conclusion
Why? 
Why did the apple fall 
down instead of going 
up? 
Why does average 
temperature rise? 
Why did the stock 
market fall? 
Why did a post go viral 
on facebook? 
CAUSE EFFECT RELATIONSHIPS
Cause - Effect Relationships 
◦ Causality is defined as the relation between two events: cause and effect where 
the effect occurs as a consequence of the cause. 
◦ Effect is “What happened?” and Cause is “Why it happened?” 
◦ e.g. In case of global warming, the increase in Greenhouse gases is the cause and 
increase in average temperature is the effect.[1]
Characteristics of Causal Relationships 
◦ Temporal Precedence: It states that the cause occurs prior to the effect. e.g. A person 
must smoke first and then he gets lung cancer. 
◦ Co-occurrence : Whenever cause happens, effect must also happen. Cause cannot be 
isolated from the effect. e.g. Whenever there is a net force on a body, it will accelerate. 
Is Causality same as Association then?
Correlation Vs Causation 
◦ Correlation does not imply Causation 
◦ Correlation only means that two events co-exist more often than ordinary chance.[2]
Physics 
Econometrics 
Types of Data: web metrics , stock prices, sales 
(all time series) 
Medicine 
Types of Data: experiments result, gene 
sequences(sequential data), brain signals(time 
series) 
Climate Science 
Types of Data: weather conditions (spatio-temporal 
or temporal data) 
Fields of Study 
HOW TO DETECT CAUSALITY?
Detecting Causality 
To test if X causes Y
Control Experimentation 
Aim: To find out what happens to a system when you interfere with it. 
Divide subjects 
randomly into two 
groups: Test and 
Control 
Introduce X only in 
the test group and 
observe Y in both. 
If X causes Y : 
ܲ(ܻ=ݕ|݀݋(ܺ)) > 
ܲ(ܻ=ݕ|!݀݋(ܺ)) 
implies Causality
Disadvantage of Control Experimentation 
◦ Not possible to always carry out the experiment. 
◦ Most time series data cannot be manipulated. e.g. Climate, Stock data 
◦ Have to resort to statistical methods to determine causality. 
HOW TO DO IT IN TIME SERIES?
Time Series 
◦ A time series is a sequence of data points, measured typically at successive 
points in time spaced at uniform time intervals.
Granger Causality 
◦ Also known as Predictive Causality. 
◦ Granger said that Causality could be reflected by measuring the ability of 
predicting the future values of a time series using past values of another time 
series. 
◦ Two main principles: 
 Cause must occur before the Effect. 
 The Cause can be used to predict the of Effect i.e. Cause has some unique information 
about the future values of the effect.
Granger Causality 
Suppose X and Y are two time series and for X to cause Y : 
푃[푌(푡 + 1)|훤 푡 ≠ 푃[푌(푡 + 1)| 훤−X 푡 
훤 푡 and 훤−X 푡 denote the “information in the universe up to time t” and “information in 
alternate universe up to time t in which X is excluded”.
Performing the Granger Causality test 
◦ Model 1: Build model 1 by regressing on the past values of both X and Y 
푚 훼푗푌푡−푗 + 푖=1 
퐸(푌|푌푡−푘 , 푋푡−푘 ) 푌푡 = 푗=1 
푛 훽푖 푋푡−푖 + 퐷푡 + 휀푡 
◦ Model 2: Build model 2 by regressing on the past values of Y only 
푚 훼푗푌푡−푗 + 퐷푡 + 휀푡 
퐸(푌|푌푡−푘 ) 푌푡 = 푗=1 
◦ Check whether the prediction accuracy has significantly increased by performing 
F-test.[11]
Granger Causality 
• CONS 
 It does not take into account the effect of hidden common 
causes(confounders) 
 It assumes that all the relationships are linear in nature and does not account 
for non-linear dependencies. 
HOW TO DEAL WITH MULTIPLE TIME SERIES?
Relationship Graphs in Time Series 
Extending the concept of Granger Causality to Mult iple Time Series
Relationship Graphs 
◦ Relationship graph has all time series as nodes and an edge between any two 
nodes denotes the direction of relationship between the two. 
◦ Input: 
 Matrix X of time series 
 Xlag which is the lagged versions of time series matrix X. 
◦ Output 
◦ Relationship graph between the time series with nodes xi’s each edge from xi to xj if xi 
causes xj. 
xi xj
Exhaustive Graphical Granger method 
◦ Algorithm: 
◦ For every pair of nodes(xi,xj) perform the following 
 Insert an edge xi → xj if Granger (xi,xj, Xlag) = ‘yes’ and Granger (xj,xi, Xlag) = ‘no’ 
 Insert an edge xi ← xj if Granger (xi,xj, Xlag)= ‘no’ and Granger (xj,xi, Xlag) = ‘yes’ 
 Insert an edge xi↔xj, if Granger (xi,xj, Xlag) = ‘yes’ and Granger (xj,xi, Xlag)= ‘yes’
Exhaustive Graphical Granger method 
◦ Complexity 
 A total of N time series with T lags each and P time stamps/sample size, makes the 
complexity as O(N2P2T2). 
◦ Shortcomings 
 Not considering the effect of other time series. 
 Computationally expensive.
The LASSO-Granger Method 
◦ Uses variable selection in Causality Detection 
◦ Aim is to identify the subset of time series on which xi is conditionally dependent 
and on what lag is it dependent. 
◦ Achieved by applying variable selection on the set of time series and the lags 
◦ Variable selection is done by LASSO. 
LASSO-Least Absolute Shrinkage and Selection Operator
LASSO 
◦ A selection method for linear regression 
◦ Selects a subset of variables subject to the following condition 
푤 = 푚푖푛 
1 
n 
(푤. 푥 − 푦)2+휆 푤 
Here w is the vector of coefficients, y is the variable to be predicted. 
◦ Aim is to minimize the OLS error and the sum of coefficients to prevent over 
fitting. 
◦ LARS(Least Angle Regression): best method to achieve LASSO.
LARS(Least Angle Regression Shrinkage) 
Step 1: Start with û0=0 
Step 2: The residual ŷ2-û0 has a 
greater correlation with x1 than 
with x2
LARS(Least Angle Regression Shrinkage) 
Step 3: Move in the direction of x1
LARS(Least Angle Regression Shrinkage) 
Step 4: First LARS estimate : 
û1 = û0 + ƛx1 
where the residual ŷ2-û1 
has equal correlation with 
both x1 and x2
LARS(Least Angle Regression Shrinkage) 
Step 5: Move in the direction of 
Angular bisector of x1 and x2
The Lasso-Granger Method 
◦ Algorithm 
 Obtain Xlag(the lagged version of the time series matrix X). 
 For each xi in X, 
 y= xi 
 Performs LASSO (y,Xlag) 
Wi : the set of time series for which the coefficients returned by are non-zero. 
 Add edge (xj, xi) to the graph if xj is in Wj
The Lasso-Granger Method 
◦ Complexity 
 Using LARS to solve the lasso problem: O(PN2T2). 
◦ Pros. 
 Computationally less expensive. 
 Can be used when number of series are quite large as compared to the number of data 
points. 
 Consistency: The probability of Lasso falsely including a non-neighboring feature in its 
neighborhood is very small even when the number of features are very large.
Forward Backward Granger Causality 
◦ Improvement on LASSO-Granger Algorithm 
◦ Inspired from Physics 
◦ Principle: Reverse time and all the relationships must remain same except for 
change in direction, i.e. if xi causes xj with a time lag of k then on reversing time xj 
will cause xi with time lag k. 
◦ Apply LASSO-Granger on both the forward and backward time series and 
combine the results of the two.
Application of Causal Modelling 
BRAIN IMAGING TOPIC MINING ANOMALY DETECTION
Brain Imaging 
◦ How different portions of the brain affect one another. 
 Identify the direction and order of influence 
◦ Apply Granger Causality to obtain the relationship between different components of 
the brain. 
 Obtain fMRI data from the brain corresponding to a stimulus and divided it into 
independent components corresponding to different sections of the brain. 
 Each independent component corresponds to a time series. 
 Apply Exhaustive Granger test to obtain the relationship between different time 
series.
Brain Imaging 
◦ Advantages: 
 No prior assumption about the nodes and their inter-connections. 
 Measures not only the connections but also the time lags between interactions. 
 Can work with a large number of regions.
Mining topics based on Causality 
◦ Identification of topics that are causally related with the non textual data 
iteratively. 
◦ InCaToMi (Integrative Causal Topic Miner) 
◦ Architecture: 
Topic modelling 
module 
Causality Module 
Feedback 
Text Data
InCaToMi: Integrative Causal Topic Miner 
◦ Topic Modelling Module: 
 Takes text and number of topics as input. 
 Creates topics based on word probabilities and the likelihood of each topic in the 
document using PLSA algorithm. 
 Time series of the topic formed by summation of likelihood of each word in the topic for 
a day.
InCaToMi: Integrative Causal Topic Miner 
◦ Causality Module: 
 Perform the Granger Causality test for the time series for each topic and for each word in 
the topic. 
 Form new candidate topic by selecting the words which are most causally related with the 
non textual series. 
 Use this as prior for the next round of Topic Modelling.
Anomaly Detection 
◦ Types of Anomalies: 
 Univariate Anomalies 
 Dependency Anomalies 
◦ Given two sets of data sequences A(training) and B(test) each containing p time 
series we have to find data points in B which significantly deviate from the 
normal pattern of data sequence. 
◦ Algorithm for finding dependency anomalies.
Anomaly Detection 
Learning temporal causal graphs 
by regularization 
Finding the Anomaly 
Score using Kullback- 
Leibler (KL) 
Divergence 
Determining 
Anomalies by 
specifying a threshold 
and finding the 
underlying causes 
Hypothesis: Causal Graphs of both remain the same
Anomaly Detection 
Learning temporal causal graphs 
by regularization 
Finding the Anomaly 
Score using Kullback- 
Leibler (KL) 
Divergence 
Determining 
Anomalies by 
specifying a threshold 
and finding the 
underlying causes 
Calculate the graph for A by LASSO Granger method. 
When finding the causal graph for B we need to apply additional constraints. This can be done using two 
methods: 
a) Neighborhood Similarity: This implies imposing an additional constraint that the values of β(a) should 
be zero or non-zero only when the value of β(b) are zero or non-zero. Here β(a) and β(b) are the coefficients 
obtained by running Lasso Granger on set A and Set B respectively. 
b) Coefficient similarity: The constraint is that the coefficients β(a) and β(b) should be similar.
Anomaly Detection 
Learning temporal causal graphs 
by regularization 
Finding the Anomaly 
Score using Kullback- 
Leibler (KL) 
Divergence 
Determining 
Anomalies by 
specifying a threshold 
and finding the 
underlying causes 
KL divergence is a measure of how much one distribution differs from another. 
Obtain the distributions for the two time series and the anomaly score is calculated using the KL 
formulae.
Anomaly Detection 
Learning temporal causal graphs 
by regularization 
Finding the Anomaly 
Score using Kullback- 
Leibler (KL) 
Divergence 
Determining 
Anomalies by 
specifying a threshold 
and finding the 
underlying causes 
• To set a threshold we calculate how a normal time series would score on the anomaly score. 
• We slide the window through the reference data and calculate the anomaly scores for each window. 
• We them use these to approximate the distribution of anomaly scores that a normal time series 
should have. 
• Given a significance level α, we set the α quantile of the distribution as threshold cutoff.
Conclusion 
◦ Widespread application of causal relationships motivates the study. 
◦ Completely data driven approach. So provides a new outlook in every field 
without making any assumptions. 
◦ Further Scope: 
 Applying the model to different domains. e.g Climate and Social media 
 Predicting anomalous behavior.
References 
[1] Lashof, Daniel A., and Dilip R. Ahuja. "Relative contributions of greenhouse gas emissions to global warming." (1990): 529-531. 
[2] Perry, Ronen. "Correlation versus Causality: Further Thoughts on the Law Review/Law School Liaison." Conn. L. Rev. 39 (2006): 77. 
[3] Diks, Cees, and Valentyn Panchenko. Modified hiemstra-jones test for Granger non-causality. No. 192. Society for Computational Economics, 2004. 
[4] Granger, Clive WJ. "Investigating causal relations by econometric models and cross-spectral methods." Econometrica: Journal of the Econometric Society (1969): 
424-438. 
[5] Arnold, Andrew, Yan Liu, and Naoki Abe. "Temporal causal modeling with graphical granger methods." Proceedings of the 13th ACM SIGKDD international 
conference on Knowledge discovery and data mining. ACM, 2007 
[6] Tibshirani, Robert. "Regression shrinkage and selection via the lasso." Journal of the Royal Statistical Society. Series B (Methodological) (1996): 267-288. 
[7] Cheng, Dehua, Mohammad Taha Bahadori, and Yan Liu. "FBLG: a simple and effective approach for temporal dependence discovery from time series 
data."Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2014. 
[8] Smith, Delmas, Iwabuchi, Kirk. “Demonstrating causal links between fMRI time series using time-lagged correlation”. 
[9] Kim, Hyun Duk, et al. "Incatomi: Integrative causal topic miner between textual and non-textual time series data." Proceedings of the 21st ACM international 
conference on Information and knowledge management. ACM, 2012. 
[10] Qiu, Liu, Subrahmanya, et al. "Granger Causality for Time-Series Anomaly Detection." Proceedings of the 12th IEEE international conference on data mining, 
2012. 
[11] Lomax, Richard G. (2007) Statistical Concepts: A Second Course, p. 10
Thanks!! 
Any Questions?

More Related Content

What's hot

Bivariate
BivariateBivariate
Bivariate
Vikas Saini
 
isi
isiisi
isi
ahfdv
 
gamdependence_revision1
gamdependence_revision1gamdependence_revision1
gamdependence_revision1
Thibault Vatter
 
Presentation on GMM
Presentation on GMMPresentation on GMM
Presentation on GMM
Moses sichei
 
Lesson 8 Linear Correlation And Regression
Lesson 8 Linear Correlation And RegressionLesson 8 Linear Correlation And Regression
Lesson 8 Linear Correlation And Regression
Sumit Prajapati
 
Simple linear regressionn and Correlation
Simple linear regressionn and CorrelationSimple linear regressionn and Correlation
Simple linear regressionn and Correlation
Southern Range, Berhampur, Odisha
 
A Regularization Approach to the Reconciliation of Constrained Data Sets
A Regularization Approach to the Reconciliation of Constrained Data SetsA Regularization Approach to the Reconciliation of Constrained Data Sets
A Regularization Approach to the Reconciliation of Constrained Data Sets
Alkis Vazacopoulos
 
MAST30013 Techniques in Operations Research
MAST30013 Techniques in Operations ResearchMAST30013 Techniques in Operations Research
MAST30013 Techniques in Operations Research
Lachlan Russell
 
Heteroscedasticity
HeteroscedasticityHeteroscedasticity
Heteroscedasticity
Geethu Rangan
 
Introduction to correlation and regression analysis
Introduction to correlation and regression analysisIntroduction to correlation and regression analysis
Introduction to correlation and regression analysis
Farzad Javidanrad
 
Chi sqr
Chi sqrChi sqr
Chapter13
Chapter13Chapter13
Chapter13
rwmiller
 
Regression and corelation (Biostatistics)
Regression and corelation (Biostatistics)Regression and corelation (Biostatistics)
Regression and corelation (Biostatistics)
Muhammadasif909
 
Multiple linear regression
Multiple linear regressionMultiple linear regression
Multiple linear regression
James Neill
 
Correlation in Statistics
Correlation in StatisticsCorrelation in Statistics
Correlation in Statistics
Avjinder (Avi) Kaler
 
Thesis_NickyGrant_2013
Thesis_NickyGrant_2013Thesis_NickyGrant_2013
Thesis_NickyGrant_2013
Nicky Grant
 
Chi Squared Test
Chi Squared TestChi Squared Test
Chi Squared Test
Darren Barton
 
Statistical technique exercise 23 and 24 correlational study
Statistical technique exercise 23 and 24 correlational studyStatistical technique exercise 23 and 24 correlational study
Statistical technique exercise 23 and 24 correlational study
Zelessay contents Ltd, Zelessaywritings.com
 
Multiple linear regression II
Multiple linear regression IIMultiple linear regression II
Multiple linear regression II
James Neill
 
Dong Zhang's project
Dong Zhang's projectDong Zhang's project
Dong Zhang's project
Dong Zhang
 

What's hot (20)

Bivariate
BivariateBivariate
Bivariate
 
isi
isiisi
isi
 
gamdependence_revision1
gamdependence_revision1gamdependence_revision1
gamdependence_revision1
 
Presentation on GMM
Presentation on GMMPresentation on GMM
Presentation on GMM
 
Lesson 8 Linear Correlation And Regression
Lesson 8 Linear Correlation And RegressionLesson 8 Linear Correlation And Regression
Lesson 8 Linear Correlation And Regression
 
Simple linear regressionn and Correlation
Simple linear regressionn and CorrelationSimple linear regressionn and Correlation
Simple linear regressionn and Correlation
 
A Regularization Approach to the Reconciliation of Constrained Data Sets
A Regularization Approach to the Reconciliation of Constrained Data SetsA Regularization Approach to the Reconciliation of Constrained Data Sets
A Regularization Approach to the Reconciliation of Constrained Data Sets
 
MAST30013 Techniques in Operations Research
MAST30013 Techniques in Operations ResearchMAST30013 Techniques in Operations Research
MAST30013 Techniques in Operations Research
 
Heteroscedasticity
HeteroscedasticityHeteroscedasticity
Heteroscedasticity
 
Introduction to correlation and regression analysis
Introduction to correlation and regression analysisIntroduction to correlation and regression analysis
Introduction to correlation and regression analysis
 
Chi sqr
Chi sqrChi sqr
Chi sqr
 
Chapter13
Chapter13Chapter13
Chapter13
 
Regression and corelation (Biostatistics)
Regression and corelation (Biostatistics)Regression and corelation (Biostatistics)
Regression and corelation (Biostatistics)
 
Multiple linear regression
Multiple linear regressionMultiple linear regression
Multiple linear regression
 
Correlation in Statistics
Correlation in StatisticsCorrelation in Statistics
Correlation in Statistics
 
Thesis_NickyGrant_2013
Thesis_NickyGrant_2013Thesis_NickyGrant_2013
Thesis_NickyGrant_2013
 
Chi Squared Test
Chi Squared TestChi Squared Test
Chi Squared Test
 
Statistical technique exercise 23 and 24 correlational study
Statistical technique exercise 23 and 24 correlational studyStatistical technique exercise 23 and 24 correlational study
Statistical technique exercise 23 and 24 correlational study
 
Multiple linear regression II
Multiple linear regression IIMultiple linear regression II
Multiple linear regression II
 
Dong Zhang's project
Dong Zhang's projectDong Zhang's project
Dong Zhang's project
 

Viewers also liked

Granger causality test
Granger causality testGranger causality test
Granger causality test
Suniya Sheikh
 
Granger Causality
Granger CausalityGranger Causality
Granger Causality
Gaetan Lion
 
Cointegration and error correction model
Cointegration and error correction modelCointegration and error correction model
Cointegration and error correction model
Aditya KS
 
7. toda yamamoto-granger causality
7. toda yamamoto-granger causality7. toda yamamoto-granger causality
7. toda yamamoto-granger causality
Quang Hoang
 
Linear regression
Linear regressionLinear regression
Linear regression
ansrivas21
 
Brief Look at Association vs causation
Brief Look at Association vs causationBrief Look at Association vs causation
Brief Look at Association vs causation
Pat Barlow
 
Introduction to Wavelet Transform with Applications to DSP
Introduction to Wavelet Transform with Applications to DSPIntroduction to Wavelet Transform with Applications to DSP
Introduction to Wavelet Transform with Applications to DSP
Hicham Berkouk
 
The Law School Bubble
The Law School BubbleThe Law School Bubble
The Law School Bubble
Gaetan Lion
 
Applied Econometrics assignment3
Applied Econometrics assignment3Applied Econometrics assignment3
Applied Econometrics assignment3
Chenguang Li
 
Regresion lineal multiple_3
Regresion lineal multiple_3Regresion lineal multiple_3
Regresion lineal multiple_3
karinita12
 
Unrestricted var out
Unrestricted var outUnrestricted var out
Unrestricted var out
Gaetan Lion
 
Quantitative method intro variable_levels_measurement
Quantitative method intro variable_levels_measurementQuantitative method intro variable_levels_measurement
Quantitative method intro variable_levels_measurement
Keiko Ono
 
Deep Learning
Deep LearningDeep Learning
Deep Learning
Gaetan Lion
 
STATA - Merge or Drop Data
STATA - Merge or Drop DataSTATA - Merge or Drop Data
STATA - Merge or Drop Data
stata_org_uk
 
Regression topics
Regression topicsRegression topics
Regression topics
Gaetan Lion
 
Multiple Linear Regression
Multiple Linear RegressionMultiple Linear Regression
Multiple Linear Regression
Nadzirah Hanis
 
Linear Regression
Linear RegressionLinear Regression
Linear Regression
nep_test_account
 
Multiple linear regression
Multiple linear regressionMultiple linear regression
Multiple linear regression
Avjinder (Avi) Kaler
 
Gini coefficient vs economic growth
Gini coefficient vs economic growthGini coefficient vs economic growth
Gini coefficient vs economic growth
Gaetan Lion
 
STATA - Linear Regressions
STATA - Linear RegressionsSTATA - Linear Regressions
STATA - Linear Regressions
stata_org_uk
 

Viewers also liked (20)

Granger causality test
Granger causality testGranger causality test
Granger causality test
 
Granger Causality
Granger CausalityGranger Causality
Granger Causality
 
Cointegration and error correction model
Cointegration and error correction modelCointegration and error correction model
Cointegration and error correction model
 
7. toda yamamoto-granger causality
7. toda yamamoto-granger causality7. toda yamamoto-granger causality
7. toda yamamoto-granger causality
 
Linear regression
Linear regressionLinear regression
Linear regression
 
Brief Look at Association vs causation
Brief Look at Association vs causationBrief Look at Association vs causation
Brief Look at Association vs causation
 
Introduction to Wavelet Transform with Applications to DSP
Introduction to Wavelet Transform with Applications to DSPIntroduction to Wavelet Transform with Applications to DSP
Introduction to Wavelet Transform with Applications to DSP
 
The Law School Bubble
The Law School BubbleThe Law School Bubble
The Law School Bubble
 
Applied Econometrics assignment3
Applied Econometrics assignment3Applied Econometrics assignment3
Applied Econometrics assignment3
 
Regresion lineal multiple_3
Regresion lineal multiple_3Regresion lineal multiple_3
Regresion lineal multiple_3
 
Unrestricted var out
Unrestricted var outUnrestricted var out
Unrestricted var out
 
Quantitative method intro variable_levels_measurement
Quantitative method intro variable_levels_measurementQuantitative method intro variable_levels_measurement
Quantitative method intro variable_levels_measurement
 
Deep Learning
Deep LearningDeep Learning
Deep Learning
 
STATA - Merge or Drop Data
STATA - Merge or Drop DataSTATA - Merge or Drop Data
STATA - Merge or Drop Data
 
Regression topics
Regression topicsRegression topics
Regression topics
 
Multiple Linear Regression
Multiple Linear RegressionMultiple Linear Regression
Multiple Linear Regression
 
Linear Regression
Linear RegressionLinear Regression
Linear Regression
 
Multiple linear regression
Multiple linear regressionMultiple linear regression
Multiple linear regression
 
Gini coefficient vs economic growth
Gini coefficient vs economic growthGini coefficient vs economic growth
Gini coefficient vs economic growth
 
STATA - Linear Regressions
STATA - Linear RegressionsSTATA - Linear Regressions
STATA - Linear Regressions
 

Similar to Causality detection

Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
The Statistical and Applied Mathematical Sciences Institute
 
03 Data Mining Techniques
03 Data Mining Techniques03 Data Mining Techniques
03 Data Mining Techniques
Valerii Klymchuk
 
Cost indexes
Cost indexesCost indexes
MUMS: Transition & SPUQ Workshop - Some Strategies to Quantify Uncertainty fo...
MUMS: Transition & SPUQ Workshop - Some Strategies to Quantify Uncertainty fo...MUMS: Transition & SPUQ Workshop - Some Strategies to Quantify Uncertainty fo...
MUMS: Transition & SPUQ Workshop - Some Strategies to Quantify Uncertainty fo...
The Statistical and Applied Mathematical Sciences Institute
 
Enm fy17nano qsar
Enm fy17nano qsarEnm fy17nano qsar
Enm fy17nano qsar
PaulHarten1
 
block-mdp-masters-defense.pdf
block-mdp-masters-defense.pdfblock-mdp-masters-defense.pdf
block-mdp-masters-defense.pdf
Junghyun Lee
 
Unit3
Unit3Unit3
Leiden_VU_Delft_seminar short.pdf
Leiden_VU_Delft_seminar short.pdfLeiden_VU_Delft_seminar short.pdf
Leiden_VU_Delft_seminar short.pdf
Chiheb Ben Hammouda
 
R Packages for Time-Varying Networks and Extremal Dependence
R Packages for Time-Varying Networks and Extremal DependenceR Packages for Time-Varying Networks and Extremal Dependence
R Packages for Time-Varying Networks and Extremal Dependence
Work-Bench
 
[ICLR2021 (spotlight)] Benefit of deep learning with non-convex noisy gradien...
[ICLR2021 (spotlight)] Benefit of deep learning with non-convex noisy gradien...[ICLR2021 (spotlight)] Benefit of deep learning with non-convex noisy gradien...
[ICLR2021 (spotlight)] Benefit of deep learning with non-convex noisy gradien...
Taiji Suzuki
 
MUMS: Bayesian, Fiducial, and Frequentist Conference - Generalized Probabilis...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Generalized Probabilis...MUMS: Bayesian, Fiducial, and Frequentist Conference - Generalized Probabilis...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Generalized Probabilis...
The Statistical and Applied Mathematical Sciences Institute
 
Arellano bond
Arellano bondArellano bond
Estimating Space-Time Covariance from Finite Sample Sets
Estimating Space-Time Covariance from Finite Sample SetsEstimating Space-Time Covariance from Finite Sample Sets
Estimating Space-Time Covariance from Finite Sample Sets
Förderverein Technische Fakultät
 
Modeling of EEG (Brain waves)
Modeling of EEG (Brain waves) Modeling of EEG (Brain waves)
Modeling of EEG (Brain waves)
Kenyu Uehara
 
Bayesian Neural Networks
Bayesian Neural NetworksBayesian Neural Networks
Bayesian Neural Networks
Natan Katz
 
Random vibrations
Random vibrationsRandom vibrations
Random vibrations
Koteswara Rao Unnam
 
linear model multiple predictors.pdf
linear model multiple predictors.pdflinear model multiple predictors.pdf
linear model multiple predictors.pdf
ssuser7d5314
 
SSC20160524
SSC20160524SSC20160524
SSC20160524
Dexen Xi
 
A General Framework for Enhancing Prediction Performance on Time Series Data
A General Framework for Enhancing Prediction Performance on Time Series DataA General Framework for Enhancing Prediction Performance on Time Series Data
A General Framework for Enhancing Prediction Performance on Time Series Data
HopeBay Technologies, Inc.
 
Cluster Analysis
Cluster Analysis Cluster Analysis
Cluster Analysis
Baivab Nag
 

Similar to Causality detection (20)

Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
03 Data Mining Techniques
03 Data Mining Techniques03 Data Mining Techniques
03 Data Mining Techniques
 
Cost indexes
Cost indexesCost indexes
Cost indexes
 
MUMS: Transition & SPUQ Workshop - Some Strategies to Quantify Uncertainty fo...
MUMS: Transition & SPUQ Workshop - Some Strategies to Quantify Uncertainty fo...MUMS: Transition & SPUQ Workshop - Some Strategies to Quantify Uncertainty fo...
MUMS: Transition & SPUQ Workshop - Some Strategies to Quantify Uncertainty fo...
 
Enm fy17nano qsar
Enm fy17nano qsarEnm fy17nano qsar
Enm fy17nano qsar
 
block-mdp-masters-defense.pdf
block-mdp-masters-defense.pdfblock-mdp-masters-defense.pdf
block-mdp-masters-defense.pdf
 
Unit3
Unit3Unit3
Unit3
 
Leiden_VU_Delft_seminar short.pdf
Leiden_VU_Delft_seminar short.pdfLeiden_VU_Delft_seminar short.pdf
Leiden_VU_Delft_seminar short.pdf
 
R Packages for Time-Varying Networks and Extremal Dependence
R Packages for Time-Varying Networks and Extremal DependenceR Packages for Time-Varying Networks and Extremal Dependence
R Packages for Time-Varying Networks and Extremal Dependence
 
[ICLR2021 (spotlight)] Benefit of deep learning with non-convex noisy gradien...
[ICLR2021 (spotlight)] Benefit of deep learning with non-convex noisy gradien...[ICLR2021 (spotlight)] Benefit of deep learning with non-convex noisy gradien...
[ICLR2021 (spotlight)] Benefit of deep learning with non-convex noisy gradien...
 
MUMS: Bayesian, Fiducial, and Frequentist Conference - Generalized Probabilis...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Generalized Probabilis...MUMS: Bayesian, Fiducial, and Frequentist Conference - Generalized Probabilis...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Generalized Probabilis...
 
Arellano bond
Arellano bondArellano bond
Arellano bond
 
Estimating Space-Time Covariance from Finite Sample Sets
Estimating Space-Time Covariance from Finite Sample SetsEstimating Space-Time Covariance from Finite Sample Sets
Estimating Space-Time Covariance from Finite Sample Sets
 
Modeling of EEG (Brain waves)
Modeling of EEG (Brain waves) Modeling of EEG (Brain waves)
Modeling of EEG (Brain waves)
 
Bayesian Neural Networks
Bayesian Neural NetworksBayesian Neural Networks
Bayesian Neural Networks
 
Random vibrations
Random vibrationsRandom vibrations
Random vibrations
 
linear model multiple predictors.pdf
linear model multiple predictors.pdflinear model multiple predictors.pdf
linear model multiple predictors.pdf
 
SSC20160524
SSC20160524SSC20160524
SSC20160524
 
A General Framework for Enhancing Prediction Performance on Time Series Data
A General Framework for Enhancing Prediction Performance on Time Series DataA General Framework for Enhancing Prediction Performance on Time Series Data
A General Framework for Enhancing Prediction Performance on Time Series Data
 
Cluster Analysis
Cluster Analysis Cluster Analysis
Cluster Analysis
 

Recently uploaded

Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptxBeyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
EduSkills OECD
 
Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
adhitya5119
 
clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
Priyankaranawat4
 
Wound healing PPT
Wound healing PPTWound healing PPT
Wound healing PPT
Jyoti Chand
 
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching AptitudeUGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
S. Raj Kumar
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
History of Stoke Newington
 
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptxNEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
iammrhaywood
 
Reimagining Your Library Space: How to Increase the Vibes in Your Library No ...
Reimagining Your Library Space: How to Increase the Vibes in Your Library No ...Reimagining Your Library Space: How to Increase the Vibes in Your Library No ...
Reimagining Your Library Space: How to Increase the Vibes in Your Library No ...
Diana Rendina
 
Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
Dr. Mulla Adam Ali
 
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
Nguyen Thanh Tu Collection
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
Dr. Shivangi Singh Parihar
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
HajraNaeem15
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
Nguyen Thanh Tu Collection
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
WaniBasim
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
heathfieldcps1
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
AyyanKhan40
 
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem studentsRHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
Himanshu Rai
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
Israel Genealogy Research Association
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
Celine George
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 

Recently uploaded (20)

Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptxBeyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
 
Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
 
clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
 
Wound healing PPT
Wound healing PPTWound healing PPT
Wound healing PPT
 
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching AptitudeUGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
 
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptxNEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
 
Reimagining Your Library Space: How to Increase the Vibes in Your Library No ...
Reimagining Your Library Space: How to Increase the Vibes in Your Library No ...Reimagining Your Library Space: How to Increase the Vibes in Your Library No ...
Reimagining Your Library Space: How to Increase the Vibes in Your Library No ...
 
Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
 
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
 
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem studentsRHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
 

Causality detection

  • 1. Causality Detection in Time Series Tushar Mehndiratta IDD CSE ( V year) 10211026
  • 2. Overview  Introduction  Detecting Causality  Control Experimentation  Granger Causality  Building Causal Relationship Graphs  Exhaustive Granger Method  Lasso Granger Method  Forward backward Granger Method  Application of Causal Modelling  Brain Imaging  Topic Mining  Anomaly Detection  Conclusion
  • 3. Why? Why did the apple fall down instead of going up? Why does average temperature rise? Why did the stock market fall? Why did a post go viral on facebook? CAUSE EFFECT RELATIONSHIPS
  • 4. Cause - Effect Relationships ◦ Causality is defined as the relation between two events: cause and effect where the effect occurs as a consequence of the cause. ◦ Effect is “What happened?” and Cause is “Why it happened?” ◦ e.g. In case of global warming, the increase in Greenhouse gases is the cause and increase in average temperature is the effect.[1]
  • 5. Characteristics of Causal Relationships ◦ Temporal Precedence: It states that the cause occurs prior to the effect. e.g. A person must smoke first and then he gets lung cancer. ◦ Co-occurrence : Whenever cause happens, effect must also happen. Cause cannot be isolated from the effect. e.g. Whenever there is a net force on a body, it will accelerate. Is Causality same as Association then?
  • 6. Correlation Vs Causation ◦ Correlation does not imply Causation ◦ Correlation only means that two events co-exist more often than ordinary chance.[2]
  • 7. Physics Econometrics Types of Data: web metrics , stock prices, sales (all time series) Medicine Types of Data: experiments result, gene sequences(sequential data), brain signals(time series) Climate Science Types of Data: weather conditions (spatio-temporal or temporal data) Fields of Study HOW TO DETECT CAUSALITY?
  • 8. Detecting Causality To test if X causes Y
  • 9. Control Experimentation Aim: To find out what happens to a system when you interfere with it. Divide subjects randomly into two groups: Test and Control Introduce X only in the test group and observe Y in both. If X causes Y : ܲ(ܻ=ݕ|݀݋(ܺ)) > ܲ(ܻ=ݕ|!݀݋(ܺ)) implies Causality
  • 10. Disadvantage of Control Experimentation ◦ Not possible to always carry out the experiment. ◦ Most time series data cannot be manipulated. e.g. Climate, Stock data ◦ Have to resort to statistical methods to determine causality. HOW TO DO IT IN TIME SERIES?
  • 11. Time Series ◦ A time series is a sequence of data points, measured typically at successive points in time spaced at uniform time intervals.
  • 12. Granger Causality ◦ Also known as Predictive Causality. ◦ Granger said that Causality could be reflected by measuring the ability of predicting the future values of a time series using past values of another time series. ◦ Two main principles:  Cause must occur before the Effect.  The Cause can be used to predict the of Effect i.e. Cause has some unique information about the future values of the effect.
  • 13. Granger Causality Suppose X and Y are two time series and for X to cause Y : 푃[푌(푡 + 1)|훤 푡 ≠ 푃[푌(푡 + 1)| 훤−X 푡 훤 푡 and 훤−X 푡 denote the “information in the universe up to time t” and “information in alternate universe up to time t in which X is excluded”.
  • 14. Performing the Granger Causality test ◦ Model 1: Build model 1 by regressing on the past values of both X and Y 푚 훼푗푌푡−푗 + 푖=1 퐸(푌|푌푡−푘 , 푋푡−푘 ) 푌푡 = 푗=1 푛 훽푖 푋푡−푖 + 퐷푡 + 휀푡 ◦ Model 2: Build model 2 by regressing on the past values of Y only 푚 훼푗푌푡−푗 + 퐷푡 + 휀푡 퐸(푌|푌푡−푘 ) 푌푡 = 푗=1 ◦ Check whether the prediction accuracy has significantly increased by performing F-test.[11]
  • 15. Granger Causality • CONS  It does not take into account the effect of hidden common causes(confounders)  It assumes that all the relationships are linear in nature and does not account for non-linear dependencies. HOW TO DEAL WITH MULTIPLE TIME SERIES?
  • 16. Relationship Graphs in Time Series Extending the concept of Granger Causality to Mult iple Time Series
  • 17. Relationship Graphs ◦ Relationship graph has all time series as nodes and an edge between any two nodes denotes the direction of relationship between the two. ◦ Input:  Matrix X of time series  Xlag which is the lagged versions of time series matrix X. ◦ Output ◦ Relationship graph between the time series with nodes xi’s each edge from xi to xj if xi causes xj. xi xj
  • 18. Exhaustive Graphical Granger method ◦ Algorithm: ◦ For every pair of nodes(xi,xj) perform the following  Insert an edge xi → xj if Granger (xi,xj, Xlag) = ‘yes’ and Granger (xj,xi, Xlag) = ‘no’  Insert an edge xi ← xj if Granger (xi,xj, Xlag)= ‘no’ and Granger (xj,xi, Xlag) = ‘yes’  Insert an edge xi↔xj, if Granger (xi,xj, Xlag) = ‘yes’ and Granger (xj,xi, Xlag)= ‘yes’
  • 19. Exhaustive Graphical Granger method ◦ Complexity  A total of N time series with T lags each and P time stamps/sample size, makes the complexity as O(N2P2T2). ◦ Shortcomings  Not considering the effect of other time series.  Computationally expensive.
  • 20. The LASSO-Granger Method ◦ Uses variable selection in Causality Detection ◦ Aim is to identify the subset of time series on which xi is conditionally dependent and on what lag is it dependent. ◦ Achieved by applying variable selection on the set of time series and the lags ◦ Variable selection is done by LASSO. LASSO-Least Absolute Shrinkage and Selection Operator
  • 21. LASSO ◦ A selection method for linear regression ◦ Selects a subset of variables subject to the following condition 푤 = 푚푖푛 1 n (푤. 푥 − 푦)2+휆 푤 Here w is the vector of coefficients, y is the variable to be predicted. ◦ Aim is to minimize the OLS error and the sum of coefficients to prevent over fitting. ◦ LARS(Least Angle Regression): best method to achieve LASSO.
  • 22. LARS(Least Angle Regression Shrinkage) Step 1: Start with û0=0 Step 2: The residual ŷ2-û0 has a greater correlation with x1 than with x2
  • 23. LARS(Least Angle Regression Shrinkage) Step 3: Move in the direction of x1
  • 24. LARS(Least Angle Regression Shrinkage) Step 4: First LARS estimate : û1 = û0 + ƛx1 where the residual ŷ2-û1 has equal correlation with both x1 and x2
  • 25. LARS(Least Angle Regression Shrinkage) Step 5: Move in the direction of Angular bisector of x1 and x2
  • 26. The Lasso-Granger Method ◦ Algorithm  Obtain Xlag(the lagged version of the time series matrix X).  For each xi in X,  y= xi  Performs LASSO (y,Xlag) Wi : the set of time series for which the coefficients returned by are non-zero.  Add edge (xj, xi) to the graph if xj is in Wj
  • 27. The Lasso-Granger Method ◦ Complexity  Using LARS to solve the lasso problem: O(PN2T2). ◦ Pros.  Computationally less expensive.  Can be used when number of series are quite large as compared to the number of data points.  Consistency: The probability of Lasso falsely including a non-neighboring feature in its neighborhood is very small even when the number of features are very large.
  • 28. Forward Backward Granger Causality ◦ Improvement on LASSO-Granger Algorithm ◦ Inspired from Physics ◦ Principle: Reverse time and all the relationships must remain same except for change in direction, i.e. if xi causes xj with a time lag of k then on reversing time xj will cause xi with time lag k. ◦ Apply LASSO-Granger on both the forward and backward time series and combine the results of the two.
  • 29. Application of Causal Modelling BRAIN IMAGING TOPIC MINING ANOMALY DETECTION
  • 30. Brain Imaging ◦ How different portions of the brain affect one another.  Identify the direction and order of influence ◦ Apply Granger Causality to obtain the relationship between different components of the brain.  Obtain fMRI data from the brain corresponding to a stimulus and divided it into independent components corresponding to different sections of the brain.  Each independent component corresponds to a time series.  Apply Exhaustive Granger test to obtain the relationship between different time series.
  • 31. Brain Imaging ◦ Advantages:  No prior assumption about the nodes and their inter-connections.  Measures not only the connections but also the time lags between interactions.  Can work with a large number of regions.
  • 32. Mining topics based on Causality ◦ Identification of topics that are causally related with the non textual data iteratively. ◦ InCaToMi (Integrative Causal Topic Miner) ◦ Architecture: Topic modelling module Causality Module Feedback Text Data
  • 33. InCaToMi: Integrative Causal Topic Miner ◦ Topic Modelling Module:  Takes text and number of topics as input.  Creates topics based on word probabilities and the likelihood of each topic in the document using PLSA algorithm.  Time series of the topic formed by summation of likelihood of each word in the topic for a day.
  • 34. InCaToMi: Integrative Causal Topic Miner ◦ Causality Module:  Perform the Granger Causality test for the time series for each topic and for each word in the topic.  Form new candidate topic by selecting the words which are most causally related with the non textual series.  Use this as prior for the next round of Topic Modelling.
  • 35. Anomaly Detection ◦ Types of Anomalies:  Univariate Anomalies  Dependency Anomalies ◦ Given two sets of data sequences A(training) and B(test) each containing p time series we have to find data points in B which significantly deviate from the normal pattern of data sequence. ◦ Algorithm for finding dependency anomalies.
  • 36. Anomaly Detection Learning temporal causal graphs by regularization Finding the Anomaly Score using Kullback- Leibler (KL) Divergence Determining Anomalies by specifying a threshold and finding the underlying causes Hypothesis: Causal Graphs of both remain the same
  • 37. Anomaly Detection Learning temporal causal graphs by regularization Finding the Anomaly Score using Kullback- Leibler (KL) Divergence Determining Anomalies by specifying a threshold and finding the underlying causes Calculate the graph for A by LASSO Granger method. When finding the causal graph for B we need to apply additional constraints. This can be done using two methods: a) Neighborhood Similarity: This implies imposing an additional constraint that the values of β(a) should be zero or non-zero only when the value of β(b) are zero or non-zero. Here β(a) and β(b) are the coefficients obtained by running Lasso Granger on set A and Set B respectively. b) Coefficient similarity: The constraint is that the coefficients β(a) and β(b) should be similar.
  • 38. Anomaly Detection Learning temporal causal graphs by regularization Finding the Anomaly Score using Kullback- Leibler (KL) Divergence Determining Anomalies by specifying a threshold and finding the underlying causes KL divergence is a measure of how much one distribution differs from another. Obtain the distributions for the two time series and the anomaly score is calculated using the KL formulae.
  • 39. Anomaly Detection Learning temporal causal graphs by regularization Finding the Anomaly Score using Kullback- Leibler (KL) Divergence Determining Anomalies by specifying a threshold and finding the underlying causes • To set a threshold we calculate how a normal time series would score on the anomaly score. • We slide the window through the reference data and calculate the anomaly scores for each window. • We them use these to approximate the distribution of anomaly scores that a normal time series should have. • Given a significance level α, we set the α quantile of the distribution as threshold cutoff.
  • 40. Conclusion ◦ Widespread application of causal relationships motivates the study. ◦ Completely data driven approach. So provides a new outlook in every field without making any assumptions. ◦ Further Scope:  Applying the model to different domains. e.g Climate and Social media  Predicting anomalous behavior.
  • 41. References [1] Lashof, Daniel A., and Dilip R. Ahuja. "Relative contributions of greenhouse gas emissions to global warming." (1990): 529-531. [2] Perry, Ronen. "Correlation versus Causality: Further Thoughts on the Law Review/Law School Liaison." Conn. L. Rev. 39 (2006): 77. [3] Diks, Cees, and Valentyn Panchenko. Modified hiemstra-jones test for Granger non-causality. No. 192. Society for Computational Economics, 2004. [4] Granger, Clive WJ. "Investigating causal relations by econometric models and cross-spectral methods." Econometrica: Journal of the Econometric Society (1969): 424-438. [5] Arnold, Andrew, Yan Liu, and Naoki Abe. "Temporal causal modeling with graphical granger methods." Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2007 [6] Tibshirani, Robert. "Regression shrinkage and selection via the lasso." Journal of the Royal Statistical Society. Series B (Methodological) (1996): 267-288. [7] Cheng, Dehua, Mohammad Taha Bahadori, and Yan Liu. "FBLG: a simple and effective approach for temporal dependence discovery from time series data."Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2014. [8] Smith, Delmas, Iwabuchi, Kirk. “Demonstrating causal links between fMRI time series using time-lagged correlation”. [9] Kim, Hyun Duk, et al. "Incatomi: Integrative causal topic miner between textual and non-textual time series data." Proceedings of the 21st ACM international conference on Information and knowledge management. ACM, 2012. [10] Qiu, Liu, Subrahmanya, et al. "Granger Causality for Time-Series Anomaly Detection." Proceedings of the 12th IEEE international conference on data mining, 2012. [11] Lomax, Richard G. (2007) Statistical Concepts: A Second Course, p. 10