SlideShare a Scribd company logo
1 of 26
Download to read offline
1
Temporal Causal Models for Massive Time-series Data
Mining: Climate Change Attribution and other Applications
Yan Liu
Computer Science Department
Viterbi School of Engineering
University of Southern California
2011 Japan-America Frontiers of Engineering Symposium June 6-8, 2011
Climate Change: One of the Most Critical Issues Mankind Faces in the 21st Century
Slide 2
Understanding Climate System is Imperative to Devising Potential Solutions
 Climate system involves complex relationships between large number of variables
 Need to understand and quantify “causal” effects of the various parameters
Slide 3
Challenges with Existing Climate Models
 23 widely used global climate models: Model inter-
comparison project:
http://www.clivar.org/organization/wgcm/cmip.php
Slide 4
 Forward-simulation approach
Massive Amount of Spatial-temporal Data on Climate and Climate-forcing Agents
Slide 5
Human agents:
atmospheric constituents
Surface and atmospheric climate
Snow, Ice and Frozen Ground
Human agents: Land Cover
Solar Radiation
Massive Climate Data: New Opportunities for Machine Learning
Slide 6
Slide 7
Machine Learning Solution for Climate Modeling and Analysis
Climate Change Attribution Analysis
Input
Output
Roadmap
 Introduction of Granger Graphical Models
 Examples of Granger Graphical Models
 Granger Graphical Models for Climate Change Attribution
 Experiment Results on Biology Applications
Slide 8
Roadmap
 Introduction of Granger Graphical Models
 Examples of Granger Graphical Models
 Granger Graphical Models for Climate Change Attribution
 Experiment Results on Biology Applications
Slide 9
Temporal Causal Modeling by Graphical Granger Modeling Methods
Slide 10
 Our proposed approach for time-series analysis
 Graphical modeling using the notions of Granger causality and methods of variable selection
 Granger causality by the Nobel prize winning economist, Clive Granger
 Definition: a time series x is said to “Granger cause” another time series y, if and only if
regressing for y in terms of both past values of y and x is statically significantly better than
that of regressing in terms of past values of y only
x y
Graph Structure Learning
 Graph Structure learning [Heckerman, 1995] has been an active research area
for decades
 Recent progress on L1-penalized regression method for graph structure learning
 LASSO regression for neighborhood selection [Meinshausen and Bühlmann, Ann. Stat. 06]
Consider the p-dimensional multivariate normal distributed random variable:$ $
The neighborhood selection can be solved efficiently with the LASSO
 Block sub-gradient algorithm for finding precision matrix [Banerjee, JMLR 08]
 Efficient fixed-point equations based on a sub-gradient algorithm [Friedman et al.,
Biostatistics 08]
Slide 11
Generic Temporal Causal Modeling Method [KDD 2007 joint work with Arnold, Abe]
Slide 12
An example of REG can be Lasso [Tibshirani, 1996] Granger Causality
Neighborhood
selection
Structure learning is possible even when the number of variables is
significantly larger than that of the samples
Temporal Causal Modeling for Time-series Data Analysis
 Natural grouping of variables
 Group Lasso and group boosting [KDD 2009; ISMB 2009, with Lozano, Abe and Rosset]
 Non-stationary
 Dynamic linear system [KDD 2009, with Kalagnanam and Johnsen]
 Non-linear time-series
 Non-parametric approach [AAAI 2010, with Chen, Liu and Carbonell]
 Spatial time-series
 Spatio-temporal regression via group elastic net [KDD 2009, with Lozano et al.]
 Relational time-series
 Hidden Markov random field [Snowbird, ICML 2010, with Niculescu-Mizi, Lozano and Lu]
 Extreme event modeling
 Spatial-temporal extreme value models [KDD 2009, with Lozano et al; NIPS 2011 in
preparation]
Slide 13
Roadmap
 Introduction of Granger Graphical Models
 Examples of Granger Graphical Models
 Granger Graphical Models for Climate Change Attribution
 Experiment Results on Biology Applications
Slide 14
Slide 15
Example 1: Relational Multivariate Time-Series Data [ICML 2010, Liu et al]
 Input: multivariate time-series X(1), …, X(M) and relational graph GM
 Goal: learn a reasonable temporal causal graph for each location/species ..
Proposed approach: Hidden Markov Random Field with L1 Penalty
(HMRF-L1)
Slide 16
Proposed approach: Hidden Markov Random Field with L1 –Penalty
(HMRF-L1)
Slide 17
 Define a hidden Markov Random Field on relational graph GM
 Assign a hidden state s(i) to each time-series X(i)
 Time-series that share the same state will share component networks
 Use EM to jointly infer the hidden state assignments and the causal
structure associated with each state
Slide 18
Climate Modeling and Analysis
 We used the following 18 variables containing climate, solar radiation and greenhouse gas data
 Data pre-processing (adhering to standard practices in climate modeling)
 2.5x2.5 degree grid for North America, Monthly data for 1989-2002 with 3 months temporal lag
 Data interpolation: a common grid to join multiple data sources using smoothing splines
 De-seasonalization: removing seasonal averages
Experiment Results: Location-Specific Climate Modeling
Slide 19
Clusters of US locations by our method
(number of clusters = 3)
Causal graphs associated with each state
Map of US CO2 Concentration
(http://www.purdue.edu/eas/carbon/vulcan/GEarth)
Slide 20
Example 2: Extreme Event Modeling
 Extreme weather events happen from time to time
 Examples include heat wave, hurricane, tornado, flooding
 They are rare events, but lead to severe consequences
Slide 21
Example 2: Extreme Event Modeling
 Key questions to be answered:
 Will the extreme weather happen more intensively?
 Will the extreme weather happen more frequently?
 Our approach: hierarchical Bayesian spatio-temporal dynamic model via extreme
value distribution
 Quantify the stochastic behavior of a process at unusually large or small levels
 A point process incorporating spatio-temporal dependence structures
Slide 22
Climate Extreme Event Attribution
 We used the following 18 variables containing climate, solar
radiation and greenhouse gas data
Output causal structures in
decreasing degrees of sparsity
Roadmap
 Introduction of Granger Graphical Models
 Examples of Granger Graphical Models
 Granger Graphical Models for Climate Change Attribution
 Experiment Results on Biology Applications
Slide 23
Gene Regulatory Network Discovery [ISMB 2010]
Slide 24
Causal graphs discovered by our method
Evaluation against BioGRID
BioGRID
Recent Literature
Precision Recall F1
Our method 0.50 0.72 0.59
Sambo et al. (2008) 0.36 0.44 0.40
 Gene expression regulatory networks for the human cancer cell HeLa S3 [Whitfield
et al., 2002]
 Existing methods in the literature are unable to
 Accommodate lags greater than one
 Handle causality tests involving a large number of genes simultaneously
 Our method addresses both limitations, achieved higher accuracy, and was able
to uncovered previously uncaptured relationships
 CCNA2 to PCNA verified in [Liu, et al 2007]
 CCNE1 to ETF1 verified in [Merdzhanova, et al 2007]
 CCNE1 to CDC6 verified in [Furstenthal, et al 2001]
Granger Graphical Models for Time-series Analysis
 A general framework to reveal important dependency information about time-
series data
 Extensions to application data with different properties
 Applications: computational biology, climate science, production management
 Data properties: non-stationary, non-paranormal, relational data, spatial data, natural grouping
 On-going work
 Scalable models to massive data: online algorithms, parallel algorithms
 Anomaly detection and prediction: scalable and interpretable solutions
 Hidden variables: automatically identifying the existence of hidden variables
 Other applications: social-media analysis
Slide 25
Acknowledge
 USC Melody Lab
 IBM Research
 Harvard Medical School
Slide 26
Taha Bahadori Yanting Wu Shiv Prakash
Aurelie Lozano Naoki Abe Hongfei Li Alexandru Niculescu-Mizil
Yong Lu

More Related Content

Similar to Data Mgmt-Liu.pdf

Short Presentation: Mohamed abuella's Research Highlights
Short Presentation: Mohamed abuella's Research HighlightsShort Presentation: Mohamed abuella's Research Highlights
Short Presentation: Mohamed abuella's Research HighlightsMohamed Abuella
 
PyData Miami 2019, Quantum Generalized Linear Models
PyData Miami 2019, Quantum Generalized Linear ModelsPyData Miami 2019, Quantum Generalized Linear Models
PyData Miami 2019, Quantum Generalized Linear ModelsColleen Farrelly
 
2007 santiago marchi_cobem_2007
2007 santiago marchi_cobem_20072007 santiago marchi_cobem_2007
2007 santiago marchi_cobem_2007CosmoSantiago
 
12. analyzing climate change risks
12. analyzing climate change risks12. analyzing climate change risks
12. analyzing climate change risksNAP Events
 
Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...
Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...
Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...Charles Martin
 
A Geometry Projection Method For Shape Optimization
A Geometry Projection Method For Shape OptimizationA Geometry Projection Method For Shape Optimization
A Geometry Projection Method For Shape OptimizationJoaquin Hamad
 
Application of Semiparametric Non-Linear Model on Panel Data with Very Small ...
Application of Semiparametric Non-Linear Model on Panel Data with Very Small ...Application of Semiparametric Non-Linear Model on Panel Data with Very Small ...
Application of Semiparametric Non-Linear Model on Panel Data with Very Small ...IOSRJM
 
Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...IAEME Publication
 
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...IAEME Publication
 
Higher-Order Conjugate Gradient Method (HCGM) For Solving Continuous Optimal ...
Higher-Order Conjugate Gradient Method (HCGM) For Solving Continuous Optimal ...Higher-Order Conjugate Gradient Method (HCGM) For Solving Continuous Optimal ...
Higher-Order Conjugate Gradient Method (HCGM) For Solving Continuous Optimal ...iosrjce
 
Forecasting of electric consumption in a semiconductor plant using time serie...
Forecasting of electric consumption in a semiconductor plant using time serie...Forecasting of electric consumption in a semiconductor plant using time serie...
Forecasting of electric consumption in a semiconductor plant using time serie...Alexander Decker
 
A Land Data Assimilation System Utilizing Low Frequency Passive Microwave Rem...
A Land Data Assimilation System Utilizing Low Frequency Passive Microwave Rem...A Land Data Assimilation System Utilizing Low Frequency Passive Microwave Rem...
A Land Data Assimilation System Utilizing Low Frequency Passive Microwave Rem...drboon
 
Pak eko 4412ijdms01
Pak eko 4412ijdms01Pak eko 4412ijdms01
Pak eko 4412ijdms01hyuviridvic
 
New emulation based approach for probabilistic seismic demand
New emulation based approach for probabilistic seismic demandNew emulation based approach for probabilistic seismic demand
New emulation based approach for probabilistic seismic demandSebastian Contreras
 
The Algorithms of Life - Scientific Computing for Systems Biology
The Algorithms of Life - Scientific Computing for Systems BiologyThe Algorithms of Life - Scientific Computing for Systems Biology
The Algorithms of Life - Scientific Computing for Systems Biologyinside-BigData.com
 
Présentation PowerPoint - Diapositive 1
Présentation PowerPoint - Diapositive 1Présentation PowerPoint - Diapositive 1
Présentation PowerPoint - Diapositive 1butest
 
Présentation PowerPoint - Diapositive 1
Présentation PowerPoint - Diapositive 1Présentation PowerPoint - Diapositive 1
Présentation PowerPoint - Diapositive 1butest
 

Similar to Data Mgmt-Liu.pdf (20)

Short Presentation: Mohamed abuella's Research Highlights
Short Presentation: Mohamed abuella's Research HighlightsShort Presentation: Mohamed abuella's Research Highlights
Short Presentation: Mohamed abuella's Research Highlights
 
PyData Miami 2019, Quantum Generalized Linear Models
PyData Miami 2019, Quantum Generalized Linear ModelsPyData Miami 2019, Quantum Generalized Linear Models
PyData Miami 2019, Quantum Generalized Linear Models
 
2007 santiago marchi_cobem_2007
2007 santiago marchi_cobem_20072007 santiago marchi_cobem_2007
2007 santiago marchi_cobem_2007
 
12. analyzing climate change risks
12. analyzing climate change risks12. analyzing climate change risks
12. analyzing climate change risks
 
Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...
Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...
Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...
 
A Geometry Projection Method For Shape Optimization
A Geometry Projection Method For Shape OptimizationA Geometry Projection Method For Shape Optimization
A Geometry Projection Method For Shape Optimization
 
Application of Semiparametric Non-Linear Model on Panel Data with Very Small ...
Application of Semiparametric Non-Linear Model on Panel Data with Very Small ...Application of Semiparametric Non-Linear Model on Panel Data with Very Small ...
Application of Semiparametric Non-Linear Model on Panel Data with Very Small ...
 
Metadynamics
MetadynamicsMetadynamics
Metadynamics
 
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
 
Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...
 
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
 
Higher-Order Conjugate Gradient Method (HCGM) For Solving Continuous Optimal ...
Higher-Order Conjugate Gradient Method (HCGM) For Solving Continuous Optimal ...Higher-Order Conjugate Gradient Method (HCGM) For Solving Continuous Optimal ...
Higher-Order Conjugate Gradient Method (HCGM) For Solving Continuous Optimal ...
 
Forecasting of electric consumption in a semiconductor plant using time serie...
Forecasting of electric consumption in a semiconductor plant using time serie...Forecasting of electric consumption in a semiconductor plant using time serie...
Forecasting of electric consumption in a semiconductor plant using time serie...
 
Melles
MellesMelles
Melles
 
A Land Data Assimilation System Utilizing Low Frequency Passive Microwave Rem...
A Land Data Assimilation System Utilizing Low Frequency Passive Microwave Rem...A Land Data Assimilation System Utilizing Low Frequency Passive Microwave Rem...
A Land Data Assimilation System Utilizing Low Frequency Passive Microwave Rem...
 
Pak eko 4412ijdms01
Pak eko 4412ijdms01Pak eko 4412ijdms01
Pak eko 4412ijdms01
 
New emulation based approach for probabilistic seismic demand
New emulation based approach for probabilistic seismic demandNew emulation based approach for probabilistic seismic demand
New emulation based approach for probabilistic seismic demand
 
The Algorithms of Life - Scientific Computing for Systems Biology
The Algorithms of Life - Scientific Computing for Systems BiologyThe Algorithms of Life - Scientific Computing for Systems Biology
The Algorithms of Life - Scientific Computing for Systems Biology
 
Présentation PowerPoint - Diapositive 1
Présentation PowerPoint - Diapositive 1Présentation PowerPoint - Diapositive 1
Présentation PowerPoint - Diapositive 1
 
Présentation PowerPoint - Diapositive 1
Présentation PowerPoint - Diapositive 1Présentation PowerPoint - Diapositive 1
Présentation PowerPoint - Diapositive 1
 

Recently uploaded

Data Analysis Project Presentation : NYC Shooting Cluster Analysis
Data Analysis Project Presentation : NYC Shooting Cluster AnalysisData Analysis Project Presentation : NYC Shooting Cluster Analysis
Data Analysis Project Presentation : NYC Shooting Cluster AnalysisBoston Institute of Analytics
 
社内勉強会資料_Object Recognition as Next Token Prediction
社内勉強会資料_Object Recognition as Next Token Prediction社内勉強会資料_Object Recognition as Next Token Prediction
社内勉強会資料_Object Recognition as Next Token PredictionNABLAS株式会社
 
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...Klinik Aborsi
 
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...ssuserf63bd7
 
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证zifhagzkk
 
The Significance of Transliteration Enhancing
The Significance of Transliteration EnhancingThe Significance of Transliteration Enhancing
The Significance of Transliteration Enhancingmohamed Elzalabany
 
Digital Marketing Demystified: Expert Tips from Samantha Rae Coolbeth
Digital Marketing Demystified: Expert Tips from Samantha Rae CoolbethDigital Marketing Demystified: Expert Tips from Samantha Rae Coolbeth
Digital Marketing Demystified: Expert Tips from Samantha Rae CoolbethSamantha Rae Coolbeth
 
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证ppy8zfkfm
 
如何办理加州大学伯克利分校毕业证(UCB毕业证)成绩单留信学历认证
如何办理加州大学伯克利分校毕业证(UCB毕业证)成绩单留信学历认证如何办理加州大学伯克利分校毕业证(UCB毕业证)成绩单留信学历认证
如何办理加州大学伯克利分校毕业证(UCB毕业证)成绩单留信学历认证a8om7o51
 
MATERI MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI  MANAJEMEN OF PENYAKIT TETANUS.pptMATERI  MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI MANAJEMEN OF PENYAKIT TETANUS.pptRachmaGhifari
 
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...BabaJohn3
 
Predictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting TechniquesPredictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting TechniquesBoston Institute of Analytics
 
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam DunksNOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam Dunksgmuir1066
 
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...yulianti213969
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证acoha1
 
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarjSCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarjadimosmejiaslendon
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...Amil baba
 
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样jk0tkvfv
 
Formulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfFormulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfRobertoOcampo24
 
Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"John Sobanski
 

Recently uploaded (20)

Data Analysis Project Presentation : NYC Shooting Cluster Analysis
Data Analysis Project Presentation : NYC Shooting Cluster AnalysisData Analysis Project Presentation : NYC Shooting Cluster Analysis
Data Analysis Project Presentation : NYC Shooting Cluster Analysis
 
社内勉強会資料_Object Recognition as Next Token Prediction
社内勉強会資料_Object Recognition as Next Token Prediction社内勉強会資料_Object Recognition as Next Token Prediction
社内勉強会資料_Object Recognition as Next Token Prediction
 
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
 
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
 
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
 
The Significance of Transliteration Enhancing
The Significance of Transliteration EnhancingThe Significance of Transliteration Enhancing
The Significance of Transliteration Enhancing
 
Digital Marketing Demystified: Expert Tips from Samantha Rae Coolbeth
Digital Marketing Demystified: Expert Tips from Samantha Rae CoolbethDigital Marketing Demystified: Expert Tips from Samantha Rae Coolbeth
Digital Marketing Demystified: Expert Tips from Samantha Rae Coolbeth
 
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
 
如何办理加州大学伯克利分校毕业证(UCB毕业证)成绩单留信学历认证
如何办理加州大学伯克利分校毕业证(UCB毕业证)成绩单留信学历认证如何办理加州大学伯克利分校毕业证(UCB毕业证)成绩单留信学历认证
如何办理加州大学伯克利分校毕业证(UCB毕业证)成绩单留信学历认证
 
MATERI MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI  MANAJEMEN OF PENYAKIT TETANUS.pptMATERI  MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI MANAJEMEN OF PENYAKIT TETANUS.ppt
 
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...
 
Predictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting TechniquesPredictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting Techniques
 
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam DunksNOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
 
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
 
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarjSCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
 
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
 
Formulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfFormulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdf
 
Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"
 

Data Mgmt-Liu.pdf

  • 1. 1 Temporal Causal Models for Massive Time-series Data Mining: Climate Change Attribution and other Applications Yan Liu Computer Science Department Viterbi School of Engineering University of Southern California 2011 Japan-America Frontiers of Engineering Symposium June 6-8, 2011
  • 2. Climate Change: One of the Most Critical Issues Mankind Faces in the 21st Century Slide 2
  • 3. Understanding Climate System is Imperative to Devising Potential Solutions  Climate system involves complex relationships between large number of variables  Need to understand and quantify “causal” effects of the various parameters Slide 3
  • 4. Challenges with Existing Climate Models  23 widely used global climate models: Model inter- comparison project: http://www.clivar.org/organization/wgcm/cmip.php Slide 4  Forward-simulation approach
  • 5. Massive Amount of Spatial-temporal Data on Climate and Climate-forcing Agents Slide 5 Human agents: atmospheric constituents Surface and atmospheric climate Snow, Ice and Frozen Ground Human agents: Land Cover Solar Radiation
  • 6. Massive Climate Data: New Opportunities for Machine Learning Slide 6
  • 7. Slide 7 Machine Learning Solution for Climate Modeling and Analysis Climate Change Attribution Analysis Input Output
  • 8. Roadmap  Introduction of Granger Graphical Models  Examples of Granger Graphical Models  Granger Graphical Models for Climate Change Attribution  Experiment Results on Biology Applications Slide 8
  • 9. Roadmap  Introduction of Granger Graphical Models  Examples of Granger Graphical Models  Granger Graphical Models for Climate Change Attribution  Experiment Results on Biology Applications Slide 9
  • 10. Temporal Causal Modeling by Graphical Granger Modeling Methods Slide 10  Our proposed approach for time-series analysis  Graphical modeling using the notions of Granger causality and methods of variable selection  Granger causality by the Nobel prize winning economist, Clive Granger  Definition: a time series x is said to “Granger cause” another time series y, if and only if regressing for y in terms of both past values of y and x is statically significantly better than that of regressing in terms of past values of y only x y
  • 11. Graph Structure Learning  Graph Structure learning [Heckerman, 1995] has been an active research area for decades  Recent progress on L1-penalized regression method for graph structure learning  LASSO regression for neighborhood selection [Meinshausen and Bühlmann, Ann. Stat. 06] Consider the p-dimensional multivariate normal distributed random variable:$ $ The neighborhood selection can be solved efficiently with the LASSO  Block sub-gradient algorithm for finding precision matrix [Banerjee, JMLR 08]  Efficient fixed-point equations based on a sub-gradient algorithm [Friedman et al., Biostatistics 08] Slide 11
  • 12. Generic Temporal Causal Modeling Method [KDD 2007 joint work with Arnold, Abe] Slide 12 An example of REG can be Lasso [Tibshirani, 1996] Granger Causality Neighborhood selection Structure learning is possible even when the number of variables is significantly larger than that of the samples
  • 13. Temporal Causal Modeling for Time-series Data Analysis  Natural grouping of variables  Group Lasso and group boosting [KDD 2009; ISMB 2009, with Lozano, Abe and Rosset]  Non-stationary  Dynamic linear system [KDD 2009, with Kalagnanam and Johnsen]  Non-linear time-series  Non-parametric approach [AAAI 2010, with Chen, Liu and Carbonell]  Spatial time-series  Spatio-temporal regression via group elastic net [KDD 2009, with Lozano et al.]  Relational time-series  Hidden Markov random field [Snowbird, ICML 2010, with Niculescu-Mizi, Lozano and Lu]  Extreme event modeling  Spatial-temporal extreme value models [KDD 2009, with Lozano et al; NIPS 2011 in preparation] Slide 13
  • 14. Roadmap  Introduction of Granger Graphical Models  Examples of Granger Graphical Models  Granger Graphical Models for Climate Change Attribution  Experiment Results on Biology Applications Slide 14
  • 15. Slide 15 Example 1: Relational Multivariate Time-Series Data [ICML 2010, Liu et al]  Input: multivariate time-series X(1), …, X(M) and relational graph GM  Goal: learn a reasonable temporal causal graph for each location/species ..
  • 16. Proposed approach: Hidden Markov Random Field with L1 Penalty (HMRF-L1) Slide 16
  • 17. Proposed approach: Hidden Markov Random Field with L1 –Penalty (HMRF-L1) Slide 17  Define a hidden Markov Random Field on relational graph GM  Assign a hidden state s(i) to each time-series X(i)  Time-series that share the same state will share component networks  Use EM to jointly infer the hidden state assignments and the causal structure associated with each state
  • 18. Slide 18 Climate Modeling and Analysis  We used the following 18 variables containing climate, solar radiation and greenhouse gas data  Data pre-processing (adhering to standard practices in climate modeling)  2.5x2.5 degree grid for North America, Monthly data for 1989-2002 with 3 months temporal lag  Data interpolation: a common grid to join multiple data sources using smoothing splines  De-seasonalization: removing seasonal averages
  • 19. Experiment Results: Location-Specific Climate Modeling Slide 19 Clusters of US locations by our method (number of clusters = 3) Causal graphs associated with each state Map of US CO2 Concentration (http://www.purdue.edu/eas/carbon/vulcan/GEarth)
  • 20. Slide 20 Example 2: Extreme Event Modeling  Extreme weather events happen from time to time  Examples include heat wave, hurricane, tornado, flooding  They are rare events, but lead to severe consequences
  • 21. Slide 21 Example 2: Extreme Event Modeling  Key questions to be answered:  Will the extreme weather happen more intensively?  Will the extreme weather happen more frequently?  Our approach: hierarchical Bayesian spatio-temporal dynamic model via extreme value distribution  Quantify the stochastic behavior of a process at unusually large or small levels  A point process incorporating spatio-temporal dependence structures
  • 22. Slide 22 Climate Extreme Event Attribution  We used the following 18 variables containing climate, solar radiation and greenhouse gas data Output causal structures in decreasing degrees of sparsity
  • 23. Roadmap  Introduction of Granger Graphical Models  Examples of Granger Graphical Models  Granger Graphical Models for Climate Change Attribution  Experiment Results on Biology Applications Slide 23
  • 24. Gene Regulatory Network Discovery [ISMB 2010] Slide 24 Causal graphs discovered by our method Evaluation against BioGRID BioGRID Recent Literature Precision Recall F1 Our method 0.50 0.72 0.59 Sambo et al. (2008) 0.36 0.44 0.40  Gene expression regulatory networks for the human cancer cell HeLa S3 [Whitfield et al., 2002]  Existing methods in the literature are unable to  Accommodate lags greater than one  Handle causality tests involving a large number of genes simultaneously  Our method addresses both limitations, achieved higher accuracy, and was able to uncovered previously uncaptured relationships  CCNA2 to PCNA verified in [Liu, et al 2007]  CCNE1 to ETF1 verified in [Merdzhanova, et al 2007]  CCNE1 to CDC6 verified in [Furstenthal, et al 2001]
  • 25. Granger Graphical Models for Time-series Analysis  A general framework to reveal important dependency information about time- series data  Extensions to application data with different properties  Applications: computational biology, climate science, production management  Data properties: non-stationary, non-paranormal, relational data, spatial data, natural grouping  On-going work  Scalable models to massive data: online algorithms, parallel algorithms  Anomaly detection and prediction: scalable and interpretable solutions  Hidden variables: automatically identifying the existence of hidden variables  Other applications: social-media analysis Slide 25
  • 26. Acknowledge  USC Melody Lab  IBM Research  Harvard Medical School Slide 26 Taha Bahadori Yanting Wu Shiv Prakash Aurelie Lozano Naoki Abe Hongfei Li Alexandru Niculescu-Mizil Yong Lu