SlideShare a Scribd company logo
1 of 23
Quang-Huy Tran
Network Science Lab
Dept. of Artificial Intelligence
The Catholic University of Korea
E-mail: huytran1126@gmail.com
2024-04-15
Deciphering Spatio-Temporal Graph
Forecasting: A Causal Lens and Treatment
Yutong Xia et al.
NeurIPS’37: 2023 Conference on Neural Information Processing Systems
2
OUTLINE
• MOTIVATION
• INTRODUCTION
• METHODOLOGY
• EXPERIMENT & RESULT
• CONCLUSION
3
MOTIVATION
• Spatio-Temporal Graph (STG) represents the spatial and temporal relationships
between nodes or entities, which is widely used in various fields (e.g., transportation,
environment and epidemiology).
Spatio-Temporal Graph Forecasting
• STG forecasting has become crucial in the context of smart cities (e.g. informed
decision-making, sustainable environments).
4
MOTIVATION
• STG data is subject to temporal dynamics.
o Exhibit various data generation distribution over time –
temporal out-of-distribution (OoD).
o 𝑃𝐴 𝑥 ≠ 𝑃𝐵 𝑥 ≠ 𝑃𝑡𝑒𝑠𝑡 𝑥
Spatio-Temporal Forecasting Challenges
• Dynamic spatial causation: mostly previous works
follows.
o Rely on distance-based adjacency matrix to perform message
passing.
o Or attention mechanism to calculate dynamic spatial
correlations between nodes.
o Ripple effects of causal relations.
5
INTRODUCTION
• Concurrently tackle the temporal OoD issues and dynamic spatial causation via causal
treatments.
• Proposed a novel framework Causal Spatio-Temporal neural network (CaST):
o Presented a Structural Causal Model (SCM) to gain deeper understanding of data generation process
of STG data
o Based on SCM, they proposed:
o utilize back-door adjustment to enhance the generalization capability for unseen data (temporal
OoD).
o apply front-door adjustment along with an edge-level convolution operator to effectively
capture the dynamic causation between nodes.
6
METHODOLOGY
Problem Definition: Causal Len
• Causal inference:
o investigate causal relationships between variables, ensuring stable and robust learning and
inference.
o in STG, it commonly used to address temporal OoD issue by learning disentangled seasonal-trend
representations or environment-specific representations.
• Denote signal 𝑋𝑡
∈ ℝ𝑁×𝐷
of 𝑁nodes at time step 𝑡, with D features.
• Given historical previous 𝑇 time steps, learn a function to forecast next 𝑆 steps:
𝑋 𝑡−𝑇 :𝑡 ℱ(. )[𝑌 𝑡+1 : 𝑡+𝑆 ]
7
METHODOLOGY
Structural Causal Model (SCM)
• Assume E & C are independent.
o 𝑿 ⟵ 𝑬 ⟶ 𝒀: The temporal OoD can arise due to changes in external variables over time.
(e.g., weather can affect traffic flow observations).
o 𝑿 ⟵ 𝑪 ⟶ 𝒀: X and Y are intrinsically affected by the surrounding spatial context, comprising
both spurious and genuine causal components.
o 𝑿 ⟶ 𝒀: Primary goal.
o The causal relationship: P(𝑋, 𝑌|𝐸, 𝐶) = 𝑃 𝑋 𝐸, 𝐶 𝑃(𝑌|𝑋, 𝐸, 𝐶).
𝐸: Temporal Environment
𝐶: Spatial Context.
𝑋: Historical Node Signals.
𝑌: Future Signals.
8
METHODOLOGY
Structural Causal Model (SCM)
• Assume E & C are independent.
o 𝑿 ⟵ 𝑬 ⟶ 𝒀: The temporal OoD can arise due to changes in external variables over time. (e.g.,
weather can affect traffic flow observations).
o 𝑿 ⟵ 𝑪 ⟶ 𝒀: X and Y are intrinsically affected by the surrounding spatial context, comprising
both spurious and genuine causal components.
o 𝑿 ⟶ 𝒀: Primary goal.
o The causal relationship: P(𝑋, 𝑌|𝐸, 𝐶) = 𝑃 𝑋 𝐸, 𝐶 𝑃(𝑌|𝑋, 𝐸, 𝐶).
𝐸: Temporal Environment
𝐶: Spatial Context.
𝑋: Historical Node Signals.
𝑌: Future Signals.
Confounding factors
Backdoor
paths
9
METHODOLOGY
Structural Causal Model (SCM)
• Back-door adjustment for E
• Aspect of X is impacted by E and C. To mitigate, they block the back-door path from 𝐸
to 𝑋.
o Remove 𝐸’s confounding effect.
10
METHODOLOGY
Structural Causal Model (SCM)
• Front-door adjustment for C
• Introducing a mediating variable 𝑋∗
between 𝑋 and 𝑌 to mimic a more accurate
representation excluded the spurious parts in 𝐶.
o De-confounding 𝐶’s spurious effect.
11
METHODOLOGY
Main Architecture - CaST
12
METHODOLOGY
Backdoor Adjustment
• 2 Steps:
o separating the environment feature from the
input data.
o discretizing the environments.
• 2 main module:
o Environment Disentangler block.
o A learnable Environment Codebook + Representation Disentanglement.
13
METHODOLOGY
Backdoor Adjustment
• Environment Disentangler block:
o EnvEncoder: a series of 1D convolutions, average pooling, and a linear
projection.
o EntEncoder: Fast Fourier Transform and self-attention mechanism to
extract features from both time and frequency domains.
• Environment Codebook: latent embedding space e = {e1, . . , eK}
o a nearest neighbor in the shared embedding space 𝑒 to identify the closest
latent vector for each node’s environment representation.
o Final environment representation is corresponding closest discrete vector in 𝑒.
• Representation Disentanglement: entity representations carry minimal
information (MI) about the environment.
o Employ Mutual Information Neural Estimation by Kullback-Leiber (KL) divergence.
14
METHODOLOGY
Front-door Adjustment
• Construct boundary edge graph:
o First-order: maps pairs of nodes to edges.
o Second-order: maps pairs of edges to
triangles.
• Introducing Hodge-Laplacian (HL) Deconfounder:
o higher-order graph over edges and perform edge convolution to filter
edge signal
o Goal: capture the dynamic causal relations of nodes as well as position
embeddings to learn the nodes’ global location information.
o Then, use a linear transformation and Graph convolutional networks
(GCN) to create causal surrogate.
15
METHODOLOGY
Loss Function - Optimization
• Mutual Information Regularization: thwart the classifier to discern the true labels
o Ensure the classifier can not determine the true corresponding environment based on the
information provided by hidden feature.
• Environment Codebook: prediction loss and codebook loss.
𝛼: balancing hyperparameter.
𝑠𝑔[. ]: stop gradient operator.
• Overall loss function:
16
EXPERIMENT AND RESULT
EXPERIMENT
• Measurement:
o Mean Absolute Errors (MAE) and Root Mean Squared Errors (RMSE).
• Dataset:
o PEMS08: traffic flow data on 8 roads with a time interval of 5 minutes.
o AIR-BJ and AIR-GJ: one-year PM2.5 readings collected from air quality monitoring stations in Beijing
and Guangzhou.
• Task:
o predict over the next 24 steps given the past 24 steps.
• Variants:
o CaST-ADP: using a self-adaptive adjacency matrix.
o CaST-GAT: using the graph attention mechanism for causal scoring.
17
• Baseline:
o Historical Average(HA).
o Vector autoregression (VAR).
o DCRNN[1]: Diffusion Convolution Recurrent Neural Network.
o STSGCN[2]: Spatial temporal synchronous graph convolutional networks.
o ASTGCN[3]: Attention Spatial-Temporal graph convolutional networks.
o MTGNN[4]: Multi Time Series Graph Neural Network.
o AGCRN[5]: Adaptive Graph Convolutional Recurrent Neural Network.
o GMSDR[6]: Graph Multi-Step Dependency Relation.
o STGNCDE[7]: Spatio-temporal graph neural controlled differential equation.
EXPERIMENT AND RESULT
EXPERIMENT
[1] Li, Y.; Yu, R.; Shahabi, C.; and Liu, Y. 2018. Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting. In ICLR.
[2] Song, C.; Lin, Y.; Guo, S.; and Wan, H. 2020. Spatial temporal synchronous graph convolutional networks: A new framework for spatial-temporal network data forecasting. In AAAI.
[3] Shengnan Guo, Youfang Lin, Ning Feng, Chao Song, and Huaiyu Wan. 2019. Attention based spatial-temporal graph convolutional networks for traffic flow forecasting. In Proceedings of the AAAI conference on artificial intelligence, Vol. 33. 922–929.
[4] Zonghan Wu, Shirui Pan, Guodong Long, Jing Jiang, Xiaojun Chang, and Chengqi Zhang. 2020. Connecting the dots: Multivariate time series forecasting with graph neural networks. InProceedings of the 26th ACM SIGKDD International Conference on K
nowledge Discovery & Data Mining. 753–763.
[5] Lei Bai, Lina Yao, Can Li, Xianzhi Wang, and Can Wang. 2020. Adaptive graph convolutional recurrent network for traffic forecasting. Advances in neural information processing systems 33(2020), 17804–17815.
[6] Dachuan Liu, Jin Wang, Shuo Shang, and Peng Han. 2022. Msdr: Multi-step dependency relation networks for spatial temporal forecasting. In Proceedings of the 28th ACM SIGKDDConference on Knowledge Discovery and Data Mining. 1042–1050.
[7] Jeongwhan Choi, Hwangyong Choi, Jeehyun Hwang, and Noseong Park. 2022. Graph neuralcontrolled differential equations for traffic forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. 6367–6374.
18
EXPERIMENT AND RESULT
RESULT
19
EXPERIMENT AND RESULT
RESULT – Edge and Component Analysis
Fig: Performance comparison of different variant on
PEMS08 and England datasets.
• Effects of Edge Convolution
• Effects of Components and Visualization of Dynamic Spatial Causation
20
EXPERIMENT AND RESULT
RESULT – Interpretation Analysis
• Analysis on Environmental Codebook
• Interpretation of Temporal Environments
21
CONCLUSION
• Took a causal look at the STG forecasting problem:
• Temporal out-of-distribution (OoD) issue.
• Proposed a novel Causal Spatio-Temporal neural network (CaST).
• Utilized back-door and front-door adjustments for resolving challenges.
• Verified effectiveness, generalizability, and interpretability through extensive
experiments on three datasets
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Causal Lens and Treatment.pptx
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Causal Lens and Treatment.pptx

More Related Content

Similar to [20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Causal Lens and Treatment.pptx

Evaluation of conditional images synthesis: generating a photorealistic image...
Evaluation of conditional images synthesis: generating a photorealistic image...Evaluation of conditional images synthesis: generating a photorealistic image...
Evaluation of conditional images synthesis: generating a photorealistic image...SamanthaGallone
 
FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...
FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...
FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...Hiroki Nakahara
 
IEEE International Conference Presentation
IEEE International Conference PresentationIEEE International Conference Presentation
IEEE International Conference PresentationAnmol Dwivedi
 
NONLINEAR MODELING AND ANALYSIS OF WSN NODE LOCALIZATION METHOD
NONLINEAR MODELING AND ANALYSIS OF WSN NODE LOCALIZATION METHODNONLINEAR MODELING AND ANALYSIS OF WSN NODE LOCALIZATION METHOD
NONLINEAR MODELING AND ANALYSIS OF WSN NODE LOCALIZATION METHODijwmn
 
NONLINEAR MODELING AND ANALYSIS OF WSN NODE LOCALIZATION METHOD
NONLINEAR MODELING AND ANALYSIS OF WSN NODE LOCALIZATION METHODNONLINEAR MODELING AND ANALYSIS OF WSN NODE LOCALIZATION METHOD
NONLINEAR MODELING AND ANALYSIS OF WSN NODE LOCALIZATION METHODijwmn
 
Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...
Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...
Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...IOSR Journals
 
Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...
Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...
Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...IOSR Journals
 
Testing the global grid of master events for waveform cross correlation with ...
Testing the global grid of master events for waveform cross correlation with ...Testing the global grid of master events for waveform cross correlation with ...
Testing the global grid of master events for waveform cross correlation with ...Ivan Kitov
 
ieee nss mic 2016 poster N30-21
ieee nss mic 2016 poster N30-21ieee nss mic 2016 poster N30-21
ieee nss mic 2016 poster N30-21Dae Woon Kim
 
Real-Time Analysis of Streaming Synchotron Data: SCinet SC19 Technology Chall...
Real-Time Analysis of Streaming Synchotron Data: SCinet SC19 Technology Chall...Real-Time Analysis of Streaming Synchotron Data: SCinet SC19 Technology Chall...
Real-Time Analysis of Streaming Synchotron Data: SCinet SC19 Technology Chall...Globus
 
CSS_Workshop_Stockholm.pdf
CSS_Workshop_Stockholm.pdfCSS_Workshop_Stockholm.pdf
CSS_Workshop_Stockholm.pdfMariodiBernardo2
 
P REDICTION F OR S HORT -T ERM T RAFFIC F LOW B ASED O N O PTIMIZED W...
P REDICTION  F OR  S HORT -T ERM  T RAFFIC  F LOW  B ASED  O N  O PTIMIZED  W...P REDICTION  F OR  S HORT -T ERM  T RAFFIC  F LOW  B ASED  O N  O PTIMIZED  W...
P REDICTION F OR S HORT -T ERM T RAFFIC F LOW B ASED O N O PTIMIZED W...ijcsit
 
“New Methods for Implementation of 2-D Convolution for Convolutional Neural N...
“New Methods for Implementation of 2-D Convolution for Convolutional Neural N...“New Methods for Implementation of 2-D Convolution for Convolutional Neural N...
“New Methods for Implementation of 2-D Convolution for Convolutional Neural N...Edge AI and Vision Alliance
 
BU (UVCE)5th Sem Electronics syllabus copy from Lohith kumar R
BU (UVCE)5th Sem Electronics syllabus copy from Lohith kumar R BU (UVCE)5th Sem Electronics syllabus copy from Lohith kumar R
BU (UVCE)5th Sem Electronics syllabus copy from Lohith kumar R UVCE
 
Pedestrian behavior/intention modeling for autonomous driving IV
Pedestrian behavior/intention modeling for autonomous driving IVPedestrian behavior/intention modeling for autonomous driving IV
Pedestrian behavior/intention modeling for autonomous driving IVYu Huang
 
Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9Ganesan Narayanasamy
 
COMPARATIVE PERFORMANCE ANALYSIS OF RNSC AND MCL ALGORITHMS ON POWER-LAW DIST...
COMPARATIVE PERFORMANCE ANALYSIS OF RNSC AND MCL ALGORITHMS ON POWER-LAW DIST...COMPARATIVE PERFORMANCE ANALYSIS OF RNSC AND MCL ALGORITHMS ON POWER-LAW DIST...
COMPARATIVE PERFORMANCE ANALYSIS OF RNSC AND MCL ALGORITHMS ON POWER-LAW DIST...acijjournal
 

Similar to [20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Causal Lens and Treatment.pptx (20)

Evaluation of conditional images synthesis: generating a photorealistic image...
Evaluation of conditional images synthesis: generating a photorealistic image...Evaluation of conditional images synthesis: generating a photorealistic image...
Evaluation of conditional images synthesis: generating a photorealistic image...
 
FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...
FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...
FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...
 
IEEE International Conference Presentation
IEEE International Conference PresentationIEEE International Conference Presentation
IEEE International Conference Presentation
 
NONLINEAR MODELING AND ANALYSIS OF WSN NODE LOCALIZATION METHOD
NONLINEAR MODELING AND ANALYSIS OF WSN NODE LOCALIZATION METHODNONLINEAR MODELING AND ANALYSIS OF WSN NODE LOCALIZATION METHOD
NONLINEAR MODELING AND ANALYSIS OF WSN NODE LOCALIZATION METHOD
 
NONLINEAR MODELING AND ANALYSIS OF WSN NODE LOCALIZATION METHOD
NONLINEAR MODELING AND ANALYSIS OF WSN NODE LOCALIZATION METHODNONLINEAR MODELING AND ANALYSIS OF WSN NODE LOCALIZATION METHOD
NONLINEAR MODELING AND ANALYSIS OF WSN NODE LOCALIZATION METHOD
 
Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...
Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...
Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...
 
Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...
Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...
Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...
 
Testing the global grid of master events for waveform cross correlation with ...
Testing the global grid of master events for waveform cross correlation with ...Testing the global grid of master events for waveform cross correlation with ...
Testing the global grid of master events for waveform cross correlation with ...
 
ieee nss mic 2016 poster N30-21
ieee nss mic 2016 poster N30-21ieee nss mic 2016 poster N30-21
ieee nss mic 2016 poster N30-21
 
ME Synopsis
ME SynopsisME Synopsis
ME Synopsis
 
Real-Time Analysis of Streaming Synchotron Data: SCinet SC19 Technology Chall...
Real-Time Analysis of Streaming Synchotron Data: SCinet SC19 Technology Chall...Real-Time Analysis of Streaming Synchotron Data: SCinet SC19 Technology Chall...
Real-Time Analysis of Streaming Synchotron Data: SCinet SC19 Technology Chall...
 
CSS_Workshop_Stockholm.pdf
CSS_Workshop_Stockholm.pdfCSS_Workshop_Stockholm.pdf
CSS_Workshop_Stockholm.pdf
 
P REDICTION F OR S HORT -T ERM T RAFFIC F LOW B ASED O N O PTIMIZED W...
P REDICTION  F OR  S HORT -T ERM  T RAFFIC  F LOW  B ASED  O N  O PTIMIZED  W...P REDICTION  F OR  S HORT -T ERM  T RAFFIC  F LOW  B ASED  O N  O PTIMIZED  W...
P REDICTION F OR S HORT -T ERM T RAFFIC F LOW B ASED O N O PTIMIZED W...
 
“New Methods for Implementation of 2-D Convolution for Convolutional Neural N...
“New Methods for Implementation of 2-D Convolution for Convolutional Neural N...“New Methods for Implementation of 2-D Convolution for Convolutional Neural N...
“New Methods for Implementation of 2-D Convolution for Convolutional Neural N...
 
BU (UVCE)5th Sem Electronics syllabus copy from Lohith kumar R
BU (UVCE)5th Sem Electronics syllabus copy from Lohith kumar R BU (UVCE)5th Sem Electronics syllabus copy from Lohith kumar R
BU (UVCE)5th Sem Electronics syllabus copy from Lohith kumar R
 
Pedestrian behavior/intention modeling for autonomous driving IV
Pedestrian behavior/intention modeling for autonomous driving IVPedestrian behavior/intention modeling for autonomous driving IV
Pedestrian behavior/intention modeling for autonomous driving IV
 
SBU072811_short.ppt
SBU072811_short.pptSBU072811_short.ppt
SBU072811_short.ppt
 
Automatic reading cr39
Automatic reading cr39Automatic reading cr39
Automatic reading cr39
 
Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9
 
COMPARATIVE PERFORMANCE ANALYSIS OF RNSC AND MCL ALGORITHMS ON POWER-LAW DIST...
COMPARATIVE PERFORMANCE ANALYSIS OF RNSC AND MCL ALGORITHMS ON POWER-LAW DIST...COMPARATIVE PERFORMANCE ANALYSIS OF RNSC AND MCL ALGORITHMS ON POWER-LAW DIST...
COMPARATIVE PERFORMANCE ANALYSIS OF RNSC AND MCL ALGORITHMS ON POWER-LAW DIST...
 

More from thanhdowork

[20240429_LabSeminar_Huy]Spatio-Temporal Graph Neural Point Process for Traff...
[20240429_LabSeminar_Huy]Spatio-Temporal Graph Neural Point Process for Traff...[20240429_LabSeminar_Huy]Spatio-Temporal Graph Neural Point Process for Traff...
[20240429_LabSeminar_Huy]Spatio-Temporal Graph Neural Point Process for Traff...thanhdowork
 
240429_Thanh_LabSeminar[TranSG: Transformer-Based Skeleton Graph Prototype Co...
240429_Thanh_LabSeminar[TranSG: Transformer-Based Skeleton Graph Prototype Co...240429_Thanh_LabSeminar[TranSG: Transformer-Based Skeleton Graph Prototype Co...
240429_Thanh_LabSeminar[TranSG: Transformer-Based Skeleton Graph Prototype Co...thanhdowork
 
240429_Thuy_Labseminar[Simplifying and Empowering Transformers for Large-Grap...
240429_Thuy_Labseminar[Simplifying and Empowering Transformers for Large-Grap...240429_Thuy_Labseminar[Simplifying and Empowering Transformers for Large-Grap...
240429_Thuy_Labseminar[Simplifying and Empowering Transformers for Large-Grap...thanhdowork
 
240422_Thanh_LabSeminar[Dynamic Graph Enhanced Contrastive Learning for Chest...
240422_Thanh_LabSeminar[Dynamic Graph Enhanced Contrastive Learning for Chest...240422_Thanh_LabSeminar[Dynamic Graph Enhanced Contrastive Learning for Chest...
240422_Thanh_LabSeminar[Dynamic Graph Enhanced Contrastive Learning for Chest...thanhdowork
 
240422_Thuy_Labseminar[Large Graph Property Prediction via Graph Segment Trai...
240422_Thuy_Labseminar[Large Graph Property Prediction via Graph Segment Trai...240422_Thuy_Labseminar[Large Graph Property Prediction via Graph Segment Trai...
240422_Thuy_Labseminar[Large Graph Property Prediction via Graph Segment Trai...thanhdowork
 
240315_Thanh_LabSeminar[G-TAD: Sub-Graph Localization for Temporal Action Det...
240315_Thanh_LabSeminar[G-TAD: Sub-Graph Localization for Temporal Action Det...240315_Thanh_LabSeminar[G-TAD: Sub-Graph Localization for Temporal Action Det...
240315_Thanh_LabSeminar[G-TAD: Sub-Graph Localization for Temporal Action Det...thanhdowork
 
240415_Thuy_Labseminar[Simple and Asymmetric Graph Contrastive Learning witho...
240415_Thuy_Labseminar[Simple and Asymmetric Graph Contrastive Learning witho...240415_Thuy_Labseminar[Simple and Asymmetric Graph Contrastive Learning witho...
240415_Thuy_Labseminar[Simple and Asymmetric Graph Contrastive Learning witho...thanhdowork
 
240115_Attention Is All You Need (2017 NIPS).pptx
240115_Attention Is All You Need (2017 NIPS).pptx240115_Attention Is All You Need (2017 NIPS).pptx
240115_Attention Is All You Need (2017 NIPS).pptxthanhdowork
 
240115_Thanh_LabSeminar[Don't walk, skip! online learning of multi-scale netw...
240115_Thanh_LabSeminar[Don't walk, skip! online learning of multi-scale netw...240115_Thanh_LabSeminar[Don't walk, skip! online learning of multi-scale netw...
240115_Thanh_LabSeminar[Don't walk, skip! online learning of multi-scale netw...thanhdowork
 
240122_Attention Is All You Need (2017 NIPS)2.pptx
240122_Attention Is All You Need (2017 NIPS)2.pptx240122_Attention Is All You Need (2017 NIPS)2.pptx
240122_Attention Is All You Need (2017 NIPS)2.pptxthanhdowork
 
240226_Thanh_LabSeminar[Structure-Aware Transformer for Graph Representation ...
240226_Thanh_LabSeminar[Structure-Aware Transformer for Graph Representation ...240226_Thanh_LabSeminar[Structure-Aware Transformer for Graph Representation ...
240226_Thanh_LabSeminar[Structure-Aware Transformer for Graph Representation ...thanhdowork
 
[20240304_LabSeminar_Huy]DeepWalk: Online Learning of Social Representations....
[20240304_LabSeminar_Huy]DeepWalk: Online Learning of Social Representations....[20240304_LabSeminar_Huy]DeepWalk: Online Learning of Social Representations....
[20240304_LabSeminar_Huy]DeepWalk: Online Learning of Social Representations....thanhdowork
 
240304_Thanh_LabSeminar[Pure Transformers are Powerful Graph Learners].pptx
240304_Thanh_LabSeminar[Pure Transformers are Powerful Graph Learners].pptx240304_Thanh_LabSeminar[Pure Transformers are Powerful Graph Learners].pptx
240304_Thanh_LabSeminar[Pure Transformers are Powerful Graph Learners].pptxthanhdowork
 
240304_Thuy_Labseminar[SimGRACE: A Simple Framework for Graph Contrastive Lea...
240304_Thuy_Labseminar[SimGRACE: A Simple Framework for Graph Contrastive Lea...240304_Thuy_Labseminar[SimGRACE: A Simple Framework for Graph Contrastive Lea...
240304_Thuy_Labseminar[SimGRACE: A Simple Framework for Graph Contrastive Lea...thanhdowork
 
240311_JW_labseminar[Sequence to Sequence Learning with Neural Networks].pptx
240311_JW_labseminar[Sequence to Sequence Learning with Neural Networks].pptx240311_JW_labseminar[Sequence to Sequence Learning with Neural Networks].pptx
240311_JW_labseminar[Sequence to Sequence Learning with Neural Networks].pptxthanhdowork
 
[20240311_LabSeminar_Huy]LINE: Large-scale Information Network Embedding.pptx
[20240311_LabSeminar_Huy]LINE: Large-scale Information Network Embedding.pptx[20240311_LabSeminar_Huy]LINE: Large-scale Information Network Embedding.pptx
[20240311_LabSeminar_Huy]LINE: Large-scale Information Network Embedding.pptxthanhdowork
 
240311_Thanh_LabSeminar[Translating Embeddings for Modeling Multi-relational ...
240311_Thanh_LabSeminar[Translating Embeddings for Modeling Multi-relational ...240311_Thanh_LabSeminar[Translating Embeddings for Modeling Multi-relational ...
240311_Thanh_LabSeminar[Translating Embeddings for Modeling Multi-relational ...thanhdowork
 
240311_Thuy_Labseminar[Contrastive Multi-View Representation Learning on Grap...
240311_Thuy_Labseminar[Contrastive Multi-View Representation Learning on Grap...240311_Thuy_Labseminar[Contrastive Multi-View Representation Learning on Grap...
240311_Thuy_Labseminar[Contrastive Multi-View Representation Learning on Grap...thanhdowork
 
240318_JW_labseminar[Attention Is All You Need].pptx
240318_JW_labseminar[Attention Is All You Need].pptx240318_JW_labseminar[Attention Is All You Need].pptx
240318_JW_labseminar[Attention Is All You Need].pptxthanhdowork
 
240318_Thanh_LabSeminar[Learning Graph Embeddings for Compositional Zero-shot...
240318_Thanh_LabSeminar[Learning Graph Embeddings for Compositional Zero-shot...240318_Thanh_LabSeminar[Learning Graph Embeddings for Compositional Zero-shot...
240318_Thanh_LabSeminar[Learning Graph Embeddings for Compositional Zero-shot...thanhdowork
 

More from thanhdowork (20)

[20240429_LabSeminar_Huy]Spatio-Temporal Graph Neural Point Process for Traff...
[20240429_LabSeminar_Huy]Spatio-Temporal Graph Neural Point Process for Traff...[20240429_LabSeminar_Huy]Spatio-Temporal Graph Neural Point Process for Traff...
[20240429_LabSeminar_Huy]Spatio-Temporal Graph Neural Point Process for Traff...
 
240429_Thanh_LabSeminar[TranSG: Transformer-Based Skeleton Graph Prototype Co...
240429_Thanh_LabSeminar[TranSG: Transformer-Based Skeleton Graph Prototype Co...240429_Thanh_LabSeminar[TranSG: Transformer-Based Skeleton Graph Prototype Co...
240429_Thanh_LabSeminar[TranSG: Transformer-Based Skeleton Graph Prototype Co...
 
240429_Thuy_Labseminar[Simplifying and Empowering Transformers for Large-Grap...
240429_Thuy_Labseminar[Simplifying and Empowering Transformers for Large-Grap...240429_Thuy_Labseminar[Simplifying and Empowering Transformers for Large-Grap...
240429_Thuy_Labseminar[Simplifying and Empowering Transformers for Large-Grap...
 
240422_Thanh_LabSeminar[Dynamic Graph Enhanced Contrastive Learning for Chest...
240422_Thanh_LabSeminar[Dynamic Graph Enhanced Contrastive Learning for Chest...240422_Thanh_LabSeminar[Dynamic Graph Enhanced Contrastive Learning for Chest...
240422_Thanh_LabSeminar[Dynamic Graph Enhanced Contrastive Learning for Chest...
 
240422_Thuy_Labseminar[Large Graph Property Prediction via Graph Segment Trai...
240422_Thuy_Labseminar[Large Graph Property Prediction via Graph Segment Trai...240422_Thuy_Labseminar[Large Graph Property Prediction via Graph Segment Trai...
240422_Thuy_Labseminar[Large Graph Property Prediction via Graph Segment Trai...
 
240315_Thanh_LabSeminar[G-TAD: Sub-Graph Localization for Temporal Action Det...
240315_Thanh_LabSeminar[G-TAD: Sub-Graph Localization for Temporal Action Det...240315_Thanh_LabSeminar[G-TAD: Sub-Graph Localization for Temporal Action Det...
240315_Thanh_LabSeminar[G-TAD: Sub-Graph Localization for Temporal Action Det...
 
240415_Thuy_Labseminar[Simple and Asymmetric Graph Contrastive Learning witho...
240415_Thuy_Labseminar[Simple and Asymmetric Graph Contrastive Learning witho...240415_Thuy_Labseminar[Simple and Asymmetric Graph Contrastive Learning witho...
240415_Thuy_Labseminar[Simple and Asymmetric Graph Contrastive Learning witho...
 
240115_Attention Is All You Need (2017 NIPS).pptx
240115_Attention Is All You Need (2017 NIPS).pptx240115_Attention Is All You Need (2017 NIPS).pptx
240115_Attention Is All You Need (2017 NIPS).pptx
 
240115_Thanh_LabSeminar[Don't walk, skip! online learning of multi-scale netw...
240115_Thanh_LabSeminar[Don't walk, skip! online learning of multi-scale netw...240115_Thanh_LabSeminar[Don't walk, skip! online learning of multi-scale netw...
240115_Thanh_LabSeminar[Don't walk, skip! online learning of multi-scale netw...
 
240122_Attention Is All You Need (2017 NIPS)2.pptx
240122_Attention Is All You Need (2017 NIPS)2.pptx240122_Attention Is All You Need (2017 NIPS)2.pptx
240122_Attention Is All You Need (2017 NIPS)2.pptx
 
240226_Thanh_LabSeminar[Structure-Aware Transformer for Graph Representation ...
240226_Thanh_LabSeminar[Structure-Aware Transformer for Graph Representation ...240226_Thanh_LabSeminar[Structure-Aware Transformer for Graph Representation ...
240226_Thanh_LabSeminar[Structure-Aware Transformer for Graph Representation ...
 
[20240304_LabSeminar_Huy]DeepWalk: Online Learning of Social Representations....
[20240304_LabSeminar_Huy]DeepWalk: Online Learning of Social Representations....[20240304_LabSeminar_Huy]DeepWalk: Online Learning of Social Representations....
[20240304_LabSeminar_Huy]DeepWalk: Online Learning of Social Representations....
 
240304_Thanh_LabSeminar[Pure Transformers are Powerful Graph Learners].pptx
240304_Thanh_LabSeminar[Pure Transformers are Powerful Graph Learners].pptx240304_Thanh_LabSeminar[Pure Transformers are Powerful Graph Learners].pptx
240304_Thanh_LabSeminar[Pure Transformers are Powerful Graph Learners].pptx
 
240304_Thuy_Labseminar[SimGRACE: A Simple Framework for Graph Contrastive Lea...
240304_Thuy_Labseminar[SimGRACE: A Simple Framework for Graph Contrastive Lea...240304_Thuy_Labseminar[SimGRACE: A Simple Framework for Graph Contrastive Lea...
240304_Thuy_Labseminar[SimGRACE: A Simple Framework for Graph Contrastive Lea...
 
240311_JW_labseminar[Sequence to Sequence Learning with Neural Networks].pptx
240311_JW_labseminar[Sequence to Sequence Learning with Neural Networks].pptx240311_JW_labseminar[Sequence to Sequence Learning with Neural Networks].pptx
240311_JW_labseminar[Sequence to Sequence Learning with Neural Networks].pptx
 
[20240311_LabSeminar_Huy]LINE: Large-scale Information Network Embedding.pptx
[20240311_LabSeminar_Huy]LINE: Large-scale Information Network Embedding.pptx[20240311_LabSeminar_Huy]LINE: Large-scale Information Network Embedding.pptx
[20240311_LabSeminar_Huy]LINE: Large-scale Information Network Embedding.pptx
 
240311_Thanh_LabSeminar[Translating Embeddings for Modeling Multi-relational ...
240311_Thanh_LabSeminar[Translating Embeddings for Modeling Multi-relational ...240311_Thanh_LabSeminar[Translating Embeddings for Modeling Multi-relational ...
240311_Thanh_LabSeminar[Translating Embeddings for Modeling Multi-relational ...
 
240311_Thuy_Labseminar[Contrastive Multi-View Representation Learning on Grap...
240311_Thuy_Labseminar[Contrastive Multi-View Representation Learning on Grap...240311_Thuy_Labseminar[Contrastive Multi-View Representation Learning on Grap...
240311_Thuy_Labseminar[Contrastive Multi-View Representation Learning on Grap...
 
240318_JW_labseminar[Attention Is All You Need].pptx
240318_JW_labseminar[Attention Is All You Need].pptx240318_JW_labseminar[Attention Is All You Need].pptx
240318_JW_labseminar[Attention Is All You Need].pptx
 
240318_Thanh_LabSeminar[Learning Graph Embeddings for Compositional Zero-shot...
240318_Thanh_LabSeminar[Learning Graph Embeddings for Compositional Zero-shot...240318_Thanh_LabSeminar[Learning Graph Embeddings for Compositional Zero-shot...
240318_Thanh_LabSeminar[Learning Graph Embeddings for Compositional Zero-shot...
 

Recently uploaded

Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...RKavithamani
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 

Recently uploaded (20)

Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 

[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Causal Lens and Treatment.pptx

  • 1. Quang-Huy Tran Network Science Lab Dept. of Artificial Intelligence The Catholic University of Korea E-mail: huytran1126@gmail.com 2024-04-15 Deciphering Spatio-Temporal Graph Forecasting: A Causal Lens and Treatment Yutong Xia et al. NeurIPS’37: 2023 Conference on Neural Information Processing Systems
  • 2. 2 OUTLINE • MOTIVATION • INTRODUCTION • METHODOLOGY • EXPERIMENT & RESULT • CONCLUSION
  • 3. 3 MOTIVATION • Spatio-Temporal Graph (STG) represents the spatial and temporal relationships between nodes or entities, which is widely used in various fields (e.g., transportation, environment and epidemiology). Spatio-Temporal Graph Forecasting • STG forecasting has become crucial in the context of smart cities (e.g. informed decision-making, sustainable environments).
  • 4. 4 MOTIVATION • STG data is subject to temporal dynamics. o Exhibit various data generation distribution over time – temporal out-of-distribution (OoD). o 𝑃𝐴 𝑥 ≠ 𝑃𝐵 𝑥 ≠ 𝑃𝑡𝑒𝑠𝑡 𝑥 Spatio-Temporal Forecasting Challenges • Dynamic spatial causation: mostly previous works follows. o Rely on distance-based adjacency matrix to perform message passing. o Or attention mechanism to calculate dynamic spatial correlations between nodes. o Ripple effects of causal relations.
  • 5. 5 INTRODUCTION • Concurrently tackle the temporal OoD issues and dynamic spatial causation via causal treatments. • Proposed a novel framework Causal Spatio-Temporal neural network (CaST): o Presented a Structural Causal Model (SCM) to gain deeper understanding of data generation process of STG data o Based on SCM, they proposed: o utilize back-door adjustment to enhance the generalization capability for unseen data (temporal OoD). o apply front-door adjustment along with an edge-level convolution operator to effectively capture the dynamic causation between nodes.
  • 6. 6 METHODOLOGY Problem Definition: Causal Len • Causal inference: o investigate causal relationships between variables, ensuring stable and robust learning and inference. o in STG, it commonly used to address temporal OoD issue by learning disentangled seasonal-trend representations or environment-specific representations. • Denote signal 𝑋𝑡 ∈ ℝ𝑁×𝐷 of 𝑁nodes at time step 𝑡, with D features. • Given historical previous 𝑇 time steps, learn a function to forecast next 𝑆 steps: 𝑋 𝑡−𝑇 :𝑡 ℱ(. )[𝑌 𝑡+1 : 𝑡+𝑆 ]
  • 7. 7 METHODOLOGY Structural Causal Model (SCM) • Assume E & C are independent. o 𝑿 ⟵ 𝑬 ⟶ 𝒀: The temporal OoD can arise due to changes in external variables over time. (e.g., weather can affect traffic flow observations). o 𝑿 ⟵ 𝑪 ⟶ 𝒀: X and Y are intrinsically affected by the surrounding spatial context, comprising both spurious and genuine causal components. o 𝑿 ⟶ 𝒀: Primary goal. o The causal relationship: P(𝑋, 𝑌|𝐸, 𝐶) = 𝑃 𝑋 𝐸, 𝐶 𝑃(𝑌|𝑋, 𝐸, 𝐶). 𝐸: Temporal Environment 𝐶: Spatial Context. 𝑋: Historical Node Signals. 𝑌: Future Signals.
  • 8. 8 METHODOLOGY Structural Causal Model (SCM) • Assume E & C are independent. o 𝑿 ⟵ 𝑬 ⟶ 𝒀: The temporal OoD can arise due to changes in external variables over time. (e.g., weather can affect traffic flow observations). o 𝑿 ⟵ 𝑪 ⟶ 𝒀: X and Y are intrinsically affected by the surrounding spatial context, comprising both spurious and genuine causal components. o 𝑿 ⟶ 𝒀: Primary goal. o The causal relationship: P(𝑋, 𝑌|𝐸, 𝐶) = 𝑃 𝑋 𝐸, 𝐶 𝑃(𝑌|𝑋, 𝐸, 𝐶). 𝐸: Temporal Environment 𝐶: Spatial Context. 𝑋: Historical Node Signals. 𝑌: Future Signals. Confounding factors Backdoor paths
  • 9. 9 METHODOLOGY Structural Causal Model (SCM) • Back-door adjustment for E • Aspect of X is impacted by E and C. To mitigate, they block the back-door path from 𝐸 to 𝑋. o Remove 𝐸’s confounding effect.
  • 10. 10 METHODOLOGY Structural Causal Model (SCM) • Front-door adjustment for C • Introducing a mediating variable 𝑋∗ between 𝑋 and 𝑌 to mimic a more accurate representation excluded the spurious parts in 𝐶. o De-confounding 𝐶’s spurious effect.
  • 12. 12 METHODOLOGY Backdoor Adjustment • 2 Steps: o separating the environment feature from the input data. o discretizing the environments. • 2 main module: o Environment Disentangler block. o A learnable Environment Codebook + Representation Disentanglement.
  • 13. 13 METHODOLOGY Backdoor Adjustment • Environment Disentangler block: o EnvEncoder: a series of 1D convolutions, average pooling, and a linear projection. o EntEncoder: Fast Fourier Transform and self-attention mechanism to extract features from both time and frequency domains. • Environment Codebook: latent embedding space e = {e1, . . , eK} o a nearest neighbor in the shared embedding space 𝑒 to identify the closest latent vector for each node’s environment representation. o Final environment representation is corresponding closest discrete vector in 𝑒. • Representation Disentanglement: entity representations carry minimal information (MI) about the environment. o Employ Mutual Information Neural Estimation by Kullback-Leiber (KL) divergence.
  • 14. 14 METHODOLOGY Front-door Adjustment • Construct boundary edge graph: o First-order: maps pairs of nodes to edges. o Second-order: maps pairs of edges to triangles. • Introducing Hodge-Laplacian (HL) Deconfounder: o higher-order graph over edges and perform edge convolution to filter edge signal o Goal: capture the dynamic causal relations of nodes as well as position embeddings to learn the nodes’ global location information. o Then, use a linear transformation and Graph convolutional networks (GCN) to create causal surrogate.
  • 15. 15 METHODOLOGY Loss Function - Optimization • Mutual Information Regularization: thwart the classifier to discern the true labels o Ensure the classifier can not determine the true corresponding environment based on the information provided by hidden feature. • Environment Codebook: prediction loss and codebook loss. 𝛼: balancing hyperparameter. 𝑠𝑔[. ]: stop gradient operator. • Overall loss function:
  • 16. 16 EXPERIMENT AND RESULT EXPERIMENT • Measurement: o Mean Absolute Errors (MAE) and Root Mean Squared Errors (RMSE). • Dataset: o PEMS08: traffic flow data on 8 roads with a time interval of 5 minutes. o AIR-BJ and AIR-GJ: one-year PM2.5 readings collected from air quality monitoring stations in Beijing and Guangzhou. • Task: o predict over the next 24 steps given the past 24 steps. • Variants: o CaST-ADP: using a self-adaptive adjacency matrix. o CaST-GAT: using the graph attention mechanism for causal scoring.
  • 17. 17 • Baseline: o Historical Average(HA). o Vector autoregression (VAR). o DCRNN[1]: Diffusion Convolution Recurrent Neural Network. o STSGCN[2]: Spatial temporal synchronous graph convolutional networks. o ASTGCN[3]: Attention Spatial-Temporal graph convolutional networks. o MTGNN[4]: Multi Time Series Graph Neural Network. o AGCRN[5]: Adaptive Graph Convolutional Recurrent Neural Network. o GMSDR[6]: Graph Multi-Step Dependency Relation. o STGNCDE[7]: Spatio-temporal graph neural controlled differential equation. EXPERIMENT AND RESULT EXPERIMENT [1] Li, Y.; Yu, R.; Shahabi, C.; and Liu, Y. 2018. Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting. In ICLR. [2] Song, C.; Lin, Y.; Guo, S.; and Wan, H. 2020. Spatial temporal synchronous graph convolutional networks: A new framework for spatial-temporal network data forecasting. In AAAI. [3] Shengnan Guo, Youfang Lin, Ning Feng, Chao Song, and Huaiyu Wan. 2019. Attention based spatial-temporal graph convolutional networks for traffic flow forecasting. In Proceedings of the AAAI conference on artificial intelligence, Vol. 33. 922–929. [4] Zonghan Wu, Shirui Pan, Guodong Long, Jing Jiang, Xiaojun Chang, and Chengqi Zhang. 2020. Connecting the dots: Multivariate time series forecasting with graph neural networks. InProceedings of the 26th ACM SIGKDD International Conference on K nowledge Discovery & Data Mining. 753–763. [5] Lei Bai, Lina Yao, Can Li, Xianzhi Wang, and Can Wang. 2020. Adaptive graph convolutional recurrent network for traffic forecasting. Advances in neural information processing systems 33(2020), 17804–17815. [6] Dachuan Liu, Jin Wang, Shuo Shang, and Peng Han. 2022. Msdr: Multi-step dependency relation networks for spatial temporal forecasting. In Proceedings of the 28th ACM SIGKDDConference on Knowledge Discovery and Data Mining. 1042–1050. [7] Jeongwhan Choi, Hwangyong Choi, Jeehyun Hwang, and Noseong Park. 2022. Graph neuralcontrolled differential equations for traffic forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. 6367–6374.
  • 19. 19 EXPERIMENT AND RESULT RESULT – Edge and Component Analysis Fig: Performance comparison of different variant on PEMS08 and England datasets. • Effects of Edge Convolution • Effects of Components and Visualization of Dynamic Spatial Causation
  • 20. 20 EXPERIMENT AND RESULT RESULT – Interpretation Analysis • Analysis on Environmental Codebook • Interpretation of Temporal Environments
  • 21. 21 CONCLUSION • Took a causal look at the STG forecasting problem: • Temporal out-of-distribution (OoD) issue. • Proposed a novel Causal Spatio-Temporal neural network (CaST). • Utilized back-door and front-door adjustments for resolving challenges. • Verified effectiveness, generalizability, and interpretability through extensive experiments on three datasets

Editor's Notes

  1. 𝐾: discrete space size/ total number of environments.
  2. Variants: CaST-ADP: using a self-adaptive adjacency matrix CaST-GAT: using the graph attention mechanism for causal scoring Effective of Components w/o Env: excludes environment features for prediction. w/o Ent: omits entity features for prediction. w/o Edge: not utilize the causal score to guide the spatial message passing