SlideShare a Scribd company logo
Quang-Huy Tran
Network Science Lab
Dept. of Artificial Intelligence
The Catholic University of Korea
E-mail: huytran1126@gmail.com
2024-05-13
Spatio-Temporal Graph Few-Shot Learning
with Cross-CityKnowledge Transfer
Bin Lu Lin et al.
SIGKDD’28: 2022 ACM SIGKDD Conference on Knowledge Discovery and Data Mining
2
OUTLINE
• MOTIVATION
• INTRODUCTION
• METHODOLOGY
• EXPERIMENT & RESULT
• CONCLUSION
3
MOTIVATION
• Spatio-temporal graph learning is a key method for urban computing tasks, such as
traffic flow, taxi demand and air quality forecasting.
• However, due to the high cost of data collection, developing cities have few available
data:
o infeasible to train a well-performed model.
Overview
• Knowledge transfer, such as few-shot learning, made a progress in research.
• There are challenges from previous works:
o Transfer one single source: risk of negative transfer due to the great difference.
o Transfer multiple sources: no consider the varied feature differences across cities
and within cities.
4
INTRODUCTION
• Goal: transfer the cross-city knowledge in graph-based few-shot learning scenarios.
• Research challenges:
o How to adapt feature extraction in target city via the knowledge from multiple source cities?
o How to alleviate the impacts of varied graph structure on transferring among different cities?
• Propose a novel and model-agnostic Spatio-Temporal Graph Few-Shot Learning
framework(ST-GFSL).
o generate non-shared model parameters based on node-level meta knowledge to enhance specific
feature extraction.
o reconstruct the graph structure of different cities based on meta-knowledge.
5
METHODOLOGY
PROBLEM SETTING
• For ST graph forecasting tasks, our goal is to learn a function 𝑓(·) for approximating
the true mapping of historical 𝑇 observed signals to the future signals:
• Given a ST graph 𝐺𝑆𝑇 = (𝑉, ℰ, 𝐴, 𝑋).
• 𝑉: set of 𝑁 nodes, ℰ is set of edges, 𝐴 is binary adjacency matrix, and 𝑋 is node
features matrix.
• For ST graph few-show learning, suppose we have P source of graph cities 𝐺1:𝑃
𝑠𝑜𝑢𝑟𝑐𝑒
=
{𝐺1
𝑠𝑜𝑢𝑟𝑐𝑒
, … , 𝐺𝑃
𝑠𝑜𝑢𝑟𝑐𝑒
} and target 𝐺𝑡𝑎𝑟𝑔𝑒𝑡.
• After training on 𝐺1:𝑃
𝑠𝑜𝑢𝑟𝑐𝑒
,the model can leverage the meta knowledge from multiple source graphs
and is tasked to predict on a disjoint target scenario, where only few-shot structured data of
𝐺𝑡𝑎𝑟𝑔𝑒𝑡 is available.
6
METHODOLOGY
Overall Architecture
• Spatio-Temporal Neural Network (STNN)
• Cross-city Knowledge Transfer
7
METHODOLOGY
Spatio-Temporal Meta Knowledge Learner (ST-Meta Learner)
• Employ Gated Recurrent Unit (GRU).
• Utilize spatial-based graph attention network (GAT) to encode the spatial correlations.
• Meta knowledge:
8
METHODOLOGY
ST-Meta Graph Reconstruction
• ST-Meta Graph is reconstructed by meta knowledge for structure aware learning.
o We predict the likelihood of an edge existing between nodes
• To guide the structure-aware learning of meta knowledge, we introduce graph
reconstruction loss:
o ST-meta graph A𝑚𝑒𝑡𝑎 can be constructed
9
METHODOLOGY
Parameter Generation
• Linear layer: 2 linear transformation with reshape in center.
• Function 𝐹 that takes node-level meta knowledge as input and outputs the non-shared
feature extractor parameters 𝜃𝑆𝑇.
• Obtain the non-shared parameters of feature extractors for different scenarios.
• Convolutional layer: 2 2D-Convolution with reshape in center.
10
METHODOLOGY
ST-GFSL Learning Process
o Samples batches of task sets from source datasets.
o Each task 𝑇𝑖 ∈ 𝑇𝑆𝑇 belongs to one single city and is divided into support set ST𝑖
, query set 𝑄T𝑖
and
ST𝑖
∩ 𝑄T𝑖
= ∅.
o When learning a task, consider a joint loss:
• Model-agnostic methods are an approach to understand the predictive
response of a black box model, instead of the response from the original
dataset.
• Two stage: base-model meta training and adaptation.
o Meta objective: minimize the sum of task loss on query sets
11
EXPERIMENT AND RESULT
EXPERIMENT - BASELINES
• Measurement:
o Mean Absolute Error (MAE).
o Root Mean Square Error (RMSE).
• Dataset: METR-LA,PEMS-BAY, Didi-Chengdu, Didi-Shenzhen
o Traffic flow dataset.
• Baselines:
o HA: Historical Average, which formulates the traffic flow as a seasonal process, and uses average of
previous seasons as the prediction.
o ARIMA: Auto-regressive integrated moving average is a well-known model that can understand and
predict future values in a time series.
o Target-only: Directly training the model on few-shot data in target domain.
o Fine-tuned (Vanilla): We first train the model on source datasets, and then fine-tune the model on
few-shot data in target domain.
12
EXPERIMENT AND RESULT
EXPERIMENT – BASELINE
[1] Du, Y., Wang, J., Feng, W., Pan, S., Qin, T., Xu, R., & Wang, C. (2021, October). Adarnn: Adaptive learning and forecasting of time series. In Proceedings of the 30th ACM international conference on information & knowledge management (pp. 402-411).
[2] Finn, C., Abbeel, P., & Levine, S. (2017, July). Model-agnostic meta-learning for fast adaptation of deep networks. In International conference on machine learning (pp. 1126-1135). PMLR.
[3] Lea, C., Flynn, M. D., Vidal, R., Reiter, A., & Hager, G. D. (2017). Temporal convolutional networks for action segmentation and detection. In proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 156-165).
[4] Yu, B., Yin, H., & Zhu, Z. (2017). Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting. arXiv preprint arXiv:1709.04875.
[5] Wu, Z., Pan, S., Long, G., Jiang, J., & Zhang, C. (2019). Graph wavenet for deep spatial-temporal graph modeling. arXiv preprint arXiv:1906.00121.
• Fine-tuned (ST-Meta): Compared with “Fine-tuned (Vanilla)” method, we combine the proposed
parameter generation based on meta knowledge to generate non-shared parameters for the model.
• AdaRNN [1]: A state-of-the-art transfer learning framework for non-stationary time series.
• MAML [2]: Model-Agnostic Meta Learning (MAML).
• Apply some advanced spatio-temporal data graph learning algorithms to our ST-GFSL
framework:
o TCN [3]: 1D dilated convolution network-based temporal convolution network.
o STGCN [4]: Spatial temporal graph convolution network, which combines graph convolution with 1D
convolution.
o GWN [5]: A convolution network structure combines graph convolution with dilated casual
convolution and a self-adaptive graph.
13
EXPERIMENT AND RESULT
RESULT – Overall Performance
14
EXPERIMENT AND RESULT
RESULT – Overall Performance
15
CONCLUSION
• Propose a spatio-temporal graph few-shot learning framework called ST-GFSL for
cross-city knowledge transfer.
o Non-shared feature extractor parameters based on node-level meta knowledge.
o improve the effectiveness of spatio-temporal representation on multiple datasets and transfer the
cross-city knowledge via parameter matching from similar spatio-temporal meta knowledge.
• ST-GFSL integrates the graph reconstruction loss to achieve structure-aware learning.
• ST-GFSL not only apply to traffic speed prediction, but also apply to other few-shot
scenarios: taxi demand prediction, indoor environment monitoring indifferent
warehouses.
[20240513_LabSeminar_Huy]GraphFewShort_Transfer.pptx
[20240513_LabSeminar_Huy]GraphFewShort_Transfer.pptx

More Related Content

Similar to [20240513_LabSeminar_Huy]GraphFewShort_Transfer.pptx

mathematics-10-01599-v2.pdf
mathematics-10-01599-v2.pdfmathematics-10-01599-v2.pdf
mathematics-10-01599-v2.pdf
TadiyosHailemichael
 
Clustering big spatiotemporal interval data
Clustering big spatiotemporal interval dataClustering big spatiotemporal interval data
Clustering big spatiotemporal interval data
Nexgen Technology
 
IEEE Datamining 2016 Title and Abstract
IEEE  Datamining 2016 Title and AbstractIEEE  Datamining 2016 Title and Abstract
IEEE Datamining 2016 Title and Abstract
tsysglobalsolutions
 
[20240429_LabSeminar_Huy]Spatio-Temporal Graph Neural Point Process for Traff...
[20240429_LabSeminar_Huy]Spatio-Temporal Graph Neural Point Process for Traff...[20240429_LabSeminar_Huy]Spatio-Temporal Graph Neural Point Process for Traff...
[20240429_LabSeminar_Huy]Spatio-Temporal Graph Neural Point Process for Traff...
thanhdowork
 
[Seminar] hyunwook 0624
[Seminar] hyunwook 0624[Seminar] hyunwook 0624
[Seminar] hyunwook 0624
ivaderivader
 
PointNet
PointNetPointNet
0505.pdf
0505.pdf0505.pdf
A Graph Summarization: A Survey | Summarizing and understanding large graphs
A Graph Summarization: A Survey | Summarizing and understanding large graphsA Graph Summarization: A Survey | Summarizing and understanding large graphs
A Graph Summarization: A Survey | Summarizing and understanding large graphs
aftab alam
 
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
thanhdowork
 
2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee
Moazzem Hossain
 
2019 cvpr paper_overview
2019 cvpr paper_overview2019 cvpr paper_overview
2019 cvpr paper_overview
LEE HOSEONG
 
Traffic Prediction from Street Network images.pptx
Traffic Prediction from  Street Network images.pptxTraffic Prediction from  Street Network images.pptx
Traffic Prediction from Street Network images.pptx
chirantanGupta1
 
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
Mokhtar SELLAMI
 
Chapter 3.pptx
Chapter 3.pptxChapter 3.pptx
Chapter 3.pptx
AbanobZakaria1
 
Semantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite ImagerySemantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite Imagery
RAHUL BHOJWANI
 
Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detec...
Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detec...Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detec...
Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detec...
IRJET Journal
 
240401_Thuy_Labseminar[Train Once and Explain Everywhere: Pre-training Interp...
240401_Thuy_Labseminar[Train Once and Explain Everywhere: Pre-training Interp...240401_Thuy_Labseminar[Train Once and Explain Everywhere: Pre-training Interp...
240401_Thuy_Labseminar[Train Once and Explain Everywhere: Pre-training Interp...
thanhdowork
 
Applications of machine learning in Wireless sensor networks.
Applications of machine learning in Wireless sensor networks.Applications of machine learning in Wireless sensor networks.
Applications of machine learning in Wireless sensor networks.
Sahana B S
 
Combinatorial optimization and deep reinforcement learning
Combinatorial optimization and deep reinforcement learningCombinatorial optimization and deep reinforcement learning
Combinatorial optimization and deep reinforcement learning
민재 정
 

Similar to [20240513_LabSeminar_Huy]GraphFewShort_Transfer.pptx (20)

mathematics-10-01599-v2.pdf
mathematics-10-01599-v2.pdfmathematics-10-01599-v2.pdf
mathematics-10-01599-v2.pdf
 
Clustering big spatiotemporal interval data
Clustering big spatiotemporal interval dataClustering big spatiotemporal interval data
Clustering big spatiotemporal interval data
 
IEEE Datamining 2016 Title and Abstract
IEEE  Datamining 2016 Title and AbstractIEEE  Datamining 2016 Title and Abstract
IEEE Datamining 2016 Title and Abstract
 
[20240429_LabSeminar_Huy]Spatio-Temporal Graph Neural Point Process for Traff...
[20240429_LabSeminar_Huy]Spatio-Temporal Graph Neural Point Process for Traff...[20240429_LabSeminar_Huy]Spatio-Temporal Graph Neural Point Process for Traff...
[20240429_LabSeminar_Huy]Spatio-Temporal Graph Neural Point Process for Traff...
 
[Seminar] hyunwook 0624
[Seminar] hyunwook 0624[Seminar] hyunwook 0624
[Seminar] hyunwook 0624
 
PointNet
PointNetPointNet
PointNet
 
0505.pdf
0505.pdf0505.pdf
0505.pdf
 
A Graph Summarization: A Survey | Summarizing and understanding large graphs
A Graph Summarization: A Survey | Summarizing and understanding large graphsA Graph Summarization: A Survey | Summarizing and understanding large graphs
A Graph Summarization: A Survey | Summarizing and understanding large graphs
 
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
 
230727_HB_JointJournalClub.pptx
230727_HB_JointJournalClub.pptx230727_HB_JointJournalClub.pptx
230727_HB_JointJournalClub.pptx
 
2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee
 
2019 cvpr paper_overview
2019 cvpr paper_overview2019 cvpr paper_overview
2019 cvpr paper_overview
 
Traffic Prediction from Street Network images.pptx
Traffic Prediction from  Street Network images.pptxTraffic Prediction from  Street Network images.pptx
Traffic Prediction from Street Network images.pptx
 
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
 
Chapter 3.pptx
Chapter 3.pptxChapter 3.pptx
Chapter 3.pptx
 
Semantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite ImagerySemantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite Imagery
 
Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detec...
Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detec...Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detec...
Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detec...
 
240401_Thuy_Labseminar[Train Once and Explain Everywhere: Pre-training Interp...
240401_Thuy_Labseminar[Train Once and Explain Everywhere: Pre-training Interp...240401_Thuy_Labseminar[Train Once and Explain Everywhere: Pre-training Interp...
240401_Thuy_Labseminar[Train Once and Explain Everywhere: Pre-training Interp...
 
Applications of machine learning in Wireless sensor networks.
Applications of machine learning in Wireless sensor networks.Applications of machine learning in Wireless sensor networks.
Applications of machine learning in Wireless sensor networks.
 
Combinatorial optimization and deep reinforcement learning
Combinatorial optimization and deep reinforcement learningCombinatorial optimization and deep reinforcement learning
Combinatorial optimization and deep reinforcement learning
 

More from thanhdowork

A CONVERGENCE ANALYSIS OF GRADIENT_version1
A CONVERGENCE ANALYSIS OF GRADIENT_version1A CONVERGENCE ANALYSIS OF GRADIENT_version1
A CONVERGENCE ANALYSIS OF GRADIENT_version1
thanhdowork
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
thanhdowork
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
thanhdowork
 
LLAMA PRO: Progressive LLaMA with Block Expansion.pptx
LLAMA PRO: Progressive LLaMA with Block Expansion.pptxLLAMA PRO: Progressive LLaMA with Block Expansion.pptx
LLAMA PRO: Progressive LLaMA with Block Expansion.pptx
thanhdowork
 
Prefix-Tuning: Optimizing Continuous Prompts for Generation.pptx
Prefix-Tuning: Optimizing Continuous Prompts for Generation.pptxPrefix-Tuning: Optimizing Continuous Prompts for Generation.pptx
Prefix-Tuning: Optimizing Continuous Prompts for Generation.pptx
thanhdowork
 
240527_Thanh_LabSeminar[Transitivity Recovering Decompositions: Interpretable...
240527_Thanh_LabSeminar[Transitivity Recovering Decompositions: Interpretable...240527_Thanh_LabSeminar[Transitivity Recovering Decompositions: Interpretable...
240527_Thanh_LabSeminar[Transitivity Recovering Decompositions: Interpretable...
thanhdowork
 
240527_Thuy_Labseminar[Self-supervised Heterogeneous Graph Pre-training Based...
240527_Thuy_Labseminar[Self-supervised Heterogeneous Graph Pre-training Based...240527_Thuy_Labseminar[Self-supervised Heterogeneous Graph Pre-training Based...
240527_Thuy_Labseminar[Self-supervised Heterogeneous Graph Pre-training Based...
thanhdowork
 
[20240527_LabSeminar_Huy]Meta-Graph.pptx
[20240527_LabSeminar_Huy]Meta-Graph.pptx[20240527_LabSeminar_Huy]Meta-Graph.pptx
[20240527_LabSeminar_Huy]Meta-Graph.pptx
thanhdowork
 
[20240603_LabSeminar_Huy]TransMOT: Spatial-Temporal Graph Transformer for Mul...
[20240603_LabSeminar_Huy]TransMOT: Spatial-Temporal Graph Transformer for Mul...[20240603_LabSeminar_Huy]TransMOT: Spatial-Temporal Graph Transformer for Mul...
[20240603_LabSeminar_Huy]TransMOT: Spatial-Temporal Graph Transformer for Mul...
thanhdowork
 
240603_Thanh_LabSeminar[Imagine That! Abstract-to-Intricate Text-to-Image Syn...
240603_Thanh_LabSeminar[Imagine That! Abstract-to-Intricate Text-to-Image Syn...240603_Thanh_LabSeminar[Imagine That! Abstract-to-Intricate Text-to-Image Syn...
240603_Thanh_LabSeminar[Imagine That! Abstract-to-Intricate Text-to-Image Syn...
thanhdowork
 
240603_Thuy_Labseminar[One For All: Towards Training One Graph Model For All ...
240603_Thuy_Labseminar[One For All: Towards Training One Graph Model For All ...240603_Thuy_Labseminar[One For All: Towards Training One Graph Model For All ...
240603_Thuy_Labseminar[One For All: Towards Training One Graph Model For All ...
thanhdowork
 
240520_Thanh_LabSeminar[G-MSM: Unsupervised Multi-Shape Matching with Graph-b...
240520_Thanh_LabSeminar[G-MSM: Unsupervised Multi-Shape Matching with Graph-b...240520_Thanh_LabSeminar[G-MSM: Unsupervised Multi-Shape Matching with Graph-b...
240520_Thanh_LabSeminar[G-MSM: Unsupervised Multi-Shape Matching with Graph-b...
thanhdowork
 
240513_Thanh_LabSeminar[Learning and Aggregating Lane Graphs for Urban Automa...
240513_Thanh_LabSeminar[Learning and Aggregating Lane Graphs for Urban Automa...240513_Thanh_LabSeminar[Learning and Aggregating Lane Graphs for Urban Automa...
240513_Thanh_LabSeminar[Learning and Aggregating Lane Graphs for Urban Automa...
thanhdowork
 
240513_Thuy_Labseminar[Universal Prompt Tuning for Graph Neural Networks].pptx
240513_Thuy_Labseminar[Universal Prompt Tuning for Graph Neural Networks].pptx240513_Thuy_Labseminar[Universal Prompt Tuning for Graph Neural Networks].pptx
240513_Thuy_Labseminar[Universal Prompt Tuning for Graph Neural Networks].pptx
thanhdowork
 
240506_JW_labseminar[Structural Deep Network Embedding].pptx
240506_JW_labseminar[Structural Deep Network Embedding].pptx240506_JW_labseminar[Structural Deep Network Embedding].pptx
240506_JW_labseminar[Structural Deep Network Embedding].pptx
thanhdowork
 
[20240506_LabSeminar_Huy]Conditional Local Convolution for Spatio-Temporal Me...
[20240506_LabSeminar_Huy]Conditional Local Convolution for Spatio-Temporal Me...[20240506_LabSeminar_Huy]Conditional Local Convolution for Spatio-Temporal Me...
[20240506_LabSeminar_Huy]Conditional Local Convolution for Spatio-Temporal Me...
thanhdowork
 
240506_Thanh_LabSeminar[ASG2Caption].pptx
240506_Thanh_LabSeminar[ASG2Caption].pptx240506_Thanh_LabSeminar[ASG2Caption].pptx
240506_Thanh_LabSeminar[ASG2Caption].pptx
thanhdowork
 
240506_Thuy_Labseminar[GraphPrompt: Unifying Pre-Training and Downstream Task...
240506_Thuy_Labseminar[GraphPrompt: Unifying Pre-Training and Downstream Task...240506_Thuy_Labseminar[GraphPrompt: Unifying Pre-Training and Downstream Task...
240506_Thuy_Labseminar[GraphPrompt: Unifying Pre-Training and Downstream Task...
thanhdowork
 
240429_Thanh_LabSeminar[TranSG: Transformer-Based Skeleton Graph Prototype Co...
240429_Thanh_LabSeminar[TranSG: Transformer-Based Skeleton Graph Prototype Co...240429_Thanh_LabSeminar[TranSG: Transformer-Based Skeleton Graph Prototype Co...
240429_Thanh_LabSeminar[TranSG: Transformer-Based Skeleton Graph Prototype Co...
thanhdowork
 
240429_Thuy_Labseminar[Simplifying and Empowering Transformers for Large-Grap...
240429_Thuy_Labseminar[Simplifying and Empowering Transformers for Large-Grap...240429_Thuy_Labseminar[Simplifying and Empowering Transformers for Large-Grap...
240429_Thuy_Labseminar[Simplifying and Empowering Transformers for Large-Grap...
thanhdowork
 

More from thanhdowork (20)

A CONVERGENCE ANALYSIS OF GRADIENT_version1
A CONVERGENCE ANALYSIS OF GRADIENT_version1A CONVERGENCE ANALYSIS OF GRADIENT_version1
A CONVERGENCE ANALYSIS OF GRADIENT_version1
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
 
LLAMA PRO: Progressive LLaMA with Block Expansion.pptx
LLAMA PRO: Progressive LLaMA with Block Expansion.pptxLLAMA PRO: Progressive LLaMA with Block Expansion.pptx
LLAMA PRO: Progressive LLaMA with Block Expansion.pptx
 
Prefix-Tuning: Optimizing Continuous Prompts for Generation.pptx
Prefix-Tuning: Optimizing Continuous Prompts for Generation.pptxPrefix-Tuning: Optimizing Continuous Prompts for Generation.pptx
Prefix-Tuning: Optimizing Continuous Prompts for Generation.pptx
 
240527_Thanh_LabSeminar[Transitivity Recovering Decompositions: Interpretable...
240527_Thanh_LabSeminar[Transitivity Recovering Decompositions: Interpretable...240527_Thanh_LabSeminar[Transitivity Recovering Decompositions: Interpretable...
240527_Thanh_LabSeminar[Transitivity Recovering Decompositions: Interpretable...
 
240527_Thuy_Labseminar[Self-supervised Heterogeneous Graph Pre-training Based...
240527_Thuy_Labseminar[Self-supervised Heterogeneous Graph Pre-training Based...240527_Thuy_Labseminar[Self-supervised Heterogeneous Graph Pre-training Based...
240527_Thuy_Labseminar[Self-supervised Heterogeneous Graph Pre-training Based...
 
[20240527_LabSeminar_Huy]Meta-Graph.pptx
[20240527_LabSeminar_Huy]Meta-Graph.pptx[20240527_LabSeminar_Huy]Meta-Graph.pptx
[20240527_LabSeminar_Huy]Meta-Graph.pptx
 
[20240603_LabSeminar_Huy]TransMOT: Spatial-Temporal Graph Transformer for Mul...
[20240603_LabSeminar_Huy]TransMOT: Spatial-Temporal Graph Transformer for Mul...[20240603_LabSeminar_Huy]TransMOT: Spatial-Temporal Graph Transformer for Mul...
[20240603_LabSeminar_Huy]TransMOT: Spatial-Temporal Graph Transformer for Mul...
 
240603_Thanh_LabSeminar[Imagine That! Abstract-to-Intricate Text-to-Image Syn...
240603_Thanh_LabSeminar[Imagine That! Abstract-to-Intricate Text-to-Image Syn...240603_Thanh_LabSeminar[Imagine That! Abstract-to-Intricate Text-to-Image Syn...
240603_Thanh_LabSeminar[Imagine That! Abstract-to-Intricate Text-to-Image Syn...
 
240603_Thuy_Labseminar[One For All: Towards Training One Graph Model For All ...
240603_Thuy_Labseminar[One For All: Towards Training One Graph Model For All ...240603_Thuy_Labseminar[One For All: Towards Training One Graph Model For All ...
240603_Thuy_Labseminar[One For All: Towards Training One Graph Model For All ...
 
240520_Thanh_LabSeminar[G-MSM: Unsupervised Multi-Shape Matching with Graph-b...
240520_Thanh_LabSeminar[G-MSM: Unsupervised Multi-Shape Matching with Graph-b...240520_Thanh_LabSeminar[G-MSM: Unsupervised Multi-Shape Matching with Graph-b...
240520_Thanh_LabSeminar[G-MSM: Unsupervised Multi-Shape Matching with Graph-b...
 
240513_Thanh_LabSeminar[Learning and Aggregating Lane Graphs for Urban Automa...
240513_Thanh_LabSeminar[Learning and Aggregating Lane Graphs for Urban Automa...240513_Thanh_LabSeminar[Learning and Aggregating Lane Graphs for Urban Automa...
240513_Thanh_LabSeminar[Learning and Aggregating Lane Graphs for Urban Automa...
 
240513_Thuy_Labseminar[Universal Prompt Tuning for Graph Neural Networks].pptx
240513_Thuy_Labseminar[Universal Prompt Tuning for Graph Neural Networks].pptx240513_Thuy_Labseminar[Universal Prompt Tuning for Graph Neural Networks].pptx
240513_Thuy_Labseminar[Universal Prompt Tuning for Graph Neural Networks].pptx
 
240506_JW_labseminar[Structural Deep Network Embedding].pptx
240506_JW_labseminar[Structural Deep Network Embedding].pptx240506_JW_labseminar[Structural Deep Network Embedding].pptx
240506_JW_labseminar[Structural Deep Network Embedding].pptx
 
[20240506_LabSeminar_Huy]Conditional Local Convolution for Spatio-Temporal Me...
[20240506_LabSeminar_Huy]Conditional Local Convolution for Spatio-Temporal Me...[20240506_LabSeminar_Huy]Conditional Local Convolution for Spatio-Temporal Me...
[20240506_LabSeminar_Huy]Conditional Local Convolution for Spatio-Temporal Me...
 
240506_Thanh_LabSeminar[ASG2Caption].pptx
240506_Thanh_LabSeminar[ASG2Caption].pptx240506_Thanh_LabSeminar[ASG2Caption].pptx
240506_Thanh_LabSeminar[ASG2Caption].pptx
 
240506_Thuy_Labseminar[GraphPrompt: Unifying Pre-Training and Downstream Task...
240506_Thuy_Labseminar[GraphPrompt: Unifying Pre-Training and Downstream Task...240506_Thuy_Labseminar[GraphPrompt: Unifying Pre-Training and Downstream Task...
240506_Thuy_Labseminar[GraphPrompt: Unifying Pre-Training and Downstream Task...
 
240429_Thanh_LabSeminar[TranSG: Transformer-Based Skeleton Graph Prototype Co...
240429_Thanh_LabSeminar[TranSG: Transformer-Based Skeleton Graph Prototype Co...240429_Thanh_LabSeminar[TranSG: Transformer-Based Skeleton Graph Prototype Co...
240429_Thanh_LabSeminar[TranSG: Transformer-Based Skeleton Graph Prototype Co...
 
240429_Thuy_Labseminar[Simplifying and Empowering Transformers for Large-Grap...
240429_Thuy_Labseminar[Simplifying and Empowering Transformers for Large-Grap...240429_Thuy_Labseminar[Simplifying and Empowering Transformers for Large-Grap...
240429_Thuy_Labseminar[Simplifying and Empowering Transformers for Large-Grap...
 

Recently uploaded

Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
joachimlavalley1
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)
rosedainty
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
Vivekanand Anglo Vedic Academy
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
beazzy04
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
PedroFerreira53928
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chips
GeoBlogs
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
AzmatAli747758
 

Recently uploaded (20)

Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chips
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
 

[20240513_LabSeminar_Huy]GraphFewShort_Transfer.pptx

  • 1. Quang-Huy Tran Network Science Lab Dept. of Artificial Intelligence The Catholic University of Korea E-mail: huytran1126@gmail.com 2024-05-13 Spatio-Temporal Graph Few-Shot Learning with Cross-CityKnowledge Transfer Bin Lu Lin et al. SIGKDD’28: 2022 ACM SIGKDD Conference on Knowledge Discovery and Data Mining
  • 2. 2 OUTLINE • MOTIVATION • INTRODUCTION • METHODOLOGY • EXPERIMENT & RESULT • CONCLUSION
  • 3. 3 MOTIVATION • Spatio-temporal graph learning is a key method for urban computing tasks, such as traffic flow, taxi demand and air quality forecasting. • However, due to the high cost of data collection, developing cities have few available data: o infeasible to train a well-performed model. Overview • Knowledge transfer, such as few-shot learning, made a progress in research. • There are challenges from previous works: o Transfer one single source: risk of negative transfer due to the great difference. o Transfer multiple sources: no consider the varied feature differences across cities and within cities.
  • 4. 4 INTRODUCTION • Goal: transfer the cross-city knowledge in graph-based few-shot learning scenarios. • Research challenges: o How to adapt feature extraction in target city via the knowledge from multiple source cities? o How to alleviate the impacts of varied graph structure on transferring among different cities? • Propose a novel and model-agnostic Spatio-Temporal Graph Few-Shot Learning framework(ST-GFSL). o generate non-shared model parameters based on node-level meta knowledge to enhance specific feature extraction. o reconstruct the graph structure of different cities based on meta-knowledge.
  • 5. 5 METHODOLOGY PROBLEM SETTING • For ST graph forecasting tasks, our goal is to learn a function 𝑓(·) for approximating the true mapping of historical 𝑇 observed signals to the future signals: • Given a ST graph 𝐺𝑆𝑇 = (𝑉, ℰ, 𝐴, 𝑋). • 𝑉: set of 𝑁 nodes, ℰ is set of edges, 𝐴 is binary adjacency matrix, and 𝑋 is node features matrix. • For ST graph few-show learning, suppose we have P source of graph cities 𝐺1:𝑃 𝑠𝑜𝑢𝑟𝑐𝑒 = {𝐺1 𝑠𝑜𝑢𝑟𝑐𝑒 , … , 𝐺𝑃 𝑠𝑜𝑢𝑟𝑐𝑒 } and target 𝐺𝑡𝑎𝑟𝑔𝑒𝑡. • After training on 𝐺1:𝑃 𝑠𝑜𝑢𝑟𝑐𝑒 ,the model can leverage the meta knowledge from multiple source graphs and is tasked to predict on a disjoint target scenario, where only few-shot structured data of 𝐺𝑡𝑎𝑟𝑔𝑒𝑡 is available.
  • 6. 6 METHODOLOGY Overall Architecture • Spatio-Temporal Neural Network (STNN) • Cross-city Knowledge Transfer
  • 7. 7 METHODOLOGY Spatio-Temporal Meta Knowledge Learner (ST-Meta Learner) • Employ Gated Recurrent Unit (GRU). • Utilize spatial-based graph attention network (GAT) to encode the spatial correlations. • Meta knowledge:
  • 8. 8 METHODOLOGY ST-Meta Graph Reconstruction • ST-Meta Graph is reconstructed by meta knowledge for structure aware learning. o We predict the likelihood of an edge existing between nodes • To guide the structure-aware learning of meta knowledge, we introduce graph reconstruction loss: o ST-meta graph A𝑚𝑒𝑡𝑎 can be constructed
  • 9. 9 METHODOLOGY Parameter Generation • Linear layer: 2 linear transformation with reshape in center. • Function 𝐹 that takes node-level meta knowledge as input and outputs the non-shared feature extractor parameters 𝜃𝑆𝑇. • Obtain the non-shared parameters of feature extractors for different scenarios. • Convolutional layer: 2 2D-Convolution with reshape in center.
  • 10. 10 METHODOLOGY ST-GFSL Learning Process o Samples batches of task sets from source datasets. o Each task 𝑇𝑖 ∈ 𝑇𝑆𝑇 belongs to one single city and is divided into support set ST𝑖 , query set 𝑄T𝑖 and ST𝑖 ∩ 𝑄T𝑖 = ∅. o When learning a task, consider a joint loss: • Model-agnostic methods are an approach to understand the predictive response of a black box model, instead of the response from the original dataset. • Two stage: base-model meta training and adaptation. o Meta objective: minimize the sum of task loss on query sets
  • 11. 11 EXPERIMENT AND RESULT EXPERIMENT - BASELINES • Measurement: o Mean Absolute Error (MAE). o Root Mean Square Error (RMSE). • Dataset: METR-LA,PEMS-BAY, Didi-Chengdu, Didi-Shenzhen o Traffic flow dataset. • Baselines: o HA: Historical Average, which formulates the traffic flow as a seasonal process, and uses average of previous seasons as the prediction. o ARIMA: Auto-regressive integrated moving average is a well-known model that can understand and predict future values in a time series. o Target-only: Directly training the model on few-shot data in target domain. o Fine-tuned (Vanilla): We first train the model on source datasets, and then fine-tune the model on few-shot data in target domain.
  • 12. 12 EXPERIMENT AND RESULT EXPERIMENT – BASELINE [1] Du, Y., Wang, J., Feng, W., Pan, S., Qin, T., Xu, R., & Wang, C. (2021, October). Adarnn: Adaptive learning and forecasting of time series. In Proceedings of the 30th ACM international conference on information & knowledge management (pp. 402-411). [2] Finn, C., Abbeel, P., & Levine, S. (2017, July). Model-agnostic meta-learning for fast adaptation of deep networks. In International conference on machine learning (pp. 1126-1135). PMLR. [3] Lea, C., Flynn, M. D., Vidal, R., Reiter, A., & Hager, G. D. (2017). Temporal convolutional networks for action segmentation and detection. In proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 156-165). [4] Yu, B., Yin, H., & Zhu, Z. (2017). Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting. arXiv preprint arXiv:1709.04875. [5] Wu, Z., Pan, S., Long, G., Jiang, J., & Zhang, C. (2019). Graph wavenet for deep spatial-temporal graph modeling. arXiv preprint arXiv:1906.00121. • Fine-tuned (ST-Meta): Compared with “Fine-tuned (Vanilla)” method, we combine the proposed parameter generation based on meta knowledge to generate non-shared parameters for the model. • AdaRNN [1]: A state-of-the-art transfer learning framework for non-stationary time series. • MAML [2]: Model-Agnostic Meta Learning (MAML). • Apply some advanced spatio-temporal data graph learning algorithms to our ST-GFSL framework: o TCN [3]: 1D dilated convolution network-based temporal convolution network. o STGCN [4]: Spatial temporal graph convolution network, which combines graph convolution with 1D convolution. o GWN [5]: A convolution network structure combines graph convolution with dilated casual convolution and a self-adaptive graph.
  • 13. 13 EXPERIMENT AND RESULT RESULT – Overall Performance
  • 14. 14 EXPERIMENT AND RESULT RESULT – Overall Performance
  • 15. 15 CONCLUSION • Propose a spatio-temporal graph few-shot learning framework called ST-GFSL for cross-city knowledge transfer. o Non-shared feature extractor parameters based on node-level meta knowledge. o improve the effectiveness of spatio-temporal representation on multiple datasets and transfer the cross-city knowledge via parameter matching from similar spatio-temporal meta knowledge. • ST-GFSL integrates the graph reconstruction loss to achieve structure-aware learning. • ST-GFSL not only apply to traffic speed prediction, but also apply to other few-shot scenarios: taxi demand prediction, indoor environment monitoring indifferent warehouses.

Editor's Notes

  1. Type equation here.
  2. Type equation here.