[20240513_LabSeminar_Huy]GraphFewShort_Transfer.pptx

Quang-Huy Tran
Network Science Lab
Dept. of Artificial Intelligence
The Catholic University of Korea
E-mail: huytran1126@gmail.com
2024-05-13
Spatio-Temporal Graph Few-Shot Learning
with Cross-CityKnowledge Transfer
Bin Lu Lin et al.
SIGKDD’28: 2022 ACM SIGKDD Conference on Knowledge Discovery and Data Mining

2
OUTLINE
• MOTIVATION
• INTRODUCTION
• METHODOLOGY
• EXPERIMENT & RESULT
• CONCLUSION

3
MOTIVATION
• Spatio-temporal graph learning is a key method for urban computing tasks, such as
traffic flow, taxi demand and air quality forecasting.
• However, due to the high cost of data collection, developing cities have few available
data:
o infeasible to train a well-performed model.
Overview
• Knowledge transfer, such as few-shot learning, made a progress in research.
• There are challenges from previous works:
o Transfer one single source: risk of negative transfer due to the great difference.
o Transfer multiple sources: no consider the varied feature differences across cities
and within cities.

4
INTRODUCTION
• Goal: transfer the cross-city knowledge in graph-based few-shot learning scenarios.
• Research challenges:
o How to adapt feature extraction in target city via the knowledge from multiple source cities?
o How to alleviate the impacts of varied graph structure on transferring among different cities?
• Propose a novel and model-agnostic Spatio-Temporal Graph Few-Shot Learning
framework(ST-GFSL).
o generate non-shared model parameters based on node-level meta knowledge to enhance specific
feature extraction.
o reconstruct the graph structure of different cities based on meta-knowledge.

5
METHODOLOGY
PROBLEM SETTING
• For ST graph forecasting tasks, our goal is to learn a function 𝑓(·) for approximating
the true mapping of historical 𝑇 observed signals to the future signals:
• Given a ST graph 𝐺𝑆𝑇 = (𝑉, ℰ, 𝐴, 𝑋).
• 𝑉: set of 𝑁 nodes, ℰ is set of edges, 𝐴 is binary adjacency matrix, and 𝑋 is node
features matrix.
• For ST graph few-show learning, suppose we have P source of graph cities 𝐺1:𝑃
𝑠𝑜𝑢𝑟𝑐𝑒
=
{𝐺1
, … , 𝐺𝑃
} and target 𝐺𝑡𝑎𝑟𝑔𝑒𝑡.
• After training on 𝐺1:𝑃
,the model can leverage the meta knowledge from multiple source graphs
and is tasked to predict on a disjoint target scenario, where only few-shot structured data of
𝐺𝑡𝑎𝑟𝑔𝑒𝑡 is available.

6
METHODOLOGY
Overall Architecture
• Spatio-Temporal Neural Network (STNN)
• Cross-city Knowledge Transfer

7
METHODOLOGY
Spatio-Temporal Meta Knowledge Learner (ST-Meta Learner)
• Employ Gated Recurrent Unit (GRU).
• Utilize spatial-based graph attention network (GAT) to encode the spatial correlations.
• Meta knowledge:

8
METHODOLOGY
ST-Meta Graph Reconstruction
• ST-Meta Graph is reconstructed by meta knowledge for structure aware learning.
o We predict the likelihood of an edge existing between nodes
• To guide the structure-aware learning of meta knowledge, we introduce graph
reconstruction loss:
o ST-meta graph A𝑚𝑒𝑡𝑎 can be constructed

9
METHODOLOGY
Parameter Generation
• Linear layer: 2 linear transformation with reshape in center.
• Function 𝐹 that takes node-level meta knowledge as input and outputs the non-shared
feature extractor parameters 𝜃𝑆𝑇.
• Obtain the non-shared parameters of feature extractors for different scenarios.
• Convolutional layer: 2 2D-Convolution with reshape in center.

10
METHODOLOGY
ST-GFSL Learning Process
o Samples batches of task sets from source datasets.
o Each task 𝑇𝑖 ∈ 𝑇𝑆𝑇 belongs to one single city and is divided into support set ST𝑖
, query set 𝑄T𝑖
and
ST𝑖
∩ 𝑄T𝑖
= ∅.
o When learning a task, consider a joint loss:
• Model-agnostic methods are an approach to understand the predictive
response of a black box model, instead of the response from the original
dataset.
• Two stage: base-model meta training and adaptation.
o Meta objective: minimize the sum of task loss on query sets

11
EXPERIMENT AND RESULT
EXPERIMENT - BASELINES
• Measurement:
o Mean Absolute Error (MAE).
o Root Mean Square Error (RMSE).
• Dataset: METR-LA,PEMS-BAY, Didi-Chengdu, Didi-Shenzhen
o Traffic flow dataset.
• Baselines:
o HA: Historical Average, which formulates the traffic flow as a seasonal process, and uses average of
previous seasons as the prediction.
o ARIMA: Auto-regressive integrated moving average is a well-known model that can understand and
predict future values in a time series.
o Target-only: Directly training the model on few-shot data in target domain.
o Fine-tuned (Vanilla): We first train the model on source datasets, and then fine-tune the model on
few-shot data in target domain.

12
EXPERIMENT – BASELINE
[1] Du, Y., Wang, J., Feng, W., Pan, S., Qin, T., Xu, R., & Wang, C. (2021, October). Adarnn: Adaptive learning and forecasting of time series. In Proceedings of the 30th ACM international conference on information & knowledge management (pp. 402-411).
[2] Finn, C., Abbeel, P., & Levine, S. (2017, July). Model-agnostic meta-learning for fast adaptation of deep networks. In International conference on machine learning (pp. 1126-1135). PMLR.
[3] Lea, C., Flynn, M. D., Vidal, R., Reiter, A., & Hager, G. D. (2017). Temporal convolutional networks for action segmentation and detection. In proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 156-165).
[4] Yu, B., Yin, H., & Zhu, Z. (2017). Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting. arXiv preprint arXiv:1709.04875.
[5] Wu, Z., Pan, S., Long, G., Jiang, J., & Zhang, C. (2019). Graph wavenet for deep spatial-temporal graph modeling. arXiv preprint arXiv:1906.00121.
• Fine-tuned (ST-Meta): Compared with “Fine-tuned (Vanilla)” method, we combine the proposed
parameter generation based on meta knowledge to generate non-shared parameters for the model.
• AdaRNN [1]: A state-of-the-art transfer learning framework for non-stationary time series.
• MAML [2]: Model-Agnostic Meta Learning (MAML).
• Apply some advanced spatio-temporal data graph learning algorithms to our ST-GFSL
framework:
o TCN [3]: 1D dilated convolution network-based temporal convolution network.
o STGCN [4]: Spatial temporal graph convolution network, which combines graph convolution with 1D
convolution.
o GWN [5]: A convolution network structure combines graph convolution with dilated casual
convolution and a self-adaptive graph.

13
RESULT – Overall Performance

14
RESULT – Overall Performance

15
CONCLUSION
• Propose a spatio-temporal graph few-shot learning framework called ST-GFSL for
cross-city knowledge transfer.
o Non-shared feature extractor parameters based on node-level meta knowledge.
o improve the effectiveness of spatio-temporal representation on multiple datasets and transfer the
cross-city knowledge via parameter matching from similar spatio-temporal meta knowledge.
• ST-GFSL integrates the graph reconstruction loss to achieve structure-aware learning.
• ST-GFSL not only apply to traffic speed prediction, but also apply to other few-shot
scenarios: taxi demand prediction, indoor environment monitoring indifferent
warehouses.

[20240513_LabSeminar_Huy]GraphFewShort_Transfer.pptx

[20240513_LabSeminar_Huy]GraphFewShort_Transfer.pptx

Recommended

Recommended

More Related Content

Similar to [20240513_LabSeminar_Huy]GraphFewShort_Transfer.pptx

Similar to [20240513_LabSeminar_Huy]GraphFewShort_Transfer.pptx (20)

More from thanhdowork

More from thanhdowork (20)

Recently uploaded

Recently uploaded (20)

[20240513_LabSeminar_Huy]GraphFewShort_Transfer.pptx

Editor's Notes