Towards Deep Attention in Graph Neural Networks: Problems and Remedies.pptx

•Download as PPTX, PDF•

0 likes•44 views

ssuser2624f71

Towards Deep Attention in Graph Neural Networks: Problems and Remedies

Education

Van Thuy Hoang
Network Science Lab
Dept. of Artificial Intelligence
The Catholic University of Korea
E-mail: hoangvanthuy90@gmail.com
2023-12-18
PMLR 2023

2
Graph Convolutional Networks (GCNs)
 Generate node embeddings based on local network neighborhoods
 Nodes have embeddings at each layer, repeating combine messages
from their neighbor using neural networks

3
K hop neighbourhood
 k-hop neighbourhoods of the central node (red).

4
Graph Attention Networks
 Employing self-attention over the node features to do so.
 This choice was not without motivation, as self-attention has
previously been shown to be self-sufficient for state-of-the-art-level
results on machine translation, as demonstrated by the Transformer
architecture

5
Questions
 Can the model remain expressive over deep layers?
 How to design a deep GAT?

6
From hard to soft attentions
 Message-Passing GNNs
 Edge Attention: Edge-attention GNNs (e.g., GAT and its variants)
learn an edge-attention matrix
 Hop Attention: With hop attention, different importance γ(k) can be
assigned at different layers k for every node :
the hop attention matrix

7
Cumulative Attention
 A concept of cumulative attention matrix, denoted by T (k) ,
 Intuitively represents attention between all node pairs within k hops
(or equivalently, at layer k) that considers both edge and hop
attentions
an edge-attention matrix

8
Proposed Method: AERO-GNN
 Attentive dEep pROpagation-GNN (AERO-GNN)
 The feature transformation and propagation of AERO-GNN consist
of:
 Using Layer-Aggregated Features (more stable)

9
Proposed Method: AERO-GNN
 Attention Functions
 Compute the pre-normalized edge attention at each layer:
 Softplus is used to positively map edge attention, with two
primary advantages over two other mapping functions, exp
and tanh
 Hop Attention:

10
Experiments
 Datasets:
 12 node classification benchmark datasets, among which 6 are
homophilic and 6 are heterophilic
 Baseline Methods:
 edge-attention GNNs (GAT, GATv2, GATv2)

11
Experiments
 Node Classification Performance on Real-World Graphs

12
Discussion
 Bridge the two research directions, addressing two underexplored
questions:
 What are the unique challenges in deep graph attention
 How can we design provably more expressive deep graph
attention?
 Under a larger context, these findings extend prior literature on
limitations to deep attention in general
 demonstrate that attention-based GNNs share related, yet distinct,
problems and propose a novel solution.
 This study will inspire future research on deep attention and graph
learning in various directions.

Similar to Towards Deep Attention in Graph Neural Networks: Problems and Remedies.pptx

Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018Universitat Politècnica de Catalunya

Multidimensional RNNGrigory Sapunov

Node classification with graph neural network based centrality measures and f...IJECEIAES

Deep LearningPierre de Lacaze

NS-CUK Seminar:V.T.Hoang, Review on "Muhan Zhang, Pan Li: Nested Graph Neural...ssuser4b1f48

230724_Thuy_Labseminar.pptxNetwork Science Lab, The Catholic University of Korea

Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...Universitat Politècnica de Catalunya

NS-CUK Seminar: S.T.Nguyen, Review on "DeepGCNs: Can GCNs Go as Deep as CNNs?...ssuser4b1f48

Anchor free object detection by deep learningYu Huang

NS-CUK Seminar: S.T.Nguyen, Review on "Improving Graph Neural Network Express...ssuser4b1f48

Massively Parallel K-Nearest Neighbor Computation on Distributed Architectures Intel® Software

Attention correlated appearance and motion feature followed temporal learning...IJECEIAES

convolutional_neural_networks in deep learningssusere5ddd6

Learning Convolutional Neural Networks for GraphsMathias Niepert

220206 transformer interpretability beyond attention visualizationtaeseon ryu

Mnist reportRaghunandanJairam

[Paper] Multiscale Vision Transformers(MVit)Susang Kim

EDGE-Net: Efficient Deep-learning Gradients Extraction Networkgerogepatton

INVESTIGATIONS OF THE INFLUENCES OF A CNN’S RECEPTIVE FIELD ON SEGMENTATION O...adeij1

Similar to Towards Deep Attention in Graph Neural Networks: Problems and Remedies.pptx (20)

Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018

Multidimensional RNN

Node classification with graph neural network based centrality measures and f...

Deep Learning

NS-CUK Seminar:V.T.Hoang, Review on "Muhan Zhang, Pan Li: Nested Graph Neural...

230724_Thuy_Labseminar.pptx

Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...

NS-CUK Seminar: S.T.Nguyen, Review on "DeepGCNs: Can GCNs Go as Deep as CNNs?...

Anchor free object detection by deep learning

NS-CUK Seminar: S.T.Nguyen, Review on "Improving Graph Neural Network Express...

Massively Parallel K-Nearest Neighbor Computation on Distributed Architectures

Attention correlated appearance and motion feature followed temporal learning...

convolutional_neural_networks in deep learning

Learning Convolutional Neural Networks for Graphs

220206 transformer interpretability beyond attention visualization

Mnist report

[Paper] Multiscale Vision Transformers(MVit)

EDGE-Net: Efficient Deep-learning Gradients Extraction Network

INVESTIGATIONS OF THE INFLUENCES OF A CNN’S RECEPTIVE FIELD ON SEGMENTATION O...

Recently uploaded

CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2

Class 11 Legal Studies Ch-1 Concept of State .pdfakmcokerachita

Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam

Presiding Officer Training module 2024 lok sabha electionsanshu789521

call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR

Proudly South Africa powerpoint Thorisha.pptxthorishapillay1

“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr

Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN

Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019

POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar

Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton

ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxAnaBeatriceAblay2

How to Configure Email Server in Odoo 17Celine George

Software Engineering Methodologies (overview)eniolaolutunde

Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke

Science lesson Moon for 4th quarter lessonJericReyAuditor

Paris 2024 Olympic Geographies - an activityGeoBlogs

Crayon Activity Handout For the Crayon AUnboundStockton

भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke

ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood

Recently uploaded (20)

CARE OF CHILD IN INCUBATOR..........pptx

Class 11 Legal Studies Ch-1 Concept of State .pdf

Pharmacognosy Flower 3. Compositae 2023.pdf

Presiding Officer Training module 2024 lok sabha elections

call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️

Proudly South Africa powerpoint Thorisha.pptx

“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...

Solving Puzzles Benefits Everyone (English).pptx

Sanyam Choudhary Chemistry practical.pdf

POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx

Science 7 - LAND and SEA BREEZE and its Characteristics

ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx

How to Configure Email Server in Odoo 17

Software Engineering Methodologies (overview)

Painted Grey Ware.pptx, PGW Culture of India

Science lesson Moon for 4th quarter lesson

Paris 2024 Olympic Geographies - an activity

Crayon Activity Handout For the Crayon A

भारत-रोम व्यापार.pptx, Indo-Roman Trade,

ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT

Towards Deep Attention in Graph Neural Networks: Problems and Remedies.pptx

1. Van Thuy Hoang Network Science Lab Dept. of Artificial Intelligence The Catholic University of Korea E-mail: hoangvanthuy90@gmail.com 2023-12-18 PMLR 2023

2. 2 Graph Convolutional Networks (GCNs)  Generate node embeddings based on local network neighborhoods  Nodes have embeddings at each layer, repeating combine messages from their neighbor using neural networks

3. 3 K hop neighbourhood  k-hop neighbourhoods of the central node (red).

4. 4 Graph Attention Networks  Employing self-attention over the node features to do so.  This choice was not without motivation, as self-attention has previously been shown to be self-sufficient for state-of-the-art-level results on machine translation, as demonstrated by the Transformer architecture

5. 5 Questions  Can the model remain expressive over deep layers?  How to design a deep GAT?

6. 6 From hard to soft attentions  Message-Passing GNNs  Edge Attention: Edge-attention GNNs (e.g., GAT and its variants) learn an edge-attention matrix  Hop Attention: With hop attention, different importance γ(k) can be assigned at different layers k for every node : the hop attention matrix

7. 7 Cumulative Attention  A concept of cumulative attention matrix, denoted by T (k) ,  Intuitively represents attention between all node pairs within k hops (or equivalently, at layer k) that considers both edge and hop attentions an edge-attention matrix

8. 8 Proposed Method: AERO-GNN  Attentive dEep pROpagation-GNN (AERO-GNN)  The feature transformation and propagation of AERO-GNN consist of:  Using Layer-Aggregated Features (more stable)

9. 9 Proposed Method: AERO-GNN  Attention Functions  Compute the pre-normalized edge attention at each layer:  Softplus is used to positively map edge attention, with two primary advantages over two other mapping functions, exp and tanh  Hop Attention:

10. 10 Experiments  Datasets:  12 node classification benchmark datasets, among which 6 are homophilic and 6 are heterophilic  Baseline Methods:  edge-attention GNNs (GAT, GATv2, GATv2)

11. 11 Experiments  Node Classification Performance on Real-World Graphs

12. 12 Discussion  Bridge the two research directions, addressing two underexplored questions:  What are the unique challenges in deep graph attention  How can we design provably more expressive deep graph attention?  Under a larger context, these findings extend prior literature on limitations to deep attention in general  demonstrate that attention-based GNNs share related, yet distinct, problems and propose a novel solution.  This study will inspire future research on deep attention and graph learning in various directions.

Towards Deep Attention in Graph Neural Networks: Problems and Remedies.pptx

Recommended

Recommended

More Related Content

Similar to Towards Deep Attention in Graph Neural Networks: Problems and Remedies.pptx

Similar to Towards Deep Attention in Graph Neural Networks: Problems and Remedies.pptx (20)

More from ssuser2624f71

More from ssuser2624f71 (20)

Recently uploaded

Recently uploaded (20)

Towards Deep Attention in Graph Neural Networks: Problems and Remedies.pptx