Graph Neural Networks.pptx

•Download as PPTX, PDF•

0 likes•189 views

Graph-based learning using Graph neural networks: This is a beginner-friendly exploration of Graph Neural Networks (GNNs), where we unravel the fundamentals of this powerful technique for analyzing interconnected data structures and pave the way for deeper understanding and practical applications. This will be a precursor to a subsequent hands-on workshop that'll be announced later. This talk was delivered as part of the neo4j meetup that happened on 19th August, 2023 at Thoughtworks, Bangalore. Meetup link: https://www.meetup.com/graph-database-bangalore/events/294780261

Technology

Spotlight: A chemist’s tale
4
What’s this? Is a molecule a graph?
If so, what are its nodes,
edges, features?

Predict if a molecule is a potent drug
5
1. Train a GNN on a curated dataset where response is known
2. Once trained, use the model and apply any molecule
3. Select ~top-100 candidates
4. Get chemists to thoroughly investigate them

Halicin - a powerful drug discovered by GNN in
2020
“We wanted to develop a platform that would allow us to harness the power of artificial intelligence to usher in a new age of antibiotic
drug discovery. Our approach revealed this amazing molecule which is arguably one of the more powerful antibiotics that has been
discovered.” - James Collins, Professor of Medical Engineering and Science, MIT
6
6

Machine Learning problems using Graph data
8
?
Node-level predictions
Does this person smoke?
(Unlabelled node)
Link-level predictions
Next Prime video?
Graph-level predictions
Is this molecule a suitable
drug?

Embeddings
10
banana
mango
dog
Vocabulary size (V)
banana
mango
dog
D
Local representation (1-hot) Distributed representation

Graph Neural
Networks -
Intuition (and
some Math)
11

A simplified view
12
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
GNN
h1 h2 h3 …
h1 h2 h3 …
h1 h2 h3 …
h1 h2 h3 …
h1 h2 h3 …

Vectorizing this…
13
Inputs(X, A) Latents(H, A)
GNN

Message passing
14
1
2
3
4
5
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
AGGREGATE UPDATE
1
2
3
4
5
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…

More Message passing
15
1
2
3
4
5
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
1
2
3
4
5
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
[...]

Multiple hops of message passing
16
1
2
3
4
5
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
1
2
3
4
5
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…
x1 x2
x
3
…

What to do with GNN outputs?
17
Inputs(X, A) Latents(H, A)
GNN

Two-step message passing: a recap
19
1
2
3
4
5
Target
node
1
2
3
4
AGGREGATE
1. AGGREGATE - pass information (the “message”) from the target node’s neighbours to
the target node
2. UPDATE - update each node’s features based on “message” to form an embedded
representation

Generic form of message passing
20
1
2
3
4
AGGREGATE
1
2
3
4
5
Target node
h = node features or embeddings
k = number of hops

Using neural networks for aggregate and update
21
Each node’s updated value becomes a weighting of its previous value + a weighting of its
neighbours’ values

Make message passing more efficient by
simplifying, generalizing and sharing parameters
22
Collapse the two weight vectors into W by adding self-loops to the adjacency matrix

Base GNNs on a “convolution” perspective
23
Normalize by # of nodes in the
neighbourhood
“Original” GNN (2009)
GCN (2016)

Graph Neural
Networks -
Implementation
notes
24

25
pyTorch and pyG
1.
A wealth of libraries
You can mix-and-match some of these libraries to train and predict node/edge/graph
classification problems.
tensorflow
2.
GDS from neo4j
3.
25

Thankyou!
Kumar Sankara Iyer
Engineering Director, Thoughtworks
www.linkedin.com/in/aquaraga
27

Neural Networks - you’ve seen this diagram a
LOT!
29

Neural networks: weights, biases, activations
30

What's hot

Faster R-CNN: Towards real-time object detection with region proposal network...Universitat Politècnica de Catalunya

Introduction to Graph neural networks @ Vienna Deep Learning meetupLiad Magen

Zero shot learning Kishor Datta Gupta

Transformers in Vision: From Zero to HeroBill Liu

Handwritten Digit Recognition using Convolutional Neural NetworksIRJET Journal

Lec14: Evaluation Framework for Medical Image SegmentationUlaş Bağcı

Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...Simplilearn

GoogLeNet InsightsAuro Tripathy

Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

Activation functions and Training Algorithms for Deep Neural networkGayatri Khanvilkar

210523 swin transformer v1.5taeseon ryu

Deep learning for medical imaginggeetachauhan

cnn ppt.pptxrohithprabhas1

Image-to-Image Translation with Conditional Adversarial Nets (UPC Reading Group)Universitat Politècnica de Catalunya

Image classification using convolutional neural networkKIRAN R

Image classification with Deep Neural NetworksYogendra Tamang

Classification Based Machine Learning AlgorithmsMd. Main Uddin Rony

Convolutional Neural NetworkVignesh Suresh

Image classification using cnnSumeraHangi

Object Detection using Deep Neural NetworksUsman Qayyum

What's hot (20)

Faster R-CNN: Towards real-time object detection with region proposal network...

Introduction to Graph neural networks @ Vienna Deep Learning meetup

Zero shot learning

Transformers in Vision: From Zero to Hero

Handwritten Digit Recognition using Convolutional Neural Networks

Lec14: Evaluation Framework for Medical Image Segmentation

Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...

GoogLeNet Insights

Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)

Activation functions and Training Algorithms for Deep Neural network

210523 swin transformer v1.5

Deep learning for medical imaging

cnn ppt.pptx

Image-to-Image Translation with Conditional Adversarial Nets (UPC Reading Group)

Image classification using convolutional neural network

Image classification with Deep Neural Networks

Classification Based Machine Learning Algorithms

Convolutional Neural Network

Image classification using cnn

Object Detection using Deep Neural Networks

Similar to Graph Neural Networks.pptx

TensorFlow London: Cutting edge generative modelsSeldon

Introduction to Deep learningMassimiliano Patacchiola

CEHS 2016 PosterEric Ma

A Fuzzy Approach For Multi-Domain Sentiment AnalysisMauro Dragoni

2017 07 03_meetup_dDana Brophy

Building the next generation of statistical tools for outbreak response using REuropean Centre for Disease Prevention and Control

ProjectReportPritish Yuvraj

XGBoostLSS - An extension of XGBoost to probabilistic forecasting, Alexander ...Erlangen Artificial Intelligence & Machine Learning Meetup

323462348DrShubhangiDnyaneshw

323462348Shubhangi Kirange

Large Scale Data Clustering: an overviewVahid Mirjalili

Esa act-2020 la muraAdvanced-Concepts-Team

Diagnosis Chest Diseases Using Neural Network and Genetic Hybrid AlgorithmIJERA Editor

Towards Statistical Queries over Distributed Private User Data Serafeim Chatzopoulos

A Survey of Deep Learning Algorithms for Malware DetectionIJCSIS Research Publications

Major project.pptxabhishekThakur36815

paper_148.pptxTarun710971

20181212 ibm aotHiroshi Maruyama

Deep learning in oil and gasYang Cong

Similar to Graph Neural Networks.pptx (20)

TensorFlow London: Cutting edge generative models

Introduction to Deep learning

CEHS 2016 Poster

A Fuzzy Approach For Multi-Domain Sentiment Analysis

2017 07 03_meetup_d

Building the next generation of statistical tools for outbreak response using R

ProjectReport

XGBoostLSS - An extension of XGBoost to probabilistic forecasting, Alexander ...

323462348

Large Scale Data Clustering: an overview

Esa act-2020 la mura

Diagnosis Chest Diseases Using Neural Network and Genetic Hybrid Algorithm

Towards Statistical Queries over Distributed Private User Data

A Survey of Deep Learning Algorithms for Malware Detection

Major project.pptx

paper_148.pptx

20181212 ibm aot

Deep learning in oil and gas

Recently uploaded

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang

AI as an Interface for Commercial BuildingsMemoori

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

Build your next Gen AI Breakthrough - April 2024Neo4j

New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited

Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada

APIForce Zurich 5 April Automation LPDGMarianaLemus7

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

Vulnerability_Management_GRC_by Sohang Sengupta.pptxnull - The Open Security Community

costume and set research powerpoint presentationphoebematthew05

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm

Recently uploaded (20)

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)

AI as an Interface for Commercial Buildings

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

Build your next Gen AI Breakthrough - April 2024

New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

SQL Database Design For Developers at php[tek] 2024

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

Designing IA for AI - Information Architecture Conference 2024

My Hashitalk Indonesia April 2024 Presentation

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365

Are Multi-Cloud and Serverless Good or Bad?

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024

APIForce Zurich 5 April Automation LPDG

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

Vulnerability_Management_GRC_by Sohang Sengupta.pptx

costume and set research powerpoint presentation

Unblocking The Main Thread Solving ANRs and Frozen Frames

Streamlining Python Development: A Guide to a Modern Project Setup

Graph Neural Networks.pptx

1. 1

2. Graph Neural Networks 2

3. Applications 3

4. Spotlight: A chemist’s tale 4 What’s this? Is a molecule a graph? If so, what are its nodes, edges, features?

5. Predict if a molecule is a potent drug 5 1. Train a GNN on a curated dataset where response is known 2. Once trained, use the model and apply any molecule 3. Select ~top-100 candidates 4. Get chemists to thoroughly investigate them

6. Halicin - a powerful drug discovered by GNN in 2020 “We wanted to develop a platform that would allow us to harness the power of artificial intelligence to usher in a new age of antibiotic drug discovery. Our approach revealed this amazing molecule which is arguably one of the more powerful antibiotics that has been discovered.” - James Collins, Professor of Medical Engineering and Science, MIT 6 6

7. Success stories of GNNs 7

8. Machine Learning problems using Graph data 8 ? Node-level predictions Does this person smoke? (Unlabelled node) Link-level predictions Next Prime video? Graph-level predictions Is this molecule a suitable drug?

9. Concept of Embeddings 9

10. Embeddings 10 banana mango dog Vocabulary size (V) banana mango dog D Local representation (1-hot) Distributed representation

11. Graph Neural Networks - Intuition (and some Math) 11

12. A simplified view 12 x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … GNN h1 h2 h3 … h1 h2 h3 … h1 h2 h3 … h1 h2 h3 … h1 h2 h3 …

13. Vectorizing this… 13 Inputs(X, A) Latents(H, A) GNN

14. Message passing 14 1 2 3 4 5 x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … AGGREGATE UPDATE 1 2 3 4 5 x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 …

15. More Message passing 15 1 2 3 4 5 x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … 1 2 3 4 5 x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … [...]

16. Multiple hops of message passing 16 1 2 3 4 5 x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … 1 2 3 4 5 x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 … x1 x2 x 3 …

17. What to do with GNN outputs? 17 Inputs(X, A) Latents(H, A) GNN

18. Graph Neural Networks - more Math 18

19. Two-step message passing: a recap 19 1 2 3 4 5 Target node 1 2 3 4 AGGREGATE 1. AGGREGATE - pass information (the “message”) from the target node’s neighbours to the target node 2. UPDATE - update each node’s features based on “message” to form an embedded representation

20. Generic form of message passing 20 1 2 3 4 AGGREGATE 1 2 3 4 5 Target node h = node features or embeddings k = number of hops

21. Using neural networks for aggregate and update 21 Each node’s updated value becomes a weighting of its previous value + a weighting of its neighbours’ values

22. Make message passing more efficient by simplifying, generalizing and sharing parameters 22 Collapse the two weight vectors into W by adding self-loops to the adjacency matrix

23. Base GNNs on a “convolution” perspective 23 Normalize by # of nodes in the neighbourhood “Original” GNN (2009) GCN (2016)

24. Graph Neural Networks - Implementation notes 24

25. 25 pyTorch and pyG 1. A wealth of libraries You can mix-and-match some of these libraries to train and predict node/edge/graph classification problems. tensorflow 2. GDS from neo4j 3. 25

26. Book recommendation 26

27. Thankyou! Kumar Sankara Iyer Engineering Director, Thoughtworks www.linkedin.com/in/aquaraga 27

28. Appendix 28

29. Neural Networks - you’ve seen this diagram a LOT! 29

30. Neural networks: weights, biases, activations 30

Editor's Notes

Set expectation for the session You should be able to understand why people use GNNs You should have a 20000-foot idea of what they actually do - at least develop an intuition of what they do Appreciate the math (if you have some exposure to Neural networks) Overall we will be setting the stage for a subsequent workshop (perhaps in the next neo4j meetup) where we will do some hands-on. Must-watch: https://www.youtube.com/watch?v=cWIeTMklzNg
Just like graph databases are found everywhere, predictions based on graphs also have their applications. From a social graph of LinkedIn, you can suggest new connections. Could be used in e-commerce recommender systems Could be used in medicine/pharmacy - say is this drug potent? Could be used in social networks - say providing connection recommendations https://www.youtube.com/watch?v=fOctJB4kVlM&list=PLV8yxwGOxvvoNkzPfCx2i8an--Tkt7O8Z&index=1 2:14
Molecule Nodes - atoms Edges - bonds Features of nodes - atom types, number of protons, charge Features of edges - bond type
One you have a molecule represented as a graph, you can train a Graph Neural Network to perform - say a binary task. Will this molecule inhibit a given bacteria? First, you will have a curated dataset of several molecules and you know the outcome - i.e. whether it will inhibit E-coli or not. You train a Graph neural network with this training data. Once you have trained the GNN, you can feed it a bunch of candidate molecules. Take the most promising N number of molecules.
https://news.mit.edu/2020/artificial-intelligence-identifies-new-antibiotic-0220
Halicin in 2020 GNNs in 1990s in the chemical industry - a bit of history
So what can you do with graph data? Node-level predictions - If you have a graph with unlabelled nodes, you want to predict attributes about them or you want to classify them. GNNs as you’ll see soon will use information from other nodes to infer on these unlabelled nodes Edge-prediction - Typically used by companies to predict which product will be purchased next by a customer. Here, it can used to predict if there will be a link between a person node and an item node Graph-level prediction - If a molecule is a potent drug or not https://www.youtube.com/watch?v=fOctJB4kVlM&list=PLV8yxwGOxvvoNkzPfCx2i8an--Tkt7O8Z&index=2 3:20
This is a commonly encountered concept in NLP and deep learning. How do you represent discrete things in a large set, say the words in a dictionary? We have a huge sparse vector that has mostly 0s. It just has 1 for the word under consideration at the appropriate index. Problem is that we don’t know how they will correlate. Most deep learning models tend to learn a distributed vector representation. In this case, you can see that “banana” and “mango” have some similarity - maybe because they are both fruits and are yellow. So V could be 10000, and D maybe some 128 or 500 or something like that. In a way, it compresses as well. Distributed Vector representations - https://www.youtube.com/watch?v=zCEYiCxrL_0 5:29
You start with a graph - modelling decision. It should encode the problem. For each of the nodes, there could be some information - either an embedding or a set of features. The output of a GNN is the same graph. For each node, there is a vector representation. Each node now should have the information not simply about itself - but rather how it belongs in the context of the graph. Training and recommendation
Recommendation
The aggregate function should be permutation invariant. Training
Training Too much message passing - over-smoothing
In the first step, the nodes nly know about themselves In the next step, all of them know about their neighbours. Then their neighbours’ neighbours. In each step, the perceptive field increases. Finally, every node would have learnt how it “belongs” in the graph.
Once you have these vectors, you can do: Node classification - classify each node independently by applying a shared layer Link classification Aggregate all the h vectors 12:59 https://www.youtube.com/watch?v=8owQBFAHw7E Recommendation
Show n layers of convolution
Aggregate function needs to be permutation invariant. For eg; if there is a new neighbour, the output shape (whether it is a number or a matrix) should not change.
Summation is a choice - you could either do mean, max, min, etc.
Do not treat target and neighbour nodes differently. Adjacency matrix defines
pyTorch - From Meta, now under Linux foundation. Optimized tensor library for deep learning using GPUs and CPUs. pyG - Has extension for graph learning. Supports GNN architectures like GCN and GAT Tensorflow GDS - Random walks with restarts sampling (server-side and client-side components, uses Apache arrow for in-memory analytics)

Graph Neural Networks.pptx

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Graph Neural Networks.pptx

Similar to Graph Neural Networks.pptx (20)

Recently uploaded

Recently uploaded (20)

Graph Neural Networks.pptx

Editor's Notes