PFN Summer Internship 2019 / Kenshin Abe: Extension of Chainer-Chemistry for Large and Sparse Graph

•

4 likes•9,492 views

Preferred Networks

Graph Neural Network implementation with the sparse data structure. PFN summer internship 2019 by Kenshin Abe.

Technology

Extension of Chianer-Chemistry
for Large and Sparse Graph
Preferred Networks
Summer Internship 2019
Kenshin Abe

Introduction to Graph Neural Network (GNN)
2D Convolution Graph Convolution
[2019 Zonghan+]
https://arxiv.org/pdf/1901.00596.pdf

https://www.schrodinger.com/science-articles/autoqsardeepchem
Example of GNN

Typical End-to-end GNN Framework
Graph
Conv
Graph
Conv
Graph
Conv
Graph
Conv
Graph
Readout
Linear Linear
Node
Classification / Regression
Graph
Classification / Regression
Graph
Node
Embeddings
Graph
Representation
Graph
Embedding

Adjacency Matrix
● Matrix multiplication
● Zero padding
Sparse Pattern
● Scatter operation
● Graph concatenation
● etc.
Network Graph
For Large and Sparse Graphs

Graph Data Pattern
Adjacency Matrix
Node Features
Edge List (src, dest)
Node Features
Padding Pattern Sparse Pattern

Batching
Adjacency Matrix
Node Features
Handle as One Big Graph
Padding Pattern Sparse Pattern
Edge List (src, dst)
Node Features
https://github.com/tkipf/gcn/issues/4

Scatter Operation
[PyTorch Scatter]
https://pytorch-scatter.readthedoc
s.io/en/latest/functions/add.html
• Add each value of input to an element of output
specified by index

Graph Convolution
Matrix Multiplication
●
● Inefficient for Large Sparse Graph
Scatter Operation
●
● Efficient
● Broad
Padding Pattern Sparse Pattern
src scatter_add(dst)

Readout
Mask and Aggregate Scatter Operation
Padding Pattern Sparse Pattern
Node Embeddings
Mask
Graph
Level
Aggregate
Graph
Embeddings
0
1
2
Graph
Embeddings
Scatter Aggregate

Experiment - Chemical Dataset -
Padding Pattern
(s / epoch)
Sparse Pattern
(s / epoch)
QM9
V=1~9
133,885 graphs
6.92 5.56 1.24 times faster!
ZINC
V=~38 nodes
249,455 graphs
16.67 11.47 1.45 times faster!!
Training of RelGCN [2019 Schlichtkrull+]
layer_num=2, feature_num=16, batchsize=256
Intel(R) Xeon(R) Gold 6254 CPU @ 3.10GHz

Memory Problem
Scatter Operation
●
● Efficient
● Broad
Sparse Pattern
src scatter_add(dst)
of Memory Consumption

Chainer Sparse Matmul
https://docs.chainer.org/en/stable/reference/generat
ed/chainer.functions.sparse_matmul.html

Memory Problem
Sparse Matmul
●
● Unnecessary Multiplication by 1
Coo Matrix Pattern
sparse_matmul(coo_adj)
of Memory Consumption

Experiment - Network Dataset on GPU -
Padding Pattern
(s / 100 epoch)
Sparse Pattern
(s / 100 epoch)
Coo Matrix Pattern
(s / 100 epoch)
Cora
V=2,708
E=5,278
3.3760 3.0190 3.5500
Citeseer
V=3,312
E=4,660
6.8128 3.3024 6.2707
Reddit
V=232,965
E=11,606,919
Out of Memory Out of Memory 318.76
(5.452 GB)
頂点: ２３万
辺:１１００万！！
Training of GIN [2019 Keyulu+]
layer_num=2, feature_num=64
On a single Tesla V100-SXM2

Experiment - Network Dataset on CPU -
Padding Pattern
(s / 100 epoch)
Sparse Pattern
(s / 100 epoch)
Coo Matrix Pattern
(s / 100 epoch)
Cora
V=2,708
E=5,278
224.439 22.8092 12.1168
Citeseer
V=3,312
E=4,660
1346.11 23.3707 39.8982
Reddit
V=232,965
E=11,606,919
Out of Memory Out of Memory 28097.187
頂点: ２３万
辺:１１００万！！
Training of GIN [2019 Keyulu+]
layer_num=2, feature_num=64
On Intel(R) Xeon(R) Gold 6254 CPU @ 3.10GHz

Conclusion
• Sparse pattern is good in most cases
– Not using multiplication
• For very large graphs, CooMatrix saves memory
– Not as fast as sparse pattern

Summary
Chainer Chemistry Goal
Overall GNN for Chemical Data General Framework of GNN
Graph Data Pattern Padding Pattern + Sparse Pattern
Dataset Chemical Dataset (Small)
● qm9, tox21, etc
+ Network (Large)
● Citation Networks, Reddit
Task Graph Regression
Graph Classification
+ Node Regression
+ Node Classification
Additional + Sparse matmul

PFN Summer Internship 2019 / Kenshin Abe: Extension of Chainer-Chemistry for Large and Sparse Graph

What's hot

Introduction to Chainer 11 may,2018Preferred Networks

CUDA and Caffe for deep learningAmgad Muhammad

Parallel K means clustering using CUDAprithan

Parallel Implementation of K Means Clustering on CUDAprithan

第11回配信講義計算科学技術特論A（2021）RCCSRENKEI

Slide tesiNicolò Savioli

Introduction to Chainer ChemistryPreferred Networks

Introduction to ChainerPreferred Networks

Graph Neural Network #2-1 (PinSage)seungwoo kim

Deep Learning with PyTorchMayur Bhangale

Alex Smola, Professor in the Machine Learning Department, Carnegie Mellon Uni...MLconf

Ml srhwt-machine-learning-based-superlative-rapid-haar-wavelet-transformation...Jumlesha Shaik

Graph Neural Network 1부seungwoo kim

Multi-core GPU – Fast parallel SAR image generationMahesh Khadatare

HC-4012, Complex Network Clustering Using GPU-based Parallel Non-negative Mat...AMD Developer Central

Chainer ui v0.3 and imagereportPreferred Networks

GTC Japan 2016 Chainer feature introductionKenta Oono

Common Design of Deep Learning FrameworksKenta Oono

TensorFlow Dev Summit 2017 요약Jin Joong Kim

Pytorch for tf_developersAbdul Muneer

What's hot (20)

Introduction to Chainer 11 may,2018

CUDA and Caffe for deep learning

Parallel K means clustering using CUDA

Parallel Implementation of K Means Clustering on CUDA

第11回配信講義計算科学技術特論A（2021）

Slide tesi

Introduction to Chainer Chemistry

Introduction to Chainer

Graph Neural Network #2-1 (PinSage)

Deep Learning with PyTorch

Alex Smola, Professor in the Machine Learning Department, Carnegie Mellon Uni...

Ml srhwt-machine-learning-based-superlative-rapid-haar-wavelet-transformation...

Graph Neural Network 1부

Multi-core GPU – Fast parallel SAR image generation

HC-4012, Complex Network Clustering Using GPU-based Parallel Non-negative Mat...

Chainer ui v0.3 and imagereport

GTC Japan 2016 Chainer feature introduction

Common Design of Deep Learning Frameworks

TensorFlow Dev Summit 2017 요약

Pytorch for tf_developers

Similar to PFN Summer Internship 2019 / Kenshin Abe: Extension of Chainer-Chemistry for Large and Sparse Graph

Materials Design in the Age of Deep Learning and Quantum ComputationKAMAL CHOUDHARY

PL/CUDA - Fusion of HPC Grade Power with In-Database AnalyticsKohei KaiGai

pgconfasia2016 plcuda enKohei KaiGai

The Intersection of Game Engines & GPUs: Current & Future (Graphics Hardware ...Johan Andersson

Optimizing the Graphics Pipeline with Compute, GDC 2016Graham Wihlidal

Accelerate Reed-Solomon coding for Fault-Tolerance in RAID-like systemShuai Yuan

[論文紹介] DPSNet: End-to-end Deep Plane Sweep StereoSeiya Ito

Amazon SageMaker을 통한 손쉬운 Jupyter Notebook 활용하기 - 윤석찬 (AWS 테크에반젤리스트)Amazon Web Services Korea

クラウドDWHとしても進化を続けるPivotal Greenplumご紹介Masayuki Matsushita

IRJET-Hardware Co-Simulation of Classical Edge Detection Algorithms using Xil...IRJET Journal

Power and Clock Gating Modelling in Coarse Grained Reconfigurable SystemsMDC_UNICA

20180920_DBTS_PGStrom_ENKohei KaiGai

Intro to Machine Learning for GPUsSri Ambati

Performance tuning jvmPrem Kuppumani

Distributed Computing for EveryoneGiovanna Roda

CSBP: A Fast Circuit Similarity-Based Placement for FPGA Incremental Design a...Xiaoyu Shi

20171206 PGconf.ASIA LT gstore_fdwKohei KaiGai

AI On the Edge: Model CompressionApache MXNet

20170602_OSSummit_an_intelligent_storageKohei KaiGai

Klessydra t - designing vector coprocessors for multi-threaded edge-computing...RISC-V International

Similar to PFN Summer Internship 2019 / Kenshin Abe: Extension of Chainer-Chemistry for Large and Sparse Graph (20)

Materials Design in the Age of Deep Learning and Quantum Computation

PL/CUDA - Fusion of HPC Grade Power with In-Database Analytics

pgconfasia2016 plcuda en

The Intersection of Game Engines & GPUs: Current & Future (Graphics Hardware ...

Optimizing the Graphics Pipeline with Compute, GDC 2016

Accelerate Reed-Solomon coding for Fault-Tolerance in RAID-like system

[論文紹介] DPSNet: End-to-end Deep Plane Sweep Stereo

Amazon SageMaker을 통한 손쉬운 Jupyter Notebook 활용하기 - 윤석찬 (AWS 테크에반젤리스트)

クラウドDWHとしても進化を続けるPivotal Greenplumご紹介

IRJET-Hardware Co-Simulation of Classical Edge Detection Algorithms using Xil...

Power and Clock Gating Modelling in Coarse Grained Reconfigurable Systems

20180920_DBTS_PGStrom_EN

Intro to Machine Learning for GPUs

Performance tuning jvm

Distributed Computing for Everyone

CSBP: A Fast Circuit Similarity-Based Placement for FPGA Incremental Design a...

20171206 PGconf.ASIA LT gstore_fdw

AI On the Edge: Model Compression

20170602_OSSummit_an_intelligent_storage

Klessydra t - designing vector coprocessors for multi-threaded edge-computing...

Recently uploaded

Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang

"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski

Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge

Vulnerability_Management_GRC_by Sohang Sengupta.pptxnull - The Open Security Community

CloudStudio User manual (basic edition):comworks

SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero

Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada

Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson

Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar

Artificial intelligence in the post-deep learning eraDeakin University

Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies

Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida

My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays

Recently uploaded (20)

Connect Wave/ connectwave Pitch Deck Presentation

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)

"Debugging python applications inside k8s environment", Andrii Soldatenko

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

Nell’iperspazio con Rocket: il Framework Web di Rust!

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...

Designing IA for AI - Information Architecture Conference 2024

Vulnerability_Management_GRC_by Sohang Sengupta.pptx

CloudStudio User manual (basic edition):

SIP trunking in Janus @ Kamailio World 2024

Vertex AI Gemini Prompt Engineering Tips

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024

Are Multi-Cloud and Serverless Good or Bad?

Unleash Your Potential - Namagunga Girls Coding Club

Artificial intelligence in the post-deep learning era

Benefits Of Flutter Compared To Other Frameworks

Science&tech:THE INFORMATION AGE STS.pdf

My INSURER PTE LTD - Insurtech Innovation Award 2024

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...

PFN Summer Internship 2019 / Kenshin Abe: Extension of Chainer-Chemistry for Large and Sparse Graph

1. Extension of Chianer-Chemistry for Large and Sparse Graph Preferred Networks Summer Internship 2019 Kenshin Abe

2. Introduction to Graph Neural Network (GNN) 2D Convolution Graph Convolution [2019 Zonghan+] https://arxiv.org/pdf/1901.00596.pdf

3. https://www.schrodinger.com/science-articles/autoqsardeepchem Example of GNN

4. Typical End-to-end GNN Framework Graph Conv Graph Conv Graph Conv Graph Conv Graph Readout Linear Linear Node Classification / Regression Graph Classification / Regression Graph Node Embeddings Graph Representation Graph Embedding

5. Adjacency Matrix ● Matrix multiplication ● Zero padding Sparse Pattern ● Scatter operation ● Graph concatenation ● etc. Network Graph For Large and Sparse Graphs

6. Graph Data Pattern Adjacency Matrix Node Features Edge List (src, dest) Node Features Padding Pattern Sparse Pattern

7. Batching Adjacency Matrix Node Features Handle as One Big Graph Padding Pattern Sparse Pattern Edge List (src, dst) Node Features https://github.com/tkipf/gcn/issues/4

8. Scatter Operation [PyTorch Scatter] https://pytorch-scatter.readthedoc s.io/en/latest/functions/add.html • Add each value of input to an element of output specified by index

9. Graph Convolution Matrix Multiplication ● ● Inefficient for Large Sparse Graph Scatter Operation ● ● Efficient ● Broad Padding Pattern Sparse Pattern src scatter_add(dst)

10. Readout Mask and Aggregate Scatter Operation Padding Pattern Sparse Pattern Node Embeddings Mask Graph Level Aggregate Graph Embeddings 0 1 2 Graph Embeddings Scatter Aggregate

11. Experiment - Chemical Dataset - Padding Pattern (s / epoch) Sparse Pattern (s / epoch) QM9 V=1~9 133,885 graphs 6.92 5.56 1.24 times faster! ZINC V=~38 nodes 249,455 graphs 16.67 11.47 1.45 times faster!! Training of RelGCN [2019 Schlichtkrull+] layer_num=2, feature_num=16, batchsize=256 Intel(R) Xeon(R) Gold 6254 CPU @ 3.10GHz

12. Memory Problem Scatter Operation ● ● Efficient ● Broad Sparse Pattern src scatter_add(dst) of Memory Consumption

13. Chainer Sparse Matmul https://docs.chainer.org/en/stable/reference/generat ed/chainer.functions.sparse_matmul.html

14. Memory Problem Sparse Matmul ● ● Unnecessary Multiplication by 1 Coo Matrix Pattern sparse_matmul(coo_adj) of Memory Consumption

15. Experiment - Network Dataset on GPU - Padding Pattern (s / 100 epoch) Sparse Pattern (s / 100 epoch) Coo Matrix Pattern (s / 100 epoch) Cora V=2,708 E=5,278 3.3760 3.0190 3.5500 Citeseer V=3,312 E=4,660 6.8128 3.3024 6.2707 Reddit V=232,965 E=11,606,919 Out of Memory Out of Memory 318.76 (5.452 GB) 頂点: ２３万辺:１１００万！！ Training of GIN [2019 Keyulu+] layer_num=2, feature_num=64 On a single Tesla V100-SXM2

16. Experiment - Network Dataset on CPU - Padding Pattern (s / 100 epoch) Sparse Pattern (s / 100 epoch) Coo Matrix Pattern (s / 100 epoch) Cora V=2,708 E=5,278 224.439 22.8092 12.1168 Citeseer V=3,312 E=4,660 1346.11 23.3707 39.8982 Reddit V=232,965 E=11,606,919 Out of Memory Out of Memory 28097.187 頂点: ２３万辺:１１００万！！ Training of GIN [2019 Keyulu+] layer_num=2, feature_num=64 On Intel(R) Xeon(R) Gold 6254 CPU @ 3.10GHz

17. Conclusion • Sparse pattern is good in most cases – Not using multiplication • For very large graphs, CooMatrix saves memory – Not as fast as sparse pattern

18. Summary Chainer Chemistry Goal Overall GNN for Chemical Data General Framework of GNN Graph Data Pattern Padding Pattern + Sparse Pattern Dataset Chemical Dataset (Small) ● qm9, tox21, etc + Network (Large) ● Citation Networks, Reddit Task Graph Regression Graph Classification + Node Regression + Node Classification Additional + Sparse matmul

PFN Summer Internship 2019 / Kenshin Abe: Extension of Chainer-Chemistry for Large and Sparse Graph

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to PFN Summer Internship 2019 / Kenshin Abe: Extension of Chainer-Chemistry for Large and Sparse Graph

Similar to PFN Summer Internship 2019 / Kenshin Abe: Extension of Chainer-Chemistry for Large and Sparse Graph (20)

More from Preferred Networks

More from Preferred Networks (20)

Recently uploaded

Recently uploaded (20)

PFN Summer Internship 2019 / Kenshin Abe: Extension of Chainer-Chemistry for Large and Sparse Graph