SlideShare a Scribd company logo
1 of 35
Download to read offline
Jure Leskovec
Stanford
Graph Learning
Workshop
Graphs
Graphs represent objects and their
relationships as nodes and edges
Jure Leskovec (@jure), Stanford University 2
Graphs capture relations, which
leads to better models.
Graphs in Many Domains
Often most valuable data are graphs:
Jure Leskovec (@jure), Stanford University 3
Finance & Insurance
Transportation
Healthcare IT industry Commerce & Retail
Manufacturing
Today We are Showing
§ Research advancements in Graph ML
§ PyG ecosystem and partnerships
§ Industrial applications
§ How these technologies propel
advancements of AI and applications of
AI to next wave for science and industry
§ We are going to announce new tools
and partnerships that power these
applications
Jure Leskovec (@jure), Stanford University 4
Thank you!
Thank you to all the researchers,
developers and partners who join us to
make this event amazing.
Welcome to Stanford
Graph Learning Workshop!
Jure Leskovec (@jure), Stanford University 5
INPUT GRAPH
TARGET NODE B
D
E
F
C
A
B
C
D
A
A
A
C
F
B
E
A
Deep Learning for Graphs
6
Jure Leskovec (@jure), Stanford University
Each node defines a computation graph
§ Each edge is a message function MSG()
§ ⨁ is a message aggregation function
Scarselli et al. 2005. The Graph Neural Network Model. IEEE Transactions on Neural Networks.
M
SG()
⨁
MSG()
⨁
Graph Neural Networks
Graph Neural Networks (GNNs) can learn
patterns and relations on a giant scale
and train predictive models:
§ 1) GNNs learn to
combine features from
neighboring nodes
§ 2) GNNs learn the graph
patterns and relations
Jure Leskovec, Stanford University 7
Key Benefits of GNNs
§ GNNs adapt to local shape of data
§ Other methods assume fixed input
(matrix, sequence) while GNNs capture
local patterns around each node
§ GNNs glue together other Neural
Network architectures (CNNs,
Transformers) and integrate
multimodal data
Jure Leskovec (@jure), Stanford University 8
Key Benefits of GNNs
GNNs are extremely general: They subsume
CNNs and Transformers as special cases:
§ Example: CNN layer with 3x3 filter
Jure Leskovec (@jure), Stanford University 9
Convolutional neural networks (on grids)
(Animation by
Vincent Dumoulin)
Single CNN layer with 3x3 filter:
Image Graph
GNN formulation: h!
(#$%)
= 𝜎(𝐖𝒍 ∑(∈*(!)
+!
(#)
*(!)
+ B#h!
(#)
), ∀𝑙 ∈ {0, … , 𝐿 − 1}
CNN formulation: h!
(#$%)
= 𝜎(∑(∈*(!) 𝐖𝒍
𝒖
h(
(#)
+ B#h!
(#)
), ∀𝑙 ∈ {0, … , 𝐿 − 1}
Many Applications of GNNs
Jure Leskovec (@jure), Stanford University 10
Fraud detection
Drug discovery Recommender
systems
Customer 360 journey
analysis
Supply chain
optimization
Fake news detection
The hottest subfield in ML
Jure Leskovec (@jure), Stanford University 11
Learning on Graphs
Conference
9th – 12th December 2022
https://logconference.org
Dynamic Financial Networks
Jure Leskovec (@jure), Stanford University 12
Financial Networks: Describe financial entities and
their connections
Tasks: Fraud detection, Anti-money laundering,
Anomaly detection
ROLAND: Graph Learning Framework for Dynamic Graphs. Jiaxuan You, Tianyu Du, Jure Leskovec. KDD 2022.
Forecasting Transactions
Jure Leskovec (@jure), Stanford University 13
Central Bank of a European country forecasted
financial transactions:
§ Central Bank was able to process dynamic
graphs with tens of millions of transactions:
§ 43-73% improvement in accuracy
§ Robust performance with changes in transaction
pattern
ROLAND: Tool for Dynamic Graphs
We developed ROLAND:
§ Easy creation of dynamic GNN from any
static GNN
§ Scalable and adaptive training
Jure Leskovec (@jure), Stanford University 14
Key idea: Recurrently update node
embeddings at each layer, by injecting a
new module to a static GNN:
Input:
§ Previous embeddings from the same layer
§ Current embeddings from the previous layer
Output: Updated embeddings
https://github.com/snap-stanford/roland
ROLAND: Implementation
ROLAND is built with PyG GraphGym to
efficiently explore the GNN design space
Jure Leskovec (@jure), Stanford University 15
Design Space for Graph Neural Networks. Jiaxuan You, Rex Ying, Jure Leskovec. NeurIPS 2020
PyG: GNN Library
PyG provides the state-of-the-art in
Graph Representation Learning
Jure Leskovec (@jure), Stanford University 16
PyG 2.1: What’s New
§ pyg-lib: A low-level GNN engine to
further accelerate PyG, joint effort
across different partnerships
§ Improved GNN design via principled
aggregations
§ New model milestones: Scalable Link
prediction, 1000+ layer deep GNNs,
GNNs for heterophily graphs, ...
Jure Leskovec (@jure), Stanford University 17
PyG: GNN Library
PyG is the most used Graph
Representation Learning framework
Jure Leskovec (@jure), Stanford University 18
§ 80+ GNN architectures
§ 200+ benchmark datasets
§ Extendable via a message
passing interface
§ Dedicated sparsity-aware
CUDA kernels
§ Scalable and easy to use
PyG Powers Products
§ anomaly and outlier detection
knowledge graph for drug
discovery
recommender system for audio
books and podcasts
fraud detection
fluid simulations
Jure Leskovec (@jure), Stanford University 19
Graph ML Case Studies
Graph ML Tutorials: https://medium.com/stanford-cs224w
§ Graph Machine Learning tutorials created
by Stanford students of CS224W.
Jure Leskovec (@jure), Stanford University 20
Stanford Graph ML Videos
Stanford CS224W: Machine Learning
with Graphs on YouTube
https://youtube.com/playlist?list=PLoROMvodv4rPLKxIpqhjhPgdQy7imNkDn
§ Over 700,000 views
Jure Leskovec (@jure), Stanford University 21
PyG: Huge Community
PYG.ORG:
§ ~1600 research papers written using PyG
§ ~100K monthly downloads
§ ~300 external contributors/developers
§ ~3k members on Slack (Join us: https://data.pyg.org/slack.html)
Jure Leskovec (@jure), Stanford University 22
Scaling-up Graph Learning
§ In real-world applications graph sizes
easily reach billions of nodes and
edges
§ Now with PyG we can learn deep
learning models at a giant scale and
train a predictive model for a graph
Jure Leskovec (@jure), Stanford University 23
New Abstractions
§ Graphs contain both
the wireframe and rich
features on its nodes.
§ Scaling up requires
distributing this
information out-of-core.
§ New PyG graph store
and PyG feature store
abstractions enable
modular scalability.
Jure Leskovec (@jure), Stanford University 24
Partnership with NVIDIA
NVIDIA joins as an official partner to
accelerate GPU needs:
§ GPU-accelerated neighbor sampling via
cugraph integration
§ GPU-accelerated heterogeneous GNN
execution via typed matrix multiply
Jure Leskovec (@jure), Stanford University 25
&
Partnership with Intel
Intel joins as an official partner to
accelerate CPU needs:
§ Accelerated inference of GNNs
on the CPU
§ Accelerated neighbor sampling
techniques (up to 20x speed-ups)
Jure Leskovec (@jure), Stanford University 26
&
OGB-LSC-2
Jure Leskovec (@jure), Stanford University 27
In 2021 we run OGB-LSC with over
500+ participating teams
§ Huge leap forward on
model performance
We are announcing OGB-LSC-2:
Winners
Best baseline
Knowledge Graphs
Knowledge Graph stores facts as triples ℎ, 𝑟, 𝑡
§ (‘Mona Lisa’, ‘created_by’, ‘Leonardo da Vinci’)
§ (‘Fulvestrant’, ‘causes’, ‘Migrane’)
§ (‘Fulvestrant’, ‘treats’, ‘Breast Neoplasm’)
§ Examples:
28
Jure Leskovec (@jure), Stanford University
Predictive Queries over KGs
29
Biomedical knowledge graph
Jure Leskovec (@jure), Stanford University
[Embedding Logical Queries on Knowledge Graphs. Hamilton, et al., NeurIPS 2018]
[Query2box: Reasoning over Knowledge Graphs in Vector Space Using Box Embeddings. Ren, et al., ICLR 2020]
KG Reasoning with SMORE
SMORE: Scalable framework for multi-
hop Knowledge Graph reasoning
Scales to full Freebase KG:
86M nodes, 338M edges
Jure Leskovec (@jure), Stanford University 30
https://github.com/google-research/smore
Graphs & Language Models
Knowledge graphs help train
more effective large language
models.
Jure Leskovec (@jure), Stanford University 31
https://arxiv.org/abs/2203.15827
Conclusion
§ Today we announce new advances
in research, new tools and platforms
and exciting partnerships and
updates to the PyG ecosystem
§ We have an exciting program for you
with talks from brightest minds in
science and industry
Jure Leskovec (@jure), Stanford University 32
Workshop
Program 09:30 - 10:00 Matthias Fey, PyG – What’s New in PyG
10:00 - 10:20 Ivaylo Bahtchevanov, PyG – Building PyG Open Source Community
10:20 - 10:40 Manan Shah & Dong Wang, Kumo.ai – Scaling-up PyG
11:00 - 11:20 Rishi Puri, Nvidia – Accelerating PyG with Nvidia GPUs
11:20 - 11:40 Ke Ding, Intel – Accelerating PyG with Intel CPUs
11:40 - 12:00 Andreas Damianou, Spotify – Podcast Recommendations Spotify
12:00 - 12:20 Hema Raghavan, Kumo.ai – Query the Future using PyG
12:20 - 12:30 Joseph Huang, Stanford – Stanford CS LINXS Summer Program
13:30 - 13:50 Marinka Zitnik, Harvard – Graph AI to Enable Precision Medicine
13:50 - 14:10 Bryan Perozzi, Google – Challenges and Solutions for GNNs at Google
14:10 - 14:30 Srijan Kumar, GaTech –GNNs for Web Safety and Integrity
14:30 - 14:50 Luna Dong, Meta – Graph Mining for Intelligent Assistants
14:50 - 15:10 Michi Yasunaga, Stanford – Graph Learning in NLP Applications
15:30 - 15:50 Weihua Hu, Stanford – Learning Backward Compatible Embeddings
15:50 - 16:10 Hongyu Ren, Stanford – Multi-hop Reasoning in Knowledge Graphs
16:10 - 17:00 Panel – Challenges and Opportunities for Graph Learning
•Naren Chittar, JPMorgan Chase (moderator)
•Evan Feinberg, Genesis Therapeutics
•Yunyao Li, Apple
•Neil Shah, Snap
•Karthik Subbian, Amazon
Logistics
§ Q&A: Subscribe to PyG Slack:
https://data.pyg.org/slack.html and
join #workshop-2022
§ Live stream:
https://youtu.be/GYW286H3SKw
Jure Leskovec (@jure), Stanford University 34
Thank you!!
Thank you for attending!
Thank you Stanford Data Science
Initiative for organizing:
Joseph Huang
Malwana Adalat
Rok Sosic
Ivaylo Buhtchevanov (PyG)
Jure Leskovec (@jure), Stanford University 35

More Related Content

Similar to stanford_graph-learning_workshop.pdf

[台灣人工智慧學校] 主題演講 - 張智威總經理 (President of HTC DeepQ)
[台灣人工智慧學校] 主題演講 - 張智威總經理 (President of HTC DeepQ)[台灣人工智慧學校] 主題演講 - 張智威總經理 (President of HTC DeepQ)
[台灣人工智慧學校] 主題演講 - 張智威總經理 (President of HTC DeepQ)台灣資料科學年會
 
OpenACC Monthly Highlights: March 2021
OpenACC Monthly Highlights: March 2021OpenACC Monthly Highlights: March 2021
OpenACC Monthly Highlights: March 2021OpenACC
 
Global Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated SystemsGlobal Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated SystemsLarry Smarr
 
Global Network Advancement Group Next Generation Network-Integrated Sys...
      Global Network Advancement GroupNext Generation Network-Integrated Sys...      Global Network Advancement GroupNext Generation Network-Integrated Sys...
Global Network Advancement Group Next Generation Network-Integrated Sys...Larry Smarr
 
ASGARD Splunk Conf 2016
ASGARD Splunk Conf 2016ASGARD Splunk Conf 2016
ASGARD Splunk Conf 2016Keith Kraus
 
Pioneering and Democratizing Scalable HPC+AI at PSC
Pioneering and Democratizing Scalable HPC+AI at PSCPioneering and Democratizing Scalable HPC+AI at PSC
Pioneering and Democratizing Scalable HPC+AI at PSCinside-BigData.com
 
Software Sustainability Institute
Software Sustainability InstituteSoftware Sustainability Institute
Software Sustainability InstituteNeil Chue Hong
 
Official resume titash_mandal_
Official resume titash_mandal_Official resume titash_mandal_
Official resume titash_mandal_Titash Mandal
 
Samsung SDS OpeniT - The possibility of Python
Samsung SDS OpeniT - The possibility of PythonSamsung SDS OpeniT - The possibility of Python
Samsung SDS OpeniT - The possibility of PythonInsuk (Chris) Cho
 
OpenACC Monthly Highlights - March 2018
OpenACC Monthly Highlights - March 2018OpenACC Monthly Highlights - March 2018
OpenACC Monthly Highlights - March 2018NVIDIA
 
OpenACC Monthly Highlights: February 2022
OpenACC Monthly Highlights: February 2022OpenACC Monthly Highlights: February 2022
OpenACC Monthly Highlights: February 2022OpenACC
 
SCC2011 - Talking about e-Science in a virtual world
SCC2011 - Talking about e-Science in a virtual worldSCC2011 - Talking about e-Science in a virtual world
SCC2011 - Talking about e-Science in a virtual worldBritish Science Association
 
Panel: NRP Science Impacts​
Panel: NRP Science Impacts​Panel: NRP Science Impacts​
Panel: NRP Science Impacts​Larry Smarr
 
OpenACC Monthly Highlights: January 2024
OpenACC Monthly Highlights: January 2024OpenACC Monthly Highlights: January 2024
OpenACC Monthly Highlights: January 2024OpenACC
 
building intelligent systems with large scale deep learning
building intelligent systems with large scale deep learningbuilding intelligent systems with large scale deep learning
building intelligent systems with large scale deep learningmustafa sarac
 
Big Data and Predictive Analysis
Big Data and Predictive AnalysisBig Data and Predictive Analysis
Big Data and Predictive AnalysisJongwook Woo
 
Using OptIPuter Innovations to Enable LambdaGrid Applications
Using OptIPuter Innovations to Enable LambdaGrid ApplicationsUsing OptIPuter Innovations to Enable LambdaGrid Applications
Using OptIPuter Innovations to Enable LambdaGrid ApplicationsLarry Smarr
 

Similar to stanford_graph-learning_workshop.pdf (20)

[台灣人工智慧學校] 主題演講 - 張智威總經理 (President of HTC DeepQ)
[台灣人工智慧學校] 主題演講 - 張智威總經理 (President of HTC DeepQ)[台灣人工智慧學校] 主題演講 - 張智威總經理 (President of HTC DeepQ)
[台灣人工智慧學校] 主題演講 - 張智威總經理 (President of HTC DeepQ)
 
OpenACC Monthly Highlights: March 2021
OpenACC Monthly Highlights: March 2021OpenACC Monthly Highlights: March 2021
OpenACC Monthly Highlights: March 2021
 
Global Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated SystemsGlobal Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated Systems
 
Global Network Advancement Group Next Generation Network-Integrated Sys...
      Global Network Advancement GroupNext Generation Network-Integrated Sys...      Global Network Advancement GroupNext Generation Network-Integrated Sys...
Global Network Advancement Group Next Generation Network-Integrated Sys...
 
ASGARD Splunk Conf 2016
ASGARD Splunk Conf 2016ASGARD Splunk Conf 2016
ASGARD Splunk Conf 2016
 
Pioneering and Democratizing Scalable HPC+AI at PSC
Pioneering and Democratizing Scalable HPC+AI at PSCPioneering and Democratizing Scalable HPC+AI at PSC
Pioneering and Democratizing Scalable HPC+AI at PSC
 
CV _Manoj
CV _ManojCV _Manoj
CV _Manoj
 
Software Sustainability Institute
Software Sustainability InstituteSoftware Sustainability Institute
Software Sustainability Institute
 
Official resume titash_mandal_
Official resume titash_mandal_Official resume titash_mandal_
Official resume titash_mandal_
 
Samsung SDS OpeniT - The possibility of Python
Samsung SDS OpeniT - The possibility of PythonSamsung SDS OpeniT - The possibility of Python
Samsung SDS OpeniT - The possibility of Python
 
OpenACC Monthly Highlights - March 2018
OpenACC Monthly Highlights - March 2018OpenACC Monthly Highlights - March 2018
OpenACC Monthly Highlights - March 2018
 
OpenACC Monthly Highlights: February 2022
OpenACC Monthly Highlights: February 2022OpenACC Monthly Highlights: February 2022
OpenACC Monthly Highlights: February 2022
 
SCC2011 - Talking about e-Science in a virtual world
SCC2011 - Talking about e-Science in a virtual worldSCC2011 - Talking about e-Science in a virtual world
SCC2011 - Talking about e-Science in a virtual world
 
Panel: NRP Science Impacts​
Panel: NRP Science Impacts​Panel: NRP Science Impacts​
Panel: NRP Science Impacts​
 
Session 33 - Production Grids
Session 33 - Production GridsSession 33 - Production Grids
Session 33 - Production Grids
 
OpenACC Monthly Highlights: January 2024
OpenACC Monthly Highlights: January 2024OpenACC Monthly Highlights: January 2024
OpenACC Monthly Highlights: January 2024
 
building intelligent systems with large scale deep learning
building intelligent systems with large scale deep learningbuilding intelligent systems with large scale deep learning
building intelligent systems with large scale deep learning
 
Portfolio
PortfolioPortfolio
Portfolio
 
Big Data and Predictive Analysis
Big Data and Predictive AnalysisBig Data and Predictive Analysis
Big Data and Predictive Analysis
 
Using OptIPuter Innovations to Enable LambdaGrid Applications
Using OptIPuter Innovations to Enable LambdaGrid ApplicationsUsing OptIPuter Innovations to Enable LambdaGrid Applications
Using OptIPuter Innovations to Enable LambdaGrid Applications
 

Recently uploaded

Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - InfographicHr365.us smith
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024StefanoLambiase
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 

Recently uploaded (20)

Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - Infographic
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 

stanford_graph-learning_workshop.pdf

  • 2. Graphs Graphs represent objects and their relationships as nodes and edges Jure Leskovec (@jure), Stanford University 2 Graphs capture relations, which leads to better models.
  • 3. Graphs in Many Domains Often most valuable data are graphs: Jure Leskovec (@jure), Stanford University 3 Finance & Insurance Transportation Healthcare IT industry Commerce & Retail Manufacturing
  • 4. Today We are Showing § Research advancements in Graph ML § PyG ecosystem and partnerships § Industrial applications § How these technologies propel advancements of AI and applications of AI to next wave for science and industry § We are going to announce new tools and partnerships that power these applications Jure Leskovec (@jure), Stanford University 4
  • 5. Thank you! Thank you to all the researchers, developers and partners who join us to make this event amazing. Welcome to Stanford Graph Learning Workshop! Jure Leskovec (@jure), Stanford University 5
  • 6. INPUT GRAPH TARGET NODE B D E F C A B C D A A A C F B E A Deep Learning for Graphs 6 Jure Leskovec (@jure), Stanford University Each node defines a computation graph § Each edge is a message function MSG() § ⨁ is a message aggregation function Scarselli et al. 2005. The Graph Neural Network Model. IEEE Transactions on Neural Networks. M SG() ⨁ MSG() ⨁
  • 7. Graph Neural Networks Graph Neural Networks (GNNs) can learn patterns and relations on a giant scale and train predictive models: § 1) GNNs learn to combine features from neighboring nodes § 2) GNNs learn the graph patterns and relations Jure Leskovec, Stanford University 7
  • 8. Key Benefits of GNNs § GNNs adapt to local shape of data § Other methods assume fixed input (matrix, sequence) while GNNs capture local patterns around each node § GNNs glue together other Neural Network architectures (CNNs, Transformers) and integrate multimodal data Jure Leskovec (@jure), Stanford University 8
  • 9. Key Benefits of GNNs GNNs are extremely general: They subsume CNNs and Transformers as special cases: § Example: CNN layer with 3x3 filter Jure Leskovec (@jure), Stanford University 9 Convolutional neural networks (on grids) (Animation by Vincent Dumoulin) Single CNN layer with 3x3 filter: Image Graph GNN formulation: h! (#$%) = 𝜎(𝐖𝒍 ∑(∈*(!) +! (#) *(!) + B#h! (#) ), ∀𝑙 ∈ {0, … , 𝐿 − 1} CNN formulation: h! (#$%) = 𝜎(∑(∈*(!) 𝐖𝒍 𝒖 h( (#) + B#h! (#) ), ∀𝑙 ∈ {0, … , 𝐿 − 1}
  • 10. Many Applications of GNNs Jure Leskovec (@jure), Stanford University 10 Fraud detection Drug discovery Recommender systems Customer 360 journey analysis Supply chain optimization Fake news detection
  • 11. The hottest subfield in ML Jure Leskovec (@jure), Stanford University 11 Learning on Graphs Conference 9th – 12th December 2022 https://logconference.org
  • 12. Dynamic Financial Networks Jure Leskovec (@jure), Stanford University 12 Financial Networks: Describe financial entities and their connections Tasks: Fraud detection, Anti-money laundering, Anomaly detection ROLAND: Graph Learning Framework for Dynamic Graphs. Jiaxuan You, Tianyu Du, Jure Leskovec. KDD 2022.
  • 13. Forecasting Transactions Jure Leskovec (@jure), Stanford University 13 Central Bank of a European country forecasted financial transactions: § Central Bank was able to process dynamic graphs with tens of millions of transactions: § 43-73% improvement in accuracy § Robust performance with changes in transaction pattern
  • 14. ROLAND: Tool for Dynamic Graphs We developed ROLAND: § Easy creation of dynamic GNN from any static GNN § Scalable and adaptive training Jure Leskovec (@jure), Stanford University 14 Key idea: Recurrently update node embeddings at each layer, by injecting a new module to a static GNN: Input: § Previous embeddings from the same layer § Current embeddings from the previous layer Output: Updated embeddings https://github.com/snap-stanford/roland
  • 15. ROLAND: Implementation ROLAND is built with PyG GraphGym to efficiently explore the GNN design space Jure Leskovec (@jure), Stanford University 15 Design Space for Graph Neural Networks. Jiaxuan You, Rex Ying, Jure Leskovec. NeurIPS 2020
  • 16. PyG: GNN Library PyG provides the state-of-the-art in Graph Representation Learning Jure Leskovec (@jure), Stanford University 16
  • 17. PyG 2.1: What’s New § pyg-lib: A low-level GNN engine to further accelerate PyG, joint effort across different partnerships § Improved GNN design via principled aggregations § New model milestones: Scalable Link prediction, 1000+ layer deep GNNs, GNNs for heterophily graphs, ... Jure Leskovec (@jure), Stanford University 17
  • 18. PyG: GNN Library PyG is the most used Graph Representation Learning framework Jure Leskovec (@jure), Stanford University 18 § 80+ GNN architectures § 200+ benchmark datasets § Extendable via a message passing interface § Dedicated sparsity-aware CUDA kernels § Scalable and easy to use
  • 19. PyG Powers Products § anomaly and outlier detection knowledge graph for drug discovery recommender system for audio books and podcasts fraud detection fluid simulations Jure Leskovec (@jure), Stanford University 19
  • 20. Graph ML Case Studies Graph ML Tutorials: https://medium.com/stanford-cs224w § Graph Machine Learning tutorials created by Stanford students of CS224W. Jure Leskovec (@jure), Stanford University 20
  • 21. Stanford Graph ML Videos Stanford CS224W: Machine Learning with Graphs on YouTube https://youtube.com/playlist?list=PLoROMvodv4rPLKxIpqhjhPgdQy7imNkDn § Over 700,000 views Jure Leskovec (@jure), Stanford University 21
  • 22. PyG: Huge Community PYG.ORG: § ~1600 research papers written using PyG § ~100K monthly downloads § ~300 external contributors/developers § ~3k members on Slack (Join us: https://data.pyg.org/slack.html) Jure Leskovec (@jure), Stanford University 22
  • 23. Scaling-up Graph Learning § In real-world applications graph sizes easily reach billions of nodes and edges § Now with PyG we can learn deep learning models at a giant scale and train a predictive model for a graph Jure Leskovec (@jure), Stanford University 23
  • 24. New Abstractions § Graphs contain both the wireframe and rich features on its nodes. § Scaling up requires distributing this information out-of-core. § New PyG graph store and PyG feature store abstractions enable modular scalability. Jure Leskovec (@jure), Stanford University 24
  • 25. Partnership with NVIDIA NVIDIA joins as an official partner to accelerate GPU needs: § GPU-accelerated neighbor sampling via cugraph integration § GPU-accelerated heterogeneous GNN execution via typed matrix multiply Jure Leskovec (@jure), Stanford University 25 &
  • 26. Partnership with Intel Intel joins as an official partner to accelerate CPU needs: § Accelerated inference of GNNs on the CPU § Accelerated neighbor sampling techniques (up to 20x speed-ups) Jure Leskovec (@jure), Stanford University 26 &
  • 27. OGB-LSC-2 Jure Leskovec (@jure), Stanford University 27 In 2021 we run OGB-LSC with over 500+ participating teams § Huge leap forward on model performance We are announcing OGB-LSC-2: Winners Best baseline
  • 28. Knowledge Graphs Knowledge Graph stores facts as triples ℎ, 𝑟, 𝑡 § (‘Mona Lisa’, ‘created_by’, ‘Leonardo da Vinci’) § (‘Fulvestrant’, ‘causes’, ‘Migrane’) § (‘Fulvestrant’, ‘treats’, ‘Breast Neoplasm’) § Examples: 28 Jure Leskovec (@jure), Stanford University
  • 29. Predictive Queries over KGs 29 Biomedical knowledge graph Jure Leskovec (@jure), Stanford University [Embedding Logical Queries on Knowledge Graphs. Hamilton, et al., NeurIPS 2018] [Query2box: Reasoning over Knowledge Graphs in Vector Space Using Box Embeddings. Ren, et al., ICLR 2020]
  • 30. KG Reasoning with SMORE SMORE: Scalable framework for multi- hop Knowledge Graph reasoning Scales to full Freebase KG: 86M nodes, 338M edges Jure Leskovec (@jure), Stanford University 30 https://github.com/google-research/smore
  • 31. Graphs & Language Models Knowledge graphs help train more effective large language models. Jure Leskovec (@jure), Stanford University 31 https://arxiv.org/abs/2203.15827
  • 32. Conclusion § Today we announce new advances in research, new tools and platforms and exciting partnerships and updates to the PyG ecosystem § We have an exciting program for you with talks from brightest minds in science and industry Jure Leskovec (@jure), Stanford University 32
  • 33. Workshop Program 09:30 - 10:00 Matthias Fey, PyG – What’s New in PyG 10:00 - 10:20 Ivaylo Bahtchevanov, PyG – Building PyG Open Source Community 10:20 - 10:40 Manan Shah & Dong Wang, Kumo.ai – Scaling-up PyG 11:00 - 11:20 Rishi Puri, Nvidia – Accelerating PyG with Nvidia GPUs 11:20 - 11:40 Ke Ding, Intel – Accelerating PyG with Intel CPUs 11:40 - 12:00 Andreas Damianou, Spotify – Podcast Recommendations Spotify 12:00 - 12:20 Hema Raghavan, Kumo.ai – Query the Future using PyG 12:20 - 12:30 Joseph Huang, Stanford – Stanford CS LINXS Summer Program 13:30 - 13:50 Marinka Zitnik, Harvard – Graph AI to Enable Precision Medicine 13:50 - 14:10 Bryan Perozzi, Google – Challenges and Solutions for GNNs at Google 14:10 - 14:30 Srijan Kumar, GaTech –GNNs for Web Safety and Integrity 14:30 - 14:50 Luna Dong, Meta – Graph Mining for Intelligent Assistants 14:50 - 15:10 Michi Yasunaga, Stanford – Graph Learning in NLP Applications 15:30 - 15:50 Weihua Hu, Stanford – Learning Backward Compatible Embeddings 15:50 - 16:10 Hongyu Ren, Stanford – Multi-hop Reasoning in Knowledge Graphs 16:10 - 17:00 Panel – Challenges and Opportunities for Graph Learning •Naren Chittar, JPMorgan Chase (moderator) •Evan Feinberg, Genesis Therapeutics •Yunyao Li, Apple •Neil Shah, Snap •Karthik Subbian, Amazon
  • 34. Logistics § Q&A: Subscribe to PyG Slack: https://data.pyg.org/slack.html and join #workshop-2022 § Live stream: https://youtu.be/GYW286H3SKw Jure Leskovec (@jure), Stanford University 34
  • 35. Thank you!! Thank you for attending! Thank you Stanford Data Science Initiative for organizing: Joseph Huang Malwana Adalat Rok Sosic Ivaylo Buhtchevanov (PyG) Jure Leskovec (@jure), Stanford University 35