SlideShare a Scribd company logo
https://nebula-graph.io
An Open-Source Distributed Graph Database
Sherman Ye
Founder & CEO
sherman.ye@vesoft.com
https://nebula-graph.io
Agenda
l Who We Are?
l What is a Graph Database?
l Why Open Source?
l Architecture
l Advantages
• In Architecture
• In Data Amount
• In Performance
l Adopters
l Product Roadmap
l Summary
https://nebula-graph.io
Founder: A Graph Database Forerunner
Founder & CEO
Infra Software
Engineer
(2010-2015)
Started and led the high-performance
distributed Graph Database project - Dragon
• First distributed Graph Database ever in the industry
• Deployed on more than 500 nodes by Jan. 2015
• Together with the write-through cache system TAO, Dragon served all
relationship queries across the entire Facebook application stacks.
Principal
Software
Engineer
(2015-2018)
Formed the team to build the high-performance Graph
Database GeaBase from the ground up
• GeaBase is the only technical product that was rooted in Ant Financial
and widely adopted in Alibaba Group
• Deployed on more than 1000 nodes and served more than one billion
queries every day. The average latency is about 40ms
• Served in almost one hundred scenario and covered business groups
from Ant Financial, to AliExpress, GaoDe Map, AliMama, TianMao,
CaiNiao, UC, etc.
Sherman Ye
https://nebula-graph.io
Who We Are?
Team
l 40+ full-time employees, more than 30 of them are
technical persons
l More than half of the technical employees are from Alibaba,
Hauawei, NetEase, etc.
l vesoft Inc. was founded in Oct. 2018, Nebula Graph became
open source in May 2019
l Finalized $3M angel funding: Matrix Partners China
l Raised $17M Pre-A funding: Source Code Capital, Red
Point Ventures, Matrix Partners China
History
https://nebula-graph.io
What is a Graph Database?
l Typical database consists of tables filled with same type of data, useful for quick retrieval of such
data.
l A graph database can uncover deep relationships between many data sets
l In a graph, a VERTEX (or NODE) defines an item, a defined EDGE connects it with other data sets
or items (one way, two way or multiple ways), and PROPERTIES can further classify a VERTEX
for more granular data mining
l So, instead of just calling up people with the name SMITH data scientists can instead intelligently reveal
how SMITH might relate to JONES or DOE, in a given time, a location, and more
l These capabilities lead to big users: Adobe, Facebook, Microsoft, Netflix, and many more
l Graph database market expected to grow $650M (2018) to $4.13B by 2026 (Verified Market
Research)
https://nebula-graph.io
Why Open Source?
l To make the graph technology more accessible to the world
l To build a healthy ecosystem around Nebula Graph
l To expand globally
GitHub star: 5200+ WeChat group
members: 1000+
Contributors: 50+ Forum posts: 1,100/month
https://nebula-graph.io
Architecture
l Meta Service
l Query Service
l Storage Service
l Proven Highest Performance
l The Most Scalable
l Industry’s Highest Availability
Three Components:
Advantages:
https://nebula-graph.io
Advantages
Data Amount in Example:
l Data amount: 150TB
l Graph size: One trillion edges/connections
l An hourly update of 10 billion connections
Compared with other graph database solutions, Nebula Graph has the following advantages:
In Architecture
l Shared-nothing structure - ensures high availability
l Storage and computation separation - ensures high scalability and cloud ready
https://nebula-graph.io
In Performance: Meituan
Link to the topic on the forum:
https://discuss.nebula-graph.io/t/benchmarking-the-mainstream-open-source-distributed-graph-databases-at-meituan-nebula-
graph-vs-dgraph-vs-hugegraph/715
Real-Time Write
We invite you to read a real large customer’s own performance benchmarking, conducted by the
NLP team at Meituan: NebulaGraph vs. Dgraph vs. HugeGraph
https://nebula-graph.io
In Performance: Meituan (Cont’d)
N-Hop Queries Shared Friends Queries
https://nebula-graph.io
In Performance: Tencent Cloud
Data import 1-degree friends query 2-degree friends query Common friends query
Performance comparison conducted by the Tencent Cloud team:
NebulaGraph vs. Neo4j vs. HugeGraph
Link to the topic on the forum:
https://discuss.nebula-graph.io/t/performance-comparison-neo4j-vs-nebula-graph-vs-janusgraph/619
https://nebula-graph.io
In Performance: 360 Digitech
360 Digitech has shared their experience migrating from JanusGraph to NebulaGraph and the
huge performance gains after the migration.
Link to the topic on the forum:
https://discuss.nebula-graph.io/t/data-migration-from-janusgraph-to-nebula-graph-practice-at-360-finance/672
HBase network I/O Nebula Graph network I/O
HBase disk I/O Nebula Graph disk I/O
https://nebula-graph.io
In Performance: 360 Digitech (Cont’d)
Test Results from 360 Digitech
l NebulaGraph significantly outperforms in disk or network I/O
l Performance achieved using only 30% of HBase cluster machine resources
l When JanusGraph needs 2-3 seconds per query, Nebula Graph just needs 100 ms
l When JanusGraph needs 10-20 seconds per query, Nebula Graph needs 2 seconds
l Overall Nebula Graph performance is more than 20 times improvement over others
https://nebula-graph.io
Adopters
Real-Time
Recommendations
Fraud
Detection
Cyber Security Knowledge Graph
...... ...... ...... ......
https://nebula-graph.io
Product Roadmap
https://nebula-graph.io
Summary
l Nebula Graph is a VC-funded solution already adopted by some of the world’s
largest Internet companies
l Nebula Graph is proven the world’s highest performing Graph Database
l It can store and process hundreds of billions of data points with trillions of relational connections in a
shared-nothing distributed architecture
l Graph database market to quadruple in size by 2026
https://nebula-graph.io
Unleash the Power of
Connections!
https://nebula-graph.io
Thank You
GitHub: vesoft-inc/nebula
Twitter: @NebulaGraph
Facebook: @NebulaGraph
https://discuss.nebula-graph.io
https://nebula-graph.io

More Related Content

What's hot

Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)
James Serra
 
Netflix Data Engineering @ Uber Engineering Meetup
Netflix Data Engineering @ Uber Engineering MeetupNetflix Data Engineering @ Uber Engineering Meetup
Netflix Data Engineering @ Uber Engineering Meetup
Blake Irvine
 
Data Mesh
Data MeshData Mesh
Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017
Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017
Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017
AWS Chicago
 
Snowflake SnowPro Certification Exam Cheat Sheet
Snowflake SnowPro Certification Exam Cheat SheetSnowflake SnowPro Certification Exam Cheat Sheet
Snowflake SnowPro Certification Exam Cheat Sheet
Jeno Yamma
 
Big MDM Part 2: Using a Graph Database for MDM and Relationship Management
Big MDM Part 2: Using a Graph Database for MDM and Relationship ManagementBig MDM Part 2: Using a Graph Database for MDM and Relationship Management
Big MDM Part 2: Using a Graph Database for MDM and Relationship Management
Caserta
 
Agile Data Mining with Data Vault 2.0 (english)
Agile Data Mining with Data Vault 2.0 (english)Agile Data Mining with Data Vault 2.0 (english)
Agile Data Mining with Data Vault 2.0 (english)
Michael Olschimke
 
Big Data Architectural Patterns and Best Practices
Big Data Architectural Patterns and Best PracticesBig Data Architectural Patterns and Best Practices
Big Data Architectural Patterns and Best Practices
Amazon Web Services
 
Trino: A Ludicrously Fast Query Engine - Pulsar Summit NA 2021
Trino: A Ludicrously Fast Query Engine - Pulsar Summit NA 2021Trino: A Ludicrously Fast Query Engine - Pulsar Summit NA 2021
Trino: A Ludicrously Fast Query Engine - Pulsar Summit NA 2021
StreamNative
 
Snowflake Overview
Snowflake OverviewSnowflake Overview
Snowflake Overview
Snowflake Computing
 
Data Lake Architecture
Data Lake ArchitectureData Lake Architecture
Data Lake Architecture
DATAVERSITY
 
Présentation data vault et bi v20120508
Présentation data vault et bi v20120508Présentation data vault et bi v20120508
Présentation data vault et bi v20120508
Empowered Holdings, LLC
 
Standing on the Shoulders of Open-Source Giants: The Serverless Realtime Lake...
Standing on the Shoulders of Open-Source Giants: The Serverless Realtime Lake...Standing on the Shoulders of Open-Source Giants: The Serverless Realtime Lake...
Standing on the Shoulders of Open-Source Giants: The Serverless Realtime Lake...
HostedbyConfluent
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
Vivek Aanand Ganesan
 
Introduction to ETL and Data Integration
Introduction to ETL and Data IntegrationIntroduction to ETL and Data Integration
Introduction to ETL and Data Integration
CloverDX (formerly known as CloverETL)
 
Graph based data models
Graph based data modelsGraph based data models
Graph based data models
Moumie Soulemane
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 
Deep Dive with Spark Streaming - Tathagata Das - Spark Meetup 2013-06-17
Deep Dive with Spark Streaming - Tathagata  Das - Spark Meetup 2013-06-17Deep Dive with Spark Streaming - Tathagata  Das - Spark Meetup 2013-06-17
Deep Dive with Spark Streaming - Tathagata Das - Spark Meetup 2013-06-17
spark-project
 
Elastic Data Warehousing
Elastic Data WarehousingElastic Data Warehousing
Elastic Data Warehousing
Snowflake Computing
 
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaThe Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
Kai Wähner
 

What's hot (20)

Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)
 
Netflix Data Engineering @ Uber Engineering Meetup
Netflix Data Engineering @ Uber Engineering MeetupNetflix Data Engineering @ Uber Engineering Meetup
Netflix Data Engineering @ Uber Engineering Meetup
 
Data Mesh
Data MeshData Mesh
Data Mesh
 
Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017
Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017
Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017
 
Snowflake SnowPro Certification Exam Cheat Sheet
Snowflake SnowPro Certification Exam Cheat SheetSnowflake SnowPro Certification Exam Cheat Sheet
Snowflake SnowPro Certification Exam Cheat Sheet
 
Big MDM Part 2: Using a Graph Database for MDM and Relationship Management
Big MDM Part 2: Using a Graph Database for MDM and Relationship ManagementBig MDM Part 2: Using a Graph Database for MDM and Relationship Management
Big MDM Part 2: Using a Graph Database for MDM and Relationship Management
 
Agile Data Mining with Data Vault 2.0 (english)
Agile Data Mining with Data Vault 2.0 (english)Agile Data Mining with Data Vault 2.0 (english)
Agile Data Mining with Data Vault 2.0 (english)
 
Big Data Architectural Patterns and Best Practices
Big Data Architectural Patterns and Best PracticesBig Data Architectural Patterns and Best Practices
Big Data Architectural Patterns and Best Practices
 
Trino: A Ludicrously Fast Query Engine - Pulsar Summit NA 2021
Trino: A Ludicrously Fast Query Engine - Pulsar Summit NA 2021Trino: A Ludicrously Fast Query Engine - Pulsar Summit NA 2021
Trino: A Ludicrously Fast Query Engine - Pulsar Summit NA 2021
 
Snowflake Overview
Snowflake OverviewSnowflake Overview
Snowflake Overview
 
Data Lake Architecture
Data Lake ArchitectureData Lake Architecture
Data Lake Architecture
 
Présentation data vault et bi v20120508
Présentation data vault et bi v20120508Présentation data vault et bi v20120508
Présentation data vault et bi v20120508
 
Standing on the Shoulders of Open-Source Giants: The Serverless Realtime Lake...
Standing on the Shoulders of Open-Source Giants: The Serverless Realtime Lake...Standing on the Shoulders of Open-Source Giants: The Serverless Realtime Lake...
Standing on the Shoulders of Open-Source Giants: The Serverless Realtime Lake...
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
 
Introduction to ETL and Data Integration
Introduction to ETL and Data IntegrationIntroduction to ETL and Data Integration
Introduction to ETL and Data Integration
 
Graph based data models
Graph based data modelsGraph based data models
Graph based data models
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Deep Dive with Spark Streaming - Tathagata Das - Spark Meetup 2013-06-17
Deep Dive with Spark Streaming - Tathagata  Das - Spark Meetup 2013-06-17Deep Dive with Spark Streaming - Tathagata  Das - Spark Meetup 2013-06-17
Deep Dive with Spark Streaming - Tathagata Das - Spark Meetup 2013-06-17
 
Elastic Data Warehousing
Elastic Data WarehousingElastic Data Warehousing
Elastic Data Warehousing
 
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaThe Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
 

Similar to Introduction to Nebula Graph, an Open-Source Distributed Graph Database

GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?
Neo4j
 
Discover How IBM Uses InfluxDB and Grafana to Help Clients Monitor Large Prod...
Discover How IBM Uses InfluxDB and Grafana to Help Clients Monitor Large Prod...Discover How IBM Uses InfluxDB and Grafana to Help Clients Monitor Large Prod...
Discover How IBM Uses InfluxDB and Grafana to Help Clients Monitor Large Prod...
InfluxData
 
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSABetter Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
PRBETTER
 
Introduction to the graph technologies landscape
Introduction to the graph technologies landscapeIntroduction to the graph technologies landscape
Introduction to the graph technologies landscape
Linkurious
 
Introduction to the graph technologies landscape
Introduction to the graph technologies landscapeIntroduction to the graph technologies landscape
Introduction to the graph technologies landscape
Linkurious
 
Power of Polyglot Search
Power of Polyglot SearchPower of Polyglot Search
Power of Polyglot Search
Janos Szendi-Varga
 
The power of polyglot searching
The power of polyglot searchingThe power of polyglot searching
The power of polyglot searching
GraphAware
 
Enabling Data centric Teams
Enabling Data centric TeamsEnabling Data centric Teams
Enabling Data centric Teams
Data Con LA
 
Big Data & Open Source - Neil Jadhav
Big Data & Open Source - Neil JadhavBig Data & Open Source - Neil Jadhav
Big Data & Open Source - Neil JadhavSwapnil (Neil) Jadhav
 
La bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesLa bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesCédric Fauvet
 
Deep learning state_of_the_art- Autonomous Driving
Deep learning state_of_the_art- Autonomous DrivingDeep learning state_of_the_art- Autonomous Driving
Deep learning state_of_the_art- Autonomous Driving
Alok Jain
 
Deep Learning State of the Art (2019) - MIT by Lex Fridman
Deep Learning State of the Art (2019) - MIT by Lex FridmanDeep Learning State of the Art (2019) - MIT by Lex Fridman
Deep Learning State of the Art (2019) - MIT by Lex Fridman
Peerasak C.
 
Graph-Oriented NoSQL Databases
Graph-Oriented NoSQL  Databases Graph-Oriented NoSQL  Databases
Graph-Oriented NoSQL Databases
Abdelkader OUARED
 
Apache AGE and the synergy effect in the combination of Postgres and NoSQL
 Apache AGE and the synergy effect in the combination of Postgres and NoSQL Apache AGE and the synergy effect in the combination of Postgres and NoSQL
Apache AGE and the synergy effect in the combination of Postgres and NoSQL
EDB
 
Multiplatform Spark solution for Graph datasources by Javier Dominguez
Multiplatform Spark solution for Graph datasources by Javier DominguezMultiplatform Spark solution for Graph datasources by Javier Dominguez
Multiplatform Spark solution for Graph datasources by Javier Dominguez
Big Data Spain
 
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
Neo4j
 
Anaconda and PyData Solutions
Anaconda and PyData SolutionsAnaconda and PyData Solutions
Anaconda and PyData Solutions
Travis Oliphant
 
How Celtra Optimizes its Advertising Platform with Databricks
How Celtra Optimizes its Advertising Platformwith DatabricksHow Celtra Optimizes its Advertising Platformwith Databricks
How Celtra Optimizes its Advertising Platform with Databricks
Grega Kespret
 
20181019 code.talks graph_analytics_k_patenge
20181019 code.talks graph_analytics_k_patenge20181019 code.talks graph_analytics_k_patenge
20181019 code.talks graph_analytics_k_patenge
Karin Patenge
 
Introduction to Neo4j
Introduction to Neo4jIntroduction to Neo4j
Introduction to Neo4j
Neo4j
 

Similar to Introduction to Nebula Graph, an Open-Source Distributed Graph Database (20)

GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?
 
Discover How IBM Uses InfluxDB and Grafana to Help Clients Monitor Large Prod...
Discover How IBM Uses InfluxDB and Grafana to Help Clients Monitor Large Prod...Discover How IBM Uses InfluxDB and Grafana to Help Clients Monitor Large Prod...
Discover How IBM Uses InfluxDB and Grafana to Help Clients Monitor Large Prod...
 
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSABetter Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
 
Introduction to the graph technologies landscape
Introduction to the graph technologies landscapeIntroduction to the graph technologies landscape
Introduction to the graph technologies landscape
 
Introduction to the graph technologies landscape
Introduction to the graph technologies landscapeIntroduction to the graph technologies landscape
Introduction to the graph technologies landscape
 
Power of Polyglot Search
Power of Polyglot SearchPower of Polyglot Search
Power of Polyglot Search
 
The power of polyglot searching
The power of polyglot searchingThe power of polyglot searching
The power of polyglot searching
 
Enabling Data centric Teams
Enabling Data centric TeamsEnabling Data centric Teams
Enabling Data centric Teams
 
Big Data & Open Source - Neil Jadhav
Big Data & Open Source - Neil JadhavBig Data & Open Source - Neil Jadhav
Big Data & Open Source - Neil Jadhav
 
La bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesLa bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphes
 
Deep learning state_of_the_art- Autonomous Driving
Deep learning state_of_the_art- Autonomous DrivingDeep learning state_of_the_art- Autonomous Driving
Deep learning state_of_the_art- Autonomous Driving
 
Deep Learning State of the Art (2019) - MIT by Lex Fridman
Deep Learning State of the Art (2019) - MIT by Lex FridmanDeep Learning State of the Art (2019) - MIT by Lex Fridman
Deep Learning State of the Art (2019) - MIT by Lex Fridman
 
Graph-Oriented NoSQL Databases
Graph-Oriented NoSQL  Databases Graph-Oriented NoSQL  Databases
Graph-Oriented NoSQL Databases
 
Apache AGE and the synergy effect in the combination of Postgres and NoSQL
 Apache AGE and the synergy effect in the combination of Postgres and NoSQL Apache AGE and the synergy effect in the combination of Postgres and NoSQL
Apache AGE and the synergy effect in the combination of Postgres and NoSQL
 
Multiplatform Spark solution for Graph datasources by Javier Dominguez
Multiplatform Spark solution for Graph datasources by Javier DominguezMultiplatform Spark solution for Graph datasources by Javier Dominguez
Multiplatform Spark solution for Graph datasources by Javier Dominguez
 
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
 
Anaconda and PyData Solutions
Anaconda and PyData SolutionsAnaconda and PyData Solutions
Anaconda and PyData Solutions
 
How Celtra Optimizes its Advertising Platform with Databricks
How Celtra Optimizes its Advertising Platformwith DatabricksHow Celtra Optimizes its Advertising Platformwith Databricks
How Celtra Optimizes its Advertising Platform with Databricks
 
20181019 code.talks graph_analytics_k_patenge
20181019 code.talks graph_analytics_k_patenge20181019 code.talks graph_analytics_k_patenge
20181019 code.talks graph_analytics_k_patenge
 
Introduction to Neo4j
Introduction to Neo4jIntroduction to Neo4j
Introduction to Neo4j
 

Recently uploaded

GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 

Recently uploaded (20)

GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 

Introduction to Nebula Graph, an Open-Source Distributed Graph Database

  • 1. https://nebula-graph.io An Open-Source Distributed Graph Database Sherman Ye Founder & CEO sherman.ye@vesoft.com
  • 2. https://nebula-graph.io Agenda l Who We Are? l What is a Graph Database? l Why Open Source? l Architecture l Advantages • In Architecture • In Data Amount • In Performance l Adopters l Product Roadmap l Summary
  • 3. https://nebula-graph.io Founder: A Graph Database Forerunner Founder & CEO Infra Software Engineer (2010-2015) Started and led the high-performance distributed Graph Database project - Dragon • First distributed Graph Database ever in the industry • Deployed on more than 500 nodes by Jan. 2015 • Together with the write-through cache system TAO, Dragon served all relationship queries across the entire Facebook application stacks. Principal Software Engineer (2015-2018) Formed the team to build the high-performance Graph Database GeaBase from the ground up • GeaBase is the only technical product that was rooted in Ant Financial and widely adopted in Alibaba Group • Deployed on more than 1000 nodes and served more than one billion queries every day. The average latency is about 40ms • Served in almost one hundred scenario and covered business groups from Ant Financial, to AliExpress, GaoDe Map, AliMama, TianMao, CaiNiao, UC, etc. Sherman Ye
  • 4. https://nebula-graph.io Who We Are? Team l 40+ full-time employees, more than 30 of them are technical persons l More than half of the technical employees are from Alibaba, Hauawei, NetEase, etc. l vesoft Inc. was founded in Oct. 2018, Nebula Graph became open source in May 2019 l Finalized $3M angel funding: Matrix Partners China l Raised $17M Pre-A funding: Source Code Capital, Red Point Ventures, Matrix Partners China History
  • 5. https://nebula-graph.io What is a Graph Database? l Typical database consists of tables filled with same type of data, useful for quick retrieval of such data. l A graph database can uncover deep relationships between many data sets l In a graph, a VERTEX (or NODE) defines an item, a defined EDGE connects it with other data sets or items (one way, two way or multiple ways), and PROPERTIES can further classify a VERTEX for more granular data mining l So, instead of just calling up people with the name SMITH data scientists can instead intelligently reveal how SMITH might relate to JONES or DOE, in a given time, a location, and more l These capabilities lead to big users: Adobe, Facebook, Microsoft, Netflix, and many more l Graph database market expected to grow $650M (2018) to $4.13B by 2026 (Verified Market Research)
  • 6. https://nebula-graph.io Why Open Source? l To make the graph technology more accessible to the world l To build a healthy ecosystem around Nebula Graph l To expand globally GitHub star: 5200+ WeChat group members: 1000+ Contributors: 50+ Forum posts: 1,100/month
  • 7. https://nebula-graph.io Architecture l Meta Service l Query Service l Storage Service l Proven Highest Performance l The Most Scalable l Industry’s Highest Availability Three Components: Advantages:
  • 8. https://nebula-graph.io Advantages Data Amount in Example: l Data amount: 150TB l Graph size: One trillion edges/connections l An hourly update of 10 billion connections Compared with other graph database solutions, Nebula Graph has the following advantages: In Architecture l Shared-nothing structure - ensures high availability l Storage and computation separation - ensures high scalability and cloud ready
  • 9. https://nebula-graph.io In Performance: Meituan Link to the topic on the forum: https://discuss.nebula-graph.io/t/benchmarking-the-mainstream-open-source-distributed-graph-databases-at-meituan-nebula- graph-vs-dgraph-vs-hugegraph/715 Real-Time Write We invite you to read a real large customer’s own performance benchmarking, conducted by the NLP team at Meituan: NebulaGraph vs. Dgraph vs. HugeGraph
  • 10. https://nebula-graph.io In Performance: Meituan (Cont’d) N-Hop Queries Shared Friends Queries
  • 11. https://nebula-graph.io In Performance: Tencent Cloud Data import 1-degree friends query 2-degree friends query Common friends query Performance comparison conducted by the Tencent Cloud team: NebulaGraph vs. Neo4j vs. HugeGraph Link to the topic on the forum: https://discuss.nebula-graph.io/t/performance-comparison-neo4j-vs-nebula-graph-vs-janusgraph/619
  • 12. https://nebula-graph.io In Performance: 360 Digitech 360 Digitech has shared their experience migrating from JanusGraph to NebulaGraph and the huge performance gains after the migration. Link to the topic on the forum: https://discuss.nebula-graph.io/t/data-migration-from-janusgraph-to-nebula-graph-practice-at-360-finance/672 HBase network I/O Nebula Graph network I/O HBase disk I/O Nebula Graph disk I/O
  • 13. https://nebula-graph.io In Performance: 360 Digitech (Cont’d) Test Results from 360 Digitech l NebulaGraph significantly outperforms in disk or network I/O l Performance achieved using only 30% of HBase cluster machine resources l When JanusGraph needs 2-3 seconds per query, Nebula Graph just needs 100 ms l When JanusGraph needs 10-20 seconds per query, Nebula Graph needs 2 seconds l Overall Nebula Graph performance is more than 20 times improvement over others
  • 16. https://nebula-graph.io Summary l Nebula Graph is a VC-funded solution already adopted by some of the world’s largest Internet companies l Nebula Graph is proven the world’s highest performing Graph Database l It can store and process hundreds of billions of data points with trillions of relational connections in a shared-nothing distributed architecture l Graph database market to quadruple in size by 2026
  • 18. https://nebula-graph.io Thank You GitHub: vesoft-inc/nebula Twitter: @NebulaGraph Facebook: @NebulaGraph https://discuss.nebula-graph.io https://nebula-graph.io