SlideShare a Scribd company logo
1 of 23
Download to read offline
Building an Enterprise
Knowledge
Graph @Uber:
Lessons from Reality
Joshua Shinavier, PhD
Knowledge Graph Conference
May 8th
, 2019
...
The future
Half empty
Half full
The present
Knowledge @Uber
● Uber is an ideal proving ground for an enterprise knowledge graph (EKG)
● 200k managed data sets
● Billions and billions of trips served
○ Low thousands of new entities per second
○ Totally doable!
● Even more sensor data
○ Use cases for graph stream processing
● Genuine need for knowledge and real-time inference
Knowledge @Uber
EKG hierarchy of needs
● Real data is messy
● Real data is messy
● We are not all ontologists
● Real data is messy
● We are not all ontologists
● Good enough does not scale
● Real data is messy
● We are not all ontologists
● Good enough does not scale
● Beware of the hype cycle
● Real data is messy
● We are not all ontologists
● Good enough does not scale
● Beware of the hype cycle
● RDF is a hard sell
● Real data is messy
● We are not all ontologists
● Good enough does not scale
● Beware of the hype cycle
● RDF is a hard sell
● Property Graphs are not enough
● Use and promote standards
● Use and promote standards
● Invest in shared vocabulary
● Use and promote standards
● Invest in shared vocabulary
● Fit the tooling to the infrastructure
● Use and promote standards
● Invest in shared vocabulary
● Fit the tooling to the infrastructure
● Fit the data model to the data
● Use and promote standards
● Invest in shared vocabulary
● Fit the tooling to the infrastructure
● Fit the data model to the data
● Budget for “other stuff”
● Use and promote standards
● Invest in shared vocabulary
● Fit the tooling to the infrastructure
● Fit the data model to the data
● Budget for “other stuff”
● Collaborate early and often
Risk & Safety Knowledge Graph
This slide intentionally left blank to save entropy.
UBER KNOWLEDGE GRAPH
● Controlled vocabularies for all of Uber
○ Basic type aliases
○ Structured types for geospatial data, sensor data, money, etc. etc.
○ Entities and relationships (User, Vehicle, Trip, etc.)
○ Metadata vocabularies
● Elevates domain-specific RPC and storage schemas to ontologies
● Tooling carries schemas between data representation languages
○ Protobuf, Thrift, Avro, RDF, PG, etc.
Data Standardization
● Hundreds of thousands of structured datasets at Uber
● Data protections and user trust
○ GDPR and other regulations, Uber’s own data policies
○ What kind of user data? Where is it?
○ Heroic numbers of manual annotations
■ Limited expressivity, limited guarantees
■ Inference is required
● Two birds: in annotating datasets, standardize and compose schemas
○ Now we have a true global knowledge graph
○ Investigating efficient reasoning and “No ETL” solutions
Metadata graph
● Common data model for RPC, storage, and KR at Uber
● In progress: alignment with the Property Graph Schema Working Group
● In progress: “Universal structure” of TinkerPop4
Algebraic Property Graphs
● Real data is messy
● We are not all ontologists
● Good enough does not scale
● Beware of the hype cycle
● RDF is a hard sell
● The Property Graph is not enough
● Use and promote standards
● Invest in shared vocabulary
● Fit the tooling to the infrastructure
● Fit the data model to the data
● Budget for “other stuff”
● Collaborate early and often
joshsh@uber.com
Thanks

More Related Content

What's hot

Knowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based SearchKnowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based Search
Neo4j
 

What's hot (20)

Tiger graph 2021 corporate overview [read only]
Tiger graph 2021 corporate overview [read only]Tiger graph 2021 corporate overview [read only]
Tiger graph 2021 corporate overview [read only]
 
Introduction of Knowledge Graphs
Introduction of Knowledge GraphsIntroduction of Knowledge Graphs
Introduction of Knowledge Graphs
 
Graph Databases and Machine Learning | November 2018
Graph Databases and Machine Learning | November 2018Graph Databases and Machine Learning | November 2018
Graph Databases and Machine Learning | November 2018
 
How Expedia’s Entity Graph Powers Global Travel
How Expedia’s Entity Graph Powers Global TravelHow Expedia’s Entity Graph Powers Global Travel
How Expedia’s Entity Graph Powers Global Travel
 
Using an employee knowledge graph for employee engagement and career mobility
Using an employee knowledge graph for employee engagement and career mobilityUsing an employee knowledge graph for employee engagement and career mobility
Using an employee knowledge graph for employee engagement and career mobility
 
Slides: Knowledge Graphs vs. Property Graphs
Slides: Knowledge Graphs vs. Property GraphsSlides: Knowledge Graphs vs. Property Graphs
Slides: Knowledge Graphs vs. Property Graphs
 
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data ScienceAI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
 
Demystifying Graph Neural Networks
Demystifying Graph Neural NetworksDemystifying Graph Neural Networks
Demystifying Graph Neural Networks
 
Smarter Fraud Detection With Graph Data Science
Smarter Fraud Detection With Graph Data ScienceSmarter Fraud Detection With Graph Data Science
Smarter Fraud Detection With Graph Data Science
 
Property graph vs. RDF Triplestore comparison in 2020
Property graph vs. RDF Triplestore comparison in 2020Property graph vs. RDF Triplestore comparison in 2020
Property graph vs. RDF Triplestore comparison in 2020
 
ntroducing to the Power of Graph Technology
ntroducing to the Power of Graph Technologyntroducing to the Power of Graph Technology
ntroducing to the Power of Graph Technology
 
Knowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based SearchKnowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based Search
 
Optimizing the Supply Chain with Knowledge Graphs, IoT and Digital Twins_Moor...
Optimizing the Supply Chain with Knowledge Graphs, IoT and Digital Twins_Moor...Optimizing the Supply Chain with Knowledge Graphs, IoT and Digital Twins_Moor...
Optimizing the Supply Chain with Knowledge Graphs, IoT and Digital Twins_Moor...
 
Graph Machine Learning in Production with Neo4j
Graph Machine Learning in Production with Neo4jGraph Machine Learning in Production with Neo4j
Graph Machine Learning in Production with Neo4j
 
Querying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge GraphQuerying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge Graph
 
Workshop - Build a Graph Solution
Workshop - Build a Graph SolutionWorkshop - Build a Graph Solution
Workshop - Build a Graph Solution
 
Democratizing Data at Airbnb
Democratizing Data at AirbnbDemocratizing Data at Airbnb
Democratizing Data at Airbnb
 
Introduction to Knowledge Graphs: Data Summit 2020
Introduction to Knowledge Graphs: Data Summit 2020Introduction to Knowledge Graphs: Data Summit 2020
Introduction to Knowledge Graphs: Data Summit 2020
 
Hands on Explainable Recommender Systems with Knowledge Graphs @ RecSys22
Hands on Explainable Recommender Systems with Knowledge Graphs @ RecSys22Hands on Explainable Recommender Systems with Knowledge Graphs @ RecSys22
Hands on Explainable Recommender Systems with Knowledge Graphs @ RecSys22
 
Benefits of Knowledge Graphs and AI For Enterprise PLM Platforms
Benefits of Knowledge Graphs and AI For Enterprise PLM PlatformsBenefits of Knowledge Graphs and AI For Enterprise PLM Platforms
Benefits of Knowledge Graphs and AI For Enterprise PLM Platforms
 

Similar to Building an Enterprise Knowledge Graph @Uber: Lessons from Reality

Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
Sarah Aerni
 

Similar to Building an Enterprise Knowledge Graph @Uber: Lessons from Reality (20)

Data mining with Rattle For R
Data mining with Rattle For RData mining with Rattle For R
Data mining with Rattle For R
 
AI & Personalised Experiences
AI & Personalised ExperiencesAI & Personalised Experiences
AI & Personalised Experiences
 
Domain Semantics
Domain SemanticsDomain Semantics
Domain Semantics
 
Data Science as Scale
Data Science as ScaleData Science as Scale
Data Science as Scale
 
"The Hunt For Alpha Among Alternative Data Sources" by Dr. Michael Halls-Moor...
"The Hunt For Alpha Among Alternative Data Sources" by Dr. Michael Halls-Moor..."The Hunt For Alpha Among Alternative Data Sources" by Dr. Michael Halls-Moor...
"The Hunt For Alpha Among Alternative Data Sources" by Dr. Michael Halls-Moor...
 
Smart User Experiences and the World of Work: Context is King
Smart User Experiences and the World of Work: Context is KingSmart User Experiences and the World of Work: Context is King
Smart User Experiences and the World of Work: Context is King
 
Context is King: Smart User Experiences and the World of Work
Context is King: Smart User Experiences and the World of WorkContext is King: Smart User Experiences and the World of Work
Context is King: Smart User Experiences and the World of Work
 
Artificial Intelligence and Antitrust (Hal Varian)
Artificial Intelligence and Antitrust (Hal Varian)Artificial Intelligence and Antitrust (Hal Varian)
Artificial Intelligence and Antitrust (Hal Varian)
 
Global Scale in Research (Nancy Douyon at Enterprise UX 2018)
Global Scale in Research (Nancy Douyon at Enterprise UX 2018)Global Scale in Research (Nancy Douyon at Enterprise UX 2018)
Global Scale in Research (Nancy Douyon at Enterprise UX 2018)
 
SDSC18 and DSATL Meetup March 2018
SDSC18 and DSATL Meetup March 2018 SDSC18 and DSATL Meetup March 2018
SDSC18 and DSATL Meetup March 2018
 
Big Data Pipelines and Machine Learning at Uber
Big Data Pipelines and Machine Learning at UberBig Data Pipelines and Machine Learning at Uber
Big Data Pipelines and Machine Learning at Uber
 
Publishing Linked Data using Schema.org
Publishing Linked Data using Schema.orgPublishing Linked Data using Schema.org
Publishing Linked Data using Schema.org
 
[Strata NYC 2019] Turning big data into knowledge: Managing metadata and data...
[Strata NYC 2019] Turning big data into knowledge: Managing metadata and data...[Strata NYC 2019] Turning big data into knowledge: Managing metadata and data...
[Strata NYC 2019] Turning big data into knowledge: Managing metadata and data...
 
Big Data with IOT approach and trends with case study
Big Data with IOT approach and trends with case studyBig Data with IOT approach and trends with case study
Big Data with IOT approach and trends with case study
 
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
 
Hadoop Training Tutorial for Freshers
Hadoop Training Tutorial for FreshersHadoop Training Tutorial for Freshers
Hadoop Training Tutorial for Freshers
 
Data science: use cases and tools
Data science: use cases and toolsData science: use cases and tools
Data science: use cases and tools
 
L15.pptx
L15.pptxL15.pptx
L15.pptx
 
D.3.1: State of the Art - Linked Data and Digital Preservation
D.3.1: State of the Art - Linked Data and Digital PreservationD.3.1: State of the Art - Linked Data and Digital Preservation
D.3.1: State of the Art - Linked Data and Digital Preservation
 
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
 

More from Joshua Shinavier

The Real-time Web in the Age of Agents
The Real-time Web in the Age of AgentsThe Real-time Web in the Age of Agents
The Real-time Web in the Age of Agents
Joshua Shinavier
 

More from Joshua Shinavier (16)

Anything-to-Graph
Anything-to-GraphAnything-to-Graph
Anything-to-Graph
 
Transpilers Gone Wild: Introducing Hydra
Transpilers Gone Wild: Introducing HydraTranspilers Gone Wild: Introducing Hydra
Transpilers Gone Wild: Introducing Hydra
 
TinkerPop 2020
TinkerPop 2020TinkerPop 2020
TinkerPop 2020
 
An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...
An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...
An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...
 
In Search of the Universal Data Model (ISWC 2019 Minute Madness)
In Search of the Universal Data Model (ISWC 2019 Minute Madness)In Search of the Universal Data Model (ISWC 2019 Minute Madness)
In Search of the Universal Data Model (ISWC 2019 Minute Madness)
 
In Search of the Universal Data Model (Connected Data London 2019)
In Search of the Universal Data Model (Connected Data London 2019)In Search of the Universal Data Model (Connected Data London 2019)
In Search of the Universal Data Model (Connected Data London 2019)
 
Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)
Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)
Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)
 
Evolution of the Graph Schema
Evolution of the Graph SchemaEvolution of the Graph Schema
Evolution of the Graph Schema
 
TinkerPop: a story of graphs, DBs, and graph DBs
TinkerPop: a story of graphs, DBs, and graph DBsTinkerPop: a story of graphs, DBs, and graph DBs
TinkerPop: a story of graphs, DBs, and graph DBs
 
Semantics and Sensors
Semantics and SensorsSemantics and Sensors
Semantics and Sensors
 
semantic markup using schema.org
semantic markup using schema.orgsemantic markup using schema.org
semantic markup using schema.org
 
The Real-time Web in the Age of Agents
The Real-time Web in the Age of AgentsThe Real-time Web in the Age of Agents
The Real-time Web in the Age of Agents
 
Linked Process
Linked ProcessLinked Process
Linked Process
 
Real-time Semantic Web with Twitter Annotations
Real-time Semantic Web with Twitter AnnotationsReal-time Semantic Web with Twitter Annotations
Real-time Semantic Web with Twitter Annotations
 
Real-time #SemanticWeb in 140 chars
Real-time #SemanticWeb in 140 charsReal-time #SemanticWeb in 140 chars
Real-time #SemanticWeb in 140 chars
 
The state of the art in Linked Data
The state of the art in Linked DataThe state of the art in Linked Data
The state of the art in Linked Data
 

Recently uploaded

CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)
Wonjun Hwang
 
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
FIDO Alliance
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
FIDO Alliance
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
panagenda
 

Recently uploaded (20)

CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 Warsaw
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
UiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overviewUiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overview
 
Overview of Hyperledger Foundation
Overview of Hyperledger FoundationOverview of Hyperledger Foundation
Overview of Hyperledger Foundation
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Introduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxIntroduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptx
 
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
 
Working together SRE & Platform Engineering
Working together SRE & Platform EngineeringWorking together SRE & Platform Engineering
Working together SRE & Platform Engineering
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe
 
الأمن السيبراني - ما لا يسع للمستخدم جهله
الأمن السيبراني - ما لا يسع للمستخدم جهلهالأمن السيبراني - ما لا يسع للمستخدم جهله
الأمن السيبراني - ما لا يسع للمستخدم جهله
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
 
Microsoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireMicrosoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - Questionnaire
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptxCyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptx
 

Building an Enterprise Knowledge Graph @Uber: Lessons from Reality

  • 1. Building an Enterprise Knowledge Graph @Uber: Lessons from Reality Joshua Shinavier, PhD Knowledge Graph Conference May 8th , 2019 ...
  • 2. The future Half empty Half full The present Knowledge @Uber
  • 3. ● Uber is an ideal proving ground for an enterprise knowledge graph (EKG) ● 200k managed data sets ● Billions and billions of trips served ○ Low thousands of new entities per second ○ Totally doable! ● Even more sensor data ○ Use cases for graph stream processing ● Genuine need for knowledge and real-time inference Knowledge @Uber
  • 5.
  • 6. ● Real data is messy
  • 7. ● Real data is messy ● We are not all ontologists
  • 8. ● Real data is messy ● We are not all ontologists ● Good enough does not scale
  • 9. ● Real data is messy ● We are not all ontologists ● Good enough does not scale ● Beware of the hype cycle
  • 10. ● Real data is messy ● We are not all ontologists ● Good enough does not scale ● Beware of the hype cycle ● RDF is a hard sell
  • 11. ● Real data is messy ● We are not all ontologists ● Good enough does not scale ● Beware of the hype cycle ● RDF is a hard sell ● Property Graphs are not enough
  • 12. ● Use and promote standards
  • 13. ● Use and promote standards ● Invest in shared vocabulary
  • 14. ● Use and promote standards ● Invest in shared vocabulary ● Fit the tooling to the infrastructure
  • 15. ● Use and promote standards ● Invest in shared vocabulary ● Fit the tooling to the infrastructure ● Fit the data model to the data
  • 16. ● Use and promote standards ● Invest in shared vocabulary ● Fit the tooling to the infrastructure ● Fit the data model to the data ● Budget for “other stuff”
  • 17. ● Use and promote standards ● Invest in shared vocabulary ● Fit the tooling to the infrastructure ● Fit the data model to the data ● Budget for “other stuff” ● Collaborate early and often
  • 18. Risk & Safety Knowledge Graph This slide intentionally left blank to save entropy. UBER KNOWLEDGE GRAPH
  • 19. ● Controlled vocabularies for all of Uber ○ Basic type aliases ○ Structured types for geospatial data, sensor data, money, etc. etc. ○ Entities and relationships (User, Vehicle, Trip, etc.) ○ Metadata vocabularies ● Elevates domain-specific RPC and storage schemas to ontologies ● Tooling carries schemas between data representation languages ○ Protobuf, Thrift, Avro, RDF, PG, etc. Data Standardization
  • 20. ● Hundreds of thousands of structured datasets at Uber ● Data protections and user trust ○ GDPR and other regulations, Uber’s own data policies ○ What kind of user data? Where is it? ○ Heroic numbers of manual annotations ■ Limited expressivity, limited guarantees ■ Inference is required ● Two birds: in annotating datasets, standardize and compose schemas ○ Now we have a true global knowledge graph ○ Investigating efficient reasoning and “No ETL” solutions Metadata graph
  • 21. ● Common data model for RPC, storage, and KR at Uber ● In progress: alignment with the Property Graph Schema Working Group ● In progress: “Universal structure” of TinkerPop4 Algebraic Property Graphs
  • 22. ● Real data is messy ● We are not all ontologists ● Good enough does not scale ● Beware of the hype cycle ● RDF is a hard sell ● The Property Graph is not enough ● Use and promote standards ● Invest in shared vocabulary ● Fit the tooling to the infrastructure ● Fit the data model to the data ● Budget for “other stuff” ● Collaborate early and often