Crafting a Knowledge Graph Strategy - What to think about

Connected Data World
Connected Data WorldConnected Data World
Crafting a Knowledge Graph Strategy
What to think about
Panos Alexopoulos
Head of Ontology
2.5 years ago I got a mission ...
• “Good morning Mr. Alexopoulos. Your mission,
should you decide to accept it, is to build an
HR and Recruitment Knowledge Graph.”
• “As always, should you or any member of your
team be caught or killed, the company will
disavow any knowledge of your actions. “
• “This tape will self-destruct in 5 seconds.”
… which I accepted as I had the perfect recipe ...
… that often didn’t quite work as expected ...
… leading to several hard-learned lessons ...
… one of them being:
Is it easy to build a
Knowledge Graph?
A tale of two graphs
• EU’s ESCO knowledge graph:
• Released September 2017
• 12K Skill and 3K Profession interrelated concepts in 27
languages
• Built completely manually by experts in 6 years and cost
~15 million Euros
• To be updated once a year by a maintenance committee
• DBPedia knowledge graph
• First release in 2007
• Currently >17M entities in 127 languages
• Automatically extracted from Wikipedia infoboxes
• Live version and monthly releases
Why the difference?
• Types of entities: Abstract entities more difficult to define than concrete
ones (e.g. “Ontology” v.s. “The Netherlands”)
• Types of relations: Vague relations harder to define and achieve
consensus (e.g. “has essential skill” v.s. “was born in”)
• Available data sources: There are no Wikipedia-like infoboxes for
Professions and Skills that could enable automatic extraction.
• Quality requirements: ESCO wanted very high accuracy.
Knowledge Graphs and
Strategy
Strategy is about “What” rather than “How”
• What domains of knowledge we want our
Knowledge Graph to cover?
• What data we want to connect?
• What applications we want to power?
• What users we want to serve?
• What quality dimensions are important and
at what levels?
Strategy is about Choices, Dilemmas and Decisions
• Domain A or Domain B?
• Accuracy or Completeness?
• Application A or Application B?
• Data-Driven or Human-Driven?
• Democracy, Oligarchy or Dictatorship?
• Single or Multiple Truths?
Crafting a Knowledge
Graph Strategy
Know thy Goals
• What business/technical goals are you trying to
achieve via the Knowledge Graph?
• Imitate the others?
• Enhance current products/processes and get
ahead of the competition ? If so, which?
• Lay the foundations for new products/processes?
If so, which?
• Integrate dispersed data and improve their
governance?
• …
• Extract!: Entity and Relation Tagging and Disambiguation in CVs and Vacancies
• Search! and Match!: Query Understanding and Expansion, Semantic Similarity
• Jobfeed: Labour Market Analytics
Example: Products enhanced by the Textkernel Knowledge
Graph
Know thy Context
• Domain(s) and Semantics
• What elements does the Knowledge Graph need to provide
(entity types, relations, etc)?
• Is there ambiguity, vagueness, semantic drift? To what extent?
• Technology Context
• What knowledge can be automatically mined with the
technology we have?
• How fast and how accurately?
• Organizational Context
• Data, technology and products
• People, knowledge and capabilities
• Processes, practices and attitudes
• Business Context
• Business strategy and model(s)
• Clients, partners and competitors
• Trends, hypes, expectations and narratives
Use Context to make choices and break dilemmas
• If your semantics are highly vague and cause disagreements you
might need to support multiple truths.
• If the best extraction system for relation X is only 50% accurate, you
will probably need manual curation.
• If there is already a non-RDF semantics management infrastructure
in the organization, it may not worth replacing it with an RDF one!
• If product A generates more revenue than product B, you might need
to optimize the KG for A.
• If your competitor outperforms you in coverage because of more data
and better infrastructure, you might want to compete on accuracy or
some niche complex domain..
Questions to think when you return to the office
Do w w y e
bu n (or t) a
Kno d G ap ?
Do w w o t
an r u “Wha ” an
b e d e m ?
➔ High-level goals are ok but
you need concrete goals
➔ Any answer with “AI”, “Big
Data” or other buzzword
should be ignored!
➔ Semantics
➔ Data
➔ Technology
➔ Organization
➔ Business
➔ Basic philosophy and
principles
➔ Definition of success (and
failure)
➔ Decision making processes
Do w w e c l e
Con t?
Thank you!
Panos Alexopoulos
Head of Ontology
E-mail: alexopoulos@textkernel.nl
Web: http://www.panosalexopoulos.com
LinkedIn: www.linkedin.com/in/panosalexopoulos
Twitter: @PAlexop
1 of 19

Recommended

Tara Raafat by
Tara RaafatTara Raafat
Tara RaafatConnected Data World
975 views28 slides
Sebastian Hellmann by
Sebastian HellmannSebastian Hellmann
Sebastian HellmannConnected Data World
381 views26 slides
Charles Ivie by
Charles Ivie Charles Ivie
Charles Ivie Connected Data World
259 views17 slides
Connected data meetup group - introduction & scope by
Connected data meetup group - introduction & scopeConnected data meetup group - introduction & scope
Connected data meetup group - introduction & scopeConnected Data World
342 views11 slides
Mike Bennett by
Mike BennettMike Bennett
Mike BennettConnected Data World
899 views43 slides
moOn: A Multidimensional Graph Approach to Human Resources Analytics, aizoOn by
moOn: A Multidimensional Graph Approach to Human Resources Analytics, aizoOnmoOn: A Multidimensional Graph Approach to Human Resources Analytics, aizoOn
moOn: A Multidimensional Graph Approach to Human Resources Analytics, aizoOnNeo4j
853 views40 slides

More Related Content

What's hot

Edmc use cases 2018 nyc by
Edmc use cases 2018   nycEdmc use cases 2018   nyc
Edmc use cases 2018 nycMarty Loughlin
379 views12 slides
How a global manufacturing company built a data science capability from scratch by
How a global manufacturing company built a data science capability from scratchHow a global manufacturing company built a data science capability from scratch
How a global manufacturing company built a data science capability from scratchCarlo Torniai
2.1K views19 slides
State street edmc swaps pilot by
State street edmc swaps pilotState street edmc swaps pilot
State street edmc swaps pilotMarty Loughlin
844 views7 slides
AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha... by
AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha...AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha...
AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha...Dr. Haxel Consult
383 views21 slides
Machine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlow by
Machine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlowMachine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlow
Machine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlowandrejusb
173 views23 slides
AI-SDV 2021: Stefan Geissler - AI support for creating and maintaining vocabu... by
AI-SDV 2021: Stefan Geissler - AI support for creating and maintaining vocabu...AI-SDV 2021: Stefan Geissler - AI support for creating and maintaining vocabu...
AI-SDV 2021: Stefan Geissler - AI support for creating and maintaining vocabu...Dr. Haxel Consult
482 views15 slides

What's hot(20)

How a global manufacturing company built a data science capability from scratch by Carlo Torniai
How a global manufacturing company built a data science capability from scratchHow a global manufacturing company built a data science capability from scratch
How a global manufacturing company built a data science capability from scratch
Carlo Torniai2.1K views
State street edmc swaps pilot by Marty Loughlin
State street edmc swaps pilotState street edmc swaps pilot
State street edmc swaps pilot
Marty Loughlin844 views
AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha... by Dr. Haxel Consult
AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha...AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha...
AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha...
Dr. Haxel Consult383 views
Machine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlow by andrejusb
Machine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlowMachine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlow
Machine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlow
andrejusb173 views
AI-SDV 2021: Stefan Geissler - AI support for creating and maintaining vocabu... by Dr. Haxel Consult
AI-SDV 2021: Stefan Geissler - AI support for creating and maintaining vocabu...AI-SDV 2021: Stefan Geissler - AI support for creating and maintaining vocabu...
AI-SDV 2021: Stefan Geissler - AI support for creating and maintaining vocabu...
Dr. Haxel Consult482 views
5. Building the Cancer Research Data Commons with Neo4j: The Bento Framework by Neo4j
5. Building the Cancer Research Data Commons with Neo4j: The Bento Framework5. Building the Cancer Research Data Commons with Neo4j: The Bento Framework
5. Building the Cancer Research Data Commons with Neo4j: The Bento Framework
Neo4j565 views
Neo4j: What's Under the Hood & How Knowing This Can Help You by Neo4j
Neo4j: What's Under the Hood & How Knowing This Can Help You Neo4j: What's Under the Hood & How Knowing This Can Help You
Neo4j: What's Under the Hood & How Knowing This Can Help You
Neo4j805 views
Talk to me Goose: Going beyond your regular Chatbot by Luc Bors
Talk to me Goose: Going beyond your regular ChatbotTalk to me Goose: Going beyond your regular Chatbot
Talk to me Goose: Going beyond your regular Chatbot
Luc Bors326 views
How LinkedIn leverages data to build scalable payments strategy by Chi-Yi Kuan
How LinkedIn leverages data to build scalable payments strategyHow LinkedIn leverages data to build scalable payments strategy
How LinkedIn leverages data to build scalable payments strategy
Chi-Yi Kuan650 views
GraphTour - DXC - Digital Explorer by Neo4j
GraphTour - DXC - Digital ExplorerGraphTour - DXC - Digital Explorer
GraphTour - DXC - Digital Explorer
Neo4j686 views
Neo4j: What's Under the Hood by Neo4j
Neo4j: What's Under the HoodNeo4j: What's Under the Hood
Neo4j: What's Under the Hood
Neo4j531 views
Compliance made easy: Lynx webinar #1 by Lynx Project
Compliance made easy: Lynx webinar #1Compliance made easy: Lynx webinar #1
Compliance made easy: Lynx webinar #1
Lynx Project516 views
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w... by semanticsconference
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
Session 2.1 ontological representation of the telecom domain for advanced a... by semanticsconference
Session 2.1   ontological representation of the telecom domain for advanced a...Session 2.1   ontological representation of the telecom domain for advanced a...
Session 2.1 ontological representation of the telecom domain for advanced a...
Graph Data Science: The Secret to Accelerating Innovation with AI/ML by Neo4j
Graph Data Science: The Secret to Accelerating Innovation with AI/MLGraph Data Science: The Secret to Accelerating Innovation with AI/ML
Graph Data Science: The Secret to Accelerating Innovation with AI/ML
Neo4j415 views
W3C-XBRL International Workshop 2009 by ashubhatnagar
W3C-XBRL International Workshop 2009W3C-XBRL International Workshop 2009
W3C-XBRL International Workshop 2009
ashubhatnagar286 views
Test strategy for Conversational AI by Shama Ugale
Test strategy for Conversational AITest strategy for Conversational AI
Test strategy for Conversational AI
Shama Ugale290 views
AI-SDV 2020: Using Transformer technology to build an AI based personal News ... by Dr. Haxel Consult
AI-SDV 2020: Using Transformer technology to build an AI based personal News ...AI-SDV 2020: Using Transformer technology to build an AI based personal News ...
AI-SDV 2020: Using Transformer technology to build an AI based personal News ...
Dr. Haxel Consult797 views
Intro to Graphs and Neo4j by jexp
Intro to Graphs and Neo4jIntro to Graphs and Neo4j
Intro to Graphs and Neo4j
jexp2.9K views

Similar to Crafting a Knowledge Graph Strategy - What to think about

Lean Analytics: How to get more out of your data science team by
Lean Analytics: How to get more out of your data science teamLean Analytics: How to get more out of your data science team
Lean Analytics: How to get more out of your data science teamDigital Transformation EXPO Event Series
148 views25 slides
What Managers Need to Know about Data Science by
What Managers Need to Know about Data ScienceWhat Managers Need to Know about Data Science
What Managers Need to Know about Data ScienceAnnie Flippo
802 views45 slides
Crafting a Compelling Data Science Resume by
Crafting a Compelling Data Science ResumeCrafting a Compelling Data Science Resume
Crafting a Compelling Data Science ResumeArushi Prakash, Ph.D.
340 views33 slides
Synergis60: 6 Critical Steps to Implementing Data Managment by
Synergis60: 6 Critical Steps to Implementing Data ManagmentSynergis60: 6 Critical Steps to Implementing Data Managment
Synergis60: 6 Critical Steps to Implementing Data ManagmentSynergis Engineering Design Solutions
416 views33 slides
CTO School Meetup - Jan 2013 Becoming Better Technical Leader by
CTO School Meetup - Jan 2013   Becoming Better Technical LeaderCTO School Meetup - Jan 2013   Becoming Better Technical Leader
CTO School Meetup - Jan 2013 Becoming Better Technical LeaderJean Barmash
2.2K views32 slides
Be the Captain of Your Career by
Be the Captain of Your Career Be the Captain of Your Career
Be the Captain of Your Career Jack Molisani
550 views71 slides

Similar to Crafting a Knowledge Graph Strategy - What to think about(20)

What Managers Need to Know about Data Science by Annie Flippo
What Managers Need to Know about Data ScienceWhat Managers Need to Know about Data Science
What Managers Need to Know about Data Science
Annie Flippo802 views
CTO School Meetup - Jan 2013 Becoming Better Technical Leader by Jean Barmash
CTO School Meetup - Jan 2013   Becoming Better Technical LeaderCTO School Meetup - Jan 2013   Becoming Better Technical Leader
CTO School Meetup - Jan 2013 Becoming Better Technical Leader
Jean Barmash2.2K views
Be the Captain of Your Career by Jack Molisani
Be the Captain of Your Career Be the Captain of Your Career
Be the Captain of Your Career
Jack Molisani550 views
Lessons after working as a data scientist for 1 year by Yao Yao
Lessons after working as a data scientist for 1 yearLessons after working as a data scientist for 1 year
Lessons after working as a data scientist for 1 year
Yao Yao439 views
InfoVision_PM101_RPadaki by Ravi Padaki
InfoVision_PM101_RPadakiInfoVision_PM101_RPadaki
InfoVision_PM101_RPadaki
Ravi Padaki92 views
DACS Presentation January 2015 by amandablair203
DACS Presentation January 2015 DACS Presentation January 2015
DACS Presentation January 2015
amandablair203195 views
DACS Presentation January 2015 by John Barry
DACS Presentation January 2015DACS Presentation January 2015
DACS Presentation January 2015
John Barry1K views
Are you ready for Data science? A 12 point test by Bertil Hatt
Are you ready for Data science? A 12 point testAre you ready for Data science? A 12 point test
Are you ready for Data science? A 12 point test
Bertil Hatt593 views
Scaling Training Data for AI Applications by Applause
Scaling Training Data for AI ApplicationsScaling Training Data for AI Applications
Scaling Training Data for AI Applications
Applause128 views
Data and analytic strategies for developing ethical it by Hyoun Park
Data and analytic strategies for developing ethical itData and analytic strategies for developing ethical it
Data and analytic strategies for developing ethical it
Hyoun Park231 views
Data Refinement: The missing link between data collection and decisions by Vivastream
Data Refinement: The missing link between data collection and decisionsData Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisions
Vivastream1K views
How to start your data career by Adwait Bhave
How to start your data careerHow to start your data career
How to start your data career
Adwait Bhave173 views
Enterprise search Information by Netwoven Inc.
Enterprise search Information Enterprise search Information
Enterprise search Information
Netwoven Inc. 338 views
Data Solutions for Products by Leadspace Director of Analytics by Product School
Data Solutions for Products by Leadspace Director of AnalyticsData Solutions for Products by Leadspace Director of Analytics
Data Solutions for Products by Leadspace Director of Analytics
Product School369 views
5 Reasons Content Strategy & Content Engineering Go Together Like Milk and Or... by Kanban Solutions
5 Reasons Content Strategy & Content Engineering Go Together Like Milk and Or...5 Reasons Content Strategy & Content Engineering Go Together Like Milk and Or...
5 Reasons Content Strategy & Content Engineering Go Together Like Milk and Or...
Kanban Solutions690 views

More from Connected Data World

Κnowledge Architecture: Combining Strategy, Data Science and Information Arch... by
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Connected Data World
485 views39 slides
How to get started with Graph Machine Learning by
How to get started with Graph Machine LearningHow to get started with Graph Machine Learning
How to get started with Graph Machine LearningConnected Data World
186 views22 slides
Graphs in sustainable finance by
Graphs in sustainable financeGraphs in sustainable finance
Graphs in sustainable financeConnected Data World
1K views13 slides
The years of the graph: The future of the future is here by
The years of the graph: The future of the future is hereThe years of the graph: The future of the future is here
The years of the graph: The future of the future is hereConnected Data World
716 views27 slides
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2 by
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2Connected Data World
1.8K views22 slides
From Taxonomies and Schemas to Knowledge Graphs: Part 3 by
From Taxonomies and Schemas to Knowledge Graphs: Part 3From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3Connected Data World
778 views14 slides

More from Connected Data World(20)

Κnowledge Architecture: Combining Strategy, Data Science and Information Arch... by Connected Data World
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
The years of the graph: The future of the future is here by Connected Data World
The years of the graph: The future of the future is hereThe years of the graph: The future of the future is here
The years of the graph: The future of the future is here
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2 by Connected Data World
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Part 3 by Connected Data World
From Taxonomies and Schemas to Knowledge Graphs: Part 3From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3
Graph in Apache Cassandra. The World’s Most Scalable Graph Database by Connected Data World
Graph in Apache Cassandra. The World’s Most Scalable Graph DatabaseGraph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d... by Connected Data World
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne... by Connected Data World
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Semantic similarity for faster Knowledge Graph delivery at scale by Connected Data World
Semantic similarity for faster Knowledge Graph delivery at scaleSemantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scale
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at... by Connected Data World
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
RAPIDS cuGraph – Accelerating all your Graph needs by Connected Data World
RAPIDS cuGraph – Accelerating all your Graph needsRAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needs
Elegant and Scalable Code Querying with Code Property Graphs by Connected Data World
Elegant and Scalable Code Querying with Code Property GraphsElegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property Graphs
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle... by Connected Data World
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
What are we Talking About, When we Talk About Ontology? by Connected Data World
What are we Talking About, When we Talk About Ontology?What are we Talking About, When we Talk About Ontology?
What are we Talking About, When we Talk About Ontology?

Recently uploaded

Future of Learning - Yap Aye Wee.pdf by
Future of Learning - Yap Aye Wee.pdfFuture of Learning - Yap Aye Wee.pdf
Future of Learning - Yap Aye Wee.pdfNUS-ISS
41 views11 slides
Melek BEN MAHMOUD.pdf by
Melek BEN MAHMOUD.pdfMelek BEN MAHMOUD.pdf
Melek BEN MAHMOUD.pdfMelekBenMahmoud
14 views1 slide
Understanding GenAI/LLM and What is Google Offering - Felix Goh by
Understanding GenAI/LLM and What is Google Offering - Felix GohUnderstanding GenAI/LLM and What is Google Offering - Felix Goh
Understanding GenAI/LLM and What is Google Offering - Felix GohNUS-ISS
41 views33 slides
How to reduce cold starts for Java Serverless applications in AWS at JCON Wor... by
How to reduce cold starts for Java Serverless applications in AWS at JCON Wor...How to reduce cold starts for Java Serverless applications in AWS at JCON Wor...
How to reduce cold starts for Java Serverless applications in AWS at JCON Wor...Vadym Kazulkin
75 views64 slides
Report 2030 Digital Decade by
Report 2030 Digital DecadeReport 2030 Digital Decade
Report 2030 Digital DecadeMassimo Talia
14 views41 slides
Attacking IoT Devices from a Web Perspective - Linux Day by
Attacking IoT Devices from a Web Perspective - Linux Day Attacking IoT Devices from a Web Perspective - Linux Day
Attacking IoT Devices from a Web Perspective - Linux Day Simone Onofri
15 views68 slides

Recently uploaded(20)

Future of Learning - Yap Aye Wee.pdf by NUS-ISS
Future of Learning - Yap Aye Wee.pdfFuture of Learning - Yap Aye Wee.pdf
Future of Learning - Yap Aye Wee.pdf
NUS-ISS41 views
Understanding GenAI/LLM and What is Google Offering - Felix Goh by NUS-ISS
Understanding GenAI/LLM and What is Google Offering - Felix GohUnderstanding GenAI/LLM and What is Google Offering - Felix Goh
Understanding GenAI/LLM and What is Google Offering - Felix Goh
NUS-ISS41 views
How to reduce cold starts for Java Serverless applications in AWS at JCON Wor... by Vadym Kazulkin
How to reduce cold starts for Java Serverless applications in AWS at JCON Wor...How to reduce cold starts for Java Serverless applications in AWS at JCON Wor...
How to reduce cold starts for Java Serverless applications in AWS at JCON Wor...
Vadym Kazulkin75 views
Attacking IoT Devices from a Web Perspective - Linux Day by Simone Onofri
Attacking IoT Devices from a Web Perspective - Linux Day Attacking IoT Devices from a Web Perspective - Linux Day
Attacking IoT Devices from a Web Perspective - Linux Day
Simone Onofri15 views
[2023] Putting the R! in R&D.pdf by Eleanor McHugh
[2023] Putting the R! in R&D.pdf[2023] Putting the R! in R&D.pdf
[2023] Putting the R! in R&D.pdf
Eleanor McHugh38 views
Spesifikasi Lengkap ASUS Vivobook Go 14 by Dot Semarang
Spesifikasi Lengkap ASUS Vivobook Go 14Spesifikasi Lengkap ASUS Vivobook Go 14
Spesifikasi Lengkap ASUS Vivobook Go 14
Dot Semarang35 views
Data-centric AI and the convergence of data and model engineering: opportunit... by Paolo Missier
Data-centric AI and the convergence of data and model engineering:opportunit...Data-centric AI and the convergence of data and model engineering:opportunit...
Data-centric AI and the convergence of data and model engineering: opportunit...
Paolo Missier34 views
PharoJS - Zürich Smalltalk Group Meetup November 2023 by Noury Bouraqadi
PharoJS - Zürich Smalltalk Group Meetup November 2023PharoJS - Zürich Smalltalk Group Meetup November 2023
PharoJS - Zürich Smalltalk Group Meetup November 2023
Noury Bouraqadi120 views
Business Analyst Series 2023 - Week 3 Session 5 by DianaGray10
Business Analyst Series 2023 -  Week 3 Session 5Business Analyst Series 2023 -  Week 3 Session 5
Business Analyst Series 2023 - Week 3 Session 5
DianaGray10209 views
How the World's Leading Independent Automotive Distributor is Reinventing Its... by NUS-ISS
How the World's Leading Independent Automotive Distributor is Reinventing Its...How the World's Leading Independent Automotive Distributor is Reinventing Its...
How the World's Leading Independent Automotive Distributor is Reinventing Its...
NUS-ISS15 views
DALI Basics Course 2023 by Ivory Egg
DALI Basics Course  2023DALI Basics Course  2023
DALI Basics Course 2023
Ivory Egg14 views
AMAZON PRODUCT RESEARCH.pdf by JerikkLaureta
AMAZON PRODUCT RESEARCH.pdfAMAZON PRODUCT RESEARCH.pdf
AMAZON PRODUCT RESEARCH.pdf
JerikkLaureta15 views
.conf Go 2023 - Data analysis as a routine by Splunk
.conf Go 2023 - Data analysis as a routine.conf Go 2023 - Data analysis as a routine
.conf Go 2023 - Data analysis as a routine
Splunk93 views
Transcript: The Details of Description Techniques tips and tangents on altern... by BookNet Canada
Transcript: The Details of Description Techniques tips and tangents on altern...Transcript: The Details of Description Techniques tips and tangents on altern...
Transcript: The Details of Description Techniques tips and tangents on altern...
BookNet Canada130 views
Empathic Computing: Delivering the Potential of the Metaverse by Mark Billinghurst
Empathic Computing: Delivering  the Potential of the MetaverseEmpathic Computing: Delivering  the Potential of the Metaverse
Empathic Computing: Delivering the Potential of the Metaverse
Mark Billinghurst470 views
Igniting Next Level Productivity with AI-Infused Data Integration Workflows by Safe Software
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Safe Software225 views
Perth MeetUp November 2023 by Michael Price
Perth MeetUp November 2023 Perth MeetUp November 2023
Perth MeetUp November 2023
Michael Price15 views
Black and White Modern Science Presentation.pptx by maryamkhalid2916
Black and White Modern Science Presentation.pptxBlack and White Modern Science Presentation.pptx
Black and White Modern Science Presentation.pptx
maryamkhalid291614 views

Crafting a Knowledge Graph Strategy - What to think about

  • 1. Crafting a Knowledge Graph Strategy What to think about Panos Alexopoulos Head of Ontology
  • 2. 2.5 years ago I got a mission ... • “Good morning Mr. Alexopoulos. Your mission, should you decide to accept it, is to build an HR and Recruitment Knowledge Graph.” • “As always, should you or any member of your team be caught or killed, the company will disavow any knowledge of your actions. “ • “This tape will self-destruct in 5 seconds.”
  • 3. … which I accepted as I had the perfect recipe ...
  • 4. … that often didn’t quite work as expected ...
  • 5. … leading to several hard-learned lessons ...
  • 6. … one of them being:
  • 7. Is it easy to build a Knowledge Graph?
  • 8. A tale of two graphs • EU’s ESCO knowledge graph: • Released September 2017 • 12K Skill and 3K Profession interrelated concepts in 27 languages • Built completely manually by experts in 6 years and cost ~15 million Euros • To be updated once a year by a maintenance committee • DBPedia knowledge graph • First release in 2007 • Currently >17M entities in 127 languages • Automatically extracted from Wikipedia infoboxes • Live version and monthly releases
  • 9. Why the difference? • Types of entities: Abstract entities more difficult to define than concrete ones (e.g. “Ontology” v.s. “The Netherlands”) • Types of relations: Vague relations harder to define and achieve consensus (e.g. “has essential skill” v.s. “was born in”) • Available data sources: There are no Wikipedia-like infoboxes for Professions and Skills that could enable automatic extraction. • Quality requirements: ESCO wanted very high accuracy.
  • 11. Strategy is about “What” rather than “How” • What domains of knowledge we want our Knowledge Graph to cover? • What data we want to connect? • What applications we want to power? • What users we want to serve? • What quality dimensions are important and at what levels?
  • 12. Strategy is about Choices, Dilemmas and Decisions • Domain A or Domain B? • Accuracy or Completeness? • Application A or Application B? • Data-Driven or Human-Driven? • Democracy, Oligarchy or Dictatorship? • Single or Multiple Truths?
  • 14. Know thy Goals • What business/technical goals are you trying to achieve via the Knowledge Graph? • Imitate the others? • Enhance current products/processes and get ahead of the competition ? If so, which? • Lay the foundations for new products/processes? If so, which? • Integrate dispersed data and improve their governance? • …
  • 15. • Extract!: Entity and Relation Tagging and Disambiguation in CVs and Vacancies • Search! and Match!: Query Understanding and Expansion, Semantic Similarity • Jobfeed: Labour Market Analytics Example: Products enhanced by the Textkernel Knowledge Graph
  • 16. Know thy Context • Domain(s) and Semantics • What elements does the Knowledge Graph need to provide (entity types, relations, etc)? • Is there ambiguity, vagueness, semantic drift? To what extent? • Technology Context • What knowledge can be automatically mined with the technology we have? • How fast and how accurately? • Organizational Context • Data, technology and products • People, knowledge and capabilities • Processes, practices and attitudes • Business Context • Business strategy and model(s) • Clients, partners and competitors • Trends, hypes, expectations and narratives
  • 17. Use Context to make choices and break dilemmas • If your semantics are highly vague and cause disagreements you might need to support multiple truths. • If the best extraction system for relation X is only 50% accurate, you will probably need manual curation. • If there is already a non-RDF semantics management infrastructure in the organization, it may not worth replacing it with an RDF one! • If product A generates more revenue than product B, you might need to optimize the KG for A. • If your competitor outperforms you in coverage because of more data and better infrastructure, you might want to compete on accuracy or some niche complex domain..
  • 18. Questions to think when you return to the office Do w w y e bu n (or t) a Kno d G ap ? Do w w o t an r u “Wha ” an b e d e m ? ➔ High-level goals are ok but you need concrete goals ➔ Any answer with “AI”, “Big Data” or other buzzword should be ignored! ➔ Semantics ➔ Data ➔ Technology ➔ Organization ➔ Business ➔ Basic philosophy and principles ➔ Definition of success (and failure) ➔ Decision making processes Do w w e c l e Con t?
  • 19. Thank you! Panos Alexopoulos Head of Ontology E-mail: alexopoulos@textkernel.nl Web: http://www.panosalexopoulos.com LinkedIn: www.linkedin.com/in/panosalexopoulos Twitter: @PAlexop