SlideShare a Scribd company logo
1 of 17
Linked Metadata by Design
Presented by: Shirley Bacso
Data Architect at Ingka Digital, Netherlands
A few words
about me
• Data Architect at Ingka Digital, Netherlands
• Specialize in metadata
• Love problem-solving
• Love learning
• Graph enthusiast since June 2021
June 2021
Oct 2021
May 2022
Implementing
Business
Glossary
Ingka Graph
Event with
Neo4j
Feb 2023
Knowledge Graphs
2023, Prof. Dr.
Harald Sack
Oct 2023
DKG MVP
Dec 2023
NeoDash
Feb 2024
Learn graph
technology
Exam case
studies
Research metadata
management
best practices
Study organization
realities
Observe people
Hunt for metadata
Hunt for metadata
Hunt for metadata
Observe people
Study organization
realities
Hunt for metadata
Observe people
My journey
Metadata – the knowledge of our data
What is linked
metadata by
design?
• Linked metadata represents the integration
of outcomes from human collaboration. It
collectively forms the knowledge about
data, which is captured in the data
knowledge graph.
• By design emphasizes that the capture and
integration of metadata should start from
the conceptual level and continue through
the logical level, as human collaboration
begins at the design phase of data product
development.
“Metadata as a product”
Conceptual
• Right competence
• Well-defined processes
• Supporting tools
• Smart curation
Pain points of “Metadata as a by-product” and relieves
DAMA DMBOK says:
1. Do these metadata exist?
2. Which ones are correct to use?
3. Do I have the time to link
everything to my data?
Architecture
Various
metadata
sources
L0 L1 L2 L3
“Sources of knowledge” “Data knowledge graph”
The Ontology Development Process
Knowledge Graphs 2023, Prof. Dr. Harald Sack, FIZ Karlsruhe – Leibniz Institute for Information Infrastructure & Karlsruhe Institute of Technology
Ontology development in practice is an iterative process
that repeats continuously and improves the ontology.
Which knowledge
areas should be
covered by the
knowledge graph?
Collectively capture the broader knowledge about data
What concepts
and relationships
should be in the
knowledge graph?
arrows.app
What types of questions should be answered
by the knowledge graph?
Do all the data
columns/fields have
business names?
Are the data compliant
to (retention) rules?
What are the gaps
between the data quality
dimension goals and the
reality?
Do all the databases
connected to the
software systems?
What systems does a
certain digital domain
have?
What type of personal
data does a software
system have?
Find the most relevant
dataset
(recommendation
engine) and request
access.
Which data products are
overlapping? (Think
about the Principle of
Data as a Product)
How was the data
transformed throughout
the lifetime?
(Provenance and
operational lineage)
…
Example of
manual
knowledge
curation in
NeoDash
Idea on augmented solution
- Word embeddings for semantic similarity
Idea on augmented solution
- Knowledge graph embeddings for link prediction
• Translational Embeddings
o Unsupervised methods, e.g. TransE
o Supervised methods for prediction,
based on embedding vectors
• Transductive Link Prediction
• Inductive Link Prediction:
o Fully-inductive Link Prediction
o Semi-inductive Link Prediction
Knowledge Graph, Semantic Web Technology
Bring data to people
where they work
The importance of User Interface for
seamless experience
• Pros and cons of interaction models:
“Point-to-point” vs “Hub-and-spoke”
• How to solve the pain of “hop-
between-the-solutions”?
An inspiration
- A browser extension
Challenges on the Graph
Journey
The understanding of the benefits in graph
technology is not there yet at all levels. We need
more advocates to get buy-in and fund for
resources.
• Ontology and modeling
• Data governance
• Data engineering
• Graph data science
• UX design and UI building
• …
Thank you!
Reach out on Shirley Bacso

More Related Content

Similar to Ingka Digital: Linked Metadata by Design

Rscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libsRscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libsSusanMRob
 
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION Elvis Muyanja
 
DATA SCIENCE COURSE FEATURES
DATA SCIENCE COURSE FEATURESDATA SCIENCE COURSE FEATURES
DATA SCIENCE COURSE FEATURESUncodemy
 
Explorasi Data untuk Peluang Bisnis dan Pengembangan Karir.pptx
Explorasi Data untuk Peluang Bisnis dan Pengembangan Karir.pptxExplorasi Data untuk Peluang Bisnis dan Pengembangan Karir.pptx
Explorasi Data untuk Peluang Bisnis dan Pengembangan Karir.pptxwindu19
 
Neo4j on Microsoft Azure
Neo4j on Microsoft AzureNeo4j on Microsoft Azure
Neo4j on Microsoft AzureNeo4j
 
Climbing the Ontology Mountain to Achieve a Successful Knowledge Graph
Climbing the Ontology Mountain to Achieve a Successful Knowledge GraphClimbing the Ontology Mountain to Achieve a Successful Knowledge Graph
Climbing the Ontology Mountain to Achieve a Successful Knowledge GraphEnterprise Knowledge
 
How to Prepare for a Career in Data Science
How to Prepare for a Career in Data ScienceHow to Prepare for a Career in Data Science
How to Prepare for a Career in Data ScienceJuuso Parkkinen
 
Search Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignSearch Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignMarianne Sweeny
 
Making Internet Of Things Device Data Just Work!
Making Internet Of Things Device Data Just Work!Making Internet Of Things Device Data Just Work!
Making Internet Of Things Device Data Just Work!Memoori
 
The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration James Hendler
 
Data Science Certification in Pune-January
Data Science Certification in Pune-JanuaryData Science Certification in Pune-January
Data Science Certification in Pune-JanuaryDataMites
 
Data Science Training in Chennai-January
Data Science Training in Chennai-JanuaryData Science Training in Chennai-January
Data Science Training in Chennai-JanuaryDataMites
 
Data Science Course in Chennai-January-1
Data Science Course in Chennai-January-1Data Science Course in Chennai-January-1
Data Science Course in Chennai-January-1DataMites
 
Data Science Certification in Pune-January
Data Science Certification in Pune-JanuaryData Science Certification in Pune-January
Data Science Certification in Pune-JanuaryDataMites
 
Starting a career in data science
Starting a career in data scienceStarting a career in data science
Starting a career in data scienceBrian Spiering
 
KM SHOWCASE 2020 - "Lessons Learned Building a Knowledge Graph" - Chris Marino
KM SHOWCASE 2020 - "Lessons Learned Building a Knowledge Graph" - Chris MarinoKM SHOWCASE 2020 - "Lessons Learned Building a Knowledge Graph" - Chris Marino
KM SHOWCASE 2020 - "Lessons Learned Building a Knowledge Graph" - Chris MarinoKM Institute
 
Getting started with with SharePoint Syntex
Getting started with with SharePoint SyntexGetting started with with SharePoint Syntex
Getting started with with SharePoint SyntexDrew Madelung
 
A Practical-ish Introduction to Data Science
A Practical-ish Introduction to Data ScienceA Practical-ish Introduction to Data Science
A Practical-ish Introduction to Data ScienceMark West
 

Similar to Ingka Digital: Linked Metadata by Design (20)

Rscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libsRscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libs
 
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
 
DataScience.pptx
DataScience.pptxDataScience.pptx
DataScience.pptx
 
DATA SCIENCE COURSE FEATURES
DATA SCIENCE COURSE FEATURESDATA SCIENCE COURSE FEATURES
DATA SCIENCE COURSE FEATURES
 
Explorasi Data untuk Peluang Bisnis dan Pengembangan Karir.pptx
Explorasi Data untuk Peluang Bisnis dan Pengembangan Karir.pptxExplorasi Data untuk Peluang Bisnis dan Pengembangan Karir.pptx
Explorasi Data untuk Peluang Bisnis dan Pengembangan Karir.pptx
 
Neo4j on Microsoft Azure
Neo4j on Microsoft AzureNeo4j on Microsoft Azure
Neo4j on Microsoft Azure
 
Climbing the Ontology Mountain to Achieve a Successful Knowledge Graph
Climbing the Ontology Mountain to Achieve a Successful Knowledge GraphClimbing the Ontology Mountain to Achieve a Successful Knowledge Graph
Climbing the Ontology Mountain to Achieve a Successful Knowledge Graph
 
How to Prepare for a Career in Data Science
How to Prepare for a Career in Data ScienceHow to Prepare for a Career in Data Science
How to Prepare for a Career in Data Science
 
Search Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignSearch Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By Design
 
Making Internet Of Things Device Data Just Work!
Making Internet Of Things Device Data Just Work!Making Internet Of Things Device Data Just Work!
Making Internet Of Things Device Data Just Work!
 
What is data science ?
What is data science ?What is data science ?
What is data science ?
 
The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration
 
Data Science Certification in Pune-January
Data Science Certification in Pune-JanuaryData Science Certification in Pune-January
Data Science Certification in Pune-January
 
Data Science Training in Chennai-January
Data Science Training in Chennai-JanuaryData Science Training in Chennai-January
Data Science Training in Chennai-January
 
Data Science Course in Chennai-January-1
Data Science Course in Chennai-January-1Data Science Course in Chennai-January-1
Data Science Course in Chennai-January-1
 
Data Science Certification in Pune-January
Data Science Certification in Pune-JanuaryData Science Certification in Pune-January
Data Science Certification in Pune-January
 
Starting a career in data science
Starting a career in data scienceStarting a career in data science
Starting a career in data science
 
KM SHOWCASE 2020 - "Lessons Learned Building a Knowledge Graph" - Chris Marino
KM SHOWCASE 2020 - "Lessons Learned Building a Knowledge Graph" - Chris MarinoKM SHOWCASE 2020 - "Lessons Learned Building a Knowledge Graph" - Chris Marino
KM SHOWCASE 2020 - "Lessons Learned Building a Knowledge Graph" - Chris Marino
 
Getting started with with SharePoint Syntex
Getting started with with SharePoint SyntexGetting started with with SharePoint Syntex
Getting started with with SharePoint Syntex
 
A Practical-ish Introduction to Data Science
A Practical-ish Introduction to Data ScienceA Practical-ish Introduction to Data Science
A Practical-ish Introduction to Data Science
 

More from Neo4j

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansNeo4j
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...Neo4j
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosNeo4j
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Neo4j
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jNeo4j
 
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j
 
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfRabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j
 
Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Neo4j
 
IA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeIA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeNeo4j
 
Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j
 
Enabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsEnabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsNeo4j
 
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j
 
Neo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j
 
SWIFT: Maintaining Critical Standards in the Financial Services Industry with...
SWIFT: Maintaining Critical Standards in the Financial Services Industry with...SWIFT: Maintaining Critical Standards in the Financial Services Industry with...
SWIFT: Maintaining Critical Standards in the Financial Services Industry with...Neo4j
 

More from Neo4j (20)

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
 
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfRabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!
 
IA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeIA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG time
 
Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
 
Enabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsEnabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge Graphs
 
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
 
Neo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with Graph
 
SWIFT: Maintaining Critical Standards in the Financial Services Industry with...
SWIFT: Maintaining Critical Standards in the Financial Services Industry with...SWIFT: Maintaining Critical Standards in the Financial Services Industry with...
SWIFT: Maintaining Critical Standards in the Financial Services Industry with...
 

Ingka Digital: Linked Metadata by Design

  • 1. Linked Metadata by Design Presented by: Shirley Bacso Data Architect at Ingka Digital, Netherlands
  • 2. A few words about me • Data Architect at Ingka Digital, Netherlands • Specialize in metadata • Love problem-solving • Love learning • Graph enthusiast since June 2021
  • 3. June 2021 Oct 2021 May 2022 Implementing Business Glossary Ingka Graph Event with Neo4j Feb 2023 Knowledge Graphs 2023, Prof. Dr. Harald Sack Oct 2023 DKG MVP Dec 2023 NeoDash Feb 2024 Learn graph technology Exam case studies Research metadata management best practices Study organization realities Observe people Hunt for metadata Hunt for metadata Hunt for metadata Observe people Study organization realities Hunt for metadata Observe people My journey Metadata – the knowledge of our data
  • 4. What is linked metadata by design? • Linked metadata represents the integration of outcomes from human collaboration. It collectively forms the knowledge about data, which is captured in the data knowledge graph. • By design emphasizes that the capture and integration of metadata should start from the conceptual level and continue through the logical level, as human collaboration begins at the design phase of data product development.
  • 5. “Metadata as a product” Conceptual • Right competence • Well-defined processes • Supporting tools • Smart curation Pain points of “Metadata as a by-product” and relieves DAMA DMBOK says: 1. Do these metadata exist? 2. Which ones are correct to use? 3. Do I have the time to link everything to my data?
  • 6. Architecture Various metadata sources L0 L1 L2 L3 “Sources of knowledge” “Data knowledge graph”
  • 7. The Ontology Development Process Knowledge Graphs 2023, Prof. Dr. Harald Sack, FIZ Karlsruhe – Leibniz Institute for Information Infrastructure & Karlsruhe Institute of Technology Ontology development in practice is an iterative process that repeats continuously and improves the ontology.
  • 8. Which knowledge areas should be covered by the knowledge graph? Collectively capture the broader knowledge about data
  • 9. What concepts and relationships should be in the knowledge graph? arrows.app
  • 10. What types of questions should be answered by the knowledge graph? Do all the data columns/fields have business names? Are the data compliant to (retention) rules? What are the gaps between the data quality dimension goals and the reality? Do all the databases connected to the software systems? What systems does a certain digital domain have? What type of personal data does a software system have? Find the most relevant dataset (recommendation engine) and request access. Which data products are overlapping? (Think about the Principle of Data as a Product) How was the data transformed throughout the lifetime? (Provenance and operational lineage) …
  • 12. Idea on augmented solution - Word embeddings for semantic similarity
  • 13. Idea on augmented solution - Knowledge graph embeddings for link prediction • Translational Embeddings o Unsupervised methods, e.g. TransE o Supervised methods for prediction, based on embedding vectors • Transductive Link Prediction • Inductive Link Prediction: o Fully-inductive Link Prediction o Semi-inductive Link Prediction Knowledge Graph, Semantic Web Technology
  • 14. Bring data to people where they work The importance of User Interface for seamless experience • Pros and cons of interaction models: “Point-to-point” vs “Hub-and-spoke” • How to solve the pain of “hop- between-the-solutions”?
  • 15. An inspiration - A browser extension
  • 16. Challenges on the Graph Journey The understanding of the benefits in graph technology is not there yet at all levels. We need more advocates to get buy-in and fund for resources. • Ontology and modeling • Data governance • Data engineering • Graph data science • UX design and UI building • …
  • 17. Thank you! Reach out on Shirley Bacso