SlideShare a Scribd company logo
1 of 42
keynote presentation at
DC-2010 Conference
Pittsburgh, PA
October 22, 2010
Bridging the Gaps:
Adaptive Approaches to Data Interoperability
Michael K. Bergman
2
The Iconoclast Cometh
3
Outline of Talk
Linked Data
Data Web, Structured Data and Semantic Web
Players and Roles  DCMI
Conclusions
4
Three Overall Assertions
<LinkedData> <isA> <ValuableTechnique>
<DataWeb> <hasNeedOf> <Semantics>
<DCMI> <hasRole> <Unique>
Linked Data
6
Three Linked Data Assertions
<LinkedData> <isA> <PreferredTechnique>
<Techniques> <doNotSolve> <RootChallenges>
<RDF> <hasBestRoleAs> <CanonicalDataModel>
7
Three More Linked Data Assertions
<LinkedData> <hasGrowing> <Triples>
<LDUsers> <wronglyUse> <ManyPredicates>
<LinkedData> <hasLack> <MajorUptake>
8
25 Billion Linked Data Triples
9
Bad Results from sameAs Misuse
10
The State of Linked Data
 Growing, but not as fast as promise would suggest
 Not used much, except curated settings
 Few actual dataset linkages
 NO true interoperability, except curated (life science,
some others)
 Difficult to publish
 If done right, best form to consume
Data, Structure and Semantic Web
12
Three Structured Data Web Assertions
<Heterogeneity> <isA> <Reality>
<LinkedData> <isOnly> <TinyContributor>
<Semantics> <isThe> <MissingLink>
13
Hundreds of Formats in the Wild
14
How to Aliquot the Firehose ?
15
Three Semantics Assertions (+ Axiom)
<ReferenceVocabs> <organize> <MassiveContent>
<LinkingPredicates> <gather> <RelatedContent>
<intersectionOf>
<SemanticContent> <enables> <MeaningfulWork>
16
Fixed References Help Orient
17
Concepts are the Fixed References
18
Design Aspects of Reference Concepts
 Truly are concepts, the idea of a thing
 Labels are language independent (à la SKOS):
 Preferred, human-readable label (prefLabel)
 Many, alternate synonyms, jargon, etc. (altLabel)
 Misspellings (hiddenLabel)
 all combined for tagging, IE purposes
 MUST have definition: what does this concept mean ?
 Organized into coherent structures (graphs)
 Inferencing
 Discovery and navigation
 Act as both classes and instances (RDF / OWL-speak)
 MUST have persistent URIs
19
Mappings Get Stuff into the Right Room
20
Many Mappings Should be Approximate
 skos:broadMatch
 skos:related
 ore:similarTo
 umbel:isAbout
 vmf:isInVocabulary
 skos:closeMatch
 lvont:nearlySameAs
 umbel:isLike
 umbel:hasCharacteristic
 lvont:somewhatSameAs
 rdfs:seeAlso
 ore:describes
 map:narrowerThan
 skos:narrower
 map:broaderThan
 skos:broader
 dc:subject
 link:uri
 foaf:isPrimaryTopicOf
21
Some Conditions for Interoperability
<Interoperability> <needsMapping> <Predicates>
<Interoperability> <needsReference> <Nouns>
Three Major Players
23
World Role
<World> <hasRole> <ContentAndStructure>
24
W3C Role
<W3C> <hasRole> <Standards>
25
DCMI Role
<DCMI> <hasRole> <ReferenceMetadata>
26
Three Going Forward Assertions
<LinkedData> <hasNeedOf> <MapPredicates>
<DataWeb> <hasNeedOf> <ReferenceConcepts>
<DCMI> <hasUniqueRole> <BothRequirements>
27
DCMI: the Unique Franchise
 DCMI already has unique authority in:
1. dc:subject
2. dc:subject qualifiers
3. initial Open Registry effort
4. core foundational properties
 DCMI has unique experience in:
1. diverse vocabularies
2. cataloging and classification
3. semantics
28
Reference Authority - Needed DCMI Role
<RefMetadata> <notSameAs> <OneRingRulesAll>
29
Reference Metadata is Not a Third Rail
30
The Web is Parched for Semantics
 Reference vocabularies
 Persistent URIs
 Re-use of vocabs
 Vetting + ranking
 Alignment services
 Annotation services
 RDFa injection
 Open source frameworks
31
We’re also Ready to Help
+
+ + + ???
32
A First Exemplar: FactForge
 A “reason-able” view to linked open data
 Pre-loaded semantic repository: reasoning, querying, exploration
 Ontologies
 Dublin Core, SKOS, RSS, FOAF
 Datasets
 DBpedia, Freebase, Geonames, UMBEL, MusicBrainz, Wordnet, CIA
World Factbook, Lingvoj
 Very large scale
 1.2B explicit + 0.9B inferred  10B retrievable statements
 Managed by BigOWLIM
 Free public service with many features:
 Auto-suggest
 Query and explore through Forest, RelFinder and Tabulator
 RDF search
 SPARQL end-point
33
Next Step, RENDER
 New EU project
 Large-scale LOD interoperability, methods
 Players:
 Karlsruher Institut fuer Technologie (DE)
 Ontotext (BG)
 Institut Jozef Stefan (SI)
 Telefonica (ES)
 Google (IE)
 Wikimedia (DE)
 STI Innsbruck (AT)
 Testbed for possible follow-ons ??
34
Possible Ontotext + SD Contributions
1. Mapping services to all comers (“vocabulary neutrality”)
2. Tagging services
3. Software + systems for other tagging services
4. Possible technical support for Metadata Registry
5. Lead / support for possible EU grant-seeking efforts
↓↓↓
If DCMI willing to partner, Ontotext + SD willing to
contribute in a neutral, open source manner
35
Ontotext + SD Links
 FactForge
http://www.factforge.net
 PROTON
http://proton.semanticweb.com
 Ontotext
http://www.ontotext.com
 RENDER
http://render-project.eu
 UMBEL
http://www.umbel.org
 Structured Dynamics
http://structureddynamics.com
Conclusion
37
Main Assertions Re-visited
 Interoperability on the Web not working:
1. Not (generally) fulfilled by linked data in current state
2. Predicates for approximate mappings lacking
3. Reference vocabularies essential as connecting nodes
 DCMI is the best (only?) player to plug these gaps
 We are willing to help find the resources + right
process to help plug the interoperability gap
38
DCMI Interoperability Services ?
Q & A
41
Contacts & Information
Michael K. Bergman
CEO
319.621.5225
mike@structureddynamics.com
blog: www.mkbergman.com
Web Sites
structureddynamics.com
citizen-dan.org (community indicator
systems)
openstructs.org (open source software)
techwiki.openstructs.org (open license
technical documentation)
umbel.org
umbel.structureddynamics.com (UMBEL
Web services)
DCMI Keynote: Bridging the Semantic Gaps and Interoperability

More Related Content

What's hot

Big Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and OpportunitiesBig Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and Opportunities
Srinath Srinivasa
 
One Ontology, One Data Set, Multiple Shapes with SHACL
One Ontology, One Data Set, Multiple Shapes with SHACLOne Ontology, One Data Set, Multiple Shapes with SHACL
One Ontology, One Data Set, Multiple Shapes with SHACL
Connected Data World
 

What's hot (20)

Big Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and OpportunitiesBig Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and Opportunities
 
Linked data for Enterprise Data Integration
Linked data for Enterprise Data IntegrationLinked data for Enterprise Data Integration
Linked data for Enterprise Data Integration
 
External CV support in Dataverse 5.7
External CV support in Dataverse 5.7External CV support in Dataverse 5.7
External CV support in Dataverse 5.7
 
The Semantic Data Web, Sören Auer, University of Leipzig
The Semantic Data Web, Sören Auer, University of LeipzigThe Semantic Data Web, Sören Auer, University of Leipzig
The Semantic Data Web, Sören Auer, University of Leipzig
 
Linking Big Data to Rich Process Descriptions
Linking Big Data to Rich Process DescriptionsLinking Big Data to Rich Process Descriptions
Linking Big Data to Rich Process Descriptions
 
Semantic Web Nature
Semantic Web NatureSemantic Web Nature
Semantic Web Nature
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?
 
Towards an Open Research Knowledge Graph
Towards an Open Research Knowledge GraphTowards an Open Research Knowledge Graph
Towards an Open Research Knowledge Graph
 
Linked data life cycles
Linked data life cyclesLinked data life cycles
Linked data life cycles
 
Fighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial IntelligenceFighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial Intelligence
 
Cognitive data
Cognitive dataCognitive data
Cognitive data
 
How Semantics Solves Big Data Challenges
How Semantics Solves Big Data ChallengesHow Semantics Solves Big Data Challenges
How Semantics Solves Big Data Challenges
 
Structured Data for the Financial Industry
Structured Data for the Financial Industry Structured Data for the Financial Industry
Structured Data for the Financial Industry
 
Going for GOLD - Adventures in Open Linked Metadata
Going for GOLD - Adventures in Open Linked MetadataGoing for GOLD - Adventures in Open Linked Metadata
Going for GOLD - Adventures in Open Linked Metadata
 
Ethics & (Explainable) AI – Semantic AI & the Role of the Knowledge Scientist
Ethics & (Explainable) AI – Semantic AI & the Role of the Knowledge ScientistEthics & (Explainable) AI – Semantic AI & the Role of the Knowledge Scientist
Ethics & (Explainable) AI – Semantic AI & the Role of the Knowledge Scientist
 
One Ontology, One Data Set, Multiple Shapes with SHACL
One Ontology, One Data Set, Multiple Shapes with SHACLOne Ontology, One Data Set, Multiple Shapes with SHACL
One Ontology, One Data Set, Multiple Shapes with SHACL
 
Setting up Dataverse repository for research data
Setting up Dataverse repository for research dataSetting up Dataverse repository for research data
Setting up Dataverse repository for research data
 
Semantics for Big Data Integration and Analysis
Semantics for Big Data Integration and AnalysisSemantics for Big Data Integration and Analysis
Semantics for Big Data Integration and Analysis
 
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudFirst Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
 
Creating Linked Data from Relational Databases
Creating Linked Data from Relational DatabasesCreating Linked Data from Relational Databases
Creating Linked Data from Relational Databases
 

Viewers also liked

Context & Connections: Designing a Vocab App
Context & Connections: Designing a Vocab AppContext & Connections: Designing a Vocab App
Context & Connections: Designing a Vocab App
Joshua Underwood
 
Ticclassiques
TicclassiquesTicclassiques
Ticclassiques
iesrb4
 
Coretta Scott King Shihab
Coretta Scott King ShihabCoretta Scott King Shihab
Coretta Scott King Shihab
anaq
 
Programas De
Programas DeProgramas De
Programas De
tat
 
Ch14 OS
Ch14 OSCh14 OS
Ch14 OS
C.U
 
Tales of an Open Scholar
Tales of an Open ScholarTales of an Open Scholar
Tales of an Open Scholar
ethan.watrall
 

Viewers also liked (20)

Context & Connections: Designing a Vocab App
Context & Connections: Designing a Vocab AppContext & Connections: Designing a Vocab App
Context & Connections: Designing a Vocab App
 
Livinbrand 2016 - Jakub Michl, Beneš & Michl: Jak prosazujeme branding ve fir...
Livinbrand 2016 - Jakub Michl, Beneš & Michl: Jak prosazujeme branding ve fir...Livinbrand 2016 - Jakub Michl, Beneš & Michl: Jak prosazujeme branding ve fir...
Livinbrand 2016 - Jakub Michl, Beneš & Michl: Jak prosazujeme branding ve fir...
 
miLexicon @ Eurocall2010
miLexicon @ Eurocall2010miLexicon @ Eurocall2010
miLexicon @ Eurocall2010
 
Foundations of Open Source Economic Development Presentation 2 Curve 1
Foundations of Open Source Economic Development Presentation 2 Curve 1Foundations of Open Source Economic Development Presentation 2 Curve 1
Foundations of Open Source Economic Development Presentation 2 Curve 1
 
Ticclassiques
TicclassiquesTicclassiques
Ticclassiques
 
Salzburg
SalzburgSalzburg
Salzburg
 
User Testing Tactics
User Testing TacticsUser Testing Tactics
User Testing Tactics
 
Coretta Scott King Shihab
Coretta Scott King ShihabCoretta Scott King Shihab
Coretta Scott King Shihab
 
Social and business activities alignment
Social and business activities alignmentSocial and business activities alignment
Social and business activities alignment
 
The commoditization and fragmentation of the ia community
The commoditization and fragmentation of the ia communityThe commoditization and fragmentation of the ia community
The commoditization and fragmentation of the ia community
 
Intj0808pdf
Intj0808pdfIntj0808pdf
Intj0808pdf
 
Top Reasons To Recycle
Top Reasons To RecycleTop Reasons To Recycle
Top Reasons To Recycle
 
Programas De
Programas DeProgramas De
Programas De
 
Is This Clickable? - Change how you look at the web
Is This Clickable? - Change how you look at the webIs This Clickable? - Change how you look at the web
Is This Clickable? - Change how you look at the web
 
Ch14 OS
Ch14 OSCh14 OS
Ch14 OS
 
Tales of an Open Scholar
Tales of an Open ScholarTales of an Open Scholar
Tales of an Open Scholar
 
Data-driven Applications with conStruct
Data-driven Applications with conStructData-driven Applications with conStruct
Data-driven Applications with conStruct
 
Publicness
PublicnessPublicness
Publicness
 
Googley Family Philanthropy
Googley Family PhilanthropyGoogley Family Philanthropy
Googley Family Philanthropy
 
User Experience Utopia (Ad Club Seattle)
User Experience Utopia (Ad Club Seattle)User Experience Utopia (Ad Club Seattle)
User Experience Utopia (Ad Club Seattle)
 

Similar to DCMI Keynote: Bridging the Semantic Gaps and Interoperability

Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
Tomek Pluskiewicz
 
Web 3 Mark Greaves
Web 3 Mark GreavesWeb 3 Mark Greaves
Web 3 Mark Greaves
Mediabistro
 

Similar to DCMI Keynote: Bridging the Semantic Gaps and Interoperability (20)

The Future of LOD
The Future of LODThe Future of LOD
The Future of LOD
 
OSLC & The Future of Interoperability
OSLC & The Future of InteroperabilityOSLC & The Future of Interoperability
OSLC & The Future of Interoperability
 
Web Data Management in the RDF Age
Web Data Management in the RDF AgeWeb Data Management in the RDF Age
Web Data Management in the RDF Age
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
Decentralized Data Management for the Semantic Web
Decentralized Data Management for the Semantic WebDecentralized Data Management for the Semantic Web
Decentralized Data Management for the Semantic Web
 
Linking Open Data
Linking Open DataLinking Open Data
Linking Open Data
 
Sem tech 2011 v8
Sem tech 2011 v8Sem tech 2011 v8
Sem tech 2011 v8
 
鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107
 
20100614 ISWSA Keynote
20100614 ISWSA Keynote20100614 ISWSA Keynote
20100614 ISWSA Keynote
 
Information Extraction and Linked Data Cloud
Information Extraction and Linked Data CloudInformation Extraction and Linked Data Cloud
Information Extraction and Linked Data Cloud
 
The Web of data and web data commons
The Web of data and web data commonsThe Web of data and web data commons
The Web of data and web data commons
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data Visualization
 
Towards Virtual Knowledge Graphs over Web APIs
Towards Virtual Knowledge Graphs over Web APIsTowards Virtual Knowledge Graphs over Web APIs
Towards Virtual Knowledge Graphs over Web APIs
 
Linked data and voyager
Linked data and voyagerLinked data and voyager
Linked data and voyager
 
DCMI/RDA Task Group Report, DC-2010 Pittsburgh
DCMI/RDA Task Group Report, DC-2010 PittsburghDCMI/RDA Task Group Report, DC-2010 Pittsburgh
DCMI/RDA Task Group Report, DC-2010 Pittsburgh
 
Map of the CETIS metadata and digital repository interoperability domain
Map of the CETIS metadata and digital repository interoperability domainMap of the CETIS metadata and digital repository interoperability domain
Map of the CETIS metadata and digital repository interoperability domain
 
Web 3 Mark Greaves
Web 3 Mark GreavesWeb 3 Mark Greaves
Web 3 Mark Greaves
 
Virtuoso -- The Prometheus of RDF
Virtuoso -- The Prometheus of RDFVirtuoso -- The Prometheus of RDF
Virtuoso -- The Prometheus of RDF
 
The Empirical Turn in Knowledge Representation
The Empirical Turn in Knowledge RepresentationThe Empirical Turn in Knowledge Representation
The Empirical Turn in Knowledge Representation
 
Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021
 

More from Mike Bergman

More from Mike Bergman (6)

Context, Perspective, and Generalities in a Knowledge Ontology
Context, Perspective, and Generalities in a Knowledge OntologyContext, Perspective, and Generalities in a Knowledge Ontology
Context, Perspective, and Generalities in a Knowledge Ontology
 
Seven Arguments for Semantic Technologies
Seven Arguments for Semantic TechnologiesSeven Arguments for Semantic Technologies
Seven Arguments for Semantic Technologies
 
The Rationale for Semantic Technologies
The Rationale for Semantic TechnologiesThe Rationale for Semantic Technologies
The Rationale for Semantic Technologies
 
Structured Dynamics' Semantic Technologies Product Stack
Structured Dynamics' Semantic Technologies Product StackStructured Dynamics' Semantic Technologies Product Stack
Structured Dynamics' Semantic Technologies Product Stack
 
UMBEL: Subject Concepts Layer for the Web
UMBEL: Subject Concepts Layer for the WebUMBEL: Subject Concepts Layer for the Web
UMBEL: Subject Concepts Layer for the Web
 
UMBEL Semantic Web Services
UMBEL Semantic Web ServicesUMBEL Semantic Web Services
UMBEL Semantic Web Services
 

Recently uploaded

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Recently uploaded (20)

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 

DCMI Keynote: Bridging the Semantic Gaps and Interoperability

  • 1. keynote presentation at DC-2010 Conference Pittsburgh, PA October 22, 2010 Bridging the Gaps: Adaptive Approaches to Data Interoperability Michael K. Bergman
  • 3. 3 Outline of Talk Linked Data Data Web, Structured Data and Semantic Web Players and Roles  DCMI Conclusions
  • 4. 4 Three Overall Assertions <LinkedData> <isA> <ValuableTechnique> <DataWeb> <hasNeedOf> <Semantics> <DCMI> <hasRole> <Unique>
  • 6. 6 Three Linked Data Assertions <LinkedData> <isA> <PreferredTechnique> <Techniques> <doNotSolve> <RootChallenges> <RDF> <hasBestRoleAs> <CanonicalDataModel>
  • 7. 7 Three More Linked Data Assertions <LinkedData> <hasGrowing> <Triples> <LDUsers> <wronglyUse> <ManyPredicates> <LinkedData> <hasLack> <MajorUptake>
  • 8. 8 25 Billion Linked Data Triples
  • 9. 9 Bad Results from sameAs Misuse
  • 10. 10 The State of Linked Data  Growing, but not as fast as promise would suggest  Not used much, except curated settings  Few actual dataset linkages  NO true interoperability, except curated (life science, some others)  Difficult to publish  If done right, best form to consume
  • 11. Data, Structure and Semantic Web
  • 12. 12 Three Structured Data Web Assertions <Heterogeneity> <isA> <Reality> <LinkedData> <isOnly> <TinyContributor> <Semantics> <isThe> <MissingLink>
  • 13. 13 Hundreds of Formats in the Wild
  • 14. 14 How to Aliquot the Firehose ?
  • 15. 15 Three Semantics Assertions (+ Axiom) <ReferenceVocabs> <organize> <MassiveContent> <LinkingPredicates> <gather> <RelatedContent> <intersectionOf> <SemanticContent> <enables> <MeaningfulWork>
  • 17. 17 Concepts are the Fixed References
  • 18. 18 Design Aspects of Reference Concepts  Truly are concepts, the idea of a thing  Labels are language independent (à la SKOS):  Preferred, human-readable label (prefLabel)  Many, alternate synonyms, jargon, etc. (altLabel)  Misspellings (hiddenLabel)  all combined for tagging, IE purposes  MUST have definition: what does this concept mean ?  Organized into coherent structures (graphs)  Inferencing  Discovery and navigation  Act as both classes and instances (RDF / OWL-speak)  MUST have persistent URIs
  • 19. 19 Mappings Get Stuff into the Right Room
  • 20. 20 Many Mappings Should be Approximate  skos:broadMatch  skos:related  ore:similarTo  umbel:isAbout  vmf:isInVocabulary  skos:closeMatch  lvont:nearlySameAs  umbel:isLike  umbel:hasCharacteristic  lvont:somewhatSameAs  rdfs:seeAlso  ore:describes  map:narrowerThan  skos:narrower  map:broaderThan  skos:broader  dc:subject  link:uri  foaf:isPrimaryTopicOf
  • 21. 21 Some Conditions for Interoperability <Interoperability> <needsMapping> <Predicates> <Interoperability> <needsReference> <Nouns>
  • 23. 23 World Role <World> <hasRole> <ContentAndStructure>
  • 25. 25 DCMI Role <DCMI> <hasRole> <ReferenceMetadata>
  • 26. 26 Three Going Forward Assertions <LinkedData> <hasNeedOf> <MapPredicates> <DataWeb> <hasNeedOf> <ReferenceConcepts> <DCMI> <hasUniqueRole> <BothRequirements>
  • 27. 27 DCMI: the Unique Franchise  DCMI already has unique authority in: 1. dc:subject 2. dc:subject qualifiers 3. initial Open Registry effort 4. core foundational properties  DCMI has unique experience in: 1. diverse vocabularies 2. cataloging and classification 3. semantics
  • 28. 28 Reference Authority - Needed DCMI Role <RefMetadata> <notSameAs> <OneRingRulesAll>
  • 29. 29 Reference Metadata is Not a Third Rail
  • 30. 30 The Web is Parched for Semantics  Reference vocabularies  Persistent URIs  Re-use of vocabs  Vetting + ranking  Alignment services  Annotation services  RDFa injection  Open source frameworks
  • 31. 31 We’re also Ready to Help + + + + ???
  • 32. 32 A First Exemplar: FactForge  A “reason-able” view to linked open data  Pre-loaded semantic repository: reasoning, querying, exploration  Ontologies  Dublin Core, SKOS, RSS, FOAF  Datasets  DBpedia, Freebase, Geonames, UMBEL, MusicBrainz, Wordnet, CIA World Factbook, Lingvoj  Very large scale  1.2B explicit + 0.9B inferred  10B retrievable statements  Managed by BigOWLIM  Free public service with many features:  Auto-suggest  Query and explore through Forest, RelFinder and Tabulator  RDF search  SPARQL end-point
  • 33. 33 Next Step, RENDER  New EU project  Large-scale LOD interoperability, methods  Players:  Karlsruher Institut fuer Technologie (DE)  Ontotext (BG)  Institut Jozef Stefan (SI)  Telefonica (ES)  Google (IE)  Wikimedia (DE)  STI Innsbruck (AT)  Testbed for possible follow-ons ??
  • 34. 34 Possible Ontotext + SD Contributions 1. Mapping services to all comers (“vocabulary neutrality”) 2. Tagging services 3. Software + systems for other tagging services 4. Possible technical support for Metadata Registry 5. Lead / support for possible EU grant-seeking efforts ↓↓↓ If DCMI willing to partner, Ontotext + SD willing to contribute in a neutral, open source manner
  • 35. 35 Ontotext + SD Links  FactForge http://www.factforge.net  PROTON http://proton.semanticweb.com  Ontotext http://www.ontotext.com  RENDER http://render-project.eu  UMBEL http://www.umbel.org  Structured Dynamics http://structureddynamics.com
  • 37. 37 Main Assertions Re-visited  Interoperability on the Web not working: 1. Not (generally) fulfilled by linked data in current state 2. Predicates for approximate mappings lacking 3. Reference vocabularies essential as connecting nodes  DCMI is the best (only?) player to plug these gaps  We are willing to help find the resources + right process to help plug the interoperability gap
  • 39.
  • 40. Q & A
  • 41. 41 Contacts & Information Michael K. Bergman CEO 319.621.5225 mike@structureddynamics.com blog: www.mkbergman.com Web Sites structureddynamics.com citizen-dan.org (community indicator systems) openstructs.org (open source software) techwiki.openstructs.org (open license technical documentation) umbel.org umbel.structureddynamics.com (UMBEL Web services)

Editor's Notes

  1. Predicate Role Reference Concepts (“is About”) Role DCMI PerfectlySituated
  2. Predicate Role Reference Concepts (“is About”) Role DCMI PerfectlySituated
  3. Predicate Role Reference Concepts (“is About”) Role DCMI PerfectlySituated
  4. Predicate Role Reference Concepts (“is About”) Role DCMI PerfectlySituated
  5. Predicate Role Reference Concepts (“is About”) Role DCMI PerfectlySituated
  6. Predicate Role Reference Concepts (“is About”) Role DCMI PerfectlySituated
  7. Predicate Role Reference Concepts (“is About”) Role DCMI PerfectlySituated