Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

ESWC2008 Identity OpenLink - On The Evolution of Terms

2,543 views

Published on

Published in: Technology, Education
  • Be the first to comment

  • Be the first to like this

ESWC2008 Identity OpenLink - On The Evolution of Terms

  1. 1. On The Evolution of Terms <ul><li>“ appellavitque Adam nominibus suis cuncta animantia et universa volatilia caeli et omnes bestias terrae...“ </li></ul><ul><li>Genesis, 2:20 </li></ul>© 2008 OpenLink Software, All rights reserved. Orri Erling - Program Manager, Virtuoso Yrjänä Rankka - Developer, Virtuoso
  2. 2. “ Adam called all things by their names” <ul><li>We are not the first to try </li></ul><ul><li>The history of the &quot;search for the perfect language&quot; favors natural languages </li></ul><ul><li>What gets used becomes fit for the task </li></ul><ul><li>Language building from scratch has generally not been successful </li></ul><ul><li>Forcing nature to fit preconceived, idealized taxonomies has generally failed </li></ul>© 2008 OpenLink Software, All rights reserved.
  3. 3. Usage for Linked Data <ul><li>New layer to the document web </li></ul><ul><li>Use de-referenceable HTTP URIs </li></ul><ul><li>Use #this to distinguish subject matter from document </li></ul><ul><li>Reuse terms where can </li></ul><ul><li>Human readable URI's are best </li></ul><ul><li>From HTML to XML, most formats are somewhat human readable, same for the data web </li></ul>© 2008 OpenLink Software, All rights reserved.
  4. 4. Universal “Data Language”? <ul><li>RDF will do for grammar </li></ul><ul><li>Useful vocabulary can only evolve in a community of practice </li></ul><ul><li>Distributed evolution of vocabulary and diversity of names for things is a given </li></ul><ul><li>Application determines the data structure and vocabulary </li></ul>© 2008 OpenLink Software, All rights reserved.
  5. 5. Emerging Hubs <ul><li>SIOC/FOAF For Web Data Spaces </li></ul><ul><li>Dbpedia/UMBEL/OpenCYC for names of real world entities </li></ul><ul><li>BFO for epistemology of structures and processes </li></ul>© 2008 OpenLink Software, All rights reserved.
  6. 6. Can’t Put Genie Back in the Bottle <ul><li>Once data is out, there it stays </li></ul><ul><li>Single identifiers for entities are possible only in closed, application specific DWs </li></ul><ul><li>Heterogeneity of names, overlap of descriptions, taxonomies etc is a given </li></ul><ul><li>Meaning exists only in context, so make this explicit </li></ul>© 2008 OpenLink Software, All rights reserved.
  7. 7. Infrastructure Scenarios <ul><li>Application-specific warehouse or mapped RDBMS </li></ul><ul><li>General warehouse with lots of graphs, a la search engine or Billion Triples Challenge </li></ul><ul><li>Query driven harvesting a la OpenLink Sponger </li></ul><ul><li>On-line discovery and federated SPARQL </li></ul>© 2008 OpenLink Software, All rights reserved.
  8. 8. Implications for Query <ul><li>Make it explicit </li></ul><ul><li>Report what data sets and SameAs's and graphs went into producing an answer </li></ul><ul><li>Allow the app to explicitly choose what graphs, SameAs's, taxonomies, etc. are considered </li></ul><ul><li>Search and discoverability will drive vocabulary convergence </li></ul>© 2008 OpenLink Software, All rights reserved.
  9. 9. Sameness <ul><li>SameAs and equivalent are necessary and permanent features </li></ul><ul><li>What can be considered identical depends on context </li></ul><ul><li>Universal agreement will not happen, so let people choose whose SameAs they trust </li></ul><ul><li>SameAs adds query cost and must be resolved at time of query </li></ul><ul><li>SameAs cannot be forward chained at web scale because which of them are relevant is not fixed, </li></ul><ul><li>Malicious/SPAM SameAs is inevitable </li></ul>© 2008 OpenLink Software, All rights reserved.
  10. 10. Implications For Publishers <ul><li>SPARQL and data self description practices are urgently needed </li></ul><ul><li>Describe what names are used and what other data meshes with yours </li></ul><ul><li>Explicit license </li></ul>© 2008 OpenLink Software, All rights reserved.
  11. 11. Implications for Entity NS <ul><li>DNS is good because of distributed, resilient storage and admin </li></ul><ul><li>Convergence cannot be forced but should be encouraged </li></ul><ul><li>Make administration compartmentalized a la DNS for No SPAM, No censorship </li></ul><ul><li>People can say things in their own spaces </li></ul><ul><li>Offer classification, e.g. UMBEL </li></ul><ul><li>In searching for terms, rank most reused the highest </li></ul>© 2008 OpenLink Software, All rights reserved.
  12. 12. Conclusion <ul><li>Since uniformity is impossible, make diversity of identifiers explicit </li></ul><ul><li>Build alongside the document web, de-referenceably and with #this </li></ul><ul><li>Encourage reuse but allow innovation </li></ul><ul><li>Needs of communities will differ according to stage of development </li></ul>© 2008 OpenLink Software, All rights reserved.
  13. 13. OpenLink Software © 2008 OpenLink Software, All rights reserved. Thank You! http://www.openlinksw.com

×