ESWC2008 Identity OpenLink - On The Evolution of Terms

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

0 comments

Post a comment

    Post a comment
    Embed Video
    Edit your comment Cancel

    Favorites, Groups & Events

    ESWC2008 Identity OpenLink - On The Evolution of Terms - Presentation Transcript

    1. On The Evolution of Terms
      • “ appellavitque Adam nominibus suis cuncta animantia et universa volatilia caeli et omnes bestias terrae...“
      • Genesis, 2:20
      © 2008 OpenLink Software, All rights reserved. Orri Erling - Program Manager, Virtuoso Yrjänä Rankka - Developer, Virtuoso
    2. “ Adam called all things by their names”
      • We are not the first to try
      • The history of the "search for the perfect language" favors natural languages
      • What gets used becomes fit for the task
      • Language building from scratch has generally not been successful
      • Forcing nature to fit preconceived, idealized taxonomies has generally failed
      © 2008 OpenLink Software, All rights reserved.
    3. Usage for Linked Data
      • New layer to the document web
      • Use de-referenceable HTTP URIs
      • Use #this to distinguish subject matter from document
      • Reuse terms where can
      • Human readable URI's are best
      • From HTML to XML, most formats are somewhat human readable, same for the data web
      © 2008 OpenLink Software, All rights reserved.
    4. Universal “Data Language”?
      • RDF will do for grammar
      • Useful vocabulary can only evolve in a community of practice
      • Distributed evolution of vocabulary and diversity of names for things is a given
      • Application determines the data structure and vocabulary
      © 2008 OpenLink Software, All rights reserved.
    5. Emerging Hubs
      • SIOC/FOAF For Web Data Spaces
      • Dbpedia/UMBEL/OpenCYC for names of real world entities
      • BFO for epistemology of structures and processes
      © 2008 OpenLink Software, All rights reserved.
    6. Can’t Put Genie Back in the Bottle
      • Once data is out, there it stays
      • Single identifiers for entities are possible only in closed, application specific DWs
      • Heterogeneity of names, overlap of descriptions, taxonomies etc is a given
      • Meaning exists only in context, so make this explicit
      © 2008 OpenLink Software, All rights reserved.
    7. Infrastructure Scenarios
      • Application-specific warehouse or mapped RDBMS
      • General warehouse with lots of graphs, a la search engine or Billion Triples Challenge
      • Query driven harvesting a la OpenLink Sponger
      • On-line discovery and federated SPARQL
      © 2008 OpenLink Software, All rights reserved.
    8. Implications for Query
      • Make it explicit
      • Report what data sets and SameAs's and graphs went into producing an answer
      • Allow the app to explicitly choose what graphs, SameAs's, taxonomies, etc. are considered
      • Search and discoverability will drive vocabulary convergence
      © 2008 OpenLink Software, All rights reserved.
    9. Sameness
      • SameAs and equivalent are necessary and permanent features
      • What can be considered identical depends on context
      • Universal agreement will not happen, so let people choose whose SameAs they trust
      • SameAs adds query cost and must be resolved at time of query
      • SameAs cannot be forward chained at web scale because which of them are relevant is not fixed,
      • Malicious/SPAM SameAs is inevitable
      © 2008 OpenLink Software, All rights reserved.
    10. Implications For Publishers
      • SPARQL and data self description practices are urgently needed
      • Describe what names are used and what other data meshes with yours
      • Explicit license
      © 2008 OpenLink Software, All rights reserved.
    11. Implications for Entity NS
      • DNS is good because of distributed, resilient storage and admin
      • Convergence cannot be forced but should be encouraged
      • Make administration compartmentalized a la DNS for No SPAM, No censorship
      • People can say things in their own spaces
      • Offer classification, e.g. UMBEL
      • In searching for terms, rank most reused the highest
      © 2008 OpenLink Software, All rights reserved.
    12. Conclusion
      • Since uniformity is impossible, make diversity of identifiers explicit
      • Build alongside the document web, de-referenceably and with #this
      • Encourage reuse but allow innovation
      • Needs of communities will differ according to stage of development
      © 2008 OpenLink Software, All rights reserved.
    13. OpenLink Software © 2008 OpenLink Software, All rights reserved. Thank You! http://www.openlinksw.com

    + rumitorumito, 2 years ago

    custom

    1018 views, 0 favs, 0 embeds more stats

    More info about this document

    © All Rights Reserved

    Go to text version

    • Total Views 1018
      • 1018 on SlideShare
      • 0 from embeds
    • Comments 0
    • Favorites 0
    • Downloads 0
    Most viewed embeds

    more

    All embeds

    less

    Flagged as inappropriate Flag as inappropriate
    Flag as inappropriate

    Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

    Cancel
    File a copyright complaint
    Having problems? Go to our helpdesk?

    Categories