Your SlideShare is downloading. ×
Datalift: A Catalyser for the Web of Data - Francois Scharffe
Upcoming SlideShare
Loading in...5

Thanks for flagging this SlideShare!

Oops! An error has occurred.

Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Datalift: A Catalyser for the Web of Data - Francois Scharffe


Published on

Talk at Web Science Montpellier Meetup - 13th May 2011

Talk at Web Science Montpellier Meetup - 13th May 2011

Published in: Education
  • Be the first to comment

  • Be the first to like this

No Downloads
Total Views
On Slideshare
From Embeds
Number of Embeds
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

No notes for slide


  • 1. Datalift: A Catalyser for the Web of Data François Scharffe University of Montpellier, LIRMM, INRIA @lechatpitoWith the help of the Datalift teamAnd the support of the French National Research Agency Webscience meetup 5/02/2011 1
  • 2. Datalift and Web-Science
  • 3. DataliftA large scale Web data publication experiment.Objectives:- Publish reference datasets- Automate the data publication process- Show the interest of publishing linked data 3
  • 4. DataliftMotivation:- Two phenomena: - Society – Open Data - Technology – Semantic WebData revolution going on : the web of data is explosing as the web of documents exploded in the 90 4
  • 5. DataliftDatasets publicationR&D to automate the publication processA modular architecture to assist datapublicationTraining, tutorials, data publication camps
  • 6. Welcome aboard the data lift Published and interlinked data on the Web Applications InterconnexionPublication infrastructure Data convertion Vocabulary selection Raw data
  • 7. st 1 floor - SelectionSemWebPro 18/01/2011 7
  • 8. Vocabulary selection Vocabularies for linked-data ● Are meant to describe resources in RDF ● Are based on one of the standard W3C language RDFS and OWLØ What makes a good vocabulary ? ● A good vocabulary is a used vocabulary ● Other usability criterias : Simplicity, visibility, documentation, flexibility, semantic integration, social integrationØ Types of vocabularies ● Metadata, reference, domain, general
  • 9. Vocabulary of a FriendØØ A simple vocabulary...Ø To represent interconnexions between vocabulariesØ A unique entry point to vocabularies and Datasets of the linked-data cloud Linked Data CloudØ Ongoing work in Datalift
  • 10. nd 2 floor - ConversionSemWebPro 18/01/2011 10
  • 11. Reference datasets, URI design● Providing reference datasets for the French ecosystem: geographical, topological, statistical, political. Ex:● Providing URI design guidelines ● Opaque or transparent URIs ? ● Usage of accents in URIs
  • 12. Convertion tools to RDFØ How is the raw data to be converted ? § Relational Database ? § (Semi-)structured formats ? § Programmatic acces (API) ?Ø There are solutions for all cases
  • 13. rd 3 floor - PublicationSemWebPro 18/01/2011 13
  • 14. th 4 floor - InterconnexionSemWebPro 18/01/2011 15
  • 15. Towards automated interconnexion servicesØ Record linkage, entity reconciliation, instance, ontology, schema matching § Using alignments between vocabularies § Detection of discriminating properties § Indicating comparison methods by attaching metadata to ontologiesØ Work in progress in Datalift
  • 16. 5th floor - ApplicationsSemWebPro 18/01/2011 17
  • 17. “It is a time when, even if nets were to guide all consciousness that had been converted to photons and electrons toward coalescing, standaloneindividuals have not yet been converted into data tothe extent that they can form unique components of a larger complex” Mamoru Oshii, Ghost in the Shell