Your SlideShare is downloading. ×
Datalift: A Catalyser for the Web of Data - Francois Scharffe
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Datalift: A Catalyser for the Web of Data - Francois Scharffe

506

Published on

Talk at Web Science Montpellier Meetup - 13th May 2011

Talk at Web Science Montpellier Meetup - 13th May 2011

Published in: Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
506
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
4
Comments
0
Likes
0
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Datalift: A Catalyser for the Web of Data François Scharffe University of Montpellier, LIRMM, INRIA francois.scharffe@lirmm.fr @lechatpitoWith the help of the Datalift teamAnd the support of the French National Research Agency Webscience meetup 5/02/2011 1
  • 2. Datalift and Web-Science
  • 3. DataliftA large scale Web data publication experiment.Objectives:- Publish reference datasets- Automate the data publication process- Show the interest of publishing linked data 3
  • 4. DataliftMotivation:- Two phenomena: - Society – Open Data - Technology – Semantic WebData revolution going on : the web of data is explosing as the web of documents exploded in the 90 4
  • 5. DataliftDatasets publicationR&D to automate the publication processA modular architecture to assist datapublicationTraining, tutorials, data publication camps
  • 6. Welcome aboard the data lift Published and interlinked data on the Web Applications InterconnexionPublication infrastructure Data convertion Vocabulary selection Raw data
  • 7. st 1 floor - SelectionSemWebPro 18/01/2011 7
  • 8. Vocabulary selection Vocabularies for linked-data ● Are meant to describe resources in RDF ● Are based on one of the standard W3C language RDFS and OWLØ What makes a good vocabulary ? ● A good vocabulary is a used vocabulary ● Other usability criterias : Simplicity, visibility, documentation, flexibility, semantic integration, social integrationØ Types of vocabularies ● Metadata, reference, domain, general
  • 9. Vocabulary of a FriendØ http://www.mondeca.com/foaf/voafØ A simple vocabulary...Ø To represent interconnexions between vocabulariesØ A unique entry point to vocabularies and Datasets of the linked-data cloud Linked Data CloudØ Ongoing work in Datalift
  • 10. nd 2 floor - ConversionSemWebPro 18/01/2011 10
  • 11. Reference datasets, URI design● Providing reference datasets for the French ecosystem: geographical, topological, statistical, political. Ex: http://parisemantique.fr● Providing URI design guidelines ● Opaque or transparent URIs ? ● Usage of accents in URIs
  • 12. Convertion tools to RDFØ How is the raw data to be converted ? § Relational Database ? § (Semi-)structured formats ? § Programmatic acces (API) ?Ø There are solutions for all cases
  • 13. rd 3 floor - PublicationSemWebPro 18/01/2011 13
  • 14. th 4 floor - InterconnexionSemWebPro 18/01/2011 15
  • 15. Towards automated interconnexion servicesØ Record linkage, entity reconciliation, instance, ontology, schema matching § Using alignments between vocabularies § Detection of discriminating properties § Indicating comparison methods by attaching metadata to ontologiesØ Work in progress in Datalift
  • 16. 5th floor - ApplicationsSemWebPro 18/01/2011 17
  • 17. “It is a time when, even if nets were to guide all consciousness that had been converted to photons and electrons toward coalescing, standaloneindividuals have not yet been converted into data tothe extent that they can form unique components of a larger complex” Mamoru Oshii, Ghost in the Shell

×