Datalift: A Catalyser for the Web of Data                    François Scharffe                    University of Montpellie...
Datalift and Web-Science
DataliftA large scale Web data publication experiment.Objectives:- Publish reference datasets- Automate the data publicati...
DataliftMotivation:- Two phenomena: - Society – Open Data - Technology – Semantic WebData revolution going on : the web of...
DataliftDatasets publicationR&D to automate the publication processA modular architecture to assist datapublicationTrainin...
Welcome aboard the data lift                Published and interlinked data on the Web                             Applicat...
st                       1 floor - SelectionSemWebPro 18/01/2011            7
Vocabulary selection   Vocabularies for linked-data    ●     Are meant to describe resources in RDF   ●     Are based on o...
Vocabulary of a FriendØ http://www.mondeca.com/foaf/voafØ A simple vocabulary...Ø To represent interconnexions between voc...
nd                   2 floor - ConversionSemWebPro 18/01/2011         10
Reference datasets, URI design●   Providing reference datasets for the French    ecosystem: geographical, topological, sta...
Convertion tools to RDFØ How is the raw data to be converted ?  § Relational Database ?  § (Semi-)structured formats ?  § ...
rd                       3 floor - PublicationSemWebPro 18/01/2011             13
th                 4 floor - InterconnexionSemWebPro 18/01/2011         15
Towards automated interconnexion servicesØ Record linkage, entity reconciliation, instance,  ontology, schema matching   §...
5th floor - ApplicationsSemWebPro 18/01/2011          17
“It is a time when, even if nets were to guide all consciousness that had been converted to photons     and electrons towa...
Datalift: A Catalyser for the Web of Data - Francois Scharffe
Upcoming SlideShare
Loading in …5
×

Datalift: A Catalyser for the Web of Data - Francois Scharffe

818 views

Published on

Talk at Web Science Montpellier Meetup - 13th May 2011

Published in: Education
  • Be the first to comment

  • Be the first to like this

Datalift: A Catalyser for the Web of Data - Francois Scharffe

  1. 1. Datalift: A Catalyser for the Web of Data François Scharffe University of Montpellier, LIRMM, INRIA francois.scharffe@lirmm.fr @lechatpitoWith the help of the Datalift teamAnd the support of the French National Research Agency Webscience meetup 5/02/2011 1
  2. 2. Datalift and Web-Science
  3. 3. DataliftA large scale Web data publication experiment.Objectives:- Publish reference datasets- Automate the data publication process- Show the interest of publishing linked data 3
  4. 4. DataliftMotivation:- Two phenomena: - Society – Open Data - Technology – Semantic WebData revolution going on : the web of data is explosing as the web of documents exploded in the 90 4
  5. 5. DataliftDatasets publicationR&D to automate the publication processA modular architecture to assist datapublicationTraining, tutorials, data publication camps
  6. 6. Welcome aboard the data lift Published and interlinked data on the Web Applications InterconnexionPublication infrastructure Data convertion Vocabulary selection Raw data
  7. 7. st 1 floor - SelectionSemWebPro 18/01/2011 7
  8. 8. Vocabulary selection Vocabularies for linked-data ● Are meant to describe resources in RDF ● Are based on one of the standard W3C language RDFS and OWLØ What makes a good vocabulary ? ● A good vocabulary is a used vocabulary ● Other usability criterias : Simplicity, visibility, documentation, flexibility, semantic integration, social integrationØ Types of vocabularies ● Metadata, reference, domain, general
  9. 9. Vocabulary of a FriendØ http://www.mondeca.com/foaf/voafØ A simple vocabulary...Ø To represent interconnexions between vocabulariesØ A unique entry point to vocabularies and Datasets of the linked-data cloud Linked Data CloudØ Ongoing work in Datalift
  10. 10. nd 2 floor - ConversionSemWebPro 18/01/2011 10
  11. 11. Reference datasets, URI design● Providing reference datasets for the French ecosystem: geographical, topological, statistical, political. Ex: http://parisemantique.fr● Providing URI design guidelines ● Opaque or transparent URIs ? ● Usage of accents in URIs
  12. 12. Convertion tools to RDFØ How is the raw data to be converted ? § Relational Database ? § (Semi-)structured formats ? § Programmatic acces (API) ?Ø There are solutions for all cases
  13. 13. rd 3 floor - PublicationSemWebPro 18/01/2011 13
  14. 14. th 4 floor - InterconnexionSemWebPro 18/01/2011 15
  15. 15. Towards automated interconnexion servicesØ Record linkage, entity reconciliation, instance, ontology, schema matching § Using alignments between vocabularies § Detection of discriminating properties § Indicating comparison methods by attaching metadata to ontologiesØ Work in progress in Datalift
  16. 16. 5th floor - ApplicationsSemWebPro 18/01/2011 17
  17. 17. “It is a time when, even if nets were to guide all consciousness that had been converted to photons and electrons toward coalescing, standaloneindividuals have not yet been converted into data tothe extent that they can form unique components of a larger complex” Mamoru Oshii, Ghost in the Shell

×