Linking Open Data                   with DrupalEmmanuel Jamin Drupal.cat      October 4th, 2012   Citilab, Cornellá
Who am I?Emmanuel Jamin   –   PhD        •    At Paris XI university (LIMSI-CNRS, Orsay)   –   Research and development (E...
PlanIntroduction to Open DataIntroduction to the Semantic WebFrom Open Data to Linked Data
A – OD - Definition“Open data is    data that can be freely used, reused and       redistributed by anyone – subject only,...
A – OD - Principles> Availability and Access   Availability and                    Access   Reuse and Redistribution      ...
A – OD – Small history1957-1958: 1st concept           “open access to scientific data”2001: 1st definition           “the...
Image by Peter Ito (2009): http://www.flickr.com/photos/peterito/3054501076/lightbox/
A – OD - Platforms                                       Open Cities   Open ScienceOpen Government                        ...
A – OD – Status of ODTopics         From: http://okfn.org/opendata/
A – OD – Status of OD                      DatabaseTypes of Data                         Structured Data  Documents     Do...
A – OD – Status of ODHeterogenous standards (Open Standard)            TXTPDF - DOC                      PDFCSV           ...
A – OD – Comparison              Barcelona                   Catalunya                      España   Datos.gov.es / Gen.ca...
A – OD – Why opening up data?Why opening up the data?   Why opening up the Data                      Data?
A – OD – Why opening up data?Facet search and browsingData integration    to compare easily                            htt...
A – OD http://manybills.researchlabs.ibm.com/       – Why opening up data?
A – OD – Why opening up data?                                   Big Data analysisGraphic representation of dataset     Sta...
A – OD – Why opening up data?   Analyze it …                 Reuse it …Opening the data    Reuse it                   Open...
OD – The big challengeThe OD movement has: The big challenge    The energy    The Open Mind philosophy    The public resou...
From: http://www.nathan.com/thoughts/unified/3.html
OD – The big challengeOpening the data is great !    But it is not enough …    Linking Open Data !
Semantic Web
From: http://salesenablement.wordpress.com/2010/09/07/the-importance-of-context/
B – SW - Principios Do not read the next slideDo not read the next slide!
B – SW - PrincipiosYou loose!
B – SW - Principios Humans identify and interpret informationHumans identify and interpret informationMachines dont      M...
B – Towards the structured web Separate the content and the formSeparate the form and the content    XML and metadata
B–                 Towards the Structured WebArbitrary metadata XML and the metadata    <book/>       |        <chapter/> ...
B–  What do really understand the machines?Arbitrary metadata    <hbskm/>       |        <rzañokt/>             |         ...
B–What is the last document have your read?    Which is the last document you read?
B–    Document?Document{ book, newspaper, paper, post-card … }
B–                    The answer is based on a                       Shared OntologyThe answer is based on a shared knowle...
B–                DocumentDocumentBook                 BookRoman / Novel       Roman               Novel
B–“An ontology is a specification of aconceptualization”   (i.e. the logical description of the concepts       and relatio...
B–Towards the Semantic Web   Towards the Semantic Web
B – SW - Definitionthe Semantic Web is        "a web of data that can be processed directly and indirectly by machines."  ...
B–The W3C normalization / scale          From: http://mmt.me.uk/slides/london011209/#(2)
B – SW – Resources           Everything is a resourceEverything is a resource    –   Person             Berners Lee    –  ...
B – SW – Resources            Each resource is identified               with a unique reference.Each resource identified w...
B – SW – Resources    Namespace to referenceNamespace to simplify URI                     the URI   Namespace:        www....
B – SW – Resources        CURIE to simplify the URINamespace to simplify URI     w3c:timbl        foaf:Person     w3c:this...
B – SW – TriplesRDFRDF   (Subject, predicate, object)         (Subject, predicate, object)
B – SW – TriplesRDF triples      web.html has author Tim Berners Lee      LinkedData.html has author Hausenblas      W3C h...
B – SW – OntologiesRDF-S → RDF-Schema   Definition of the        •   Classes (concepts)        •   and Properties (concept...
B – SW – OntologiesRDFS–   Book is sub-type of Document–   Novel is sub-type of Book–   Roman is sub-type of Book
B – SW – RDF graph          RDF triples => Linked DataRDF triples = LinkedData    –   W3C.html has author Tim Berners Lee ...
B – SW – RDF graph              RDF triples => RDF graph                                                       Organisatio...
B – SW – Federated Dataset               Federated dataset    All resources are connectedResources are connected over the ...
B – SW – SPARQL     Search and retrieve informationFind and retrieve information from the graph          from the graph wi...
B – SW – Giant Global Graph                Global Giant GraphThe web becomes one giant database
B – SWIs this a fiction?             Is it a fiction?
B – SWGoogle Rich Snippet                                            Rich Snippets From: http://openspring.net/blog/2011/0...
B – SWFacebook                    Open Graph
B – SW                   Knowledge GraphGoogle Knowledge Graph
C – OD + LD
C – OD + LD        From Open Data to Linked DataFrom Open Data to Linked Data                                      RDFS  O...
CFrom PDF to RDF                – OD + LDFrom PDF to RDF   1. Document engineering       •   Content extraction       •   ...
C – OD + LD       Synthesis about data formatsSíntesis de los formatos (table)
C – to arrive in LOD         To succeed with Linked DataLinking Open Data    1. Data formalization          •   Create or ...
C – OD + LDFrom Open Data to Linked Data   Data quality
B – SW – Big Giant GraphOpen Data + Data Interconnection         LinkedLinked Open Data    Open Data   25 billion RDF trip...
From: http://www.w3.org/DesignIssues/diagrams/lod/2010-color.png
B – SW – Big Giant GraphOpen Data + Data Interconnection         LinkedLinked Open Data        Open Data    25 billion RDF...
B – SW – Big Giant GraphOpen Data + Data InterconnectionLinked Open Data   25 billion RDF triples over the web  The Web 3....
Linking Open Data with Drupal
D – LODrupal - Drupal                  LOD and Drupal    Availability and AccessEntities ↔ Resources   Entities ↔ Resource...
Main Microdata Web modules     Semantic Import Linked Data                          schema.org  Microdata  SPARQL         ...
D – LODrupal – Mod1 ...         RDFx     From: http://drupal.org/project/rdfx
D – LODrupal – Mod1 ...       schemaorg    From: http://drupal.org/project/schemaorg
D – LODrupal – Mod1 ...        SPARQL     From: http://drupal.org/project/sparql
D – LODrupal – Drupal PrototypeDemonstration                     Demo
E – LODrupal HackathonLOD + Drupal hackathon
E –LOD + Drupal hackathon         LODrupal HackathonSprint 1:             Sprint 2:A1 - Consume OD       B1 - Publish LODA...
E – LODrupal Hackathon              References−   http://okfn.org/opendata/−   http://www.slideshare.net/fabien_gandon/web...
Questions        Questions?                     Thanks!
121004 linking open_data_with_drupal_v1
121004 linking open_data_with_drupal_v1
121004 linking open_data_with_drupal_v1
121004 linking open_data_with_drupal_v1
121004 linking open_data_with_drupal_v1
121004 linking open_data_with_drupal_v1
Upcoming SlideShare
Loading in...5
×

121004 linking open_data_with_drupal_v1

362

Published on

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
362
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
5
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

121004 linking open_data_with_drupal_v1

  1. 1. Linking Open Data with DrupalEmmanuel Jamin Drupal.cat October 4th, 2012 Citilab, Cornellá
  2. 2. Who am I?Emmanuel Jamin – PhD • At Paris XI university (LIMSI-CNRS, Orsay) – Research and development (EU projects) • At Edelweiss (INRIA, Sophia Antipolis) • At the Knowledge Lab (ATOS, Barcelona) – Now • Semantic Web consultant in Barcelona • www.OpenData-consulting.com • @openDataC
  3. 3. PlanIntroduction to Open DataIntroduction to the Semantic WebFrom Open Data to Linked Data
  4. 4. A – OD - Definition“Open data is data that can be freely used, reused and redistributed by anyone – subject only, at most, to the requirement to attribute and sharealike.” http://OpenDefinition.org
  5. 5. A – OD - Principles> Availability and Access Availability and Access Reuse and Redistribution > Reuse and Redistribution Universal Participation > Universal Participation
  6. 6. A – OD – Small history1957-1958: 1st concept “open access to scientific data”2001: 1st definition “the web of data” (Tim Berners Lee)2004-05: 1st fondation Open Knowledge Fondation (http://okfn.org/)2009-05: 1st Open Government platform in US http://data.gov2012-09: 1st Open Knowledge Festival http://okfestival.org
  7. 7. Image by Peter Ito (2009): http://www.flickr.com/photos/peterito/3054501076/lightbox/
  8. 8. A – OD - Platforms Open Cities Open ScienceOpen Government TransparencyOpen Science OpenOpen Cities Government ParticipationOpen Education Collaboration Open CultureOpen Health Open Health… Open Education
  9. 9. A – OD – Status of ODTopics From: http://okfn.org/opendata/
  10. 10. A – OD – Status of OD DatabaseTypes of Data Structured Data Documents Documents Raw data Open Data Raw Data Structured data Linked Data Linked data Geo Data
  11. 11. A – OD – Status of ODHeterogenous standards (Open Standard) TXTPDF - DOC PDFCSV CSV ZIPXML ODTRDF JSON RDFKML-KMZ XML XSL JSON
  12. 12. A – OD – Comparison Barcelona Catalunya España Datos.gov.es / Gen.cat / barcelona.catWebsite http://w20.bcn.cat/opendata/ http://www20.gencat.cat/portal/site/dadesobertes/ http://datos.gob.es/datos/Topics Economy, Cartography and Public sector, Cartography, maps, Facilities Culture and hobbies, Population, Statistics, Science and Environment, Meteorology. technologies, Administration Nomenclators, Environment, Health, Public Education, transport, Turism TansportFormats CSV, PDF, XLS, TMX, ZIP, PDF, XHTML, HTML, XML, RDF, TXT, ZIP CSV, KML-KMZ, PDF, XLS, XML, ZIP DOC, XLS, XML, JSON, RDF, SHP, SPARQL
  13. 13. A – OD – Why opening up data?Why opening up the data? Why opening up the Data Data?
  14. 14. A – OD – Why opening up data?Facet search and browsingData integration to compare easily http://civio.es
  15. 15. A – OD http://manybills.researchlabs.ibm.com/ – Why opening up data?
  16. 16. A – OD – Why opening up data? Big Data analysisGraphic representation of dataset Statistics Graphic representation to visualize it easilyData reuse and combination Data vizualization Data integrationData integration Data reuse to compare easily Facet search and browsingFacet search and browsing – to contextualize information easily Data contextualization Data mapping
  17. 17. A – OD – Why opening up data? Analyze it … Reuse it …Opening the data Reuse it Open Data Mix it Analyse it Mix it … Visualise it … Vizualize itFor for a better comprehension! a better comprehension
  18. 18. OD – The big challengeThe OD movement has: The big challenge The energy The Open Mind philosophy The public resources Etc.But something is missing ...
  19. 19. From: http://www.nathan.com/thoughts/unified/3.html
  20. 20. OD – The big challengeOpening the data is great ! But it is not enough … Linking Open Data !
  21. 21. Semantic Web
  22. 22. From: http://salesenablement.wordpress.com/2010/09/07/the-importance-of-context/
  23. 23. B – SW - Principios Do not read the next slideDo not read the next slide!
  24. 24. B – SW - PrincipiosYou loose!
  25. 25. B – SW - Principios Humans identify and interpret informationHumans identify and interpret informationMachines dont Machines dont
  26. 26. B – Towards the structured web Separate the content and the formSeparate the form and the content XML and metadata
  27. 27. B– Towards the Structured WebArbitrary metadata XML and the metadata <book/> | <chapter/> | <paragraph/>
  28. 28. B– What do really understand the machines?Arbitrary metadata <hbskm/> | <rzañokt/> | <kmcsuhdd/>
  29. 29. B–What is the last document have your read? Which is the last document you read?
  30. 30. B– Document?Document{ book, newspaper, paper, post-card … }
  31. 31. B– The answer is based on a Shared OntologyThe answer is based on a shared knowledgeWe can understand You can reason
  32. 32. B– DocumentDocumentBook BookRoman / Novel Roman Novel
  33. 33. B–“An ontology is a specification of aconceptualization” (i.e. the logical description of the concepts and relationships that can exist for an agent or a community of agents). Tom Grüber (1993)
  34. 34. B–Towards the Semantic Web Towards the Semantic Web
  35. 35. B – SW - Definitionthe Semantic Web is "a web of data that can be processed directly and indirectly by machines." Tim Berners Lee (2001)
  36. 36. B–The W3C normalization / scale From: http://mmt.me.uk/slides/london011209/#(2)
  37. 37. B – SW – Resources Everything is a resourceEverything is a resource – Person Berners Lee – Organisation W3C – Document paper.html – Event SW conference 2012 – … etc.
  38. 38. B – SW – Resources Each resource is identified with a unique reference.Each resource identified with an URI www.w3c.org/people/timbl.html#this Berners Lee www.w3c.org/index.html#this W3C www.w3c.org/papers/paper.html#this paper.html www.w3c.org/events/swcon12.html#this SW con12
  39. 39. B – SW – Resources Namespace to referenceNamespace to simplify URI the URI Namespace: www.w3c.org/people/timbl.html# Prefix tbl: www.w3c.org/people/timbl.html# CURIE tbl:this
  40. 40. B – SW – Resources CURIE to simplify the URINamespace to simplify URI w3c:timbl foaf:Person w3c:this foaf:Organisation dblp:this foaf:Document event:this foaf:Event
  41. 41. B – SW – TriplesRDFRDF (Subject, predicate, object) (Subject, predicate, object)
  42. 42. B – SW – TriplesRDF triples web.html has author Tim Berners Lee LinkedData.html has author Hausenblas W3C has employee Tim Berners Lee web.html is published at SW conference
  43. 43. B – SW – OntologiesRDF-S → RDF-Schema Definition of the • Classes (concepts) • and Properties (conceptual relations) Hierachy organisation with conceptual relations
  44. 44. B – SW – OntologiesRDFS– Book is sub-type of Document– Novel is sub-type of Book– Roman is sub-type of Book
  45. 45. B – SW – RDF graph RDF triples => Linked DataRDF triples = LinkedData – W3C.html has author Tim Berners Lee – W3C.html is type of Document – Tim Berners Lee is type of Person – W3C.html is presented at Web Conference 2012 – Web Conference 2012 is type of Conference – Conference is sub class of Event
  46. 46. B – SW – RDF graph RDF triples => RDF graph Organisation RDF triples = RDF graph Event Document Person RDF graph W3CConference web.html Tim Berners Lee SW conference
  47. 47. B – SW – Federated Dataset Federated dataset All resources are connectedResources are connected over the web over the Web LOD site 1 LOD site 2 w3c:this w3c:this tim:this ivan:this doc3:this doc1:this doc2:this doc2:this
  48. 48. B – SW – SPARQL Search and retrieve informationFind and retrieve information from the graph from the graph with SPARQLPREFIX foaf: <http://xmlns.com/foaf/0.1/>SELECT ?document ?authorNameWHERE { ?person rdf:type foaf:Person ?person foaf:name ?authorName ?authorName foaf:made ?document}
  49. 49. B – SW – Giant Global Graph Global Giant GraphThe web becomes one giant database
  50. 50. B – SWIs this a fiction? Is it a fiction?
  51. 51. B – SWGoogle Rich Snippet Rich Snippets From: http://openspring.net/blog/2011/09/30/schemaorg-rich-snippets-drupal-7-rdfa
  52. 52. B – SWFacebook Open Graph
  53. 53. B – SW Knowledge GraphGoogle Knowledge Graph
  54. 54. C – OD + LD
  55. 55. C – OD + LD From Open Data to Linked DataFrom Open Data to Linked Data RDFS Open Data RDF JSON Linked Data XML CSV PDF Structured Data
  56. 56. CFrom PDF to RDF – OD + LDFrom PDF to RDF 1. Document engineering • Content extraction • Content format • Multimedia extraction 2. Knowledge engineering • Term extraction (indexation) • Recognition of Named Entities • Ontology engineering • Conceptual recognition and mapping
  57. 57. C – OD + LD Synthesis about data formatsSíntesis de los formatos (table)
  58. 58. C – to arrive in LOD To succeed with Linked DataLinking Open Data 1. Data formalization • Create or reuse ontologies (RDF, RDFS, OWL) 2. Data annotation • Associate semantic metadata (RDF, RDFa, Microdata) 3. Data publication • Publish your semantic data (RDFa, Microdata) 4. Data consumption • Reuse all available data (SPARQL endpoints)
  59. 59. C – OD + LDFrom Open Data to Linked Data Data quality
  60. 60. B – SW – Big Giant GraphOpen Data + Data Interconnection LinkedLinked Open Data Open Data 25 billion RDF triples over the web 25 billion of RDF triples over the web
  61. 61. From: http://www.w3.org/DesignIssues/diagrams/lod/2010-color.png
  62. 62. B – SW – Big Giant GraphOpen Data + Data Interconnection LinkedLinked Open Data Open Data 25 billion RDF triples over the web http://dbpedia.org
  63. 63. B – SW – Big Giant GraphOpen Data + Data InterconnectionLinked Open Data 25 billion RDF triples over the web The Web 3.0 is already here ...
  64. 64. Linking Open Data with Drupal
  65. 65. D – LODrupal - Drupal LOD and Drupal Availability and AccessEntities ↔ Resources Entities ↔ ResourcesRDFReuse and Redistribution in Core RDF in Drupal Core Universal ParticipationSemantic Web modules and Semantic Web modules
  66. 66. Main Microdata Web modules Semantic Import Linked Data schema.org Microdata SPARQL RDFx SPARQL Views SPARQL Views RDFx SPARQL
  67. 67. D – LODrupal – Mod1 ... RDFx From: http://drupal.org/project/rdfx
  68. 68. D – LODrupal – Mod1 ... schemaorg From: http://drupal.org/project/schemaorg
  69. 69. D – LODrupal – Mod1 ... SPARQL From: http://drupal.org/project/sparql
  70. 70. D – LODrupal – Drupal PrototypeDemonstration Demo
  71. 71. E – LODrupal HackathonLOD + Drupal hackathon
  72. 72. E –LOD + Drupal hackathon LODrupal HackathonSprint 1: Sprint 2:A1 - Consume OD B1 - Publish LODA2 - OD Integration B2 - Build LOD applicationsSaturday Saturday 10/11/2012 08/12/2012
  73. 73. E – LODrupal Hackathon References− http://okfn.org/opendata/− http://www.slideshare.net/fabien_gandon/web-smantique- donnes-lies-et-smantique-des-schmas-2184768− http://www.slideshare.net/scorlosquet/how-to-build- linked-data-sites-with-drupal-7-and-rdfa− http://www20.gencat.cat/portal/site/dadesobertes/− http://w20.bcn.cat/opendata/− http://datos.gob.es/datos/− http://drupal.org/project/odv
  74. 74. Questions Questions? Thanks!
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×