LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion

771 views

Published on

State of Play presentation at the LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion by Robert Isele of FUB.

Published in: Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
771
On SlideShare
0
From Embeds
0
Number of Embeds
6
Actions
Shares
0
Downloads
12
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion

  1. 1. Creating Knowledge out of Interlinked Data LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 1 http://lod2.eu LOD2 Plenary Meeting 2012 Vienna WP4: Reuse, Interlinking and Knowledge Fusion Robert Isele Freie Universität BerlinLOD2 Presentation . 02.09.2010 . Page http://lod2.eu
  2. 2. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 2 http://lod2.eu WP4 GoalsTranslate heterogeneous data from the Web of Linked Datainto a clean local target representationProvide open-source software components for: – Link Generation – Vocabulary Mapping – Linked Data quality assessment – Linked Data Fusion
  3. 3. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 3 http://lod2.eu WP4 in the LOD Stack
  4. 4. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 4 http://lod2.eu Task 4.1: Semi-Automatic Data InterlinkingPartners: ULEI, NUIG, FUB, KAISTGoals: – Develop a Linking Assist, which guides the knowledge engineer through the linking process (FUB, ULEI). – (New) Provide a platform for automatic linking with Korean, Chinese, Japanese RDF resources (KAIST).
  5. 5. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 5 http://lod2.eu Task 4.1: ProgressFirst Linking Assist/Silk Workbench (D4.1.1) has beendelivered in February 2012 – Define Data Sources (e.g. SPARQL endpoint, RDF dump) – Specify the types of resources which should be interlinked – Build linkage rules supported by maching learning – Evaluate the quality of linkage rulesPreliminary work on Korean Resource Linking Assist – Transformed test datasets into RDF. – This data will be an input to Korean resource linking module. – Finished preliminary design of the Korean resource linking module
  6. 6. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 6 http://lod2.eu Task 4.1: Improving Silk Workbench (1/2)Use Active learning to reduce the manual effort andrequired expertise to interlink data sources – Automating the generation of a linkage rule. – The user only confirms or declines a set of example links.
  7. 7. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 7 http://lod2.eu Task 4.1: Improving Silk Workbench (2/2)Improving the usability based on user-feedbackFirst results for the Y2 review meetingFinal deliverable D4.1.2 (Second Linking Assist Release) inFebruary 2013
  8. 8. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 8 http://lod2.eu Task 4.2: Data Interlinking EnvironmentPartners: NUIGGoals: – To research and develop LATC well beyond 2012 into 2014 – Interlinking recommendations – Interaction with data linkage validator from WP3Progress: – First version of Data Interlinking Environment (D4.2.1) submitted in December 2011 – Combines Analytics Graph produced from Sindice data sources and the Silk Link Discovery Framework
  9. 9. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 9 http://lod2.eu Task 4.2: Silk Workbench ExtensionNew Sindice datasource for the linking of datasets.Dataset suggestion based on keywords, classes, and datasetsAutocompletion for data types when executing linking tasks.A retrieval method for entity properties to also aid in theexecution of linking tasks. Dataset suggestion
  10. 10. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 10 http://lod2.eu Task 4.3: Linked Data Quality AssessmentPartners: FUB, NUIG, ULEI, SWCGGoals: – Research into recent advances in quality assessment of Linked Data – Develop design metrics for quality assessment – Release a Linked Data Quality Assessment Component
  11. 11. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 11 http://lod2.eu Task 4.3 ProgressSurvey on the State of the Art in Mapping, Quality Assessment andData Fusion (D4.3.1) finished in February 2011Conceptual Design and Implementation of Metrics (D4.3.2) finishedin February 2012Released first prototype of Sieve, a Linked Data Quality Assessmentand Fusion framework – Allows Web data to be filtered according to different data quality assessment policies – Provides for fusing Web data according to different conflict resolution methods. – http://sieve.wbsg.de – D4.3.2: Release of the data quality assessment tool (August 2012)
  12. 12. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 12 http://lod2.eu Task 4.4: Schema Mapping Publication and DiscoveryPartners: FUB, ULEI, OGL, SWCG, UEPGoals: – Specification of the vocabulary mapping publication and discovery language – Implementation of the Vocabulary Mapping Component
  13. 13. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 13 http://lod2.eu Task 4.4 ProgressSpecification of the Mapping Publication and Discovery Language(D4.4.1) finished in June 2011Implementation of the Mapping Publication and DiscoveryFramework (D4.4.2 ) finished in February 2012. – Adapted the R2R Framework based on the use cases in LOD2. – Conducted various experiments to demonstrate the performance and scaling behavior for translating data sets (http://www.assembla.com/spaces/ldif/wiki/Benchmark) – Implementation published under the terms of the Apache License
  14. 14. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 14 http://lod2.eu Task 4.4: Future WorkIntegration of the Mapping Publication and DiscoveryFramework into the LOD2 stack (D4.4.3)Deadline: February 2013
  15. 15. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 15 http://lod2.eu Task 4.4a: Schema Mapping Robust to Modeling StylePartners: UEPGoal: Extend the methods and tools of schema matchingdiscovery (from the original Task 4.4) by ontologytransformation methods implemented within the(enhanced) PatOMat frameworkStart: March 2012First deliverable in December 2012
  16. 16. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 16 http://lod2.eu Task 4.5: Linked Data FusionPartners: FUB, ULEIGoal: – Build a Data Fusion Component which fuses data from multiple sources – Fuse multiple entities representing the same real-world object into a single, consistent and clean representationFirst deliverable: – Initial release of Data Fusion Component (D4.5.1). – Deadline: 31.08.12 – Integrating the data quality assessment module (Sieve) developed in Task 4.3 with a data fusion module.
  17. 17. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 17 http://lod2.eu Task 4.5a: Multilingual Linked Data FusionInvolved: KAIST, ULEIGoal: Fusion of multilingual datasets – DBpedia dataset as the pivot multilingual dataset, since it is extracted from various kinds of languages – First step: Bilingual fusion between the Korean DBpedia and the English Dbpedia – Next: Include other languages such as Chinese and JapaneseFirst deliverable in February 2013: Korean Data Fusion Assistant – The component will support Korean data fusion into English LOD by combining Deliverable 4.5.1 with the fused dataset of English and Korean DBpedia.
  18. 18. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 18 http://lod2.eu Task 4.6: Tools for Cleansing Entity Data and Crowdsourcing of CleansingInvolved: ZemantaGoals: – Adapt Google Refine for Linked Open Data based on the existing Deri Plugin – Integrate crowdsourcing services such as Amazon Mechanical Turk for LOD data cleansing.Progress: – D 4.6.1 (M18) Release of an LOD-Enabled Version of Google Refine submitted.Next deliberable: – D 4.6.2 (M30) Release of Documentation and Software Infrastructure for Using GR along with Amazon Mechanical Turk
  19. 19. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 19 http://lod2.eu WP 4 Summary (M12 - M18)5 Deliverables submitted in the last 6 months:ULEI and FUB submitted the First Linking Assist (D4.1.1)NUIG submitted the first version of the Data LinkingEnvironment Release (D4.2.1)FUB finished the Conceptual Design and Implementation ofQuality Assessment Metrics (D 4.3.2)FUB finished the Implementation of the Mapping Publicationand Discovery Framework (D4.4.2)Zemanta submitted the first release of the LOD-enabledversion of Google Refine for review (D4.6.1)
  20. 20. LOD2 Plenary Meeting Vienna – 2012/03/21 – Page 20 http://lod2.eu ContactAddressFreie Universität BerlinSchool of Business & EconomicsWeb-based Systems GroupGarystr. 2114195 BerlinGermanyPresenterRobert Iselemail@robertisele.com

×