Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Pal gov.tutorial2.session13 integration and fusion using rdf


Published on

Published in: Education
  • Be the first to comment

Pal gov.tutorial2.session13 integration and fusion using rdf

  1. 1. ‫أكاديمية الحكومة اإللكترونية الفلسطينية‬ The Palestinian eGovernment Academy www.egovacademy.psTutorial II: Data Integration and Open Information Systems Module 13.3 Data Integration and Fusion using RDF Dr. Mustafa Jarrar University of Birzeit PalGov © 2011 1
  2. 2. AboutThis tutorial is part of the PalGov project, funded by the TEMPUS IV program of theCommission of the European Communities, grant agreement 511159-TEMPUS-1-2010-1-PS-TEMPUS-JPHES. The project website: www.egovacademy.psProject Consortium: Birzeit University, Palestine University of Trento, Italy (Coordinator ) Palestine Polytechnic University, Palestine Vrije Universiteit Brussel, Belgium Palestine Technical University, Palestine Université de Savoie, France Ministry of Telecom and IT, Palestine University of Namur, Belgium Ministry of Interior, Palestine TrueTrust, UK Ministry of Local Government, PalestineCoordinator:Dr. Mustafa JarrarBirzeit University, P.O.Box 14- Birzeit, PalestineTelfax:+972 2 2982935 mjarrar@birzeit.eduPalGov © 2011 2
  3. 3. © Copyright NotesEveryone is encouraged to use this material, or part of it, but shouldproperly cite the project (logo and website), and the author of that part.No part of this tutorial may be reproduced or modified in any form or byany means, without prior written permission from the project, who havethe full copyrights on the material. Attribution-NonCommercial-ShareAlike CC-BY-NC-SAThis license lets others remix, tweak, and build upon your work non-commercially, as long as they credit you and license their new creationsunder the identical terms. PalGov © 2011 3
  4. 4. Tutorial Map Topic h Intended Learning Objectives Session 1: XML Basics and Namespaces 3A: Knowledge and Understanding Session 2: XML DTD’s 3 2a1: Describe tree and graph data models. Session 3: XML Schemas 3 2a2: Understand the notation of XML, RDF, RDFS, and OWL. 2a3: Demonstrate knowledge about querying techniques for data Session 4: Lab-XML Schemas 3 models as SPARQL and XPath. Session 5: RDF and RDFs 3 2a4: Explain the concepts of identity management and Linked data. Session 6: Lab-RDF and RDFs 3 2a5: Demonstrate knowledge about Integration &fusion of Session 7: OWL (Ontology Web Language) 3 heterogeneous data. Session 8: Lab-OWL 3B: Intellectual Skills Session 9: Lab-RDF Stores -Challenges and Solutions 3 2b1: Represent data using tree and graph data models (XML & Session 10: Lab-SPARQL 3 RDF). Session 11: Lab-Oracle Semantic Technology 3 2b2: Describe data semantics using RDFS and OWL. Session 12_1: The problem of Data Integration 1.5 2b3: Manage and query data represented in RDF, XML, OWL. Session 12_2: Architectural Solutions for the Integration Issues 1.5 2b4: Integrate and fuse heterogeneous data. Session 13_1: Data Schema Integration 1C: Professional and Practical Skills Session 13_2: GAV and LAV Integration 1 2c1: Using Oracle Semantic Technology and/or Virtuoso to store Session 13_3: Data Integration and Fusion using RDF 1 and query RDF stores. Session 14: Lab-Data Integration and Fusion using RDF 3D: General and Transferable Skills 2d1: Working with team. Session 15_1: Data Web and Linked Data 1.5 2d2: Presenting and defending ideas. Session 15_2: RDFa 1.5 2d3: Use of creativity and innovation in problem solving. 2d4: Develop communication skills and logical reasoning abilities. Session 16: Lab-RDFa 3 PalGov © 2011 4
  5. 5. Module ILOsAfter completing this module students will be able to: - Explain the concepts of identity management and linked data. - Integrate and fuse heterogeneous data. - Represent data using the graph data model (RDF). - Manage and query data represented in RDF. PalGov © 2011 5
  6. 6. Example from the Government Domain• Consider this simplified example from the Government domain. Consider three governmental agencies that record information about companies.• In this example, we will integrate the three databases by transforming each one into RDF and then concatenating the resultant RDF tables into one table. After that, we investigate the concatenated data and link the different resources.• Data integration is simply achieved through concatenation of RDF graphs and linking different resources. It is also achieved when building and executing the queries over the concatenated dataset. Companies DB in Companies DB in Companies DB in Ministry of Justice Chamber of Commerce Ministry of Economy PalGov © 2011 6
  7. 7. Ministry of Justice • Ministry of Justice records some information about companies in addition to the advocates that represent the companies.CompanyAdvocate PalGov © 2011 7
  8. 8. Ministry of Justice: To RDFCompanyAdvocate To RDF … PalGov © 2011 8
  9. 9. Chamber of Commerce • Chamber of Commerce records information about companies in addition to information about companies’ owners.CompanyOwnerCompany_Owner PalGov © 2011 9
  10. 10. Chamber of Commerce: To RDF To RDF … PalGov © 2011 10
  11. 11. Ministry of Economy • Ministry of Economy records information about companies, their owners, and their advocates.CompanyOwnerLawyer PalGov © 2011 11
  12. 12. Ministry of Economy: To RDF To RDF … PalGov © 2011 12
  13. 13. Integration of RDF DataAs simple as …S P O S P O S P O PalGov © 2011 13
  14. 14. In our example PalGov © 2011 14
  15. 15. Linking resources • How are same entities described in different datasets linked? • By linking the Global Identifier, that is, the URI**! • Let’s have a look::YH852 owl:sameAs :8327848 : H782YU owl:sameAs :L85652r:YH852 owl:sameAs :4354JU- Links the company called “Palestine - Links the lawyer called “Tony Deik” recorded in Antiques” in the three databases. the ministry of Justice and the ministry of- This is called entity resolution/ national economy. disambiguation. - This is called entity resolution/ disambiguation.**Note that in our example we used colons to distinguish URIs. For example :JK452, :H782YU,:Country, and :Name are all URIs.For example: “:H782YU” might actually be something like: PalGov © 2011 15
  16. 16. Data Integration and Fusion Source: Christian Bizer• Concatenating RDF graphs and linking entities in different datasets forms an integrated view where applications see all datasets as one integrated database. PalGov © 2011 16
  17. 17. References• Chris Bizer: The Emerging Web of Linked Data. Presentation at SRI International, Artificial Intelligence Center. Menlo Park, USA. 2009. PalGov © 2011 17