‫أكاديمية الحكومة اإللكترونية الفلسطينية‬
    The Palestinian eGovernment Academy
                        www.egovacademy.ps



Data Integration and Open
  Information Systems
              Introduction & Outline


                   Prof. Mustafa Jarrar
                  Sina Institute, University of Birzeit
                          mjarrar@birzeit.edu
                             www.jarrar.info

                              Reviewed by
  Prof. Marco Ronchetti and Prof. Paolo Bouquet, Trento University, Italy
                          PalGov © 2011                                     1
About

This tutorial is part of the PalGov project, funded by the TEMPUS IV program of the
Commission of the European Communities, grant agreement 511159-TEMPUS-1-
2010-1-PS-TEMPUS-JPHES. The project website: www.egovacademy.ps
Project Consortium:

             Birzeit University, Palestine
                                                           University of Trento, Italy
             (Coordinator )


             Palestine Polytechnic University, Palestine   Vrije Universiteit Brussel, Belgium


             Palestine Technical University, Palestine
                                                           Université de Savoie, France

             Ministry of Telecom and IT, Palestine
                                                           University of Namur, Belgium
             Ministry of Interior, Palestine
                                                           TrueTrust, UK
             Ministry of Local Government, Palestine


Coordinator:
Dr. Mustafa Jarrar
Birzeit University, P.O.Box 14- Birzeit, Palestine
Telfax:+972 2 2982935 mjarrar@birzeit.eduPalGov © 2011                                           2
© Copyright Notes
Everyone is encouraged to use this material, or part of it, but should
properly cite the project (logo and website), and the author of that part.


No part of this tutorial may be reproduced or modified in any form or by
any means, without prior written permission from the project, who have
the full copyrights on the material.




                 Attribution-NonCommercial-ShareAlike
                              CC-BY-NC-SA

This license lets others remix, tweak, and build upon your work non-
commercially, as long as they credit you and license their new creations
under the identical terms.

                                 PalGov © 2011                               3
Intended Learning Outcomes and Tutorial Topics
                      Data Integration and Open Information Systems
          Prerequisites: Tutorial 1(Sessions 1-10), HTML and web programming basics.
                                                                                                  Intended Learning Objectives
                                                                                    A: Knowledge and Understanding
         General Skills (eGov Engineer)                                               2a1: Describe tree and graph data models.
                                                                                      2a2: Understand the notation of XML, RDF, RDFS, and OWL.
Service Integration: Ability to integration services using                            2a3: Demonstrate knowledge about querying techniques for data
(de/)centralized manners
                                                                                      models as SPARQL and XPath.

Integration: Ability to integration and fuse data using (de/)centralized              2a4: Explain the concepts of identity management and Linked data.
manners                                                                               2a5: Demonstrate knowledge about Integration &fusion of
                                                                                      heterogeneous data.
            Ontology: Build and engineer ontologies with good quality.              B: Intellectual Skills
                                                                                      2b1: Represent data using tree and graph data models (XML &

  Lexical: Engineer multilingual knowledge and its lexical semantics                  RDF).
                                                                                      2b2: Describe data semantics using RDFS and OWL.
                                                                                      2b3: Manage and query data represented in RDF, XML, OWL.
                           Entity: Resolve and manage entity identities.              2b4: Integrate and fuse heterogeneous data.
                                                                                    C: Professional and Practical Skills
                                                                                      2c1: Using Oracle Semantic Technology and/or Virtuoso to store
         Database: Describe database schemes at the conceptual level.
                                                                                      and query RDF stores.
                                                                                    D: General and Transferable Skills
         Database: Build data dictionaries and describe data semantics.
                                                                                      2d1: Working with team.
                                                                                      2d2: Presenting and defending ideas.
                                                                                      2d3: Use of creativity and innovation in problem solving.
Process Modeling: Identify, model, and re-engineer business processes.                2d4: Develop communication skills and logical reasoning abilities.

                                                                           PalGov © 2011                                                                   4
Intended Learning Outcomes and Tutorial Topics
                      Data Integration and Open Information Systems
Prerequisites: HTML &Web Programming; Sessions 5-16 depends on (Sessions 1-10 in Tutorial 1).
                                                                                                              Topic                                     h
                                                                                 Session 1: XML Basics and Namespaces                               3
              Intended Learning Objectives
A: Knowledge and Understanding                                                   Session 2: XML DTD’s                                               3
  2a1: Describe tree and graph data models.                                      Session 3: XML Schemas                                             3
  2a2: Understand the notation of XML, RDF, RDFS, and OWL.                       Session 4: Lab-XML Schemas                                         3
  2a3: Demonstrate knowledge about querying techniques for data                  Session 5: RDF and RDFs                                            3
  models as SPARQL and XPath.                                                    Session 6: Lab-RDF and RDFs                                        3
  2a4: Explain the concepts of identity management and Linked data.              Session 7: OWL (Ontology Web Language)                             3
  2a5: Demonstrate knowledge about Integration &fusion of
                                                                                 Session 8: Lab-OWL                                                 3
  heterogeneous data.
                                                                                 Session 9: Lab-RDF Stores -Challenges and Solutions                3
B: Intellectual Skills
                                                                                 Session 10: Lab-SPARQL                                             3
  2b1: Represent data using tree and graph data models (XML &
                                                                                 Session 11: Lab-Oracle Semantic Technology                         3
  RDF).
                                                                                 Session 12_1: The problem of Data Integration                      1.5
  2b2: Describe data semantics using RDFS and OWL.
                                                                                 Session 12_2: Architectural Solutions for the Integration Issues   1.5
  2b3: Manage and query data represented in RDF, XML, OWL.
                                                                                 Session 13_1: Data Schema Integration                              1
  2b4: Integrate and fuse heterogeneous data.
                                                                                 Session 13_2: GAV and LAV Integration                              1
C: Professional and Practical Skills
                                                                                 Session 13_3: Data Integration and Fusion using RDF                1
  2c1: Using Oracle Semantic Technology and/or Virtuoso to store
                                                                                 Session 14: Lab-Data Integration and Fusion using RDF              3
  and query RDF stores.
D: General and Transferable Skills                                               Session 15_1: Data Web and Linked Data                             1.5
  2d1: Working with team.                                                        Session 15_2: RDFa                                                 1.5
  2d2: Presenting and defending ideas.
  2d3: Use of creativity and innovation in problem solving.                      Session 16: Lab-RDFa                                               3
  2d4: Develop communication skills and logical reasoning abilities.
                                                                       PalGov © 2011                                                                5
Some Rules

• Attendance. Attendance is mandatory.



• Etiquette: Cell phones must be turned off. Don’t come late. If you
  must go out during the lecture don’t let us notice.




                              PalGov © 2011                            6

Pal gov.tutorial2.session0.outline

  • 1.
    ‫أكاديمية الحكومة اإللكترونيةالفلسطينية‬ The Palestinian eGovernment Academy www.egovacademy.ps Data Integration and Open Information Systems Introduction & Outline Prof. Mustafa Jarrar Sina Institute, University of Birzeit mjarrar@birzeit.edu www.jarrar.info Reviewed by Prof. Marco Ronchetti and Prof. Paolo Bouquet, Trento University, Italy PalGov © 2011 1
  • 2.
    About This tutorial ispart of the PalGov project, funded by the TEMPUS IV program of the Commission of the European Communities, grant agreement 511159-TEMPUS-1- 2010-1-PS-TEMPUS-JPHES. The project website: www.egovacademy.ps Project Consortium: Birzeit University, Palestine University of Trento, Italy (Coordinator ) Palestine Polytechnic University, Palestine Vrije Universiteit Brussel, Belgium Palestine Technical University, Palestine Université de Savoie, France Ministry of Telecom and IT, Palestine University of Namur, Belgium Ministry of Interior, Palestine TrueTrust, UK Ministry of Local Government, Palestine Coordinator: Dr. Mustafa Jarrar Birzeit University, P.O.Box 14- Birzeit, Palestine Telfax:+972 2 2982935 mjarrar@birzeit.eduPalGov © 2011 2
  • 3.
    © Copyright Notes Everyoneis encouraged to use this material, or part of it, but should properly cite the project (logo and website), and the author of that part. No part of this tutorial may be reproduced or modified in any form or by any means, without prior written permission from the project, who have the full copyrights on the material. Attribution-NonCommercial-ShareAlike CC-BY-NC-SA This license lets others remix, tweak, and build upon your work non- commercially, as long as they credit you and license their new creations under the identical terms. PalGov © 2011 3
  • 4.
    Intended Learning Outcomesand Tutorial Topics Data Integration and Open Information Systems Prerequisites: Tutorial 1(Sessions 1-10), HTML and web programming basics. Intended Learning Objectives A: Knowledge and Understanding General Skills (eGov Engineer) 2a1: Describe tree and graph data models. 2a2: Understand the notation of XML, RDF, RDFS, and OWL. Service Integration: Ability to integration services using 2a3: Demonstrate knowledge about querying techniques for data (de/)centralized manners models as SPARQL and XPath. Integration: Ability to integration and fuse data using (de/)centralized 2a4: Explain the concepts of identity management and Linked data. manners 2a5: Demonstrate knowledge about Integration &fusion of heterogeneous data. Ontology: Build and engineer ontologies with good quality. B: Intellectual Skills 2b1: Represent data using tree and graph data models (XML & Lexical: Engineer multilingual knowledge and its lexical semantics RDF). 2b2: Describe data semantics using RDFS and OWL. 2b3: Manage and query data represented in RDF, XML, OWL. Entity: Resolve and manage entity identities. 2b4: Integrate and fuse heterogeneous data. C: Professional and Practical Skills 2c1: Using Oracle Semantic Technology and/or Virtuoso to store Database: Describe database schemes at the conceptual level. and query RDF stores. D: General and Transferable Skills Database: Build data dictionaries and describe data semantics. 2d1: Working with team. 2d2: Presenting and defending ideas. 2d3: Use of creativity and innovation in problem solving. Process Modeling: Identify, model, and re-engineer business processes. 2d4: Develop communication skills and logical reasoning abilities. PalGov © 2011 4
  • 5.
    Intended Learning Outcomesand Tutorial Topics Data Integration and Open Information Systems Prerequisites: HTML &Web Programming; Sessions 5-16 depends on (Sessions 1-10 in Tutorial 1). Topic h Session 1: XML Basics and Namespaces 3 Intended Learning Objectives A: Knowledge and Understanding Session 2: XML DTD’s 3 2a1: Describe tree and graph data models. Session 3: XML Schemas 3 2a2: Understand the notation of XML, RDF, RDFS, and OWL. Session 4: Lab-XML Schemas 3 2a3: Demonstrate knowledge about querying techniques for data Session 5: RDF and RDFs 3 models as SPARQL and XPath. Session 6: Lab-RDF and RDFs 3 2a4: Explain the concepts of identity management and Linked data. Session 7: OWL (Ontology Web Language) 3 2a5: Demonstrate knowledge about Integration &fusion of Session 8: Lab-OWL 3 heterogeneous data. Session 9: Lab-RDF Stores -Challenges and Solutions 3 B: Intellectual Skills Session 10: Lab-SPARQL 3 2b1: Represent data using tree and graph data models (XML & Session 11: Lab-Oracle Semantic Technology 3 RDF). Session 12_1: The problem of Data Integration 1.5 2b2: Describe data semantics using RDFS and OWL. Session 12_2: Architectural Solutions for the Integration Issues 1.5 2b3: Manage and query data represented in RDF, XML, OWL. Session 13_1: Data Schema Integration 1 2b4: Integrate and fuse heterogeneous data. Session 13_2: GAV and LAV Integration 1 C: Professional and Practical Skills Session 13_3: Data Integration and Fusion using RDF 1 2c1: Using Oracle Semantic Technology and/or Virtuoso to store Session 14: Lab-Data Integration and Fusion using RDF 3 and query RDF stores. D: General and Transferable Skills Session 15_1: Data Web and Linked Data 1.5 2d1: Working with team. Session 15_2: RDFa 1.5 2d2: Presenting and defending ideas. 2d3: Use of creativity and innovation in problem solving. Session 16: Lab-RDFa 3 2d4: Develop communication skills and logical reasoning abilities. PalGov © 2011 5
  • 6.
    Some Rules • Attendance.Attendance is mandatory. • Etiquette: Cell phones must be turned off. Don’t come late. If you must go out during the lecture don’t let us notice. PalGov © 2011 6