This document discusses the query-driven method for data integration. It lists the team members and provides an index of topics to be covered, including an introduction to data integration techniques, challenges, and the query-driven method steps. The key aspects of the query-driven approach are that it uses mediators to integrate heterogeneous databases by translating queries, mapping them to individual data sites, executing the queries locally, and integrating the results into a global answer set.
4. What is Data Integration??
Data integration is the combination of technical and
business processes used to combine data from disparate
sources into meaningful and valuable information.
A complete data integration solution delivers
trusted data from a variety of sources.
5. Data Integration Techniques
Manual Data Integration
Middleware Data Integration
Data Virtualization Integration Approach
Data Warehouse Approach
Query Driven Method
6. At present, the mediator/wrapper integration
methods are widely used in ontology based data
integration because they solve the data update
problems of all the previous data integration
method.
Why Query Driven Method??
7. Schema Integration Has Two Major Challenges
Data Integration Challenges
Identification of all portions of schemas
that pertain to the same concept, in a such
a way to unify such different
representations in the global schema.
Identification, analysis and resolution of
the different types of conflicts in different
schema.
8. Query-Driven Method Steps
»Pre-processing of Information Sources
»Analysis of Data Modification Statements
»Analysis of Data Selection Statements
»Conflict Resolution
»Generation of Potential Improvements
9. Query-Driven Approach
This is the traditional approach to integrate
heterogeneous databases. This approach was
used to build wrappers and integrators on top of
multiple heterogeneous databases. These
integrators are also known as mediators.
10. Process of Query-Driven
Approach
»When a query is issued to a client side, a
metadata dictionary translates the query into an
appropriate form for individual heterogeneous
sites involved.
»Now these queries are mapped and sent to the
local query processor.
»The results from heterogeneous sites are
integrated into a global answer set.
12. SEMANTIC MAPPING
»Semantic mapping is a strategy for graphically
representing concepts.
»Semantic maps portray the schematic relations that
compose a concept.
»It assumes that there are multiple relations between a
concept and the knowledge that is associated with the
concept.
Editor's Notes
Wrappers are custom-built programs that transform data from the source native format to something acceptable to the mediator.
A heterogeneous database system is an automated system for the integration of heterogeneous, disparate database management systems to present a user with a single, unified query interface.
In computer science and information science, an ontology is a formal naming and definition of the types, properties, and interrelationships of the entities that really or fundamentally exist for a particular domain of discourse.