Semantic data integration proof of concept - Presentation Transcript
Semantic Data Integration
I6 Core Group
Nic Bertrand
Herbert Schentz
LTER-Europe Conference, Mallorca, Dec. 2008
Overview
■Testing goals
■Test Architecture
■Results
■Outlook: Applicability for LTER-Europe
Architecture
Goal:
Enable seamless access to distributed data
Allow local data analysis for all members with their own tools
Distributed Distributed
Socio-Ecological Data mining
Data
with local tools
Portal
See all data as if it came from ONE Data Source
Longer term vision
Extend seamless access to distributed services (SOA)
Allow local data analysis for all members with their own tools and common services
Distributed
Distributed
Data Mining
Socio-ecological With local tools
Data
Distributed
Applications
See all data as if data came from ONE Data Source processed within ONE application
Role of Ontology
SERONTO
common concepts
and structures
Portal
Distributed Socio- Distributed Data
Ecological Data Mining
with local tools
SERONTO: basis to discover, retrieve and integrate
distributed heterogenous data
Testing... Why?
■To validate the use of SERONTO for data
integration of ALTER-Net and LTER Europe
■Test the feasibility of mapping REAL ecological
data to SERONTO
■Test the querying of the connected
database(s) from the semantic concepts in
SERONTO
Proof of concept:
Acceptance Criteria
• The databases must have different structures and must have been developed
independently of SERONTO;
• The databases must feature reference lists (e.g. species lists);
• The database structures must not be altered as a result of the integration work;
• New concepts may be imported into SERONTO as and when required;
• The databases must contain data relevant to Long Term Ecological Research (e.g.
vegetation surveys, records of species occurrences, measurement of biotic and
abiotic components).
Testing: Connecting 5 databases
2835
floodplain More about the databases:
Independently developed,
Pythia Not developed for the purpose of data
vegetation integration
JOKL Different data models
cultural Different languages
landscapes Similar data types collected in ALTER-Net,
Some obvious integration points
ECN Summary
(e.g. Vegetation)
Database
JODI
vegetation
SERONTO
Data Integration using SERONTO
Connect Databases
Import
Ontology
SERONTO Query
Results
Getting value sets back
SERONTO
parameter method
parameter_method
Value_sets Unit
Scale
Data Integration Results
➢ Import SERONTO and Units Ontologies into Ontostudio
SERONTO
Data Integration Results
import diverse ecological databases
2835
floodplain
Pythia
vegetation
ECN Summary
Database
JOKL
cultural
landscapes
JODI
vegetation
12
Data Integration Results
Extend SERONTO Classes (SERONTO Core does not
Using the content of the databases contain domain specific concepts)
Map databases to SERONTO (Simple and complex mappings)
Query individual databases directly
Query multiple databases from the SERONTO (Simple and Complex queries)
Map once, reuse data many times, querying does not require
knowledge of the structures of the databases
Semantic data integration is possible 13
Open Questions
<?xml version=\"1.0\" encoding=\"UTF-8\"?>
<flg:flogic xmlns:flg=\"http://www.wsmo.org/2004/d16/d16.2/v0.1/\">
<!-- Test data to test the WSML F-Logic XML syntax -->
SERONTO
<!-- The following <rule></rule> encodes this fact (taken from the F-Logic
JACM paper, page 7):
bob[name -> \"Bob\";
age -> 40;
affiliation -> cs1[dname -> \"CS\";
mngr -> bob;
Core
assistents -> {john, sally}]
this encoding writes only elementary molecules
-->
<rule>
<head>
<molecule>
<object>
<constant name=\"bob\"/>
</object>
<superclass isaType=\":\">
Databases
<class>
<constant name=\"empl\"/>
</class>
</superclass>
<methodSpec arrow=\"->\">
<name>
<constant name=\"name\"/>
</name>
?
<result>
<oid>
<constant name=\"\"Bob\"\"/>
</oid>
</result>
domain ontologies
Query
Portal
Performance
Possible uses for LTER Europe
SERONTO & Domain Ontologies
common concepts
and domain knowledge
Portal
Distributed Socio- Distributed Data
Ecological Data Mining
with local tools
Seamless access... Ready for use now
0 comments
Post a comment