Falcon-AO: Results for OAEI 2007


Published on

Presentation given at OM-2007. It describes Falcon-AO, an automatic ontology matching tool, which participates OAEI'2007.

Published in: Technology, Education
1 Like
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Falcon-AO: Results for OAEI 2007

  1. 1. Falcon-AO: Results for OAEI 2007 Wei Hu, Yuanyuan Zhao, Dan Li Gong Cheng , Honghan Wu, and Yuzhong Qu School of Computer Science and Engineering, Southeast University, Nanjing 210096, China
  2. 2. Outline <ul><li>Overview </li></ul><ul><li>Features and Capabilities </li></ul><ul><li>OAEI 2007 Results </li></ul><ul><li>Conclusion </li></ul>
  3. 3. Overview of Falcon-AO <ul><li>Falcon-AO is an automatic ontology matching system to help establish interoperability between (Semantic) Web applications using different while related ontologies. </li></ul><ul><ul><li>Matching Web ontologies described in RDF(S) and OWL </li></ul></ul><ul><ul><li>Implemented in Java </li></ul></ul><ul><ul><li>An open source project under the Apache 2.0 license </li></ul></ul><ul><ul><li>http:// iws.seu.edu.cn /projects/matching/ </li></ul></ul>
  4. 4. History of Falcon-AO – v0.3 (2005) 2005 Linguistic Matching + Graph Matching LMO + GMO
  5. 5. History of Falcon-AO – v0.6 (2006) 2005 Linguistic Matching + Graph Matching + Partition-based V-Doc + GMO + I-Sub + PBM 2006
  6. 6. History of Falcon-AO – v0.7 (2007) Linguistic Matching + Graph Matching + Partition-based V-Doc + GMO + I-Sub + PBM 2006 2005 2007
  7. 7. What’s New in Falcon-AO v0.7 <ul><li>Bug repair </li></ul><ul><li>Optimization </li></ul><ul><li>Open source </li></ul>
  8. 8. Architecture <ul><li>Model Pool </li></ul><ul><li>Parsing ontologies into in-memory models using Jena </li></ul><ul><li>Adjusting models using coordination rules </li></ul>
  9. 9. Architecture (cont’d.) <ul><li>Matcher Library </li></ul><ul><li>V-Doc, I-Sub: two light- weighted linguistic matchers </li></ul><ul><li>GMO: an iterative structural matcher </li></ul><ul><li>PBM: partition-based block matching of large ontologies </li></ul>
  10. 10. Architecture (cont’d.) <ul><li>Alignment Set </li></ul><ul><li>Generating alignments using a widely-accepted RDF/XML format </li></ul><ul><li>Evaluating generated alignments against reference alignments </li></ul>
  11. 11. Architecture (cont’d.) <ul><li>Central Controller </li></ul><ul><li>Configuring matching strategies </li></ul><ul><li>Executing matchers </li></ul><ul><li>Combining similarity in terms of the measures of the linguistic and structural comparability </li></ul>
  12. 12. Architecture (cont’d.) <ul><li>Repository </li></ul><ul><li>Storing useful data during the matching process </li></ul>
  13. 13. Outline <ul><li>Overview </li></ul><ul><li>Features and Capabilities </li></ul><ul><li>OAEI 2007 Results </li></ul><ul><li>Conclusion </li></ul>
  14. 14. Features and Capabilities – V-Doc <ul><li>V-Doc – constructing virtual documents for ontology matching </li></ul><ul><ul><li>A light-weighted linguistic matcher </li></ul></ul><ul><ul><li>The virtual document of a domain entity in an ontology contains: </li></ul></ul><ul><ul><ul><li>its local description, such as labels or comments </li></ul></ul></ul><ul><ul><ul><li>its neighboring information </li></ul></ul></ul><ul><ul><li>Document similarity is calculated via the TF/IDF technique. </li></ul></ul>
  15. 15. Features and Capabilities – GMO <ul><li>GMO – graph matching for ontologies </li></ul><ul><ul><li>An iterative structural matcher </li></ul></ul><ul><ul><li>Using RDF bipartite graphs to represent ontologies </li></ul></ul><ul><ul><li>Computing structural similarity between domain entities and between statements in ontologies by recursively propagating similarity in the bipartite graphs </li></ul></ul><ul><ul><li>Ontologies + external mappings  GMO  additional mappings </li></ul></ul>
  16. 16. Features and Capabilities – PBM <ul><li>PBM – partition-based block matching of large-scale ontologies </li></ul><ul><ul><li>Based on the divide-and-conquer idea </li></ul></ul><ul><ul><li>Advantages: </li></ul></ul><ul><ul><ul><li>Avoiding OUT-OF-MEMORY </li></ul></ul></ul><ul><ul><ul><li>Decreasing the runtime </li></ul></ul></ul>
  17. 17. Features and Capabilities – Coordination Rules <ul><li>Heterogeneous ways in expressing ontologies </li></ul><ul><li>Falcon-AO implements 21 coordination rules to eliminate “useless” axioms and reduce structural heterogeneity. </li></ul><ul><ul><li>Removing redundant and worthless statements </li></ul></ul><ul><ul><ul><li>owl:imports </li></ul></ul></ul><ul><ul><li>Complementing statements </li></ul></ul><ul><ul><ul><li>owl:inverseOf </li></ul></ul></ul><ul><ul><li>Reconstructing the RDF List structure </li></ul></ul><ul><ul><ul><li>Replacing RDF collection vocabularies (rdf:first …) with rdfs:member </li></ul></ul></ul>
  18. 18. Features and Capabilities – Similarity Combination <ul><li>Falcon-AO develops an approach to automatically tune the thresholds of different matchers based on the measures of </li></ul><ul><ul><li>The linguistic comparability: </li></ul></ul><ul><ul><ul><li># mappings found by I-Sub / min (#domain entities in two ontologies) </li></ul></ul></ul><ul><ul><li>The structural comparability (confidence): </li></ul></ul><ul><ul><ul><li># mappings found by GMO / #mappings found by I-Sub or V-Doc </li></ul></ul></ul><ul><li>General scheme </li></ul><ul><ul><li>Higher comparability , lower cutoff </li></ul></ul><ul><ul><li>Unmatchable if both lower </li></ul></ul>
  19. 19. Outline <ul><li>Presentation of the system </li></ul><ul><li>Features and capabilities </li></ul><ul><li>OAEI 2007 Results </li></ul><ul><li>Conclusion </li></ul>
  20. 20. Towards a Generic Practical Tool 5 min for all Benchmark 40 min Library 3 min for all 91 tasks Conference 1.2 h GEMET vs. NALT Environment GEMET vs. AGROVOC 33 min 5.75 h Food 2 min for all 4,639 tasks Directory
  21. 21. Outline <ul><li>Presentation of the system </li></ul><ul><li>Features and capabilities </li></ul><ul><li>OAEI 2007 Results </li></ul><ul><li>Conclusion </li></ul>
  22. 22. Conclusion <ul><li>Falcon-AO (v0.7) performs well on most of matching tasks. </li></ul><ul><li>In the near future, we are looking forward to extending Falcon-AO to: </li></ul><ul><ul><li>Integrating new matchers, especially instance-based approaches </li></ul></ul><ul><ul><li>Discovering mappings between ontologies and RDB schemas </li></ul></ul><ul><ul><li>Supporting data transformation by deriving executable mappings from schema matching </li></ul></ul>
  23. 23. Thanks for your attention! Any comments are welcome! Welcome to Beijing 2008!