Ontology Alignment using Linked Data

  • 1,112 views
Uploaded on

A mini project by 'Don't move the plants' at the 8th Summer School on Ontology Engineering and the Semantic Web 2011. This Project was completed by: Andrea Nuzzolese, Esther Lozano, Ferdinand …

A mini project by 'Don't move the plants' at the 8th Summer School on Ontology Engineering and the Semantic Web 2011. This Project was completed by: Andrea Nuzzolese, Esther Lozano, Ferdinand Dhombres, Luca Greco and Tim Hodson.

More in: Education , Technology
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
No Downloads

Views

Total Views
1,112
On Slideshare
0
From Embeds
0
Number of Embeds
0

Actions

Shares
Downloads
30
Comments
0
Likes
2

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • Lack of ontological resource : \n
  • Lack of ontological resource : \n
  • Lack of ontological resource : \n
  • Lack of ontological resource : \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n

Transcript

  • 1. Ontology Alignment Discovery using Linked Open Data Esther LOZANO, Andrea NUZZOLESE, Luca GRECO, Tim HODSON, Ferdinand DHOMBRES JE1: What linked data tells us about ontology relations?
  • 2. Objective • Explore dataset links in the LOD Cloud to infer alignments between ontologies Sussex St. Reading Andrews NDL Audio- Lists Resource subjects t4gm MySpace scrobbler Lists Moseley (DBTune) (DBTune) RAMEAU Folk NTU SH lobid GTAA Plymouth Resource Lists Organi- Reading Lists sations Music The Open ECS Magna- Brainz Music DB tune Library LCSH South- (Data Brainz LIBRIS ampton Tropes lobid Ulm Incubator) (zitgist) Man- EPrints Resources chester Surge Reading biz. Music RISKS Radio Lists The Open ECS data. John Brainz Discogs Library PSH Gem. UB South- gov.uk Peel (DBTune) FanHubz (Data In- (Talis) Norm- Mann- ampton (DB cubator) Jamendo datei heim RESEX Tune) Popula- Poké- DEPLOY Last.fm tion (En- pédia Artists Last.FM Linked RDF AKTing) research EUTC (DBTune) (rdfize) LCCN VIAF Book Wiki data.gov Produc- Pisa Eurécom P20 Mashup semantic NHS .uk tions classical web.org (EnAKTing) Pokedex (DB Mortality Tune) PBAC ECS (En- AKTing) BBC MARC (RKB Budapest Program Codes Explorer) Energy education OpenEI BBC List Semantic Lotico Revyu OAI (En- CO2 data.gov mes Music Crunch SW AKTing) (En- .uk Chronic- Linked Dog NSZL Base AKTing) ling Event- MDB RDF Food IRIT America Media Catalog ohloh BBC DBLP ACM IBM Good- BibBase Ord- Wildlife (RKB Openly Recht- win nance Finder Explorer) Local spraak. Family DBLPlegislation Survey Tele- New VIVO UF .gov.uk nl graphis York flickr (L3S) New- VIVO castle Times URI wrappr Open Indiana RAE2001 UK Post- Burner Calais DBLP codes statistics (FU VIVO CiteSeer Roma data.gov LOIUS Taxon iServe Berlin) IEEE .uk Cornell Concept Geo World data ESD Fact- OS dcs Names book dotAC stan- reference Project Linked Data NASA (FUB) Freebase dards data.gov Guten- .uk for Intervals (Data GESIS Course- transport DBpedia berg STW ePrints CORDIS Incu- ware data.gov bator) (FUB) Fishes ERA UN/ .uk of Texas Geo LOCODE Uberblic Euro- Species The stat dbpedia TCM SIDER Pub KISTI (FUB) lite Gene STITCH Chem JISC London Geo KEGG DIT LAAS Gazette TWC LOGD Linked Daily OBO Drug Eurostat Data UMBEL lingvoj Med (es) Disea- YAGO Medi some Care ChEBI KEGG NSF Linked KEGG KEGG Linked Drug Cpd GovTrack rdfabout Glycan Sensor Data CT Bank Pathway US SEC Open Reactome (Kno.e.sis) riese Uni Cyc Lexvo Path- way PDB Media Semantic totl.net Pfam HGNC XBRL WordNet KEGG KEGG Geographic Linked Taxo- CAS Reaction Twarql (VUA) UniProt Enzyme rdfabout EUNIS Open nomy US Census Publications Numbers PRO- ProDom SITE Chem2 UniRef Bio2RDF User-generated content Climbing WordNet SGD Homolo Linked (W3C) Affy- Gene GeoData Cornetto metrix Government PubMed Gene UniParc Ontology GeneID Cross-domain Airports Product DB UniSTS MGI Gen Life sciences Bank OMIM InterPro As of September 2010 http://richard.cyganiak.de/2007/10/lod/lod- http://www.webology.org/2006/v3n3/images/sample.JPG datasets_2010-09-22_colored.pdf
  • 3. Methods• Find appropriate datasets in the LOD CLOUD (with an ontology and a SPARQL Endpoint)• Retrieve/build linksets between datasets• Generate new graph describing candidate alignments• Infer mappings between ontologies• Alignment evaluation
  • 4. PrincipleClass C1 owl:equivalentClass C2Individual D1 owl:sameAs D2
  • 5. PrincipleClass C1 owl:equivalentClass C2Individual D1 owl:sameAs D2
  • 6. PrincipleProperty isPossibleDrug rdfs:domain owl:equivalentClass rdfs:range rdfs:subClassOf healthClass Antalgic disease condition AspirinIndividual isPossibleDrug hangover (500 mg)
  • 7. PrincipleProperty isPossibleDrug rdfs:domain owl:equivalentClass rdfs:range rdfs:subClassOf healthClass Antalgic disease condition AspirinIndividual isPossibleDrug hangover (500 mg)
  • 8. From links to candidatesA metric and a threshold to identify potential alignments Class C1 owl:equivalentClass C2 D1 owl:sameAs D2 D1 Individual D1 owl:sameAs D2 D2 D1 D1 D1" owl:sameAs D2"
  • 9. From links to candidatesA metric and a threshold to identify potential alignments Class C1 owl:equivalentClass C2 P1 P2 D1 owl:sameAs D2 D1 Individual D1 owl:sameAs D2 D2 D1 D1 D1" owl:sameAs D2" P1=0.5 P2=0.75
  • 10. From links to candidatesA metric and a threshold to identify potential alignments Class C1 owl:equivalentClass C2 (P1+P2)/2 > x P1 P2 D1 owl:sameAs D2 D1 Individual D1 owl:sameAs D2 D2 D1 D1 D1" owl:sameAs D2" P1=0.5 P2=0.75
  • 11. Results - 1 Material collection issues OS Freebase Project Guten- DBpedia berg STW (FUB) ERA dbpedia lite TCM Gene SIDER STITCH • Datasets without Daily Med DIT Disea- ontology or with one AGO Medi Care some ChEBI class Linked Drug • CT Banktotl.net Uni Path- way Pfam PDB SPARQL Endpoint unavailabilty Taxo- UniProt • nomyUniRef PRO- SITE ProDom Linksets without owl:sameAs • Linksets with only one class
  • 12. Results - 1 Material collection issues OS Freebase Project Guten- DBpedia berg STW (FUB) ERA dbpedia lite TCM Gene SIDER STITCH • Datasets without Daily Med DIT Disea- ontology or with one AGO Medi Care some ChEBI class Linked Drug • CT Banktotl.net Uni Path- way Pfam PDB SPARQL Endpoint unavailabilty Taxo- UniProt • nomyUniRef PRO- SITE ProDom Linksets without owl:sameAs • Linksets with only one class
  • 13. Result - 2 Poké- pédia Linked Linked Movie DB & DBpedia LCCN classical Pokedex (DB • Tune) PBAC MARC Codes List Datasets with good ontologies Linked • Event- MDB NSZL Media Catalog Good- win SPARQL endpoints available • FamilyTimes URI Burner flickr wrappr Open Calais owl:sameAs links available World act- book iServe • Linked individuals belonging to (FUB) Freebase DBpedia different classes Uberblic dbpedia TCM lite Gene DIT Daily Med
  • 14. Result - 3 Poké- Correlation pédia Linked LCCN Linked classical DBpedia (DB Tune) Pokedex PBAC MDB MARC Codes List Artist Linked NSZL Actor 1869 Event- Media MDB Catalog 83847 Good- win Family 1735 Actor 37751Times URI Burner flickr wrappr Open Calais 50603 2027 Person World iServe 363751 act- book (FUB) Freebase 30 ... DBpedia Comedian Uberblic dbpedia TCM 675 lite Gene DIT Daily Med Movie Film
  • 15. Result - 3 Poké- Correlation pédia Linked LCCN Linked classical DBpedia (DB Tune) Pokedex PBAC MDB MARC Codes List Event- Linked MDB NSZL Catalog Actor 2.95 Artist 2.2 % Media 3.7 % Good- win Family 3.4 % 4 Actor 4.6 % flickrTimes URI wrappr Open 2.0 % 2.3 Burner Calais Person 0.6 % iServe World act- book Freebase 0.06 % 2.23 ... (FUB) DBpedia Uberblic Comedian 4.4 % dbpedia TCM lite Gene DIT Daily Med Movie Film
  • 16. EvaluationONTOLOGIES 0 alignment ext rac tor DATASETS ALIGNMENT 2 alignments
  • 17. EvaluationONTOLOGIES matcher ALIGNMENT 0 alignment validation ext rac tor DATASETS ALIGNMENT 2 alignments
  • 18. Conclusion• Quality issue in datasets• Existing ontology matcher was not effective• New alignments discovered using LOD• Approach could be used where natural languages of ontologies differ• Assessment of the approach has to be made (other datasets, ontologies, matchers)
  • 19. Thank you