Case acquisition from text: Ontology-based information extraction with SCOOBIE for myCBR
Upcoming SlideShare
Loading in...5
×
 

Case acquisition from text: Ontology-based information extraction with SCOOBIE for myCBR

on

  • 1,024 views

myCBR is a freely available tool for rapid prototyping of similarity-based retrieval applications such as case-based product recommender systems. It provides easy-to-use model generation, data import, ...

myCBR is a freely available tool for rapid prototyping of similarity-based retrieval applications such as case-based product recommender systems. It provides easy-to-use model generation, data import, similarity modelling, explanation, and testing functionality together with comfortable graphical user interfaces. SCOOBIE is an ontology-based information extraction system, which uses symbolic background knowledge for extracting information from text. Extraction results depend on existing knowledge fragments. In this paper we show how to use SCOOBIE for generating cases from texts. More concrete we use ontologies of the Web of Data, published as so called Linked Data interlinked with myCBR’s case model. We present a way of formalising a case model as Linked Data ready ontology and connect it with other ontologies of the Web of Data in order to get richer cases.

Statistics

Views

Total Views
1,024
Views on SlideShare
1,024
Embed Views
0

Actions

Likes
0
Downloads
17
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

CC Attribution-ShareAlike LicenseCC Attribution-ShareAlike License

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Case acquisition from text: Ontology-based information extraction with SCOOBIE for myCBR Case acquisition from text: Ontology-based information extraction with SCOOBIE for myCBR Presentation Transcript

    • Competence Center Case-Based Reasoning CASE ACQUISITION FROM TEXT: ONTOLOGY-BASED INFORMATION EXTRACTION WITH SCOOBIE FOR MYCBR Thomas Roth-Berghofer, Benjamin Adrian, and Andreas Dengel German Research Center for Artificial Intelligence DFKI GmbH Donnerstag, 5. August 2010
    • COMPETENCE CENTER CASE-BASED REASONING (CC CBR) Klaus-Dieter Thomas Armin Althoff Roth-Berghofer Stahl © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • COMPETENCE CENTER CASE-BASED REASONING (CC CBR) Klaus-Dieter Thomas Armin Althoff Roth-Berghofer Stahl Kerstin Régis © 2010 DFKI CC CBR Bach Newo Donnerstag, 5. August 2010
    • MOTIVATION © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • MOTIVATION Ontologies SCOOBIE Ontology-based RDF Texts © 2010 DFKI CC CBR Information Extraction Donnerstag, 5. August 2010
    • MOTIVATION Ontologies SCOOBIE + Ontology-based RDF Texts © 2010 DFKI CC CBR Information Extraction Donnerstag, 5. August 2010
    • BBC Music profiles Jamendo TOTP Peel Sites Open- Guides DBLP flickr RKB Project Pub Geo- Euro- wrappr Explorer Guten- Virtuoso Guide names stat Pisa CORDIS berg Sponger eprints BBC Programmes Open Calais RKB riese World Linked ECS Magna- Fact- MDB IEEE New- South- tune book ampton castle RDF Book DBpedia Mashup Linked GeoData lingvoj Freebase LAAS- US CiteSeer Census CNRS W3C DBLP Data IBM WordNet Hannover UniRef GEO UMBEL Species DBLP Gov- Track Berlin Reactome LinkedCT UniParc Open Taxonomy Cyc Yago Drug PROSITE Daily Bank Med Pub GeneID Chem Homolo KEGG UniProt Gene Pfam ProDom Disea- CAS Gene some ChEBI Ontology Symbol OMIM Inter Pro UniSTS PDB MOTIVATION HGNC MGI PubMed As of July 2009 Ontologies SCOOBIE + Ontology-based RDF Texts © 2010 DFKI CC CBR Information Extraction Donnerstag, 5. August 2010
    • BBC Music profiles Jamendo TOTP Peel Sites Open- Guides DBLP flickr RKB Project Pub Geo- Euro- wrappr Explorer Guten- Virtuoso Guide names stat Pisa CORDIS berg Sponger eprints BBC Programmes Open Calais RKB riese World Linked ECS Magna- Fact- MDB IEEE New- South- tune book ampton castle RDF Book DBpedia Mashup Linked GeoData lingvoj Freebase LAAS- US CiteSeer Census CNRS W3C DBLP Data IBM WordNet Hannover UniRef GEO UMBEL Species DBLP Gov- Track Berlin Reactome LinkedCT UniParc Open Taxonomy Cyc Yago Drug PROSITE Daily Bank Med Pub GeneID Chem Homolo KEGG UniProt Gene Pfam ProDom Disea- CAS Gene some ChEBI Ontology Symbol OMIM Inter Pro UniSTS PDB MOTIVATION HGNC MGI PubMed As of July 2009 Ontologies SCOOBIE + Ontology-based RDF Texts © 2010 DFKI CC CBR Information Extraction Donnerstag, 5. August 2010
    • OVERVIEW • Ontology-based Information Extraction with SCOOBIE • Recap of myCBR • myCBR+SCOOBIE • Outlook and future work © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • SCOOBIE: ONTOLOGIE-BASED INFORMATION EXTRACTION © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • SCOOBIE Ontologies Ontology-based Texts RDF Information Extraction © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • EXTRACT PLAIN TEXT © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • EXTRACT TOKENS © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • EXTRACT TOKENS © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • EXTRACT TOKENS © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • RECOGNISE SYMBOLS © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • RECOGNISE SYMBOLS © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • RECOGNISE SYMBOLS © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • RECOGNISE SYMBOLS © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • RECAP: MOTIVATION FOR DEVELOPING • Need for a freely available “out of the box” tool: • compact and easy to use • comfortable graphical user interface for • defining case representations • modeling knowledge-intensive similarity measures • testing of retrieval functionality • support for rapid prototyping © 2010 DFKI CC CBR • adaptable & extendable Donnerstag, 5. August 2010
    • ➜ ECCBR 2008 © 2010 DFKI CC CBR Armin Stahl and Thomas R. Roth-Berghofer. Rapid prototyping of CBR applications with the open source tool myCBR. In Ralph Bergmann and Klaus-Dieter Althoff, editors, Advances in Case-Based Reasoning. Springer Verlag, 2008. Donnerstag, 5. August 2010
    • BBC Music profiles Jamendo TOTP Peel Sites Open- Guides DBLP flickr RKB Project Pub Geo- Euro- wrappr Explorer Guten- Virtuoso Guide names stat Pisa CORDIS berg Sponger eprints BBC Programmes Open Calais RKB riese World Linked ECS Magna- Fact- MDB IEEE New- South- tune book ampton castle RDF Book DBpedia Mashup Linked GeoData lingvoj Freebase LAAS- US CiteSeer Census CNRS W3C DBLP Data IBM WordNet Hannover UniRef GEO UMBEL Species DBLP Gov- Track Berlin Reactome LinkedCT UniParc Open Taxonomy Cyc Yago Drug PROSITE Daily Bank Med Pub GeneID Chem Homolo KEGG UniProt Gene Pfam ProDom Disea- CAS Gene some ChEBI Ontology Symbol OMIM Inter Pro UniSTS PDB MOTIVATION HGNC MGI PubMed As of July 2009 Ontologies SCOOBIE + Ontology-based RDF Texts Information Extraction © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • SEMANTIC WEB VISION “The Semantic Web is an extension of the current Web in which information is given well- defined meaning, better enabling computers and people to work in cooperation.” © 2010 DFKI CC CBR T. Berners-Lee, J. Hendler, O. Lassila, “The Semantic Web”, Scientific American, May 2001 Donnerstag, 5. August 2010
    • SEMANTIC WEB VISION “The Semantic Web is an extension of the current Web in which information is given well- defined meaning, better enabling computers and people to work in cooperation.” • Web of content • Web pages linked by semantical relations • Machines are able to process contents and links © 2010 DFKI CC CBR T. Berners-Lee, J. Hendler, O. Lassila, “The Semantic Web”, Scientific American, May 2001 Donnerstag, 5. August 2010
    • SEMANTIC WEB VISION “The Semantic Web is an extension of the current Web in which information is given well- defined meaning, better enabling computers and people to work in cooperation.” • Web of content Web of content • Web pages linked by semantical relations • Machines are able to process contents and links © 2010 DFKI CC CBR T. Berners-Lee, J. Hendler, O. Lassila, “The Semantic Web”, Scientific American, May 2001 Donnerstag, 5. August 2010
    • WEB OF DATA • Characteristics: • Expressed in RDF • Identified by URIs • Accessible via http © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • WEB OF TRIPLES <rdf:Description rdf:about= "http://dbtropes.org/resource/Main/Ratatouille#Remy"> <does-not-like rdf:resource= "http://mycbr-project.net/models/Recipe#velveeta_cheese"/> </rdf:Description> © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • WEB OF TRIPLES • Characteristics: • Expressed in RDF <rdf:Description • Identified by rdf:about= URIs "http://dbtropes.org/resource/Main/Ratatouille#Remy"> <does-not-like • Accessible via rdf:resource= http "http://mycbr-project.net/models/Recipe#velveeta_cheese"/> </rdf:Description> © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • WEB OF TRIPLES <rdf:Description rdf:about= "http://dbtropes.org/resource/Main/Ratatouille#Remy"> <does-not-like rdf:resource= "http://mycbr-project.net/models/Recipe#velveeta_cheese"/> </rdf:Description> © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • flickr RKB Project Geo- Euro- wrappr Explorer Guten- Virtuoso names Pisa USING LINKED stat berg Sponger Open Calais RKB World Linked ECS Magna- Fact- MDB South- DATA FOR CASE tune book ampton RDF Book DBpedia Mashup lingvoj Freebase CiteSeer W3C DBLP GENERATION WordNet Hannover UniR GEO UMBEL Species DBLP Berlin Reactome LinkedCT UniParc o Drug PROSITE Daily Bank Med Pub GeneID Chem KEGG UniProt Pfam Disea- CAS Gene some ChEBI Ontology l OMIM UniSTS Case Inter Pro PDB HGNC MGI PubMed Model © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • flickr RKB Project Geo- Euro- wrappr Explorer Guten- Virtuoso names <skos:Concept Pisa USING LINKED stat berg Sponger Open rdf:about="http://mycbr-project.net/models/Recipe#Shallots"> Calais World Linked <skos:prefLabel> RKB ECS Magna- Fact- MDB Shallots South- DATA FOR CASE tune book ampton DBpedia </skos:prefLabel> RDF Book Mashup lingvoj Freebase <rdf:type rdf:resource="ingredients_vegetables"/> CiteSeer W3C </skos:Concept> DBLP GENERATION WordNet Hannover UniR GEO UMBEL Species DBLP Berlin <skos:Concept LinkedCT rdf:about="http://mycbr-project.net/models/Recipe#Onions"> Reactome UniParc o Drug <skos:prefLabel> PROSITE Bank Daily Med Onions Pub GeneID Chem </skos:prefLabel> KEGG UniProt <rdf:type rdf:resource="ingredients_vegetables"/> Disea- CAS </skos:Concept> Pfam Gene some ChEBI Ontology l OMIM UniSTS Case Inter Pro PDB HGNC MGI PubMed Model © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • flickr RKB Project Geo- Euro- wrappr Explorer Guten- Virtuoso names Pisa USING LINKED stat berg Sponger Open Calais RKB World Linked ECS Magna- Fact- MDB South- DATA FOR CASE tune book ampton RDF Book DBpedia Mashup lingvoj Freebase CiteSeer W3C DBLP GENERATION WordNet Hannover UniR GEO UMBEL Species DBLP Berlin Reactome LinkedCT UniParc o Drug PROSITE Daily Bank Med Pub GeneID Chem KEGG UniProt Pfam Disea- CAS Gene some ChEBI Ontology l OMIM UniSTS Case Inter Pro PDB HGNC MGI PubMed Model © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • flickr RKB Project Geo- Euro- wrappr Explorer Guten- Virtuoso names Pisa USING LINKED stat berg Sponger Open Calais RKB World Linked ECS Magna- Fact- MDB South- DATA FOR CASE tune book ampton RDF Book DBpedia Mashup lingvoj Freebase CiteSeer W3C DBLP GENERATION WordNet Hannover UniR GEO UMBEL Species DBLP Berlin Reactome LinkedCT UniParc o Drug PROSITE Daily Bank Med Pub GeneID Chem KEGG UniProt Pfam Disea- CAS Gene some ChEBI Ontology l OMIM UniSTS Connection Case Inter Pro PDB HGNC Model MGI PubMed Model © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • flickr flickr RKB RKB Project Project Geo- Geo- Euro- Euro- wrappr wrappr Explorer Explorer Guten- Guten- Virtuoso Virtuoso names names Pisa Pisa CORDIS USING LINKED stat stat berg berg Sponger Sponger eprints Open Open Calais Calais RKB RKB World World Linked Linked ECS ECS Magna- Magna- Fact- Fact- MDB MDB IEEE New- South- South- DATA FOR CASE tune tune book book ampton ampton castle RDF Book RDF Book DBpedia DBpedia Mashup Mashup lingvoj lingvoj Freebase Freebase LAAS- CiteSeer CiteSeer CNRS W3C W3C DBLP DBLP GENERATION IBM WordNet WordNet Hannover Hannover UniRef UniR GEO GEO UMBEL UMBEL Species Species DBLP DBLP Berlin Berlin Reactome Reactome LinkedCT LinkedCT UniParc UniParc Taxonomy o o Drug Drug owl:sameas PROSITE PROSITE Daily Daily Bank Bank Med Med Pub Pub GeneID GeneID Chem Chem KEGG KEGG UniProt UniProt Pfam Pfam ProDom Disea- Disea- CAS CAS Gene Gene some some ChEBI ChEBI Ontology Ontology l OMIM OMIM UniSTS UniSTS Connection CaseInter Inter Pro Pro PDB PDB HGNC HGNC Model MGI MGI PubMed PubMed Model As of July 2009 © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • flickr flickr RKB RKB Project Project Geo- Geo- Euro- Euro- wrappr wrappr Explorer Explorer Guten- Guten- Virtuoso Virtuoso names names Pisa Pisa CORDIS USING LINKED stat stat berg berg Sponger Sponger eprints Open Open Calais Calais RKB RKB World World Linked Linked ECS ECS Magna- Magna- Fact- Fact- MDB MDB IEEE New- South- South- DATA FOR CASE tune tune book book ampton ampton castle RDF Book RDF Book DBpedia DBpedia Mashup Mashup lingvoj lingvoj Freebase Freebase LAAS- CiteSeer CiteSeer CNRS W3C W3C DBLP DBLP GENERATION IBM WordNet WordNet Hannover Hannover UniRef UniR GEO GEO UMBEL UMBEL Species Species DBLP DBLP Berlin Berlin Reactome Reactome LinkedCT LinkedCT UniParc UniParc Taxonomy o o Drug Drug owl:sameas PROSITE PROSITE Daily Daily Bank Bank Med Med Pub Pub GeneID GeneID Chem Chem KEGG KEGG UniProt UniProt Pfam Pfam ProDom Disea- Disea- CAS CAS Gene Gene some some ChEBI ChEBI Ontology Ontology l OMIM OMIM UniSTS UniSTS Connection CaseInter Inter Pro Pro PDB PDB HGNC HGNC Model MGI MGI PubMed PubMed Model <http://mycbr-project.net/models/Recipe#onions> 2009 As of July owl:sameas <http://dbpedia.org/resource/Onion> <http://mycbr-project.net/models/Recipe#green_fettuccine"> owl:sameas <http://dbpedia.org/resource/Fettucine> <http://mycbr-project.net/models/Recipe#spinach_noodles"> owl:sameas <http://dbpedia.org/resource/Noodle> © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • flickr flickr RKB RKB Project Project Geo- Geo- Euro- Euro- wrappr wrappr Explorer Explorer Guten- Guten- Virtuoso Virtuoso names names Pisa Pisa CORDIS USING LINKED stat stat berg berg Sponger Sponger eprints Open Open Calais Calais RKB RKB World World Linked Linked ECS ECS Magna- Magna- Fact- Fact- MDB MDB IEEE New- South- South- DATA FOR CASE tune tune book book ampton ampton castle RDF Book RDF Book DBpedia DBpedia Mashup Mashup lingvoj lingvoj Freebase Freebase LAAS- CiteSeer CiteSeer CNRS W3C W3C DBLP DBLP GENERATION IBM WordNet WordNet Hannover Hannover UniRef UniR GEO GEO UMBEL UMBEL Species Species DBLP DBLP Berlin Berlin Reactome Reactome LinkedCT LinkedCT UniParc UniParc Taxonomy o o Drug Drug owl:sameas PROSITE PROSITE Daily Daily Bank Bank Med Med Pub Pub GeneID GeneID Chem Chem KEGG KEGG UniProt UniProt Pfam Pfam ProDom Disea- Disea- CAS CAS Gene Gene some some ChEBI ChEBI Ontology Ontology l OMIM OMIM UniSTS UniSTS Connection CaseInter Inter Pro Pro PDB PDB HGNC HGNC Model MGI MGI PubMed PubMed Model As of July 2009 © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • flickr flickr RKB RKB Project Project Geo- Geo- Euro- Euro- wrappr wrappr Explorer Explorer Guten- Guten- Virtuoso Virtuoso names names Pisa Pisa CORDIS USING LINKED stat stat berg berg Sponger Sponger eprints Open Open Calais Calais RKB RKB World World Linked Linked ECS ECS Magna- Magna- Fact- Fact- MDB MDB IEEE New- South- South- DATA FOR CASE tune tune book book ampton ampton castle RDF Book RDF Book DBpedia DBpedia Mashup Mashup lingvoj lingvoj Freebase Freebase LAAS- CiteSeer CiteSeer CNRS W3C W3C DBLP DBLP GENERATION IBM WordNet WordNet Hannover Hannover UniRef UniR GEO GEO UMBEL UMBEL Species Species DBLP DBLP Berlin Berlin Reactome Reactome LinkedCT LinkedCT UniParc UniParc Taxonomy o o Drug Drug owl:sameas PROSITE PROSITE Daily Daily Bank Bank Med Med Pub Pub GeneID GeneID Chem Chem KEGG KEGG UniProt UniProt Pfam Pfam ProDom Disea- Disea- CAS CAS Gene Gene some some ChEBI ChEBI Ontology Ontology l OMIM OMIM UniSTS UniSTS Connection CaseInter Inter Pro Pro PDB PDB HGNC HGNC Model MGI MGI PubMed PubMed Model As of July 2009 Texts © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • flickr flickr RKB RKB Project Project Geo- Geo- Euro- Euro- wrappr wrappr Explorer Explorer Guten- Guten- Virtuoso Virtuoso names names Pisa Pisa CORDIS USING LINKED stat stat berg berg Sponger Sponger eprints Open Open Calais Calais RKB RKB World World Linked Linked ECS ECS Magna- Magna- Fact- Fact- MDB MDB IEEE New- South- South- DATA FOR CASE tune tune book book ampton ampton castle RDF Book RDF Book DBpedia DBpedia Mashup Mashup lingvoj lingvoj Freebase Freebase LAAS- CiteSeer CiteSeer CNRS W3C W3C DBLP DBLP GENERATION IBM WordNet WordNet Hannover Hannover UniRef UniR GEO GEO UMBEL UMBEL Species Species DBLP DBLP Berlin Berlin Reactome Reactome LinkedCT LinkedCT UniParc UniParc Taxonomy o o Drug Drug owl:sameas PROSITE PROSITE Daily Daily Bank Bank Med Med Pub Pub GeneID GeneID Chem Chem KEGG KEGG UniProt UniProt Pfam Pfam ProDom Disea- Disea- CAS CAS Gene Gene some some ChEBI ChEBI Ontology Ontology l OMIM OMIM UniSTS UniSTS Connection CaseInter Inter Pro Pro PDB PDB HGNC HGNC Model MGI MGI PubMed PubMed Model As of July 2009 Ontology-based Texts Information Extraction © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • flickr flickr RKB RKB Project Project Geo- Geo- Euro- Euro- wrappr wrappr Explorer Explorer Guten- Guten- Virtuoso Virtuoso names names Pisa Pisa CORDIS USING LINKED stat stat berg berg Sponger Sponger eprints Open Open Calais Calais RKB RKB World World Linked Linked ECS ECS Magna- Magna- Fact- Fact- MDB MDB IEEE New- South- South- DATA FOR CASE tune tune book book ampton ampton castle RDF Book RDF Book DBpedia DBpedia Mashup Mashup lingvoj lingvoj Freebase Freebase LAAS- CiteSeer CiteSeer CNRS W3C W3C DBLP DBLP GENERATION IBM WordNet WordNet Hannover Hannover UniRef UniR GEO GEO UMBEL UMBEL Species Species DBLP DBLP Berlin Berlin Reactome Reactome LinkedCT LinkedCT UniParc UniParc Taxonomy o o Drug Drug owl:sameas PROSITE PROSITE Daily Daily Bank Bank Med Med Pub Pub GeneID GeneID Chem Chem KEGG KEGG UniProt UniProt Pfam Pfam ProDom Disea- Disea- CAS CAS Gene Gene some some ChEBI ChEBI Ontology Ontology l OMIM OMIM UniSTS UniSTS Connection CaseInter Inter Pro Pro PDB PDB HGNC HGNC Model MGI MGI PubMed PubMed Model As of July 2009 Ontology-based Case Texts Information Extraction Base © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • flickr flickr RKB RKB Project Project Geo- Geo- Euro- Euro- wrappr wrappr Explorer Explorer Guten- Guten- Virtuoso Virtuoso names names Pisa Pisa CORDIS USING LINKED stat stat berg berg Sponger Sponger eprints Open Open Calais Calais RKB RKB World World Linked Linked ECS ECS Magna- Magna- Fact- Fact- MDB MDB IEEE New- South- South- DATA FOR CASE tune tune book book ampton ampton castle RDF Book RDF Book DBpedia DBpedia Mashup Mashup lingvoj lingvoj Freebase Freebase LAAS- CiteSeer CiteSeer CNRS W3C W3C DBLP DBLP GENERATION IBM WordNet WordNet Hannover Hannover UniRef UniR GEO GEO UMBEL UMBEL Species Species DBLP DBLP Berlin Berlin Reactome Reactome LinkedCT LinkedCT UniParc UniParc Taxonomy o o Drug Drug owl:sameas PROSITE PROSITE Daily Daily Bank Bank Med Med Pub Pub GeneID GeneID Chem Chem KEGG KEGG UniProt UniProt Pfam Pfam ProDom Disea- Disea- CAS CAS Gene Gene some some ChEBI ChEBI Ontology Ontology l OMIM OMIM UniSTS UniSTS Connection CaseInter Inter Pro Pro PDB PDB HGNC HGNC Model MGI MGI PubMed PubMed Model As of July 2009 ECS South- ampton Sem- BBC Surge LIBRIS Web- Playcount Radio Central Data Doap- Music- space brainz Audio- Flickr Ontology-based Case MySpace Scrobbler QDOS exporter Wrapper Texts Information Extraction BBC Music Jamendo BaseBBC Later + TOTP BBC John Peel Crunch Base FOAF profiles SIOC Sites © 2010 DFKI CC CBR Project flickr Pub Geo- Euro- wrappr Guten- Guide names stat berg BBC Programmes Open Calais riese World Linked Magna- Fact- MDB Donnerstag, 5. August 2010
    • flickr flickr RKB RKB Project Project Geo- Geo- Euro- Euro- wrappr wrappr Explorer Explorer Guten- Guten- Virtuoso Virtuoso names names Pisa Pisa CORDIS USING LINKED stat stat berg berg Sponger Sponger eprints Open Open Calais Calais RKB RKB World World Linked Linked ECS ECS Magna- Magna- Fact- Fact- MDB MDB IEEE New- South- South- DATA FOR CASE tune tune book book ampton ampton castle RDF Book RDF Book DBpedia DBpedia Mashup Mashup lingvoj lingvoj Freebase Freebase LAAS- CiteSeer CiteSeer CNRS W3C W3C DBLP DBLP GENERATION IBM WordNet WordNet Hannover Hannover UniRef UniR GEO GEO UMBEL UMBEL Species Species DBLP DBLP Berlin Berlin Reactome Reactome LinkedCT LinkedCT UniParc UniParc Taxonomy o o Drug Drug owl:sameas PROSITE PROSITE Daily Daily Bank Bank Med Med Pub Pub GeneID GeneID Chem Chem KEGG KEGG UniProt UniProt Pfam Pfam ProDom Disea- Disea- CAS CAS Gene Gene some some ChEBI ChEBI Ontology Ontology l OMIM OMIM Connection CaseInter Inter Pro Pro UniSTS UniSTS HGNC HGNC Model MGI MGI PubMed PubMed Model PDB PDB myCBR As of July 2009 ECS South- ampton Sem- BBC Surge LIBRIS Web- Playcount Radio Central Data Doap- Music- space brainz Audio- Flickr Ontology-based Case MySpace Scrobbler QDOS exporter Wrapper Texts Information Extraction BBC Music Jamendo BaseBBC Later + TOTP BBC John Peel Crunch Base FOAF profiles SIOC Sites © 2010 DFKI CC CBR Project flickr Pub Geo- Euro- wrappr Guten- Guide names stat berg BBC Programmes Open Calais riese World Linked Magna- Fact- MDB Donnerstag, 5. August 2010
    • flickr flickr RKB RKB Project Project Geo- Geo- Euro- Euro- wrappr wrappr Explorer Explorer Guten- Guten- Virtuoso Virtuoso names names Pisa Pisa CORDIS USING LINKED stat stat berg berg Sponger Sponger eprints Open Open Calais Calais RKB RKB World World Linked Linked ECS ECS Magna- Magna- Fact- Fact- MDB MDB IEEE New- South- South- DATA FOR CASE tune tune book book ampton ampton castle RDF Book RDF Book DBpedia DBpedia Mashup Mashup lingvoj lingvoj Freebase Freebase LAAS- CiteSeer CiteSeer CNRS W3C W3C DBLP DBLP GENERATION IBM WordNet WordNet Hannover Hannover UniRef UniR GEO GEO UMBEL UMBEL Species Species DBLP DBLP Berlin Berlin Reactome Reactome LinkedCT LinkedCT UniParc UniParc Taxonomy o o Drug Drug owl:sameas PROSITE PROSITE Daily Daily Bank Bank Med Med Pub Pub GeneID GeneID Chem Chem KEGG KEGG UniProt UniProt Pfam Pfam ProDom Disea- Disea- CAS CAS Gene Gene some some ChEBI ChEBI Ontology Ontology l OMIM OMIM Connection CaseInter Inter Pro Pro UniSTS UniSTS HGNC HGNC Model MGI MGI PubMed PubMed Model PDB PDB myCBR As of July 2009 ECS South- ampton Sem- BBC Surge LIBRIS Web- Playcount Radio Central Data Doap- Music- space brainz Audio- Flickr Ontology-based Case MySpace Scrobbler QDOS exporter Wrapper Texts Information Extraction BBC Music Jamendo BaseBBC Later + TOTP BBC John Peel Crunch Base FOAF profiles SIOC Sites © 2010 DFKI CC CBR Project flickr Pub Geo- Euro- wrappr Guten- Guide names stat berg BBC Programmes Open Calais riese World Linked Magna- Fact- MDB Donnerstag, 5. August 2010
    • 3 • Improved UI based on Rich Client Platform • Use of Perspectives, e.g., for text-to-case transformation via SCOOBIE • Plus SDK • Import of myCBR 2.6.x files © 2010 DFKI CC CBR •… Donnerstag, 5. August 2010
    • 3 • Improved UI based on Rich Client Platform • Use of Perspectives, e.g., for text-to-case transformation via SCOOBIE • Plus SDK • Import of myCBR 2.6.x files © 2010 DFKI CC CBR •… Donnerstag, 5. August 2010
    • 3 PREVIEW • Configurationof source XML file • Assignment of attribute to XML path • Copy or information extraction © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • 3 PREVIEW © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • 3 PREVIEW © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • 3 PREVIEW © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • 3 PREVIEW © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • 3 PREVIEW © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • 3 PREVIEW © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • 3 PREVIEW © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • 3 PREVIEW © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • 3 PREVIEW © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • OUTLOOK AND FUTURE WORK © 2010 DFKI CC CBR Donnerstag, 5. August 2010
    • flickr flickr RKB RKB Project Project ITERATIVE Geo- Geo- Euro- Euro- wrappr wrappr Explorer Explorer Guten- Guten- Virtuoso Virtuoso names names stat stat Pisa Pisa CORDIS berg berg Sponger Sponger eprints Open Open Calais Calais IMPROVEMENT OF RKB RKB World World Linked Linked ECS ECS Magna- Magna- Fact- Fact- MDB MDB IEEE New- South- South- tune tune book book ampton ampton castle RDF Book RDF Book DBpedia DBpedia Mashup Mashup CONNECTION lingvoj lingvoj Freebase Freebase LAAS- CiteSeer CiteSeer CNRS W3C W3C DBLP DBLP IBM WordNet WordNet Hannover Hannover UniRef UniR GEO GEO MODEL DBLP DBLP owl:sameas UMBEL UMBEL Species Species Berlin Berlin Reactome Reactome LinkedCT LinkedCT UniParc UniParc Taxonomy o o Drug Drug PROSITE PROSITE Daily Daily Bank Bank Med Med Pub Pub GeneID GeneID Chem Chem KEGG KEGG UniProt UniProt Pfam Pfam ProDom Disea- Disea- CAS CAS Gene Gene some some ChEBI ChEBI Ontology Ontology l OMIM OMIM Connection CaseInter Inter Pro Pro UniSTS UniSTS HGNC HGNC Model MGI MGI PubMed PubMed Model PDB PDB myCBR As of July 2009 ECS South- ampton Sem- BBC Surge LIBRIS Web- Playcount Radio Central Data Doap- Music- space brainz Audio- Flickr Ontology-based Case MySpace Scrobbler QDOS exporter Wrapper Texts Information Extraction BBC Music Jamendo BaseBBC Later + TOTP BBC John Peel Crunch Base FOAF profiles SIOC Sites © 2010 DFKI CC CBR Project flickr Pub Geo- Euro- wrappr Guten- Guide names stat berg BBC Programmes Open Calais riese World Linked Magna- Fact- MDB Donnerstag, 5. August 2010
    • flickr flickr RKB RKB Project Project ITERATIVE Geo- Geo- Euro- Euro- wrappr wrappr Explorer Explorer Guten- Guten- Virtuoso Virtuoso names names stat stat Pisa Pisa CORDIS berg berg Sponger Sponger eprints Open Open Calais Calais IMPROVEMENT OF RKB RKB World World Linked Linked ECS ECS Magna- Magna- Fact- Fact- MDB MDB IEEE New- South- South- tune tune book book ampton ampton castle RDF Book RDF Book DBpedia DBpedia Mashup Mashup CONNECTION lingvoj lingvoj Freebase Freebase LAAS- CiteSeer CiteSeer CNRS W3C W3C DBLP DBLP IBM WordNet WordNet Hannover Hannover UniRef UniR GEO GEO MODEL DBLP DBLP owl:sameas UMBEL UMBEL Species Species Berlin Berlin Reactome Reactome LinkedCT LinkedCT UniParc UniParc Taxonomy o o Drug Drug PROSITE PROSITE Daily Daily Bank Bank Med Med Pub Pub GeneID GeneID Chem Chem KEGG KEGG UniProt UniProt Pfam Pfam ProDom Disea- Disea- CAS CAS Gene Gene some some ChEBI ChEBI Ontology Ontology l OMIM OMIM Connection CaseInter Inter Pro Pro UniSTS UniSTS HGNC HGNC Model MGI MGI PubMed PubMed Model PDB PDB myCBR As of July 2009 ECS South- ampton Sem- BBC Surge LIBRIS Web- Playcount Radio Central Data Doap- Music- space brainz Audio- Flickr Ontology-based Case MySpace Scrobbler QDOS exporter Wrapper Texts Information Extraction BBC Music Jamendo BaseBBC Later + TOTP BBC John Peel Crunch Base FOAF profiles SIOC Sites © 2010 DFKI CC CBR Project flickr Pub Geo- Euro- wrappr Guten- Guide names stat berg BBC Programmes Open Calais riese World Linked Magna- Fact- MDB Donnerstag, 5. August 2010
    • Competence Center Case-Based Reasoning Thank you! CASE ACQUISITION FROM TEXT: ONTOLOGY-BASED INFORMATION EXTRACTION WITH SCOOBIE FOR MYCBR Thomas Roth-Berghofer, Benjamin Adrian, and Andreas Dengel German Research Center for Artificial Intelligence DFKI GmbH http://mycbr-project.net http://www.dfki.de/~roth @thorob67 Donnerstag, 5. August 2010