Semi-Automatic Data-Driven  Ontology Construction System Blaz Fortuna, Marko Grobelnik, Dunja Mladenic Jozef Stefan Instit...
How does it work? <ul><li>OntoGen suggests concepts </li></ul><ul><ul><li>Suggestions are generated automatically </li></u...
Main Features <ul><li>Interactive user interface </li></ul><ul><ul><li>User can interact in real-time with the integrated ...
Main view Concept hierarchy List of suggested sub-concepts Ontology visualization Selected concept
Concept suggestion Selected concept Suggested subconcepts Add new concept New concept HCII2007, July 26th Blaz Fortuna, Jo...
Personalized suggestions UK takeovers and mergers The following are additions and deletions to the takeovers and mergers l...
Concept learning Query New Concept Finish HCII2007, July 26th Blaz Fortuna, Jozef Stefan Institute, Slovenia
Concept’s instances visualization <ul><li>Instances are visualized as points on 2D map  </li></ul><ul><ul><li>The distance...
Concept management Concept’s details Concept’s instance management Selected concept Keywords Selected instance
Adding new documents to ontology New documents Classification of selected document Content of selected document HCII2007, ...
Evaluation <ul><li>First prototype was successfully used in several commercial projects: </li></ul><ul><ul><li>Applied in ...
Future work <ul><li>Tools for suggestion and learning of more complex relations </li></ul><ul><li>Extended support for col...
<ul><li>Questions? Comments? </li></ul>Thank you for listening! HCII2007, July 26th Blaz Fortuna, Jozef Stefan Institute, ...
Upcoming SlideShare
Loading in...5
×

OntoGen

9,961

Published on

http://ontogen.ijs.si

Published in: Economy & Finance, Education
1 Comment
2 Likes
Statistics
Notes
No Downloads
Views
Total Views
9,961
On Slideshare
0
From Embeds
0
Number of Embeds
8
Actions
Shares
0
Downloads
0
Comments
1
Likes
2
Embeds 0
No embeds

No notes for slide

Transcript of "OntoGen"

  1. 1. Semi-Automatic Data-Driven Ontology Construction System Blaz Fortuna, Marko Grobelnik, Dunja Mladenic Jozef Stefan Institute http://ontogen.ijs.si OntoGen
  2. 2. How does it work? <ul><li>OntoGen suggests concepts </li></ul><ul><ul><li>Suggestions are generated automatically </li></ul></ul><ul><ul><ul><li>… from the text corpus by clustering similar documents </li></ul></ul></ul><ul><ul><ul><li>… based on user query </li></ul></ul></ul><ul><ul><ul><li>… through text corpus map </li></ul></ul></ul><ul><li>User selects appropriate suggestions and adds them to the ontology </li></ul><ul><ul><li>OntoGen helps deciding which suggestions to include </li></ul></ul><ul><ul><ul><li>… by extracting main keywords from the documents </li></ul></ul></ul><ul><ul><ul><li>… with ontology and concept visualizations </li></ul></ul></ul><ul><ul><ul><li>… by list documents behind concepts </li></ul></ul></ul><ul><li>Behind each concept there is a set of documents </li></ul><ul><ul><li>Documents are automatically assigned to concepts </li></ul></ul><ul><ul><li>Document assignments can be edited manually </li></ul></ul>HCII2007, July 26th Blaz Fortuna, Jozef Stefan Institute, Slovenia
  3. 3. Main Features <ul><li>Interactive user interface </li></ul><ul><ul><li>User can interact in real-time with the integrated machine learning and text mining methods </li></ul></ul><ul><li>Concept discovery methods: </li></ul><ul><ul><li>Unsupervised </li></ul></ul><ul><ul><ul><li>System provides suggestions </li></ul></ul></ul><ul><ul><li>Supervised </li></ul></ul><ul><ul><ul><li>Concept learning </li></ul></ul></ul><ul><ul><ul><li>Concept visualization </li></ul></ul></ul><ul><li>Methods for helping at understanding the discovered concepts: </li></ul><ul><ul><li>Keyword extraction </li></ul></ul><ul><ul><ul><li>Generates a list of characteristic keywords of a given concept </li></ul></ul></ul><ul><ul><li>Concept visualization </li></ul></ul><ul><ul><ul><li>Creates a map of documents from a given concept </li></ul></ul></ul><ul><ul><ul><li>Also available as a separate tool named Document Atlas </li></ul></ul></ul><ul><ul><ul><ul><li>http://docatlas.ijs.si </li></ul></ul></ul></ul>HCII2007, July 26th Blaz Fortuna, Jozef Stefan Institute, Slovenia
  4. 4. Main view Concept hierarchy List of suggested sub-concepts Ontology visualization Selected concept
  5. 5. Concept suggestion Selected concept Suggested subconcepts Add new concept New concept HCII2007, July 26th Blaz Fortuna, Jozef Stefan Institute, Slovenia
  6. 6. Personalized suggestions UK takeovers and mergers The following are additions and deletions to the takeovers and mergers list for the week beginning August 19, as provided by the Takeover … Lloyd’s CEO questioned in recovery suit in U.S. Ronald Sandler, chief executive of Lloyd's of London, on Tuesday underwent a second day of court interrogation about … HCII2007, July 26th Blaz Fortuna, Jozef Stefan Institute, Slovenia Topics view Countries view
  7. 7. Concept learning Query New Concept Finish HCII2007, July 26th Blaz Fortuna, Jozef Stefan Institute, Slovenia
  8. 8. Concept’s instances visualization <ul><li>Instances are visualized as points on 2D map </li></ul><ul><ul><li>The distance between two instances on the map correspond to their content similarity </li></ul></ul><ul><ul><li>Characteristic keywords are shown for all parts of the map </li></ul></ul><ul><li>User can select groups of instances on the map to create sub-concepts. </li></ul>HCII2007, July 26th Blaz Fortuna, Jozef Stefan Institute, Slovenia
  9. 9. Concept management Concept’s details Concept’s instance management Selected concept Keywords Selected instance
  10. 10. Adding new documents to ontology New documents Classification of selected document Content of selected document HCII2007, July 26th Blaz Fortuna, Jozef Stefan Institute, Slovenia Selected document
  11. 11. Evaluation <ul><li>First prototype was successfully used in several commercial projects: </li></ul><ul><ul><li>Applied in multiple domains: business, legislations and digital libraries </li></ul></ul><ul><ul><li>Users were always domain experts with limited knowledge and experience with ontology construction / knowledge engineering </li></ul></ul><ul><ul><li>Valuable data from first trails was used as input for the interface design of the second prototype (the one presented here). </li></ul></ul><ul><li>Feedback from the users of the second prototype </li></ul><ul><ul><li>Main impression was that the tool saves time and is especially useful when working with large collections of documents </li></ul></ul><ul><ul><li>Among main disadvantages were abstraction and unattractive look </li></ul></ul><ul><ul><li>Many users use the program for exploration of the data </li></ul></ul>HCII2007, July 26th Blaz Fortuna, Jozef Stefan Institute, Slovenia
  12. 12. Future work <ul><li>Tools for suggestion and learning of more complex relations </li></ul><ul><li>Extended support for collaborative editing of ontologies </li></ul><ul><li>Easier input of background knowledge </li></ul><ul><li>Improvement of the user interface based on the feedback from user trails and real-world users </li></ul>HCII2007, July 26th Blaz Fortuna, Jozef Stefan Institute, Slovenia
  13. 13. <ul><li>Questions? Comments? </li></ul>Thank you for listening! HCII2007, July 26th Blaz Fortuna, Jozef Stefan Institute, Slovenia http://ontogen.ijs.si

×