PoolParty Thesaurus Management Quick Overview


Published on

PoolParty is a thesaurus management system and a SKOS editor for the Semantic Web including text mining and linked data capabilities. The system helps to build and maintain multilingual thesauri providing an easy-to-use interface. PoolParty server provides semantic services to integrate semantic search or recommender systems into systems like CMS, DMS, CRM or Wikis

Published in: Technology

PoolParty Thesaurus Management Quick Overview

  1. 1. PoolParty Thesaurus Management Semantic Search Linked Data Andreas Blumauer
  2. 2. Company profile <ul><li>Established in 1998 </li></ul><ul><li>Located in Vienna/Austria </li></ul><ul><li>15 Employees </li></ul><ul><li>Main areas of work </li></ul><ul><ul><li>Search engines & Knowledge Management </li></ul></ul><ul><ul><li>Wikis & Communities </li></ul></ul><ul><ul><li>Semantic Web & Text Mining </li></ul></ul><ul><li>Long term experience in the area of agile software development </li></ul>
  3. 3. PoolParty at a glance <ul><li>Developed by punkt. netServices Current release: PoolParty 2.7 </li></ul><ul><li>Main focus on three application areas: </li></ul><ul><ul><li>SKOS Thesaurus Management </li></ul></ul><ul><ul><li>Linked Data (publishing & consuming) </li></ul></ul><ul><ul><li>Semantic Search & Semantic Indexing </li></ul></ul>
  4. 4. Challenge for Content Management <ul><li>Annotation : Add meaning to the content </li></ul><ul><li>Link content : Bring content together in a meaningful way </li></ul><ul><li>Make content searchable : Add background knowledge to the content </li></ul>
  5. 5. Traditional approach to put metadata on top of content Apple is in the process of launching an application to allow iPhone, iPad and iPod Touch users to purchase Apple merchandise straight from their devices. Apple application merchandise iPod touch iPad iPhone
  6. 6. Semantic Web approach: Concepts & Relations instead of simple text Apple is in the process of launching an application to allow iPhone, iPad and iPod Touch users to purchase Apple merchandise straight from their devices. http://my.com/Apple Apple Apple Inc. http://my.com/iPhone http://my.com/iPhone3G iPhone iPhone 3GS iPhone 3G http://my.com/smartphone
  7. 7. Comparison of approaches Traditional approach Semantic Web approach Indexing paradigm Text & phrases Concepts & named entities Background knowledge Not available Thesauri, Linked Data Semantic search Statistical approaches Statistical approaches & methods based on knowledge models Similarity search Based on clustering and vector models Based on clustering & vector models enriched with background knowledge Application scenarios Out-of-the-box applications, SME´s Large enterprises, domain-specific search engines
  8. 8. Thesauri & Semantic Web: SKOS (Simple Knowledge Organisation System) <ul><li>W3C Standard since 2009 </li></ul><ul><li>Based on Semantic Web standards </li></ul><ul><li>Open for linking with additional linked data </li></ul>
  9. 9. in a nutshell <ul><li>W3C Semantic Web standards : Management & maintenance of multi-lingual (corporate) thesauri & taxonomies on top of Semantic Web standards (SKOS, RDF, OWL & SPARQL) </li></ul><ul><li>Usability : PoolParty provides an easy-to-use, web-based AJAX user interface </li></ul><ul><li>Scalable Semantic Technologies : PoolParty Server is based on an RDF Triple Store (SAIL), incl. reasoner, (Lucene) index engine and a phrase-extraction component </li></ul><ul><li>Service oriented : PoolParty Server offers a Java-API & several interfaces: HTTP/REST, SOAP, SPARQL endpoint, Linked Data </li></ul>
  10. 10.
  11. 11. PoolParty GUI
  12. 12. Full compatibility with SKOS/RDF
  13. 13. Some highlights: PoolParty thesaurus management <ul><li>Auto-Complete </li></ul><ul><li>Advanced reporting functionality </li></ul><ul><li>Import of thesauri (validator!) and CSV files </li></ul><ul><li>Drag & drop </li></ul><ul><li>Quality checker </li></ul><ul><li>Visual browsing and map navigation </li></ul><ul><li>Document analysis: phrase extraction </li></ul><ul><li>Enrich concepts by using linked data </li></ul><ul><li>Publish thesauri as linked data </li></ul><ul><li>Wiki style collaborative editing of thesauri </li></ul>
  14. 14. Built-in automatic phrase extraction <ul><li>Support of different formats (html, doc, pdf, ppt, …) </li></ul><ul><li>Thesaurus based extraction </li></ul><ul><li>Integrable with CMS, CRM etc. </li></ul>
  15. 15. Some Applications on top of PoolParty <ul><li>Glossary plugin: import SKOS thesauri and generate glossaries and automatic links within Wordpress </li></ul><ul><li>Tag recommendation: support users and content managers when annotating text </li></ul><ul><li>Semantic Indexing : PoolParty TagEvent Store as a basis for a semantic index (  IndexBuilder) </li></ul><ul><li>Similarity search : „Similarity“ is configurable: Certain features of a document can be „boosted“ (example: persons, places / user tags etc.) </li></ul><ul><li>Semantic Search : Thesaurus can be used for facetted and moderated search (examples: emteba.at, ecoi.net) </li></ul>
  16. 16. Similarity search: finding the unexpected… Expert #4532 Senior Product Manager Enterprise Wiki at MitchelLake Consulting in Sydney Area ……… Project #AZ67 Integration of Confluence which is a web-based corporate wiki . It is developed and marketed by Atlassian, Australia . … .. same topic near location
  17. 17. PoolParty DemoZone <ul><li>compare thesaurus based approach with traditional approach </li></ul><ul><li>tag recommender </li></ul><ul><li>similar documents </li></ul><ul><li>find images which fit to your document </li></ul><ul><li>browser bookmarklet </li></ul>
  18. 18. Wordpress Glossary Plugin <ul><li>automatic generation of glossaries for Wordpress blogs </li></ul><ul><li>SKOS compatibility </li></ul><ul><li>automatic link detection and linkage with glossary term </li></ul>
  19. 19. System requirements <ul><li>Server Side </li></ul><ul><ul><li>Windows or UNIX Server </li></ul></ul><ul><ul><li>Java 5 or 6 and </li></ul></ul><ul><ul><li>Tomcat 6, </li></ul></ul><ul><ul><li>the more RAM, the better, but at least 2 GB should be available </li></ul></ul><ul><li>Client Side </li></ul><ul><ul><li>PoolParty Thesaurus Editor works with Firefox 3.0 (or higher) </li></ul></ul>
  20. 20. Programmatic access via REST <ul><li>getProposedTagsForDocument </li></ul><ul><li>addTaggingEvent </li></ul><ul><li>getTagFrequencies </li></ul><ul><li>addDocumentToSimilarityIndex </li></ul><ul><li>findSimilarDocuments </li></ul><ul><li>getConceptSuggestions </li></ul><ul><li>… .. </li></ul>
  21. 21. Programmatic access – Example: emteba.at
  22. 22. Contact <ul><li>punkt. netServices GmbH Lerchenfelder Guertel 43 A—1160 Wien / Austria http://www.punkt.at/ http://poolparty.punkt.at/ </li></ul><ul><li>Mag. Andreas Blumauer Managing Director [email_address] +43-1-8974122-27 </li></ul>