Nuxeo Iks 2009 11 13

3,139 views

Published on

Short introductionary slides introducing some of the work done on the Scribo project to extract Named Entities in textual documents with a UIMA engine.

Published in: Technology, News & Politics

Nuxeo Iks 2009 11 13

  1. 1. Olivier Grisel - 2009-11-13 - IKS Semantic Lifting Named Entities Extraction with UIMA Thursday, November 12, 2009
  2. 2. Nuxeo • Open Source ECM • Nuxeo DM 5.3 available • office document management with workspaces • download it at http://nuxeo.com • Soon: Nuxeo DAM • Multimedia content • Full ajax search based browsing 2 Thursday, November 12, 2009
  3. 3. http://SCRIBO.ws • Goal: content to knowledge using ontologies • 3 academic research teams • 2 NLP startups • 2 Open Source ECM / Wiki software editors • 2 use case providers: • News agency • Linux distribution 3 Thursday, November 12, 2009
  4. 4. UIMA • Chain components to extract annotations on text and images • Initially developed by IBM • Now an Apache Software Foundation project • Several existing components (OpenNLP, ClearTK, ...) • Easy to wrap new libraries as UIMA annotators 4 Thursday, November 12, 2009
  5. 5. Scribo UIMA chain 5 Thursday, November 12, 2009
  6. 6. Scribo UIMA chain editor 6 Thursday, November 12, 2009
  7. 7. Embedded UIMA chain 7 Thursday, November 12, 2009
  8. 8. It’s Open Source • Clone it! • http://hg.nuxeo.org/sandbox/scribo • http://hg.nuxeo.org/sandbox/nuxeo-uima • Give me feedback! • http://twitter.com/ogrisel 8 Thursday, November 12, 2009

×