0
Nov. 23 2010 - S. Fermigier & O. Grisel, Nuxeo




Semantic ECM @ Nuxeo
   A progress report - Nov. 2010
Agenda

From ECM to Semantic ECM
Scribo & IKS
Fise & Apache Stanbol
Nuxeo Integration
Roadmap for 2011
Nuxeo: from ECM...
Nuxeo: an open source
           ECM vendor
Our Focus is Enterprise Content Management
ECM as a Platform for Content Appli...
Nuxeo ECM - From Platform to Products

                  Construction               Media               Government        ...
Major Customers
... to Semantic ECM
Picture source: http://www.flickr.com/photos/pixelydixel/
Linked Online Data in 2007




“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.n...
2008




“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”
2009




“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”
2010




“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”
Good for Enterprise apps too!




Diagram source: http://www.w3.org/2007/Talks/0130-sb-W3CTechSemWeb/
Key Enablers
Open Data and Linked Online Data
Advances in automatic content analysis
(linguistics, image processing) and m...
The technologies and data
                   are available,

  Let’s put them to use!
Semantic ECM
Semantic ECM

Text
        Sound
Image
        Video


 Content
Semantic ECM

                    Metadata
Text
        Sound     Tags   Entities
Image             Relations
        Vide...
Semantic ECM

                    Metadata
Text
        Sound     Tags   Entities
Image             Relations
        Vide...
Goals for Semantic ECM

Repurpose existing content
Improve search and collaboration
Make information contextual
Extract an...
Challenges

Extract meaning from content
Enrich content with knowledge
Enhance interaction with content thanks to
added me...
Architectural Challenge




Content Stack vs. Knowledge Cake
Business value
             from semantic ECM

Efficiency gains: 20% to 90% (ex: in search,
collaboration)
Effectiveness ga...
SCRIBO and IKS
Project under the french FUI program, with 9
partners, and a budget of 4.7 M€
Goal: to develop algorithms and collaborativ...
European project under the FP7, with 13
partners (6 SMEs) and a 8.5 M€ budget
Goal: create a semantic software “stack” tha...
Linking Semantic Entities
 Apache Stanbol - Nuxeo integration
Demo time!

 Screencast online at http://blogs.nuxeo.com/dev


                                                   25
How does this work?


                      26
27
• Open Source Semantic Engine
• HTTP Services
• For content driven applications
• OSGi: loosely coupled components
• Analy...
What is a semantic engine?

• Unstructured content => Knowledge


• Language guessing
• Topic classification (Business, Spo...
30
31
curl -X POST 
 -H "Accept: application/json" 
 -H "Content-type: text/plain" 
 --data "John Smith works at Smith Consultin...
33
34
=
            fise
             +
fast Linked Data local index
             +
    semantic rule engine
             +
     ...
Apache Stanbol / Nuxeo
integration


                         36
Apache Stanbol

                                    Engine 1          DBpedia
                                    Engine 2...
Roadmap 2010-2011
Nuxeo DM Improvement
Automated document categorization
(language, subject, geo coverage based on fixed
lists)
Semantic enti...
Nuxeo DM:
        Upcoming Work
Stanbol + Scribo integration
Multilingual support
Extraction of relations between entities...
Nuxeo DAM

Clustering pictures by similarity
Faces detection
Faces recognition using contextual information
Speech to text...
Nuxeo CMF /
      Correspondence
Document OCR and structure extraction
Scanned document categorization (ex: invoice
vs. co...
Questions?
More info
http://www.nuxeo.com/
http://blogs.nuxeo.com/dev
http://iks-project.eu
http://fise.demo.nuxeo.com
http://scribo.w...
Upcoming SlideShare
Loading in...5
×

Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

2,471

Published on

Work on integrating semantic technologies developed in several R&D projects is now progressing at full speed. Expect to see creative new uses of semantic technologies in Nuxeo open source content management products in 2011!

Published in: Technology, Education
0 Comments
4 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
2,471
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
41
Comments
0
Likes
4
Embeds 0
No embeds

No notes for slide

Transcript of "Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications"

  1. 1. Nov. 23 2010 - S. Fermigier & O. Grisel, Nuxeo Semantic ECM @ Nuxeo A progress report - Nov. 2010
  2. 2. Agenda From ECM to Semantic ECM Scribo & IKS Fise & Apache Stanbol Nuxeo Integration Roadmap for 2011
  3. 3. Nuxeo: from ECM...
  4. 4. Nuxeo: an open source ECM vendor Our Focus is Enterprise Content Management ECM as a Platform for Content Applications Open Source as Efficient Development Model Modern architecture for 21st Century business “Lean, mobile, social, interoperable” A Social Marketplace in action Innovation driven by community of customers, partners, and our core developers
  5. 5. Nuxeo ECM - From Platform to Products Construction Media Government Life Sciences Business Solutions Correspondence Contracts Records Invoice Processing Management Management Management Case Structured Horizontal Document Digital Asset Content Management Document Packages Management Management Framework Server Aggregator Nuxeo Enterprise Platform Complete set of components covering all aspects of ECM Platform Content Infrastructure Nuxeo Core Lightweight, scalable, embeddable content repository 5
  6. 6. Major Customers
  7. 7. ... to Semantic ECM
  8. 8. Picture source: http://www.flickr.com/photos/pixelydixel/
  9. 9. Linked Online Data in 2007 “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”
  10. 10. 2008 “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”
  11. 11. 2009 “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”
  12. 12. 2010 “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”
  13. 13. Good for Enterprise apps too! Diagram source: http://www.w3.org/2007/Talks/0130-sb-W3CTechSemWeb/
  14. 14. Key Enablers Open Data and Linked Online Data Advances in automatic content analysis (linguistics, image processing) and machine learning Classical logic and classical AI Computing power (Moore’s law + MapReduce)
  15. 15. The technologies and data are available, Let’s put them to use!
  16. 16. Semantic ECM
  17. 17. Semantic ECM Text Sound Image Video Content
  18. 18. Semantic ECM Metadata Text Sound Tags Entities Image Relations Video Reasoning Content Meaning
  19. 19. Semantic ECM Metadata Text Sound Tags Entities Image Relations Video Reasoning Content Meaning
  20. 20. Goals for Semantic ECM Repurpose existing content Improve search and collaboration Make information contextual Extract and use information from your content Make your content smarter!
  21. 21. Challenges Extract meaning from content Enrich content with knowledge Enhance interaction with content thanks to added meaning
  22. 22. Architectural Challenge Content Stack vs. Knowledge Cake
  23. 23. Business value from semantic ECM Efficiency gains: 20% to 90% (ex: in search, collaboration) Effectiveness gains: better returns from your assets (ex: news and images from AFP) Strategic edge: growth, value capture, new services, gain unfair strategic advantage (ex: vertical ontologies for CEVAs / CCAs)
  24. 24. SCRIBO and IKS
  25. 25. Project under the french FUI program, with 9 partners, and a budget of 4.7 M€ Goal: to develop algorithms and collaborative tools for extracting knowledge from unstructured documents and images Started in 2008, finishing in Dec. 2010, with results already integrated as a Nuxeo plugin
  26. 26. European project under the FP7, with 13 partners (6 SMEs) and a 8.5 M€ budget Goal: create a semantic software “stack” that will be used by CMS vendors to add semantic features to their products Started in Jan. 2009, will last until Dec. 2012 First tangible result: FISE, already integrated in a Nuxeo plugin
  27. 27. Linking Semantic Entities Apache Stanbol - Nuxeo integration
  28. 28. Demo time! Screencast online at http://blogs.nuxeo.com/dev 25
  29. 29. How does this work? 26
  30. 30. 27
  31. 31. • Open Source Semantic Engine • HTTP Services • For content driven applications • OSGi: loosely coupled components • Analysis Engines • Knowledge RDF vocabularies 28
  32. 32. What is a semantic engine? • Unstructured content => Knowledge • Language guessing • Topic classification (Business, Sports, Media, ...) • Named Entities extraction and linking • Relationships and properties extraction 29
  33. 33. 30
  34. 34. 31
  35. 35. curl -X POST -H "Accept: application/json" -H "Content-type: text/plain"  --data "John Smith works at Smith Consulting in Paris."  http://fise.demo.nuxeo.com/engines { "urn:enhancement-1564680b-861c-df6f-fdf9-d34a75d68dfe": { "http://fise.iks-project.eu/ontology/selected-text": [ { "datatype": "http://www.w3.org/2001/XMLSchema#string", "type": "literal", "value": "Paris" } ], "http://fise.iks-project.eu/ontology/selection-context": [ { "datatype": "http://www.w3.org/2001/XMLSchema#string", "type": "literal", "value": "John Smith works at Smith Consulting Paris." } ], "http://purl.org/dc/terms/type": [ { "type": "uri", "value": "http://dbpedia.org/ontology/Place" } ] 32 }, …
  36. 36. 33
  37. 37. 34
  38. 38. = fise + fast Linked Data local index + semantic rule engine + more ? 35
  39. 39. Apache Stanbol / Nuxeo integration 36
  40. 40. Apache Stanbol Engine 1 DBpedia Engine 2 2 1 Engine 3 Freebase Nuxeo DM 3 addon Geonames LDAP Local IT infrastructure (LAN) 37
  41. 41. Roadmap 2010-2011
  42. 42. Nuxeo DM Improvement Automated document categorization (language, subject, geo coverage based on fixed lists) Semantic entities detection and linking Available as add-ons on the Nuxeo Marketplace in December!
  43. 43. Nuxeo DM: Upcoming Work Stanbol + Scribo integration Multilingual support Extraction of relations between entities Topic classification and linking to external taxonomies
  44. 44. Nuxeo DAM Clustering pictures by similarity Faces detection Faces recognition using contextual information Speech to text integration for full-text search on audio and video files
  45. 45. Nuxeo CMF / Correspondence Document OCR and structure extraction Scanned document categorization (ex: invoice vs. contract vs. claim...) and routing Structured field extraction with configurable document masks
  46. 46. Questions?
  47. 47. More info http://www.nuxeo.com/ http://blogs.nuxeo.com/dev http://iks-project.eu http://fise.demo.nuxeo.com http://scribo.ws http://incubator.apache.org/stanbol
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×