Loading…

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

Like this presentation? Why not share!

Nuxeo world-2010

on

  • 1,094 views

Presentation of the latest innovation in Document Management, intelligent auto-tagging, automatic invoice processing, integration with DBpedia, OpenCalais. ...

Presentation of the latest innovation in Document Management, intelligent auto-tagging, automatic invoice processing, integration with DBpedia, OpenCalais.
CMIS use for smartphone client named ECM Mobile, available on Android and iPhone.

Statistics

Views

Total Views
1,094
Views on SlideShare
954
Embed Views
140

Actions

Likes
0
Downloads
7
Comments
0

5 Embeds 140

http://blog.yerbabuena.es 106
http://blog.athento.com 31
http://www.netvibes.com 1
http://static.slidesharecdn.com 1
http://www.linkedin.com 1

Accessibility

Categories

Upload Details

Uploaded via as OpenOffice

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Nuxeo world-2010 Nuxeo world-2010 Presentation Transcript

  • Yerbabuena Software ~ 2.010 Nuxeo World 2010 Yerbabuena applications for Nuxeo DM and some succesful cases Francisco José González Barea Victor Manuel Sánchez Sánchez
  • Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010 Nuxeo World 2010 - Summary
        • Who is Yerbabuena Software?
        • Yerbabuena Applications for Nuxeo DM
            • Using OCR smartly
            • Intelligent Document Management.
                • Auto-tagging
                • Semantic Features
            • Mobile Clients
                • Windows Mobile
                • Iphone
                • Android
  • Who is Yerbabuena Software?
    • Company founded in 2005
    • ~ 20 workers
    • Activity
        • Developement of Nuxeo DM applications
        • Support
        • Training
        • I+D projects
            • Nuxeo 2010 Eureka project
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
            • Using OCR smartly
    • SCR = Smart Character Recognition
    • Not same as ICR
    • Architecture
            • Image Treatment
            • OCR
            • Text Treatment
    • Adaptability to customer needs
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
            • Using OCR smartly -> SCR
    • Sucessful Cases:
        • Extraction of Invoices fields
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • Intelligent Document Management
    • Intelligent ~ Automatic
        • Increase machine work
        • Decrease Human work
    • Two different ways
        • Classify and search documents
        • Identify and work with documents
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • Intelligent DM. Auto-Tagging
    • Extract document full text
        • Previous OCR if an image
    • OpenCalais [1] analysis
        • External web service
        • Extract tags from plain text depending on content meaning
        • RDF [2] file as result
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • Intelligent DM. Auto-Tagging
    • Extract relevant words
        • Depending on document type
        • Depending on style features
    • DBPedia [3] analysis
        • Semantic Wikipedia (RDF)
        • Semantic Query to extract related fields to each relevant word
            • SparQL [4]
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • Intelligent DM. Semantic Features
    • New Nuxeo service
    • Based on Semantic Web Technologies [5]
    • Needs:
        • Language to describe DM world to a machine -> OWL [6]
        • External tools -> Jena [7] , Pellet [8] , etc...
        • I+D: Database storage instead of RAM -> Persistent Reasoner
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • Intelligent DM. Semantic Features
    • Architecture:
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • Intelligent DM. Semantic Features
    • So, what have we achieved?
      • Now, Nuxeo:
      • is able to tag documents automatically
      • is able to identify document types automatically
      • is able to classify documents automatically
      • is continuously learning
      • is able to start operations over documents automatically (i.e. workflows)
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • Intelligent Document Management
    • Sucessful Cases:
          • Documents auto-tagging
          • Detecting document type
          • Learning to identify document types
          • Automatic operations over documents (Workflows)
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • Mobile Clients
      • REST ( Windows Mobile )
      • CMIS specification ( Android & Iphone )
      • Features
        • Multiple servers
        • Viewing documents
        • Share documents by various ways (e-mail, QR Code, etc.)
        • Upload documents from camera phone
        • Create notes and folders
        • Favourite documents
        • Document search (FullText and title search)
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • Mobile Client – Windows Mobile Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010 Video Demonstration
  • Mobile Client – Iphone Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010 Video Demonstration
  • Mobile Client – Android Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010 Video Demonstration
  • References [1] OpenCalais home page: http://www.opencalais.com/ [2] RDF Concepts: http://www.w3.org/TR/2004/REC-rdf-concepts-20040210/ [3] DBPedia home page: http://dbpedia.org/About [4] SparQL Query Language for RDF: http://www.w3.org/TR/2008/REC-rdf-sparql-query-20080115/ [5] W3C: http://www.w3.org/ [6] OWL Quick Reference Guide: http://www.w3.org/TR/2009/REC-owl2-quick-reference-20091027/ [7] Jena on sourceforge: http://jena.sourceforge.net/ [8] Pellet OWL reasoner: http://clarkparsia.com/pellet [9] CMIS on wikipedia: http://en.wikipedia.org/wiki/Content_Management_Interoperability_Services Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • Yerbabuena Software on WWW Main web: http://www.yerbabuena.es Spanish blog: http://blog.yerbabuena.es/ English blog: http://blog.yerbabuenasoftware.com/ Research blog: http://yerbabuenaresearch.blogspot.com/ Youtube Channel: http://www.youtube.com/user/YerbabuenaSoftware Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • Nuxeo World 2010 - Questions Thank you Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010