• Like

Loading…

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

Published

Presentation of the latest innovation in Document Management, intelligent auto-tagging, automatic invoice processing, integration with DBpedia, OpenCalais. …

Presentation of the latest innovation in Document Management, intelligent auto-tagging, automatic invoice processing, integration with DBpedia, OpenCalais.
CMIS use for smartphone client named ECM Mobile, available on Android and iPhone.

Published in Technology , Education
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
875
On SlideShare
0
From Embeds
0
Number of Embeds
2

Actions

Shares
Downloads
7
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. Yerbabuena Software ~ 2.010 Nuxeo World 2010 Yerbabuena applications for Nuxeo DM and some succesful cases Francisco José González Barea Victor Manuel Sánchez Sánchez
  • 2. Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010 Nuxeo World 2010 - Summary
        • Who is Yerbabuena Software?
        • 3. Yerbabuena Applications for Nuxeo DM
            • Using OCR smartly
            • 4. Intelligent Document Management.
                • Auto-tagging
                • 5. Semantic Features
            • Mobile Clients
  • 8. Who is Yerbabuena Software?
    • Company founded in 2005
    • 9. ~ 20 workers
    • 10. Activity
        • Developement of Nuxeo DM applications
        • 11. Support
        • 12. Training
        • 13. I+D projects
            • Nuxeo 2010 Eureka project
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • 14.
            • Using OCR smartly
    • SCR = Smart Character Recognition
    • 15. Not same as ICR
    • 16. Architecture
    • Adaptability to customer needs
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • 19.
            • Using OCR smartly -> SCR
    • Sucessful Cases:
        • Extraction of Invoices fields
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • 20. Intelligent Document Management
    • Intelligent ~ Automatic
        • Increase machine work
        • 21. Decrease Human work
    • Two different ways
        • Classify and search documents
        • 22. Identify and work with documents
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • 23. Intelligent DM. Auto-Tagging
    • Extract document full text
        • Previous OCR if an image
    • OpenCalais [1] analysis
        • External web service
        • 24. Extract tags from plain text depending on content meaning
        • 25. RDF [2] file as result
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • 26. Intelligent DM. Auto-Tagging
    • Extract relevant words
        • Depending on document type
        • 27. Depending on style features
    • DBPedia [3] analysis
        • Semantic Wikipedia (RDF)
        • 28. Semantic Query to extract related fields to each relevant word
            • SparQL [4]
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • 29. Intelligent DM. Semantic Features
    • New Nuxeo service
    • 30. Based on Semantic Web Technologies [5]
    • 31. Needs:
        • Language to describe DM world to a machine -> OWL [6]
        • 32. External tools -> Jena [7] , Pellet [8] , etc...
        • 33. I+D: Database storage instead of RAM -> Persistent Reasoner
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • 34. Intelligent DM. Semantic Features
    • Architecture:
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • 35. Intelligent DM. Semantic Features
    • So, what have we achieved?
      • Now, Nuxeo:
      • is able to tag documents automatically
      • 36. is able to identify document types automatically
      • 37. is able to classify documents automatically
      • 38. is continuously learning
      • 39. is able to start operations over documents automatically (i.e. workflows)
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • 40. Intelligent Document Management
    • Sucessful Cases:
          • Documents auto-tagging
          • 41. Detecting document type
          • 42. Learning to identify document types
          • 43. Automatic operations over documents (Workflows)
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • 44. Mobile Clients
      • REST ( Windows Mobile )
      • 45. CMIS specification ( Android & Iphone )
      • 46. Features
        • Multiple servers
        • 47. Viewing documents
        • 48. Share documents by various ways (e-mail, QR Code, etc.)
        • 49. Upload documents from camera phone
        • 50. Create notes and folders
        • 51. Favourite documents
        • 52. Document search (FullText and title search)
    Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • 53. Mobile Client – Windows Mobile Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010 Video Demonstration
  • 54. Mobile Client – Iphone Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010 Video Demonstration
  • 55. Mobile Client – Android Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010 Video Demonstration
  • 56. References [1] OpenCalais home page: http://www.opencalais.com/ [2] RDF Concepts: http://www.w3.org/TR/2004/REC-rdf-concepts-20040210/ [3] DBPedia home page: http://dbpedia.org/About [4] SparQL Query Language for RDF: http://www.w3.org/TR/2008/REC-rdf-sparql-query-20080115/ [5] W3C: http://www.w3.org/ [6] OWL Quick Reference Guide: http://www.w3.org/TR/2009/REC-owl2-quick-reference-20091027/ [7] Jena on sourceforge: http://jena.sourceforge.net/ [8] Pellet OWL reasoner: http://clarkparsia.com/pellet [9] CMIS on wikipedia: http://en.wikipedia.org/wiki/Content_Management_Interoperability_Services Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • 57. Yerbabuena Software on WWW Main web: http://www.yerbabuena.es Spanish blog: http://blog.yerbabuena.es/ English blog: http://blog.yerbabuenasoftware.com/ Research blog: http://yerbabuenaresearch.blogspot.com/ Youtube Channel: http://www.youtube.com/user/YerbabuenaSoftware Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010
  • 58. Nuxeo World 2010 - Questions Thank you Franciso José González – Victor Manuel Sánchez Yerbabuena Software ~ 2.010