Your SlideShare is downloading. ×
0
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Document Recognition Technologies
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Document Recognition Technologies

4,173

Published on

From the leading expert in document automation technologies Chris Riley learn the basics of what document recognition is and how it works, as well as some best practices.

From the leading expert in document automation technologies Chris Riley learn the basics of what document recognition is and how it works, as well as some best practices.

Published in: Technology, Business
0 Comments
10 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
4,173
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
0
Comments
0
Likes
10
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Document Recognition a technology overview Presented by: Chris Riley, ecm P , ioa P President AIIM Golden Gate
  • 2. What we will cover:
    • Why Chris?
    • What Are the Document Recognition Technologies
    • Who Makes Them
    • Buyer Beware
    • The future
    • Q & A
    • Free Stuff!
  • 3. Why Chris?
    • What qualifies Chris to talk to me?
      • When a developer turns to sales
      • Leading expert in document automation technologies
  • 4. What we will cover:
    • Why Chris?
    • What Are the Document Recognition Technologies
    • Who Makes Them
    • Buyer Beware
    • The future
    • Q & A
    • Free Stuff!
  • 5. Who knows what OCR is?
  • 6. The Technologies
    • OCR – Optical Character Recognition
    • ICR – Intelligent Character Recognition
    • OMR – Optical Mark Recognition
    • Barcode
    • Handwriting
    • All the other ones created for marketing purposes
    • CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
    • Assisted Capture
    • Fixed Form Process
    • Semi-Structured Forms Processing
    • Unstructured Document Processing
  • 7. The Technologies: OCR
    • OCR – Optical Character Recognition
    • ICR – Intelligent Character Recognition
    • OMR – Optical Mark Recognition
    • Barcode
    • Handwriting
    • All the other ones made up for marketing purposes
    • CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
    • Assisted Capture
    • Fixed Form Process
    • Semi-Structured Forms Processing
    • Unstructured Document Processing
    Ship To:
  • 8. The Technologies: ICR
    • OCR – Optical Character Recognition
    • ICR – Intelligent Character Recognition
    • OMR – Optical Mark Recognition
    • Barcode
    • Handwriting
    • All the other ones made up for marketing purposes
    • CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
    • Assisted Capture
    • Fixed Form Process
    • Semi-Structured Forms Processing
    • Unstructured Document Processing
    Ilya
  • 9. The Technologies: OMR
    • OCR – Optical Character Recognition
    • ICR – Intelligent Character Recognition
    • OMR – Optical Mark Recognition
    • Barcode
    • Handwriting
    • All the other ones made up for marketing purposes
    • CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
    • Assisted Capture
    • Fixed Form Process
    • Semi-Structured Forms Processing
    • Unstructured Document Processing
    Card Account
  • 10. The Technologies: Barcode
    • OCR – Optical Character Recognition
    • ICR – Intelligent Character Recognition
    • OMR – Optical Mark Recognition
    • Barcode
    • Handwriting
    • All the other ones made up for marketing purposes
    • CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
    • Assisted Capture
    • Fixed Form Process
    • Semi-Structured Forms Processing
    • Unstructured Document Processing
    1889094476620
  • 11. The Technologies: Handwriting
    • OCR – Optical Character Recognition
    • ICR – Intelligent Character Recognition
    • OMR – Optical Mark Recognition
    • Barcode
    • Handwriting
    • All the other ones made up for marketing purposes
    • CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
    • Assisted Capture
    • Fixed Form Process
    • Semi-Structured Forms Processing
    • Unstructured Document Processing
    * Critical *
  • 12. The Technologies: Acronym Heaven
    • OCR – Optical Character Recognition
    • ICR – Intelligent Character Recognition
    • OMR – Optical Mark Recognition
    • Barcode
    • Handwriting
    • All the other ones made up for marketing purposes
    • CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
    • Assisted Capture
    • Fixed Form Process
    • Semi-Structured Forms Processing
    • Unstructured Document Processing
  • 13. The Technologies: CAR/LAR
    • OCR – Optical Character Recognition
    • ICR – Intelligent Character Recognition
    • OMR – Optical Mark Recognition
    • Barcode
    • Handwriting
    • All the other ones made up for marketing purposes
    • CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
    • Assisted Capture
    • Fixed Form Process
    • Semi-Structured Forms Processing
    • Unstructured Document Processing
    2 hundred dollars & no cents
  • 14. The Technologies: Assisted Capture
    • OCR – Optical Character Recognition
    • ICR – Intelligent Character Recognition
    • OMR – Optical Mark Recognition
    • Barcode
    • Handwriting
    • All the other ones made up for marketing purposes
    • CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
    • Assisted Capture
    • Fixed Form Process
    • Semi-Structured Forms Processing
    • Unstructured Document Processing
  • 15. The Technologies: Fixed Form Processing
    • OCR – Optical Character Recognition
    • ICR – Intelligent Character Recognition
    • OMR – Optical Mark Recognition
    • Barcode
    • Handwriting
    • All the other ones made up for marketing purposes
    • CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
    • Assisted Capture
    • Fixed Form Process
    • Semi-Structured Forms Processing
    • Unstructured Document Processing
    Name: Ilya Date: 12/21/2982
  • 16. The Technologies: Fixed Form Processing Name: Ilya Date: 12/21/2982
  • 17. 80% of business end-user documents are semi-structured
  • 18. The Technologies: Semi-Structured Forms
    • OCR – Optical Character Recognition
    • ICR – Intelligent Character Recognition
    • OMR – Optical Mark Recognition
    • Barcode
    • Handwriting
    • All the other ones made up for marketing purposes
    • CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
    • Assisted Capture
    • Fixed Form Process
    • Semi-Structured Forms Processing
    • Unstructured Document Processing
    Invoice No: 99044 Date: 06/09/04 Invoice No: 24567 Date: 06/09/04
  • 19. Invoice No: 99044 Date: 06/09/04 Invoice No: 24567 Date: 06/09/04 (06/09/2004) The Technologies: Semi-Structured Forms
  • 20. The Technologies: Semi-Structured Forms
    • OCR – Optical Character Recognition
    • ICR – Intelligent Character Recognition
    • OMR – Optical Mark Recognition
    • Barcode
    • Handwriting
    • All the other ones made up for marketing purposes
    • CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
    • Assisted Capture
    • Fixed Form Process
    • Semi-Structured Forms Processing
    • Unstructured Document Processing
    Consignee Consignor Date Term
  • 21. The Technologies: Common Processes
    • Full page conversion
    • Classification
    • Index level extraction
    • Redaction
    • Routing
    • Auto Filing
    • Re-Purposing
    • Image Rotation
  • 22. The Technologies: Full page conversion
    • Image file to electronic data file
    • ALL text on the page
    • Includes:
      • Image Pre-processing
      • Document Analysis/Zoning
      • Extraction
      • Export ( Commonly PDF, DOC )
  • 23. The Technologies: Classification
    • Software tells you the document type
    • Scan batches of mixed documents
    Bill of Lading Invoice Check PO
  • 24. The Technologies: Index Level Extraction
    • Just certain required fields extracted
    • Normalization of data
    • Export usually to a database
    Invoice Number Invoice Date Total Amt Due Term
  • 25. The Technologies: How Accurate
    • Better question is how do you determine accuracy
    • Document Type Accuracy
    • Field/Zone Location Accuracy
    • Data Type Accuracy
    • Character Accuracy
  • 26. The Technologies: Common usage scenarios
    • Document Conversion
    • Document Archival / Retrieval
    • Invoice Processing
    • Insurance Processing( medical, mortgage )
    • Waybill processing
    • Survey processing
  • 27. What we will cover:
    • Why Chris?
    • What Are the Document Recognition Technologies
    • Who Makes Them
    • Buyer Beware
    • The future
    • Q & A
    • Free Stuff!
  • 28. There Really are only 4 core technology providers It takes 50 man-years to develop OCR using current computing abilities
  • 29. Who Makes Them: Core Engines
    • ABBYY
    • Nuance ( formally ScanSoft )
    • ReadI.R.I.S
    • Oc é
    • CharacTell
    • ParaScript
    • A2iA
    • Handful of Open Source
    • Handful of Other Vendors
    • Two handfuls of OLD engines
  • 30. Who Makes Them: Who Licenses Them
    • EVERYONE ELSE!
    • AnaComp
    • Anydoc
    • BancTec
    • BrainWare
    • Captaris
    • Captivation
    • Cardiff
    • CVision
    • DataCap
    • DigiTech
    • eCopy
    • EMC Documentum
    • Kofax
    • LaserFiche
    • LeadTools
    • Microsoft
    • NSi AutoStore
    • OnBase
    • Perceptive Imaging
    • ReadSoft
    • SER
    • Top Image Systems
    • Tower
    • Westbrook
    • Xerox
    • Hundreds More
  • 31. What we will cover:
    • Why Chris?
    • What Are the Document Recognition Technologies
    • Who Makes Them
    • Buyer Beware
    • The future
    • Q & A
    • Free Stuff!
  • 32. 30% of organizations that purchase, purchase the wrong thing Over 50 % of organizations that purchase never use it properly
  • 33. Buyer Beware
    • If OCR is the reason for buying a solution know what Engine it is!
    • Talk about the WHOLE solution not the pieces
    • Get past marketing gimmicks
    • Trust, Love, Be Certain of your reseller / vendor
  • 34. Buyer Beware: Know your engine
    • What version?
    • Will they upgrade?
  • 35. Buyer Beware: Talk about Whole Solution
    • Scanner / Input
    • Capture
    • Storage
    • Have Requirements List Before
  • 36. Buyer Beware: Get past Gimmicks
    • NOTHING! Is 100%
    • All canned demos work perfect
    • Always see test on your documents
    • Version numbers are really arbitrary
  • 37. Buyer Beware: Trust your vendor / reseller
    • Support after sale ( test them )
    • Where to get professional services
    • Do they understand the solution and not just the pieces?
  • 38. What we will cover:
    • Why Chris?
    • What Are the Document Recognition Technologies
    • Who Makes Them
    • Buyer Beware
    • The future
    • Q & A
    • Free Stuff!
  • 39. The Future
    • Full-page OCR will be a commodity
    • Advance Document Processing will become main-stream but less required
    • Think about what to do now that you will be gathering data rapidly
    • There will be a new approach to OCR
  • 40. What we will cover:
    • Why Chris?
    • What Are the Document Recognition Technologies
    • Who Makes Them
    • Buyer Beware
    • The future
    • Q & A
    • Free Stuff!
  • 41. Questions and Answers
    • Before you ask
  • 42. What we will cover:
    • Why Chris?
    • What Are the Document Recognition Technologies
    • Who Makes Them
    • Buyer Beware
    • The future
    • Q & A
    • Free Stuff!
  • 43. Free Stuff
    • Copy of ABBYY FineReader Pro 9.0
    • Copy of Nuance OmniPage 16
    • Copy of ReadI.R.I.S Pro 11
    • 4 Hour Consulting Session with ME!

×