SlideShare a Scribd company logo
AN UPDATE




   Prepared by Nadia Millington & Luis Rosenthal
Quality of phone



                   •   Ideally Nokia 6300 ( or above) will
                       allow appropriate visualisation of the
                       image is its resolution and screen size.

                   •   If microworkers do not have an
                       appropriate phone, they can access
                       this phone via a microfinance loan or
                       we can develop a scheme whereby
                       refurbished high end phones from the
                       first world ( which have been fully
                       depreciated) can be sent to the BOP
                       at a fraction of the cost ( some as low
                       as 20USDs) allowing for high
                       visualisation and good quality screen
                       size.
Data transmission costs

                          The money that the
                          microworkers earn is expected to
                          be significantly higher than the
                          data costs based on our quick
                          and dirty review of phone costs
                          in 3 developing countries.
                          Assuming each job pays 20US
                          cents we see data charges as a
                          small percentage of their
                          earnings and their only cost
                          (2-15%). We expect even these
                          percentages to be reduced based
                          on a thorough review of all the
                          available packages
Can the services be automated by a computer?

High accuracy OCR software can read more than 400
                                                                The accuracy of OCR systems is, in practice, directly
characters/second.
                                                                dependent upon the quality of the input documents.
                                                                OCR is not very tolerant of bad picture quality unlike
However:                                                        human readers. As such it is expected the OCR use
OCR software is not efficient in recognizing handwriting and    with receipt will have higher error thresholds. The
distinguishing between fonts which are quite similar to         main difficulties encountered with receipts , invoices
handwriting. In such cases manual entry plays better role       etc that are a challenge to OCR are
than OCR process.
Data entry provides complete flexibility allowing micro                • Variations in shape, due to serifs and style
operators to prepare digital documents from multiple                   variations.
formats- even audio recording of spending can be included,             • Deformations, caused by broken characters,
and notes on partial payments scribbled on the receipts                smudged characters and speckle.
etc.                                                                   • Variations in spacing, due to subscripts,
                                                                       superscripts, skew and variable spacing.
OCR may be efficient during the initial level of data entry
                                                                       • Mixture of text and graphics.
service but cannot be a substitute of data entry service
because recognition of typewritten text is still not 100%
accurate even where clear imaging is available. OCR
software ranges from 71% to 95%; but total accuracy can
be achieved only by human review. Errors occur because
of :
•Distinguishing noise from text- Dots and accents may be
mistaken for noise, and vice versa.
•Mistaking graphics or geometry for text- This leads to
nontext being sent to recognition.
                                                                    ni = m
• Mistaking text for graphics or geometry- In this case the
text will not be passed to the recognition stage. This often   Common OCR issues include mistaking an “ni” for an “m”
happens if characters are connected to graphics.
When OCR doesn’t work

These imperfections may affect and cause problems in different parts of the recognition process of an
OCR-system, resulting misclassifications
Finally

          Most OCR has some human interaction. Modern optical character
          recognition software relies on human interaction to correct
          misrecognized characters. Even though the software often reliably
          identifies low-confidence output, the simple language and
          vocabulary models employed are insufficient to automatically
          correct mistakes. A developer of the software lemon.com confirms
          this- he states “Whenever the machine learning system or the OCR
          system have a low confidence result, it can ask for human
          assistance, usually with a multiple choice answer or a request to
          edit an entry”.

          Models where OCR does not use human intervention, the
          consumer is expected to correct their own errors which is not a
          value proposition AskMom would ever employ as we are selling
          convenience

          It is possible to enhance the AskMom Business model with OCR
          technology on the front end utilising microworkers for quality
          assurance and low confidence results. The use of micro workers
          would still mean that we are operating at costs below other
          players. However, the human element is the key as it differentiates
          us. It allows AskMom to have higher levels of flexibility for
          recording complex, ill printed, receipts with accuracy from all parts
          of the world (offering a global solution) as opposed to the other
          options like lemon which only works within the US jurisdiction

More Related Content

Viewers also liked

17 steps to better presentations vocabulary guide
17 steps to better presentations vocabulary guide17 steps to better presentations vocabulary guide
17 steps to better presentations vocabulary guidetangtang88
 
Ask mom updated submitted april 2nd
Ask mom updated submitted april 2ndAsk mom updated submitted april 2nd
Ask mom updated submitted april 2ndNadia Millington
 
Lancorp
LancorpLancorp
Lancorp
jimlane
 
Gambar aktiviti bulan kemerdekaan
Gambar aktiviti bulan kemerdekaanGambar aktiviti bulan kemerdekaan
Gambar aktiviti bulan kemerdekaanJabit Sopining
 
Incalzirea Globala Asupra Mediului Acvatic
Incalzirea Globala Asupra Mediului AcvaticIncalzirea Globala Asupra Mediului Acvatic
Incalzirea Globala Asupra Mediului AcvaticHorvath Beatrix
 
Water 2
Water   2Water   2

Viewers also liked (11)

17 steps to better presentations vocabulary guide
17 steps to better presentations vocabulary guide17 steps to better presentations vocabulary guide
17 steps to better presentations vocabulary guide
 
3 md updated
3 md updated3 md updated
3 md updated
 
The classics
The classicsThe classics
The classics
 
England
EnglandEngland
England
 
Ask mom updated submitted april 2nd
Ask mom updated submitted april 2ndAsk mom updated submitted april 2nd
Ask mom updated submitted april 2nd
 
Lancorp
LancorpLancorp
Lancorp
 
Medical micro work final
Medical micro work finalMedical micro work final
Medical micro work final
 
Gambar aktiviti bulan kemerdekaan
Gambar aktiviti bulan kemerdekaanGambar aktiviti bulan kemerdekaan
Gambar aktiviti bulan kemerdekaan
 
Stundiu De Caz V
Stundiu De Caz VStundiu De Caz V
Stundiu De Caz V
 
Incalzirea Globala Asupra Mediului Acvatic
Incalzirea Globala Asupra Mediului AcvaticIncalzirea Globala Asupra Mediului Acvatic
Incalzirea Globala Asupra Mediului Acvatic
 
Water 2
Water   2Water   2
Water 2
 

Similar to Ask mom updated submitted april 2nd

OCR 's Functions
OCR 's FunctionsOCR 's Functions
OCR 's Functions
prithvi764
 
OCV & OCR - A Validation Perspective
OCV & OCR - A Validation PerspectiveOCV & OCR - A Validation Perspective
OCV & OCR - A Validation Perspective
MALAY MEHTA
 
Character Recognition System Based On Android Smart Phone
Character Recognition System Based On Android Smart PhoneCharacter Recognition System Based On Android Smart Phone
Character Recognition System Based On Android Smart Phone
IJMER
 
Automation for RDC and Mobile
Automation for RDC and MobileAutomation for RDC and Mobile
Automation for RDC and MobileVivastream
 
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
E42 (Light Information Systems Pvt Ltd)
 
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
E42 (Light Information Systems Pvt Ltd)
 
Hardware to Software
Hardware to SoftwareHardware to Software
Hardware to Software
GlobalSuperElite GlobalSuperElite
 
Practically genius1
Practically genius1Practically genius1
Optical character recognition IEEE Paper Study
Optical character recognition IEEE Paper StudyOptical character recognition IEEE Paper Study
Optical character recognition IEEE Paper Study
Er. Ashish Pandey
 
spt vision objects
spt vision objectsspt vision objects
spt vision objectsPolo Dimeo
 
Optical Character Recognition( OCR )
Optical Character Recognition( OCR )Optical Character Recognition( OCR )
Optical Character Recognition( OCR )
Karan Panjwani
 
New Age Digital Pen Presentation 05 2009
New Age Digital Pen Presentation 05 2009New Age Digital Pen Presentation 05 2009
New Age Digital Pen Presentation 05 2009
manos99
 
Orion Terminal
Orion Terminal Orion Terminal
Orion Terminal
axdoming
 
Omr scanner vs image scanner v2.0
Omr scanner vs image scanner v2.0Omr scanner vs image scanner v2.0
Omr scanner vs image scanner v2.0
OMR-Factory
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character Recognition
Rahul Mallik
 
Ocr abstract
Ocr abstractOcr abstract
Ocr abstract
Punya Prakash
 
Omr scanner vs image only scanners with ocr software
Omr scanner vs image only scanners with ocr softwareOmr scanner vs image only scanners with ocr software
Omr scanner vs image only scanners with ocr softwareOMR-Factory
 
OCR, optical character reader
OCR, optical character readerOCR, optical character reader
OCR, optical character reader
Learn with Tibetan Norser
 

Similar to Ask mom updated submitted april 2nd (20)

OCR 's Functions
OCR 's FunctionsOCR 's Functions
OCR 's Functions
 
OCV & OCR - A Validation Perspective
OCV & OCR - A Validation PerspectiveOCV & OCR - A Validation Perspective
OCV & OCR - A Validation Perspective
 
Character Recognition System Based On Android Smart Phone
Character Recognition System Based On Android Smart PhoneCharacter Recognition System Based On Android Smart Phone
Character Recognition System Based On Android Smart Phone
 
Automation for RDC and Mobile
Automation for RDC and MobileAutomation for RDC and Mobile
Automation for RDC and Mobile
 
50120130406005
5012013040600550120130406005
50120130406005
 
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
 
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
 
Hardware to Software
Hardware to SoftwareHardware to Software
Hardware to Software
 
Practically genius1
Practically genius1Practically genius1
Practically genius1
 
Optical character recognition IEEE Paper Study
Optical character recognition IEEE Paper StudyOptical character recognition IEEE Paper Study
Optical character recognition IEEE Paper Study
 
spt vision objects
spt vision objectsspt vision objects
spt vision objects
 
Optical Character Recognition( OCR )
Optical Character Recognition( OCR )Optical Character Recognition( OCR )
Optical Character Recognition( OCR )
 
New Age Digital Pen Presentation 05 2009
New Age Digital Pen Presentation 05 2009New Age Digital Pen Presentation 05 2009
New Age Digital Pen Presentation 05 2009
 
Orion Terminal
Orion Terminal Orion Terminal
Orion Terminal
 
Omr scanner vs image scanner v2.0
Omr scanner vs image scanner v2.0Omr scanner vs image scanner v2.0
Omr scanner vs image scanner v2.0
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character Recognition
 
Ocr abstract
Ocr abstractOcr abstract
Ocr abstract
 
Omr scanner vs image only scanners with ocr software
Omr scanner vs image only scanners with ocr softwareOmr scanner vs image only scanners with ocr software
Omr scanner vs image only scanners with ocr software
 
OCR, optical character reader
OCR, optical character readerOCR, optical character reader
OCR, optical character reader
 
05a
05a05a
05a
 

Recently uploaded

Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
UiPathCommunity
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
Vlad Stirbu
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.
ViralQR
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 

Recently uploaded (20)

Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 

Ask mom updated submitted april 2nd

  • 1. AN UPDATE Prepared by Nadia Millington & Luis Rosenthal
  • 2. Quality of phone • Ideally Nokia 6300 ( or above) will allow appropriate visualisation of the image is its resolution and screen size. • If microworkers do not have an appropriate phone, they can access this phone via a microfinance loan or we can develop a scheme whereby refurbished high end phones from the first world ( which have been fully depreciated) can be sent to the BOP at a fraction of the cost ( some as low as 20USDs) allowing for high visualisation and good quality screen size.
  • 3. Data transmission costs The money that the microworkers earn is expected to be significantly higher than the data costs based on our quick and dirty review of phone costs in 3 developing countries. Assuming each job pays 20US cents we see data charges as a small percentage of their earnings and their only cost (2-15%). We expect even these percentages to be reduced based on a thorough review of all the available packages
  • 4. Can the services be automated by a computer? High accuracy OCR software can read more than 400 The accuracy of OCR systems is, in practice, directly characters/second. dependent upon the quality of the input documents. OCR is not very tolerant of bad picture quality unlike However: human readers. As such it is expected the OCR use OCR software is not efficient in recognizing handwriting and with receipt will have higher error thresholds. The distinguishing between fonts which are quite similar to main difficulties encountered with receipts , invoices handwriting. In such cases manual entry plays better role etc that are a challenge to OCR are than OCR process. Data entry provides complete flexibility allowing micro • Variations in shape, due to serifs and style operators to prepare digital documents from multiple variations. formats- even audio recording of spending can be included, • Deformations, caused by broken characters, and notes on partial payments scribbled on the receipts smudged characters and speckle. etc. • Variations in spacing, due to subscripts, superscripts, skew and variable spacing. OCR may be efficient during the initial level of data entry • Mixture of text and graphics. service but cannot be a substitute of data entry service because recognition of typewritten text is still not 100% accurate even where clear imaging is available. OCR software ranges from 71% to 95%; but total accuracy can be achieved only by human review. Errors occur because of : •Distinguishing noise from text- Dots and accents may be mistaken for noise, and vice versa. •Mistaking graphics or geometry for text- This leads to nontext being sent to recognition. ni = m • Mistaking text for graphics or geometry- In this case the text will not be passed to the recognition stage. This often Common OCR issues include mistaking an “ni” for an “m” happens if characters are connected to graphics.
  • 5. When OCR doesn’t work These imperfections may affect and cause problems in different parts of the recognition process of an OCR-system, resulting misclassifications
  • 6. Finally Most OCR has some human interaction. Modern optical character recognition software relies on human interaction to correct misrecognized characters. Even though the software often reliably identifies low-confidence output, the simple language and vocabulary models employed are insufficient to automatically correct mistakes. A developer of the software lemon.com confirms this- he states “Whenever the machine learning system or the OCR system have a low confidence result, it can ask for human assistance, usually with a multiple choice answer or a request to edit an entry”. Models where OCR does not use human intervention, the consumer is expected to correct their own errors which is not a value proposition AskMom would ever employ as we are selling convenience It is possible to enhance the AskMom Business model with OCR technology on the front end utilising microworkers for quality assurance and low confidence results. The use of micro workers would still mean that we are operating at costs below other players. However, the human element is the key as it differentiates us. It allows AskMom to have higher levels of flexibility for recording complex, ill printed, receipts with accuracy from all parts of the world (offering a global solution) as opposed to the other options like lemon which only works within the US jurisdiction