SlideShare a Scribd company logo
1 of 11
Presented by
Deijee Kalita
MLISc. 4th sem
Roll No. 03
Department of Library & Information
Science Gauhati University
 Nowadays, a lot of paper documents are transformed to
electronic form, which makes information processing
easier, like searching, analysis and conversion.
 Many companies and other institutions decide to digitalize
their documents. Working with files is cheaper than
processing traditional documents, because there is no
space required for document storage. There are three
main steps of document digitalization: scanning,
indexation (data entry) and presentation of digitalized
documents.
 Researchers proved that the recognition of both barcodes
and printed text through Optical Character Recognition
or OCR is reliable and significantly accelerates data
processing. On the contrary, the handwritten text
appeared difficult to recognize by OCR systems.
 Optical Character Recognition or OCR is a system
that provides a full alphanumeric recognition of
printed or handwritten characters at electronic
speed by simply scanning the form.
 It is the mechanical or electronic conversion of
scanned or photographed images of typewritten or
printed text into machine-encoded/computer-
readable text.
 OCR is a field of research in pattern recognition,
artificial intelligence and computer vision. It is the
electronic translation of handwritten, typewritten or
printed text into machine translated images.
History of OCR:
 1928/9: Gustav Tauschek of Vienna, Austria patents a basic OCR "reading
machine.“
 1949: L.E. Flory and W.S. Pike of RCA Laboratories develop a photocell-based
machine that can read text to blind people at a rate of 60 words per minute.
 1950: David H. Shepard develops machines that can turn printed information
into machine-readable form for the US military and later founds a pioneering
OCR company called Intelligent Machines Research (IMR).
 1960: Lawrence (Larry) Roberts, a computer graphics researcher working at
MIT, develops early text recognition using specially simplified fonts such as
OCR-A.
 1950s/1960s: Reader's Digest and RCA work together to develop some of the
first commercial OCR systems.
 1960s: Postal services around the world begin to use OCR technology for mail-
sorting.
 1974: Raymond Kurzweil develops the Kurzweil Reading Machine that can read
printed pages aloud to blind people. Kurzweil's OCR software is acquired by
Xerox and marketed under the names ScanSoft and (later) Nuance
Communications.
 1993: The Apple Newton MessagePad (PDA) is one of the first handheld
computers to feature handwriting recognition on a touch-sensitive screen.
 2000: Researchers at Carnegie Mellon University flip the problem of developing
a good OCR system on its head—and develop a spam-busting system called
CAPTCHA
 Pre-processing:
Deals with improving quality of the image for better
recognition by the system. Techniques include –
 De- skew
 Despeckle
 Binarization
 Line removal
 Zoning etc..
 Character recognition:
There are two basic types of core OCR algorithm which may
produce a ranked list of candidate character –
 Matrix matching
 Feature extraction
 Post-processing:
OCR accuracy can be increased if the output is constrained by
lexicon. Eg. all the words in the English language can be
problematic if the document contains words that are not in
the lexicon, like proper nouns.
 Data entry for business documents, e.g. check, passport,
invoice, bank statement and receipt
 Automatic number plate recognition
 Automatic insurance documents key information
extraction
 Extracting business card information into a contact list
 More quickly make textual versions of printed documents,
e.g. book scanning for Project Gutenberg
 Make electronic images of printed documents searchable,
e.g. Google Books
 Converting handwriting in real time to control a computer
(pen computing)
 Assistive technology for blind and visually impaired users
 Once a printed page is in this machine-
readable text form, one can do all kinds of thing
that couldn't do before.
 Machine-readable text can also be decoded by
screen readers, tools that use speech synthesizers
to read out the words on a screen so blind and
visually impaired people can understand them.
 In the 1970s, one of the first major uses of OCR
was in a photocopier-like device called the
Kurzweil Reading Machine, which could read
printed books out loud to blind people.
 Institutional repositories are digital collections of the
outputs created within an institution. It collects
intellectual data of an institution, especially a
research institution where it is collected, preserved
and aired. It is basically a collection of peer reviewed
journal articles, conference proceedings, research
data, monographs, books, theses and dissertations
and presentations. Practical implementation of this
includes setting up a system which consists of
scanner which scans the documents. This scanned
document is then fed as an input to an Optical
Character Recognition system where information is
acquired and retained in digitized form.
 Nowadays, a lot of documents are produced in paper form but it
is obvious, that automatic data recognition systems are very
popular.
 Though researchers have suggested various sophisticated ideas
and techniques, practical OCR systems suffer from a lack of
various characteristics. It is because of the claims made by the
researchers are not adequately justified by exposure of the
systems into real working environments and the lack of practical
feasibility of such advanced techniques with the available
hardware from an economical viewpoint. From these constraints
and the lack of performances it can be concluded that the ability
to read text by machines with the same fluency as the human
remains an unachieved goal, though a great amount of effort has
already been expended on the subject.
 However, the frontiers of character recognition have now moved
to the recognition of cursive script that is the recognition of
characters which may be connected or written in calligraphy.
 Asif , Ali Mir Arif Mir, Hannan, Shaikh Abdul, Perwej,
Yusuf, Vithalrao, Mane Arjun. An Overview and
Applications of Optical Character Recognition.
International Journal of Advance Research In Science
And Engineering , Vol. 3(7), 261-274p.
 https://en.wikipedia.org/wiki/Optical_character_reco
gnition (accessed in 10/03/2017)
 http://www.webopedia.com/TERM/O/optical_charact
er_recognition.html (accessed in 10/03/2017)
 http://www.computerhope.com/jargon/o/ocr.htm
(accessed in 10/03/2017)
 http://www.explainthatstuff.com/how-ocr-
works.html (accessed in 10/03/2017)
Optical character recognition (ocr) ppt

More Related Content

What's hot

Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character RecognitionRahul Mallik
 
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUESA STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUESijcsitcejournal
 
Handwriting Recognition Using Deep Learning and Computer Version
Handwriting Recognition Using Deep Learning and Computer VersionHandwriting Recognition Using Deep Learning and Computer Version
Handwriting Recognition Using Deep Learning and Computer VersionNaiyan Noor
 
Final Report on Optical Character Recognition
Final Report on Optical Character Recognition Final Report on Optical Character Recognition
Final Report on Optical Character Recognition Vidyut Singhania
 
Face recognition ppt
Face recognition pptFace recognition ppt
Face recognition pptSantosh Kumar
 
Optical character recognition IEEE Paper Study
Optical character recognition IEEE Paper StudyOptical character recognition IEEE Paper Study
Optical character recognition IEEE Paper StudyEr. Ashish Pandey
 
OCR 's Functions
OCR 's FunctionsOCR 's Functions
OCR 's Functionsprithvi764
 
Optical Character Recognition Using Python
Optical Character Recognition Using PythonOptical Character Recognition Using Python
Optical Character Recognition Using PythonYogeshIJTSRD
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character RecognitionDurjoy Saha
 
Presentation on OCR
Presentation on OCRPresentation on OCR
Presentation on OCRxsconfused
 
Attendance system based on face recognition using python by Raihan Sikdar
Attendance system based on face recognition using python by Raihan SikdarAttendance system based on face recognition using python by Raihan Sikdar
Attendance system based on face recognition using python by Raihan Sikdarraihansikdar
 
Handwriting Recognition
Handwriting RecognitionHandwriting Recognition
Handwriting RecognitionBindu Karki
 

What's hot (20)

Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character Recognition
 
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUESA STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
 
Handwriting Recognition Using Deep Learning and Computer Version
Handwriting Recognition Using Deep Learning and Computer VersionHandwriting Recognition Using Deep Learning and Computer Version
Handwriting Recognition Using Deep Learning and Computer Version
 
Text reader [OCR]
Text reader [OCR]Text reader [OCR]
Text reader [OCR]
 
Final Report on Optical Character Recognition
Final Report on Optical Character Recognition Final Report on Optical Character Recognition
Final Report on Optical Character Recognition
 
Ocr abstract
Ocr abstractOcr abstract
Ocr abstract
 
Face recognition ppt
Face recognition pptFace recognition ppt
Face recognition ppt
 
Optical character recognition IEEE Paper Study
Optical character recognition IEEE Paper StudyOptical character recognition IEEE Paper Study
Optical character recognition IEEE Paper Study
 
OCR 's Functions
OCR 's FunctionsOCR 's Functions
OCR 's Functions
 
Text Detection and Recognition
Text Detection and RecognitionText Detection and Recognition
Text Detection and Recognition
 
OCR Text Extraction
OCR Text ExtractionOCR Text Extraction
OCR Text Extraction
 
Basics of-optical-character-recognition
Basics of-optical-character-recognitionBasics of-optical-character-recognition
Basics of-optical-character-recognition
 
Optical Character Recognition Using Python
Optical Character Recognition Using PythonOptical Character Recognition Using Python
Optical Character Recognition Using Python
 
Biometric Authentication PPT
Biometric Authentication PPTBiometric Authentication PPT
Biometric Authentication PPT
 
Image recognition
Image recognitionImage recognition
Image recognition
 
Image Processing ppt
Image Processing pptImage Processing ppt
Image Processing ppt
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character Recognition
 
Presentation on OCR
Presentation on OCRPresentation on OCR
Presentation on OCR
 
Attendance system based on face recognition using python by Raihan Sikdar
Attendance system based on face recognition using python by Raihan SikdarAttendance system based on face recognition using python by Raihan Sikdar
Attendance system based on face recognition using python by Raihan Sikdar
 
Handwriting Recognition
Handwriting RecognitionHandwriting Recognition
Handwriting Recognition
 

Viewers also liked

SPARK16 Presentation: Measuring for Results: Data and the Changing Energy Lan...
SPARK16 Presentation: Measuring for Results: Data and the Changing Energy Lan...SPARK16 Presentation: Measuring for Results: Data and the Changing Energy Lan...
SPARK16 Presentation: Measuring for Results: Data and the Changing Energy Lan...Urjanet
 
SPARK16 Presentation: Urjanet Product Vision
SPARK16 Presentation: Urjanet Product VisionSPARK16 Presentation: Urjanet Product Vision
SPARK16 Presentation: Urjanet Product VisionUrjanet
 
SPARK15: Architecting The Future of Energy & Sustainability
SPARK15: Architecting The Future of Energy & SustainabilitySPARK15: Architecting The Future of Energy & Sustainability
SPARK15: Architecting The Future of Energy & SustainabilityUrjanet
 
The Credit Score Present and Future
The Credit Score Present and FutureThe Credit Score Present and Future
The Credit Score Present and FutureUrjanet
 
How to Access Utility Data
How to Access Utility DataHow to Access Utility Data
How to Access Utility DataUrjanet
 
OCR vs. Urjanet
OCR vs. UrjanetOCR vs. Urjanet
OCR vs. UrjanetUrjanet
 
Spark 2017 Key Takeaways
Spark 2017 Key TakeawaysSpark 2017 Key Takeaways
Spark 2017 Key TakeawaysUrjanet
 
SPARK15: Simplifying Sustainability Through Gamification
SPARK15: Simplifying Sustainability Through GamificationSPARK15: Simplifying Sustainability Through Gamification
SPARK15: Simplifying Sustainability Through GamificationUrjanet
 

Viewers also liked (8)

SPARK16 Presentation: Measuring for Results: Data and the Changing Energy Lan...
SPARK16 Presentation: Measuring for Results: Data and the Changing Energy Lan...SPARK16 Presentation: Measuring for Results: Data and the Changing Energy Lan...
SPARK16 Presentation: Measuring for Results: Data and the Changing Energy Lan...
 
SPARK16 Presentation: Urjanet Product Vision
SPARK16 Presentation: Urjanet Product VisionSPARK16 Presentation: Urjanet Product Vision
SPARK16 Presentation: Urjanet Product Vision
 
SPARK15: Architecting The Future of Energy & Sustainability
SPARK15: Architecting The Future of Energy & SustainabilitySPARK15: Architecting The Future of Energy & Sustainability
SPARK15: Architecting The Future of Energy & Sustainability
 
The Credit Score Present and Future
The Credit Score Present and FutureThe Credit Score Present and Future
The Credit Score Present and Future
 
How to Access Utility Data
How to Access Utility DataHow to Access Utility Data
How to Access Utility Data
 
OCR vs. Urjanet
OCR vs. UrjanetOCR vs. Urjanet
OCR vs. Urjanet
 
Spark 2017 Key Takeaways
Spark 2017 Key TakeawaysSpark 2017 Key Takeaways
Spark 2017 Key Takeaways
 
SPARK15: Simplifying Sustainability Through Gamification
SPARK15: Simplifying Sustainability Through GamificationSPARK15: Simplifying Sustainability Through Gamification
SPARK15: Simplifying Sustainability Through Gamification
 

Similar to Optical character recognition (ocr) ppt

How to create a corpus of machine-readable texts: challenges and solutions
How to create a corpus of machine-readable texts: challenges and solutionsHow to create a corpus of machine-readable texts: challenges and solutions
How to create a corpus of machine-readable texts: challenges and solutionsMonika Renate Barget
 
A detailed study and recent research on handwritten recognition
A detailed study and recent research on handwritten recognitionA detailed study and recent research on handwritten recognition
A detailed study and recent research on handwritten recognitionShruthiamar
 
What is Optical Character Recognition (OCR) Technology?
What is Optical Character Recognition (OCR) Technology?What is Optical Character Recognition (OCR) Technology?
What is Optical Character Recognition (OCR) Technology?ARC Document Solutions
 
Smart Assistant for Blind Humans using Rashberry PI
Smart Assistant for Blind Humans using Rashberry PISmart Assistant for Blind Humans using Rashberry PI
Smart Assistant for Blind Humans using Rashberry PIijtsrd
 
A Detailed Study And Recent Research On OCR
A Detailed Study And Recent Research On OCRA Detailed Study And Recent Research On OCR
A Detailed Study And Recent Research On OCRDaniel Wachtel
 
Applications and benefits of optical character recognition technology
Applications and benefits of optical character recognition technologyApplications and benefits of optical character recognition technology
Applications and benefits of optical character recognition technologySameerShaik43
 
Pattern recognition research, conclusion inforamtion (2020)
Pattern recognition research, conclusion inforamtion (2020)Pattern recognition research, conclusion inforamtion (2020)
Pattern recognition research, conclusion inforamtion (2020)Ahmed Magdy
 
Optical character recognization word
Optical character recognization wordOptical character recognization word
Optical character recognization wordDhana K
 
A brief history of Optical Character Recognition (OCR)
A brief history of Optical Character Recognition (OCR)A brief history of Optical Character Recognition (OCR)
A brief history of Optical Character Recognition (OCR)Pitney Bowes
 
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...E42 (Light Information Systems Pvt Ltd)
 
Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015Editor IJARCET
 
Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015Editor IJARCET
 

Similar to Optical character recognition (ocr) ppt (20)

How to create a corpus of machine-readable texts: challenges and solutions
How to create a corpus of machine-readable texts: challenges and solutionsHow to create a corpus of machine-readable texts: challenges and solutions
How to create a corpus of machine-readable texts: challenges and solutions
 
A detailed study and recent research on handwritten recognition
A detailed study and recent research on handwritten recognitionA detailed study and recent research on handwritten recognition
A detailed study and recent research on handwritten recognition
 
D017222226
D017222226D017222226
D017222226
 
CRC Final Report
CRC Final ReportCRC Final Report
CRC Final Report
 
What is Optical Character Recognition (OCR) Technology?
What is Optical Character Recognition (OCR) Technology?What is Optical Character Recognition (OCR) Technology?
What is Optical Character Recognition (OCR) Technology?
 
Smart Assistant for Blind Humans using Rashberry PI
Smart Assistant for Blind Humans using Rashberry PISmart Assistant for Blind Humans using Rashberry PI
Smart Assistant for Blind Humans using Rashberry PI
 
A Detailed Study And Recent Research On OCR
A Detailed Study And Recent Research On OCRA Detailed Study And Recent Research On OCR
A Detailed Study And Recent Research On OCR
 
Applications and benefits of optical character recognition technology
Applications and benefits of optical character recognition technologyApplications and benefits of optical character recognition technology
Applications and benefits of optical character recognition technology
 
Bj35343348
Bj35343348Bj35343348
Bj35343348
 
Pattern recognition research, conclusion inforamtion (2020)
Pattern recognition research, conclusion inforamtion (2020)Pattern recognition research, conclusion inforamtion (2020)
Pattern recognition research, conclusion inforamtion (2020)
 
Optical character recognization word
Optical character recognization wordOptical character recognization word
Optical character recognization word
 
OCR, optical character reader
OCR, optical character readerOCR, optical character reader
OCR, optical character reader
 
Ijetcas14 371
Ijetcas14 371Ijetcas14 371
Ijetcas14 371
 
Z04405149151
Z04405149151Z04405149151
Z04405149151
 
Ocr 1
Ocr 1Ocr 1
Ocr 1
 
A brief history of Optical Character Recognition (OCR)
A brief history of Optical Character Recognition (OCR)A brief history of Optical Character Recognition (OCR)
A brief history of Optical Character Recognition (OCR)
 
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
 
Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015
 
Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015
 
Paper based interaction
Paper based interactionPaper based interaction
Paper based interaction
 

Recently uploaded

Karol Bagh Call Girls : ☎ 8527673949, Low rate Call Girls
Karol Bagh Call Girls : ☎ 8527673949, Low rate Call GirlsKarol Bagh Call Girls : ☎ 8527673949, Low rate Call Girls
Karol Bagh Call Girls : ☎ 8527673949, Low rate Call Girlsashishs7044
 
FULL ENJOY - 9953040155 Call Girls in Noida | Delhi
FULL ENJOY - 9953040155 Call Girls in Noida | DelhiFULL ENJOY - 9953040155 Call Girls in Noida | Delhi
FULL ENJOY - 9953040155 Call Girls in Noida | DelhiMalviyaNagarCallGirl
 
Russian⚡ Call Girls In Sector 104 Noida✨8375860717⚡Escorts Service
Russian⚡ Call Girls In Sector 104 Noida✨8375860717⚡Escorts ServiceRussian⚡ Call Girls In Sector 104 Noida✨8375860717⚡Escorts Service
Russian⚡ Call Girls In Sector 104 Noida✨8375860717⚡Escorts Servicedoor45step
 
9654467111 Full Enjoy @24/7 Call Girls In Saket Delhi Ncr
9654467111 Full Enjoy @24/7 Call Girls In Saket Delhi Ncr9654467111 Full Enjoy @24/7 Call Girls In Saket Delhi Ncr
9654467111 Full Enjoy @24/7 Call Girls In Saket Delhi NcrSapana Sha
 
FULL ENJOY - 9953040155 Call Girls in Moti Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in Moti Nagar | DelhiFULL ENJOY - 9953040155 Call Girls in Moti Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in Moti Nagar | DelhiMalviyaNagarCallGirl
 
Roadrunner Lodge, Motel/Residence, Tucumcari NM
Roadrunner Lodge, Motel/Residence, Tucumcari NMRoadrunner Lodge, Motel/Residence, Tucumcari NM
Roadrunner Lodge, Motel/Residence, Tucumcari NMroute66connected
 
FULL ENJOY - 9953040155 Call Girls in New Ashok Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in New Ashok Nagar | DelhiFULL ENJOY - 9953040155 Call Girls in New Ashok Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in New Ashok Nagar | DelhiMalviyaNagarCallGirl
 
FULL ENJOY - 9953040155 Call Girls in Mahipalpur | Delhi
FULL ENJOY - 9953040155 Call Girls in Mahipalpur | DelhiFULL ENJOY - 9953040155 Call Girls in Mahipalpur | Delhi
FULL ENJOY - 9953040155 Call Girls in Mahipalpur | DelhiMalviyaNagarCallGirl
 
Faridabad Call Girls : ☎ 8527673949, Low rate Call Girls
Faridabad Call Girls : ☎ 8527673949, Low rate Call GirlsFaridabad Call Girls : ☎ 8527673949, Low rate Call Girls
Faridabad Call Girls : ☎ 8527673949, Low rate Call Girlsashishs7044
 
San Jon Motel, Motel/Residence, San Jon NM
San Jon Motel, Motel/Residence, San Jon NMSan Jon Motel, Motel/Residence, San Jon NM
San Jon Motel, Motel/Residence, San Jon NMroute66connected
 
FULL ENJOY - 9953040155 Call Girls in Paschim Vihar | Delhi
FULL ENJOY - 9953040155 Call Girls in Paschim Vihar | DelhiFULL ENJOY - 9953040155 Call Girls in Paschim Vihar | Delhi
FULL ENJOY - 9953040155 Call Girls in Paschim Vihar | DelhiMalviyaNagarCallGirl
 
FULL ENJOY - 9953040155 Call Girls in Lajpat Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in Lajpat Nagar | DelhiFULL ENJOY - 9953040155 Call Girls in Lajpat Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in Lajpat Nagar | DelhiMalviyaNagarCallGirl
 
Pragati Maidan Call Girls : ☎ 8527673949, Low rate Call Girls
Pragati Maidan Call Girls : ☎ 8527673949, Low rate Call GirlsPragati Maidan Call Girls : ☎ 8527673949, Low rate Call Girls
Pragati Maidan Call Girls : ☎ 8527673949, Low rate Call Girlsashishs7044
 
FULL ENJOY - 9953040155 Call Girls in Gandhi Vihar | Delhi
FULL ENJOY - 9953040155 Call Girls in Gandhi Vihar | DelhiFULL ENJOY - 9953040155 Call Girls in Gandhi Vihar | Delhi
FULL ENJOY - 9953040155 Call Girls in Gandhi Vihar | DelhiMalviyaNagarCallGirl
 
MinSheng Gaofeng Estate commercial storyboard
MinSheng Gaofeng Estate commercial storyboardMinSheng Gaofeng Estate commercial storyboard
MinSheng Gaofeng Estate commercial storyboardjessica288382
 
Khanpur Call Girls : ☎ 8527673949, Low rate Call Girls
Khanpur Call Girls : ☎ 8527673949, Low rate Call GirlsKhanpur Call Girls : ☎ 8527673949, Low rate Call Girls
Khanpur Call Girls : ☎ 8527673949, Low rate Call Girlsashishs7044
 
Indian High Profile Call Girls In Sector 18 Noida 8375860717 Escorts Service
Indian High Profile Call Girls In Sector 18 Noida 8375860717 Escorts ServiceIndian High Profile Call Girls In Sector 18 Noida 8375860717 Escorts Service
Indian High Profile Call Girls In Sector 18 Noida 8375860717 Escorts Servicedoor45step
 
Bur Dubai Call Girls O58993O4O2 Call Girls in Bur Dubai
Bur Dubai Call Girls O58993O4O2 Call Girls in Bur DubaiBur Dubai Call Girls O58993O4O2 Call Girls in Bur Dubai
Bur Dubai Call Girls O58993O4O2 Call Girls in Bur Dubaidajasot375
 

Recently uploaded (20)

Karol Bagh Call Girls : ☎ 8527673949, Low rate Call Girls
Karol Bagh Call Girls : ☎ 8527673949, Low rate Call GirlsKarol Bagh Call Girls : ☎ 8527673949, Low rate Call Girls
Karol Bagh Call Girls : ☎ 8527673949, Low rate Call Girls
 
call girls in Noida New Ashok Nagar 🔝 >༒8448380779 🔝 genuine Escort Service 🔝...
call girls in Noida New Ashok Nagar 🔝 >༒8448380779 🔝 genuine Escort Service 🔝...call girls in Noida New Ashok Nagar 🔝 >༒8448380779 🔝 genuine Escort Service 🔝...
call girls in Noida New Ashok Nagar 🔝 >༒8448380779 🔝 genuine Escort Service 🔝...
 
FULL ENJOY - 9953040155 Call Girls in Noida | Delhi
FULL ENJOY - 9953040155 Call Girls in Noida | DelhiFULL ENJOY - 9953040155 Call Girls in Noida | Delhi
FULL ENJOY - 9953040155 Call Girls in Noida | Delhi
 
Call~Girl in Rajendra Nagar New Delhi 8448380779 Full Enjoy Escort Service
Call~Girl in Rajendra Nagar New Delhi 8448380779 Full Enjoy Escort ServiceCall~Girl in Rajendra Nagar New Delhi 8448380779 Full Enjoy Escort Service
Call~Girl in Rajendra Nagar New Delhi 8448380779 Full Enjoy Escort Service
 
Russian⚡ Call Girls In Sector 104 Noida✨8375860717⚡Escorts Service
Russian⚡ Call Girls In Sector 104 Noida✨8375860717⚡Escorts ServiceRussian⚡ Call Girls In Sector 104 Noida✨8375860717⚡Escorts Service
Russian⚡ Call Girls In Sector 104 Noida✨8375860717⚡Escorts Service
 
9654467111 Full Enjoy @24/7 Call Girls In Saket Delhi Ncr
9654467111 Full Enjoy @24/7 Call Girls In Saket Delhi Ncr9654467111 Full Enjoy @24/7 Call Girls In Saket Delhi Ncr
9654467111 Full Enjoy @24/7 Call Girls In Saket Delhi Ncr
 
FULL ENJOY - 9953040155 Call Girls in Moti Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in Moti Nagar | DelhiFULL ENJOY - 9953040155 Call Girls in Moti Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in Moti Nagar | Delhi
 
Roadrunner Lodge, Motel/Residence, Tucumcari NM
Roadrunner Lodge, Motel/Residence, Tucumcari NMRoadrunner Lodge, Motel/Residence, Tucumcari NM
Roadrunner Lodge, Motel/Residence, Tucumcari NM
 
FULL ENJOY - 9953040155 Call Girls in New Ashok Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in New Ashok Nagar | DelhiFULL ENJOY - 9953040155 Call Girls in New Ashok Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in New Ashok Nagar | Delhi
 
FULL ENJOY - 9953040155 Call Girls in Mahipalpur | Delhi
FULL ENJOY - 9953040155 Call Girls in Mahipalpur | DelhiFULL ENJOY - 9953040155 Call Girls in Mahipalpur | Delhi
FULL ENJOY - 9953040155 Call Girls in Mahipalpur | Delhi
 
Faridabad Call Girls : ☎ 8527673949, Low rate Call Girls
Faridabad Call Girls : ☎ 8527673949, Low rate Call GirlsFaridabad Call Girls : ☎ 8527673949, Low rate Call Girls
Faridabad Call Girls : ☎ 8527673949, Low rate Call Girls
 
San Jon Motel, Motel/Residence, San Jon NM
San Jon Motel, Motel/Residence, San Jon NMSan Jon Motel, Motel/Residence, San Jon NM
San Jon Motel, Motel/Residence, San Jon NM
 
FULL ENJOY - 9953040155 Call Girls in Paschim Vihar | Delhi
FULL ENJOY - 9953040155 Call Girls in Paschim Vihar | DelhiFULL ENJOY - 9953040155 Call Girls in Paschim Vihar | Delhi
FULL ENJOY - 9953040155 Call Girls in Paschim Vihar | Delhi
 
FULL ENJOY - 9953040155 Call Girls in Lajpat Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in Lajpat Nagar | DelhiFULL ENJOY - 9953040155 Call Girls in Lajpat Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in Lajpat Nagar | Delhi
 
Pragati Maidan Call Girls : ☎ 8527673949, Low rate Call Girls
Pragati Maidan Call Girls : ☎ 8527673949, Low rate Call GirlsPragati Maidan Call Girls : ☎ 8527673949, Low rate Call Girls
Pragati Maidan Call Girls : ☎ 8527673949, Low rate Call Girls
 
FULL ENJOY - 9953040155 Call Girls in Gandhi Vihar | Delhi
FULL ENJOY - 9953040155 Call Girls in Gandhi Vihar | DelhiFULL ENJOY - 9953040155 Call Girls in Gandhi Vihar | Delhi
FULL ENJOY - 9953040155 Call Girls in Gandhi Vihar | Delhi
 
MinSheng Gaofeng Estate commercial storyboard
MinSheng Gaofeng Estate commercial storyboardMinSheng Gaofeng Estate commercial storyboard
MinSheng Gaofeng Estate commercial storyboard
 
Khanpur Call Girls : ☎ 8527673949, Low rate Call Girls
Khanpur Call Girls : ☎ 8527673949, Low rate Call GirlsKhanpur Call Girls : ☎ 8527673949, Low rate Call Girls
Khanpur Call Girls : ☎ 8527673949, Low rate Call Girls
 
Indian High Profile Call Girls In Sector 18 Noida 8375860717 Escorts Service
Indian High Profile Call Girls In Sector 18 Noida 8375860717 Escorts ServiceIndian High Profile Call Girls In Sector 18 Noida 8375860717 Escorts Service
Indian High Profile Call Girls In Sector 18 Noida 8375860717 Escorts Service
 
Bur Dubai Call Girls O58993O4O2 Call Girls in Bur Dubai
Bur Dubai Call Girls O58993O4O2 Call Girls in Bur DubaiBur Dubai Call Girls O58993O4O2 Call Girls in Bur Dubai
Bur Dubai Call Girls O58993O4O2 Call Girls in Bur Dubai
 

Optical character recognition (ocr) ppt

  • 1. Presented by Deijee Kalita MLISc. 4th sem Roll No. 03 Department of Library & Information Science Gauhati University
  • 2.  Nowadays, a lot of paper documents are transformed to electronic form, which makes information processing easier, like searching, analysis and conversion.  Many companies and other institutions decide to digitalize their documents. Working with files is cheaper than processing traditional documents, because there is no space required for document storage. There are three main steps of document digitalization: scanning, indexation (data entry) and presentation of digitalized documents.  Researchers proved that the recognition of both barcodes and printed text through Optical Character Recognition or OCR is reliable and significantly accelerates data processing. On the contrary, the handwritten text appeared difficult to recognize by OCR systems.
  • 3.  Optical Character Recognition or OCR is a system that provides a full alphanumeric recognition of printed or handwritten characters at electronic speed by simply scanning the form.  It is the mechanical or electronic conversion of scanned or photographed images of typewritten or printed text into machine-encoded/computer- readable text.  OCR is a field of research in pattern recognition, artificial intelligence and computer vision. It is the electronic translation of handwritten, typewritten or printed text into machine translated images.
  • 4. History of OCR:  1928/9: Gustav Tauschek of Vienna, Austria patents a basic OCR "reading machine.“  1949: L.E. Flory and W.S. Pike of RCA Laboratories develop a photocell-based machine that can read text to blind people at a rate of 60 words per minute.  1950: David H. Shepard develops machines that can turn printed information into machine-readable form for the US military and later founds a pioneering OCR company called Intelligent Machines Research (IMR).  1960: Lawrence (Larry) Roberts, a computer graphics researcher working at MIT, develops early text recognition using specially simplified fonts such as OCR-A.  1950s/1960s: Reader's Digest and RCA work together to develop some of the first commercial OCR systems.  1960s: Postal services around the world begin to use OCR technology for mail- sorting.  1974: Raymond Kurzweil develops the Kurzweil Reading Machine that can read printed pages aloud to blind people. Kurzweil's OCR software is acquired by Xerox and marketed under the names ScanSoft and (later) Nuance Communications.  1993: The Apple Newton MessagePad (PDA) is one of the first handheld computers to feature handwriting recognition on a touch-sensitive screen.  2000: Researchers at Carnegie Mellon University flip the problem of developing a good OCR system on its head—and develop a spam-busting system called CAPTCHA
  • 5.  Pre-processing: Deals with improving quality of the image for better recognition by the system. Techniques include –  De- skew  Despeckle  Binarization  Line removal  Zoning etc..  Character recognition: There are two basic types of core OCR algorithm which may produce a ranked list of candidate character –  Matrix matching  Feature extraction  Post-processing: OCR accuracy can be increased if the output is constrained by lexicon. Eg. all the words in the English language can be problematic if the document contains words that are not in the lexicon, like proper nouns.
  • 6.  Data entry for business documents, e.g. check, passport, invoice, bank statement and receipt  Automatic number plate recognition  Automatic insurance documents key information extraction  Extracting business card information into a contact list  More quickly make textual versions of printed documents, e.g. book scanning for Project Gutenberg  Make electronic images of printed documents searchable, e.g. Google Books  Converting handwriting in real time to control a computer (pen computing)  Assistive technology for blind and visually impaired users
  • 7.  Once a printed page is in this machine- readable text form, one can do all kinds of thing that couldn't do before.  Machine-readable text can also be decoded by screen readers, tools that use speech synthesizers to read out the words on a screen so blind and visually impaired people can understand them.  In the 1970s, one of the first major uses of OCR was in a photocopier-like device called the Kurzweil Reading Machine, which could read printed books out loud to blind people.
  • 8.  Institutional repositories are digital collections of the outputs created within an institution. It collects intellectual data of an institution, especially a research institution where it is collected, preserved and aired. It is basically a collection of peer reviewed journal articles, conference proceedings, research data, monographs, books, theses and dissertations and presentations. Practical implementation of this includes setting up a system which consists of scanner which scans the documents. This scanned document is then fed as an input to an Optical Character Recognition system where information is acquired and retained in digitized form.
  • 9.  Nowadays, a lot of documents are produced in paper form but it is obvious, that automatic data recognition systems are very popular.  Though researchers have suggested various sophisticated ideas and techniques, practical OCR systems suffer from a lack of various characteristics. It is because of the claims made by the researchers are not adequately justified by exposure of the systems into real working environments and the lack of practical feasibility of such advanced techniques with the available hardware from an economical viewpoint. From these constraints and the lack of performances it can be concluded that the ability to read text by machines with the same fluency as the human remains an unachieved goal, though a great amount of effort has already been expended on the subject.  However, the frontiers of character recognition have now moved to the recognition of cursive script that is the recognition of characters which may be connected or written in calligraphy.
  • 10.  Asif , Ali Mir Arif Mir, Hannan, Shaikh Abdul, Perwej, Yusuf, Vithalrao, Mane Arjun. An Overview and Applications of Optical Character Recognition. International Journal of Advance Research In Science And Engineering , Vol. 3(7), 261-274p.  https://en.wikipedia.org/wiki/Optical_character_reco gnition (accessed in 10/03/2017)  http://www.webopedia.com/TERM/O/optical_charact er_recognition.html (accessed in 10/03/2017)  http://www.computerhope.com/jargon/o/ocr.htm (accessed in 10/03/2017)  http://www.explainthatstuff.com/how-ocr- works.html (accessed in 10/03/2017)