SlideShare a Scribd company logo
1 of 11
Download to read offline
OPTICAL CHARACTER
RECOGNITION
Divyanshu Sagar
Ahmed Zaid Faizee
Vidyut Singhania
INTRO
1. Ingenious piece of software.
2. Involves the mechanical/electronic
conversion of scanned images of
typewritten/printed text into machine-
encoded/computer-readable text.
• 3. Heavily used in the
industry.
INTRO ii
• Common method of digitizing printed texts
• Subtle software which is as highly overlooked as it is simple.
• Numerous applications and uses – editing, scanning,
searching, comparison, compact storage and many more!
• OCR is a field of research in pattern recognition, artificial
intelligence and computer vision.
Problem Statement
Ever since Charles Babbage invented the computer back in the early 19th
century, Computer machines have held man's imagination for numerous reasons - the
primary being what all is this collection of nuts, bolts and wires capable of doing.
Character Recognition is one such concept which has beheld mankind’s attention. There
can be no greater testimony to the same than the fact that people were already working on
this idea - a few decades before John McCarthy even coined the term "Artificial
Intelligence".
Today, especially, Character Recognition plays a very important part of our daily lives as
they are incorporated so subtly that we even forget their presence. Some examples are
their implementation in Microsoft Word, Adobe Acrobat and even Pen computing.
Optical Character Recognition (OCR) is the mechanical or electronic conversion of scanned
or photoed images of typewritten or printed text into machine-encoded/computer-
readable text. This text can then be used in numerous ways - ranging from assisting the
visually impaired (text-to-speech), extracting information from the image, pen computing
and so on. Optical Character Recognition (OCR) is a result of cross-linking various avenues
of technology like Machine Learning, Artificial Intelligence and Neural Networks. We
propose to develop a system based on mathematical algorithms and principles which
involve all the aforementioned technologies. That being said, Optical Character Recognition
(OCR) also depends on a few other factors : the quality of the image taken, the orientation
of and the dialect being used. Our paper aims to address the aforementioned
problems, which enables its application in numerous new fields as well as the obvious &
established aspects of our surroundings.
Tech Jargon - I
• Pre-processing
Used to improve the successful
recognition of the image (include De-
skew, Layout analysis, Despeckle)
• Character/glyph recognition
• Post-processing
• Application specific optimization
Tweaking the system to better deal
with specific or different inputs.
Tech Jargon - II
Segmentation
Includes two important phases:
1) Obtaining training samples
2) Recognizing new images after
training
Feature Extraction
Feature of the character are extracted
and hence are compared with the glyph
Classification
After the extraction, neural network is
trained using the training data
Our Current Progress
• We started with the Neural Networks / Machine Learning
aspect of the project.
• We have implemented Univariate / Multivariate
Linear/Regularized Linear Regression, Gradient Descent for
Multiple Variables and Logistic/ Regularized Logistic
Regression.
• Currently, we are studying & working on the
implementation of Neural Nets using Forward Propogation.
• We plan on tackling character segmentation and feature
extraction next.
Technology to be used
• We are using the following technology
platforms :
– GNU Octave
To develop and test the OCR software.
– 5MP HD camera (720p @ 30fps)
To take images for detection
Timeline
Literature Review
• Microsoft One Note
• Adobe PDF scanner
• HP scanner
ocrppt-140415204404-phpapp01.pdf

More Related Content

Similar to ocrppt-140415204404-phpapp01.pdf

Similar to ocrppt-140415204404-phpapp01.pdf (20)

Computer architecture for vision system
Computer architecture for vision systemComputer architecture for vision system
Computer architecture for vision system
 
IRJET- Sign Language Interpreter
IRJET- Sign Language InterpreterIRJET- Sign Language Interpreter
IRJET- Sign Language Interpreter
 
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
 
IRJET- Object Detection in an Image using Deep Learning
IRJET- Object Detection in an Image using Deep LearningIRJET- Object Detection in an Image using Deep Learning
IRJET- Object Detection in an Image using Deep Learning
 
Optical Recognition of Handwritten Text
Optical Recognition of Handwritten TextOptical Recognition of Handwritten Text
Optical Recognition of Handwritten Text
 
Using Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIsUsing Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIs
 
IRJET- Intelligent Character Recognition of Handwritten Characters
IRJET- Intelligent Character Recognition of Handwritten CharactersIRJET- Intelligent Character Recognition of Handwritten Characters
IRJET- Intelligent Character Recognition of Handwritten Characters
 
Saksham presentation
Saksham presentationSaksham presentation
Saksham presentation
 
AI GRPOUP 4 PRESENTATION.pptx
AI GRPOUP 4 PRESENTATION.pptxAI GRPOUP 4 PRESENTATION.pptx
AI GRPOUP 4 PRESENTATION.pptx
 
Optical Character Recognition Using Python
Optical Character Recognition Using PythonOptical Character Recognition Using Python
Optical Character Recognition Using Python
 
Utilization of Machine Learning in Computer Vision
Utilization of Machine Learning in Computer VisionUtilization of Machine Learning in Computer Vision
Utilization of Machine Learning in Computer Vision
 
IRJET- Navigation and Camera Reading System for Visually Impaired
IRJET- Navigation and Camera Reading System for Visually ImpairedIRJET- Navigation and Camera Reading System for Visually Impaired
IRJET- Navigation and Camera Reading System for Visually Impaired
 
IRJET- Text Recognization of Product for Blind Person using MATLAB
IRJET- Text Recognization of Product for Blind Person using MATLABIRJET- Text Recognization of Product for Blind Person using MATLAB
IRJET- Text Recognization of Product for Blind Person using MATLAB
 
4 Best Computer Vision Use Cases for Solving Business Challenges
4 Best Computer Vision Use Cases for Solving Business Challenges4 Best Computer Vision Use Cases for Solving Business Challenges
4 Best Computer Vision Use Cases for Solving Business Challenges
 
Artificial Intelligence (A.I) and Its Application -Seminar
Artificial Intelligence (A.I) and Its Application -SeminarArtificial Intelligence (A.I) and Its Application -Seminar
Artificial Intelligence (A.I) and Its Application -Seminar
 
ARTIFICIAL INTELLIGENCE.pptx
ARTIFICIAL INTELLIGENCE.pptxARTIFICIAL INTELLIGENCE.pptx
ARTIFICIAL INTELLIGENCE.pptx
 
ARTIFICIAL INTELLIGENT ( ITS / TASK 6 ) done by Wael Saad Hameedi / P71062
ARTIFICIAL INTELLIGENT ( ITS / TASK 6 ) done by Wael Saad Hameedi / P71062ARTIFICIAL INTELLIGENT ( ITS / TASK 6 ) done by Wael Saad Hameedi / P71062
ARTIFICIAL INTELLIGENT ( ITS / TASK 6 ) done by Wael Saad Hameedi / P71062
 
YOLOv4: A Face Mask Detection System
YOLOv4: A Face Mask Detection SystemYOLOv4: A Face Mask Detection System
YOLOv4: A Face Mask Detection System
 
IRJET- Real-Time Object Detection System using Caffe Model
IRJET- Real-Time Object Detection System using Caffe ModelIRJET- Real-Time Object Detection System using Caffe Model
IRJET- Real-Time Object Detection System using Caffe Model
 
What Is a Computer Vision Engineer- Unlocking the Power of Sight in Machines
What Is a Computer Vision Engineer- Unlocking the Power of Sight in MachinesWhat Is a Computer Vision Engineer- Unlocking the Power of Sight in Machines
What Is a Computer Vision Engineer- Unlocking the Power of Sight in Machines
 

Recently uploaded

Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
drm1699
 

Recently uploaded (20)

Modern binary build systems - PyCon 2024
Modern binary build systems - PyCon 2024Modern binary build systems - PyCon 2024
Modern binary build systems - PyCon 2024
 
The mythical technical debt. (Brooke, please, forgive me)
The mythical technical debt. (Brooke, please, forgive me)The mythical technical debt. (Brooke, please, forgive me)
The mythical technical debt. (Brooke, please, forgive me)
 
Spring into AI presented by Dan Vega 5/14
Spring into AI presented by Dan Vega 5/14Spring into AI presented by Dan Vega 5/14
Spring into AI presented by Dan Vega 5/14
 
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
 
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
 
The Strategic Impact of Buying vs Building in Test Automation
The Strategic Impact of Buying vs Building in Test AutomationThe Strategic Impact of Buying vs Building in Test Automation
The Strategic Impact of Buying vs Building in Test Automation
 
Lessons Learned from Building a Serverless Notifications System.pdf
Lessons Learned from Building a Serverless Notifications System.pdfLessons Learned from Building a Serverless Notifications System.pdf
Lessons Learned from Building a Serverless Notifications System.pdf
 
Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...
Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...
Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...
 
Automate your OpenSIPS config tests - OpenSIPS Summit 2024
Automate your OpenSIPS config tests - OpenSIPS Summit 2024Automate your OpenSIPS config tests - OpenSIPS Summit 2024
Automate your OpenSIPS config tests - OpenSIPS Summit 2024
 
Weeding your micro service landscape.pdf
Weeding your micro service landscape.pdfWeeding your micro service landscape.pdf
Weeding your micro service landscape.pdf
 
Auto Affiliate AI Earns First Commission in 3 Hours..pdf
Auto Affiliate  AI Earns First Commission in 3 Hours..pdfAuto Affiliate  AI Earns First Commission in 3 Hours..pdf
Auto Affiliate AI Earns First Commission in 3 Hours..pdf
 
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale IbridaUNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
 
Alluxio Monthly Webinar | Simplify Data Access for AI in Multi-Cloud
Alluxio Monthly Webinar | Simplify Data Access for AI in Multi-CloudAlluxio Monthly Webinar | Simplify Data Access for AI in Multi-Cloud
Alluxio Monthly Webinar | Simplify Data Access for AI in Multi-Cloud
 
Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...
Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...
Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...
 
Abortion Pill Prices Turfloop ](+27832195400*)[ 🏥 Women's Abortion Clinic in ...
Abortion Pill Prices Turfloop ](+27832195400*)[ 🏥 Women's Abortion Clinic in ...Abortion Pill Prices Turfloop ](+27832195400*)[ 🏥 Women's Abortion Clinic in ...
Abortion Pill Prices Turfloop ](+27832195400*)[ 🏥 Women's Abortion Clinic in ...
 
architecting-ai-in-the-enterprise-apis-and-applications.pdf
architecting-ai-in-the-enterprise-apis-and-applications.pdfarchitecting-ai-in-the-enterprise-apis-and-applications.pdf
architecting-ai-in-the-enterprise-apis-and-applications.pdf
 
GraphSummit Milan - Neo4j: The Art of the Possible with Graph
GraphSummit Milan - Neo4j: The Art of the Possible with GraphGraphSummit Milan - Neo4j: The Art of the Possible with Graph
GraphSummit Milan - Neo4j: The Art of the Possible with Graph
 
COMPUTER AND ITS COMPONENTS PPT.by naitik sharma Class 9th A mittal internati...
COMPUTER AND ITS COMPONENTS PPT.by naitik sharma Class 9th A mittal internati...COMPUTER AND ITS COMPONENTS PPT.by naitik sharma Class 9th A mittal internati...
COMPUTER AND ITS COMPONENTS PPT.by naitik sharma Class 9th A mittal internati...
 
Abortion Clinic In Springs ](+27832195400*)[ 🏥 Safe Abortion Pills in Springs...
Abortion Clinic In Springs ](+27832195400*)[ 🏥 Safe Abortion Pills in Springs...Abortion Clinic In Springs ](+27832195400*)[ 🏥 Safe Abortion Pills in Springs...
Abortion Clinic In Springs ](+27832195400*)[ 🏥 Safe Abortion Pills in Springs...
 
GraphSummit Milan - Visione e roadmap del prodotto Neo4j
GraphSummit Milan - Visione e roadmap del prodotto Neo4jGraphSummit Milan - Visione e roadmap del prodotto Neo4j
GraphSummit Milan - Visione e roadmap del prodotto Neo4j
 

ocrppt-140415204404-phpapp01.pdf

  • 2. INTRO 1. Ingenious piece of software. 2. Involves the mechanical/electronic conversion of scanned images of typewritten/printed text into machine- encoded/computer-readable text. • 3. Heavily used in the industry.
  • 3. INTRO ii • Common method of digitizing printed texts • Subtle software which is as highly overlooked as it is simple. • Numerous applications and uses – editing, scanning, searching, comparison, compact storage and many more! • OCR is a field of research in pattern recognition, artificial intelligence and computer vision.
  • 4. Problem Statement Ever since Charles Babbage invented the computer back in the early 19th century, Computer machines have held man's imagination for numerous reasons - the primary being what all is this collection of nuts, bolts and wires capable of doing. Character Recognition is one such concept which has beheld mankind’s attention. There can be no greater testimony to the same than the fact that people were already working on this idea - a few decades before John McCarthy even coined the term "Artificial Intelligence". Today, especially, Character Recognition plays a very important part of our daily lives as they are incorporated so subtly that we even forget their presence. Some examples are their implementation in Microsoft Word, Adobe Acrobat and even Pen computing. Optical Character Recognition (OCR) is the mechanical or electronic conversion of scanned or photoed images of typewritten or printed text into machine-encoded/computer- readable text. This text can then be used in numerous ways - ranging from assisting the visually impaired (text-to-speech), extracting information from the image, pen computing and so on. Optical Character Recognition (OCR) is a result of cross-linking various avenues of technology like Machine Learning, Artificial Intelligence and Neural Networks. We propose to develop a system based on mathematical algorithms and principles which involve all the aforementioned technologies. That being said, Optical Character Recognition (OCR) also depends on a few other factors : the quality of the image taken, the orientation of and the dialect being used. Our paper aims to address the aforementioned problems, which enables its application in numerous new fields as well as the obvious & established aspects of our surroundings.
  • 5. Tech Jargon - I • Pre-processing Used to improve the successful recognition of the image (include De- skew, Layout analysis, Despeckle) • Character/glyph recognition • Post-processing • Application specific optimization Tweaking the system to better deal with specific or different inputs.
  • 6. Tech Jargon - II Segmentation Includes two important phases: 1) Obtaining training samples 2) Recognizing new images after training Feature Extraction Feature of the character are extracted and hence are compared with the glyph Classification After the extraction, neural network is trained using the training data
  • 7. Our Current Progress • We started with the Neural Networks / Machine Learning aspect of the project. • We have implemented Univariate / Multivariate Linear/Regularized Linear Regression, Gradient Descent for Multiple Variables and Logistic/ Regularized Logistic Regression. • Currently, we are studying & working on the implementation of Neural Nets using Forward Propogation. • We plan on tackling character segmentation and feature extraction next.
  • 8. Technology to be used • We are using the following technology platforms : – GNU Octave To develop and test the OCR software. – 5MP HD camera (720p @ 30fps) To take images for detection
  • 10. Literature Review • Microsoft One Note • Adobe PDF scanner • HP scanner