SlideShare a Scribd company logo
1 of 16
An Introduction demo on

Mobile Camera Based Text Detection & Translation

Under The Guidance Of: Prof. Shweta Patil.
Presented By:
Akash Y Shindhe
Manjunath N Nayak
Sachin B Biradar
Vallabh G Potadar
Contents
 Introduction
 History
 Existing System
 Proposed System
 System Flow
 Requirement
 Block Diagram
 Test & Results
 Applications
 Advantages & Limitation
 Conclusion
 Bibliography

2
Introduction…
Our project ‘Mobile camera based text detection and
translation’ retrieves text from an images and converts it into
text format, then it is translated to specified language.

3
Existing System
 In 1929, first OCR device was invented but it was mechanical device
 In about 1965, earliest form of OCR was implemented in one of the
first generation computers for Airline Ticket stock.
 Revolutionary in 1971, it was implemented in postal services OCR
systems where reading and printing of routing bar code was done on
the postal code.
 In 1974, the modifications was done which would allow blind
people to have a computer read text to them out loud.
 In late 90’s, Webcam was used for OCR process.
4
Working…
 Capture image
 Detect edges
 Detect corners
 Match with stored image file
 Retrieve text from image
 Translate using Google API
 Show Result
5
Working Diagram

Fig. a: Working diagram
6
System flow
 Algorithms:
 Edge detection
 Image feature filtering
 Image binarization
 Optical character recognition
 Text correction
 Text translation
 Display of translation
7
Requirement
Mobile Hardware Requirements:
• ARM 11 processor or higher
• Memory 1 GB
• 256 MB RAM
• Mobile camera 3.2 mega pixel and above

Software Requirements:
• Operating System – Android Mob OS 2.2+
• Windows 7 OS
• Mat lab OCR,ADT bundles
Communication Requirements:
• Internet Connection is required
• Android Mobile OS inbuilt web browser
8
Block Diagram
Captured
Image

Text Feature
Filtering

Match
Image

File Library

Retrieve
Text

Google
APIs

Translate
Text

Display
Output Text
Fig. b: Block diagram

9
Example

c.1

c.2

c.5

Fig c.1
Fig c.2
Fig c.3
Fig c.4
Fig c.5

c.3

c.4
Fig. c: Example

10
Test & Results
Image quality :
As image quality degraded recognition rate will decrease

Recognition rate of character ‘A’ , ‘B’ , ‘L’ will be higher than recognition
rate of character ‘y’ , ‘u’ , ‘c’.
Fig. d: Test & result

11
Applications
Tourist understanding native language.

 Instant recognition of texts, street and e-mail
addresses, links, and telephone numbers.
 Unknown language guideline.
 Easy to recognize road signs scripts.

12
Advantages
 Android Mobile OS based platform.
 No tiresome manual data entry.
 Versatility and ease of use.
 No database is needed
 For data entry

13
Limitations
 Image taken by Mobile camera should be of good quality.
 Mobile should be of high specifications

 For translation of extracted text , Internet connection is
required.

14
Conclusion
This project which we want to implement is an Android
Mobile OS based application which is web based real time mobile
application for real-time text extraction, recognition and
translation.

15
Bibliography
1. Michael Hsueh “Interactive Text Recognition and Translation on a Mobile Device “
[Technical Report No. UCB/EECS-2011-57 ]

2. Yassin M.Y.Hasan and Lina J.Karam “Morphological Text Extraction from Images” IEEE
Transaction on Image Processing Vol.9 No.11, Nov 2000
3. Nobuyuki Otsu, A threshold selection method from gray-level histograms. IEEE
Trans.Sys.,Man., Cyber 9(1):62-66
4. Celine Mancas-Thillou, Bernard Gosselin, Color text extraction with selective metric
based clustering. Computer Vision and Image Understanding 2007
5. B. Epshtein, Detecting Text in Natural Scenes with Stroke Width Transform. Image
Rochester NY, pp. 1-8.
6. Derek Ma , Qiuhau Lin, Tong Zhang “Mobile Camera Based Text Detection and
Translation” – research paper
7. WWW.wikipedia.org/optical_character_recognization
16

More Related Content

What's hot

Ui ux designing principles
Ui ux designing principlesUi ux designing principles
Ui ux designing principlesDzung Nguyen
 
Augmented Reality (AR)
Augmented Reality (AR)Augmented Reality (AR)
Augmented Reality (AR)Samsil Arefin
 
A Web for Everyone: Accessibility as a design challenge
A Web for Everyone: Accessibility as a design challengeA Web for Everyone: Accessibility as a design challenge
A Web for Everyone: Accessibility as a design challengeWhitney Quesenbery
 
An Introduction to Computer Vision
An Introduction to Computer VisionAn Introduction to Computer Vision
An Introduction to Computer Visionguestd1b1b5
 
Web Development on Web Project Presentation
Web Development on Web Project PresentationWeb Development on Web Project Presentation
Web Development on Web Project PresentationMilind Gokhale
 
The memex presentation
The memex presentationThe memex presentation
The memex presentationDavid Lavenda
 
IRJET- Automated Detection of Gender from Face Images
IRJET-  	  Automated Detection of Gender from Face ImagesIRJET-  	  Automated Detection of Gender from Face Images
IRJET- Automated Detection of Gender from Face ImagesIRJET Journal
 
Optical Character Recognition Using Python
Optical Character Recognition Using PythonOptical Character Recognition Using Python
Optical Character Recognition Using PythonYogeshIJTSRD
 
Indirect Identity Control: delegation, guardianship, and controllership – Dan...
Indirect Identity Control: delegation, guardianship, and controllership – Dan...Indirect Identity Control: delegation, guardianship, and controllership – Dan...
Indirect Identity Control: delegation, guardianship, and controllership – Dan...SSIMeetup
 
Understanding UI/UX Design by Aroyewun Babajide
Understanding UI/UX Design by Aroyewun BabajideUnderstanding UI/UX Design by Aroyewun Babajide
Understanding UI/UX Design by Aroyewun BabajideBabajide Aroyewun
 

What's hot (20)

Ui ux designing principles
Ui ux designing principlesUi ux designing principles
Ui ux designing principles
 
Google chrome
Google chromeGoogle chrome
Google chrome
 
Augmented Reality (AR)
Augmented Reality (AR)Augmented Reality (AR)
Augmented Reality (AR)
 
Google Maps
Google MapsGoogle Maps
Google Maps
 
A Web for Everyone: Accessibility as a design challenge
A Web for Everyone: Accessibility as a design challengeA Web for Everyone: Accessibility as a design challenge
A Web for Everyone: Accessibility as a design challenge
 
An Introduction to Computer Vision
An Introduction to Computer VisionAn Introduction to Computer Vision
An Introduction to Computer Vision
 
The Importance of UX
The Importance of UXThe Importance of UX
The Importance of UX
 
Image recognition
Image recognitionImage recognition
Image recognition
 
Web Development on Web Project Presentation
Web Development on Web Project PresentationWeb Development on Web Project Presentation
Web Development on Web Project Presentation
 
The memex presentation
The memex presentationThe memex presentation
The memex presentation
 
UI/UX Fundamentals
UI/UX FundamentalsUI/UX Fundamentals
UI/UX Fundamentals
 
UX/UI Introduction
UX/UI IntroductionUX/UI Introduction
UX/UI Introduction
 
IRJET- Automated Detection of Gender from Face Images
IRJET-  	  Automated Detection of Gender from Face ImagesIRJET-  	  Automated Detection of Gender from Face Images
IRJET- Automated Detection of Gender from Face Images
 
Optical Character Recognition Using Python
Optical Character Recognition Using PythonOptical Character Recognition Using Python
Optical Character Recognition Using Python
 
UI / UX Design Presentation
UI / UX Design PresentationUI / UX Design Presentation
UI / UX Design Presentation
 
Object Recognition
Object RecognitionObject Recognition
Object Recognition
 
Indirect Identity Control: delegation, guardianship, and controllership – Dan...
Indirect Identity Control: delegation, guardianship, and controllership – Dan...Indirect Identity Control: delegation, guardianship, and controllership – Dan...
Indirect Identity Control: delegation, guardianship, and controllership – Dan...
 
Virtual reality
Virtual realityVirtual reality
Virtual reality
 
Sign language recognizer
Sign language recognizerSign language recognizer
Sign language recognizer
 
Understanding UI/UX Design by Aroyewun Babajide
Understanding UI/UX Design by Aroyewun BabajideUnderstanding UI/UX Design by Aroyewun Babajide
Understanding UI/UX Design by Aroyewun Babajide
 

Similar to mobile camera based text detection

Word Detection & Translation from image on an android device
Word Detection & Translation from image on an android deviceWord Detection & Translation from image on an android device
Word Detection & Translation from image on an android deviceRitwik Kumar
 
IRJET- Wearable AI Device for Blind
IRJET- Wearable AI Device for BlindIRJET- Wearable AI Device for Blind
IRJET- Wearable AI Device for BlindIRJET Journal
 
IRJET- Optical Character Recognition for Blind using Raspberry Pi
IRJET- Optical Character Recognition for Blind using Raspberry PiIRJET- Optical Character Recognition for Blind using Raspberry Pi
IRJET- Optical Character Recognition for Blind using Raspberry PiIRJET Journal
 
Optical Character Recognition deep learning .pdf
Optical Character Recognition deep learning .pdfOptical Character Recognition deep learning .pdf
Optical Character Recognition deep learning .pdfnavenjeyamurugan
 
Resume_embedded_systems_Onkar_Gulavani_01-05-2017
Resume_embedded_systems_Onkar_Gulavani_01-05-2017Resume_embedded_systems_Onkar_Gulavani_01-05-2017
Resume_embedded_systems_Onkar_Gulavani_01-05-2017Onkar Gulavani
 
Smart Tracking Utilizing GPS and Google Maps API_Grad Paper
Smart Tracking Utilizing GPS and Google Maps API_Grad PaperSmart Tracking Utilizing GPS and Google Maps API_Grad Paper
Smart Tracking Utilizing GPS and Google Maps API_Grad PaperVaidas Karnickas
 
Deblurring, Localization and Geometry Correction of 2D QR Bar Codes Using Ric...
Deblurring, Localization and Geometry Correction of 2D QR Bar Codes Using Ric...Deblurring, Localization and Geometry Correction of 2D QR Bar Codes Using Ric...
Deblurring, Localization and Geometry Correction of 2D QR Bar Codes Using Ric...IJERA Editor
 
Smart Face Recognition System Analysis
Smart Face Recognition System AnalysisSmart Face Recognition System Analysis
Smart Face Recognition System AnalysisVishal Aditya
 
Final Report on Optical Character Recognition
Final Report on Optical Character Recognition Final Report on Optical Character Recognition
Final Report on Optical Character Recognition Vidyut Singhania
 
IRJET- Book Reader using Raspberry Pi for Visually Impaired
IRJET- Book Reader using Raspberry Pi for Visually ImpairedIRJET- Book Reader using Raspberry Pi for Visually Impaired
IRJET- Book Reader using Raspberry Pi for Visually ImpairedIRJET Journal
 
A SMART LANGUAGE TRANSLATION TECHNIQUE USING OCR
A SMART LANGUAGE TRANSLATION TECHNIQUE USING OCRA SMART LANGUAGE TRANSLATION TECHNIQUE USING OCR
A SMART LANGUAGE TRANSLATION TECHNIQUE USING OCRIRJET Journal
 
Mobile camera based text detection and translation
Mobile camera based text detection and translationMobile camera based text detection and translation
Mobile camera based text detection and translationVivek Bharadwaj
 
Cloud report
Cloud reportCloud report
Cloud reportDeep Shah
 
IMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNING
IMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNINGIMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNING
IMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNINGIRJET Journal
 

Similar to mobile camera based text detection (20)

Word Detection & Translation from image on an android device
Word Detection & Translation from image on an android deviceWord Detection & Translation from image on an android device
Word Detection & Translation from image on an android device
 
Mob ocr
Mob ocrMob ocr
Mob ocr
 
IRJET- Wearable AI Device for Blind
IRJET- Wearable AI Device for BlindIRJET- Wearable AI Device for Blind
IRJET- Wearable AI Device for Blind
 
IRJET- Optical Character Recognition for Blind using Raspberry Pi
IRJET- Optical Character Recognition for Blind using Raspberry PiIRJET- Optical Character Recognition for Blind using Raspberry Pi
IRJET- Optical Character Recognition for Blind using Raspberry Pi
 
Optical Character Recognition deep learning .pdf
Optical Character Recognition deep learning .pdfOptical Character Recognition deep learning .pdf
Optical Character Recognition deep learning .pdf
 
201001162_report
201001162_report201001162_report
201001162_report
 
Resume_embedded_systems_Onkar_Gulavani_01-05-2017
Resume_embedded_systems_Onkar_Gulavani_01-05-2017Resume_embedded_systems_Onkar_Gulavani_01-05-2017
Resume_embedded_systems_Onkar_Gulavani_01-05-2017
 
Smart Tracking Utilizing GPS and Google Maps API_Grad Paper
Smart Tracking Utilizing GPS and Google Maps API_Grad PaperSmart Tracking Utilizing GPS and Google Maps API_Grad Paper
Smart Tracking Utilizing GPS and Google Maps API_Grad Paper
 
Deblurring, Localization and Geometry Correction of 2D QR Bar Codes Using Ric...
Deblurring, Localization and Geometry Correction of 2D QR Bar Codes Using Ric...Deblurring, Localization and Geometry Correction of 2D QR Bar Codes Using Ric...
Deblurring, Localization and Geometry Correction of 2D QR Bar Codes Using Ric...
 
Smart Face Recognition System Analysis
Smart Face Recognition System AnalysisSmart Face Recognition System Analysis
Smart Face Recognition System Analysis
 
Final Report on Optical Character Recognition
Final Report on Optical Character Recognition Final Report on Optical Character Recognition
Final Report on Optical Character Recognition
 
WIFI CONTROLLED SPY ROBOT CAR
WIFI CONTROLLED SPY ROBOT CARWIFI CONTROLLED SPY ROBOT CAR
WIFI CONTROLLED SPY ROBOT CAR
 
Resume
ResumeResume
Resume
 
IRJET- Book Reader using Raspberry Pi for Visually Impaired
IRJET- Book Reader using Raspberry Pi for Visually ImpairedIRJET- Book Reader using Raspberry Pi for Visually Impaired
IRJET- Book Reader using Raspberry Pi for Visually Impaired
 
A SMART LANGUAGE TRANSLATION TECHNIQUE USING OCR
A SMART LANGUAGE TRANSLATION TECHNIQUE USING OCRA SMART LANGUAGE TRANSLATION TECHNIQUE USING OCR
A SMART LANGUAGE TRANSLATION TECHNIQUE USING OCR
 
Mobile camera based text detection and translation
Mobile camera based text detection and translationMobile camera based text detection and translation
Mobile camera based text detection and translation
 
Portfolio
PortfolioPortfolio
Portfolio
 
ocr
ocrocr
ocr
 
Cloud report
Cloud reportCloud report
Cloud report
 
IMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNING
IMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNINGIMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNING
IMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNING
 

Recently uploaded

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 

Recently uploaded (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

mobile camera based text detection

  • 1. An Introduction demo on Mobile Camera Based Text Detection & Translation Under The Guidance Of: Prof. Shweta Patil. Presented By: Akash Y Shindhe Manjunath N Nayak Sachin B Biradar Vallabh G Potadar
  • 2. Contents  Introduction  History  Existing System  Proposed System  System Flow  Requirement  Block Diagram  Test & Results  Applications  Advantages & Limitation  Conclusion  Bibliography 2
  • 3. Introduction… Our project ‘Mobile camera based text detection and translation’ retrieves text from an images and converts it into text format, then it is translated to specified language. 3
  • 4. Existing System  In 1929, first OCR device was invented but it was mechanical device  In about 1965, earliest form of OCR was implemented in one of the first generation computers for Airline Ticket stock.  Revolutionary in 1971, it was implemented in postal services OCR systems where reading and printing of routing bar code was done on the postal code.  In 1974, the modifications was done which would allow blind people to have a computer read text to them out loud.  In late 90’s, Webcam was used for OCR process. 4
  • 5. Working…  Capture image  Detect edges  Detect corners  Match with stored image file  Retrieve text from image  Translate using Google API  Show Result 5
  • 6. Working Diagram Fig. a: Working diagram 6
  • 7. System flow  Algorithms:  Edge detection  Image feature filtering  Image binarization  Optical character recognition  Text correction  Text translation  Display of translation 7
  • 8. Requirement Mobile Hardware Requirements: • ARM 11 processor or higher • Memory 1 GB • 256 MB RAM • Mobile camera 3.2 mega pixel and above Software Requirements: • Operating System – Android Mob OS 2.2+ • Windows 7 OS • Mat lab OCR,ADT bundles Communication Requirements: • Internet Connection is required • Android Mobile OS inbuilt web browser 8
  • 9. Block Diagram Captured Image Text Feature Filtering Match Image File Library Retrieve Text Google APIs Translate Text Display Output Text Fig. b: Block diagram 9
  • 10. Example c.1 c.2 c.5 Fig c.1 Fig c.2 Fig c.3 Fig c.4 Fig c.5 c.3 c.4 Fig. c: Example 10
  • 11. Test & Results Image quality : As image quality degraded recognition rate will decrease Recognition rate of character ‘A’ , ‘B’ , ‘L’ will be higher than recognition rate of character ‘y’ , ‘u’ , ‘c’. Fig. d: Test & result 11
  • 12. Applications Tourist understanding native language.  Instant recognition of texts, street and e-mail addresses, links, and telephone numbers.  Unknown language guideline.  Easy to recognize road signs scripts. 12
  • 13. Advantages  Android Mobile OS based platform.  No tiresome manual data entry.  Versatility and ease of use.  No database is needed  For data entry 13
  • 14. Limitations  Image taken by Mobile camera should be of good quality.  Mobile should be of high specifications  For translation of extracted text , Internet connection is required. 14
  • 15. Conclusion This project which we want to implement is an Android Mobile OS based application which is web based real time mobile application for real-time text extraction, recognition and translation. 15
  • 16. Bibliography 1. Michael Hsueh “Interactive Text Recognition and Translation on a Mobile Device “ [Technical Report No. UCB/EECS-2011-57 ] 2. Yassin M.Y.Hasan and Lina J.Karam “Morphological Text Extraction from Images” IEEE Transaction on Image Processing Vol.9 No.11, Nov 2000 3. Nobuyuki Otsu, A threshold selection method from gray-level histograms. IEEE Trans.Sys.,Man., Cyber 9(1):62-66 4. Celine Mancas-Thillou, Bernard Gosselin, Color text extraction with selective metric based clustering. Computer Vision and Image Understanding 2007 5. B. Epshtein, Detecting Text in Natural Scenes with Stroke Width Transform. Image Rochester NY, pp. 1-8. 6. Derek Ma , Qiuhau Lin, Tong Zhang “Mobile Camera Based Text Detection and Translation” – research paper 7. WWW.wikipedia.org/optical_character_recognization 16