SlideShare a Scribd company logo
1 of 28
PORTABLE CAMERA-BASED
ASSISTIVE TEXT AND PRODUCT
LABEL READING FROM HAND-
HELD OBJECTS FOR BLIND
PERSONS
By: Sathmica k
Abstract
Camera based assistive text reading framework help blind
persons to read text labels and product packing from hand-
held objects in their daily lives. To isolate the objects in the
camera view, an efficient and effective motion-based method
has been proposed to define a region of interest(ROI). In the
extracted ROI, text localization and recognition are conducted
to acquire text information. The recognized text codes are
output to blind users in speech.
What is Assistive Technology?
“Any product, instrument,
equipment or technical system
used by a disabled or
elderly person, made specially or
existing on the market,
aimed to prevent, compensate,
relieve or neutralise the deficiency,
the inability or the handicap.”
Introduction
• Of the 314 million visually impaired people worldwide,
45 million are blind.
• Developments in computer vision, digital cameras, and
portable computers make it feasible to assist these
individuals.
• By developing camera-based products which combine
computer vision technology along with OCR systems.
• Already few portable system exist like portable bar code
reader, pen scanner, k mobile reader.
K mobile reader
Pen scanner Bar code reader
K mobile reader
Drawbacks
• Cannot handle screen image with complex
background.
• Hard to find position of barcode.
• Object must be placed on a clear dark surface and
must contain text.
• Cannot handle screen image with complex
background.
• Hard to find the position of barcode.
• Objects must be placed on a clear dark surface and
must contain text.
Proposed method
• The camera-based label reader help blind persons
to read names of labels on the products.
• Camera acts as main vision in detecting the label
image of the product then image is processed
internally .
• Separates label from image , and finally identifies
the product and identified product name is
pronounced through voice.
• Received label image is then converted to text.
• Once the identified label name is converted to text
and converted text is displayed on display unit
connected to controller.
• Now converted text should be converted to voice to
hear label name as voice through ear phones
connected to audio.
3 FUNCTIONAL COMPONENTS
SCENE CAPTURE
DATA PROCESSOR
AUDIO OUTPUT
• The scene capture component collects scenes
containing objects of interest in the form of images
or video.
• In this prototype, it corresponds to a camera
attached to a pair of sunglasses.
• The data processing component is used for
deploying proposed algorithms, they are
object-of-interest detection to selectively extract
the image of the object held by the blind user from
the cluttered background or other neutral objects in
the camera view.
 Text localization to obtain image regions containing
text, and text recognition to transform image-based
text information into readable codes.
• The audio output component is to inform the blind
user of recognized text codes.
• A Bluetooth earpiece with mini microphone is
employed for speech output.
Flowchart of the proposed framework to read text from hand-held objects for blind users.
Object of interest
• Frame sequence v is captured by a camera worn by
blind users.
• User’s object of interest S by shaking the object while
recording.
S= 1
|𝑣| 𝑖 𝑅(𝑣, 𝐵)
V is ith frame in the captured sequence
|v| is the number of frames
B is the estimated background from motion based object
detection
R is calculated foreground object at each frame
Text localization
• To extract text region
X𝑐=argmax 𝑠 ∈ 𝑠 L (s)
L is suitability responses of text layout
Xc is candidate text regions from object of interest S
Object region detection
• To ensure that the hand-held object appears in the
camera view, a camera with a reasonably wide
angle is proposed(since the blind user may not aim
accurately).
• Users are asked to shake the hand-held objects
containing the text they wish identify.
• Employ a motion-based method to localize the
objects from cluttered background.
• Background subtraction (BGS) approach is used to
detect moving objects for video surveillance
systems with stationary cameras.
• This method is done based on the frame variations.
• Since background imagery is nearly constant in all
frames, a Gaussian method is applied.
• Gaussian mixture model method is robust to slow
lighting changes.
• Texture information is employed to remove false
positive foreground area.
• Texture similarity is measured.
• Its subsequent frame pixel distribution is more likely
to be the background model.
• To detect moving objects in a dynamic scene,
many adaptive BGS techniques have been
developed.
Localizing the image region of the hand-held object of interest. (a)Capturing images by a camera
mounted on a pair of sunglasses;(b)an example of a captured image;(c)detected moving areas in
the image while the user shaking the object;(d)detected region of the hand-held object for
further processing of text recognition.
Automatic text extraction
• Text extraction can be done by two features,
Stroke orientation.
Edge distribution.
A sample of text strokes showing relationship between stroke orientations and
gradient orientation at pixels of stroke boundaries. Blue arrows denote the stroke
orientations at the sections and red arrow denotes the gradient orientations at
stroke boundaries.
Text stroke orientation
• Stroke orientation describes the local structure of
text characters.
• Stroke orientation will be perpendicular to the
gradient orientation.
A text patch and its 16-bin histogram of quantized
stroke orientations.
Distributed of edge pixels
• Text characters appear in the form of stroke
boundaries.
• Describes the density of text region.
• Used to distinguish between text region from
background regions.
• Edge detection is performed to obtain an edge map.
• Number of edges in pixels in each row Y and column
X is calculated as NR(Y) and Nc(X).
Each pixel is labelled with product value of number of
edge pixels in its located rows and columns
respectively.
Then a 3X3 smooth operator Wn is applied to obtain
the edge distribution feature map.
D(X,Y)=∑Wn.NR(Yn).NC(Xn)
(Xn,Yn) is neighbouring pixel of (X,Y)
Wn is 1/9(weight value)
Text recognition and audio output
• Text recognition is performed by off-the-shelf OCR
prior to output of informative words from the
localized text regions.
• A text region labels the minimum rectangular area
for the accommodation of characters inside it.
• So the border of the text region contacts the edge
boundary of the text character.
• OCR generates better performance if text regions
are first assigned proper margin areas and
binarized to segment text characters from
background.
• Thus, each localized text region is enlarged by
enhancing the height and width by pixels,
respectively.
Conclusion
• To read printed text on hand-held objects
for assisting blind person.
• In order to solve the common aiming
problem for blind users.
• This method can effectively distinguish the
object of interest from background or
other objects in the camera view.
• To extract text regions from complex
backgrounds, proposed a text localization
algorithm based on models of stroke
orientation and edge distributions.
• OCR is used to perform word recognition
on the localized text regions and
transform into audio output for blind
users.
References
• Base paper by Chucai Yi, student member,IEEE,
YingLi Tian, Senior member, IEEE, Aries Arditi.
• T.Phan, P.Shivakumara and C.L.Tan, “A Laplacian
Method for Video /text Detection,”.
• C.Stauffer and W.E.L. Grimson, “Adaptive
Background mixture Model for real-time tracking”,.
• Vision Pattern Recognit., Fort Collins, CO, USA,
2013.
THANK YOU..

More Related Content

What's hot

Mind reading computers
Mind  reading computersMind  reading computers
Mind reading computersMaryam Fatima
 
Mind reading computer report
Mind reading computer reportMind reading computer report
Mind reading computer reportIshan Khan
 
Blue Brain Seminar Report
Blue Brain Seminar ReportBlue Brain Seminar Report
Blue Brain Seminar ReportVarun A M
 
Location based reminder
Location based reminderLocation based reminder
Location based reminderjunnubabu
 
Eye Tracking Based Human - Computer Interaction
Eye Tracking Based Human - Computer InteractionEye Tracking Based Human - Computer Interaction
Eye Tracking Based Human - Computer InteractionSharath Raj
 
eye phone technology
eye phone technologyeye phone technology
eye phone technologyNaga Dinesh
 
Mind-reading computer seminar ppt
Mind-reading computer seminar pptMind-reading computer seminar ppt
Mind-reading computer seminar pptsanjeev kumar suman
 
PPT on mind reading computer
 PPT on mind reading computer PPT on mind reading computer
PPT on mind reading computerAnjali Agarwal
 
Brain Fingerprinting
Brain FingerprintingBrain Fingerprinting
Brain Fingerprintingveera
 
Sixth sense-final-ppt
Sixth sense-final-pptSixth sense-final-ppt
Sixth sense-final-pptThedarkangel1
 
Mind Reading Computers Report
Mind Reading Computers ReportMind Reading Computers Report
Mind Reading Computers ReportAman Raj
 
eyegaze communication system
eyegaze communication system eyegaze communication system
eyegaze communication system Manasa Reddy
 
IRJET- Heart Attack Detection by Heartbeat Sensing using Internet of thin...
IRJET-  	  Heart Attack Detection by Heartbeat Sensing using Internet of thin...IRJET-  	  Heart Attack Detection by Heartbeat Sensing using Internet of thin...
IRJET- Heart Attack Detection by Heartbeat Sensing using Internet of thin...IRJET Journal
 
concealed weapon detection using digital image processing
concealed weapon detection using digital image processingconcealed weapon detection using digital image processing
concealed weapon detection using digital image processingKongara Sudharshan
 
Arduino based health monitoring system
Arduino based health monitoring systemArduino based health monitoring system
Arduino based health monitoring systemYousuf Shaikh
 
The sixth sense technology seminar
The sixth sense technology seminarThe sixth sense technology seminar
The sixth sense technology seminarRam
 

What's hot (20)

Mind reading computers
Mind  reading computersMind  reading computers
Mind reading computers
 
Mind reading computer report
Mind reading computer reportMind reading computer report
Mind reading computer report
 
Blue Brain Seminar Report
Blue Brain Seminar ReportBlue Brain Seminar Report
Blue Brain Seminar Report
 
Location based reminder
Location based reminderLocation based reminder
Location based reminder
 
Eye Tracking Based Human - Computer Interaction
Eye Tracking Based Human - Computer InteractionEye Tracking Based Human - Computer Interaction
Eye Tracking Based Human - Computer Interaction
 
eye phone technology
eye phone technologyeye phone technology
eye phone technology
 
Mind-reading computer seminar ppt
Mind-reading computer seminar pptMind-reading computer seminar ppt
Mind-reading computer seminar ppt
 
PPT on mind reading computer
 PPT on mind reading computer PPT on mind reading computer
PPT on mind reading computer
 
Eye ring
Eye ringEye ring
Eye ring
 
Dip fingerprint
Dip fingerprintDip fingerprint
Dip fingerprint
 
Brain Fingerprinting
Brain FingerprintingBrain Fingerprinting
Brain Fingerprinting
 
Mind Reading Computer
Mind Reading ComputerMind Reading Computer
Mind Reading Computer
 
Sixth sense-final-ppt
Sixth sense-final-pptSixth sense-final-ppt
Sixth sense-final-ppt
 
Mind Reading Computers Report
Mind Reading Computers ReportMind Reading Computers Report
Mind Reading Computers Report
 
eyegaze communication system
eyegaze communication system eyegaze communication system
eyegaze communication system
 
IRJET- Heart Attack Detection by Heartbeat Sensing using Internet of thin...
IRJET-  	  Heart Attack Detection by Heartbeat Sensing using Internet of thin...IRJET-  	  Heart Attack Detection by Heartbeat Sensing using Internet of thin...
IRJET- Heart Attack Detection by Heartbeat Sensing using Internet of thin...
 
concealed weapon detection using digital image processing
concealed weapon detection using digital image processingconcealed weapon detection using digital image processing
concealed weapon detection using digital image processing
 
Arduino based health monitoring system
Arduino based health monitoring systemArduino based health monitoring system
Arduino based health monitoring system
 
Digital smell technology
Digital smell technologyDigital smell technology
Digital smell technology
 
The sixth sense technology seminar
The sixth sense technology seminarThe sixth sense technology seminar
The sixth sense technology seminar
 

Similar to PORTABLE CAMERA-BASED ASSISTIVE TEXT AND PRODUCT LABEL READING FROM HAND- HELD OBJECTS FOR BLIND PERSONS

A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...
A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...
A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...IRJET Journal
 
An approach for text detection and reading of product label for blind persons
An approach for text detection and reading of product label for blind personsAn approach for text detection and reading of product label for blind persons
An approach for text detection and reading of product label for blind personsVivek Chamorshikar
 
Real Time Object Dectection using machine learning
Real Time Object Dectection using machine learningReal Time Object Dectection using machine learning
Real Time Object Dectection using machine learningpratik pratyay
 
2019 cvpr paper_overview
2019 cvpr paper_overview2019 cvpr paper_overview
2019 cvpr paper_overviewLEE HOSEONG
 
2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong LeeMoazzem Hossain
 
Various object detection and tracking methods
Various object detection and tracking methodsVarious object detection and tracking methods
Various object detection and tracking methodssujeeshkumarj
 
Text extraction from natural scene image, a survey
Text extraction from natural scene image, a surveyText extraction from natural scene image, a survey
Text extraction from natural scene image, a surveySOYEON KIM
 
TechnicalBackgroundOverview
TechnicalBackgroundOverviewTechnicalBackgroundOverview
TechnicalBackgroundOverviewMotaz El-Saban
 
Pattern_Recognition_via_Character_Recogn.pptx
Pattern_Recognition_via_Character_Recogn.pptxPattern_Recognition_via_Character_Recogn.pptx
Pattern_Recognition_via_Character_Recogn.pptxEngRSMY2
 
3-d interpretation from single 2-d image III
3-d interpretation from single 2-d image III3-d interpretation from single 2-d image III
3-d interpretation from single 2-d image IIIYu Huang
 
Text Extraction of Colour Images using Mathematical Morphology & HAAR Transform
Text Extraction of Colour Images using Mathematical Morphology & HAAR TransformText Extraction of Colour Images using Mathematical Morphology & HAAR Transform
Text Extraction of Colour Images using Mathematical Morphology & HAAR TransformIOSR Journals
 
Rendering Algorithms.pptx
Rendering Algorithms.pptxRendering Algorithms.pptx
Rendering Algorithms.pptxSherinRappai
 
Enhanced characterness for text detection in the wild
Enhanced characterness for text detection in the wildEnhanced characterness for text detection in the wild
Enhanced characterness for text detection in the wildPrerana Mukherjee
 
Neuron Analysis Workshop: Neuron Tracing from Tissue Specimens at the Microscope
Neuron Analysis Workshop: Neuron Tracing from Tissue Specimens at the MicroscopeNeuron Analysis Workshop: Neuron Tracing from Tissue Specimens at the Microscope
Neuron Analysis Workshop: Neuron Tracing from Tissue Specimens at the MicroscopeMBF Bioscience
 
A Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosA Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosCSCJournals
 
A Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosA Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosCSCJournals
 
introduction to Reverse Engineering.pptx
introduction to Reverse Engineering.pptxintroduction to Reverse Engineering.pptx
introduction to Reverse Engineering.pptxGRajesh21
 

Similar to PORTABLE CAMERA-BASED ASSISTIVE TEXT AND PRODUCT LABEL READING FROM HAND- HELD OBJECTS FOR BLIND PERSONS (20)

New Technology
New TechnologyNew Technology
New Technology
 
A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...
A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...
A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...
 
An approach for text detection and reading of product label for blind persons
An approach for text detection and reading of product label for blind personsAn approach for text detection and reading of product label for blind persons
An approach for text detection and reading of product label for blind persons
 
slide-171212080528.pptx
slide-171212080528.pptxslide-171212080528.pptx
slide-171212080528.pptx
 
Real Time Object Dectection using machine learning
Real Time Object Dectection using machine learningReal Time Object Dectection using machine learning
Real Time Object Dectection using machine learning
 
2019 cvpr paper_overview
2019 cvpr paper_overview2019 cvpr paper_overview
2019 cvpr paper_overview
 
2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee
 
Various object detection and tracking methods
Various object detection and tracking methodsVarious object detection and tracking methods
Various object detection and tracking methods
 
Text extraction from natural scene image, a survey
Text extraction from natural scene image, a surveyText extraction from natural scene image, a survey
Text extraction from natural scene image, a survey
 
TechnicalBackgroundOverview
TechnicalBackgroundOverviewTechnicalBackgroundOverview
TechnicalBackgroundOverview
 
Pattern_Recognition_via_Character_Recogn.pptx
Pattern_Recognition_via_Character_Recogn.pptxPattern_Recognition_via_Character_Recogn.pptx
Pattern_Recognition_via_Character_Recogn.pptx
 
3-d interpretation from single 2-d image III
3-d interpretation from single 2-d image III3-d interpretation from single 2-d image III
3-d interpretation from single 2-d image III
 
C04741319
C04741319C04741319
C04741319
 
Text Extraction of Colour Images using Mathematical Morphology & HAAR Transform
Text Extraction of Colour Images using Mathematical Morphology & HAAR TransformText Extraction of Colour Images using Mathematical Morphology & HAAR Transform
Text Extraction of Colour Images using Mathematical Morphology & HAAR Transform
 
Rendering Algorithms.pptx
Rendering Algorithms.pptxRendering Algorithms.pptx
Rendering Algorithms.pptx
 
Enhanced characterness for text detection in the wild
Enhanced characterness for text detection in the wildEnhanced characterness for text detection in the wild
Enhanced characterness for text detection in the wild
 
Neuron Analysis Workshop: Neuron Tracing from Tissue Specimens at the Microscope
Neuron Analysis Workshop: Neuron Tracing from Tissue Specimens at the MicroscopeNeuron Analysis Workshop: Neuron Tracing from Tissue Specimens at the Microscope
Neuron Analysis Workshop: Neuron Tracing from Tissue Specimens at the Microscope
 
A Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosA Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In Videos
 
A Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosA Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In Videos
 
introduction to Reverse Engineering.pptx
introduction to Reverse Engineering.pptxintroduction to Reverse Engineering.pptx
introduction to Reverse Engineering.pptx
 

Recently uploaded

History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupJonathanParaisoCruz
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...jaredbarbolino94
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentInMediaRes1
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 

Recently uploaded (20)

History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized Group
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media Component
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 

PORTABLE CAMERA-BASED ASSISTIVE TEXT AND PRODUCT LABEL READING FROM HAND- HELD OBJECTS FOR BLIND PERSONS

  • 1. PORTABLE CAMERA-BASED ASSISTIVE TEXT AND PRODUCT LABEL READING FROM HAND- HELD OBJECTS FOR BLIND PERSONS By: Sathmica k
  • 2. Abstract Camera based assistive text reading framework help blind persons to read text labels and product packing from hand- held objects in their daily lives. To isolate the objects in the camera view, an efficient and effective motion-based method has been proposed to define a region of interest(ROI). In the extracted ROI, text localization and recognition are conducted to acquire text information. The recognized text codes are output to blind users in speech.
  • 3. What is Assistive Technology? “Any product, instrument, equipment or technical system used by a disabled or elderly person, made specially or existing on the market, aimed to prevent, compensate, relieve or neutralise the deficiency, the inability or the handicap.”
  • 4. Introduction • Of the 314 million visually impaired people worldwide, 45 million are blind. • Developments in computer vision, digital cameras, and portable computers make it feasible to assist these individuals. • By developing camera-based products which combine computer vision technology along with OCR systems. • Already few portable system exist like portable bar code reader, pen scanner, k mobile reader.
  • 5. K mobile reader Pen scanner Bar code reader K mobile reader
  • 6. Drawbacks • Cannot handle screen image with complex background. • Hard to find position of barcode. • Object must be placed on a clear dark surface and must contain text. • Cannot handle screen image with complex background. • Hard to find the position of barcode. • Objects must be placed on a clear dark surface and must contain text.
  • 7. Proposed method • The camera-based label reader help blind persons to read names of labels on the products. • Camera acts as main vision in detecting the label image of the product then image is processed internally . • Separates label from image , and finally identifies the product and identified product name is pronounced through voice.
  • 8. • Received label image is then converted to text. • Once the identified label name is converted to text and converted text is displayed on display unit connected to controller. • Now converted text should be converted to voice to hear label name as voice through ear phones connected to audio.
  • 9. 3 FUNCTIONAL COMPONENTS SCENE CAPTURE DATA PROCESSOR AUDIO OUTPUT
  • 10. • The scene capture component collects scenes containing objects of interest in the form of images or video. • In this prototype, it corresponds to a camera attached to a pair of sunglasses. • The data processing component is used for deploying proposed algorithms, they are object-of-interest detection to selectively extract the image of the object held by the blind user from the cluttered background or other neutral objects in the camera view.
  • 11.  Text localization to obtain image regions containing text, and text recognition to transform image-based text information into readable codes. • The audio output component is to inform the blind user of recognized text codes. • A Bluetooth earpiece with mini microphone is employed for speech output.
  • 12. Flowchart of the proposed framework to read text from hand-held objects for blind users.
  • 13. Object of interest • Frame sequence v is captured by a camera worn by blind users. • User’s object of interest S by shaking the object while recording. S= 1 |𝑣| 𝑖 𝑅(𝑣, 𝐵) V is ith frame in the captured sequence |v| is the number of frames B is the estimated background from motion based object detection R is calculated foreground object at each frame
  • 14. Text localization • To extract text region X𝑐=argmax 𝑠 ∈ 𝑠 L (s) L is suitability responses of text layout Xc is candidate text regions from object of interest S
  • 15. Object region detection • To ensure that the hand-held object appears in the camera view, a camera with a reasonably wide angle is proposed(since the blind user may not aim accurately). • Users are asked to shake the hand-held objects containing the text they wish identify. • Employ a motion-based method to localize the objects from cluttered background.
  • 16. • Background subtraction (BGS) approach is used to detect moving objects for video surveillance systems with stationary cameras. • This method is done based on the frame variations. • Since background imagery is nearly constant in all frames, a Gaussian method is applied. • Gaussian mixture model method is robust to slow lighting changes. • Texture information is employed to remove false positive foreground area.
  • 17. • Texture similarity is measured. • Its subsequent frame pixel distribution is more likely to be the background model. • To detect moving objects in a dynamic scene, many adaptive BGS techniques have been developed.
  • 18. Localizing the image region of the hand-held object of interest. (a)Capturing images by a camera mounted on a pair of sunglasses;(b)an example of a captured image;(c)detected moving areas in the image while the user shaking the object;(d)detected region of the hand-held object for further processing of text recognition.
  • 19. Automatic text extraction • Text extraction can be done by two features, Stroke orientation. Edge distribution. A sample of text strokes showing relationship between stroke orientations and gradient orientation at pixels of stroke boundaries. Blue arrows denote the stroke orientations at the sections and red arrow denotes the gradient orientations at stroke boundaries.
  • 20. Text stroke orientation • Stroke orientation describes the local structure of text characters. • Stroke orientation will be perpendicular to the gradient orientation. A text patch and its 16-bin histogram of quantized stroke orientations.
  • 21. Distributed of edge pixels • Text characters appear in the form of stroke boundaries. • Describes the density of text region. • Used to distinguish between text region from background regions. • Edge detection is performed to obtain an edge map. • Number of edges in pixels in each row Y and column X is calculated as NR(Y) and Nc(X).
  • 22. Each pixel is labelled with product value of number of edge pixels in its located rows and columns respectively. Then a 3X3 smooth operator Wn is applied to obtain the edge distribution feature map. D(X,Y)=∑Wn.NR(Yn).NC(Xn) (Xn,Yn) is neighbouring pixel of (X,Y) Wn is 1/9(weight value)
  • 23. Text recognition and audio output • Text recognition is performed by off-the-shelf OCR prior to output of informative words from the localized text regions. • A text region labels the minimum rectangular area for the accommodation of characters inside it. • So the border of the text region contacts the edge boundary of the text character.
  • 24. • OCR generates better performance if text regions are first assigned proper margin areas and binarized to segment text characters from background. • Thus, each localized text region is enlarged by enhancing the height and width by pixels, respectively.
  • 25. Conclusion • To read printed text on hand-held objects for assisting blind person. • In order to solve the common aiming problem for blind users. • This method can effectively distinguish the object of interest from background or other objects in the camera view.
  • 26. • To extract text regions from complex backgrounds, proposed a text localization algorithm based on models of stroke orientation and edge distributions. • OCR is used to perform word recognition on the localized text regions and transform into audio output for blind users.
  • 27. References • Base paper by Chucai Yi, student member,IEEE, YingLi Tian, Senior member, IEEE, Aries Arditi. • T.Phan, P.Shivakumara and C.L.Tan, “A Laplacian Method for Video /text Detection,”. • C.Stauffer and W.E.L. Grimson, “Adaptive Background mixture Model for real-time tracking”,. • Vision Pattern Recognit., Fort Collins, CO, USA, 2013.

Editor's Notes

  1. By: Sathmica k
  2. K mobile reader
  3. Cannot handle screen image with complex background. Hard to find the position of barcode. Objects must be placed on a clear dark surface and must contain text.
  4. SCREEN CAPTURE DATA PROCESSOR AUDIO OUTPUT
  5. Flowchart of the proposed framework to read text from hand-held objects for blind users.
  6. th
  7. c
  8. Localizing the image region of the hand-held object of interest. (a)Capturing images by a camera mounted on a pair of sunglasses;(b)an example of a captured image;(c)detected moving areas in the image while the user shaking the object;(d)detected region of the hand-held object for further processing of text recognition.
  9. A text patch and its 16-bin histogram of quantized stroke orientations.
  10. Thank you.