SlideShare a Scribd company logo
1 of 81
Download to read offline
Magdi Mohamed, 2017:06:02 1 / 81
Gradient Direction Transform
Magdi Mohamed
QCT, Multimedia and Standards R&D, Computer Vision Systems, 2017:06:02
Magdi Mohamed, 2017:06:02 2 / 81
Agenda
 Motivations
 Existing Solutions
 Gradient Direction Transform - Definition & Characteristics
 Iris Detection Application
 Handwriting Recognition Application
 Future Thoughts
 Summary & Conclusions
Magdi Mohamed, 2017:06:02 3 / 81
Optical Eye Tracking System Diagram – Motivation for Idea
Hough
Transform
Processing
And / Or
Other Methods
(Timm & Barth)
Frame
Normalization
Size
Light
Pose
Image
Preprocessing
Binarization
Noise Filtering
Smoothing
Image
Segmentation
Morphological
Operations,
Clustering,
&
Relaxation
Techniques
Feature
Extraction
Edge
Operations:
Detection
Linking
Thinning
Analysis &
Classification
Search
&
Image
Interpretation
Tasks
Post-
processing
Spatial
Relationships,
Sanity Checks
&
Accept / Reject
Iris detection is essential for several applications including gaze tracking
Magdi Mohamed, 2017:06:02 4 / 81
Overview of Techniques
Practical solutions are usually engineering combination of multiple techniques
Magdi Mohamed, 2017:06:02 5 / 81
Hough Transform for Iris Detection – Existing Solution
β
α
(α1, β1)
Hough Space for Circles
(α− x1 ) 2 + ( β − y1 ) 2 = r2
x
y Image Space
( x - α1 ) 2 + ( y – β1 ) 2 = r 2
(x1,y1)
Iris Detection Requirements
Deals with variations in light and reflections
Deals with partially occluded, missing, and noisy features
No special markers or makeup required
Features realtime processing
Quantifies action codes
Hough method can detect multiple curves and is resilient to noisy inputs
Magdi Mohamed, 2017:06:02 6 / 81
Other Methods: Timm & Barth – Existing Solution
{timm,barth}@inb.uni-luebeck.de
Proposing a faster and novel extension of this method
Magdi Mohamed, 2017:06:02 7 / 81
Gradient Direction Transform: New Idea
after initializing the transform matrix to zeros, for each gradient vector in the
input image region of interest, increment / decrement the value of the cells in
the transform matrix inline with the gradient vector accordingly
Magdi Mohamed, 2017:06:02 8 / 81
Gradient Direction Transform: Solution for Iris Detection
Pre-Processing
Blur-Convolution
Vertical-Edge Horizontal-Edge
Gradient Direction Transform
Weighted Gradient Direction
Thinning
Find Eye
Otsu Dynamic Threshold
Edge-Convolution
i-image
n-image
b-image e-image
o-imagev-image h-image
g-image
w-image h-space
iris-location
Hough Transform for Elliptical Shapes
t-image
Magdi Mohamed, 2017:06:02 9 / 81
Bresenham’s Algorithm – Implementation Details
Magdi Mohamed, 2017:06:02 10 / 81
Bresenham’s Algorithm – Implementation Details
Magdi Mohamed, 2017:06:02 11 / 81
Bresenham’s Algorithm – Implementation Details
Magdi Mohamed, 2017:06:02 12 / 81
Gradient Direction Transform: Implementation Details
Magdi Mohamed, 2017:06:02 13 / 81
References
• Hough:
Method and means for recognizing complex patterns
https://docs.google.com/viewer?url=patentimages.storage.googleapis.com/pdfs/US3069654.pdf
• Ballard:
Generalizing Hough transform to recognize arbitrarily shapes
http://www.cs.utexas.edu/~dana/HoughT.pdf
• Mohamed & Nasir:
Method and system for parallel processing of Hough transform computations
https://docs.google.com/viewer?url=patentimages.storage.googleapis.com/pdfs/US7406212.pdf
• Timm & Barth:
Accurate eye center localization by means of gradients
http://cjee.lakeheadu.ca/public/journals/22/TiBa11b.pdf
• Bresenham:
A rasterizing algorithm for drawing curves
http://members.chello.at/easyfilter/bresenham.pdf
Magdi Mohamed, 2017:06:02 14 / 81
Major Characteristics – Complexities & Capabilities
 Face detection takes ~ 30 ms (24 x 24)
 Eye corner detection takes ~ 12 ms (256 x 256)
 Iris detection time complexity for region of interest (C x R)
 T&B = K1 (C * R)
2
(K1 ~ cost for normalized floating point vector dot product)
 GDT = K2 (C * R) * C (K2 ~ cost for integer addition and bit-wise operations, C>R)
 T&B ignores sign of vector in squaring dot products to avoid square root
computations
 GDT is capable of efficient
 Consideration of sign of vectors at no extra cost (inward/outward directions)
 Extension of gradient normal-vector (by choice) to other directions such as gradient
tangent-vector that may suit describing other (binary/gray/color) image analysis tasks
Magdi Mohamed, 2017:06:02 15 / 81
Timm & Barth Versus Gradient Direction Transform
Method
Size (C x R)
Conventional
T&B
T1 (Seconds)
Novel
GDT
T2 (Seconds)
Speedup
Ratio
T1 / T2
040 x 030 0001.291 0000.053 024.538
080 x 060 0022.630 0000.345 065.672
160 x 120 0394.155 0002.689 146.580
240 x 180 2063.608 0013.426 153.705
320 x 240 6603.897 0036.665 180.116
Measured Speedup Ratio = T1 / T2
Magdi Mohamed, 2017:06:02 16 / 81
Sample Image 320x240
Magdi Mohamed, 2017:06:02 17 / 81
Input Image 320x240
Magdi Mohamed, 2017:06:02 18 / 81
Gradient Image (Gx)
Magdi Mohamed, 2017:06:02 19 / 81
Gradient Image (Gy)
Magdi Mohamed, 2017:06:02 20 / 81
Gradient Direction Transform 320x240 (In Grey)
Magdi Mohamed, 2017:06:02 21 / 81
Gradient Direction Transform 320x240
Magdi Mohamed, 2017:06:02 22 / 81
Gradient Direction Transform 320x240
Magdi Mohamed, 2017:06:02 23 / 81
Gradient Direction Transform 320x240
Magdi Mohamed, 2017:06:02 24 / 81
Timm & Barth Transform 320x240
Magdi Mohamed, 2017:06:02 25 / 81
Timm & Barth Transform 320x240
Magdi Mohamed, 2017:06:02 26 / 81
Timm & Barth Transform 320x240
Magdi Mohamed, 2017:06:02 27 / 81
Iris Detection – Major Challenges
 Eye boarder localization
 Occlusion
 Eyelashes
 Glasses
 Makeup
 Shadows
 Resolution
 Non-circularity
 Wrinkles
 Redness
 Water
 Abrupt motion
 Blinks
 Differences between left and right eyes
 Pose, color variations, and light reflections
 Age sensitive
 Manual annotation and evaluation metrics
 Strong/Weak eye concerns for gaze
Magdi Mohamed, 2017:06:02 28 / 81
Iris Detection - Notes On Used Data
 The data is not constructed particularly for evaluating iris detection algorithms
 It contains illegitimate images, and less accurate annotation for iris centers
 It contains some faces with sun glasses
 It contains many cases with corneal reflections from strong light sources
 It significantly represents almost frontal faces
 It contains mostly colored images
 It represents different categories (age groups, makeup, styles, …)
 Since it is the only annotated data available to us, we used it to construct our
facial landmark detector, and to evaluate the iris detector
 Ideally, a balanced data set with “casual” iris positions, not mostly looking at the
camera, with more accurate manual annotation of iris centers is needed
Magdi Mohamed, 2017:06:02 29 / 81
Iris Detection – Experiment # 1 Setup
 Using 15800 (mostly frontal view) face images with 40 landmarks per face
manually annotated as Ground Truth,
 Using Omron Face Detector,
 Using Omron Eye Corner Locator,
 Conducting a Blind Test to Compare GDT Versus T&B Solution,
 Compute normalized Cumulative Error Distribution (CED) for each estimator
Magdi Mohamed, 2017:06:02 30 / 81
Iris Detection – Experiment # 1 Performance Summary
Magdi Mohamed, 2017:06:02 31 / 81
Iris Detection – Experiment # 2 Setup
 Using 15800 (mostly frontal view) face images with 40 landmarks per face
manually annotated as Ground Truth,
 Using Omron Face Detector,
 Using Omron Eye Corner Locator,
 Conducting a Blind Test to Compare GDT Versus Omron Solution,
 Compute normalized Cumulative Error Distribution (CED) for each estimator
Magdi Mohamed, 2017:06:02 32 / 81
Iris Detection – Experiment # 2 Performance Summary
Magdi Mohamed, 2017:06:02 33 / 81
Iris Detection – Experiment # 3 Setup
 Using 15800 (mostly frontal view) face images with 40 landmarks per face
manually annotated as Ground Truth,
 Using Omron Face Detector,
 Using Supervised Newton Facial Landmark Detection method with Qualcomm
feature descriptor (HSG) and training algorithm,
 Conducting 5 fold cross validation for evaluation,
 Compute normalized Cumulative Error Distribution (CDF) for each estimator
Magdi Mohamed, 2017:06:02 34 / 81
Iris Detection – Experiment # 3 Performance Summary
Magdi Mohamed, 2017:06:02 35 / 81
Magdi Mohamed, 2017:06:02 36 / 81
Magdi Mohamed, 2017:06:02 37 / 81
Magdi Mohamed, 2017:06:02 38 / 81
Magdi Mohamed, 2017:06:02 39 / 81
Magdi Mohamed, 2017:06:02 40 / 81
Magdi Mohamed, 2017:06:02 41 / 81
Magdi Mohamed, 2017:06:02 42 / 81
Magdi Mohamed, 2017:06:02 43 / 81
Magdi Mohamed, 2017:06:02 44 / 81
Magdi Mohamed, 2017:06:02 45 / 81
Magdi Mohamed, 2017:06:02 46 / 81
Magdi Mohamed, 2017:06:02 47 / 81
Magdi Mohamed, 2017:06:02 48 / 81
Magdi Mohamed, 2017:06:02 49 / 81
Magdi Mohamed, 2017:06:02 50 / 81
Magdi Mohamed, 2017:06:02 51 / 81
Magdi Mohamed, 2017:06:02 52 / 81
Magdi Mohamed, 2017:06:02 53 / 81
Magdi Mohamed, 2017:06:02 54 / 81
Magdi Mohamed, 2017:06:02 55 / 81
Magdi Mohamed, 2017:06:02 56 / 81
Handwriting Recognition Using BAR Features
A Character Image and the Feature Image for the Foreground Horizontal Direction Pseudo-Code for Computing the Bar Transform on the Foreground
BAR transform is an existing technique proven to provide superior performance
Magdi Mohamed, 2017:06:02 57 / 81
Handwriting Recognition Using BAR Features
An Upper Case “B” and the Foreground and Background Bar Transform Feature Images Corresponding to the East-West Directions
BAR Feature Vector Uses Both Foreground & Background BAR Transforms
Magdi Mohamed, 2017:06:02 58 / 81
GDT for Handwriting (Normal Direction)
Magdi Mohamed, 2017:06:02 59 / 81
GDT for Handwriting (Tangent Direction)
Magdi Mohamed, 2017:06:02 60 / 81
Handwriting Recognition Using GDT Features
Magdi Mohamed, 2017:06:02 61 / 81
Handwriting Recognition Using GDT Features
Magdi Mohamed, 2017:06:02 62 / 81
Magdi Mohamed, 2017:06:02 63 / 81
Magdi Mohamed, 2017:06:02 64 / 81
Testing
Magdi Mohamed, 2017:06:02 65 / 81
Testing
Magdi Mohamed, 2017:06:02 66 / 81
Testing
Magdi Mohamed, 2017:06:02 67 / 81
Testing
Magdi Mohamed, 2017:06:02 68 / 81
Testing
Magdi Mohamed, 2017:06:02 69 / 81
Testing
Magdi Mohamed, 2017:06:02 70 / 81
Testing
Magdi Mohamed, 2017:06:02 71 / 81
Testing
Magdi Mohamed, 2017:06:02 72 / 81
Testing
Magdi Mohamed, 2017:06:02 73 / 81
Testing
Magdi Mohamed, 2017:06:02 74 / 81
Testing
Magdi Mohamed, 2017:06:02 75 / 81
Testing
Magdi Mohamed, 2017:06:02 76 / 81
Testing
Magdi Mohamed, 2017:06:02 77 / 81
Testing
Magdi Mohamed, 2017:06:02 78 / 81
Testing
Magdi Mohamed, 2017:06:02 79 / 81
Testing
Magdi Mohamed, 2017:06:02 80 / 81
Testing
Magdi Mohamed, 2017:06:02 81 / 81
Testing

More Related Content

Recently uploaded

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b
 
AIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsAIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsvanyagupta248
 
Introduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptxIntroduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptxhublikarsn
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxSCMS School of Architecture
 
Max. shear stress theory-Maximum Shear Stress Theory ​ Maximum Distortional ...
Max. shear stress theory-Maximum Shear Stress Theory ​  Maximum Distortional ...Max. shear stress theory-Maximum Shear Stress Theory ​  Maximum Distortional ...
Max. shear stress theory-Maximum Shear Stress Theory ​ Maximum Distortional ...ronahami
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdfKamal Acharya
 
UNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptxUNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptxkalpana413121
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxSCMS School of Architecture
 
Introduction to Geographic Information Systems
Introduction to Geographic Information SystemsIntroduction to Geographic Information Systems
Introduction to Geographic Information SystemsAnge Felix NSANZIYERA
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdfKamal Acharya
 
Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...ppkakm
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxpritamlangde
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiessarkmank1
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdfKamal Acharya
 
Post office management system project ..pdf
Post office management system project ..pdfPost office management system project ..pdf
Post office management system project ..pdfKamal Acharya
 
Electromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptxElectromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptxNANDHAKUMARA10
 
Augmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxAugmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxMustafa Ahmed
 
Introduction to Data Visualization,Matplotlib.pdf
Introduction to Data Visualization,Matplotlib.pdfIntroduction to Data Visualization,Matplotlib.pdf
Introduction to Data Visualization,Matplotlib.pdfsumitt6_25730773
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARKOUSTAV SARKAR
 

Recently uploaded (20)

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
AIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsAIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech students
 
Introduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptxIntroduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptx
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
 
Max. shear stress theory-Maximum Shear Stress Theory ​ Maximum Distortional ...
Max. shear stress theory-Maximum Shear Stress Theory ​  Maximum Distortional ...Max. shear stress theory-Maximum Shear Stress Theory ​  Maximum Distortional ...
Max. shear stress theory-Maximum Shear Stress Theory ​ Maximum Distortional ...
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
 
UNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptxUNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptx
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
 
Introduction to Geographic Information Systems
Introduction to Geographic Information SystemsIntroduction to Geographic Information Systems
Introduction to Geographic Information Systems
 
Signal Processing and Linear System Analysis
Signal Processing and Linear System AnalysisSignal Processing and Linear System Analysis
Signal Processing and Linear System Analysis
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
 
Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptx
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and properties
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
Post office management system project ..pdf
Post office management system project ..pdfPost office management system project ..pdf
Post office management system project ..pdf
 
Electromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptxElectromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptx
 
Augmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxAugmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptx
 
Introduction to Data Visualization,Matplotlib.pdf
Introduction to Data Visualization,Matplotlib.pdfIntroduction to Data Visualization,Matplotlib.pdf
Introduction to Data Visualization,Matplotlib.pdf
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
 

Featured

Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 

Featured (20)

Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 

Gradient Direction Transform

  • 1. Magdi Mohamed, 2017:06:02 1 / 81 Gradient Direction Transform Magdi Mohamed QCT, Multimedia and Standards R&D, Computer Vision Systems, 2017:06:02
  • 2. Magdi Mohamed, 2017:06:02 2 / 81 Agenda  Motivations  Existing Solutions  Gradient Direction Transform - Definition & Characteristics  Iris Detection Application  Handwriting Recognition Application  Future Thoughts  Summary & Conclusions
  • 3. Magdi Mohamed, 2017:06:02 3 / 81 Optical Eye Tracking System Diagram – Motivation for Idea Hough Transform Processing And / Or Other Methods (Timm & Barth) Frame Normalization Size Light Pose Image Preprocessing Binarization Noise Filtering Smoothing Image Segmentation Morphological Operations, Clustering, & Relaxation Techniques Feature Extraction Edge Operations: Detection Linking Thinning Analysis & Classification Search & Image Interpretation Tasks Post- processing Spatial Relationships, Sanity Checks & Accept / Reject Iris detection is essential for several applications including gaze tracking
  • 4. Magdi Mohamed, 2017:06:02 4 / 81 Overview of Techniques Practical solutions are usually engineering combination of multiple techniques
  • 5. Magdi Mohamed, 2017:06:02 5 / 81 Hough Transform for Iris Detection – Existing Solution β α (α1, β1) Hough Space for Circles (α− x1 ) 2 + ( β − y1 ) 2 = r2 x y Image Space ( x - α1 ) 2 + ( y – β1 ) 2 = r 2 (x1,y1) Iris Detection Requirements Deals with variations in light and reflections Deals with partially occluded, missing, and noisy features No special markers or makeup required Features realtime processing Quantifies action codes Hough method can detect multiple curves and is resilient to noisy inputs
  • 6. Magdi Mohamed, 2017:06:02 6 / 81 Other Methods: Timm & Barth – Existing Solution {timm,barth}@inb.uni-luebeck.de Proposing a faster and novel extension of this method
  • 7. Magdi Mohamed, 2017:06:02 7 / 81 Gradient Direction Transform: New Idea after initializing the transform matrix to zeros, for each gradient vector in the input image region of interest, increment / decrement the value of the cells in the transform matrix inline with the gradient vector accordingly
  • 8. Magdi Mohamed, 2017:06:02 8 / 81 Gradient Direction Transform: Solution for Iris Detection Pre-Processing Blur-Convolution Vertical-Edge Horizontal-Edge Gradient Direction Transform Weighted Gradient Direction Thinning Find Eye Otsu Dynamic Threshold Edge-Convolution i-image n-image b-image e-image o-imagev-image h-image g-image w-image h-space iris-location Hough Transform for Elliptical Shapes t-image
  • 9. Magdi Mohamed, 2017:06:02 9 / 81 Bresenham’s Algorithm – Implementation Details
  • 10. Magdi Mohamed, 2017:06:02 10 / 81 Bresenham’s Algorithm – Implementation Details
  • 11. Magdi Mohamed, 2017:06:02 11 / 81 Bresenham’s Algorithm – Implementation Details
  • 12. Magdi Mohamed, 2017:06:02 12 / 81 Gradient Direction Transform: Implementation Details
  • 13. Magdi Mohamed, 2017:06:02 13 / 81 References • Hough: Method and means for recognizing complex patterns https://docs.google.com/viewer?url=patentimages.storage.googleapis.com/pdfs/US3069654.pdf • Ballard: Generalizing Hough transform to recognize arbitrarily shapes http://www.cs.utexas.edu/~dana/HoughT.pdf • Mohamed & Nasir: Method and system for parallel processing of Hough transform computations https://docs.google.com/viewer?url=patentimages.storage.googleapis.com/pdfs/US7406212.pdf • Timm & Barth: Accurate eye center localization by means of gradients http://cjee.lakeheadu.ca/public/journals/22/TiBa11b.pdf • Bresenham: A rasterizing algorithm for drawing curves http://members.chello.at/easyfilter/bresenham.pdf
  • 14. Magdi Mohamed, 2017:06:02 14 / 81 Major Characteristics – Complexities & Capabilities  Face detection takes ~ 30 ms (24 x 24)  Eye corner detection takes ~ 12 ms (256 x 256)  Iris detection time complexity for region of interest (C x R)  T&B = K1 (C * R) 2 (K1 ~ cost for normalized floating point vector dot product)  GDT = K2 (C * R) * C (K2 ~ cost for integer addition and bit-wise operations, C>R)  T&B ignores sign of vector in squaring dot products to avoid square root computations  GDT is capable of efficient  Consideration of sign of vectors at no extra cost (inward/outward directions)  Extension of gradient normal-vector (by choice) to other directions such as gradient tangent-vector that may suit describing other (binary/gray/color) image analysis tasks
  • 15. Magdi Mohamed, 2017:06:02 15 / 81 Timm & Barth Versus Gradient Direction Transform Method Size (C x R) Conventional T&B T1 (Seconds) Novel GDT T2 (Seconds) Speedup Ratio T1 / T2 040 x 030 0001.291 0000.053 024.538 080 x 060 0022.630 0000.345 065.672 160 x 120 0394.155 0002.689 146.580 240 x 180 2063.608 0013.426 153.705 320 x 240 6603.897 0036.665 180.116 Measured Speedup Ratio = T1 / T2
  • 16. Magdi Mohamed, 2017:06:02 16 / 81 Sample Image 320x240
  • 17. Magdi Mohamed, 2017:06:02 17 / 81 Input Image 320x240
  • 18. Magdi Mohamed, 2017:06:02 18 / 81 Gradient Image (Gx)
  • 19. Magdi Mohamed, 2017:06:02 19 / 81 Gradient Image (Gy)
  • 20. Magdi Mohamed, 2017:06:02 20 / 81 Gradient Direction Transform 320x240 (In Grey)
  • 21. Magdi Mohamed, 2017:06:02 21 / 81 Gradient Direction Transform 320x240
  • 22. Magdi Mohamed, 2017:06:02 22 / 81 Gradient Direction Transform 320x240
  • 23. Magdi Mohamed, 2017:06:02 23 / 81 Gradient Direction Transform 320x240
  • 24. Magdi Mohamed, 2017:06:02 24 / 81 Timm & Barth Transform 320x240
  • 25. Magdi Mohamed, 2017:06:02 25 / 81 Timm & Barth Transform 320x240
  • 26. Magdi Mohamed, 2017:06:02 26 / 81 Timm & Barth Transform 320x240
  • 27. Magdi Mohamed, 2017:06:02 27 / 81 Iris Detection – Major Challenges  Eye boarder localization  Occlusion  Eyelashes  Glasses  Makeup  Shadows  Resolution  Non-circularity  Wrinkles  Redness  Water  Abrupt motion  Blinks  Differences between left and right eyes  Pose, color variations, and light reflections  Age sensitive  Manual annotation and evaluation metrics  Strong/Weak eye concerns for gaze
  • 28. Magdi Mohamed, 2017:06:02 28 / 81 Iris Detection - Notes On Used Data  The data is not constructed particularly for evaluating iris detection algorithms  It contains illegitimate images, and less accurate annotation for iris centers  It contains some faces with sun glasses  It contains many cases with corneal reflections from strong light sources  It significantly represents almost frontal faces  It contains mostly colored images  It represents different categories (age groups, makeup, styles, …)  Since it is the only annotated data available to us, we used it to construct our facial landmark detector, and to evaluate the iris detector  Ideally, a balanced data set with “casual” iris positions, not mostly looking at the camera, with more accurate manual annotation of iris centers is needed
  • 29. Magdi Mohamed, 2017:06:02 29 / 81 Iris Detection – Experiment # 1 Setup  Using 15800 (mostly frontal view) face images with 40 landmarks per face manually annotated as Ground Truth,  Using Omron Face Detector,  Using Omron Eye Corner Locator,  Conducting a Blind Test to Compare GDT Versus T&B Solution,  Compute normalized Cumulative Error Distribution (CED) for each estimator
  • 30. Magdi Mohamed, 2017:06:02 30 / 81 Iris Detection – Experiment # 1 Performance Summary
  • 31. Magdi Mohamed, 2017:06:02 31 / 81 Iris Detection – Experiment # 2 Setup  Using 15800 (mostly frontal view) face images with 40 landmarks per face manually annotated as Ground Truth,  Using Omron Face Detector,  Using Omron Eye Corner Locator,  Conducting a Blind Test to Compare GDT Versus Omron Solution,  Compute normalized Cumulative Error Distribution (CED) for each estimator
  • 32. Magdi Mohamed, 2017:06:02 32 / 81 Iris Detection – Experiment # 2 Performance Summary
  • 33. Magdi Mohamed, 2017:06:02 33 / 81 Iris Detection – Experiment # 3 Setup  Using 15800 (mostly frontal view) face images with 40 landmarks per face manually annotated as Ground Truth,  Using Omron Face Detector,  Using Supervised Newton Facial Landmark Detection method with Qualcomm feature descriptor (HSG) and training algorithm,  Conducting 5 fold cross validation for evaluation,  Compute normalized Cumulative Error Distribution (CDF) for each estimator
  • 34. Magdi Mohamed, 2017:06:02 34 / 81 Iris Detection – Experiment # 3 Performance Summary
  • 56. Magdi Mohamed, 2017:06:02 56 / 81 Handwriting Recognition Using BAR Features A Character Image and the Feature Image for the Foreground Horizontal Direction Pseudo-Code for Computing the Bar Transform on the Foreground BAR transform is an existing technique proven to provide superior performance
  • 57. Magdi Mohamed, 2017:06:02 57 / 81 Handwriting Recognition Using BAR Features An Upper Case “B” and the Foreground and Background Bar Transform Feature Images Corresponding to the East-West Directions BAR Feature Vector Uses Both Foreground & Background BAR Transforms
  • 58. Magdi Mohamed, 2017:06:02 58 / 81 GDT for Handwriting (Normal Direction)
  • 59. Magdi Mohamed, 2017:06:02 59 / 81 GDT for Handwriting (Tangent Direction)
  • 60. Magdi Mohamed, 2017:06:02 60 / 81 Handwriting Recognition Using GDT Features
  • 61. Magdi Mohamed, 2017:06:02 61 / 81 Handwriting Recognition Using GDT Features
  • 64. Magdi Mohamed, 2017:06:02 64 / 81 Testing
  • 65. Magdi Mohamed, 2017:06:02 65 / 81 Testing
  • 66. Magdi Mohamed, 2017:06:02 66 / 81 Testing
  • 67. Magdi Mohamed, 2017:06:02 67 / 81 Testing
  • 68. Magdi Mohamed, 2017:06:02 68 / 81 Testing
  • 69. Magdi Mohamed, 2017:06:02 69 / 81 Testing
  • 70. Magdi Mohamed, 2017:06:02 70 / 81 Testing
  • 71. Magdi Mohamed, 2017:06:02 71 / 81 Testing
  • 72. Magdi Mohamed, 2017:06:02 72 / 81 Testing
  • 73. Magdi Mohamed, 2017:06:02 73 / 81 Testing
  • 74. Magdi Mohamed, 2017:06:02 74 / 81 Testing
  • 75. Magdi Mohamed, 2017:06:02 75 / 81 Testing
  • 76. Magdi Mohamed, 2017:06:02 76 / 81 Testing
  • 77. Magdi Mohamed, 2017:06:02 77 / 81 Testing
  • 78. Magdi Mohamed, 2017:06:02 78 / 81 Testing
  • 79. Magdi Mohamed, 2017:06:02 79 / 81 Testing
  • 80. Magdi Mohamed, 2017:06:02 80 / 81 Testing
  • 81. Magdi Mohamed, 2017:06:02 81 / 81 Testing