SlideShare a Scribd company logo
1 of 38
Download to read offline
1
AI uses cases
in
Computer Vision
Presented By:
SAMeh Zaghloul – |ABCD> - IBM
linkedin.com/in/sameh-zaghloul-00b5151
2
Computer Vision: How Machines See?
3
Computer Vision: How Machines See?
4
Computer Vision
Example
Egyptian Hieroglyph Alphabet
5
Deep Learning in Hieroglyph Recognition
6
Hieroglyph Alphabet Recognition – Early Stages
“Ibn Wahshiyya”
985 CE “Arabic”
translation of the
Ancient Egyptian
Hieroglyph alphabet
7
Hieroglyph Alphabet
8
Hieroglyph Alphabet https://www.unicode.org/charts/PDF/U13000.pdf
9
Hieroglyph Alphabet Recognition – Raw Data
10
Hieroglyph Alphabet Recognition – Training Data
11
Hieroglyph Alphabet - Sample Tagged/Labeled Datasets
12
Hieroglyph Alphabet – Sample Tagged/Labeled Dataset
13
Deep Learning in Hieroglyph Recognition
14
Computer Vision: Overview
15
Computer vision history
Started with the emergence of AI In 1956.
Convolutional neural networks were proposed in the
well-known 1998 research paper by Yann LeCun and
Léon Bottou.
- 99.2% recognition accuracy on the MNIST data set.
© Copyright IBM Corporation 2019, 2021
16
Computer vision tasks
Object detection and recognition: Detect certain
patterns within the image.
Examples:
- Detecting red eyes when taking photos in certain
conditions.
- Face recognition.
© Copyright IBM Corporation 2019, 2021
17
Computer vision tasks (cont.)
Content-based image retrieval: Image retrieval from a
database based on user’s image query.
- By using image actual feature contents such as colors,
shapes, and textures
- Not using image metadata (keywords, tags, or
descriptions)
Optical character recognition (OCR): Converting hand-
written text to a digital format.
© Copyright IBM Corporation 2019, 2021
18
Computer vision tasks (cont.)
Object tracking: Following the position changes of a
target object from one frame to another in an image
sequence or video.
The following photo shows an example of human
tracking.
© Copyright IBM Corporation 2019, 2021
19
Computer vision tasks
Image restoration: Fixing and restoring images that are corrupted by noise, such as motion
blur, to their default state.
Scene reconstruction: Creation of a 3D model by supplying the system with multiple 2D
images from different views. The computer constructs a 3D model based on those images.
(Sinha, Steedly, Szeliskiet al. 2008) © 2008 ACM.
© Copyright IBM Corporation 2019, 2021
20
Computer Vision tools
OpenCV: CV open source library
- C++, Python, Java, and MATLAB interfaces
PyTorchCV is based on PyTorch framework.
• Used for various computer vision tasks.
• Includes a collection of pretrained models for image classification, segmentation, detection, and pose
estimation.
scikit-image is an open source library for image processing.
• Includes a set of algorithms for image processing.
• Implements algorithms and utilities that are used in research, education, and industry applications.
• Well-documented API in the Python programming language.
© Copyright IBM Corporation 2019, 2021
21
Computer Vision – Deep Learning – CNN (Convolutional Neural Network)
© Copyright IBM Corporation 2019, 2021
Convolution is a
Sliding-Window
over a
Matrix-Representation
of an Image
Loose Mimicking
of the
Overlapping Tiling
of the
Biological Visual Field
22
Computer Vision Annotation Tools
IBM Cloud Annotations
cloud.annotations.ai
- Makes labeling images and training machine learning models easy.
- Supports both photos and videos.
- Object detection: An object detection model provides the following information:
• Type of object and prediction confidence level
• Location: The coordinates and area of where the object is in the image.
• Count: The number of objects found in the image.
• Size: How large the object is with respect to the image dimensions.
- Object classification: An object classification model provides the following information
• Type of object and prediction confidence level
© Copyright IBM Corporation 2019, 2021
23
Computer Vision Annotation Tools
VoTT - Visual Object Tagging Tool
https://vott.z22.web.core.windows.net/#/
CVAT - Computer Vision Annotation Tool
https://cvat.org/
Google’s Teachable Machine
https://teachablemachine.withgoogle.com/
© Copyright IBM Corporation 2019, 2021
24
Computer Vision Trained Models Public Datasets
Model Zoo
https://models.roboflow.com/
Computer Vision Datasets
https://public.roboflow.com/
Image Datasets for Computer Vision Training
https://imerit.net/blog/22-free-image-datasets-for-computer-vision-all-pbm/
© Copyright IBM Corporation 2019, 2021
25
Computer Vision: Basics
26
Image representation
Images are stored as a 2D array of pixels on computers.
Each pixel has a certain value representing its intensity.
Example of grayscale representation:
• Image is black and white with shades of gray in between.
• Pixel intensity is a number between 0 (black) and 255 (white).
int[ ][ ] array = { {255, 170, 170, 0},
{220, 80, 80, 170},
{255, 80, 0, 0},
{175, 20, 170, 0} };
© Copyright IBM Corporation 2019, 2021
27
Image representation (cont.)
- Example of color representation:
• The pixel color is represented as a mix of Red, Green, and Blue.
• The pixel intensity becomes three 2D arrays or one 2D array, where each entry is an
object containing the 3 color values of RGB
© Copyright IBM Corporation 2019, 2021
28
Computer vision pipeline
The steps and functions that are included are highly
dependent on the application.
Here is a conventional visual pattern recognition pipeline.
Image
Acquisition
Pre-
Processing
Segmentation
Feature
Extraction &
Selection
Classification
© Copyright IBM Corporation 2019, 2021
29
Computer vision pipeline (cont.)
1. Image acquisition:
The process of acquiring images and saving them in a
digital image format for processing.
Images often use common formats, such as .jpeg, .png,
and .bmp.
Images are 2D images that are stored as arrays of pixels
according to their color model.
© Copyright IBM Corporation 2019, 2021
30
Computer vision pipeline (cont.)
2. Pre-processing:
Preparing the image for the processing stage
Examples:
- Resizing images
- Noise reduction
- Contrast adjustment
© Copyright IBM Corporation 2019, 2021
31
Computer vision pipeline (cont.)
3. Segmentation:
Partitioning an image into regions of similarity.
Grouping pixels and features with similar characteristics
together.
Helps with selecting regions of interest within the images.
These regions can contain objects of interest that we want to
capture.
Segmenting an image into foreground and background.
© Copyright IBM Corporation 2019, 2021
32
Computer vision pipeline (cont.)
4. Feature extraction and selection:
Find distinguishing information about the image.
Image features examples: distinct color in an image or a
specific shape such as a line, edge, corner, or an image
segment.
© Copyright IBM Corporation 2019, 2021
33
Computer vision pipeline (cont.)
5. Classification:
The extracted features are used to classify the image.
More processing might be done on the classified images
to identify more features from the image.
Example: After face detection, identify features on the
face, such as hair style, age, and gender.
© Copyright IBM Corporation 2019, 2021
34
Computer Vision: Hands-on
https://cloud.annotations.ai/
35
36
37
38

More Related Content

What's hot

Image Recognition Expert System based on deep learning
Image Recognition Expert System based on deep learningImage Recognition Expert System based on deep learning
Image Recognition Expert System based on deep learningPRATHAMESH REGE
 
Introduction to Object recognition
Introduction to Object recognitionIntroduction to Object recognition
Introduction to Object recognitionAshiq Ullah
 
Computer Vision: Feature matching with RANSAC Algorithm
Computer Vision: Feature matching with RANSAC AlgorithmComputer Vision: Feature matching with RANSAC Algorithm
Computer Vision: Feature matching with RANSAC Algorithmallyn joy calcaben
 
Deep learning on face recognition (use case, development and risk)
Deep learning on face recognition (use case, development and risk)Deep learning on face recognition (use case, development and risk)
Deep learning on face recognition (use case, development and risk)Herman Kurnadi
 
Object Detection & Tracking
Object Detection & TrackingObject Detection & Tracking
Object Detection & TrackingAkshay Gujarathi
 
(2017/06)Practical points of deep learning for medical imaging
(2017/06)Practical points of deep learning for medical imaging(2017/06)Practical points of deep learning for medical imaging
(2017/06)Practical points of deep learning for medical imagingKyuhwan Jung
 
Prospects of Deep Learning in Medical Imaging
Prospects of Deep Learning in Medical ImagingProspects of Deep Learning in Medical Imaging
Prospects of Deep Learning in Medical ImagingGodswll Egegwu
 
General introduction to computer vision
General introduction to computer visionGeneral introduction to computer vision
General introduction to computer visionbutest
 
face recognition based on PCA
face recognition based on PCAface recognition based on PCA
face recognition based on PCA@zenafaris91
 
Elements of visual perception
Elements of visual perceptionElements of visual perception
Elements of visual perceptionDr INBAMALAR T M
 

What's hot (20)

Object detection
Object detectionObject detection
Object detection
 
Computer vision
Computer visionComputer vision
Computer vision
 
Computer vision
Computer visionComputer vision
Computer vision
 
Image recognition
Image recognitionImage recognition
Image recognition
 
Computer vision
Computer visionComputer vision
Computer vision
 
Computer vision
Computer visionComputer vision
Computer vision
 
Computer vision
Computer vision Computer vision
Computer vision
 
Image Recognition Expert System based on deep learning
Image Recognition Expert System based on deep learningImage Recognition Expert System based on deep learning
Image Recognition Expert System based on deep learning
 
Introduction to Object recognition
Introduction to Object recognitionIntroduction to Object recognition
Introduction to Object recognition
 
Computer Vision: Feature matching with RANSAC Algorithm
Computer Vision: Feature matching with RANSAC AlgorithmComputer Vision: Feature matching with RANSAC Algorithm
Computer Vision: Feature matching with RANSAC Algorithm
 
Medical image analysis
Medical image analysisMedical image analysis
Medical image analysis
 
Deep learning on face recognition (use case, development and risk)
Deep learning on face recognition (use case, development and risk)Deep learning on face recognition (use case, development and risk)
Deep learning on face recognition (use case, development and risk)
 
Object Detection & Tracking
Object Detection & TrackingObject Detection & Tracking
Object Detection & Tracking
 
Computer vision
Computer visionComputer vision
Computer vision
 
(2017/06)Practical points of deep learning for medical imaging
(2017/06)Practical points of deep learning for medical imaging(2017/06)Practical points of deep learning for medical imaging
(2017/06)Practical points of deep learning for medical imaging
 
Prospects of Deep Learning in Medical Imaging
Prospects of Deep Learning in Medical ImagingProspects of Deep Learning in Medical Imaging
Prospects of Deep Learning in Medical Imaging
 
General introduction to computer vision
General introduction to computer visionGeneral introduction to computer vision
General introduction to computer vision
 
face recognition based on PCA
face recognition based on PCAface recognition based on PCA
face recognition based on PCA
 
Elements of visual perception
Elements of visual perceptionElements of visual perception
Elements of visual perception
 
Final ppt
Final pptFinal ppt
Final ppt
 

Similar to IEEE EED2021 AI use cases in Computer Vision

Computer graphics notes
Computer graphics notesComputer graphics notes
Computer graphics notessmruti sarangi
 
IRJET- Wearable AI Device for Blind
IRJET- Wearable AI Device for BlindIRJET- Wearable AI Device for Blind
IRJET- Wearable AI Device for BlindIRJET Journal
 
IRJET- Object Detection in an Image using Convolutional Neural Network
IRJET- Object Detection in an Image using Convolutional Neural NetworkIRJET- Object Detection in an Image using Convolutional Neural Network
IRJET- Object Detection in an Image using Convolutional Neural NetworkIRJET Journal
 
Scaling up Deep Learning by Scaling Down
Scaling up Deep Learning by Scaling DownScaling up Deep Learning by Scaling Down
Scaling up Deep Learning by Scaling DownDatabricks
 
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine LearningMakine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine LearningAli Alkan
 
IRJET - Content based Image Classification
IRJET -  	  Content based Image ClassificationIRJET -  	  Content based Image Classification
IRJET - Content based Image ClassificationIRJET Journal
 
Challenges of Deep Learning in Computer Vision Webinar - Tessellate Imaging
Challenges of Deep Learning in Computer Vision Webinar - Tessellate ImagingChallenges of Deep Learning in Computer Vision Webinar - Tessellate Imaging
Challenges of Deep Learning in Computer Vision Webinar - Tessellate ImagingAdhesh Shrivastava
 
Scaling up deep learning by scaling down
Scaling up deep learning by scaling downScaling up deep learning by scaling down
Scaling up deep learning by scaling downNick Pentreath
 
Color based image processing , tracking and automation using matlab
Color based image processing , tracking and automation using matlabColor based image processing , tracking and automation using matlab
Color based image processing , tracking and automation using matlabKamal Pradhan
 
IRJET- Transformation of Realistic Images and Videos into Cartoon Images and ...
IRJET- Transformation of Realistic Images and Videos into Cartoon Images and ...IRJET- Transformation of Realistic Images and Videos into Cartoon Images and ...
IRJET- Transformation of Realistic Images and Videos into Cartoon Images and ...IRJET Journal
 
Lecture 1 computer vision introduction
Lecture 1 computer vision introductionLecture 1 computer vision introduction
Lecture 1 computer vision introductioncairo university
 
IMAGE SEGMENTATION AND ITS TECHNIQUES
IMAGE SEGMENTATION AND ITS TECHNIQUESIMAGE SEGMENTATION AND ITS TECHNIQUES
IMAGE SEGMENTATION AND ITS TECHNIQUESIRJET Journal
 
Detection of a user-defined object in an image using feature extraction- Trai...
Detection of a user-defined object in an image using feature extraction- Trai...Detection of a user-defined object in an image using feature extraction- Trai...
Detection of a user-defined object in an image using feature extraction- Trai...IRJET Journal
 
Image Processing In Open CV. Image Processing In Open CV. Image Processing In...
Image Processing In Open CV. Image Processing In Open CV. Image Processing In...Image Processing In Open CV. Image Processing In Open CV. Image Processing In...
Image Processing In Open CV. Image Processing In Open CV. Image Processing In...Antoinette Williams
 
01 foundations
01 foundations01 foundations
01 foundationsankit_ppt
 
A Literature Survey on Image Linguistic Visual Question Answering
A Literature Survey on Image Linguistic Visual Question AnsweringA Literature Survey on Image Linguistic Visual Question Answering
A Literature Survey on Image Linguistic Visual Question AnsweringIRJET Journal
 

Similar to IEEE EED2021 AI use cases in Computer Vision (20)

OpenCV+Android.pptx
OpenCV+Android.pptxOpenCV+Android.pptx
OpenCV+Android.pptx
 
Computer graphics by bahadar sher
Computer graphics by bahadar sherComputer graphics by bahadar sher
Computer graphics by bahadar sher
 
Computer graphics notes
Computer graphics notesComputer graphics notes
Computer graphics notes
 
IRJET- Wearable AI Device for Blind
IRJET- Wearable AI Device for BlindIRJET- Wearable AI Device for Blind
IRJET- Wearable AI Device for Blind
 
OpenPOWER/POWER9 AI webinar
OpenPOWER/POWER9 AI webinar OpenPOWER/POWER9 AI webinar
OpenPOWER/POWER9 AI webinar
 
IRJET- Object Detection in an Image using Convolutional Neural Network
IRJET- Object Detection in an Image using Convolutional Neural NetworkIRJET- Object Detection in an Image using Convolutional Neural Network
IRJET- Object Detection in an Image using Convolutional Neural Network
 
Scaling up Deep Learning by Scaling Down
Scaling up Deep Learning by Scaling DownScaling up Deep Learning by Scaling Down
Scaling up Deep Learning by Scaling Down
 
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine LearningMakine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
 
IRJET - Content based Image Classification
IRJET -  	  Content based Image ClassificationIRJET -  	  Content based Image Classification
IRJET - Content based Image Classification
 
Challenges of Deep Learning in Computer Vision Webinar - Tessellate Imaging
Challenges of Deep Learning in Computer Vision Webinar - Tessellate ImagingChallenges of Deep Learning in Computer Vision Webinar - Tessellate Imaging
Challenges of Deep Learning in Computer Vision Webinar - Tessellate Imaging
 
Scaling up deep learning by scaling down
Scaling up deep learning by scaling downScaling up deep learning by scaling down
Scaling up deep learning by scaling down
 
Color based image processing , tracking and automation using matlab
Color based image processing , tracking and automation using matlabColor based image processing , tracking and automation using matlab
Color based image processing , tracking and automation using matlab
 
IRJET- Transformation of Realistic Images and Videos into Cartoon Images and ...
IRJET- Transformation of Realistic Images and Videos into Cartoon Images and ...IRJET- Transformation of Realistic Images and Videos into Cartoon Images and ...
IRJET- Transformation of Realistic Images and Videos into Cartoon Images and ...
 
Lecture 1 computer vision introduction
Lecture 1 computer vision introductionLecture 1 computer vision introduction
Lecture 1 computer vision introduction
 
Cg
CgCg
Cg
 
IMAGE SEGMENTATION AND ITS TECHNIQUES
IMAGE SEGMENTATION AND ITS TECHNIQUESIMAGE SEGMENTATION AND ITS TECHNIQUES
IMAGE SEGMENTATION AND ITS TECHNIQUES
 
Detection of a user-defined object in an image using feature extraction- Trai...
Detection of a user-defined object in an image using feature extraction- Trai...Detection of a user-defined object in an image using feature extraction- Trai...
Detection of a user-defined object in an image using feature extraction- Trai...
 
Image Processing In Open CV. Image Processing In Open CV. Image Processing In...
Image Processing In Open CV. Image Processing In Open CV. Image Processing In...Image Processing In Open CV. Image Processing In Open CV. Image Processing In...
Image Processing In Open CV. Image Processing In Open CV. Image Processing In...
 
01 foundations
01 foundations01 foundations
01 foundations
 
A Literature Survey on Image Linguistic Visual Question Answering
A Literature Survey on Image Linguistic Visual Question AnsweringA Literature Survey on Image Linguistic Visual Question Answering
A Literature Survey on Image Linguistic Visual Question Answering
 

Recently uploaded

Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 

Recently uploaded (20)

Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 

IEEE EED2021 AI use cases in Computer Vision

  • 1. 1 AI uses cases in Computer Vision Presented By: SAMeh Zaghloul – |ABCD> - IBM linkedin.com/in/sameh-zaghloul-00b5151
  • 2. 2 Computer Vision: How Machines See?
  • 3. 3 Computer Vision: How Machines See?
  • 5. 5 Deep Learning in Hieroglyph Recognition
  • 6. 6 Hieroglyph Alphabet Recognition – Early Stages “Ibn Wahshiyya” 985 CE “Arabic” translation of the Ancient Egyptian Hieroglyph alphabet
  • 11. 11 Hieroglyph Alphabet - Sample Tagged/Labeled Datasets
  • 12. 12 Hieroglyph Alphabet – Sample Tagged/Labeled Dataset
  • 13. 13 Deep Learning in Hieroglyph Recognition
  • 15. 15 Computer vision history Started with the emergence of AI In 1956. Convolutional neural networks were proposed in the well-known 1998 research paper by Yann LeCun and Léon Bottou. - 99.2% recognition accuracy on the MNIST data set. © Copyright IBM Corporation 2019, 2021
  • 16. 16 Computer vision tasks Object detection and recognition: Detect certain patterns within the image. Examples: - Detecting red eyes when taking photos in certain conditions. - Face recognition. © Copyright IBM Corporation 2019, 2021
  • 17. 17 Computer vision tasks (cont.) Content-based image retrieval: Image retrieval from a database based on user’s image query. - By using image actual feature contents such as colors, shapes, and textures - Not using image metadata (keywords, tags, or descriptions) Optical character recognition (OCR): Converting hand- written text to a digital format. © Copyright IBM Corporation 2019, 2021
  • 18. 18 Computer vision tasks (cont.) Object tracking: Following the position changes of a target object from one frame to another in an image sequence or video. The following photo shows an example of human tracking. © Copyright IBM Corporation 2019, 2021
  • 19. 19 Computer vision tasks Image restoration: Fixing and restoring images that are corrupted by noise, such as motion blur, to their default state. Scene reconstruction: Creation of a 3D model by supplying the system with multiple 2D images from different views. The computer constructs a 3D model based on those images. (Sinha, Steedly, Szeliskiet al. 2008) © 2008 ACM. © Copyright IBM Corporation 2019, 2021
  • 20. 20 Computer Vision tools OpenCV: CV open source library - C++, Python, Java, and MATLAB interfaces PyTorchCV is based on PyTorch framework. • Used for various computer vision tasks. • Includes a collection of pretrained models for image classification, segmentation, detection, and pose estimation. scikit-image is an open source library for image processing. • Includes a set of algorithms for image processing. • Implements algorithms and utilities that are used in research, education, and industry applications. • Well-documented API in the Python programming language. © Copyright IBM Corporation 2019, 2021
  • 21. 21 Computer Vision – Deep Learning – CNN (Convolutional Neural Network) © Copyright IBM Corporation 2019, 2021 Convolution is a Sliding-Window over a Matrix-Representation of an Image Loose Mimicking of the Overlapping Tiling of the Biological Visual Field
  • 22. 22 Computer Vision Annotation Tools IBM Cloud Annotations cloud.annotations.ai - Makes labeling images and training machine learning models easy. - Supports both photos and videos. - Object detection: An object detection model provides the following information: • Type of object and prediction confidence level • Location: The coordinates and area of where the object is in the image. • Count: The number of objects found in the image. • Size: How large the object is with respect to the image dimensions. - Object classification: An object classification model provides the following information • Type of object and prediction confidence level © Copyright IBM Corporation 2019, 2021
  • 23. 23 Computer Vision Annotation Tools VoTT - Visual Object Tagging Tool https://vott.z22.web.core.windows.net/#/ CVAT - Computer Vision Annotation Tool https://cvat.org/ Google’s Teachable Machine https://teachablemachine.withgoogle.com/ © Copyright IBM Corporation 2019, 2021
  • 24. 24 Computer Vision Trained Models Public Datasets Model Zoo https://models.roboflow.com/ Computer Vision Datasets https://public.roboflow.com/ Image Datasets for Computer Vision Training https://imerit.net/blog/22-free-image-datasets-for-computer-vision-all-pbm/ © Copyright IBM Corporation 2019, 2021
  • 26. 26 Image representation Images are stored as a 2D array of pixels on computers. Each pixel has a certain value representing its intensity. Example of grayscale representation: • Image is black and white with shades of gray in between. • Pixel intensity is a number between 0 (black) and 255 (white). int[ ][ ] array = { {255, 170, 170, 0}, {220, 80, 80, 170}, {255, 80, 0, 0}, {175, 20, 170, 0} }; © Copyright IBM Corporation 2019, 2021
  • 27. 27 Image representation (cont.) - Example of color representation: • The pixel color is represented as a mix of Red, Green, and Blue. • The pixel intensity becomes three 2D arrays or one 2D array, where each entry is an object containing the 3 color values of RGB © Copyright IBM Corporation 2019, 2021
  • 28. 28 Computer vision pipeline The steps and functions that are included are highly dependent on the application. Here is a conventional visual pattern recognition pipeline. Image Acquisition Pre- Processing Segmentation Feature Extraction & Selection Classification © Copyright IBM Corporation 2019, 2021
  • 29. 29 Computer vision pipeline (cont.) 1. Image acquisition: The process of acquiring images and saving them in a digital image format for processing. Images often use common formats, such as .jpeg, .png, and .bmp. Images are 2D images that are stored as arrays of pixels according to their color model. © Copyright IBM Corporation 2019, 2021
  • 30. 30 Computer vision pipeline (cont.) 2. Pre-processing: Preparing the image for the processing stage Examples: - Resizing images - Noise reduction - Contrast adjustment © Copyright IBM Corporation 2019, 2021
  • 31. 31 Computer vision pipeline (cont.) 3. Segmentation: Partitioning an image into regions of similarity. Grouping pixels and features with similar characteristics together. Helps with selecting regions of interest within the images. These regions can contain objects of interest that we want to capture. Segmenting an image into foreground and background. © Copyright IBM Corporation 2019, 2021
  • 32. 32 Computer vision pipeline (cont.) 4. Feature extraction and selection: Find distinguishing information about the image. Image features examples: distinct color in an image or a specific shape such as a line, edge, corner, or an image segment. © Copyright IBM Corporation 2019, 2021
  • 33. 33 Computer vision pipeline (cont.) 5. Classification: The extracted features are used to classify the image. More processing might be done on the classified images to identify more features from the image. Example: After face detection, identify features on the face, such as hair style, age, and gender. © Copyright IBM Corporation 2019, 2021
  • 35. 35
  • 36. 36
  • 37. 37
  • 38. 38