SlideShare a Scribd company logo
1 of 34
DEEP LEARNING
Computer Vision
Annisa Darmawahyuni Machine Learning Study Jams, 2024
DEEP LEARNING BASIC
“ARTIFICIAL NEURAL NETWORKS”
ARTIFICIAL NEURAL NETWORKS
MACHINE VS DEEP LEARNING
Annisa Darmawahyuni
“Deep learning allows computational models of multiple
processing layers to learn and represent data with multiple
levels of abstraction mimicking how the brain perceives and
understands multimodal information, thus implicitly capturing intricate
structures of large-scale data”
Annisa Darmawahyuni
(a) Face Detection
(b) Object Instance Segmentation
(c) Structure from motion (3D)
(d) Stereo Matching (3D)
COMPUTER
VISION
Annisa Darmawahyuni
Computer vision is a field of artificial intelligence (AI) that enables
computers and systems to derive meaningful information from digital
images, videos and other visual inputs — and take actions or make
recommendations based on that information. If AI enables computers to
think, computer vision enables them to see, observe and understand.
COMPUTER
VISION
Ti
Annisa Darmawahyuni
Timeline of topic research in computer vision
Ti
Annisa Darmawahyuni
Annisa Darmawahyuni
Annisa Darmawahyuni
COMPUTER VISION
MACHINE LEARNING DEEP LEARNING
Haar-like wavelet feature and integral graph
method
K-means, Naive Bayes classifier, Decision
Tree, Boosting, Random Forest, Haar
Classifier, Expectation–Maximization (EM), K-
Nearest Neighbor (KNN), and Support Vector
Machine (SVM
Convolutional Neural Networks (CNNs),
Restricted Boltzmann Machines (RBMs),
Autoencoders, Sparse Coding
Annisa Darmawahyuni
CNN FOR
COMPUTER VISION
Annisa Darmawahyuni
OBJECT DETECTION
Object detection is the process of detecting instances of semantic objects of a certain class (such as humans,
airplanes, or birds) in digital images and video.
Ground truth Bounding Box with region approach Bounding Box with region and semantic
segmentation approach
Annisa Darmawahyuni
OBJECT DETECTION
You can choose from two key approaches to get started with object detection using deep learning:
Create and train a custom object detector.
To train a custom object detector from scratch, you need to design a network architecture to learn
the features for the objects of interest. You also need to compile a very large set of labeled data to
train the CNN. The results of a custom object detector can be remarkable. That said, you need to
manually set up the layers and weights in the CNN, which requires a lot of time and training data.
Use a pretrained object detector.
Many object detection workflows using deep learning leverage transfer learning, an approach that
enables you to start with a pretrained network and then fine-tune it for your application. This
method can provide faster results because the object detectors have already been trained on
thousands, or even millions, of images.
Annisa Darmawahyuni
OBJECT DETECTION
Annisa Darmawahyuni
OBJECT DETECTION
SEGMENTATION
Annisa Darmawahyuni
SEMANTIC SEGMENTATION
Semantic Segmentation is a deep learning algorithm that associates a label or category with every pixel in an
image. It is used to recognize a collection of pixels that form distinct categories
A simple example of semantic segmentation is separating the images into two classes. For example, in Figure 1, an image showing a person
at the beach is paired with a version showing the image's pixels segmented into two separate classes: person and background.
Annisa Darmawahyuni
HOW DOES SEMANTIC SEGMENTATION
DIFFER FROM OBJECT DETECTION?
Semantic segmentation can be a useful alternative to object detection because it allows the object of interest to span
multiple areas in the image at the pixel level. This technique cleanly detects objects that are irregularly shaped, in
contrast to object detection, where objects must fit within a bounding box (Figure 2)
Figure 2. Object detection, showing bounding boxes to identify objects.
Annisa Darmawahyuni
SEMANTIC SEGMENTATION
Annisa Darmawahyuni
SEMANTIC
SEGMENTATION
The process of training a semantic segmentation network to
classify images follows these steps:
Analyze a collection of pixel-labeled images.
Create a semantic segmentation network.
Train the network to classify images into pixel categories.
Assess the accuracy of the network
Annisa Darmawahyuni
SEMANTIC SEGMENTATION
Highway scene showing color image (left) and corresponding labeled pixels (right)
Annisa Darmawahyuni
DATASET FOR COMPUTER VISION
Grayscale Images. The most used grayscale images dataset is MNIST
(https://www.kaggle.com/datasets/hojjatk/mnist-dataset) and its variations, that is, NIST and perturbed
NIST. The application scenario is the recognition of handwritten digits.
RGB Natural Images. Caltech RGB image datasets (https://euclid.caltech.edu/image/euclid20231107b-
ngc-6822), CIFAR datasets (https://www.cs.toronto.edu/~kriz/cifar.html) consist of thousands of 32 × 32
color images in various classes.
Hyperspectral Images. SCIEN hyperspectral image data and AVIRIS sensor based datasets, for example,
contain hyperspectral images.
Facial Characteristics Images. Adience benchmark dataset
Medical Images. Chest X-ray dataset (https://www.kaggle.com/datasets/paultimothymooney/chest-xray-
pneumonia) comprises 112120 frontal-view X-ray images of 30805 unique patients.
Video Streams. The WR datasets can be used for video-based activity recognition in assembly lines.
YouTube-8M is a dataset of 8 million YouTube video URLs, along with video-level labels from a diverse set
of 4800 Knowledge Graph entities.
Annisa Darmawahyuni
PARAMETER VS HYPERPARAMETER
Annisa Darmawahyuni
HYPERPARAMETER TUNING (DL)
Learning rate (LR). If the learning rate (LR) is too small, overfitting can occur. Large learning rates help to
regularize the training but if the learning rate is too large, the training will diverge.
Number of hidden layers.
Number of nodes/neurons per layer.
Optimizer
Batch Size
Epochs
Artikel Ilmiah Computer Vision Deep Learning
Intelligent System Research Group
https://docs.google.com/spreadsheets/d/13MLJnecd32B3H-f342M-
Uoqd_y5wRVgGDK1aT-bQg3w/edit#gid=0
annisadarmawahyuni@unsri.ac.id
riset.annisadarmawahyuni@gmail.com

More Related Content

Similar to Deep Learning For Computer Vision- Day 3 Study Jams GDSC Unsri.pptx

AI UNIT 4 - SRCAS JOC.pptx enjoy this ppt
AI UNIT 4 - SRCAS JOC.pptx enjoy this pptAI UNIT 4 - SRCAS JOC.pptx enjoy this ppt
AI UNIT 4 - SRCAS JOC.pptx enjoy this pptPavankalayankusetty
 
What is Computer Vision?
What is Computer Vision?What is Computer Vision?
What is Computer Vision?Kavika Roy
 
Traffic Automation System
Traffic Automation SystemTraffic Automation System
Traffic Automation SystemPrabal Chauhan
 
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...ijscai
 
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...ijscai
 
Unsupervised learning models of invariant features in images: Recent developm...
Unsupervised learning models of invariant features in images: Recent developm...Unsupervised learning models of invariant features in images: Recent developm...
Unsupervised learning models of invariant features in images: Recent developm...IJSCAI Journal
 
Introduction to computer vision and
Introduction to computer vision andIntroduction to computer vision and
Introduction to computer vision andcodeprogramming
 
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine LearningMakine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine LearningAli Alkan
 
Detection of a user-defined object in an image using feature extraction- Trai...
Detection of a user-defined object in an image using feature extraction- Trai...Detection of a user-defined object in an image using feature extraction- Trai...
Detection of a user-defined object in an image using feature extraction- Trai...IRJET Journal
 
Everything You Need to Know About Computer Vision
Everything You Need to Know About Computer VisionEverything You Need to Know About Computer Vision
Everything You Need to Know About Computer VisionKavika Roy
 
MOUSE SIMULATION USING NON MAXIMUM SUPPRESSION
MOUSE SIMULATION USING NON MAXIMUM SUPPRESSIONMOUSE SIMULATION USING NON MAXIMUM SUPPRESSION
MOUSE SIMULATION USING NON MAXIMUM SUPPRESSIONIRJET Journal
 
Dataset creation for Deep Learning-based Geometric Computer Vision problems
Dataset creation for Deep Learning-based Geometric Computer Vision problemsDataset creation for Deep Learning-based Geometric Computer Vision problems
Dataset creation for Deep Learning-based Geometric Computer Vision problemsPetteriTeikariPhD
 
Object Detetcion using SSD-MobileNet
Object Detetcion using SSD-MobileNetObject Detetcion using SSD-MobileNet
Object Detetcion using SSD-MobileNetIRJET Journal
 
Face Recognition - Deep Learning
Face Recognition - Deep LearningFace Recognition - Deep Learning
Face Recognition - Deep LearningAashish Chaubey
 
Inspection of Suspicious Human Activity in the Crowd Sourced Areas Captured i...
Inspection of Suspicious Human Activity in the Crowd Sourced Areas Captured i...Inspection of Suspicious Human Activity in the Crowd Sourced Areas Captured i...
Inspection of Suspicious Human Activity in the Crowd Sourced Areas Captured i...IRJET Journal
 
Machine learning based augmented reality for improved learning application th...
Machine learning based augmented reality for improved learning application th...Machine learning based augmented reality for improved learning application th...
Machine learning based augmented reality for improved learning application th...IJECEIAES
 
Image Classification and Annotation Using Deep Learning
Image Classification and Annotation Using Deep LearningImage Classification and Annotation Using Deep Learning
Image Classification and Annotation Using Deep LearningIRJET Journal
 
Mirko Lucchese - Deep Image Processing
Mirko Lucchese - Deep Image ProcessingMirko Lucchese - Deep Image Processing
Mirko Lucchese - Deep Image ProcessingMeetupDataScienceRoma
 

Similar to Deep Learning For Computer Vision- Day 3 Study Jams GDSC Unsri.pptx (20)

AI UNIT 4 - SRCAS JOC.pptx enjoy this ppt
AI UNIT 4 - SRCAS JOC.pptx enjoy this pptAI UNIT 4 - SRCAS JOC.pptx enjoy this ppt
AI UNIT 4 - SRCAS JOC.pptx enjoy this ppt
 
What is Computer Vision?
What is Computer Vision?What is Computer Vision?
What is Computer Vision?
 
One shot learning
One shot learningOne shot learning
One shot learning
 
Traffic Automation System
Traffic Automation SystemTraffic Automation System
Traffic Automation System
 
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
 
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
 
Unsupervised learning models of invariant features in images: Recent developm...
Unsupervised learning models of invariant features in images: Recent developm...Unsupervised learning models of invariant features in images: Recent developm...
Unsupervised learning models of invariant features in images: Recent developm...
 
Introduction to computer vision and
Introduction to computer vision andIntroduction to computer vision and
Introduction to computer vision and
 
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine LearningMakine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
 
Detection of a user-defined object in an image using feature extraction- Trai...
Detection of a user-defined object in an image using feature extraction- Trai...Detection of a user-defined object in an image using feature extraction- Trai...
Detection of a user-defined object in an image using feature extraction- Trai...
 
Everything You Need to Know About Computer Vision
Everything You Need to Know About Computer VisionEverything You Need to Know About Computer Vision
Everything You Need to Know About Computer Vision
 
MOUSE SIMULATION USING NON MAXIMUM SUPPRESSION
MOUSE SIMULATION USING NON MAXIMUM SUPPRESSIONMOUSE SIMULATION USING NON MAXIMUM SUPPRESSION
MOUSE SIMULATION USING NON MAXIMUM SUPPRESSION
 
Dataset creation for Deep Learning-based Geometric Computer Vision problems
Dataset creation for Deep Learning-based Geometric Computer Vision problemsDataset creation for Deep Learning-based Geometric Computer Vision problems
Dataset creation for Deep Learning-based Geometric Computer Vision problems
 
Ch1.pptx
Ch1.pptxCh1.pptx
Ch1.pptx
 
Object Detetcion using SSD-MobileNet
Object Detetcion using SSD-MobileNetObject Detetcion using SSD-MobileNet
Object Detetcion using SSD-MobileNet
 
Face Recognition - Deep Learning
Face Recognition - Deep LearningFace Recognition - Deep Learning
Face Recognition - Deep Learning
 
Inspection of Suspicious Human Activity in the Crowd Sourced Areas Captured i...
Inspection of Suspicious Human Activity in the Crowd Sourced Areas Captured i...Inspection of Suspicious Human Activity in the Crowd Sourced Areas Captured i...
Inspection of Suspicious Human Activity in the Crowd Sourced Areas Captured i...
 
Machine learning based augmented reality for improved learning application th...
Machine learning based augmented reality for improved learning application th...Machine learning based augmented reality for improved learning application th...
Machine learning based augmented reality for improved learning application th...
 
Image Classification and Annotation Using Deep Learning
Image Classification and Annotation Using Deep LearningImage Classification and Annotation Using Deep Learning
Image Classification and Annotation Using Deep Learning
 
Mirko Lucchese - Deep Image Processing
Mirko Lucchese - Deep Image ProcessingMirko Lucchese - Deep Image Processing
Mirko Lucchese - Deep Image Processing
 

More from pmgdscunsri

[InspireHER] Carving Success as Kartini: Strategies in Pursuing Careers
[InspireHER] Carving Success as Kartini: Strategies in Pursuing Careers[InspireHER] Carving Success as Kartini: Strategies in Pursuing Careers
[InspireHER] Carving Success as Kartini: Strategies in Pursuing Careerspmgdscunsri
 
Leveling Up your Branding and Mastering MERN: Fullstack WebDev
Leveling Up your Branding and Mastering MERN: Fullstack WebDevLeveling Up your Branding and Mastering MERN: Fullstack WebDev
Leveling Up your Branding and Mastering MERN: Fullstack WebDevpmgdscunsri
 
Firebase integration with Flutter
Firebase integration with FlutterFirebase integration with Flutter
Firebase integration with Flutterpmgdscunsri
 
Building a Flutter App.pptx
Building a Flutter App.pptxBuilding a Flutter App.pptx
Building a Flutter App.pptxpmgdscunsri
 
Dart Object-Oriented Programming (OOP) and Flutter State Management
Dart Object-Oriented Programming (OOP) and Flutter State ManagementDart Object-Oriented Programming (OOP) and Flutter State Management
Dart Object-Oriented Programming (OOP) and Flutter State Managementpmgdscunsri
 
Convolutional Neural Networks (CNN) for Computer Vision - Day 4 Study Jams G...
Convolutional Neural Networks (CNN) for Computer  Vision - Day 4 Study Jams G...Convolutional Neural Networks (CNN) for Computer  Vision - Day 4 Study Jams G...
Convolutional Neural Networks (CNN) for Computer Vision - Day 4 Study Jams G...pmgdscunsri
 
Linear Regression, Classification and Data Time Series - Day 2 Study Jams GDS...
Linear Regression, Classification and Data Time Series - Day 2 Study Jams GDS...Linear Regression, Classification and Data Time Series - Day 2 Study Jams GDS...
Linear Regression, Classification and Data Time Series - Day 2 Study Jams GDS...pmgdscunsri
 
Introduction to Machine Learning - Day 1 Study Jams GDSC Unsri.pptx
Introduction to Machine Learning - Day 1 Study Jams GDSC Unsri.pptxIntroduction to Machine Learning - Day 1 Study Jams GDSC Unsri.pptx
Introduction to Machine Learning - Day 1 Study Jams GDSC Unsri.pptxpmgdscunsri
 
Dart Basics & Introduction to Flutter Part 2.pptx
Dart Basics & Introduction to Flutter Part 2.pptxDart Basics & Introduction to Flutter Part 2.pptx
Dart Basics & Introduction to Flutter Part 2.pptxpmgdscunsri
 
Dart Basics & Introduction to Flutter Part 1
Dart Basics & Introduction to Flutter Part 1Dart Basics & Introduction to Flutter Part 1
Dart Basics & Introduction to Flutter Part 1pmgdscunsri
 
Seminar and DevTalk : Mastering UX Research - 5 Whys Framework
Seminar and DevTalk : Mastering UX Research - 5 Whys FrameworkSeminar and DevTalk : Mastering UX Research - 5 Whys Framework
Seminar and DevTalk : Mastering UX Research - 5 Whys Frameworkpmgdscunsri
 
Seminar and DevTalk : Mastering UX Research - What is UX Research
Seminar and DevTalk : Mastering UX Research - What is UX ResearchSeminar and DevTalk : Mastering UX Research - What is UX Research
Seminar and DevTalk : Mastering UX Research - What is UX Researchpmgdscunsri
 
Info Session Bangkit Academy "Empowering Through Bangkit: Unveiling the Essen...
Info Session Bangkit Academy "Empowering Through Bangkit: Unveiling the Essen...Info Session Bangkit Academy "Empowering Through Bangkit: Unveiling the Essen...
Info Session Bangkit Academy "Empowering Through Bangkit: Unveiling the Essen...pmgdscunsri
 
Sosialisasi Bangkit 2024: Discovering Bangkit Academy, Capstone, and Incubati...
Sosialisasi Bangkit 2024: Discovering Bangkit Academy, Capstone, and Incubati...Sosialisasi Bangkit 2024: Discovering Bangkit Academy, Capstone, and Incubati...
Sosialisasi Bangkit 2024: Discovering Bangkit Academy, Capstone, and Incubati...pmgdscunsri
 
Sharing Session Internal : Computer Security
Sharing Session Internal : Computer SecuritySharing Session Internal : Computer Security
Sharing Session Internal : Computer Securitypmgdscunsri
 

More from pmgdscunsri (15)

[InspireHER] Carving Success as Kartini: Strategies in Pursuing Careers
[InspireHER] Carving Success as Kartini: Strategies in Pursuing Careers[InspireHER] Carving Success as Kartini: Strategies in Pursuing Careers
[InspireHER] Carving Success as Kartini: Strategies in Pursuing Careers
 
Leveling Up your Branding and Mastering MERN: Fullstack WebDev
Leveling Up your Branding and Mastering MERN: Fullstack WebDevLeveling Up your Branding and Mastering MERN: Fullstack WebDev
Leveling Up your Branding and Mastering MERN: Fullstack WebDev
 
Firebase integration with Flutter
Firebase integration with FlutterFirebase integration with Flutter
Firebase integration with Flutter
 
Building a Flutter App.pptx
Building a Flutter App.pptxBuilding a Flutter App.pptx
Building a Flutter App.pptx
 
Dart Object-Oriented Programming (OOP) and Flutter State Management
Dart Object-Oriented Programming (OOP) and Flutter State ManagementDart Object-Oriented Programming (OOP) and Flutter State Management
Dart Object-Oriented Programming (OOP) and Flutter State Management
 
Convolutional Neural Networks (CNN) for Computer Vision - Day 4 Study Jams G...
Convolutional Neural Networks (CNN) for Computer  Vision - Day 4 Study Jams G...Convolutional Neural Networks (CNN) for Computer  Vision - Day 4 Study Jams G...
Convolutional Neural Networks (CNN) for Computer Vision - Day 4 Study Jams G...
 
Linear Regression, Classification and Data Time Series - Day 2 Study Jams GDS...
Linear Regression, Classification and Data Time Series - Day 2 Study Jams GDS...Linear Regression, Classification and Data Time Series - Day 2 Study Jams GDS...
Linear Regression, Classification and Data Time Series - Day 2 Study Jams GDS...
 
Introduction to Machine Learning - Day 1 Study Jams GDSC Unsri.pptx
Introduction to Machine Learning - Day 1 Study Jams GDSC Unsri.pptxIntroduction to Machine Learning - Day 1 Study Jams GDSC Unsri.pptx
Introduction to Machine Learning - Day 1 Study Jams GDSC Unsri.pptx
 
Dart Basics & Introduction to Flutter Part 2.pptx
Dart Basics & Introduction to Flutter Part 2.pptxDart Basics & Introduction to Flutter Part 2.pptx
Dart Basics & Introduction to Flutter Part 2.pptx
 
Dart Basics & Introduction to Flutter Part 1
Dart Basics & Introduction to Flutter Part 1Dart Basics & Introduction to Flutter Part 1
Dart Basics & Introduction to Flutter Part 1
 
Seminar and DevTalk : Mastering UX Research - 5 Whys Framework
Seminar and DevTalk : Mastering UX Research - 5 Whys FrameworkSeminar and DevTalk : Mastering UX Research - 5 Whys Framework
Seminar and DevTalk : Mastering UX Research - 5 Whys Framework
 
Seminar and DevTalk : Mastering UX Research - What is UX Research
Seminar and DevTalk : Mastering UX Research - What is UX ResearchSeminar and DevTalk : Mastering UX Research - What is UX Research
Seminar and DevTalk : Mastering UX Research - What is UX Research
 
Info Session Bangkit Academy "Empowering Through Bangkit: Unveiling the Essen...
Info Session Bangkit Academy "Empowering Through Bangkit: Unveiling the Essen...Info Session Bangkit Academy "Empowering Through Bangkit: Unveiling the Essen...
Info Session Bangkit Academy "Empowering Through Bangkit: Unveiling the Essen...
 
Sosialisasi Bangkit 2024: Discovering Bangkit Academy, Capstone, and Incubati...
Sosialisasi Bangkit 2024: Discovering Bangkit Academy, Capstone, and Incubati...Sosialisasi Bangkit 2024: Discovering Bangkit Academy, Capstone, and Incubati...
Sosialisasi Bangkit 2024: Discovering Bangkit Academy, Capstone, and Incubati...
 
Sharing Session Internal : Computer Security
Sharing Session Internal : Computer SecuritySharing Session Internal : Computer Security
Sharing Session Internal : Computer Security
 

Recently uploaded

Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Analog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAnalog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAbhinavSharma374939
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 

Recently uploaded (20)

Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Analog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAnalog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog Converter
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 

Deep Learning For Computer Vision- Day 3 Study Jams GDSC Unsri.pptx

  • 1. DEEP LEARNING Computer Vision Annisa Darmawahyuni Machine Learning Study Jams, 2024
  • 2.
  • 5.
  • 6.
  • 7. MACHINE VS DEEP LEARNING Annisa Darmawahyuni
  • 8.
  • 9.
  • 10. “Deep learning allows computational models of multiple processing layers to learn and represent data with multiple levels of abstraction mimicking how the brain perceives and understands multimodal information, thus implicitly capturing intricate structures of large-scale data”
  • 11. Annisa Darmawahyuni (a) Face Detection (b) Object Instance Segmentation (c) Structure from motion (3D) (d) Stereo Matching (3D) COMPUTER VISION
  • 12. Annisa Darmawahyuni Computer vision is a field of artificial intelligence (AI) that enables computers and systems to derive meaningful information from digital images, videos and other visual inputs — and take actions or make recommendations based on that information. If AI enables computers to think, computer vision enables them to see, observe and understand. COMPUTER VISION
  • 13. Ti Annisa Darmawahyuni Timeline of topic research in computer vision
  • 16. Annisa Darmawahyuni COMPUTER VISION MACHINE LEARNING DEEP LEARNING Haar-like wavelet feature and integral graph method K-means, Naive Bayes classifier, Decision Tree, Boosting, Random Forest, Haar Classifier, Expectation–Maximization (EM), K- Nearest Neighbor (KNN), and Support Vector Machine (SVM Convolutional Neural Networks (CNNs), Restricted Boltzmann Machines (RBMs), Autoencoders, Sparse Coding
  • 18. Annisa Darmawahyuni OBJECT DETECTION Object detection is the process of detecting instances of semantic objects of a certain class (such as humans, airplanes, or birds) in digital images and video. Ground truth Bounding Box with region approach Bounding Box with region and semantic segmentation approach
  • 19. Annisa Darmawahyuni OBJECT DETECTION You can choose from two key approaches to get started with object detection using deep learning: Create and train a custom object detector. To train a custom object detector from scratch, you need to design a network architecture to learn the features for the objects of interest. You also need to compile a very large set of labeled data to train the CNN. The results of a custom object detector can be remarkable. That said, you need to manually set up the layers and weights in the CNN, which requires a lot of time and training data. Use a pretrained object detector. Many object detection workflows using deep learning leverage transfer learning, an approach that enables you to start with a pretrained network and then fine-tune it for your application. This method can provide faster results because the object detectors have already been trained on thousands, or even millions, of images.
  • 23.
  • 24.
  • 25.
  • 26. Annisa Darmawahyuni SEMANTIC SEGMENTATION Semantic Segmentation is a deep learning algorithm that associates a label or category with every pixel in an image. It is used to recognize a collection of pixels that form distinct categories A simple example of semantic segmentation is separating the images into two classes. For example, in Figure 1, an image showing a person at the beach is paired with a version showing the image's pixels segmented into two separate classes: person and background.
  • 27. Annisa Darmawahyuni HOW DOES SEMANTIC SEGMENTATION DIFFER FROM OBJECT DETECTION? Semantic segmentation can be a useful alternative to object detection because it allows the object of interest to span multiple areas in the image at the pixel level. This technique cleanly detects objects that are irregularly shaped, in contrast to object detection, where objects must fit within a bounding box (Figure 2) Figure 2. Object detection, showing bounding boxes to identify objects.
  • 29. Annisa Darmawahyuni SEMANTIC SEGMENTATION The process of training a semantic segmentation network to classify images follows these steps: Analyze a collection of pixel-labeled images. Create a semantic segmentation network. Train the network to classify images into pixel categories. Assess the accuracy of the network
  • 30. Annisa Darmawahyuni SEMANTIC SEGMENTATION Highway scene showing color image (left) and corresponding labeled pixels (right)
  • 31. Annisa Darmawahyuni DATASET FOR COMPUTER VISION Grayscale Images. The most used grayscale images dataset is MNIST (https://www.kaggle.com/datasets/hojjatk/mnist-dataset) and its variations, that is, NIST and perturbed NIST. The application scenario is the recognition of handwritten digits. RGB Natural Images. Caltech RGB image datasets (https://euclid.caltech.edu/image/euclid20231107b- ngc-6822), CIFAR datasets (https://www.cs.toronto.edu/~kriz/cifar.html) consist of thousands of 32 × 32 color images in various classes. Hyperspectral Images. SCIEN hyperspectral image data and AVIRIS sensor based datasets, for example, contain hyperspectral images. Facial Characteristics Images. Adience benchmark dataset Medical Images. Chest X-ray dataset (https://www.kaggle.com/datasets/paultimothymooney/chest-xray- pneumonia) comprises 112120 frontal-view X-ray images of 30805 unique patients. Video Streams. The WR datasets can be used for video-based activity recognition in assembly lines. YouTube-8M is a dataset of 8 million YouTube video URLs, along with video-level labels from a diverse set of 4800 Knowledge Graph entities.
  • 33. Annisa Darmawahyuni HYPERPARAMETER TUNING (DL) Learning rate (LR). If the learning rate (LR) is too small, overfitting can occur. Large learning rates help to regularize the training but if the learning rate is too large, the training will diverge. Number of hidden layers. Number of nodes/neurons per layer. Optimizer Batch Size Epochs Artikel Ilmiah Computer Vision Deep Learning Intelligent System Research Group https://docs.google.com/spreadsheets/d/13MLJnecd32B3H-f342M- Uoqd_y5wRVgGDK1aT-bQg3w/edit#gid=0