SlideShare a Scribd company logo
1 of 31
Download to read offline
Deep Fine-Grained
Image Classification
Overview and state-of-the-art
Carmine Paolino & Hagop Boghazdeklian, OLX
2
Carmine Paolino
MSc in Artificial Intelligence at Vrije Universiteit Amsterdam
Data Scientist at OLX (Search and Recommendations)
carmine.paolino@olx.com
@paolino
http://paolino.me
3
Hagop Boghazdeklian
MSc in DS and Econometrics from Aix-Marseille University
Data Scientist at OLX (Fraud, Trust and Safety)
hagop.boghazdeklian@olx.com
Offline OLX
5
Deep
Fine-Grained
Image
Classification
● Introduction
● Overview of Approaches
● Standard Datasets
● State-of-the-art
Introduction
7
Are these cars the same model/year?
Introduction
8
Are these cars the same model/year?
Introduction
9
Are these cars the same model/year?
Introduction
YES: BMW Series 6 Convertible 2012
High intra-class variance
10
Are these cars the same model/year?
Introduction
Low inter-class variance
NO: BMW 2 series 2016 BMW 2 series 2017 BMW 2 series 2018
11
Discriminating features
Introduction
BMW Series 6 Convertible 2012
High intra-class variance
12
Discriminating features
Introduction
Low inter-class variance
BMW 2 series 2016 BMW 2 series 2017 BMW 2 series 2018
Overview of Approaches
14
Deep Neural Networks
Overview of approaches
❖ Basic and pragmatic approach
❖ Two-steps classification
➢ Object detection (bounding-boxes)
➢ Classification of cropped images
❖ Increase of robustness in recent CNN architectures
❖ Transfer Learning, Data Augmentation and Fine-Tuning
15
Deep Neural Networks
Overview of approaches. Valev, Schuman and Sommer
A Systematic Evaluation of Recent Deep Learning Architectures for Fine-Grained Vehicle Classification (2018)
16
Part-detection and alignment
Overview of approaches
Part-detection Templates Pose alignment Classification
y
17
Example of pose-normalized part-CNNs
Figure 3: Overview of the part-CNN pipeline on pose normalized parts
part-CNN
Overview of approaches. Source: Branson, Van Horn, Belogie, Peronize.
Bird species categorization using prose normalized deep convolutional nets (2014)
18
Ensemble of networks
Overview of approaches. Lin, RoyChowdhury, Maji. Bilinear CNN models for fine-grained classification (2015)
19
Attention mechanism
Overview of approaches. Zheng Fu, Luo. Learning Multi-Attention Convolutional Neural Network for Fine-Grained Image Recognition (2017)
Standard Datasets
21
Caltech-UCSD Birds-200-2011 (CUB-200-2011)
Standard Datasets
200
Categories
15 Part Locations
312 Binary
Attributes
1 Bounding Box
Per image
11,788
Images
22
Stanford Cars
Standard Datasets
196
Categories
1 Bounding Box
3 Labels: Make,
Model, Year
Per image
16,185
Images
23
Fine-Grained Visual Classification of Aircraft
(FGVC-Aircraft)
Standard Datasets
102
Categories
1 Bounding Box
4 Labels:
Manufacturer,
Family, Variant,
Model
Per image
10,200
Images
State-of-the-art
25
Classical approaches
State-of-the-art: H. Zheng, J. Fu, T. Mei, and J. Luo, “Learning Multi-attention Convolutional Neural Network for Fine-Grained Image Recognition,” Proc. IEEE
Int. Conf. Comput. Vis., vol. 2017–October, pp. 5219–5227, 2017.
Part-detection Templates Pose alignment Classification
y
26
Multi-attention
State-of-the-art: H. Zheng, J. Fu, T. Mei, and J. Luo, “Learning Multi-attention Convolutional Neural Network for Fine-Grained Image Recognition,” Proc. IEEE
Int. Conf. Comput. Vis., vol. 2017–October, pp. 5219–5227, 2017.
Highest activations
VGG-19
Grouping
Candidate part attentions
27
Final classification
State-of-the-art: H. Zheng, J. Fu, T. Mei, and J. Luo, “Learning Multi-attention Convolutional Neural Network for Fine-Grained Image Recognition,” Proc. IEEE
Int. Conf. Comput. Vis., vol. 2017–October, pp. 5219–5227, 2017.
28
Part-CNN for final classification
Figure 3: Overview of the part-CNN pipeline on pose normalized parts
State-of-the-art. Source: Branson, Van Horn, Belogie, Peronize.
Bird species categorization using prose normalized deep convolutional nets (2014)
Region zooming
29
Joint learning
State-of-the-art: H. Zheng, J. Fu, T. Mei, and J. Luo, “Learning Multi-attention Convolutional Neural Network for Fine-Grained Image Recognition,” Proc. IEEE
Int. Conf. Comput. Vis., vol. 2017–October, pp. 5219–5227, 2017.
Thank you!
H. Zheng, J. Fu, T. Mei, and J. Luo, “Learning
Multi-attention Convolutional Neural Network for
Fine-Grained Image Recognition,” Proc. IEEE Int. Conf.
Comput. Vis., vol. 2017–October, pp. 5219–5227, 2017.
B. Zhao, J. Feng, X. Wu, and S. Yan, “A survey on deep
learning-based fine-grained object classification and
semantic segmentation,” Int. J. Autom. Comput., vol.
14, no. 2, pp. 119–135, 2017.
31

More Related Content

Similar to Deep Fine-Grained Image Classification [Keynote at IFIP WCC 2018]

LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...
LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...
LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...Pieter Pauwels
 
Accelerating the Design of Optical Networks using Surrogate Models
Accelerating the Design of Optical Networks using Surrogate ModelsAccelerating the Design of Optical Networks using Surrogate Models
Accelerating the Design of Optical Networks using Surrogate ModelsCPqD
 
Helmet Detection Based on Convolutional Neural Networks
Helmet Detection Based on Convolutional Neural NetworksHelmet Detection Based on Convolutional Neural Networks
Helmet Detection Based on Convolutional Neural NetworksIRJET Journal
 
Knowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsKnowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsEnrico Palumbo
 
Machine Learning for Chemistry: Representing and Intervening
Machine Learning for Chemistry: Representing and InterveningMachine Learning for Chemistry: Representing and Intervening
Machine Learning for Chemistry: Representing and InterveningIchigaku Takigawa
 
[ADBIS2022] Insight-based Vocalization of OLAP Sessions
[ADBIS2022] Insight-based Vocalization of OLAP Sessions[ADBIS2022] Insight-based Vocalization of OLAP Sessions
[ADBIS2022] Insight-based Vocalization of OLAP SessionsUniversity of Bologna
 
Knowledge graphs for knowing more and knowing for sure
Knowledge graphs for knowing more and knowing for sureKnowledge graphs for knowing more and knowing for sure
Knowledge graphs for knowing more and knowing for sureSteffen Staab
 
Justification Factory : From justification requirements elicitation to their ...
Justification Factory : From justification requirements elicitation to their ...Justification Factory : From justification requirements elicitation to their ...
Justification Factory : From justification requirements elicitation to their ...Clément DUFFAU
 
Real Time Object Detection System with YOLO and CNN Models: A Review
Real Time Object Detection System with YOLO and CNN Models: A ReviewReal Time Object Detection System with YOLO and CNN Models: A Review
Real Time Object Detection System with YOLO and CNN Models: A ReviewSpringer
 
NVIDIA GTC 2018 Presentation
NVIDIA GTC 2018 PresentationNVIDIA GTC 2018 Presentation
NVIDIA GTC 2018 PresentationTomasz Bednarz
 
Algoritma genetika
Algoritma genetikaAlgoritma genetika
Algoritma genetikaHendra Arie
 
Performance evaluation of GANs in a semisupervised OCR use case
Performance evaluation of GANs in a semisupervised OCR use casePerformance evaluation of GANs in a semisupervised OCR use case
Performance evaluation of GANs in a semisupervised OCR use caseFlorian Wilhelm
 
Performance evaluation of GANs in a semisupervised OCR use case
Performance evaluation of GANs in a semisupervised OCR use casePerformance evaluation of GANs in a semisupervised OCR use case
Performance evaluation of GANs in a semisupervised OCR use caseinovex GmbH
 
II-SDV 2012 From (Text) Mining to Models: Applying Large-Scale Text Mining on...
II-SDV 2012 From (Text) Mining to Models: Applying Large-Scale Text Mining on...II-SDV 2012 From (Text) Mining to Models: Applying Large-Scale Text Mining on...
II-SDV 2012 From (Text) Mining to Models: Applying Large-Scale Text Mining on...Dr. Haxel Consult
 
Closing, Course Offer 17/18 & Homework (D5 2017 UPC Deep Learning for Compute...
Closing, Course Offer 17/18 & Homework (D5 2017 UPC Deep Learning for Compute...Closing, Course Offer 17/18 & Homework (D5 2017 UPC Deep Learning for Compute...
Closing, Course Offer 17/18 & Homework (D5 2017 UPC Deep Learning for Compute...Universitat Politècnica de Catalunya
 
Capstone Design(2) 최종 발표
Capstone Design(2) 최종 발표Capstone Design(2) 최종 발표
Capstone Design(2) 최종 발표Hyunwoo Kim
 

Similar to Deep Fine-Grained Image Classification [Keynote at IFIP WCC 2018] (20)

LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...
LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...
LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...
 
Accelerating the Design of Optical Networks using Surrogate Models
Accelerating the Design of Optical Networks using Surrogate ModelsAccelerating the Design of Optical Networks using Surrogate Models
Accelerating the Design of Optical Networks using Surrogate Models
 
Helmet Detection Based on Convolutional Neural Networks
Helmet Detection Based on Convolutional Neural NetworksHelmet Detection Based on Convolutional Neural Networks
Helmet Detection Based on Convolutional Neural Networks
 
OOD_PPT.pptx
OOD_PPT.pptxOOD_PPT.pptx
OOD_PPT.pptx
 
Knowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsKnowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender Systems
 
Machine Learning for Chemistry: Representing and Intervening
Machine Learning for Chemistry: Representing and InterveningMachine Learning for Chemistry: Representing and Intervening
Machine Learning for Chemistry: Representing and Intervening
 
[ADBIS2022] Insight-based Vocalization of OLAP Sessions
[ADBIS2022] Insight-based Vocalization of OLAP Sessions[ADBIS2022] Insight-based Vocalization of OLAP Sessions
[ADBIS2022] Insight-based Vocalization of OLAP Sessions
 
Knowledge graphs for knowing more and knowing for sure
Knowledge graphs for knowing more and knowing for sureKnowledge graphs for knowing more and knowing for sure
Knowledge graphs for knowing more and knowing for sure
 
Tools and Methods for Continuously Expanding Software Applications
Tools and Methods for Continuously Expanding Software ApplicationsTools and Methods for Continuously Expanding Software Applications
Tools and Methods for Continuously Expanding Software Applications
 
Justification Factory : From justification requirements elicitation to their ...
Justification Factory : From justification requirements elicitation to their ...Justification Factory : From justification requirements elicitation to their ...
Justification Factory : From justification requirements elicitation to their ...
 
Real Time Object Detection System with YOLO and CNN Models: A Review
Real Time Object Detection System with YOLO and CNN Models: A ReviewReal Time Object Detection System with YOLO and CNN Models: A Review
Real Time Object Detection System with YOLO and CNN Models: A Review
 
NVIDIA GTC 2018 Presentation
NVIDIA GTC 2018 PresentationNVIDIA GTC 2018 Presentation
NVIDIA GTC 2018 Presentation
 
Algoritma genetika
Algoritma genetikaAlgoritma genetika
Algoritma genetika
 
Performance evaluation of GANs in a semisupervised OCR use case
Performance evaluation of GANs in a semisupervised OCR use casePerformance evaluation of GANs in a semisupervised OCR use case
Performance evaluation of GANs in a semisupervised OCR use case
 
Performance evaluation of GANs in a semisupervised OCR use case
Performance evaluation of GANs in a semisupervised OCR use casePerformance evaluation of GANs in a semisupervised OCR use case
Performance evaluation of GANs in a semisupervised OCR use case
 
II-SDV 2012 From (Text) Mining to Models: Applying Large-Scale Text Mining on...
II-SDV 2012 From (Text) Mining to Models: Applying Large-Scale Text Mining on...II-SDV 2012 From (Text) Mining to Models: Applying Large-Scale Text Mining on...
II-SDV 2012 From (Text) Mining to Models: Applying Large-Scale Text Mining on...
 
Technical Writing Paper Presentation .pptx
Technical Writing Paper Presentation .pptxTechnical Writing Paper Presentation .pptx
Technical Writing Paper Presentation .pptx
 
Closing, Course Offer 17/18 & Homework (D5 2017 UPC Deep Learning for Compute...
Closing, Course Offer 17/18 & Homework (D5 2017 UPC Deep Learning for Compute...Closing, Course Offer 17/18 & Homework (D5 2017 UPC Deep Learning for Compute...
Closing, Course Offer 17/18 & Homework (D5 2017 UPC Deep Learning for Compute...
 
An introduction to automated analysis of feature models through propositional...
An introduction to automated analysis of feature models through propositional...An introduction to automated analysis of feature models through propositional...
An introduction to automated analysis of feature models through propositional...
 
Capstone Design(2) 최종 발표
Capstone Design(2) 최종 발표Capstone Design(2) 최종 발표
Capstone Design(2) 최종 발표
 

Recently uploaded

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 

Recently uploaded (20)

Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 

Deep Fine-Grained Image Classification [Keynote at IFIP WCC 2018]