SlideShare a Scribd company logo
1 of 18
Multi-class Image Classification using
Deep Convolutional Networks on
extremely large dataset
Marija Stanojevic
Ashis Chanda
CIS Department, Temple University
5th December 2017
Problem Definition
❖ Kaggle competition: Cdiscount’s Image Classification Challenge
❖ Classify user posted images into 5270 categories
❖ Challenges of dataset:
➢ Huge amount of data and categories
➢ Background clutter (objects blend into environment)
➢ Viewpoint and data scale variation
➢ Deformation
➢ Occlusion
➢ Illumination conditions
❖ Deep learning models seem suitable
Data sets Size # products
Train data 58.2 GB 7,069,896
Test data 14.5 GB 1,768,182
Category # Labels
Category1 49
Category2 483
Category3 5270
Dataset: Product complexity
Dataset: Categories
A sample list of product categories:
Our used models
● CNN
● DenseNet
● ResNet
● ResNext
● Wide-res Net
Models: CNN
❖ Our CNN
➢ 3 Conv, 2 FC layers, 3 Pool layers
➢ thinner layers
➢ downsampling three times
❖ Tried to use for three types of categories
❖ l th layer has connection with all previous layers (l -1)
❖ Normally, each dense block contains 40, 100, 160 layers
❖ We used 3 blocks, 3 layers. Total connection: l x (l -1) /2 = 15
❖ Each layer has BN, ReLU, Conv
❖ No transition layer (downsampling), but used dropout
Models: DenseNet
Models: ResNet
❖ 34, 50, 101, 152 layers
❖ Residual block example
❖ Batch normalization, ReLu
❖ Our residual network
➢ thinner layers (25 conv + 4 pool + fc)
➢ downsampling after each block
➢ no bottleneck
❖ Uses residual block as bases
❖ Block width is divided in k parts
which learn separately
❖ Blocks are wider than in resNet
❖ Distinct cardinalities
❖ 29, 50 and 101 layers
❖ Our resNext:
➢ thinner layers
➢ downsampling after each block
➢ less layers (19 conv + 4 pool + fc)
➢ cardinality 32 always - best
results according to paper
Models: ResNext
❖ Uses residual block as bases
❖ Block is k times wider (k=1, 2, 4, 8, 10, 12)
❖ Dropout beside batch normalization
❖ 16, 22, 28, 40 layers
❖ Our wideResNet:
➢ k = 2 - doesn’t increase much number
of parameters, but shows biggest
improvement in original results
➢ dropout keep = 0.7 - best in original results
➢ less layers (13 conv + 4 pool + fc)
Models: WideResNet
Challenges in dataset
❖ 5.6 TB after splitting data
❖ Train data (707 Batches) & Test data (177 Batches)
❖ Smaller batch: more reads and writes
❖ Bigger batch: memory error
❖ Each batch has 10,000 products ~ 20,000 images
❖ Cross-validation data: 707 products ~ 1500 images
Challenges in implementation
❖ Used owlsnesttwo high performance computing system
❖ GPU: NVIDIA Tesla P100 PCIe 12GB
❖ Only two jobs allowed in parallel
❖ Implemented with tensorflow and tflearn in python
❖ Network
➢ Complexity in debugging
➢ Bigger network: memory error => tuning network
➢ Thinner and less layers
➢ Small number of epoch
Baseline (CNN) Result
❖ CNN: (10 Epochs, 10 Batches)
❖ CNN: (50 Epochs, 50 Batches) (in progress)
Label # class Accuracy Error Time
Category 1 49 52% 10.7 4 hours
Category 2 483 47% 11.68 5 hours
Category 3 5270 32% 4.89 11 hours
Category 1 49 12%
(1 epoch)
3.42 20 hours
Experimental Result (category 3)
Model # Batch # Epoch Accuracy Error Time
DenseNet 50/707 1 (running) 4% 9.83 10 days
ResNet 50/707 10 34% 3.94489 5.69 hours
ResNext 50/707 10 28.9% 4.51031 16.8 hours
WideResNext 50/707 10 41.93% 3.53318 6.11 hours
ResNet 707/707 3 (running) 36.69% 3.84427 3.35 days
ResNext 707/707 1 (running) 28.47% 4.50394 9.90 days
WideResNext 707/707 2 (running) 40.51% 3.49804 3.60 days
We are here now
Experimental results: Error in classifying
Mobile cover bag Mobile Case Mobile
Laptop cover bag Mobile film protector Mobile Case
Conclusions and Future Work
❖ Baseline: CNN; Proposed: resNet, resNext, denseNet, wideResNet
❖ All proposed networks have similar number of parameters
❖ wideResNet performs the best
❖ resNext gave worst results and is 3 times slower than resNet
❖ DenseNet requires GPU with huge memory
❖ Requires a lot of time, huge memory and
fast computational resources
❖ Number of epochs has to be 70+
❖ Future: Submit result to Kaggle competition
Thank you
Questions?

More Related Content

What's hot

Image segmentation with deep learning
Image segmentation with deep learningImage segmentation with deep learning
Image segmentation with deep learningAntonio Rueda-Toicen
 
Introduction to object detection
Introduction to object detectionIntroduction to object detection
Introduction to object detectionBrodmann17
 
Lec7: Medical Image Segmentation (I) (Radiology Applications of Segmentation,...
Lec7: Medical Image Segmentation (I) (Radiology Applications of Segmentation,...Lec7: Medical Image Segmentation (I) (Radiology Applications of Segmentation,...
Lec7: Medical Image Segmentation (I) (Radiology Applications of Segmentation,...Ulaş Bağcı
 
Heart disease prediction system
Heart disease prediction systemHeart disease prediction system
Heart disease prediction systemSWAMI06
 
Convolutional neural network
Convolutional neural network Convolutional neural network
Convolutional neural network Yan Xu
 
Object Detection and Recognition
Object Detection and Recognition Object Detection and Recognition
Object Detection and Recognition Intel Nervana
 
CNN and its applications by ketaki
CNN and its applications by ketakiCNN and its applications by ketaki
CNN and its applications by ketakiKetaki Patwari
 
Diabetic Retinopathy.pptx
Diabetic Retinopathy.pptxDiabetic Retinopathy.pptx
Diabetic Retinopathy.pptxNGOKUL3
 
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...Simplilearn
 
Convolutional Neural Network Models - Deep Learning
Convolutional Neural Network Models - Deep LearningConvolutional Neural Network Models - Deep Learning
Convolutional Neural Network Models - Deep LearningMohamed Loey
 
Computer Vision: Image Augmentation
Computer Vision: Image AugmentationComputer Vision: Image Augmentation
Computer Vision: Image AugmentationEng Teong Cheah
 
Convolutional Neural Networks : Popular Architectures
Convolutional Neural Networks : Popular ArchitecturesConvolutional Neural Networks : Popular Architectures
Convolutional Neural Networks : Popular Architecturesananth
 
Object Detection using Deep Neural Networks
Object Detection using Deep Neural NetworksObject Detection using Deep Neural Networks
Object Detection using Deep Neural NetworksUsman Qayyum
 
Image captioning using DL and NLP.pptx
Image captioning using DL and NLP.pptxImage captioning using DL and NLP.pptx
Image captioning using DL and NLP.pptxMrUnknown820784
 
Object classification using CNN & VGG16 Model (Keras and Tensorflow)
Object classification using CNN & VGG16 Model (Keras and Tensorflow) Object classification using CNN & VGG16 Model (Keras and Tensorflow)
Object classification using CNN & VGG16 Model (Keras and Tensorflow) Lalit Jain
 
Real Time Object Dectection using machine learning
Real Time Object Dectection using machine learningReal Time Object Dectection using machine learning
Real Time Object Dectection using machine learningpratik pratyay
 
Convolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNetConvolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNetSungminYou
 

What's hot (20)

Image segmentation with deep learning
Image segmentation with deep learningImage segmentation with deep learning
Image segmentation with deep learning
 
Stroke Prediction
Stroke PredictionStroke Prediction
Stroke Prediction
 
Introduction to object detection
Introduction to object detectionIntroduction to object detection
Introduction to object detection
 
Lec7: Medical Image Segmentation (I) (Radiology Applications of Segmentation,...
Lec7: Medical Image Segmentation (I) (Radiology Applications of Segmentation,...Lec7: Medical Image Segmentation (I) (Radiology Applications of Segmentation,...
Lec7: Medical Image Segmentation (I) (Radiology Applications of Segmentation,...
 
Heart disease prediction system
Heart disease prediction systemHeart disease prediction system
Heart disease prediction system
 
Convolutional neural network
Convolutional neural network Convolutional neural network
Convolutional neural network
 
Image captioning
Image captioningImage captioning
Image captioning
 
Siamese networks.pptx.pdf
Siamese networks.pptx.pdfSiamese networks.pptx.pdf
Siamese networks.pptx.pdf
 
Object Detection and Recognition
Object Detection and Recognition Object Detection and Recognition
Object Detection and Recognition
 
CNN and its applications by ketaki
CNN and its applications by ketakiCNN and its applications by ketaki
CNN and its applications by ketaki
 
Diabetic Retinopathy.pptx
Diabetic Retinopathy.pptxDiabetic Retinopathy.pptx
Diabetic Retinopathy.pptx
 
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
 
Convolutional Neural Network Models - Deep Learning
Convolutional Neural Network Models - Deep LearningConvolutional Neural Network Models - Deep Learning
Convolutional Neural Network Models - Deep Learning
 
Computer Vision: Image Augmentation
Computer Vision: Image AugmentationComputer Vision: Image Augmentation
Computer Vision: Image Augmentation
 
Convolutional Neural Networks : Popular Architectures
Convolutional Neural Networks : Popular ArchitecturesConvolutional Neural Networks : Popular Architectures
Convolutional Neural Networks : Popular Architectures
 
Object Detection using Deep Neural Networks
Object Detection using Deep Neural NetworksObject Detection using Deep Neural Networks
Object Detection using Deep Neural Networks
 
Image captioning using DL and NLP.pptx
Image captioning using DL and NLP.pptxImage captioning using DL and NLP.pptx
Image captioning using DL and NLP.pptx
 
Object classification using CNN & VGG16 Model (Keras and Tensorflow)
Object classification using CNN & VGG16 Model (Keras and Tensorflow) Object classification using CNN & VGG16 Model (Keras and Tensorflow)
Object classification using CNN & VGG16 Model (Keras and Tensorflow)
 
Real Time Object Dectection using machine learning
Real Time Object Dectection using machine learningReal Time Object Dectection using machine learning
Real Time Object Dectection using machine learning
 
Convolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNetConvolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNet
 

Similar to Multi-class Image Classification using Deep Convolutional Networks on extremely large dataset

Lightweight DNN Processor Design (based on NVDLA)
Lightweight DNN Processor Design (based on NVDLA)Lightweight DNN Processor Design (based on NVDLA)
Lightweight DNN Processor Design (based on NVDLA)Shien-Chun Luo
 
Deep Learning for Computer Vision: Memory usage and computational considerati...
Deep Learning for Computer Vision: Memory usage and computational considerati...Deep Learning for Computer Vision: Memory usage and computational considerati...
Deep Learning for Computer Vision: Memory usage and computational considerati...Universitat Politècnica de Catalunya
 
Introduction to CNN Models: DenseNet & MobileNet
Introduction to CNN Models: DenseNet & MobileNetIntroduction to CNN Models: DenseNet & MobileNet
Introduction to CNN Models: DenseNet & MobileNetKrishnakoumarC
 
Once-for-All: Train One Network and Specialize it for Efficient Deployment
 Once-for-All: Train One Network and Specialize it for Efficient Deployment Once-for-All: Train One Network and Specialize it for Efficient Deployment
Once-for-All: Train One Network and Specialize it for Efficient Deploymenttaeseon ryu
 
Yufeng Guo - Tensor Processing Units: how TPUs enable the next generation of ...
Yufeng Guo - Tensor Processing Units: how TPUs enable the next generation of ...Yufeng Guo - Tensor Processing Units: how TPUs enable the next generation of ...
Yufeng Guo - Tensor Processing Units: how TPUs enable the next generation of ...Codemotion
 
(Paper note) Real time rgb-d camera relocalization via randomized ferns for k...
(Paper note) Real time rgb-d camera relocalization via randomized ferns for k...(Paper note) Real time rgb-d camera relocalization via randomized ferns for k...
(Paper note) Real time rgb-d camera relocalization via randomized ferns for k...e8xu
 
Distributed DNN training: Infrastructure, challenges, and lessons learned
Distributed DNN training: Infrastructure, challenges, and lessons learnedDistributed DNN training: Infrastructure, challenges, and lessons learned
Distributed DNN training: Infrastructure, challenges, and lessons learnedWee Hyong Tok
 
Intelligence at scale through AI model efficiency
Intelligence at scale through AI model efficiencyIntelligence at scale through AI model efficiency
Intelligence at scale through AI model efficiencyQualcomm Research
 
YOLO9000 - PR023
YOLO9000 - PR023YOLO9000 - PR023
YOLO9000 - PR023Jinwon Lee
 
Improving Efficiency of Machine Learning Algorithms using HPCC Systems
Improving Efficiency of Machine Learning Algorithms using HPCC SystemsImproving Efficiency of Machine Learning Algorithms using HPCC Systems
Improving Efficiency of Machine Learning Algorithms using HPCC SystemsHPCC Systems
 
Salt Identification Challenge
Salt Identification ChallengeSalt Identification Challenge
Salt Identification Challengekenluck2001
 
A brief introduction to recent segmentation methods
A brief introduction to recent segmentation methodsA brief introduction to recent segmentation methods
A brief introduction to recent segmentation methodsShunta Saito
 
NVIDIA深度學習教育機構 (DLI): Object detection with jetson
NVIDIA深度學習教育機構 (DLI): Object detection with jetsonNVIDIA深度學習教育機構 (DLI): Object detection with jetson
NVIDIA深度學習教育機構 (DLI): Object detection with jetsonNVIDIA Taiwan
 
Trackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity CalorimeterTrackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity CalorimeterYousef Fadila
 
Machine Learning approaches at video compression
Machine Learning approaches at video compression Machine Learning approaches at video compression
Machine Learning approaches at video compression Roberto Iacoviello
 
Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)DonghyunKang12
 
Ceph Day Netherlands - Ceph @ BIT
Ceph Day Netherlands - Ceph @ BIT Ceph Day Netherlands - Ceph @ BIT
Ceph Day Netherlands - Ceph @ BIT Ceph Community
 

Similar to Multi-class Image Classification using Deep Convolutional Networks on extremely large dataset (20)

Lightweight DNN Processor Design (based on NVDLA)
Lightweight DNN Processor Design (based on NVDLA)Lightweight DNN Processor Design (based on NVDLA)
Lightweight DNN Processor Design (based on NVDLA)
 
Deep Learning for Computer Vision: Memory usage and computational considerati...
Deep Learning for Computer Vision: Memory usage and computational considerati...Deep Learning for Computer Vision: Memory usage and computational considerati...
Deep Learning for Computer Vision: Memory usage and computational considerati...
 
Manycores for the Masses
Manycores for the MassesManycores for the Masses
Manycores for the Masses
 
Introduction to CNN Models: DenseNet & MobileNet
Introduction to CNN Models: DenseNet & MobileNetIntroduction to CNN Models: DenseNet & MobileNet
Introduction to CNN Models: DenseNet & MobileNet
 
Once-for-All: Train One Network and Specialize it for Efficient Deployment
 Once-for-All: Train One Network and Specialize it for Efficient Deployment Once-for-All: Train One Network and Specialize it for Efficient Deployment
Once-for-All: Train One Network and Specialize it for Efficient Deployment
 
Yufeng Guo - Tensor Processing Units: how TPUs enable the next generation of ...
Yufeng Guo - Tensor Processing Units: how TPUs enable the next generation of ...Yufeng Guo - Tensor Processing Units: how TPUs enable the next generation of ...
Yufeng Guo - Tensor Processing Units: how TPUs enable the next generation of ...
 
(Paper note) Real time rgb-d camera relocalization via randomized ferns for k...
(Paper note) Real time rgb-d camera relocalization via randomized ferns for k...(Paper note) Real time rgb-d camera relocalization via randomized ferns for k...
(Paper note) Real time rgb-d camera relocalization via randomized ferns for k...
 
Distributed DNN training: Infrastructure, challenges, and lessons learned
Distributed DNN training: Infrastructure, challenges, and lessons learnedDistributed DNN training: Infrastructure, challenges, and lessons learned
Distributed DNN training: Infrastructure, challenges, and lessons learned
 
Intelligence at scale through AI model efficiency
Intelligence at scale through AI model efficiencyIntelligence at scale through AI model efficiency
Intelligence at scale through AI model efficiency
 
YOLO9000 - PR023
YOLO9000 - PR023YOLO9000 - PR023
YOLO9000 - PR023
 
Improving Efficiency of Machine Learning Algorithms using HPCC Systems
Improving Efficiency of Machine Learning Algorithms using HPCC SystemsImproving Efficiency of Machine Learning Algorithms using HPCC Systems
Improving Efficiency of Machine Learning Algorithms using HPCC Systems
 
Chatbot
ChatbotChatbot
Chatbot
 
Salt Identification Challenge
Salt Identification ChallengeSalt Identification Challenge
Salt Identification Challenge
 
A brief introduction to recent segmentation methods
A brief introduction to recent segmentation methodsA brief introduction to recent segmentation methods
A brief introduction to recent segmentation methods
 
NVIDIA深度學習教育機構 (DLI): Object detection with jetson
NVIDIA深度學習教育機構 (DLI): Object detection with jetsonNVIDIA深度學習教育機構 (DLI): Object detection with jetson
NVIDIA深度學習教育機構 (DLI): Object detection with jetson
 
Trackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity CalorimeterTrackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity Calorimeter
 
Machine Learning approaches at video compression
Machine Learning approaches at video compression Machine Learning approaches at video compression
Machine Learning approaches at video compression
 
Deep Learning at Scale
Deep Learning at ScaleDeep Learning at Scale
Deep Learning at Scale
 
Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)
 
Ceph Day Netherlands - Ceph @ BIT
Ceph Day Netherlands - Ceph @ BIT Ceph Day Netherlands - Ceph @ BIT
Ceph Day Netherlands - Ceph @ BIT
 

More from Ashis Chanda

Understanding medical concepts and codes through NLP methods
Understanding medical concepts and codes through NLP methodsUnderstanding medical concepts and codes through NLP methods
Understanding medical concepts and codes through NLP methodsAshis Chanda
 
Information extraction from EHR
Information extraction from EHRInformation extraction from EHR
Information extraction from EHRAshis Chanda
 
Understanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational DatabasesUnderstanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational DatabasesAshis Chanda
 
Full resolution image compression with recurrent neural networks
Full resolution image compression with recurrent neural networksFull resolution image compression with recurrent neural networks
Full resolution image compression with recurrent neural networksAshis Chanda
 
Iterative deepening search
Iterative deepening searchIterative deepening search
Iterative deepening searchAshis Chanda
 
Periodic pattern mining
Periodic pattern miningPeriodic pattern mining
Periodic pattern miningAshis Chanda
 
An efficient approach to mine flexible periodic patterns in time series datab...
An efficient approach to mine flexible periodic patterns in time series datab...An efficient approach to mine flexible periodic patterns in time series datab...
An efficient approach to mine flexible periodic patterns in time series datab...Ashis Chanda
 
Frequent Pattern Growth Algorithm (FP growth method)
Frequent Pattern Growth Algorithm (FP growth method)Frequent Pattern Growth Algorithm (FP growth method)
Frequent Pattern Growth Algorithm (FP growth method)Ashis Chanda
 

More from Ashis Chanda (11)

Understanding medical concepts and codes through NLP methods
Understanding medical concepts and codes through NLP methodsUnderstanding medical concepts and codes through NLP methods
Understanding medical concepts and codes through NLP methods
 
Word2vector
Word2vectorWord2vector
Word2vector
 
Information extraction from EHR
Information extraction from EHRInformation extraction from EHR
Information extraction from EHR
 
Understanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational DatabasesUnderstanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational Databases
 
Full resolution image compression with recurrent neural networks
Full resolution image compression with recurrent neural networksFull resolution image compression with recurrent neural networks
Full resolution image compression with recurrent neural networks
 
Iterative deepening search
Iterative deepening searchIterative deepening search
Iterative deepening search
 
Periodic pattern mining
Periodic pattern miningPeriodic pattern mining
Periodic pattern mining
 
An efficient approach to mine flexible periodic patterns in time series datab...
An efficient approach to mine flexible periodic patterns in time series datab...An efficient approach to mine flexible periodic patterns in time series datab...
An efficient approach to mine flexible periodic patterns in time series datab...
 
Data Mining
Data MiningData Mining
Data Mining
 
Frequent Pattern Growth Algorithm (FP growth method)
Frequent Pattern Growth Algorithm (FP growth method)Frequent Pattern Growth Algorithm (FP growth method)
Frequent Pattern Growth Algorithm (FP growth method)
 
Apriori algorithm
Apriori algorithmApriori algorithm
Apriori algorithm
 

Recently uploaded

Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...KokoStevan
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.MateoGardella
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfChris Hunter
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterMateoGardella
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 

Recently uploaded (20)

Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 

Multi-class Image Classification using Deep Convolutional Networks on extremely large dataset

  • 1. Multi-class Image Classification using Deep Convolutional Networks on extremely large dataset Marija Stanojevic Ashis Chanda CIS Department, Temple University 5th December 2017
  • 2. Problem Definition ❖ Kaggle competition: Cdiscount’s Image Classification Challenge ❖ Classify user posted images into 5270 categories ❖ Challenges of dataset: ➢ Huge amount of data and categories ➢ Background clutter (objects blend into environment) ➢ Viewpoint and data scale variation ➢ Deformation ➢ Occlusion ➢ Illumination conditions ❖ Deep learning models seem suitable Data sets Size # products Train data 58.2 GB 7,069,896 Test data 14.5 GB 1,768,182 Category # Labels Category1 49 Category2 483 Category3 5270
  • 4. Dataset: Categories A sample list of product categories:
  • 5. Our used models ● CNN ● DenseNet ● ResNet ● ResNext ● Wide-res Net
  • 6. Models: CNN ❖ Our CNN ➢ 3 Conv, 2 FC layers, 3 Pool layers ➢ thinner layers ➢ downsampling three times ❖ Tried to use for three types of categories
  • 7. ❖ l th layer has connection with all previous layers (l -1) ❖ Normally, each dense block contains 40, 100, 160 layers ❖ We used 3 blocks, 3 layers. Total connection: l x (l -1) /2 = 15 ❖ Each layer has BN, ReLU, Conv ❖ No transition layer (downsampling), but used dropout Models: DenseNet
  • 8. Models: ResNet ❖ 34, 50, 101, 152 layers ❖ Residual block example ❖ Batch normalization, ReLu ❖ Our residual network ➢ thinner layers (25 conv + 4 pool + fc) ➢ downsampling after each block ➢ no bottleneck
  • 9. ❖ Uses residual block as bases ❖ Block width is divided in k parts which learn separately ❖ Blocks are wider than in resNet ❖ Distinct cardinalities ❖ 29, 50 and 101 layers ❖ Our resNext: ➢ thinner layers ➢ downsampling after each block ➢ less layers (19 conv + 4 pool + fc) ➢ cardinality 32 always - best results according to paper Models: ResNext
  • 10. ❖ Uses residual block as bases ❖ Block is k times wider (k=1, 2, 4, 8, 10, 12) ❖ Dropout beside batch normalization ❖ 16, 22, 28, 40 layers ❖ Our wideResNet: ➢ k = 2 - doesn’t increase much number of parameters, but shows biggest improvement in original results ➢ dropout keep = 0.7 - best in original results ➢ less layers (13 conv + 4 pool + fc) Models: WideResNet
  • 11. Challenges in dataset ❖ 5.6 TB after splitting data ❖ Train data (707 Batches) & Test data (177 Batches) ❖ Smaller batch: more reads and writes ❖ Bigger batch: memory error ❖ Each batch has 10,000 products ~ 20,000 images ❖ Cross-validation data: 707 products ~ 1500 images
  • 12. Challenges in implementation ❖ Used owlsnesttwo high performance computing system ❖ GPU: NVIDIA Tesla P100 PCIe 12GB ❖ Only two jobs allowed in parallel ❖ Implemented with tensorflow and tflearn in python ❖ Network ➢ Complexity in debugging ➢ Bigger network: memory error => tuning network ➢ Thinner and less layers ➢ Small number of epoch
  • 13. Baseline (CNN) Result ❖ CNN: (10 Epochs, 10 Batches) ❖ CNN: (50 Epochs, 50 Batches) (in progress) Label # class Accuracy Error Time Category 1 49 52% 10.7 4 hours Category 2 483 47% 11.68 5 hours Category 3 5270 32% 4.89 11 hours Category 1 49 12% (1 epoch) 3.42 20 hours
  • 14. Experimental Result (category 3) Model # Batch # Epoch Accuracy Error Time DenseNet 50/707 1 (running) 4% 9.83 10 days ResNet 50/707 10 34% 3.94489 5.69 hours ResNext 50/707 10 28.9% 4.51031 16.8 hours WideResNext 50/707 10 41.93% 3.53318 6.11 hours ResNet 707/707 3 (running) 36.69% 3.84427 3.35 days ResNext 707/707 1 (running) 28.47% 4.50394 9.90 days WideResNext 707/707 2 (running) 40.51% 3.49804 3.60 days
  • 15. We are here now
  • 16. Experimental results: Error in classifying Mobile cover bag Mobile Case Mobile Laptop cover bag Mobile film protector Mobile Case
  • 17. Conclusions and Future Work ❖ Baseline: CNN; Proposed: resNet, resNext, denseNet, wideResNet ❖ All proposed networks have similar number of parameters ❖ wideResNet performs the best ❖ resNext gave worst results and is 3 times slower than resNet ❖ DenseNet requires GPU with huge memory ❖ Requires a lot of time, huge memory and fast computational resources ❖ Number of epochs has to be 70+ ❖ Future: Submit result to Kaggle competition