SlideShare a Scribd company logo
1 of 28
Semantic Segmentation State of The Art on UAV
Imagery for Plant Species Classification
Pierre Lague
Université de Rennes 2 / Université Bretagne Sud
Presentation for the 6p meeting
CMI Internship 2021-2022 2/28 Internship Oral Defense
Table of Contents
1 Introduction
Introduction to the project
Introduction to Semantic Segmentation
2 The Data
3 Tools and Methods
Overall Process
Tools
4 Metrics
5 First Results
6 Finding an Adapted Metric
Object Detection Evaluation
Algorithm to compute TP, FP and FN
Results with the new method
Conclusion
Pierre Lague (UBS, UR2) Introduction
CMI Internship 2021-2022 3/28 Internship Oral Defense
Introduction to the project
1 The internship aims to deploy innovative methods of
characterization and identification of the vegetation via remote
sensing and artificial intelligence.
Objective 1:
Combine remote sensing
approaches and in situ species
identification via artificial
intelligence methods to allow
mapping of species present in sites
of interest.
Objective 2:
Establish a state of the art on
semantic segmentation methods
that can be used for our data
(UAV imagery) and that achieve
significant scores.
Pierre Lague (UBS, UR2) Introduction Introduction to the project
CMI Internship 2021-2022 4/28 Internship Oral Defense
Introduction to Semantic Segmentation
Definition
The goal of semantic image segmentation is to label each pixel of an
image with a corresponding class of what is being represented.
Figure 1: An example of semantic segmentation, where the goal is to predict
class labels for each pixel in the image
Pierre Lague (UBS, UR2) Introduction Introduction to Semantic Segmentation
CMI Internship 2021-2022 5/28 Internship Oral Defense
Understanding the U-net
Figure 2: Block Diagram of the U-net architecture
Pierre Lague (UBS, UR2) Introduction Introduction to Semantic Segmentation
CMI Internship 2021-2022 6/28 Internship Oral Defense
Understanding the U-net
Figure 3: Encoder-Decoder and Skip connections
Pierre Lague (UBS, UR2) Introduction Introduction to Semantic Segmentation
CMI Internship 2021-2022 7/28 Internship Oral Defense
Table of Contents
1 Introduction
Introduction to the project
Introduction to Semantic Segmentation
2 The Data
3 Tools and Methods
Overall Process
Tools
4 Metrics
5 First Results
6 Finding an Adapted Metric
Object Detection Evaluation
Algorithm to compute TP, FP and FN
Results with the new method
Conclusion
Pierre Lague (UBS, UR2) The Data
CMI Internship 2021-2022 8/28 Internship Oral Defense
Figure 4: The private company’s logo "l’Avion Jaune"
Study Sites:
1 Talus -> 513 images (103 test / 410 train)
2 Chichoue_Bas -> 95 images (19 test / 76 train)
3 Chichoue_Haut -> 280 images (56 test / 224 train)
4 Chichoue_Milieu_Bas -> 186 images (38 test / 148 train)
5 Chichoue_Milieu_Haut -> 295 images (59 test / 236 train)
Pierre Lague (UBS, UR2) The Data
CMI Internship 2021-2022 9/28 Internship Oral Defense
Remark
The particularity of the labels are such that a plant is represented by a
full circle :
(a) Sample from the Talus site (b) Ground Truth
Figure 5: An image and it’s associated label
Pierre Lague (UBS, UR2) The Data
CMI Internship 2021-2022 10/28 Internship Oral Defense
Here are the things we can expect during the training phases :
Remark
The model might have issues detecting the edges of the plant
(leaves, highly uncanny shapes ...).
The amount of data is not as important as the standards for
semantic segmentation studies, this will affect the performance of
the model.
Pierre Lague (UBS, UR2) The Data
CMI Internship 2021-2022 11/28 Internship Oral Defense
Table of Contents
1 Introduction
Introduction to the project
Introduction to Semantic Segmentation
2 The Data
3 Tools and Methods
Overall Process
Tools
4 Metrics
5 First Results
6 Finding an Adapted Metric
Object Detection Evaluation
Algorithm to compute TP, FP and FN
Results with the new method
Conclusion
Pierre Lague (UBS, UR2) Tools and Methods
CMI Internship 2021-2022 12/28 Internship Oral Defense
Overall Process
The complete process of the study is as follows :
Figure 6: Flow Chart of the project
Pierre Lague (UBS, UR2) Tools and Methods Overall Process
CMI Internship 2021-2022 13/28 Internship Oral Defense
Tools
"segmentation_models_pytorch" (SMP) allows the access to a variety of
architectures and backbones.
Architectures :
Unet
Unet++
PSPNet
MANet
LinkNet
FPN
DeepLab-V3
Backbones :
ResNet-50
ResNeXt-50
Inception-V4
MobileNet-V2
EfficientNet-B2
Pierre Lague (UBS, UR2) Tools and Methods Tools
CMI Internship 2021-2022 14/28 Internship Oral Defense
The programming language
The whole project was implemented in Python. Libraries such as pytorch,
numpy and sklearn were used during the implementation of the training /
testing phases.
Pierre Lague (UBS, UR2) Tools and Methods Tools
CMI Internship 2021-2022 15/28 Internship Oral Defense
Table of Contents
1 Introduction
Introduction to the project
Introduction to Semantic Segmentation
2 The Data
3 Tools and Methods
Overall Process
Tools
4 Metrics
5 First Results
6 Finding an Adapted Metric
Object Detection Evaluation
Algorithm to compute TP, FP and FN
Results with the new method
Conclusion
Pierre Lague (UBS, UR2) Metrics
CMI Internship 2021-2022 16/28 Internship Oral Defense
Remark
Asses the models quality and capacity to fulfill it’s main task.
Need to use multiple in order to have a reliable evaluation.
Pierre Lague (UBS, UR2) Metrics
CMI Internship 2021-2022 17/28 Internship Oral Defense
The metrics we decided to use are extracted from a confusion matrix :
Used Metrics:
Precision = TP
TP+FP
Recall = TP
TP+FN
IoU = TP
TP+FP+FN
F1 = Precision∗Recall
Precision+Recall ∗ 2
Figure 7: Confusion Matrix
Pierre Lague (UBS, UR2) Metrics
CMI Internship 2021-2022 18/28 Internship Oral Defense
Table of Contents
1 Introduction
Introduction to the project
Introduction to Semantic Segmentation
2 The Data
3 Tools and Methods
Overall Process
Tools
4 Metrics
5 First Results
6 Finding an Adapted Metric
Object Detection Evaluation
Algorithm to compute TP, FP and FN
Results with the new method
Conclusion
Pierre Lague (UBS, UR2) First Results
CMI Internship 2021-2022 19/28 Internship Oral Defense
First Results
Here are the results we obtained on the Talus Dataset only the best
results are displayed :
Network Accuracy Mean IoU Mean DICE
Manet 21.94% 0.0972 0.1444
Table 1: ResNet50 - Talus - 9 classes
Network Accuracy Mean IoU Mean DICE
Unet 19.99% 0.0864 0.1345
Table 2: ResNeXt50 - Talus - 9 classes
PSPNet 23.49% 0.0760 0.1233
LinkNet 23.43% 0.0813 0.1266
Table 3: EfficientNetB2 - Talus - 9 classes
Pierre Lague (UBS, UR2) First Results
CMI Internship 2021-2022 20/28 Internship Oral Defense
Network Accuracy Mean IoU Mean DICE
Unet++ 25.15% 0.0965 0.1429
PSPNet 25.49% 0.0731 0.1136
Table 4: InceptionV4 - Talus - 9 classes
Network Accuracy Mean IoU Mean DICE
PSPNet 29.86% 0.0792 0.1181
Table 5: MobileNetV2 - Talus - 9 classes
Remark
Overall the results are very low and not convincing. There are two
reasons for this :
approximate labelling -> metrics are not entirely reliable
lack of data
Pierre Lague (UBS, UR2) First Results
CMI Internship 2021-2022 21/28 Internship Oral Defense
Table of Contents
1 Introduction
Introduction to the project
Introduction to Semantic Segmentation
2 The Data
3 Tools and Methods
Overall Process
Tools
4 Metrics
5 First Results
6 Finding an Adapted Metric
Object Detection Evaluation
Algorithm to compute TP, FP and FN
Results with the new method
Conclusion
Pierre Lague (UBS, UR2) Finding an Adapted Metric
CMI Internship 2021-2022 22/28 Internship Oral Defense
Object Detection Evaluation
New Method
Change the scale of evaluation : from a pixel-based evaluation to an
object-based evaluation.
Figure 8: New ways of computing the metrics to evaluate our models
Remark
The goal is to compute the Average Precision for each classes and then
the Mean Average Precision.
Pierre Lague (UBS, UR2) Finding an Adapted Metric Object Detection Evaluation
CMI Internship 2021-2022 23/28 Internship Oral Defense
Algorithm to compute TP, FP and FN
Figure 9: Illustrated algorithm to compute TP, FP and FN with an example
Pierre Lague (UBS, UR2) Finding an Adapted Metric Algorithm to compute TP, FP and FN
CMI Internship 2021-2022 24/28 Internship Oral Defense
Results with the new method
Steps to evaluate the models
Include the algorithm into the testing phase
For all classes and for multiple thresholds, compute Average
Precision
Compute Mean Average Precision
Pierre Lague (UBS, UR2) Finding an Adapted Metric Results with the new method
CMI Internship 2021-2022 25/28 Internship Oral Defense
Class Average Precision
1 0.09787
Table 6: Average Precisions class 1
Figure 10: Precision-Recall Curve for Class 1, Average Precision is 0.097
Remark
Mean Average Precision The Average Precision for all the other classes
was 0. The Mean Average Precision is 0.009787
Pierre Lague (UBS, UR2) Finding an Adapted Metric Results with the new method
CMI Internship 2021-2022 26/28 Internship Oral Defense
Conclusion
The new way of computing the metrics is adapted to the setup of our
experiment and our labels. This way we can reliably evaluate our models
:
The Mean Average Precision shows that our models are of very poor
quality.
The discriminant capacities of our models are also very low, even
after trying out multiple configurations.
Our models did not achieve significant scores for them to be used
on the field. Semantic Segmentation may not be the most
effective approach, given the setup of the experiment.
Pierre Lague (UBS, UR2) Finding an Adapted Metric Conclusion
CMI Internship 2021-2022 27/28 Internship Oral Defense
Future Work
Object Detection has a lot of potential. Bounding boxes can easily be
constructed out of the labels we already have. The problem has to be
less granular in order to achieve better scores.
Pierre Lague (UBS, UR2) Finding an Adapted Metric Conclusion
CMI Internship 2021-2022 28/28 Internship Oral Defense
Thank you for your time.
Pierre Lague (UBS, UR2) Finding an Adapted Metric Conclusion

More Related Content

Similar to RESEARCH_PRESENTATION.pdf

Similar to RESEARCH_PRESENTATION.pdf (20)

CenterAttentionFaceNet: A improved network with the CBAM attention mechanism
CenterAttentionFaceNet: A improved network with the CBAM attention mechanismCenterAttentionFaceNet: A improved network with the CBAM attention mechanism
CenterAttentionFaceNet: A improved network with the CBAM attention mechanism
 
IRJET- Crowd Density Estimation using Image Processing
IRJET- Crowd Density Estimation using Image ProcessingIRJET- Crowd Density Estimation using Image Processing
IRJET- Crowd Density Estimation using Image Processing
 
CovidAID: COVID-19 Detection using Chest X-Ray Images
CovidAID: COVID-19 Detection using Chest X-Ray ImagesCovidAID: COVID-19 Detection using Chest X-Ray Images
CovidAID: COVID-19 Detection using Chest X-Ray Images
 
IRJET - Comparative Study of Flight Delay Prediction using Back Propagati...
IRJET -  	  Comparative Study of Flight Delay Prediction using Back Propagati...IRJET -  	  Comparative Study of Flight Delay Prediction using Back Propagati...
IRJET - Comparative Study of Flight Delay Prediction using Back Propagati...
 
A Real-Time Emotion Recognition from Facial Expression Using Conventional Neu...
A Real-Time Emotion Recognition from Facial Expression Using Conventional Neu...A Real-Time Emotion Recognition from Facial Expression Using Conventional Neu...
A Real-Time Emotion Recognition from Facial Expression Using Conventional Neu...
 
Unit 28 topographic_surveying_in_construction_and_civil_engineering
Unit 28 topographic_surveying_in_construction_and_civil_engineeringUnit 28 topographic_surveying_in_construction_and_civil_engineering
Unit 28 topographic_surveying_in_construction_and_civil_engineering
 
Five Minute Speech: An Overview of Activities Developed in Disciplines and Gu...
Five Minute Speech: An Overview of Activities Developed in Disciplines and Gu...Five Minute Speech: An Overview of Activities Developed in Disciplines and Gu...
Five Minute Speech: An Overview of Activities Developed in Disciplines and Gu...
 
IRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
IRJET- Object Detection and Recognition using Single Shot Multi-Box DetectorIRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
IRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
 
Is Your Software Development Process Green?
Is Your Software Development Process Green?Is Your Software Development Process Green?
Is Your Software Development Process Green?
 
IRJET- Automated Detection of Diabetic Retinopathy using Compressed Sensing
IRJET- Automated Detection of Diabetic Retinopathy using Compressed SensingIRJET- Automated Detection of Diabetic Retinopathy using Compressed Sensing
IRJET- Automated Detection of Diabetic Retinopathy using Compressed Sensing
 
Parkinson Disease Detection Using XGBoost and SVM
Parkinson Disease Detection Using XGBoost and SVMParkinson Disease Detection Using XGBoost and SVM
Parkinson Disease Detection Using XGBoost and SVM
 
ReTV AI4TV Summarization
ReTV AI4TV SummarizationReTV AI4TV Summarization
ReTV AI4TV Summarization
 
Sentiment Analysis using Naïve Bayes, CNN, SVM
Sentiment Analysis using Naïve Bayes, CNN, SVMSentiment Analysis using Naïve Bayes, CNN, SVM
Sentiment Analysis using Naïve Bayes, CNN, SVM
 
Thermal Imaging in Surgical Site Infection (SSI) Detection
Thermal Imaging in Surgical Site Infection (SSI) DetectionThermal Imaging in Surgical Site Infection (SSI) Detection
Thermal Imaging in Surgical Site Infection (SSI) Detection
 
IRJET- Glaucoma Detection using Convolutional Neural Network
IRJET- Glaucoma Detection using Convolutional Neural NetworkIRJET- Glaucoma Detection using Convolutional Neural Network
IRJET- Glaucoma Detection using Convolutional Neural Network
 
Deep Learning-based Diagnosis of Pneumonia using X-Ray Scans
Deep Learning-based Diagnosis of Pneumonia using X-Ray ScansDeep Learning-based Diagnosis of Pneumonia using X-Ray Scans
Deep Learning-based Diagnosis of Pneumonia using X-Ray Scans
 
Sign Detection from Hearing Impaired
Sign Detection from Hearing ImpairedSign Detection from Hearing Impaired
Sign Detection from Hearing Impaired
 
IRJET - An Robust and Dynamic Fire Detection Method using Convolutional N...
IRJET -  	  An Robust and Dynamic Fire Detection Method using Convolutional N...IRJET -  	  An Robust and Dynamic Fire Detection Method using Convolutional N...
IRJET - An Robust and Dynamic Fire Detection Method using Convolutional N...
 
SVM Based Saliency Map Technique for Reducing Time Complexity in HEVC
SVM Based Saliency Map Technique for Reducing Time Complexity in HEVCSVM Based Saliency Map Technique for Reducing Time Complexity in HEVC
SVM Based Saliency Map Technique for Reducing Time Complexity in HEVC
 
IRJET- Sketch-Verse: Sketch Image Inversion using DCNN
IRJET- Sketch-Verse: Sketch Image Inversion using DCNNIRJET- Sketch-Verse: Sketch Image Inversion using DCNN
IRJET- Sketch-Verse: Sketch Image Inversion using DCNN
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Recently uploaded (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 

RESEARCH_PRESENTATION.pdf

  • 1. Semantic Segmentation State of The Art on UAV Imagery for Plant Species Classification Pierre Lague Université de Rennes 2 / Université Bretagne Sud Presentation for the 6p meeting
  • 2. CMI Internship 2021-2022 2/28 Internship Oral Defense Table of Contents 1 Introduction Introduction to the project Introduction to Semantic Segmentation 2 The Data 3 Tools and Methods Overall Process Tools 4 Metrics 5 First Results 6 Finding an Adapted Metric Object Detection Evaluation Algorithm to compute TP, FP and FN Results with the new method Conclusion Pierre Lague (UBS, UR2) Introduction
  • 3. CMI Internship 2021-2022 3/28 Internship Oral Defense Introduction to the project 1 The internship aims to deploy innovative methods of characterization and identification of the vegetation via remote sensing and artificial intelligence. Objective 1: Combine remote sensing approaches and in situ species identification via artificial intelligence methods to allow mapping of species present in sites of interest. Objective 2: Establish a state of the art on semantic segmentation methods that can be used for our data (UAV imagery) and that achieve significant scores. Pierre Lague (UBS, UR2) Introduction Introduction to the project
  • 4. CMI Internship 2021-2022 4/28 Internship Oral Defense Introduction to Semantic Segmentation Definition The goal of semantic image segmentation is to label each pixel of an image with a corresponding class of what is being represented. Figure 1: An example of semantic segmentation, where the goal is to predict class labels for each pixel in the image Pierre Lague (UBS, UR2) Introduction Introduction to Semantic Segmentation
  • 5. CMI Internship 2021-2022 5/28 Internship Oral Defense Understanding the U-net Figure 2: Block Diagram of the U-net architecture Pierre Lague (UBS, UR2) Introduction Introduction to Semantic Segmentation
  • 6. CMI Internship 2021-2022 6/28 Internship Oral Defense Understanding the U-net Figure 3: Encoder-Decoder and Skip connections Pierre Lague (UBS, UR2) Introduction Introduction to Semantic Segmentation
  • 7. CMI Internship 2021-2022 7/28 Internship Oral Defense Table of Contents 1 Introduction Introduction to the project Introduction to Semantic Segmentation 2 The Data 3 Tools and Methods Overall Process Tools 4 Metrics 5 First Results 6 Finding an Adapted Metric Object Detection Evaluation Algorithm to compute TP, FP and FN Results with the new method Conclusion Pierre Lague (UBS, UR2) The Data
  • 8. CMI Internship 2021-2022 8/28 Internship Oral Defense Figure 4: The private company’s logo "l’Avion Jaune" Study Sites: 1 Talus -> 513 images (103 test / 410 train) 2 Chichoue_Bas -> 95 images (19 test / 76 train) 3 Chichoue_Haut -> 280 images (56 test / 224 train) 4 Chichoue_Milieu_Bas -> 186 images (38 test / 148 train) 5 Chichoue_Milieu_Haut -> 295 images (59 test / 236 train) Pierre Lague (UBS, UR2) The Data
  • 9. CMI Internship 2021-2022 9/28 Internship Oral Defense Remark The particularity of the labels are such that a plant is represented by a full circle : (a) Sample from the Talus site (b) Ground Truth Figure 5: An image and it’s associated label Pierre Lague (UBS, UR2) The Data
  • 10. CMI Internship 2021-2022 10/28 Internship Oral Defense Here are the things we can expect during the training phases : Remark The model might have issues detecting the edges of the plant (leaves, highly uncanny shapes ...). The amount of data is not as important as the standards for semantic segmentation studies, this will affect the performance of the model. Pierre Lague (UBS, UR2) The Data
  • 11. CMI Internship 2021-2022 11/28 Internship Oral Defense Table of Contents 1 Introduction Introduction to the project Introduction to Semantic Segmentation 2 The Data 3 Tools and Methods Overall Process Tools 4 Metrics 5 First Results 6 Finding an Adapted Metric Object Detection Evaluation Algorithm to compute TP, FP and FN Results with the new method Conclusion Pierre Lague (UBS, UR2) Tools and Methods
  • 12. CMI Internship 2021-2022 12/28 Internship Oral Defense Overall Process The complete process of the study is as follows : Figure 6: Flow Chart of the project Pierre Lague (UBS, UR2) Tools and Methods Overall Process
  • 13. CMI Internship 2021-2022 13/28 Internship Oral Defense Tools "segmentation_models_pytorch" (SMP) allows the access to a variety of architectures and backbones. Architectures : Unet Unet++ PSPNet MANet LinkNet FPN DeepLab-V3 Backbones : ResNet-50 ResNeXt-50 Inception-V4 MobileNet-V2 EfficientNet-B2 Pierre Lague (UBS, UR2) Tools and Methods Tools
  • 14. CMI Internship 2021-2022 14/28 Internship Oral Defense The programming language The whole project was implemented in Python. Libraries such as pytorch, numpy and sklearn were used during the implementation of the training / testing phases. Pierre Lague (UBS, UR2) Tools and Methods Tools
  • 15. CMI Internship 2021-2022 15/28 Internship Oral Defense Table of Contents 1 Introduction Introduction to the project Introduction to Semantic Segmentation 2 The Data 3 Tools and Methods Overall Process Tools 4 Metrics 5 First Results 6 Finding an Adapted Metric Object Detection Evaluation Algorithm to compute TP, FP and FN Results with the new method Conclusion Pierre Lague (UBS, UR2) Metrics
  • 16. CMI Internship 2021-2022 16/28 Internship Oral Defense Remark Asses the models quality and capacity to fulfill it’s main task. Need to use multiple in order to have a reliable evaluation. Pierre Lague (UBS, UR2) Metrics
  • 17. CMI Internship 2021-2022 17/28 Internship Oral Defense The metrics we decided to use are extracted from a confusion matrix : Used Metrics: Precision = TP TP+FP Recall = TP TP+FN IoU = TP TP+FP+FN F1 = Precision∗Recall Precision+Recall ∗ 2 Figure 7: Confusion Matrix Pierre Lague (UBS, UR2) Metrics
  • 18. CMI Internship 2021-2022 18/28 Internship Oral Defense Table of Contents 1 Introduction Introduction to the project Introduction to Semantic Segmentation 2 The Data 3 Tools and Methods Overall Process Tools 4 Metrics 5 First Results 6 Finding an Adapted Metric Object Detection Evaluation Algorithm to compute TP, FP and FN Results with the new method Conclusion Pierre Lague (UBS, UR2) First Results
  • 19. CMI Internship 2021-2022 19/28 Internship Oral Defense First Results Here are the results we obtained on the Talus Dataset only the best results are displayed : Network Accuracy Mean IoU Mean DICE Manet 21.94% 0.0972 0.1444 Table 1: ResNet50 - Talus - 9 classes Network Accuracy Mean IoU Mean DICE Unet 19.99% 0.0864 0.1345 Table 2: ResNeXt50 - Talus - 9 classes PSPNet 23.49% 0.0760 0.1233 LinkNet 23.43% 0.0813 0.1266 Table 3: EfficientNetB2 - Talus - 9 classes Pierre Lague (UBS, UR2) First Results
  • 20. CMI Internship 2021-2022 20/28 Internship Oral Defense Network Accuracy Mean IoU Mean DICE Unet++ 25.15% 0.0965 0.1429 PSPNet 25.49% 0.0731 0.1136 Table 4: InceptionV4 - Talus - 9 classes Network Accuracy Mean IoU Mean DICE PSPNet 29.86% 0.0792 0.1181 Table 5: MobileNetV2 - Talus - 9 classes Remark Overall the results are very low and not convincing. There are two reasons for this : approximate labelling -> metrics are not entirely reliable lack of data Pierre Lague (UBS, UR2) First Results
  • 21. CMI Internship 2021-2022 21/28 Internship Oral Defense Table of Contents 1 Introduction Introduction to the project Introduction to Semantic Segmentation 2 The Data 3 Tools and Methods Overall Process Tools 4 Metrics 5 First Results 6 Finding an Adapted Metric Object Detection Evaluation Algorithm to compute TP, FP and FN Results with the new method Conclusion Pierre Lague (UBS, UR2) Finding an Adapted Metric
  • 22. CMI Internship 2021-2022 22/28 Internship Oral Defense Object Detection Evaluation New Method Change the scale of evaluation : from a pixel-based evaluation to an object-based evaluation. Figure 8: New ways of computing the metrics to evaluate our models Remark The goal is to compute the Average Precision for each classes and then the Mean Average Precision. Pierre Lague (UBS, UR2) Finding an Adapted Metric Object Detection Evaluation
  • 23. CMI Internship 2021-2022 23/28 Internship Oral Defense Algorithm to compute TP, FP and FN Figure 9: Illustrated algorithm to compute TP, FP and FN with an example Pierre Lague (UBS, UR2) Finding an Adapted Metric Algorithm to compute TP, FP and FN
  • 24. CMI Internship 2021-2022 24/28 Internship Oral Defense Results with the new method Steps to evaluate the models Include the algorithm into the testing phase For all classes and for multiple thresholds, compute Average Precision Compute Mean Average Precision Pierre Lague (UBS, UR2) Finding an Adapted Metric Results with the new method
  • 25. CMI Internship 2021-2022 25/28 Internship Oral Defense Class Average Precision 1 0.09787 Table 6: Average Precisions class 1 Figure 10: Precision-Recall Curve for Class 1, Average Precision is 0.097 Remark Mean Average Precision The Average Precision for all the other classes was 0. The Mean Average Precision is 0.009787 Pierre Lague (UBS, UR2) Finding an Adapted Metric Results with the new method
  • 26. CMI Internship 2021-2022 26/28 Internship Oral Defense Conclusion The new way of computing the metrics is adapted to the setup of our experiment and our labels. This way we can reliably evaluate our models : The Mean Average Precision shows that our models are of very poor quality. The discriminant capacities of our models are also very low, even after trying out multiple configurations. Our models did not achieve significant scores for them to be used on the field. Semantic Segmentation may not be the most effective approach, given the setup of the experiment. Pierre Lague (UBS, UR2) Finding an Adapted Metric Conclusion
  • 27. CMI Internship 2021-2022 27/28 Internship Oral Defense Future Work Object Detection has a lot of potential. Bounding boxes can easily be constructed out of the labels we already have. The problem has to be less granular in order to achieve better scores. Pierre Lague (UBS, UR2) Finding an Adapted Metric Conclusion
  • 28. CMI Internship 2021-2022 28/28 Internship Oral Defense Thank you for your time. Pierre Lague (UBS, UR2) Finding an Adapted Metric Conclusion