AIxIA 2021 Main Track Presentation

•

0 likes•125 views

Gennaro Vessio

Human Detection in Drone Images Using YOLO for Search-and-Rescue Operations

Science

Context
Drones can provide a cost-eﬃcient aid to
search-and-rescue operations:
● swarms of aerial vehicles can be rapidly
spread across a disaster area providing
mobile ad-hoc networks
● they can rapidly overﬂy and traverse
diﬃcult to reach regions, such as
mountains, islands, etc.
● they can deliver rescue apparatus, such as
medications, much faster than rescue
teams
2

Motivations
However, in such a scenario, a manual search performed by a ﬂight operator
(based on the aerial video captured by the drone) can prove extremely diﬃcult:
● it requires a long concentration to perform the ﬂight operation and the
searching task at the same time
● the operator could work in poor conditions, because of the small size of the
monitor he is equipped with, as well as the brightness of the screen outdoor
The use of autonomous drones can reduce manual human intervention, thereby
increasing detection rate, while reducing rescue time
3

Goal
This opportunity motivates research eﬀorts
towards the development of real-time intelligent
tools to be mounted directly on-board drones
Nowadays, drones embed quite powerful GPUs,
so even a simple UAV can be transformed into
an advanced computer vision ﬂying machine
4

Detection method
We considered the lighter versions of YOLOv5:
● YOLOv5s (small-size)
● YOLOv5m (medium-size)
YOLOv5 is diﬀerent from all other previous
versions; in particular, it introduced mosaic data
augmentation and the ability to autonomously
learn bounding box anchors
5

Datasets
● HERIDAL
○ 1700 images of wildlife at high resolution
● SARD
○ 1981 images of wildlife at high resolution
○ pose estimation labels
6

Setting
● Software:
○ Roboﬂow library
● Hardware:
○ Google Colab NVIDIA Tesla K80
● Hyper-parameter setting:
○ Pretraining: COCO
○ Mini-batch size: 32
○ Learning rate: 0.01
○ Early stopping on a validation set
○ Input: 800×800
○ IoU: 50%
● Performance metrics:
○ Precision
○ Recall
○ AP
7

Conclusion
Promising human detection performance using the latest YOLOv5 detection
algorithm have been obtained
Future work: improve the results obtained on the multi-class classiﬁcation based
on human pose by properly augmenting the under-represented classes
Thanks for the attention!
10

What's hot

Introduction to TLS Applications PresentationSERC at Carleton College

CSEO CURVES project (june)Andrey Klimenko

Photogrametry_3D_Modelling[1]Joachim Nkendeys

MSc Proposal Presentation: A comparison of TLS and PhotogrammetryPeter McCready

Night vision device 2brads112

COMIT Community Day Winter 2018 - GeoSLAMComit Projects Ltd

Hyperspectral ImagingParikshith Beenaveni

Remote Sensing: Meaning, Concept and Components | GeographySrimantaKarak

REMOTE SENSINGmusadoto

Remote sensing and gis pptpreeti patil

Nanosat eye in the sky Astronomy Society of VictoriaMark Smith

Optical remote sensingMohsin Siddique

Unmanned Aerial Vehicles: COMP4DRONES (ECSEL JU)Big Data Value Association

(2015/09) Drone Imagery EconomicsMartin Scholl

What's hot (14)

Introduction to TLS Applications Presentation

CSEO CURVES project (june)

Photogrametry_3D_Modelling[1]

MSc Proposal Presentation: A comparison of TLS and Photogrammetry

Night vision device 2

COMIT Community Day Winter 2018 - GeoSLAM

Hyperspectral Imaging

Remote Sensing: Meaning, Concept and Components | Geography

REMOTE SENSING

Remote sensing and gis ppt

Nanosat eye in the sky Astronomy Society of Victoria

Optical remote sensing

Unmanned Aerial Vehicles: COMP4DRONES (ECSEL JU)

(2015/09) Drone Imagery Economics

Recently uploaded

Topography and sediments of the floor of the Bay of BengalMd Hasan Tareq

FAIR & AI Ready KGs for Explainable PredictionsMichel Dumontier

SAMPLING.pptx for analystical chemistry sample techniquesrodneykiptoo8

INSIGHT Partner Profile: Tampere UniversitySteffi Friedrichs

GEOLOGICAL FIELD REPORT On Kaptai Rangamati Road-Cut Section.pdfUniversity of Barishal

insect taxonomy importance systematics and classificationanitaento25

National Biodiversity protection initiatives and Convention on Biological Di...PABOLU TEJASREE

word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings o...Subhajit Sahu

Constraints on Neutrino Natal Kicks from Black-Hole Binary VFTS 243Sérgio Sacani

Lab report on liquid viscosity of glycerinossaicprecious19

ESR_factors_affect-clinic significance-Pathysiology.pptxmuralinath2

GLOBAL AND LOCAL SCENARIO OF FOOD AND NUTRITION.pptxSultanMuhammadGhauri

NuGOweek 2024 full programme - hosted by Ghent Universitypablovgd

Pests of sugarcane_Binomics_IPM_Dr.UPR.pdfPirithiRaju

NuGOweek 2024 Ghent - programme - final versionpablovgd

Richard's entangled aventures in wonderlandRichard Gill

Cancer cell metabolism: special Reference to Lactate PathwayAADYARAJPANDEY1

Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...Sérgio Sacani

Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...NathanBaughman3

Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...Sérgio Sacani

Recently uploaded (20)

Topography and sediments of the floor of the Bay of Bengal

FAIR & AI Ready KGs for Explainable Predictions

SAMPLING.pptx for analystical chemistry sample techniques

INSIGHT Partner Profile: Tampere University

GEOLOGICAL FIELD REPORT On Kaptai Rangamati Road-Cut Section.pdf

insect taxonomy importance systematics and classification

National Biodiversity protection initiatives and Convention on Biological Di...

word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings o...

Constraints on Neutrino Natal Kicks from Black-Hole Binary VFTS 243

Lab report on liquid viscosity of glycerin

ESR_factors_affect-clinic significance-Pathysiology.pptx

GLOBAL AND LOCAL SCENARIO OF FOOD AND NUTRITION.pptx

NuGOweek 2024 full programme - hosted by Ghent University

Pests of sugarcane_Binomics_IPM_Dr.UPR.pdf

NuGOweek 2024 Ghent - programme - final version

Richard's entangled aventures in wonderland

Cancer cell metabolism: special Reference to Lactate Pathway

Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...

Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...

Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...

AIxIA 2021 Main Track Presentation

1. Human Detection in Drone Images Using YOLO for Search-and-Rescue Operations Sergio Caputo, Giovanna Castellano, Francesco Greco, Corrado Mencar, Niccolò Petti, Gennaro Vessio gennaro.vessio@uniba.it

2. Context Drones can provide a cost-efficient aid to search-and-rescue operations: ● swarms of aerial vehicles can be rapidly spread across a disaster area providing mobile ad-hoc networks ● they can rapidly overfly and traverse difficult to reach regions, such as mountains, islands, etc. ● they can deliver rescue apparatus, such as medications, much faster than rescue teams 2

3. Motivations However, in such a scenario, a manual search performed by a flight operator (based on the aerial video captured by the drone) can prove extremely difficult: ● it requires a long concentration to perform the flight operation and the searching task at the same time ● the operator could work in poor conditions, because of the small size of the monitor he is equipped with, as well as the brightness of the screen outdoor The use of autonomous drones can reduce manual human intervention, thereby increasing detection rate, while reducing rescue time 3

4. Goal This opportunity motivates research eﬀorts towards the development of real-time intelligent tools to be mounted directly on-board drones Nowadays, drones embed quite powerful GPUs, so even a simple UAV can be transformed into an advanced computer vision ﬂying machine 4

5. Detection method We considered the lighter versions of YOLOv5: ● YOLOv5s (small-size) ● YOLOv5m (medium-size) YOLOv5 is diﬀerent from all other previous versions; in particular, it introduced mosaic data augmentation and the ability to autonomously learn bounding box anchors 5

6. Datasets ● HERIDAL ○ 1700 images of wildlife at high resolution ● SARD ○ 1981 images of wildlife at high resolution ○ pose estimation labels 6

7. Setting ● Software: ○ Roboﬂow library ● Hardware: ○ Google Colab NVIDIA Tesla K80 ● Hyper-parameter setting: ○ Pretraining: COCO ○ Mini-batch size: 32 ○ Learning rate: 0.01 ○ Early stopping on a validation set ○ Input: 800×800 ○ IoU: 50% ● Performance metrics: ○ Precision ○ Recall ○ AP 7

8. Results 8

9. Results 9

10. Conclusion Promising human detection performance using the latest YOLOv5 detection algorithm have been obtained Future work: improve the results obtained on the multi-class classiﬁcation based on human pose by properly augmenting the under-represented classes Thanks for the attention! 10

AIxIA 2021 Main Track Presentation

Recommended

Recommended

More Related Content

What's hot

What's hot (14)

Similar to AIxIA 2021 Main Track Presentation

Similar to AIxIA 2021 Main Track Presentation (20)

More from Gennaro Vessio

More from Gennaro Vessio (12)

Recently uploaded

Recently uploaded (20)

AIxIA 2021 Main Track Presentation