ODS.ai Odessa Meetup #3: Object Detection in the Wild

•

0 likes•377 views

Borys Tymchenko (Senior ML Research Engineer at VITech) Поговорим о том, что делать, если из органов зрения есть только IP/CCTV камеры, а хочется детектировать объекты и их свойства. https://dataphoenix.info/ods-ai-odessa-meetup-3/

Data & Analytics

Object Detection in the Wild
Borys Tymchenko
VITech
(an embarrassingly simple approach)

What do we need to solve?
- Detect objects
- Detect their attributes
- Track them in time

What do we need to solve in particular?
- Detect people
- Detect their clothes
- Helmet
- Vest
- Glasses
- Headphones
- Track them in time
- Challenging environment

Environment challenges us to be creative!
- People can occlude each other
- People can carry things
- People can be occluded by environment
- Tracks must be consistent
- Clothes must be detected per person

Detecting clothes: three approaches
- Top down approach:
- Detect person
- Within it detect clothes
- Play around matching
- Bottom-up approach
- Detect clothes
- Try to stitch them into person
- Single pass
- Detect person in the frame
- Detect clothes in the frame
- Match them together

Top-down and bottom-up downsides
- Sensitive to occlusions
- Unattended clothes items can lead to false positives
- Overlapping people lead to false reconstructions
- Temporal inconsistency

Doing everything in a single pass: the method
Combine CenteNet and FCOS to detect simultaneously:
- object classes
- bounding boxes
- attributes
- tracking features

Overall approach
- Predict everything for every pixel, sort out in post-processing

Training
- Focal Loss on objectness head
- Training target is gaussian with the value 1 in the center and 0 on the edge of the box
- L1 loss on bbox coordinates head
- Training target: for every pixel inside bbox there are 4 coordinates (lop, left, right, bottom)
- Focal Loss on classes head
- Training target is gaussian with the value 1 in the center and 0 on the edge of the box
- Focal Loss on attributes head
- Training target is gaussian with the value 1 in the center and 0 on the edge of the box
- Cross-Entropy on features head
- Training target: class for every object in the dataset

Inference
- Select top-k peaks on the objectness heatmap
- From their coordinates get values from other heatmaps
- Apply scaling/sigmoid if needed
Tracking?
DeepSORT with provided features

Problem detected: heatmaps are not aligned
Solution:
Multiply inputs to all heads by objectness during training (cue: CenterNetv2)

What about data? And how to validate this?
- Pretrain on synthetic dataset (in-house)
- Collect data from different locations, label everything)
- Train jointly, validate separately
- Ensure to leave some locations to validate if it generalizes well
If your project is non-commercial, use CrowdHuman, COCO, etc. to pretrain

Useful papers
- CenterNet: arxiv.org/abs/1904.07850
- FCOS: arxiv.org/abs/1904.01355
- CenterNet v2: arxiv.org/abs/2103.07461
- TTFNet: arxiv.org/abs/1909.00700
- YOLOv5: github.com/ultralytics/yolov5

Thanks for watching!
Feel free to say “WT# was this?”!
18
ods.ai: spsancti
t.me/spsancti
fb.com/borys.tymc

Recently uploaded

edited gordis ebook sixth edition david d.pdfgreat91

Bios of leading Astrologers & Researchersdarmandersingh4580

一比一原版麦考瑞大学毕业证成绩单如何办理cyebo

一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理pyhepag

社内勉強会資料_Object Recognition as Next Token PredictionNABLAS株式会社

一比一原版阿德莱德大学毕业证成绩单如何办理pyhepag

123.docx. .jipohal318

一比一原版西悉尼大学毕业证成绩单如何办理pyhepag

原件一样(UWO毕业证书）西安大略大学毕业证成绩单留信学历认证pwgnohujw

社内勉強会資料　Mamba - A new era or ephemeralNABLAS株式会社

如何办理(UPenn毕业证书）宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证acoha1

Aggregations - The Elasticsearch "GROUP BY"John Sobanski

How to Transform Clinical Trial Management with Advanced Data AnalyticsBrainSell Technologies

一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理pyhepag

NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...Amil baba

Predictive Precipitation: Advanced Rain Forecasting TechniquesBoston Institute of Analytics

Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotecAbortion pills in Riyadh +966572737505 get cytotec

Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeBoston Institute of Analytics

Abortion Clinic in Randfontein +27791653574 Randfontein WhatsApp Abortion Cli...mikehavy0

Formulas dax para power bI de microsoft.pdfRobertoOcampo24

Recently uploaded (20)

edited gordis ebook sixth edition david d.pdf

Bios of leading Astrologers & Researchers

一比一原版麦考瑞大学毕业证成绩单如何办理

一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理

社内勉強会資料_Object Recognition as Next Token Prediction

一比一原版阿德莱德大学毕业证成绩单如何办理

123.docx. .

一比一原版西悉尼大学毕业证成绩单如何办理

原件一样(UWO毕业证书）西安大略大学毕业证成绩单留信学历认证

社内勉強会資料　Mamba - A new era or ephemeral

如何办理(UPenn毕业证书）宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证

Aggregations - The Elasticsearch "GROUP BY"

How to Transform Clinical Trial Management with Advanced Data Analytics

一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理

NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...

Predictive Precipitation: Advanced Rain Forecasting Techniques

Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec

Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age

Abortion Clinic in Randfontein +27791653574 Randfontein WhatsApp Abortion Cli...

Formulas dax para power bI de microsoft.pdf

Featured

2024 State of Marketing Report – by HubspotMarius Sescu

Everything You Need To Know About ChatGPTExpeed Software

Product Design Trends in 2024 | Teenage EngineeringsPixeldarts

How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow

AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork

Skeleton Culture CodeSkeleton Technologies

PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley

Content Methodology: A Best Practices Report (Webinar)contently

How to Prepare For a Successful Job Search for 2024Albert Qian

Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)

Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal

5 Public speaking tips from TED - Visualized summarySpeakerHub

ChatGPT and the Future of Work - Clark Boyd Clark Boyd

Getting into the tech field. what next Tessa Mero

Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray

How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC

Introduction to Data ScienceChristy Abraham Joy

Time Management & Productivity - Best PracticesVit Horky

The six step guide to practical project managementMindGenius

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36

Featured (20)

2024 State of Marketing Report – by Hubspot

Everything You Need To Know About ChatGPT

Product Design Trends in 2024 | Teenage Engineerings

How Race, Age and Gender Shape Attitudes Towards Mental Health

AI Trends in Creative Operations 2024 by Artwork Flow.pdf

Skeleton Culture Code

PEPSICO Presentation to CAGNY Conference Feb 2024

Content Methodology: A Best Practices Report (Webinar)

How to Prepare For a Successful Job Search for 2024

Social Media Marketing Trends 2024 // The Global Indie Insights

Trends In Paid Search: Navigating The Digital Landscape In 2024

5 Public speaking tips from TED - Visualized summary

ChatGPT and the Future of Work - Clark Boyd

Getting into the tech field. what next

Google's Just Not That Into You: Understanding Core Updates & Search Intent

How to have difficult conversations

Introduction to Data Science

Time Management & Productivity - Best Practices

The six step guide to practical project management

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...

ODS.ai Odessa Meetup #3: Object Detection in the Wild

1. Object Detection in the Wild Borys Tymchenko VITech (an embarrassingly simple approach)

2. What do we need to solve? - Detect objects - Detect their attributes - Track them in time

3. What do we need to solve in particular? - Detect people - Detect their clothes - Helmet - Vest - Glasses - Headphones - Track them in time - Challenging environment

4. Environment challenges us to be creative! - People can occlude each other - People can carry things - People can be occluded by environment - Tracks must be consistent - Clothes must be detected per person

5. Detecting clothes: three approaches - Top down approach: - Detect person - Within it detect clothes - Play around matching - Bottom-up approach - Detect clothes - Try to stitch them into person - Single pass - Detect person in the frame - Detect clothes in the frame - Match them together

6. Top-down and bottom-up downsides - Sensitive to occlusions - Unattended clothes items can lead to false positives - Overlapping people lead to false reconstructions - Temporal inconsistency

8. Doing everything in a single pass: the method Combine CenteNet and FCOS to detect simultaneously: - object classes - bounding boxes - attributes - tracking features

9. Overall approach - Predict everything for every pixel, sort out in post-processing

10. Training - Focal Loss on objectness head - Training target is gaussian with the value 1 in the center and 0 on the edge of the box - L1 loss on bbox coordinates head - Training target: for every pixel inside bbox there are 4 coordinates (lop, left, right, bottom) - Focal Loss on classes head - Training target is gaussian with the value 1 in the center and 0 on the edge of the box - Focal Loss on attributes head - Training target is gaussian with the value 1 in the center and 0 on the edge of the box - Cross-Entropy on features head - Training target: class for every object in the dataset

11. Inference - Select top-k peaks on the objectness heatmap - From their coordinates get values from other heatmaps - Apply scaling/sigmoid if needed Tracking? DeepSORT with provided features

12. Problem detected: heatmaps are not aligned Solution: Multiply inputs to all heads by objectness during training (cue: CenterNetv2)

13. What about data?

14. What about data? And how to validate this? - Pretrain on synthetic dataset (in-house) - Collect data from different locations, label everything) - Train jointly, validate separately - Ensure to leave some locations to validate if it generalizes well If your project is non-commercial, use CrowdHuman, COCO, etc. to pretrain

15. Results

16. Results

17. Useful papers - CenterNet: arxiv.org/abs/1904.07850 - FCOS: arxiv.org/abs/1904.01355 - CenterNet v2: arxiv.org/abs/2103.07461 - TTFNet: arxiv.org/abs/1909.00700 - YOLOv5: github.com/ultralytics/yolov5

18. Thanks for watching! Feel free to say “WT# was this?”! 18 ods.ai: spsancti t.me/spsancti fb.com/borys.tymc

ODS.ai Odessa Meetup #3: Object Detection in the Wild

Recommended

Recommended

More Related Content

Recently uploaded

Recently uploaded (20)

Featured

Featured (20)

ODS.ai Odessa Meetup #3: Object Detection in the Wild