PR-365: Fast object detection in compressed video

•

0 likes•165 views

이번 PR12 365번째 논문으로 소개드릴 내용은 조금 특이한 접근법입니다. 우리가 실생활에서 접하는 대부분의 비디오는 Compressed 된 형태의 Video인데요, 실제 Computer Vision Task에서 input이 Compressed Video라는 가정을 하게 되면 생각보다 큰 이점을 얻을 수 있습니다. 바로 Compressed Video에는 Motion Vector가 포함되어있다는 점입니다. 이를 이용하면 생각보다 많은 것들을 할 수 있게 됩니다. 그 예시로 Object Detection의 연산량을 크게 줄인 case를 하나 소개드려보고자 합니다. paper link: https://openaccess.thecvf.com/content_ICCV_2019/html/Wang_Fast_Object_Detection_in_Compressed_Video_ICCV_2019_paper.html video link: https://youtu.be/9n6OtHtJvJ0

Engineering

Fast Object Detection in Compressed Video
Hyeongmin Lee
Yonsei University
PR-365
2022.1.9

H.264 Codec
 Codec? [PR-340]
0011010100111...

H.264 Codec
 Decoding Time Stamp (DTS)
Intra Coding: I-Frame compression
Inter Coding: P-Frame compression

H.264 Codec
 Inter Compression
𝑰𝑰𝟏𝟏, 𝑰𝑰𝟐𝟐, 𝑰𝑰𝟑𝟑, 𝑰𝑰𝟒𝟒, … , 𝑰𝑰𝑵𝑵
𝑰𝑰𝟏𝟏, 𝑹𝑹𝟐𝟐, 𝑹𝑹𝟑𝟑, 𝑹𝑹𝟒𝟒, … , 𝑹𝑹𝑵𝑵
𝑭𝑭𝟏𝟏𝟏𝟏, 𝑭𝑭𝟐𝟐𝟐𝟐, 𝑭𝑭𝟑𝟑𝟑𝟑, … , 𝑭𝑭𝑵𝑵−𝟏𝟏,𝑵𝑵

H.264 Codec
 Traditional Video Compression
• H.264
• H.265 (HEVC)
1
𝒙𝒙𝒕𝒕 �
𝒙𝒙𝒕𝒕−𝟏𝟏
Motion Estimation
𝒗𝒗𝒕𝒕
Entropy Coding
11000110101

H.264 Codec
 Traditional Video Compression
• H.264
• H.265 (HEVC)
2
�
𝒙𝒙𝒕𝒕−𝟏𝟏
Warping
�
𝒙𝒙𝒕𝒕
𝒗𝒗𝒕𝒕

H.264 Codec
 Traditional Video Compression
• H.264
• H.265 (HEVC)
3
𝒓𝒓𝒕𝒕 = 𝒙𝒙𝒕𝒕 − �
𝒙𝒙𝒕𝒕

H.264 Codec
 Traditional Video Compression
• H.264
• H.265 (HEVC)
4
𝒓𝒓𝒕𝒕
Transform
𝒚𝒚𝒕𝒕
Quantization
�
𝒚𝒚𝒕𝒕
Inverse
Transform
�
𝒓𝒓𝒕𝒕
Entropy Coding
11000110101
Buffer
�
𝒙𝒙𝒕𝒕 = �
𝒙𝒙𝒕𝒕 + �
𝒓𝒓𝒕𝒕

H.264 Codec
 Inter Compression
𝑰𝑰𝟏𝟏, 𝑰𝑰𝟐𝟐, 𝑰𝑰𝟑𝟑, 𝑰𝑰𝟒𝟒, … , 𝑰𝑰𝑵𝑵
𝑰𝑰𝟏𝟏, 𝑹𝑹𝟐𝟐, 𝑹𝑹𝟑𝟑, 𝑹𝑹𝟒𝟒, … , 𝑹𝑹𝑵𝑵
𝑭𝑭𝟏𝟏𝟏𝟏, 𝑭𝑭𝟐𝟐𝟐𝟐, 𝑭𝑭𝟑𝟑𝟑𝟑, … , 𝑭𝑭𝑵𝑵−𝟏𝟏,𝑵𝑵
𝑰𝑰𝟏𝟏, 𝑹𝑹𝟐𝟐, 𝑹𝑹𝟑𝟑, 𝑹𝑹𝟒𝟒, … , 𝑹𝑹𝑵𝑵
𝑭𝑭𝟏𝟏𝟏𝟏, 𝑭𝑭𝟐𝟐𝟐𝟐, 𝑭𝑭𝟑𝟑𝟑𝟑, … , 𝑭𝑭𝑵𝑵−𝟏𝟏,𝑵𝑵
𝑰𝑰𝟏𝟏, 𝑰𝑰𝟐𝟐, 𝑰𝑰𝟑𝟑, 𝑰𝑰𝟒𝟒, … , 𝑰𝑰𝑵𝑵
11000110101
Encoder
(Premiere Pro)
Decoder
(Viewer)
[Channel]

Fast Object Detection in
Compressed Video

Fast Object Detection in Compressed Video
 Overview

Fast Object Detection in Compressed Video
 Network Structure
Feature Extractor Memory Network Detection Network

Fast Object Detection in Compressed Video
 Pyramidal Feature Attention
ResNet-101

Fast Object Detection in Compressed Video
 Motion-aided LSTM
[Motion-Based Warping]
[LSTM]

Experiments
 Ablation Study – motion alignment

What's hot

Lzw coding technique for image compressionTata Consultancy Services

Algorithm of standard videocodec H.264 chintapallisantoshkumar

Robust Watermarking of Video StreamsTamás Polyák

h.264 video compression standard.Videoguy

Speech Compression using LPCDisha Modi

New 47 lcd hdtv 1080 p 120hz b stockmathrixpolo

martelli.pptVideoguy

Introduction to video reverse engineeringVittorio Giovara

Porting To SymbianMark Wilcox

Speech technology basicsHemaraja Nayaka S

Speaker Segmentation (2006)Luís Gustavo Martins

Real time SHVC decoderwassim hamidouche

What's hot (12)

Lzw coding technique for image compression

Algorithm of standard videocodec H.264

Robust Watermarking of Video Streams

h.264 video compression standard.

Speech Compression using LPC

New 47 lcd hdtv 1080 p 120hz b stock

martelli.ppt

Introduction to video reverse engineering

Porting To Symbian

Speech technology basics

Speaker Segmentation (2006)

Real time SHVC decoder

Similar to PR-365: Fast object detection in compressed video

H263.pptVideoguy

Video Compression Standards - History & IntroductionChamp Yen

Emerging H.264 Standard:Videoguy

Video coding standards pptLokesh Reddy Avula

An Overview of High Efficiency Video Codec HEVC (H.265)Varun Ravi

Performance Analysis of Various Video Compression TechniquesInternational Journal of Science and Research (IJSR)

Encoding at Scale for Live Video StreamingRay Adensamer

Iain Richardson: An Introduction to Video CompressionIain Richardson

H.264 vs HEVCMarcin Walendowski

The H.265/MPEG-HEVC StandardIMTC

H.264 video standardSajan Sahu

Barcelona keynote webPptblog Pptblogcom

A short history of video codingIain Richardson

Dcpflipbook

Emerging H.264 Standard: Overview and TMS320DM642- Based ...Videoguy

Compressed Video QualityIain Richardson

從音樂走向影音服務 - KKBOX 的影音之路奮鬥史 - 序章Shuen-Huei Guan

E010132529IOSR Journals

Spatial Scalable Video Compression Using H.264IOSR Journals

Video Compression TechnologyTong Teerayuth

Similar to PR-365: Fast object detection in compressed video (20)

H263.ppt

Video Compression Standards - History & Introduction

Emerging H.264 Standard:

Video coding standards ppt

An Overview of High Efficiency Video Codec HEVC (H.265)

Performance Analysis of Various Video Compression Techniques

Encoding at Scale for Live Video Streaming

Iain Richardson: An Introduction to Video Compression

H.264 vs HEVC

The H.265/MPEG-HEVC Standard

H.264 video standard

Barcelona keynote web

A short history of video coding

Dcp

Emerging H.264 Standard: Overview and TMS320DM642- Based ...

Compressed Video Quality

從音樂走向影音服務 - KKBOX 的影音之路奮鬥史 - 序章

E010132529

Spatial Scalable Video Compression Using H.264

Video Compression Technology

Recently uploaded

Internship report on mechanical engineeringmalavadedarshan25

Biology for Computer Engineers Course Handout.pptxDeepakSakkari2

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEslot gacor bisa pakai pulsa

Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona

Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3

Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis

HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla

What are the advantages and disadvantages of membrane structures.pptxwendy cai

Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR9953056974 Low Rate Call Girls In Saket, Delhi NCR

SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome

Current Transformer Drawing and GTP for MSETCLDeelipZope

Introduction to Multiple Access Protocol.pptxupamatechverse

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha

Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan

ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZTE

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal

(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

Recently uploaded (20)

Internship report on mechanical engineering

Biology for Computer Engineers Course Handout.pptx

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE

Processing & Properties of Floor and Wall Tiles.pptx

Software Development Life Cycle By Team Orange (Dept. of Pharmacy)

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS

Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...

HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS

What are the advantages and disadvantages of membrane structures.pptx

Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR

SPICE PARK APR2024 ( 6,793 SPICE Models )

Current Transformer Drawing and GTP for MSETCL

Introduction to Multiple Access Protocol.pptx

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx

Coefficient of Thermal Expansion and their Importance.pptx

ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...

(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service

PR-365: Fast object detection in compressed video

1. Fast Object Detection in Compressed Video Hyeongmin Lee Yonsei University PR-365 2022.1.9

3. H.264 Codec

4. H.264 Codec  Codec? [PR-340] 0011010100111...

5. H.264 Codec  Decoding Time Stamp (DTS) Intra Coding: I-Frame compression Inter Coding: P-Frame compression

6. H.264 Codec  Inter Compression 𝑰𝑰𝟏𝟏, 𝑰𝑰𝟐𝟐, 𝑰𝑰𝟑𝟑, 𝑰𝑰𝟒𝟒, … , 𝑰𝑰𝑵𝑵 𝑰𝑰𝟏𝟏, 𝑹𝑹𝟐𝟐, 𝑹𝑹𝟑𝟑, 𝑹𝑹𝟒𝟒, … , 𝑹𝑹𝑵𝑵 𝑭𝑭𝟏𝟏𝟏𝟏, 𝑭𝑭𝟐𝟐𝟐𝟐, 𝑭𝑭𝟑𝟑𝟑𝟑, … , 𝑭𝑭𝑵𝑵−𝟏𝟏,𝑵𝑵

7. H.264 Codec  Traditional Video Compression • H.264 • H.265 (HEVC) 1 𝒙𝒙𝒕𝒕 � 𝒙𝒙𝒕𝒕−𝟏𝟏 Motion Estimation 𝒗𝒗𝒕𝒕 Entropy Coding 11000110101

8. H.264 Codec  Traditional Video Compression • H.264 • H.265 (HEVC) 2 � 𝒙𝒙𝒕𝒕−𝟏𝟏 Warping � 𝒙𝒙𝒕𝒕 𝒗𝒗𝒕𝒕

9. H.264 Codec  Traditional Video Compression • H.264 • H.265 (HEVC) 3 𝒓𝒓𝒕𝒕 = 𝒙𝒙𝒕𝒕 − � 𝒙𝒙𝒕𝒕

10. H.264 Codec  Traditional Video Compression • H.264 • H.265 (HEVC) 4 𝒓𝒓𝒕𝒕 Transform 𝒚𝒚𝒕𝒕 Quantization � 𝒚𝒚𝒕𝒕 Inverse Transform � 𝒓𝒓𝒕𝒕 Entropy Coding 11000110101 Buffer � 𝒙𝒙𝒕𝒕 = � 𝒙𝒙𝒕𝒕 + � 𝒓𝒓𝒕𝒕

11. H.264 Codec  Inter Compression 𝑰𝑰𝟏𝟏, 𝑰𝑰𝟐𝟐, 𝑰𝑰𝟑𝟑, 𝑰𝑰𝟒𝟒, … , 𝑰𝑰𝑵𝑵 𝑰𝑰𝟏𝟏, 𝑹𝑹𝟐𝟐, 𝑹𝑹𝟑𝟑, 𝑹𝑹𝟒𝟒, … , 𝑹𝑹𝑵𝑵 𝑭𝑭𝟏𝟏𝟏𝟏, 𝑭𝑭𝟐𝟐𝟐𝟐, 𝑭𝑭𝟑𝟑𝟑𝟑, … , 𝑭𝑭𝑵𝑵−𝟏𝟏,𝑵𝑵 𝑰𝑰𝟏𝟏, 𝑹𝑹𝟐𝟐, 𝑹𝑹𝟑𝟑, 𝑹𝑹𝟒𝟒, … , 𝑹𝑹𝑵𝑵 𝑭𝑭𝟏𝟏𝟏𝟏, 𝑭𝑭𝟐𝟐𝟐𝟐, 𝑭𝑭𝟑𝟑𝟑𝟑, … , 𝑭𝑭𝑵𝑵−𝟏𝟏,𝑵𝑵 𝑰𝑰𝟏𝟏, 𝑰𝑰𝟐𝟐, 𝑰𝑰𝟑𝟑, 𝑰𝑰𝟒𝟒, … , 𝑰𝑰𝑵𝑵 11000110101 Encoder (Premiere Pro) Decoder (Viewer) [Channel]

12. Fast Object Detection in Compressed Video

13. Fast Object Detection in Compressed Video  Overview

14. Fast Object Detection in Compressed Video  Network Structure Feature Extractor Memory Network Detection Network

15. Fast Object Detection in Compressed Video  Pyramidal Feature Attention ResNet-101

16. Fast Object Detection in Compressed Video  Motion-aided LSTM [Motion-Based Warping] [LSTM]

17. Experiments

18. Experiments  Ablation Study

19. Experiments  Ablation Study – motion alignment

20. Experiments  vs Flownet

21. Experiments  Comparisons

22. Thank You!