Weakly supervised semantic segmentation using web crawled videos

•Download as PPTX, PDF•

1 like•113 views

Cheng-You Lu

Software

Goal
 Problem:
Model overly focuses on discriminative parts
rather than the entire object area
 Solution:
 Retrieving video and generating segmentation
labels from the retrieving video without human
intervention to simulate string supervised for
semantic segmentation

Video
 Estimate shape and extent of object in
video by motion
 Generating segmentation labels

Video
 Temporally ambiguous:
Some frames are not relevant to label
 Spatial ambiguity:
Some motion object not relevant to label

Weakly annotated images
 Image and clean class labels given manually
 Alleviate the ambiguities in video

Achitecture – two part
 Encoder for image classification and
discriminative localization
 Decoder for image segmentation

Section 3.1
: Based on Pre-trained VGG-16 network

Section 3.1
Remove full-connected layer
Place a new convolutional layer after the last
convolutional layer of VGG-16
For better ada-ptation to our task
Global average pooling followed by a fully-
connected layer at bottom

Section 3.2 - Eliminating
irrelevent frames
< Threshold -> Eliminate

Section 3.2 - Eliminating
irrelevent frames
 If more than 5 consecutive frames are
chosen, we consider them as a single
relevant video

Section 3.2 - spatio-temporal
segmentation
Graph-based optimization problem
i-th superpixel of frame t

Section 3.2 - spatio-temporal
segmentation
Estimate a binary label for

Section 3.2 - spatio-temporal
segmentation
By GMM
By Inside outside map

Section 3.2 - spatio-temporal
segmentation

Reference
 Weakly Supervised Semantic Segmentation
using Web-Crawled Videos ,CVPR2017

Similar to Weakly supervised semantic segmentation using web crawled videos

Mining Frequent Events From VideoSteffi Keran Rani J

L0956974IOSR Journals

Paper discussion:Video-to-Video Synthesis (NIPS 2018)Motaz Sabri

C1 mala1 akilaJasline Presilda

AaSeminar_Template.pptxManojGowdaKb

Motion detection in compressed video using macroblock classificationacijjournal

Optimal Repeated Frame Compensation Using Efficient Video CodingIOSR Journals

Explaining video summarization based on the focus of attentionVasileiosMezaris

Key Frame Extraction in Video Stream using Two Stage Method with Colour and S...ijtsrd

2019-06-14:3 - Reti neurali e compressione videouninfoit

Recent advances in content based video copy detection (IEEE)PACE 2.0

International Journal of Computational Engineering Research(IJCER)ijceronline

Secured Data Transmission Using Video Steganographic SchemeIJERA Editor

Video content analysis and retrieval system using video storytelling and inde...IJECEIAES

VISUAL ATTENTION BASED KEYFRAMES EXTRACTION AND VIDEO SUMMARIZATIONcscpconf

absorption, Cu2+ : glass, emission, excitation, XRDIJERA Editor

Improved Error Detection and Data Recovery Architecture for Motion Estimation...IJERA Editor

Effective Compression of Digital VideoIRJET Journal

A Segmentation Based Sequential Pattern Matching for Efficient Video Copy Det...Best Jobs

White Paper - Mpeg 4 Toolkit ApproachAmos Kohn

Similar to Weakly supervised semantic segmentation using web crawled videos (20)

Mining Frequent Events From Video

L0956974

Paper discussion:Video-to-Video Synthesis (NIPS 2018)

C1 mala1 akila

AaSeminar_Template.pptx

Motion detection in compressed video using macroblock classification

Optimal Repeated Frame Compensation Using Efficient Video Coding

Explaining video summarization based on the focus of attention

Key Frame Extraction in Video Stream using Two Stage Method with Colour and S...

2019-06-14:3 - Reti neurali e compressione video

Recent advances in content based video copy detection (IEEE)

International Journal of Computational Engineering Research(IJCER)

Secured Data Transmission Using Video Steganographic Scheme

Video content analysis and retrieval system using video storytelling and inde...

VISUAL ATTENTION BASED KEYFRAMES EXTRACTION AND VIDEO SUMMARIZATION

absorption, Cu2+ : glass, emission, excitation, XRD

Improved Error Detection and Data Recovery Architecture for Motion Estimation...

Effective Compression of Digital Video

A Segmentation Based Sequential Pattern Matching for Efficient Video Copy Det...

White Paper - Mpeg 4 Toolkit Approach

Recently uploaded

Alluxio Monthly Webinar | Simplify Data Access for AI in Multi-CloudAlluxio, Inc.

Effective Strategies for Wix's Scaling challenges - GeeConNatan Silnitsky

Weeding your micro service landscape.pdftimtebeek1

Software Engineering - Introduction + Process Models + Requirements EngineeringPrakhyath Rai

Transformer Neural Network Use Cases with LinksJinanKordab

UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale IbridaNeo4j

Rapidoform for Modern Form Building and Insightsrapidoform

BusinessGPT - Security and Governance for Generative AIAGATSoftware

Test Automation Design Patterns_ A Comprehensive Guide.pdfkalichargn70th171

Community is Just as Important as Code by Andrea GouletAndrea Goulet

Abortion Clinic In Johannesburg ](+27832195400*)[ 🏥 Safe Abortion Pills in Jo...Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg

Microsoft365_Dev_Security_2024_05_16.pdfMarkus Moeller

Prompt Engineering - an Art, a Science, or your next Job Title?Maxim Salnikov

[GeeCON2024] How I learned to stop worrying and love the dark silicon apocalypseTomasz Kowalczewski

Workshop - Architecting Innovative Graph Applications- GraphSummit MilanNeo4j

Modern binary build systems - PyCon 2024Henry Schreiner

Navigation in flutter – how to add stack, tab, and drawer navigators to your ...Flutter Agency

Abortion Clinic In Stanger ](+27832195400*)[ 🏥 Safe Abortion Pills In Stanger...Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg

Food Delivery Business App Development Guide 2024Chirag Panchal

[GRCPP] Introduction to concepts (C++20)Dimitrios Platis

Recently uploaded (20)

Alluxio Monthly Webinar | Simplify Data Access for AI in Multi-Cloud

Effective Strategies for Wix's Scaling challenges - GeeCon

Weeding your micro service landscape.pdf

Software Engineering - Introduction + Process Models + Requirements Engineering

Transformer Neural Network Use Cases with Links

UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida

Rapidoform for Modern Form Building and Insights

BusinessGPT - Security and Governance for Generative AI

Test Automation Design Patterns_ A Comprehensive Guide.pdf

Community is Just as Important as Code by Andrea Goulet

Abortion Clinic In Johannesburg ](+27832195400*)[ 🏥 Safe Abortion Pills in Jo...

Microsoft365_Dev_Security_2024_05_16.pdf

Prompt Engineering - an Art, a Science, or your next Job Title?

[GeeCON2024] How I learned to stop worrying and love the dark silicon apocalypse

Workshop - Architecting Innovative Graph Applications- GraphSummit Milan

Modern binary build systems - PyCon 2024

Navigation in flutter – how to add stack, tab, and drawer navigators to your ...

Abortion Clinic In Stanger ](+27832195400*)[ 🏥 Safe Abortion Pills In Stanger...

Food Delivery Business App Development Guide 2024

[GRCPP] Introduction to concepts (C++20)

Weakly supervised semantic segmentation using web crawled videos

1. Weakly Supervised Semantic Segmentation using Web-Crawled Videos Author: Seunghoon Hong, Donghun Yeo,Suha Kwak, Honglak Lee, Bohyung Han Publish: CVPR2017

2. Goal  Problem: Model overly focuses on discriminative parts rather than the entire object area  Solution:  Retrieving video and generating segmentation labels from the retrieving video without human intervention to simulate string supervised for semantic segmentation

3. Video  Estimate shape and extent of object in video by motion  Generating segmentation labels

4. Video  Temporally ambiguous: Some frames are not relevant to label  Spatial ambiguity: Some motion object not relevant to label

5. Weakly annotated images  Image and clean class labels given manually  Alleviate the ambiguities in video

6. Achitecture – two part  Encoder for image classification and discriminative localization  Decoder for image segmentation

8. Section 3.1 : Based on Pre-trained VGG-16 network

9. Section 3.1 Remove full-connected layer Place a new convolutional layer after the last convolutional layer of VGG-16 For better ada-ptation to our task Global average pooling followed by a fully- connected layer at bottom

10. Global average pooling

11. Section 3.1

12. Attention - Class Activation Mapping

13. Section 3.2 - Eliminating irrelevent frames < Threshold -> Eliminate

14. Section 3.2 - Eliminating irrelevent frames  If more than 5 consecutive frames are chosen, we consider them as a single relevant video

15. Section 3.2 - spatio-temporal segmentation Graph-based optimization problem i-th superpixel of frame t

16. Section 3.2 - spatio-temporal segmentation Estimate a binary label for

17. Section 3.2 - spatio-temporal segmentation By GMM By Inside outside map

18. Section 3.2 – Attention

19. Section 3.2 - spatio-temporal segmentation

20. Section 3.3 – Learning with video

21. Result

22. Reference  Weakly Supervised Semantic Segmentation using Web-Crawled Videos ,CVPR2017

Weakly supervised semantic segmentation using web crawled videos

Recommended

Recommended

More Related Content

Similar to Weakly supervised semantic segmentation using web crawled videos

Similar to Weakly supervised semantic segmentation using web crawled videos (20)

Recently uploaded

Recently uploaded (20)

Weakly supervised semantic segmentation using web crawled videos