R-FCN.pptx

•Download as PPTX, PDF•

0 likes•117 views

R-FCN is a two-stage object detection network that addresses the translation invariance vs variance dilemma. It uses position-sensitive score maps and RoI pooling to classify objects. Position-sensitive score maps are produced from a convolutional layer and are specialized for different locations within objects. Position-sensitive RoI pooling pools only over the relevant score map for each RoI bin. Bounding box regression is also performed using position-sensitive techniques. R-FCN achieves state-of-the-art object detection performance while being faster than Faster R-CNN since it removes unnecessary RoI pooling layers.

Engineering

R-FCN: Object Detection via Region-based
Fully Convolutional Networks
2022/04/19, Changjin Lee

Introduction
Translation Invariance Translation Variance
DOG
Classification Detection
dilemma

Introduction
● Two-stage object detection networks have two subnetworks
○ Shared fully convolutional subnetwork independent of RoIs
○ RoI-wise subnetwork that does not share computation
● RoI pooling layer is unnaturally inserted to address invariance vs variance dilemma
○ Sacrifices training and testing efficiency since it introduces a considerable number of
region-wise layers -> each RoI goes through classification layer

R-FCN Specialized feature map in detecting top-left
corner of cat
k
k

R-FCN
position-sensitive
score maps
position-sensitive
RoI pooling layer

R-FCN vs Faster R-CNN
Faster R-CNN
R-FCN
conv layer
NO conv layer

Position-sensitive score maps
● Attach a convolutional layer on top of feature map to produce k^2(C+1) position-
sensitive score maps
● For each class C, k^2 feature maps are produced
○ feature map specialized for (top-left, top-middle,...) locations of an object
k
k

Position-sensitive RoI Pooling
● Each RoI rectangle is divided into k x k bins
○ For w x h RoI, each bin has size of (w/k x h/k)
● For each (i, j)th bin, position-sensitive RoI pooling pools only over the (i,j)th score map
RoI
0 1 2
0
1
2
one score map out of k^2(C+1) score
maps
top_left corner of an
ROI
# of pixels in the bin
(123, 245)

Position-sensitive RoI Pooling
-> For each class!
For each class: [0.47, 0.77, 0.18, ….], n= C+1 classes

Voting
softmax
● For each class, obtain average class score
○ Total (C+1) class scores
● Apply softmax to determine the classification result

Bounding box regression
● Aside from the k^2(C+1)-d conv layer, a sibling 4k^2 sibling conv layer for bounding box regression
is appended
○ produce 4k^2-d vector for each RoI
● Then, it’s aggregated into 4-d vector by average voting.
● 4-d vector parameterizes (t_x, t_y, t_w, t_h)

References
[1]https://arxiv.org/abs/1605.06409
[2]https://jonathan-hui.medium.com/understanding-region-based-fully-
convolutional-networks-r-fcn-for-object-detection-828316f07c99

Similar to R-FCN.pptx

Deep image retrieval - learning global representations for image search - ub ...

Universitat de Barcelona

SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

ynxm25hpxp

Localization and classification. Overfeat: class agnostic versu class specific localization, fully convolutional neural networks, greedy merge strategy. Multiobject detection. Region proposal and selective search. R-CNN, Fast R-CNN, Faster R-CNN and YOLO. Image segmentation. Semantic segmentation and transposed convolutions. Instance segmentation and Mask R-CNN. Image captioning. Recurrent Neural Networks (RNNs). Language generation. Long Short Term Memory (LSTMs). DeepImageSent, Show and Tell, and Show, Attend and Tell algorithms.

MLIP - Chapter 5 - Detection, Segmentation, Captioning

Charles Deledalle

Slides by Albert Jimenez about the following paper: Gordo, Albert, Jon Almazan, Jerome Revaud, and Diane Larlus. "Deep Image Retrieval: Learning global representations for image search." arXiv preprint arXiv:1604.01325 (2016). We propose a novel approach for instance-level image retrieval. It produces a global and compact fixed-length representation for each image by aggregating many region-wise descriptors. In contrast to previous works employing pre-trained deep networks as a black box to produce features, our method leverages a deep architecture trained for the specific task of image retrieval. Our contribution is twofold: (i) we introduce a ranking framework to learn convolution and projection weights that are used to build the region features; and (ii) we employ a region proposal network to learn which regions should be pooled to form the final global descriptor. We show that using clean training data is key to the success of our approach. To that aim, we leverage a large scale but noisy landmark dataset and develop an automatic cleaning approach. The proposed architecture produces a global image representation in a single forward pass. Our approach significantly outperforms previous approaches based on global descriptors on standard datasets. It even surpasses most prior works based on costly local descriptor indexing and spatial verification. We intend to release our pre-trained model.

Deep image retrieval learning global representations for image search

Universitat Politècnica de Catalunya

D3L4-objects.pdf

ssusere945ae

Cheatsheet convolutional-neural-networks

Steve Nouri

150807 Fast R-CNN

Junho Cho

[MIRU2018] Global Average Poolingの特性を用いたAttention Branch Network

Hiroshi Fukui

Conditional Image Generation with PixelCNN Decoders

suga93

http://imatge-upc.github.io/telecombcn-2016-dlcv/ Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of big annotated data and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which had been addressed until now with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks and Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles and applications of deep learning to computer vision problems, such as image classification, object detection or text captioning.

Deep Learning for Computer Vision: Object Detection (UPC 2016)

Universitat Politècnica de Catalunya

Image Texture Analysis

lalitxp

B spline surfeces

ramac123

https://telecombcn-dl.github.io/2018-dlcv/ Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of large-scale annotated datasets and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which were previously addressed with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks and Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles and applications of deep learning to computer vision problems, such as image classification, object detection or image captioning.

Object Detection - Míriam Bellver - UPC Barcelona 2018

Universitat Politècnica de Catalunya

Densebox

冠宇陳

For the full video of this presentation, please visit: https://www.embedded-vision.com/platinum-members/cadence/embedded-vision-training/videos/pages/may-2019-embedded-vision-summit-gadkari For more information about embedded vision, please visit: http://www.embedded-vision.com Shrinivas Gadkari, Design Engineering Director at Cadence, presents the "Fundamentals of Monocular SLAM" tutorial at the May 2019 Embedded Vision Summit. Simultaneous Localization and Mapping (SLAM) refers to a class of algorithms that enables a device with one or more cameras and/or other sensors to create an accurate map of its surroundings, to determine the device’s location relative to its surroundings and to track its path as it moves through this environment. This is a key capability for many new use cases and applications, especially in the domains of augmented reality, virtual reality and mobile robots. Monocular SLAM is a type of SLAM that relies exclusively on a monocular image sequence captured by a moving camera. In this talk, Gadkari introduces the fundamentals of monocular SLAM algorithms, from input images to 3D map. He takes a close look at key components of monocular SLAM algorithms, including Oriented Fast and Oriented Brief (ORB), Fundamental Matrix-based Pose Estimation, stitching together poses using translation estimation and loop closure. He also discusses implementation considerations for these components, including arithmetic precision required to achieve acceptable mapping and tracking accuracy.

"Fundamentals of Monocular SLAM," a Presentation from Cadence

Edge AI and Vision Alliance

The world of loss function

홍배 김

Bouguet's MatLab Camera Calibration Toolbox

Yuji Oyamada

Quasi Cyclic LDPC codes - Algebraic Construction

Eapen Vpp

VHDL and Cordic Algorithim

Subeer Rangra

위성이미지 객체 검출 대회 - 2등

DACON AI 데이콘

Similar to R-FCN.pptx (20)

Deep image retrieval - learning global representations for image search - ub ...

SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

MLIP - Chapter 5 - Detection, Segmentation, Captioning

Deep image retrieval learning global representations for image search

D3L4-objects.pdf

Cheatsheet convolutional-neural-networks

150807 Fast R-CNN

[MIRU2018] Global Average Poolingの特性を用いたAttention Branch Network

Conditional Image Generation with PixelCNN Decoders

Deep Learning for Computer Vision: Object Detection (UPC 2016)

Image Texture Analysis

B spline surfeces

Object Detection - Míriam Bellver - UPC Barcelona 2018

Densebox

"Fundamentals of Monocular SLAM," a Presentation from Cadence

The world of loss function

Bouguet's MatLab Camera Calibration Toolbox

Quasi Cyclic LDPC codes - Algebraic Construction

VHDL and Cordic Algorithim

위성이미지 객체 검출 대회 - 2등

Recently uploaded

AIRCANVAS[1].pdf mini project for btech students

vanyagupta248

DeepFakes presentation : brief idea of DeepFakes

MayuraD1

PE 459 LECTURE 2- natural gas basic concepts and properties

sarkmank1

“HOSTEL MANAGEMENT SYSTEM” is a software developed for managing various activities in the hostel. For the past few years the number of educational institutions are increasing rapidly. Thereby the number of hostels are also increasing for the accommodation of the students studying in this institution. And hence there is a lot of strain on the person who are running the hostel and software’s are not usually used in this context. This particular project deals with the problems on managing a hostel and avoids the problems which occur when carried manually.

Hostel management system project report..pdf

Kamal Acharya

Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptx

MuhammadAsimMuhammad6

Work-Permit-Receiver-in-Saudi-Aramco.pptx

JuliansyahHarahap1

Model Call Girl Services in Delhi reach out to us at 🔝 9953056974🔝✔️✔️ Our agency presents a selection of young, charming call girls available for bookings at Oyo Hotels. Experience high-class escort services at pocket-friendly rates, with our female escorts exuding both beauty and a delightful personality, ready to meet your desires. Whether it's Housewives, College girls, Russian girls, Muslim girls, or any other preference, we offer a diverse range of options to cater to your tastes. We provide both in- call and out-call services for your convenience. Our in-call location in Delhi ensures cleanliness, hygiene, and 100% safety, while our out-call services offer doorstep delivery for added ease. We value your time and money, hence we kindly request pic collectors, time-passers, and bargain hunters to refrain from contacting us. Our services feature various packages at competitive rates: One shot: ₹2000/in-call, ₹5000/out-call Two shots with one girl: ₹3500 /in-call, ₱6000/out-call Body to body massage with sex: ₱3000/in-call Full night for one person: ₱7000/in-call, ₱10000/out-call Full night for more than 1 person : Contact us at 🔝 9953056974🔝. for details Operating 24/7, we serve various locations in Delhi, including Green Park, Lajpat Nagar, Saket, and Hauz Khas near metro stations. For premium call girl services in Delhi 🔝 9953056974🔝. Thank you for considering us

Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7

9953056974 Low Rate Call Girls In Saket, Delhi NCR

Saudi Arabia [ Abortion pills) Jeddah/riaydh/dammam/+918761049707☎️] cytotec tablets uses abortion pills 💊💊 How effective is the abortion pill? 💊💊 +918761049707) "Abortion pills in Jeddah" how to get cytotec tablets in Riyadh " Abortion pills in dammam*💊💊 The abortion pill is very effective. If you’re taking mifepristone and misoprostol, it depends on how far along the pregnancy is, and how many doses of medicine you take:💊💊 +918761049707) how to buy cytotec pills At 8 weeks pregnant or less, it works about 94-98% of the time. +918761049707[ 💊💊💊 At 8-9 weeks pregnant, it works about 94-96% of the time. +918761049707) At 9-10 weeks pregnant, it works about 91-93% of the time. +918761049707)💊💊 If you take an extra dose of misoprostol, it works about 99% of the time. At 10-11 weeks pregnant, it works about 87% of the time. +918761049707) If you take an extra dose of misoprostol, it works about 98% of the time. In general, taking both mifepristone and+918761049707 misoprostol works a bit better than taking misoprostol only. +918761049707 Taking misoprostol alone works to end the+918761049707 pregnancy about 85-95% of the time — depending on how far along the+918761049707 pregnancy is and how you take the medicine. +918761049707 The abortion pill usually works, but if it doesn’t, you can take more medicine or have an in-clinic abortion. +918761049707 When can I take the abortion pill?+918761049707 In general, you can have a medication abortion up to 77 days (11 weeks)+918761049707 after the first day of your last period. If it’s been 78 days or more since the first day of your last+918761049707period, you can have an in-clinic abortion to end your pregnancy.+918761049707 Why do people choose the abortion pill? Which kind of abortion you choose all depends on your personal+918761049707preference and situation. With+918761049707medication+918761049707 abortion, some people like that you don’t need to have a procedure in a doctor’s office. You can have your medication abortion on your own+918761049707schedule, at home or in another comfortable place that you choose.+918761049707 You get to decide who you want to be with during your abortion, or you can go it alone. Because+918761049707medication abortion is similar to a miscarriage, many people feel like it’s more “natural” and less invasive. And some+918761049707 people may not have an in-clinic abortion provider close by, so abortion pills are more available to+918761049707 them. +918761049707 Your doctor, nurse, or health center staff can help you decide which kind of abortion is best for you. More questions from patients: Saudi Arabia+918761049707 CYTOTEC Misoprostol Tablets. Misoprostol is a medication that can prevent stomach ulcers if you also take NSAID medications. It reduces the amount of acid in your stomach, which protects your stomach lining. The brand name of this medication is Cytotec®️.+918761049707) Unwanted Kit is a combination of two medicines, which is used for

Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait

jaanualu31

HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR

KOUSTAV SARKAR

The project Hospital Management system includes registration of patients, storing their details into the system, and also computerized billing in the pharmacy, and labs. The software has the facility to give a unique id for every patient and stores the details of every patient and the staff automatically. It includes a search facility to know the current status of each room. User can search availability of a doctor and the details of a patient using the id. The Hospital Management System can be entered using a username and password. It is accessible either by an administrator or receptionist. Only they can add data into the database. The data can be retrieved easily. The interface is very user-friendly. The data are well protected for personal use and makes the data processing very fast. Hospital Management System is powerful, flexible, and easy to use and is designed and developed to deliver real conceivable benefits to hospitals. Hospital Management System is designed for multispecialty hospitals, to cover a wide range of hospital administration and management processes. It is an integrated end-to-end Hospital Management System that provides relevant information across the hospital to support effective decision making for patient care, hospital administration and critical financial accounting, in a seamless flow. Hospital Management System is a software product suite designed to improve the quality and management of hospital management in the areas of clinical process analysis and activity-based costing. Hospital Management System enables you to develop your organization and improve its effectiveness and quality of work. Managing the key processes efficiently is critical to the success of the hospital helps you manage your processes.

Hospital management system project report.pdf

Kamal Acharya

LECTURE 01 Introduction to Computers Computers in Society Components of a Computer Types of Computers Definition: “A computer is an electronic device that manipulates information, or data. It has the ability to store, retrieve, and process data” The word "computer" is derived from the Latin word "computare," which means "to calculate" Computers are used for Businesses Communication Entertainment Education Medical Field

Computer Lecture 01.pptxIntroduction to Computers

MairaAshraf6

Online Food Ordering System is proposed for simplifies the food ordering process. ThisSystem shows an user interface and update the menu with all available options so that it eases thecustomer work. Customer can choose more than one item to make an order and can view Orderdetails before logging off. The order confirmation is sent to the customer. The order is placed inthe queue and updated in the Database and returned in real time. This system assists the staff togo through the orders in real time and process it efficiently. Online food order system is mainlydesigned primarily function for use in the food delivery industry. This system will allowhotels and restaurants to increase online food ordering such type of business. The customerscan be selected food menu items just few minutes. In the modern food industries allows toquickly and easily delivery on customer place. Restaurant employees then use these ordersthrough an easy to delivery on customer place easy find out navigate graphical interface forefficient processing .

Online food ordering system project report.pdf

Kamal Acharya

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

ssuser89054b

GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE

selvakumar948

Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil

Cara Menggugurkan Kandungan 087776558899

Wadi Rum luxhotel lodge Analysis case study.pptx

NadaHaitham1

FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads

Arindam Chakraborty, Ph.D., P.E. (CA, TX)

NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...

Amil baba

Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRLS IN bhubaneswar ESCORT SERVICE❤CALL GIRL No Advance Pay 💋💋 Book Now +91-9777949614 Discover Bhubaneswar Finest Call GirlsY ou're heading to Bhubaneswar for business or pleasure and want to experience the city to the fullest. What better way than in the company of a beautiful, intelligent companion to show you around town? Look no further than Bhubaneswar's finest Call Girls. These lovely ladies know all the hot spots and hidden gems that only locals are privy to. From the top restaurants and bars to the most stunning sights, they'll ensure you make unforgettable memories.

Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...

Call Girls Mumbai

1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf

AldoGarca30

Recently uploaded (20)

AIRCANVAS[1].pdf mini project for btech students

DeepFakes presentation : brief idea of DeepFakes

PE 459 LECTURE 2- natural gas basic concepts and properties

Hostel management system project report..pdf

Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptx

Work-Permit-Receiver-in-Saudi-Aramco.pptx

Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7

Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait

HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR

Hospital management system project report.pdf

Computer Lecture 01.pptxIntroduction to Computers

Online food ordering system project report.pdf

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE

Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil

Wadi Rum luxhotel lodge Analysis case study.pptx

FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads

NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...

Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...

1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf

R-FCN.pptx

1. R-FCN: Object Detection via Region-based Fully Convolutional Networks 2022/04/19, Changjin Lee

2. Introduction Translation Invariance Translation Variance DOG Classification Detection dilemma

3. Introduction ● Two-stage object detection networks have two subnetworks ○ Shared fully convolutional subnetwork independent of RoIs ○ RoI-wise subnetwork that does not share computation ● RoI pooling layer is unnaturally inserted to address invariance vs variance dilemma ○ Sacrifices training and testing efficiency since it introduces a considerable number of region-wise layers -> each RoI goes through classification layer

4. R-FCN Specialized feature map in detecting top-left corner of cat k k

5. R-FCN position-sensitive score maps position-sensitive RoI pooling layer

6. R-FCN vs Faster R-CNN Faster R-CNN R-FCN conv layer NO conv layer

7. Position-sensitive score maps ● Attach a convolutional layer on top of feature map to produce k^2(C+1) position- sensitive score maps ● For each class C, k^2 feature maps are produced ○ feature map specialized for (top-left, top-middle,...) locations of an object k k

8. Position-sensitive score maps

9. Position-sensitive RoI Pooling ● Each RoI rectangle is divided into k x k bins ○ For w x h RoI, each bin has size of (w/k x h/k) ● For each (i, j)th bin, position-sensitive RoI pooling pools only over the (i,j)th score map RoI 0 1 2 0 1 2 one score map out of k^2(C+1) score maps top_left corner of an ROI # of pixels in the bin (123, 245)

10. Position-sensitive RoI Pooling -> For each class! For each class: [0.47, 0.77, 0.18, ….], n= C+1 classes

11. Voting softmax ● For each class, obtain average class score ○ Total (C+1) class scores ● Apply softmax to determine the classification result

12. Position-sensitive RoI Pooling

13. Position-sensitive RoI Pooling

14. Bounding box regression ● Aside from the k^2(C+1)-d conv layer, a sibling 4k^2 sibling conv layer for bounding box regression is appended ○ produce 4k^2-d vector for each RoI ● Then, it’s aggregated into 4-d vector by average voting. ● 4-d vector parameterizes (t_x, t_y, t_w, t_h)

15. R-FCN Recap

16. Performances

17. References [1]https://arxiv.org/abs/1605.06409 [2]https://jonathan-hui.medium.com/understanding-region-based-fully- convolutional-networks-r-fcn-for-object-detection-828316f07c99

R-FCN.pptx

Recommended

Recommended

More Related Content

Similar to R-FCN.pptx

Similar to R-FCN.pptx (20)

More from Changjin Lee

More from Changjin Lee (6)

Recently uploaded

Recently uploaded (20)

R-FCN.pptx