PiCANet, Pytorch Implementation (Korean)

논문 구현 및 Attention Map 관찰

목차
• Pixel-wise contextual attention network 논문 설명
• PiCANet Architecture
• PiCANet을 활용한 Saliency Detection Network 구조 설명
• Encoder
• Decoder
• Decoder Cell
• 실제 구현
• 결과 해석
• 한계점 및 보완할 점

Pixel-wise contextual attention network
• 각각의 픽셀이 가지고 있는 contextual 정보를 Attention Map으로
표현하는 Network

Pixel-wise contextual attention network
• Intuition: 어떤 한 픽셀의 정보에 미치는 영향은
픽셀마다 다를 것이다.
• Goal : 각 픽셀 간의 Context 정보를 학습한다.
• Soft Attention에 기반하여 구현.

• Global Attention 과 Local Attention

각 픽셀의 AttentionH*W의 채널 수를 가진
Attended Contextual Feature Map
기존의 Feature Map
(a)
(b)-softmax
Global PiCANet Architecture

D = H * W
𝛼 𝑤 ℎ= reshape(𝛼[w, h]: (1, D)  (H, W) 2D array)
𝛼 𝑤 ℎF
C
(C, 1, 1) : Attended pixel at (w, h)

각 픽셀의 Attention
(Channel # : 1)
(Channel # : C)
Weighted
Sum
Channel = C인
Single Pixel
(H * W) 개
reshape

각 픽셀의 Attention_h*_w의 채널 수를 가진
Attended Contextual Feature Map
中 일부
(a)
(b)-softmax
(a) (b)
Local PiCANet Architecture

각 픽셀의 Attention
(Channel # : 1)
中 일부
(Channel # : C)
Weighted
Sum
Channel = C인
Single Pixel
(_h * _w) 개
reshape
(a) (b)

Overall Architecture
Encoder Decoder
Decoder Cell

Encoder
• VGG16 Backbone
• 2 Convolution Layer
• 기존 fc-classifier 부분

Decoder & Decoder Cell
• 6개의 Decoder Cell로 구성
• 각 Cell 마다 Cross Entropy Loss 연산
• Target은 Max-pooling을 이용해 Resize

Decoder & Decoder Cell
Deci+1
Eni
Deci

Result
• -R
• Resnet Backbone
Instead of VGG16
• -C
• CRF Postprocessing

실제 구현
• Data Augmentation
• Dilated Filter

Data Augmentation
• Crop Image into 224*224
• Horizontal Mirror Flip

Dilated Convolution
• Global PiCANet에서 실제 H,W = 28,28  D = 28 * 28
• 계산량 감소를 위해 10*10의 채널 수를 가진 ACF Map 생성
• Dilation = 3 를 줌으로 1028로 확장
• (10 - 1) * (3 - 1) + 10 = 28

Dilated Convolution
• Local PiCANet에서 실제 _h,_w = 13,13, _d = 13*13
• 계산량 감소를 위해 7*7의 채널 수를 가진 ACF Map 생성
• Dilation = 2 를 줌으로 713로 확장
• (7 - 1) * (2 – 1) + 7 = 28

Implementation
• Pytorch로 구현 논문  Caffe 로 구현
• Batchsize = 1 로 학습 논문  Batchsize = 10
• (더 크게 할 시 Out of Memory 문제 발생)
• HyperParameter :
• learning_rate, beta, momentum 등등 : 논문과 동일 (lr : 0.001, m : 0.9,)
• Network:
• Global/Local PiCANet : 논문과 동일하게 구현
• Decoder Cell 中 Convolution layer :
• Kernel Size와 관련하여 논문에 적힌 내용이 없어 임의로 구현
• Optimize:
• Gradient가 튀는 현상을 막고자 Gradient Clipping 적용함.

Result
• Training Loss
• PR_Curve
• F_Measure Score (beta_square = 0.3)
• MAE Score
• Attention Map Viewing Application

Training Result
• 1000 iteration
• 123000 iteration

PR_Curve
TensorBoard로 출력한 PR_Curve를
위에 겹쳐 놓은 모양이라 부정확할 수
있습니다.
주황색 굵은 곡선이
직접 구현한 모델입니다.

F_Measure Score
Step Value Threshold
55000 0.827128 0.754009
123000 0.837655 0.787505
130000 0.825695 0.830526
139000 0.823819 0.728928
159000 0.824045 0.7839
180000 0.833425 0.827174
194000 0.829462 0.786741
218000 0.823438 0.676048
227000 0.823538 0.718029
243000 0.825455 0.728044

MAE
Step Value
55000 0.040421
70000 0.040555
86000 0.039739
90000 0.041903
96000 0.041098
106000 0.041002
227000 0.041843
243000 0.041748
247000 0.040996
289000 0.041671

Attention Map 결과 해석
• 이미지마다 다른 Attention Map이 생성됨
• 각 픽셀마다 다른 Attention Map이 생성됨
• Attention Map이 불균일하므로, Pixel간의 위치가 갖는
Contextual 정보가 있음을 알 수 있다.

현재 한계점
1. Network를 학습 or 검증할 때의 GPU 사용률이 현저히 낮다.
2. 논문에서 BatchSize 10으로 학습을 했다고 하는데, BatchSize를 1 이상으
로 학습시킬 수 없다.
3. Attention Map이 구체적으로 어떤 의미를 갖는지 파악하기 힘들다.
(Saliency Detection 문제 및 이미지 처리에 어떤 도움을 주는지 확인이 어려움)
4. DUTS Dataset 에서만 모델을 검증해 보았다.

향후 보완점
1. Network를 학습 or 검증할 때의 GPU 사용률이 현저히 낮다.
1. Pytorch 불안정성으로 인한 문제일 가능성 있음.
2. Tensorboard로 구현해 본다.
2. 논문에서 BatchSize 10으로 학습을 했다고 하는데, BatchSize를 1 이상으
로 학습시킬 수 없다.
1. 1. 과 같은 문제일 가능성

향후 보완점
3. Attention Map이 구체적으로 어떤 의미를 갖는지 파악하기 힘들다.
(Saliency Detection 문제 및 이미지 처리에 어떤 도움을 주는지 확인이 어려움)
1. PiCANet을 활용하여 Classification, Semantic Detection과 같은 다른 문제들을 해
결해 본다.
2. PiCANet을 이용하는 AutoEncoder를 설계하여 Context 정보로 기존 정보를 복원할
수 있는지 검증해본다.
4. DUTS Dataset 에서만 모델을 검증해 보았다.
1. 다른 Dataset에서도 검증해본다.

PiCANet, Pytorch Implementation (Korean)

Recommended

Recommended

More Related Content

Similar to PiCANet, Pytorch Implementation (Korean)

Similar to PiCANet, Pytorch Implementation (Korean) (20)

Recently uploaded

Recently uploaded (6)

PiCANet, Pytorch Implementation (Korean)