12. 2023/9/6 12
TransformerのEncoder, Decoderから出力された特徴に着目
• ドメインクエリを用いて識別器に
大域的な特徴におけるドメイン差の緩和
• トークンを用いて識別器に
局所的な特徴におけるドメイン差の緩和
→両方の観点からドメインの差を埋める
Exploring Sequence Feature Alignment for Domain Adaptive Detection
Transformers (SFA),
Wen Wang+ (University of Science and Technology of China) [ACM International Conference on
Multimedia’21]
13. 2023/9/6 13
Encoder+Decoderの特徴でドメイン適応することは最適ではないと主張
❌ Encoder → ◎ Backbone
Improving Transferability for Domain Adaptive Detection Transformers
(O2 Net),
Kaixiong Gong+ (Beijing Institute of Technology) [ACM International Conference on Multimedia’22]
14. 2023/9/6 14
BackboneとDecoderの出力をドメイン適応に用いる
💡技術の肝は
Backbone (Object-Aware Alignment)
前景に着目されるように重みづけ
Decoder (Optimal Transport based Alignment)
敵対的手法では位置情報が消失
→Wasserstain距離の最小化により位置情報も考慮
Improving Transferability for Domain Adaptive Detection Transformers
(O2 Net),
Kaixiong Gong+ (Beijing Institute of Technology) [ACM International Conference on Multimedia’22]
15. 2023/9/6 15
Backbone (Object-Aware Alignment)
信頼度の高い擬似ラベルで重みを設定
敵対的学習に重みづけ
Improving Transferability for Domain Adaptive Detection Transformers
(O2 Net),
Kaixiong Gong+ (Beijing Institute of Technology) [ACM International Conference on Multimedia’22]
16. 2023/9/6 16
Decoder (Optimal Transport based Alignment)
Sliced Wasserstain距離を用いてDecoderにおけるソースとターゲットの
特徴分布の距離を近づける
Improving Transferability for Domain Adaptive Detection Transformers
(O2 Net),
Kaixiong Gong+ (Beijing Institute of Technology) [ACM International Conference on Multimedia’22]
位置情報を保持したままドメイン差を埋められる
※Wasserstain距離は分布を別の分布に輸送する最小のコストを測る
17. 2023/9/6 17
Cityscapes → Foggy CityscapesのデータセットにおいてSoTA
Improving Transferability for Domain Adaptive Detection Transformers
(O2 Net),
Kaixiong Gong+ (Beijing Institute of Technology) [ACM International Conference on Multimedia’22]
18. 2023/9/6 18
Cityscapes → Foggy CityscapesのデータセットにおいてSoTA
Improving Transferability for Domain Adaptive Detection Transformers
(O2 Net),
Kaixiong Gong+ (Beijing Institute of Technology) [ACM International Conference on Multimedia’22]
21. 2023/9/6 21
Cascading Alignment for Unsupervised Domain-Adaptive DETR with
Improved DeNoising Anchor Boxes (CA-DINO),
Huantong Geng+ (Nanjing University of Information Science and Technology) [MDPI’22]
Backbone
AEDD (Attention Enhanced Double Discriminator)
によりドメイン不変な特徴
CBAM [ECCV’18]
(Convolutional block attention module)
による強いドメイン識別器を導入
22. 2023/9/6 22
Cascading Alignment for Unsupervised Domain-Adaptive DETR with
Improved DeNoising Anchor Boxes (CA-DINO),
Huantong Geng+ (Nanjing University of Information Science and Technology) [MDPI’22]
Transformer
WROT (Weak Restraints on Category-Level Token)
𝑧:EncoderとDecoderの出力を平坦化
Frobenius normを計算し最小化
23. 2023/9/6 23
Cascading Alignment for Unsupervised Domain-Adaptive DETR with
Improved DeNoising Anchor Boxes (CA-DINO),
Huantong Geng+ (Nanjing University of Information Science and Technology) [MDPI’22]
Cityscapes → Foggy CityscapesのデータセットでSoTA