More Related Content More from harmonylab (20) Differential Networks for Visual Question Answering2. 1
1論文概要
• Author
• Chenfei Wu, Jinlai Liu, Xiaojie Wang, Ruifan Li
• Center for Intelligence Science and Technology
Beijing University of Posts and Telecommunications
• AAAI2019 (2019/1/27 ~ 2019/2/1)
• URL
• https://www.aaai.org/Papers/AAAI/2019/AAAI-WuC.76.pdf
• FCNに工夫を加えたDifferential Networksを提案し,VQAタスクに適用
4. 3
3VQA2.0のデータセットの例
How many stories is the building tall?
Answer 1: 5
Answer 2: 5
Answer 3: 5
Answer 4: 5
Answer 5: 6
Answer 6: 6
Answer 7: 5
Answer 8: 5
Answer 9: 4
Answer 10: 5
答え
5. 4
4VQA2.0のデータセットの例
What is on the pillow?
Answer 1: pillow sham
Answer 2: frills
Answer 3: pillow case
Answer 4: pillowcases
Answer 5: nothing
Answer 6: gray
Answer 7: note
Answer 8: cover
Answer 9: paper
Answer 10: sign
答え
8. 7
7Differential Fusion Model for VQA
DNを利用したVQAを解くためのモデル
Data Embedding ・ Differential Fusion ・ Decision Making の3段階からなる
attentionベースのモデル: 画像のどこに注目すればよいかを考慮するモデル