[DL輪読会]Exploiting Cyclic Symmetry in Convolutional Neural Networks

•Download as PPTX, PDF•

0 likes•971 views

Deep Learning JP

2017/2/28 Deep Learning JP: http://deeplearning.jp/seminar-2/

2017/2/16
DL輪読会
医学系研究科 D2
山口亮平

Exploiting Cyclic Symmetry in
Convolutional Neural Networks
• Deep Mindの論文
• Sander Dileman(ゲント大学の博士課程→google)。博士
課程のとき、Kaggle2015のプランクトンコンテストで
優勝。
• ICML16で発表、引用数12(2017/2/17現在)
• http://benanne.github.io/2015/03/17/plankton.html
• 回転に対するロバスト性をネットワーク内部で再現し
た論文。

Motivation
<CNN・・・平行移動に対して強い>
• Conv層・・・平行移動に対してequivariant
• Pooling層・・・平行移動に対してinvariant
→
①rotationに対してもinvariantを保ちたい。
ネットワークにそのinvariance/equivariantな性質を組み入れたい。
(data augmentationでは一般化されているかどうか不明なため)
②パラメーター共有を普通のCNNに比べてさらに進めることで、
overfittingリスクを減らせる。

Motivation
<CNN・・・平行移動に対して強い>
• Conv層・・・平行移動に対してequivariant
• Pooling層・・・平行移動に対してinvariant
→
①rotationに対してもinvariantを保ちたい。ネットワー
クにそのinvariance/equivariantな性質を入れたい。
②パラメーター共有を普通のCNNに比べてさらに進める
ことで、overfittingリスクを減らせる。

limitation
★Rotation：今回は90度の整数倍の回転に絞ってい
る。
（30度などの鋭角回転を入れると、画像的な補間
が必要となり、さらに計算量が増えるから、とい
う理由で除外している）
★鏡面に関して対象な画像(dihedral symmetry)に関
しても、実装は容易に可能だ、としている。

基本的な考え方
画像を4方向に回転させたものを作成、同じ数のフィルタ
を使いfeature mapを4倍作成できる

提案手法
• 以下の3種類の層を組み込むことを提案
cyclic slicing, cyclic pooling, cyclic rolling
（cyclic stackingは実際には使っていない）

①cyclic slicing, cyclic pooling
http://benanne.github.io/2015/03/17/plankton.html

http://benanne.github.io/2015/03/17/plankton.html

②cyclic rolling
Ｔ（ｘ
）

実験
＜多クラス分類＞
• プランクトンデータセット
（121クラス、95*95pixel、3037valid/27299train）
• 銀河写真データセット
（121クラス、95*95pixel、6157valid/55421train）
＜領域抽出＞
• マサチューセッツの航空写真
（80*80pixel,137train/4valid/10test）
Baseline CNNをコントロールとし、cyclic slicingなどを挿入すること
でどれだけ性能が向上したかを示した。

• Adam使用、プランクトンのみweight decayも併
用
• Data augmentationは、ベースラインCNNにも、
筆者らの提案した手法にも、どちらにも使用し
た

実験その①（cyclic slice/pool）

実験その②（cyclic rolling）

<rollingの挿入の仕方>
• Roll all(convの後にすべて挿入)
• Roll dense(dense layerの後にのみ挿入)
• その後に1/2,1/4とついているのは、フィルタの
数をその倍率に縮小した、という意味。

参考
• http://benanne.github.io/2015/03/17/plankton.ht
ml
• http://icml.cc/2016/reviews/871.txt

Recommended

[DL輪読会]Combining Fully Convolutional and Recurrent Neural Networks for 3D Bio...

[DL輪読会]Combining Fully Convolutional and Recurrent Neural Networks for 3D Bio...

[DL輪読会]Combining Fully Convolutional and Recurrent Neural Networks for 3D Bio...

Deep Learning JP

[DL輪読会]Wasserstein GAN/Towards Principled Methods for Training Generative Adv...

[DL輪読会]Wasserstein GAN/Towards Principled Methods for Training Generative Adv...

[DL輪読会]Wasserstein GAN/Towards Principled Methods for Training Generative Adv...

Deep Learning JP

【DL輪読会】AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners

【DL輪読会】AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners

【DL輪読会】AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners

Deep Learning JP

【DL輪読会】事前学習用データセットについて

【DL輪読会】事前学習用データセットについて

【DL輪読会】事前学習用データセットについて

Deep Learning JP

【DL輪読会】 "Learning to render novel views from wide-baseline stereo pairs." CVP...

【DL輪読会】 "Learning to render novel views from wide-baseline stereo pairs." CVP...

【DL輪読会】 "Learning to render novel views from wide-baseline stereo pairs." CVP...

Deep Learning JP

【DL輪読会】Zero-Shot Dual-Lens Super-Resolution

【DL輪読会】Zero-Shot Dual-Lens Super-Resolution

【DL輪読会】Zero-Shot Dual-Lens Super-Resolution

Deep Learning JP

【DL輪読会】BloombergGPT: A Large Language Model for Finance arxiv

【DL輪読会】BloombergGPT: A Large Language Model for Finance arxiv

【DL輪読会】BloombergGPT: A Large Language Model for Finance arxiv

Deep Learning JP

【DL輪読会】マルチモーダル LLM

【DL輪読会】マルチモーダル LLM

【DL輪読会】マルチモーダル LLM

Deep Learning JP

Recommended

[DL輪読会]Combining Fully Convolutional and Recurrent Neural Networks for 3D Bio...

[DL輪読会]Combining Fully Convolutional and Recurrent Neural Networks for 3D Bio...

[DL輪読会]Combining Fully Convolutional and Recurrent Neural Networks for 3D Bio...

Deep Learning JP

[DL輪読会]Wasserstein GAN/Towards Principled Methods for Training Generative Adv...

[DL輪読会]Wasserstein GAN/Towards Principled Methods for Training Generative Adv...

[DL輪読会]Wasserstein GAN/Towards Principled Methods for Training Generative Adv...

Deep Learning JP

【DL輪読会】AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners

【DL輪読会】AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners

【DL輪読会】AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners

Deep Learning JP

【DL輪読会】事前学習用データセットについて

【DL輪読会】事前学習用データセットについて

【DL輪読会】事前学習用データセットについて

Deep Learning JP

【DL輪読会】 "Learning to render novel views from wide-baseline stereo pairs." CVP...

【DL輪読会】 "Learning to render novel views from wide-baseline stereo pairs." CVP...

【DL輪読会】 "Learning to render novel views from wide-baseline stereo pairs." CVP...

Deep Learning JP

【DL輪読会】Zero-Shot Dual-Lens Super-Resolution

【DL輪読会】Zero-Shot Dual-Lens Super-Resolution

【DL輪読会】Zero-Shot Dual-Lens Super-Resolution

Deep Learning JP

【DL輪読会】BloombergGPT: A Large Language Model for Finance arxiv

【DL輪読会】BloombergGPT: A Large Language Model for Finance arxiv

【DL輪読会】BloombergGPT: A Large Language Model for Finance arxiv

Deep Learning JP

【DL輪読会】マルチモーダル LLM

【DL輪読会】マルチモーダル LLM

【DL輪読会】マルチモーダル LLM

Deep Learning JP

【 DL輪読会】ToolLLM: Facilitating Large Language Models to Master 16000+ Real-wo...

【 DL輪読会】ToolLLM: Facilitating Large Language Models to Master 16000+ Real-wo...

【 DL輪読会】ToolLLM: Facilitating Large Language Models to Master 16000+ Real-wo...

Deep Learning JP

【DL輪読会】AnyLoc: Towards Universal Visual Place Recognition

【DL輪読会】AnyLoc: Towards Universal Visual Place Recognition

【DL輪読会】AnyLoc: Towards Universal Visual Place Recognition

Deep Learning JP

【DL輪読会】Can Neural Network Memorization Be Localized?

【DL輪読会】Can Neural Network Memorization Be Localized?

【DL輪読会】Can Neural Network Memorization Be Localized?

Deep Learning JP

【DL輪読会】Hopfield network　関連研究について

【DL輪読会】Hopfield network　関連研究について

【DL輪読会】Hopfield network　関連研究について

Deep Learning JP

【DL輪読会】SimPer: Simple self-supervised learning of periodic targets( ICLR 2023 )

【DL輪読会】SimPer: Simple self-supervised learning of periodic targets( ICLR 2023 )

【DL輪読会】SimPer: Simple self-supervised learning of periodic targets( ICLR 2023 )

Deep Learning JP

【DL輪読会】RLCD: Reinforcement Learning from Contrast Distillation for Language M...

【DL輪読会】RLCD: Reinforcement Learning from Contrast Distillation for Language M...

【DL輪読会】RLCD: Reinforcement Learning from Contrast Distillation for Language M...

Deep Learning JP

【DL輪読会】"Secrets of RLHF in Large Language Models Part I: PPO"

【DL輪読会】"Secrets of RLHF in Large Language Models Part I: PPO"

【DL輪読会】"Secrets of RLHF in Large Language Models Part I: PPO"

Deep Learning JP

【DL輪読会】"Language Instructed Reinforcement Learning for Human-AI Coordination "

【DL輪読会】"Language Instructed Reinforcement Learning for Human-AI Coordination "

【DL輪読会】"Language Instructed Reinforcement Learning for Human-AI Coordination "

Deep Learning JP

【DL輪読会】Llama 2: Open Foundation and Fine-Tuned Chat Models

【DL輪読会】Llama 2: Open Foundation and Fine-Tuned Chat Models

【DL輪読会】Llama 2: Open Foundation and Fine-Tuned Chat Models

Deep Learning JP

【DL輪読会】"Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware"

【DL輪読会】"Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware"

【DL輪読会】"Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware"

Deep Learning JP

【DL輪読会】Parameter is Not All You Need:Starting from Non-Parametric Networks fo...

【DL輪読会】Parameter is Not All You Need:Starting from Non-Parametric Networks fo...

【DL輪読会】Parameter is Not All You Need:Starting from Non-Parametric Networks fo...

Deep Learning JP

【DL輪読会】Drag Your GAN: Interactive Point-based Manipulation on the Generative ...

【DL輪読会】Drag Your GAN: Interactive Point-based Manipulation on the Generative ...

【DL輪読会】Drag Your GAN: Interactive Point-based Manipulation on the Generative ...

Deep Learning JP

【DL輪読会】Self-Supervised Learning from Images with a Joint-Embedding Predictive...

【DL輪読会】Self-Supervised Learning from Images with a Joint-Embedding Predictive...

【DL輪読会】Self-Supervised Learning from Images with a Joint-Embedding Predictive...

Deep Learning JP

【DL輪読会】Towards Understanding Ensemble, Knowledge Distillation and Self-Distil...

【DL輪読会】Towards Understanding Ensemble, Knowledge Distillation and Self-Distil...

【DL輪読会】Towards Understanding Ensemble, Knowledge Distillation and Self-Distil...

Deep Learning JP

【DL輪読会】VIP: Towards Universal Visual Reward and Representation via Value-Impl...

【DL輪読会】VIP: Towards Universal Visual Reward and Representation via Value-Impl...

【DL輪読会】VIP: Towards Universal Visual Reward and Representation via Value-Impl...

Deep Learning JP

【DL輪読会】Deep Transformers without Shortcuts: Modifying Self-attention for Fait...

【DL輪読会】Deep Transformers without Shortcuts: Modifying Self-attention for Fait...

【DL輪読会】Deep Transformers without Shortcuts: Modifying Self-attention for Fait...

Deep Learning JP

【DL輪読会】マルチモーダル基盤モデル

【DL輪読会】マルチモーダル基盤モデル

【DL輪読会】マルチモーダル基盤モデル

Deep Learning JP

【DL輪読会】TrOCR: Transformer-based Optical Character Recognition with Pre-traine...

【DL輪読会】TrOCR: Transformer-based Optical Character Recognition with Pre-traine...

【DL輪読会】TrOCR: Transformer-based Optical Character Recognition with Pre-traine...

Deep Learning JP

【DL輪読会】HyperDiffusion: Generating Implicit Neural Fields withWeight-Space Dif...

【DL輪読会】HyperDiffusion: Generating Implicit Neural Fields withWeight-Space Dif...

【DL輪読会】HyperDiffusion: Generating Implicit Neural Fields withWeight-Space Dif...

Deep Learning JP

【DL輪読会】大量API・ツールの扱いに特化したLLM

【DL輪読会】大量API・ツールの扱いに特化したLLM

【DL輪読会】大量API・ツールの扱いに特化したLLM

Deep Learning JP

YugabyteDB適用に向けた取り組みと隠れた魅力 (DSS Asia 2024 発表資料)

YugabyteDB適用に向けた取り組みと隠れた魅力 (DSS Asia 2024 発表資料)

YugabyteDB適用に向けた取り組みと隠れた魅力 (DSS Asia 2024 発表資料)

NTT DATA Technology & Innovation

論文紹介: Exploiting semantic segmentation to boost reinforcement learning in vid...

論文紹介: Exploiting semantic segmentation to boost reinforcement learning in vid...

論文紹介: Exploiting semantic segmentation to boost reinforcement learning in vid...

More Related Content

More from Deep Learning JP

【 DL輪読会】ToolLLM: Facilitating Large Language Models to Master 16000+ Real-wo...

【 DL輪読会】ToolLLM: Facilitating Large Language Models to Master 16000+ Real-wo...

【 DL輪読会】ToolLLM: Facilitating Large Language Models to Master 16000+ Real-wo...

Deep Learning JP

【DL輪読会】AnyLoc: Towards Universal Visual Place Recognition

【DL輪読会】AnyLoc: Towards Universal Visual Place Recognition

【DL輪読会】AnyLoc: Towards Universal Visual Place Recognition

Deep Learning JP

【DL輪読会】Can Neural Network Memorization Be Localized?

【DL輪読会】Can Neural Network Memorization Be Localized?

【DL輪読会】Can Neural Network Memorization Be Localized?

Deep Learning JP

【DL輪読会】Hopfield network　関連研究について

【DL輪読会】Hopfield network　関連研究について

【DL輪読会】Hopfield network　関連研究について

Deep Learning JP

【DL輪読会】SimPer: Simple self-supervised learning of periodic targets( ICLR 2023 )

【DL輪読会】SimPer: Simple self-supervised learning of periodic targets( ICLR 2023 )

【DL輪読会】SimPer: Simple self-supervised learning of periodic targets( ICLR 2023 )

Deep Learning JP

【DL輪読会】RLCD: Reinforcement Learning from Contrast Distillation for Language M...

【DL輪読会】RLCD: Reinforcement Learning from Contrast Distillation for Language M...

【DL輪読会】RLCD: Reinforcement Learning from Contrast Distillation for Language M...

Deep Learning JP

【DL輪読会】"Secrets of RLHF in Large Language Models Part I: PPO"

【DL輪読会】"Secrets of RLHF in Large Language Models Part I: PPO"

【DL輪読会】"Secrets of RLHF in Large Language Models Part I: PPO"

Deep Learning JP

【DL輪読会】"Language Instructed Reinforcement Learning for Human-AI Coordination "

【DL輪読会】"Language Instructed Reinforcement Learning for Human-AI Coordination "

【DL輪読会】"Language Instructed Reinforcement Learning for Human-AI Coordination "

Deep Learning JP

【DL輪読会】Llama 2: Open Foundation and Fine-Tuned Chat Models

【DL輪読会】Llama 2: Open Foundation and Fine-Tuned Chat Models

【DL輪読会】Llama 2: Open Foundation and Fine-Tuned Chat Models

Deep Learning JP

【DL輪読会】"Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware"

【DL輪読会】"Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware"

【DL輪読会】"Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware"

Deep Learning JP

【DL輪読会】Parameter is Not All You Need:Starting from Non-Parametric Networks fo...

【DL輪読会】Parameter is Not All You Need:Starting from Non-Parametric Networks fo...

【DL輪読会】Parameter is Not All You Need:Starting from Non-Parametric Networks fo...

Deep Learning JP

【DL輪読会】Drag Your GAN: Interactive Point-based Manipulation on the Generative ...

【DL輪読会】Drag Your GAN: Interactive Point-based Manipulation on the Generative ...

【DL輪読会】Drag Your GAN: Interactive Point-based Manipulation on the Generative ...

Deep Learning JP

【DL輪読会】Self-Supervised Learning from Images with a Joint-Embedding Predictive...

【DL輪読会】Self-Supervised Learning from Images with a Joint-Embedding Predictive...

【DL輪読会】Self-Supervised Learning from Images with a Joint-Embedding Predictive...

Deep Learning JP

【DL輪読会】Towards Understanding Ensemble, Knowledge Distillation and Self-Distil...

【DL輪読会】Towards Understanding Ensemble, Knowledge Distillation and Self-Distil...

【DL輪読会】Towards Understanding Ensemble, Knowledge Distillation and Self-Distil...

Deep Learning JP

【DL輪読会】VIP: Towards Universal Visual Reward and Representation via Value-Impl...

【DL輪読会】VIP: Towards Universal Visual Reward and Representation via Value-Impl...

【DL輪読会】VIP: Towards Universal Visual Reward and Representation via Value-Impl...

Deep Learning JP

【DL輪読会】Deep Transformers without Shortcuts: Modifying Self-attention for Fait...

【DL輪読会】Deep Transformers without Shortcuts: Modifying Self-attention for Fait...

【DL輪読会】Deep Transformers without Shortcuts: Modifying Self-attention for Fait...

Deep Learning JP

【DL輪読会】マルチモーダル基盤モデル

【DL輪読会】マルチモーダル基盤モデル

【DL輪読会】マルチモーダル基盤モデル

Deep Learning JP

【DL輪読会】TrOCR: Transformer-based Optical Character Recognition with Pre-traine...

【DL輪読会】TrOCR: Transformer-based Optical Character Recognition with Pre-traine...

【DL輪読会】TrOCR: Transformer-based Optical Character Recognition with Pre-traine...

Deep Learning JP

【DL輪読会】HyperDiffusion: Generating Implicit Neural Fields withWeight-Space Dif...

【DL輪読会】HyperDiffusion: Generating Implicit Neural Fields withWeight-Space Dif...

【DL輪読会】HyperDiffusion: Generating Implicit Neural Fields withWeight-Space Dif...

Deep Learning JP

【DL輪読会】大量API・ツールの扱いに特化したLLM

【DL輪読会】大量API・ツールの扱いに特化したLLM

【DL輪読会】大量API・ツールの扱いに特化したLLM

Deep Learning JP

More from Deep Learning JP (20)

【 DL輪読会】ToolLLM: Facilitating Large Language Models to Master 16000+ Real-wo...

【 DL輪読会】ToolLLM: Facilitating Large Language Models to Master 16000+ Real-wo...

【 DL輪読会】ToolLLM: Facilitating Large Language Models to Master 16000+ Real-wo...

【DL輪読会】AnyLoc: Towards Universal Visual Place Recognition

【DL輪読会】AnyLoc: Towards Universal Visual Place Recognition

【DL輪読会】AnyLoc: Towards Universal Visual Place Recognition

【DL輪読会】Can Neural Network Memorization Be Localized?

【DL輪読会】Can Neural Network Memorization Be Localized?

【DL輪読会】Can Neural Network Memorization Be Localized?

【DL輪読会】Hopfield network　関連研究について

【DL輪読会】Hopfield network　関連研究について

【DL輪読会】Hopfield network　関連研究について

【DL輪読会】SimPer: Simple self-supervised learning of periodic targets( ICLR 2023 )

【DL輪読会】SimPer: Simple self-supervised learning of periodic targets( ICLR 2023 )

【DL輪読会】SimPer: Simple self-supervised learning of periodic targets( ICLR 2023 )

【DL輪読会】RLCD: Reinforcement Learning from Contrast Distillation for Language M...

【DL輪読会】RLCD: Reinforcement Learning from Contrast Distillation for Language M...

【DL輪読会】RLCD: Reinforcement Learning from Contrast Distillation for Language M...

【DL輪読会】"Secrets of RLHF in Large Language Models Part I: PPO"

【DL輪読会】"Secrets of RLHF in Large Language Models Part I: PPO"

【DL輪読会】"Secrets of RLHF in Large Language Models Part I: PPO"

【DL輪読会】"Language Instructed Reinforcement Learning for Human-AI Coordination "

【DL輪読会】"Language Instructed Reinforcement Learning for Human-AI Coordination "

【DL輪読会】"Language Instructed Reinforcement Learning for Human-AI Coordination "

【DL輪読会】Llama 2: Open Foundation and Fine-Tuned Chat Models

【DL輪読会】Llama 2: Open Foundation and Fine-Tuned Chat Models

【DL輪読会】Llama 2: Open Foundation and Fine-Tuned Chat Models

【DL輪読会】"Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware"

【DL輪読会】"Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware"

【DL輪読会】"Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware"

【DL輪読会】Parameter is Not All You Need:Starting from Non-Parametric Networks fo...

【DL輪読会】Parameter is Not All You Need:Starting from Non-Parametric Networks fo...

【DL輪読会】Parameter is Not All You Need:Starting from Non-Parametric Networks fo...

【DL輪読会】Drag Your GAN: Interactive Point-based Manipulation on the Generative ...

【DL輪読会】Drag Your GAN: Interactive Point-based Manipulation on the Generative ...

【DL輪読会】Drag Your GAN: Interactive Point-based Manipulation on the Generative ...

【DL輪読会】Self-Supervised Learning from Images with a Joint-Embedding Predictive...

【DL輪読会】Self-Supervised Learning from Images with a Joint-Embedding Predictive...

【DL輪読会】Self-Supervised Learning from Images with a Joint-Embedding Predictive...

【DL輪読会】Towards Understanding Ensemble, Knowledge Distillation and Self-Distil...

【DL輪読会】Towards Understanding Ensemble, Knowledge Distillation and Self-Distil...

【DL輪読会】Towards Understanding Ensemble, Knowledge Distillation and Self-Distil...

【DL輪読会】VIP: Towards Universal Visual Reward and Representation via Value-Impl...

【DL輪読会】VIP: Towards Universal Visual Reward and Representation via Value-Impl...

【DL輪読会】VIP: Towards Universal Visual Reward and Representation via Value-Impl...

【DL輪読会】Deep Transformers without Shortcuts: Modifying Self-attention for Fait...

【DL輪読会】Deep Transformers without Shortcuts: Modifying Self-attention for Fait...

【DL輪読会】Deep Transformers without Shortcuts: Modifying Self-attention for Fait...

【DL輪読会】マルチモーダル基盤モデル

【DL輪読会】マルチモーダル基盤モデル

【DL輪読会】マルチモーダル基盤モデル

【DL輪読会】TrOCR: Transformer-based Optical Character Recognition with Pre-traine...

【DL輪読会】TrOCR: Transformer-based Optical Character Recognition with Pre-traine...

【DL輪読会】TrOCR: Transformer-based Optical Character Recognition with Pre-traine...

【DL輪読会】HyperDiffusion: Generating Implicit Neural Fields withWeight-Space Dif...

【DL輪読会】HyperDiffusion: Generating Implicit Neural Fields withWeight-Space Dif...

【DL輪読会】HyperDiffusion: Generating Implicit Neural Fields withWeight-Space Dif...

【DL輪読会】大量API・ツールの扱いに特化したLLM

【DL輪読会】大量API・ツールの扱いに特化したLLM

【DL輪読会】大量API・ツールの扱いに特化したLLM

Recently uploaded

YugabyteDB適用に向けた取り組みと隠れた魅力 (DSS Asia 2024 発表資料)

YugabyteDB適用に向けた取り組みと隠れた魅力 (DSS Asia 2024 発表資料)

YugabyteDB適用に向けた取り組みと隠れた魅力 (DSS Asia 2024 発表資料)

NTT DATA Technology & Innovation

論文紹介: Exploiting semantic segmentation to boost reinforcement learning in vid...

論文紹介: Exploiting semantic segmentation to boost reinforcement learning in vid...

論文紹介: Exploiting semantic segmentation to boost reinforcement learning in vid...

2024年度_サイバーエージェント_新卒研修「データベースの歴史」.pptx

2024年度_サイバーエージェント_新卒研修「データベースの歴史」.pptx

2024年度_サイバーエージェント_新卒研修「データベースの歴史」.pptx

YouTube nnabla channelの次の動画で利用したスライドです。【AI論文解説】Consistency ModelとRectified Flow https://youtu.be/3IKCrAPe55k Consistency ModelとRectified Flowに関連する以下の論文を紹介しています。 Consistency models（解説編Part1で紹介） - “Consistency Models,” ICML 2023. - “Improved Techniques for Training Consistency Models,” ICLR 2024. - “Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion,” ICLR 2024. Rectified flow（解説編Part2で紹介） - “Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow,” ICLR 2023. - “InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation,” ICLR 2024.

【AI論文解説】Consistency ModelとRectified Flow

【AI論文解説】Consistency ModelとRectified Flow

【AI論文解説】Consistency ModelとRectified Flow

Sony - Neural Network Libraries

20240523_IoTLT_vol111_kitazaki_v1___.pdf

20240523_IoTLT_vol111_kitazaki_v1___.pdf

20240523_IoTLT_vol111_kitazaki_v1___.pdf

Ayachika Kitazaki

MPAなWebフレームワーク、Astroの紹介 (その2) 2024/05/24の勉強会で発表されたものです。

MPAなWebフレームワーク、Astroの紹介 (その2) 2024/05/24の勉強会で発表されたものです。

MPAなWebフレームワーク、Astroの紹介 (その2) 2024/05/24の勉強会で発表されたものです。

iPride Co., Ltd.

2024年5月25日Serverless Meetup大阪　アプリケーションをどこで動かすべきなのか.pptx

2024年5月25日Serverless Meetup大阪　アプリケーションをどこで動かすべきなのか.pptx

2024年5月25日Serverless Meetup大阪　アプリケーションをどこで動かすべきなのか.pptx

論文紹介: Offline Q-Learning on diverse Multi-Task data both scales and generalizes

論文紹介: Offline Q-Learning on diverse Multi-Task data both scales and generalizes

論文紹介: Offline Q-Learning on diverse Multi-Task data both scales and generalizes

LoRaWAN 4チャンネル電流センサー・コンバーター CS01-LB 日本語マニュアル

LoRaWAN 4チャンネル電流センサー・コンバーター CS01-LB 日本語マニュアル

LoRaWAN 4チャンネル電流センサー・コンバーター CS01-LB 日本語マニュアル

CRI Japan, Inc.

Amazon Cognitoで実装するパスキー（Security-JAWS【第33回】勉強会）

Amazon Cognitoで実装するパスキー（Security-JAWS【第33回】勉強会）

Amazon Cognitoで実装するパスキー（Security-JAWS【第33回】勉強会）

Recently uploaded (10)

YugabyteDB適用に向けた取り組みと隠れた魅力 (DSS Asia 2024 発表資料)

YugabyteDB適用に向けた取り組みと隠れた魅力 (DSS Asia 2024 発表資料)

YugabyteDB適用に向けた取り組みと隠れた魅力 (DSS Asia 2024 発表資料)

論文紹介: Exploiting semantic segmentation to boost reinforcement learning in vid...

論文紹介: Exploiting semantic segmentation to boost reinforcement learning in vid...

論文紹介: Exploiting semantic segmentation to boost reinforcement learning in vid...

2024年度_サイバーエージェント_新卒研修「データベースの歴史」.pptx

2024年度_サイバーエージェント_新卒研修「データベースの歴史」.pptx

2024年度_サイバーエージェント_新卒研修「データベースの歴史」.pptx

【AI論文解説】Consistency ModelとRectified Flow

【AI論文解説】Consistency ModelとRectified Flow

【AI論文解説】Consistency ModelとRectified Flow

20240523_IoTLT_vol111_kitazaki_v1___.pdf

20240523_IoTLT_vol111_kitazaki_v1___.pdf

20240523_IoTLT_vol111_kitazaki_v1___.pdf

MPAなWebフレームワーク、Astroの紹介 (その2) 2024/05/24の勉強会で発表されたものです。

MPAなWebフレームワーク、Astroの紹介 (その2) 2024/05/24の勉強会で発表されたものです。

MPAなWebフレームワーク、Astroの紹介 (その2) 2024/05/24の勉強会で発表されたものです。

2024年5月25日Serverless Meetup大阪　アプリケーションをどこで動かすべきなのか.pptx

2024年5月25日Serverless Meetup大阪　アプリケーションをどこで動かすべきなのか.pptx

2024年5月25日Serverless Meetup大阪　アプリケーションをどこで動かすべきなのか.pptx

論文紹介: Offline Q-Learning on diverse Multi-Task data both scales and generalizes

論文紹介: Offline Q-Learning on diverse Multi-Task data both scales and generalizes

論文紹介: Offline Q-Learning on diverse Multi-Task data both scales and generalizes

LoRaWAN 4チャンネル電流センサー・コンバーター CS01-LB 日本語マニュアル

LoRaWAN 4チャンネル電流センサー・コンバーター CS01-LB 日本語マニュアル

LoRaWAN 4チャンネル電流センサー・コンバーター CS01-LB 日本語マニュアル

Amazon Cognitoで実装するパスキー（Security-JAWS【第33回】勉強会）

Amazon Cognitoで実装するパスキー（Security-JAWS【第33回】勉強会）

Amazon Cognitoで実装するパスキー（Security-JAWS【第33回】勉強会）

[DL輪読会]Exploiting Cyclic Symmetry in Convolutional Neural Networks

1. 2017/2/16 DL輪読会医学系研究科 D2 山口亮平

2. Exploiting Cyclic Symmetry in Convolutional Neural Networks • Deep Mindの論文 • Sander Dileman(ゲント大学の博士課程→google)。博士課程のとき、Kaggle2015のプランクトンコンテストで優勝。 • ICML16で発表、引用数12(2017/2/17現在) • http://benanne.github.io/2015/03/17/plankton.html • 回転に対するロバスト性をネットワーク内部で再現した論文。

3. Motivation <CNN・・・平行移動に対して強い> • Conv層・・・平行移動に対してequivariant • Pooling層・・・平行移動に対してinvariant → ①rotationに対してもinvariantを保ちたい。ネットワークにそのinvariance/equivariantな性質を組み入れたい。 (data augmentationでは一般化されているかどうか不明なため) ②パラメーター共有を普通のCNNに比べてさらに進めることで、 overfittingリスクを減らせる。

4. Motivation <CNN・・・平行移動に対して強い> • Conv層・・・平行移動に対してequivariant • Pooling層・・・平行移動に対してinvariant → ①rotationに対してもinvariantを保ちたい。ネットワークにそのinvariance/equivariantな性質を入れたい。 ②パラメーター共有を普通のCNNに比べてさらに進めることで、overfittingリスクを減らせる。

5. limitation ★Rotation：今回は90度の整数倍の回転に絞っている。（30度などの鋭角回転を入れると、画像的な補間が必要となり、さらに計算量が増えるから、という理由で除外している） ★鏡面に関して対象な画像(dihedral symmetry)に関しても、実装は容易に可能だ、としている。

6. 基本的な考え方画像を4方向に回転させたものを作成、同じ数のフィルタを使いfeature mapを4倍作成できる

7. 提案手法 • 以下の3種類の層を組み込むことを提案 cyclic slicing, cyclic pooling, cyclic rolling （cyclic stackingは実際には使っていない）

8. ①cyclic slicing, cyclic pooling http://benanne.github.io/2015/03/17/plankton.html

9. http://benanne.github.io/2015/03/17/plankton.html

10. ②cyclic rolling Ｔ（ｘ）

11. 実験＜多クラス分類＞ • プランクトンデータセット（121クラス、95*95pixel、3037valid/27299train） • 銀河写真データセット（121クラス、95*95pixel、6157valid/55421train）＜領域抽出＞ • マサチューセッツの航空写真（80*80pixel,137train/4valid/10test） Baseline CNNをコントロールとし、cyclic slicingなどを挿入することでどれだけ性能が向上したかを示した。

12. • Adam使用、プランクトンのみweight decayも併用 • Data augmentationは、ベースラインCNNにも、筆者らの提案した手法にも、どちらにも使用した

13.

14. 実験その①（cyclic slice/pool）

15. 実験その②（cyclic rolling）

16. <rollingの挿入の仕方> • Roll all(convの後にすべて挿入) • Roll dense(dense layerの後にのみ挿入) • その後に1/2,1/4とついているのは、フィルタの数をその倍率に縮小した、という意味。

17.

18. 参考 • http://benanne.github.io/2015/03/17/plankton.ht ml • http://icml.cc/2016/reviews/871.txt