SlideShare a Scribd company logo
1 of 15
Architecture Design Strategies
ICSL Seminar
김범준
2020. 06. 05
1/15
 Motivation
 Neural Network의 Architecture Design은 결과 향상에 매우 중요
 Vanilla CNN: [[Conv + ReLU] x N + MaxPool] x M + [FC + ReLU] x K
 Properties of Architecture Design
 Hard to Understand
 How (O), Why (X)
 No Math
 But Important
2/15
 Introduction
 Architecture Design Strategies
 Hand-Designed Architectures
 Neural Architecture Search
Architecture Authors Publication Cite
ResNet He et al. CVPR 2016 46987
Wide Residual Networks Zagoruyko et al. BMVC 2016 2020
ResNeXt Xie et al. CVPR 2017 2192
SENet Hu et al. TPAMI 2019 2998
DenseNet Huang et al. CVPR 2017 8973
NAS Zoph et al. ICLR 2016 1508
ENAS Pham et al. ICML 2018 584
RandWire Xie et al. ICCV 2019 74
3/15
 ResNet (He et al., CVPR 2016)
 Vanilla CNN에서 일정 수준 이상 Deep할 때 성능 저하 확인
 Deeper Network는 [Identity Layers + Shallow Network]로 구성 가능
 Shallow Network는 Deeper Network의 Subspace
20 Conv Layers
𝑓20(𝑥)
36 Identity Layers
𝐼(𝑥)
[56 Conv Layers could be]
4/15
 ResNet (He et al., CVPR 2016)
 일부 Layer에서 Identity Layer가 Optimal할 가능성 고려
 Identity Mapping이 나오기 쉬운 Operation을 Design
 이때 Conv를 Identity Mapping으로 만드는 것보다, Residual을 Zero로 만드는 것이 유리
 Skip Connection을 갖는 Residual Block을 제안
5/15
 ResNet (He et al., CVPR 2016)
 Residual Learning을 통해 110 Layers까지 성공적으로 수렴
6/15
 Wide Residual Networks (Zagoruyko et al., BMVC 2016)
 Wide Residual Networks: ResNet을 𝑘배 Wide한 구조로 변형한 모델
 (Deeper Architecture) ≈ (Wider Architecture)
 GPU 병렬처리 특성상 같은 조건(Accuracy)에서 Deep보다 Wide에서 연산이 빠름이 보고됨
7/15
 ResNeXt (Xie et al., CVPR 2017)
 Branch 구조 제안
 Depth나 Width를 높이는 것보다 Branch 수(Cardinality)를 높이는 Architecture가 효율적임을 보고
8/15
 SENet (Hu et al., TPAMI 2019)
 Channel-wise Attention: Feature Map에서 중요한 Channel, 중요하지 않은 Channel을 조절
 SE Module이 추가적인 연산량이 적은 것에 비해, 성능 향상에 효과적임이 보고
* SENet: Squeeze-and-Excitation Networks
9/15
 DenseNet (Huang et al., CVPR 2017)
 Dense Block: 단위 Block 내의 모든 Layer에 Skip Connection 적용
 Architecture Design에서 Skip Connection이 중요함을 재확인
10/15
 NAS (Zoph et al., ICLR 2016), ENAS (Pham et al., ICML 2018)
 Hand-Designed Architecture: 비직관적, 휴리스틱에 의존
 Neural Architecture Search: Architecture를 스스로 Search하는 강화 학습 진행
 Search에 많은 연산량, 시간, GPU 요구
11/15
 RandWire (Xie et al., ICCV 2019)
 다양한 Wiring이 성능 향상에 중요
 보다 자유도가 높은 Wiring을 Random Graph를 통해 구성하여 Architecture 설정
 유사한 조건(FLOPs, # Param)에서 ResNet/ResNeXt보다 높은 성능 보고
* FLOPs: FLoating point Operations Per Second, 초당 부동소수점 연산
12/15
 Architecture Design Strategies
 Various Skip Connections
 Large Neural Network: Depth, Width, and Cardinality
 Complex Architecture
 Neural Architecture Search
 Performance Comparison
13/15
 Visualizing the Loss Landscape of Neural Nets (Li et al., NeurIPS 2018)
 Weight에 대한 Loss Landscape을 시각화
 Skip Connection 구조가 Smooth Loss Landscape 제공  Easier Optimization
 Limitation: Why Smooth?
14/15
 Deep Double Descent (Nakkiran et al., ICLR 2020)
 Neural Network가 Large할 수록 성능이 좋아질 수도, 아닐 수도 있다.
 Training Time, Dataset Size 등의 조건에 따라 다른 성질 확인
 CNN, ResNet 등 다양한 실험에서 해당 현상 확인
15/15

More Related Content

Recently uploaded

데이터 분석 문제 해결을 위한 나의 JMP 활용법
데이터 분석 문제 해결을 위한 나의 JMP 활용법데이터 분석 문제 해결을 위한 나의 JMP 활용법
데이터 분석 문제 해결을 위한 나의 JMP 활용법JMP Korea
 
JMP가 걸어온 여정, 새로운 도약 JMP 18!
JMP가 걸어온 여정, 새로운 도약 JMP 18!JMP가 걸어온 여정, 새로운 도약 JMP 18!
JMP가 걸어온 여정, 새로운 도약 JMP 18!JMP Korea
 
JMP 기능의 확장 및 내재화의 핵심 JMP-Python 소개
JMP 기능의 확장 및 내재화의 핵심 JMP-Python 소개JMP 기능의 확장 및 내재화의 핵심 JMP-Python 소개
JMP 기능의 확장 및 내재화의 핵심 JMP-Python 소개JMP Korea
 
JMP를 활용한 전자/반도체 산업 Yield Enhancement Methodology
JMP를 활용한 전자/반도체 산업 Yield Enhancement MethodologyJMP를 활용한 전자/반도체 산업 Yield Enhancement Methodology
JMP를 활용한 전자/반도체 산업 Yield Enhancement MethodologyJMP Korea
 
실험 설계의 평가 방법: Custom Design을 중심으로 반응인자 최적화 및 Criteria 해석
실험 설계의 평가 방법: Custom Design을 중심으로 반응인자 최적화 및 Criteria 해석실험 설계의 평가 방법: Custom Design을 중심으로 반응인자 최적화 및 Criteria 해석
실험 설계의 평가 방법: Custom Design을 중심으로 반응인자 최적화 및 Criteria 해석JMP Korea
 
공학 관점에서 바라본 JMP 머신러닝 최적화
공학 관점에서 바라본 JMP 머신러닝 최적화공학 관점에서 바라본 JMP 머신러닝 최적화
공학 관점에서 바라본 JMP 머신러닝 최적화JMP Korea
 
JMP를 활용한 가속열화 분석 사례
JMP를 활용한 가속열화 분석 사례JMP를 활용한 가속열화 분석 사례
JMP를 활용한 가속열화 분석 사례JMP Korea
 

Recently uploaded (7)

데이터 분석 문제 해결을 위한 나의 JMP 활용법
데이터 분석 문제 해결을 위한 나의 JMP 활용법데이터 분석 문제 해결을 위한 나의 JMP 활용법
데이터 분석 문제 해결을 위한 나의 JMP 활용법
 
JMP가 걸어온 여정, 새로운 도약 JMP 18!
JMP가 걸어온 여정, 새로운 도약 JMP 18!JMP가 걸어온 여정, 새로운 도약 JMP 18!
JMP가 걸어온 여정, 새로운 도약 JMP 18!
 
JMP 기능의 확장 및 내재화의 핵심 JMP-Python 소개
JMP 기능의 확장 및 내재화의 핵심 JMP-Python 소개JMP 기능의 확장 및 내재화의 핵심 JMP-Python 소개
JMP 기능의 확장 및 내재화의 핵심 JMP-Python 소개
 
JMP를 활용한 전자/반도체 산업 Yield Enhancement Methodology
JMP를 활용한 전자/반도체 산업 Yield Enhancement MethodologyJMP를 활용한 전자/반도체 산업 Yield Enhancement Methodology
JMP를 활용한 전자/반도체 산업 Yield Enhancement Methodology
 
실험 설계의 평가 방법: Custom Design을 중심으로 반응인자 최적화 및 Criteria 해석
실험 설계의 평가 방법: Custom Design을 중심으로 반응인자 최적화 및 Criteria 해석실험 설계의 평가 방법: Custom Design을 중심으로 반응인자 최적화 및 Criteria 해석
실험 설계의 평가 방법: Custom Design을 중심으로 반응인자 최적화 및 Criteria 해석
 
공학 관점에서 바라본 JMP 머신러닝 최적화
공학 관점에서 바라본 JMP 머신러닝 최적화공학 관점에서 바라본 JMP 머신러닝 최적화
공학 관점에서 바라본 JMP 머신러닝 최적화
 
JMP를 활용한 가속열화 분석 사례
JMP를 활용한 가속열화 분석 사례JMP를 활용한 가속열화 분석 사례
JMP를 활용한 가속열화 분석 사례
 

Featured

Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 

Featured (20)

Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 

Architecture Design Strategies

  • 1. Architecture Design Strategies ICSL Seminar 김범준 2020. 06. 05 1/15
  • 2.  Motivation  Neural Network의 Architecture Design은 결과 향상에 매우 중요  Vanilla CNN: [[Conv + ReLU] x N + MaxPool] x M + [FC + ReLU] x K  Properties of Architecture Design  Hard to Understand  How (O), Why (X)  No Math  But Important 2/15
  • 3.  Introduction  Architecture Design Strategies  Hand-Designed Architectures  Neural Architecture Search Architecture Authors Publication Cite ResNet He et al. CVPR 2016 46987 Wide Residual Networks Zagoruyko et al. BMVC 2016 2020 ResNeXt Xie et al. CVPR 2017 2192 SENet Hu et al. TPAMI 2019 2998 DenseNet Huang et al. CVPR 2017 8973 NAS Zoph et al. ICLR 2016 1508 ENAS Pham et al. ICML 2018 584 RandWire Xie et al. ICCV 2019 74 3/15
  • 4.  ResNet (He et al., CVPR 2016)  Vanilla CNN에서 일정 수준 이상 Deep할 때 성능 저하 확인  Deeper Network는 [Identity Layers + Shallow Network]로 구성 가능  Shallow Network는 Deeper Network의 Subspace 20 Conv Layers 𝑓20(𝑥) 36 Identity Layers 𝐼(𝑥) [56 Conv Layers could be] 4/15
  • 5.  ResNet (He et al., CVPR 2016)  일부 Layer에서 Identity Layer가 Optimal할 가능성 고려  Identity Mapping이 나오기 쉬운 Operation을 Design  이때 Conv를 Identity Mapping으로 만드는 것보다, Residual을 Zero로 만드는 것이 유리  Skip Connection을 갖는 Residual Block을 제안 5/15
  • 6.  ResNet (He et al., CVPR 2016)  Residual Learning을 통해 110 Layers까지 성공적으로 수렴 6/15
  • 7.  Wide Residual Networks (Zagoruyko et al., BMVC 2016)  Wide Residual Networks: ResNet을 𝑘배 Wide한 구조로 변형한 모델  (Deeper Architecture) ≈ (Wider Architecture)  GPU 병렬처리 특성상 같은 조건(Accuracy)에서 Deep보다 Wide에서 연산이 빠름이 보고됨 7/15
  • 8.  ResNeXt (Xie et al., CVPR 2017)  Branch 구조 제안  Depth나 Width를 높이는 것보다 Branch 수(Cardinality)를 높이는 Architecture가 효율적임을 보고 8/15
  • 9.  SENet (Hu et al., TPAMI 2019)  Channel-wise Attention: Feature Map에서 중요한 Channel, 중요하지 않은 Channel을 조절  SE Module이 추가적인 연산량이 적은 것에 비해, 성능 향상에 효과적임이 보고 * SENet: Squeeze-and-Excitation Networks 9/15
  • 10.  DenseNet (Huang et al., CVPR 2017)  Dense Block: 단위 Block 내의 모든 Layer에 Skip Connection 적용  Architecture Design에서 Skip Connection이 중요함을 재확인 10/15
  • 11.  NAS (Zoph et al., ICLR 2016), ENAS (Pham et al., ICML 2018)  Hand-Designed Architecture: 비직관적, 휴리스틱에 의존  Neural Architecture Search: Architecture를 스스로 Search하는 강화 학습 진행  Search에 많은 연산량, 시간, GPU 요구 11/15
  • 12.  RandWire (Xie et al., ICCV 2019)  다양한 Wiring이 성능 향상에 중요  보다 자유도가 높은 Wiring을 Random Graph를 통해 구성하여 Architecture 설정  유사한 조건(FLOPs, # Param)에서 ResNet/ResNeXt보다 높은 성능 보고 * FLOPs: FLoating point Operations Per Second, 초당 부동소수점 연산 12/15
  • 13.  Architecture Design Strategies  Various Skip Connections  Large Neural Network: Depth, Width, and Cardinality  Complex Architecture  Neural Architecture Search  Performance Comparison 13/15
  • 14.  Visualizing the Loss Landscape of Neural Nets (Li et al., NeurIPS 2018)  Weight에 대한 Loss Landscape을 시각화  Skip Connection 구조가 Smooth Loss Landscape 제공  Easier Optimization  Limitation: Why Smooth? 14/15
  • 15.  Deep Double Descent (Nakkiran et al., ICLR 2020)  Neural Network가 Large할 수록 성능이 좋아질 수도, 아닐 수도 있다.  Training Time, Dataset Size 등의 조건에 따라 다른 성질 확인  CNN, ResNet 등 다양한 실험에서 해당 현상 확인 15/15