PR-105: MnasNet: Platform-Aware Neural Architecture Search for Mobile

•Download as PPTX, PDF•

1 like•721 views

- Title: MnasNet: Platform-Aware Neural Architecture Search for Mobile - Paper: https://arxiv.org/abs/1807.11626 - Youtube: https://youtu.be/4uDZxefPd-I Taekmin Kim, http://github.com/tantara

Technology

MNasNet:
Platform-Aware Neural
Architecture Search for Mobile
M. Tan, B. Chen, R. Pang, V. Vasudevan, Quoc V. Le
Google Brain, Google Inc.
PR105
Taekmin Kim
Sep 30, 2018 1

AutoML
- Neural Architecture Search(NAS, 2016)
- PR-017: https://www.youtube.com/watch?v=XP3vyVrrt3Q
- Efficient Neural Architecture Search(ENAS, 2018)
- PR-069: https://www.youtube.com/watch?v=fbCcJaSQPPA
- Neural Optimizer Search(2017)
- More detail in github.com/hibayesian/awesome-automl-papers
*NAS Paper
4

Problem
- Focus only on accuracy
- CNN: e.g. MNIST, ImageNet
- RNN: e.g. Penn Treebank
5

Problem
- Focus only on accuracy
- CNN: e.g. MNIST, ImageNet
- RNN: e.g. Penn Treebank
Intuition
- Incorporate latency information
6

Problem
- Focus only on accuracy
- CNN: e.g. MNIST, ImageNet
- RNN: e.g. Penn Treebank
- On mobile, latency is considered
via inaccurate proxy
- e.g. FLOPS, MACs
Intuition
- Incorporate latency information
7
● FLOPS: FLoating point OPerations per Second
● MACs: the number of Multiply-Accumulates

github.com/tensorflow/models/tree/master/research/slim/nets/mobilenet

Recap: Neural Architecture Search
- Reward
12
*NAS Paper

- Classification: ImageNet
- Detection: COCO
Experiments
19

Experiments
- Classification: ImageNet
- Detection: COCO
- Training
- Sample architectures from controller RNN
- Train candidates on ImageNet with fewer epochs
- Evaludate the model on 50K validations sets, and use it as RL reward
- + Measure the latency on Google Pixel 2
20

Summary
Automated NAS for designing
resource-contraint mobile
29

What's hot

Deep Learning for Video: Action Recognition (UPC 2018)

Universitat Politècnica de Catalunya

Anatomy of YOLO - v1

Jihoon Song

Efficient Neural Architecture Search via Parameter Sharing

Jinwon Lee

Graph neural networks overview

Rodion Kiryukhin

Video Classification Basic

Silversparro Technologies

CMA-ESサンプラーによるハイパーパラメータ最適化 at Optuna Meetup #1

Masashi Shibata

Convolutional neural network

Yan Xu

BEV Semantic Segmentation

Yu Huang

論文紹介 "DARTS: Differentiable Architecture Search"

Yuta Koreeda

CONVOLUTIONAL NEURAL NETWORK

Md Rajib Bhuiyan

論文紹介: Fast R-CNN&Faster R-CNN

Takashi Abe

3D Perception for Autonomous Driving - Datasets and Algorithms -

Kazuyuki Miyazawa

[DL輪読会]DropBlock: A regularization method for convolutional networks

Deep Learning JP

Convolutional Neural Network Models - Deep Learning

Mohamed Loey

[DL輪読会]"CyCADA: Cycle-Consistent Adversarial Domain Adaptation"&"Learning Se...

Deep Learning JP

NIPS2015読み会: Ladder Networks

Eiichi Matsumoto

Edge computing in practice using IoT, Tensorflow and Google Cloud

Alvaro Viebrantz

[CVPR2020読み会＠CV勉強会] 3D Packing for Self-Supervised Monocular Depth Estimation

Kazuyuki Miyazawa

Neural Networks

Ismail El Gayar

CVPR 2018 完全読破チャレンジ報告会 cvpaper.challenge 勉強会@Wantedly白金台オフィス cvpaper.challenge はコンピュータビジョン分野の今を映し、創り出す挑戦です。論文読破・まとめ・アイディア考案・議論・実装・論文執筆（・社会実装）に至るまで広く取り組み、あらゆる知識を共有しています。 http://hirokatsukataoka.net/project/cc/index_cvpaperchallenge.html

Visual Question Answering (VQA) - CVPR2018動向分析 (CVPR 2018 完全読破チャレンジ報告会)

cvpaper. challenge

What's hot (20)

Deep Learning for Video: Action Recognition (UPC 2018)

Anatomy of YOLO - v1

Efficient Neural Architecture Search via Parameter Sharing

Graph neural networks overview

Video Classification Basic

CMA-ESサンプラーによるハイパーパラメータ最適化 at Optuna Meetup #1

Convolutional neural network

BEV Semantic Segmentation

論文紹介 "DARTS: Differentiable Architecture Search"

CONVOLUTIONAL NEURAL NETWORK

論文紹介: Fast R-CNN&Faster R-CNN

3D Perception for Autonomous Driving - Datasets and Algorithms -

[DL輪読会]DropBlock: A regularization method for convolutional networks

Convolutional Neural Network Models - Deep Learning

[DL輪読会]"CyCADA: Cycle-Consistent Adversarial Domain Adaptation"&"Learning Se...

NIPS2015読み会: Ladder Networks

Edge computing in practice using IoT, Tensorflow and Google Cloud

[CVPR2020読み会＠CV勉強会] 3D Packing for Self-Supervised Monocular Depth Estimation

Neural Networks

Visual Question Answering (VQA) - CVPR2018動向分析 (CVPR 2018 完全読破チャレンジ報告会)

Similar to PR-105: MnasNet: Platform-Aware Neural Architecture Search for Mobile

Resume_Weixiang Ding

Weixiang Ding

Resume_Shankar_Manickavasagam

Shankar Manickavasagam

Laurentiu Pavel - Resume

Laurentiu Pavel

Are we reaching a Data Science Singularity? How Cognitive Computing is emergi...

Big Data Spain

My Resume

SwapnilKishore3

Using the FLaNK Stack for edge ai (apache mxnet, apache flink, apache nifi, a...

Timothy Spann

In this talk I gave an overview of some of the tools that Microsoft Azure offers to researchers. I spoke about Microsoft's Big Data platform, called HDInsight, that allows for creating Spark and Hadoop applications; about Azure ML Studio, a GUI for developing machine learning models very quickly; and about the Data Science Virtual Machine (DSVM), a VM targeted to data scientists and machine learning professionals, which contains all the needed software to create any machine learning system.

Leveraging Data Driven Research Through Microsoft Azure

Miguel González-Fierro

Resume-New

Prashanth Kumar Murali

Koss Lab 세미나 오픈소스 인공지능(AI) 프레임웍파헤치기

Mario Cho

Chatbots, conversational user interfaces, dialogue systems, question-answering - the names differ, but the fundamental idea is the same: smart computer systems which can "talk" to humans in a natural way. Chatbots and their derivatives are designed to understand human language, interpret its content, and reply accordingly. This long-standing vision from artificial intelligence has gained enormous momentum since 2015. But what is possible, and where are the boundaries? Do chatbots really "understand" the meaning of text? And how can they be employed beneficially in real-world applications? In this talk, we will give an overview of state-of-the-art technologies and applications for dialogue systems in research and industry.

Chatbots and Natural Language Generation - A Bird Eyes View

Mark Cieliebak

Resume june'20

Kshitij Patil

Slides for my talk at Cloud Foundry Summit Europe 2016. Nearly 1.2 million people die in road crashes each year (WHO - 2015) with additional millions becoming injured or disabled. One big part of this problem is worst road traffic conditions and unless action is taken, road traffic injuries are predicted to become the fifth leading cause of death by 2030. Moreover, although road traffic injuries have been a major cause of mortality for many years, most traffic accidents are both predictable and preventable. In this talk, we want to demonstrate a scalable IoT platform that uses weather data and data from other cars to warn drivers of dangerous conditions. We will show how CF can help to save human lives and the architecture behind this. Additionally, we will also explain the data science that is involved.

Saving Human Lives with the IoT

Dat Tran

Walter Rweyemamu, Resume

Walter Rweyemamu

WR Based Opinion Mining on Traffic Sentiment Analysis on Social Media

IRJET Journal

Resume

Aniruddh Nathani

Resume

Aniruddh Nathani

Official resume titash_mandal_

Titash Mandal

Hoe een efficiënte Machine of Deep Learning backend ontwikkelen?

Agentschap Innoveren & Ondernemen

Presentation.pptx

KumarKumar570063

Resume

Aniruddh Nathani

Similar to PR-105: MnasNet: Platform-Aware Neural Architecture Search for Mobile (20)

Resume_Weixiang Ding

Resume_Shankar_Manickavasagam

Laurentiu Pavel - Resume

Are we reaching a Data Science Singularity? How Cognitive Computing is emergi...

My Resume

Using the FLaNK Stack for edge ai (apache mxnet, apache flink, apache nifi, a...

Leveraging Data Driven Research Through Microsoft Azure

Resume-New

Koss Lab 세미나 오픈소스 인공지능(AI) 프레임웍파헤치기

Chatbots and Natural Language Generation - A Bird Eyes View

Resume june'20

Saving Human Lives with the IoT

Walter Rweyemamu, Resume

WR Based Opinion Mining on Traffic Sentiment Analysis on Social Media

Resume

Official resume titash_mandal_

Hoe een efficiënte Machine of Deep Learning backend ontwikkelen?

Presentation.pptx

Resume

Recently uploaded

Advantages of Hiring UIUX Design Service Providers for Your Business

Pixlogix Infotech

MySQL Webinar, presented on the 25th of April, 2024. Summary: MySQL solutions enable the deployment of diverse Database Architectures tailored to specific needs, including High Availability, Disaster Recovery, and Read Scale-Out. With MySQL Shell's AdminAPI, administrators can seamlessly set up, manage, and monitor these solutions, ensuring efficiency and ease of use in their administration. MySQL Router, on the other hand, provides transparent routing from the application traffic to the backend servers in the architectures, requiring minimal configuration. Completely built in-house and supported by Oracle, these solutions have been adopted by enterprises of all sizes for their business-critical applications. In this presentation, we'll delve into various database architecture solutions to help you choose the right one based on your business requirements. Focusing on technical details and the latest features to maximize the potential of these solutions.

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Miguel Araújo

Discord is a free app offering voice, video, and text chat functionalities, primarily catering to the gaming community. It serves as a hub for users to create and join servers tailored to their interests. Discord’s ecosystem comprises servers, each functioning as a distinct online community with its own channels dedicated to specific topics or activities. Users can engage in text-based discussions, voice calls, or video chats within these channels. Understanding Discord Servers Discord servers are virtual spaces where users congregate to interact, share content, and build communities. Servers may revolve around gaming, hobbies, interests, or fandoms, providing a platform for like-minded individuals to connect. Communication Features Discord offers a range of communication tools, including text channels for messaging, voice channels for real-time audio conversations, and video channels for face-to-face interactions. These features facilitate seamless communication and collaboration. What Does NSFW Mean? The acronym NSFW stands for “Not Safe For Work,” indicating content that may be inappropriate for professional or public settings. NSFW Content NSFW content encompasses material that is sexually explicit, violent, or otherwise graphic in nature. It often includes nudity, profanity, or depictions of sensitive topics.

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

UK Journal

GenCyber Cyber Security Day Presentation

Michael W. Hawkins

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Neo4j

Data Cloud, More than a CDP by Matt Robison

Anna Loughnan Colquhoun

A Domino Admins Adventures (Engage 2024)

Gabriella Davis

Real Time Object Detection Using Open CV

Khem

The Raspberry Pi 5 was announced on October 2023. This new version of the popular embedded device comes with a new iteration of Broadcom’s VideoCore GPU platform, and was released with a fully open source driver stack, developed by Igalia. The presentation will discuss some of the major changes required to support this new Video Core iteration, the challenges we faced in the process and the solutions we provided in order to deliver conformant OpenGL ES and Vulkan drivers. The talk will also cover the next steps for the open source Raspberry Pi 5 graphics stack. (c) Embedded Open Source Summit 2024 April 16-18, 2024 Seattle, Washington (US) https://events.linuxfoundation.org/embedded-open-source-summit/ https://eoss24.sched.com/event/1aBEx

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

Igalia

BooK Now Call us at +918448380779 to hire a gorgeous and seductive call girl for sex. Take a Delhi Escort Service. The help of our escort agency is mostly meant for men who want sexual Indian Escorts In Delhi NCR. It should be noted that any impersonator will get 100 attention from our Young Girls Escorts in Delhi. They will assume the position of reliable allies. VIP Call Girl With Original Photos Book Tonight +918448380779 Our Cheap Price 1 Hour not available 2 Hours 5000 Full Night 8000 TAG: Call Girls in Delhi, Noida, Gurgaon, Ghaziabad, Connaught Place, Greater Kailash Delhi, Lajpat Nagar Delhi, Mayur Vihar Delhi, Chanakyapuri Delhi, New Friends Colony Delhi, Majnu Ka Tilla, Karol Bagh, Malviya Nagar, Saket, Khan Market, Noida Sector 18, Noida Sector 76, Noida Sector 51, Gurgaon Mg Road, Iffco Chowk Gurgaon, Rajiv Chowk Gurgaon All Delhi Ncr Free Home Deliver

08448380779 Call Girls In Civil Lines Women Seeking Men

Delhi Call girls

Microsoft's Threat Matrix for Kubernetes helps organizations understand the attack surface a Kubernetes deployment introduces to their environments. This ensures that adequate detections and mitigations are in place. By covering over 40 different attacker techniques, defenders can learn about Kubernetes-specific mitigations and controls to deploy to their environments. In this session, we will explore the MS-TA9013 Host Path Mount technique, which is commonly used by attackers to perform privilege escalation in a Kubernetes cluster. Attendees will learn how attackers and defenders can: * Escape the container's host volume mount to gain persistence on an underlying node * Move laterally from the underlying node into the customer's cloud environment * Analyze Kubernetes audit logs to detect pods deployed with a hostPath mount * Deploy an admission controller that prevents new pods from using a hostPath mount

Breaking the Kubernetes Kill Chain: Host Path Mount

Puma Security, LLC

With more memory available, system performance of three Dell devices increased, which can translate to a better user experience Conclusion When your system has plenty of RAM to meet your needs, you can efficiently access the applications and data you need to finish projects and to-do lists without sacrificing time and focus. Our test results show that with more memory available, three Dell PCs delivered better performance and took less time to complete the Procyon Office Productivity benchmark. These advantages translate to users being able to complete workflows more quickly and multitask more easily. Whether you need the mobility of the Latitude 5440, the creative capabilities of the Precision 3470, or the high performance of the OptiPlex Tower Plus 7010, configuring your system with more RAM can help keep processes running smoothly, enabling you to do more without compromising performance.

Boost PC performance: How more available memory can improve productivity

Principled Technologies

Handwritten Text Recognition for manuscripts and early printed texts

Maria Levchenko

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

The Digital Insurer

How to convert PDF to text with Nanonets

naman860154

Slack Application Development 101 Slides

praypatel2

Histor y of HAM Radio presentation slide

vu2urc

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Martijn de Jong

Enterprise Knowledge’s Urmi Majumder, Principal Data Architecture Consultant, and Fernando Aguilar Islas, Senior Data Science Consultant, presented "Driving Behavioral Change for Information Management through Data-Driven Green Strategy" on March 27, 2024 at Enterprise Data World (EDW) in Orlando, Florida. In this presentation, Urmi and Fernando discussed a case study describing how the information management division in a large supply chain organization drove user behavior change through awareness of the carbon footprint of their duplicated and near-duplicated content, identified via advanced data analytics. Check out their presentation to gain valuable perspectives on utilizing data-driven strategies to influence positive behavioral shifts and support sustainability initiatives within your organization. In this session, participants gained answers to the following questions: - What is a Green Information Management (IM) Strategy, and why should you have one? - How can Artificial Intelligence (AI) and Machine Learning (ML) support your Green IM Strategy through content deduplication? - How can an organization use insights into their data to influence employee behavior for IM? - How can you reap additional benefits from content reduction that go beyond Green IM?

Driving Behavioral Change for Information Management through Data-Driven Gree...

Enterprise Knowledge

Choosing the right accounts payable services provider is a strategic decision that can significantly impact your business's financial performance and operational efficiency. By considering factors such as expertise, range of services, technology infrastructure, scalability, cost, and reputation, businesses can make informed decisions and select a provider that aligns with their unique needs and objectives. Partnering with the right provider can streamline accounts payable processes, drive cost savings, and position your business for long-term success. https://katprotech.com/accounts-payable-and-purchase-order-automation/

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

Katpro Technologies

Recently uploaded (20)

Advantages of Hiring UIUX Design Service Providers for Your Business

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

GenCyber Cyber Security Day Presentation

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Data Cloud, More than a CDP by Matt Robison

A Domino Admins Adventures (Engage 2024)

Real Time Object Detection Using Open CV

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

08448380779 Call Girls In Civil Lines Women Seeking Men

Breaking the Kubernetes Kill Chain: Host Path Mount

Boost PC performance: How more available memory can improve productivity

Handwritten Text Recognition for manuscripts and early printed texts

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

How to convert PDF to text with Nanonets

Slack Application Development 101 Slides

Histor y of HAM Radio presentation slide

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Driving Behavioral Change for Information Management through Data-Driven Gree...

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

PR-105: MnasNet: Platform-Aware Neural Architecture Search for Mobile

1. MNasNet: Platform-Aware Neural Architecture Search for Mobile M. Tan, B. Chen, R. Pang, V. Vasudevan, Quoc V. Le Google Brain, Google Inc. PR105 Taekmin Kim Sep 30, 2018 1

2. AutoML? (Automated Machine Learning) 2

3. 3

4. AutoML - Neural Architecture Search(NAS, 2016) - PR-017: https://www.youtube.com/watch?v=XP3vyVrrt3Q - Efficient Neural Architecture Search(ENAS, 2018) - PR-069: https://www.youtube.com/watch?v=fbCcJaSQPPA - Neural Optimizer Search(2017) - More detail in github.com/hibayesian/awesome-automl-papers *NAS Paper 4

5. Problem - Focus only on accuracy - CNN: e.g. MNIST, ImageNet - RNN: e.g. Penn Treebank 5

6. Problem - Focus only on accuracy - CNN: e.g. MNIST, ImageNet - RNN: e.g. Penn Treebank Intuition - Incorporate latency information 6

7. Problem - Focus only on accuracy - CNN: e.g. MNIST, ImageNet - RNN: e.g. Penn Treebank - On mobile, latency is considered via inaccurate proxy - e.g. FLOPS, MACs Intuition - Incorporate latency information 7 ● FLOPS: FLoating point OPerations per Second ● MACs: the number of Multiply-Accumulates

8. github.com/tensorflow/models/tree/master/research/slim/nets/mobilenet

10.

11. Problem - Focus only on accuracy - CNN: e.g. MNIST, ImageNet - RNN: e.g. Penn Treebank - On mobile, latency is considered via inaccurate proxy - e.g. FLOPS, MACs Intuition - Incorporate latency information - Directly measure real-world latency on a mobile device - e.g. Google Pixel 2 11 ● FLOPS: FLoating point OPerations per Second ● MACs: the number of Multiply-Accumulates

12. Recap: Neural Architecture Search - Reward 12 *NAS Paper

13. Proposed Method #1: Reward - Reward 13

14. Proposed Method #1: Reward - Reward 14

15. 15

16. 16 Reward

17. Proposed Method #2: Search Space 17

18. Recap: ENAS 18

19. - Classification: ImageNet - Detection: COCO Experiments 19

20. Experiments - Classification: ImageNet - Detection: COCO - Training - Sample architectures from controller RNN - Train candidates on ImageNet with fewer epochs - Evaludate the model on 50K validations sets, and use it as RL reward - + Measure the latency on Google Pixel 2 20

21. Results 21

22. Results : Classification 22

23. Results : Classification 23

24. Results 24

25. Results 25

26. Results 26

27. 27 MNasNet MobileNet V2 MobileNet V1

28. Results : Detection 28

29. Summary Automated NAS for designing resource-contraint mobile 29

30. 30

Editor's Notes

논문 제목 소개 모바일 환경에 맞는 네트워크를 자동으로 찾아주겠다. 저자들 소개=mobilenet v2 저자
AutoML이란? 단순하게 말하면 머신러닝을 자동으로 해주는 모든 것 현재는 네트워크 아키텍처를 찾아주는 연구가 대세
우리들의 경험 딥러닝 = 데이터 + GPU로 대두되는 계산력 하지만 전문가 노하우가 대부분 필요하다 논문이 재현 잘 안되는 이유 제프딘의 의견 전문가 노하우를 더 많은 계산력으로 대체해보자 구글이 요즘 밀고 있음
관련 논문 구글이 리드하고 있다 PR에서도 NAS, ENAS 논문 리뷰를 했다. 아키텍처, optimizer 전부 다 함 자세한건 저 링크 참고 기본 구조 설명 기본적으로 무식하게 다 뽑고 그중에서 좋은거 찾으면 됨. 무식하게 뽑는게 너무 많으니까 좀 더 스마트하게 샘플링하는 컨트롤러를 설계 RNN + RL 기반 이런 방식으로 NAS에서 증명
automl의 단점 매번 최고 성능의 네트워크를 뽑아내지만 정확도에만 관심이 있다. 요즘엔 학습 속도도 따지지만 우린는 모바일 환경 정확도 + 연산속도가 같이 중요하다
그렇다면 연산 속도를 타겟하는 automl을 설계하자
그런데 리서치 분야에서 쓰이는 연산 속도라는 것이 애매모호하다. flops, macs 애초에 객관적으로 비교하기 어려운 메트릭
가장 유명한 모바일 네트워크 mnv2 mac vs latency
mac이 작을수록 latency 즉, 속도도 빠르다
하지만 비율이 동일하지는 않다.
지금까지의 문제는 현실적인 문제라 방법 찾기가 정말 어렵다. 그렇다면 내가 원하는 현실 디바이스에서 나오는 속도를 벤치마크로 사용하자.
nas 다시 설명 컨트롤러: 네트워크 샘플러 mnist, imagenet으로 샘플링된 네트워크 학습 정확도를 리워드로 반영해서 policy gradient 방식으로 rnn 업데이트
이와중에 latency도 같이 고려하려면, 단순하게는 latency의 upper bound를 정하고 그 안에 해당하는 네트워크만 가지고 acc를 최대화시킨다. 저자들이 고민해보니까 최적점에 도달하기 어려움
정확도와 레이턴시를 통합한 리워드 설계 정확도가 높을수록 어드벤티지 속도가 느릴수록, 값이 클수록 페널티(w가 음수기 때문에) w에 해당하는 값도 하이퍼파라미터 -> automl이지만 사람 손이 타긴함. 우리가 원하는 latency냐 아니냐에 따라서 페널티 강도가 다름
그림으로 보면 기존 nas, enas 논문 방식에서 빨간색 박스가 추가된 형태
그렇다면 latency에 대한 페널티를 어떻게 주는게 좋은지? 첫번째 경우 느린 경우에 대한 페널티만 있음 두번째 경우 알파, 베타 모두 -0.07인데 모바일넷 저자들의 경험상 latency가 두 배 느려지만 acc가 5% 하락하는 걸 바탕으로 정함 알파, 베타마저 정할 수 잇을까?
automl에서도 큰 그림은 그려줘야함. 학습 속도 문제 해결 search space 디자인은 auotml의 큰 관심사 학습 속도 관점 좋은 네트워크 샘플링 관점 여기도 모바일 넷 경험이 들어감 + 모바일넷 알면 논문 이해하기 쉬움 크게 7 블럭을 정하고 그 안에 들어가는 layer를 정의함. 한 블럭 안에서 레이어는 같은 형태로 여러번 반복 각 레이어는 오른쪽에 보이는 ops로 rnn 컨트롤러가 샘플링해줌
enas도 블럭과 비슷한 개념의 셀이 있음 하지만 그 셀이 눈으로 보기에도 엄청 복잡해보임. 많은 서치를 하기 어려움
실험은 크게 2가지 실제로 구글이 오피셜하게 리포에 올리는것도 이 두가지 완전 전략적으로 보임
automl 트레이닝은 짚을 부분 이미지넷 데이터로 아주 약간만 트레이닝하고 정확도 반영
파란색이 mnasnet의 결과물 당연히 좋음 빨간색 박스는 가장 기본 구성
classificastion에서 속도가 같을때 mnas 정확도가 2% 더 좋음
정확도가 같을떄 1.5배 빠름
mnasnet이 뽑은 네트워크 3x3 depthwise separable, 1x1 씀 skip connection 씀 inverted residual block 씀 고의적인 의도가 있긴함 특이점 5x5를 많이 쓴다. automl의 추세
우리가 알기로 3x3 2개가 5x5와 receptive field가 동일한데 연산량이 더 적다 그런데 depthwise separable에서는 특정 조건에서 5x5가 빠르다.
더 재미있는 분석 d는 mobilenet v2 f는 mobilenet v1 인데 하나로만 구성해을때 속도나 정확도면에서 종합 구성보다 안좋음
논문 결과랑 동일하지는 않지만 v2보다 좋은 성능을 보여주는 건 맞음 동일한 학습 조건에서
detection에서도 당연히 빠르고 좋음
요약: 모바일 환경을 위한 nas latency를 고려한 reward 설계 실제 디바이스에서 latency 구함
제프딘의 말대로 이미 구글에서 cloud ml 서비스를 제공중 내부에서는 automl로 만든게 짱짱짱이라고 얘기함 우리는 뭐 먹고 사나!

PR-105: MnasNet: Platform-Aware Neural Architecture Search for Mobile

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to PR-105: MnasNet: Platform-Aware Neural Architecture Search for Mobile

Similar to PR-105: MnasNet: Platform-Aware Neural Architecture Search for Mobile (20)

Recently uploaded

Recently uploaded (20)

PR-105: MnasNet: Platform-Aware Neural Architecture Search for Mobile

Editor's Notes