Sim-to-Real Transfer in Deep Reinforcement Learning

•Download as PPTX, PDF•

0 likes•76 views

atulshah16

Technology

Deep Reinforcement Learning (DRL)
Fundatmentals
Deep Reinforcement Learning is an effective way to train robots to adapt to real world as it overcomes the
problem of data source sample inefficiency and the cost of collection.
It provides potentially infinite source of data as the agent explores the environment and exploits the
knowledge learned from its exploration.

Sim-to-Real Transfer
• Transferring of policies learned during training phase by robot to that
in real-world environment.
• There is a remarkable degradation in performance observed in
transitioning from simulated environment to real world.
• Learning via exploration in DRL is cost effective but the differences
between simulations and real-world scenarios pose challenges for the
process of learning.

Methods for Sim-to-Real Transfer
• Zero Shot Transfer
An extreme example of domain adaptation in which agent is exposed to unseen test samples which were not
available during training phase. Agent is expected to predict classes using meta representation of classes.
• System identification
Represent physical system via mathematical model and precisely calibrate the simulator
• Domain Randomization
Randomize the simulated environment so as to generalize the data distribution as in real world.
Visual Randomization and Dynamics Randomization.

Methods for Sim-to-Real Transfer
• Domain Adaptation Methods
To transfer knowledge from source domain to target which has limited data, we unify source and target feature
spaces.
• Learning with disturbances
Introduce perturbations in the simulation to minimize mismatches between simulation and real-world
environment.
• Simulation environments
Carefully calibrated simulation environments to introduce realism. E.g Gazebo, Unity3D, and PyBullet or
MuJoCo.

Challenges
• Domain Randomizations: Hard to determine what and how the randomizations
work for the simulations.
• Domain Adaptations: Feature space of source and target domains may not be
easily unified.

Conclusion
• A need to add more realism to the simulation environment to have a successful
sim-to-real transfer of knowledge.
• Domain randomization and domain adaptation are most commonly used
methods.
• Policy distillation for multi-task learning while meta learning for variety of tasks
can be utilized.
• This field has provided opportunities for future research in the domain of
transferring knowledge.

What's hot

Federated learningMindos Cheng

Machine Learning on AWSAmazon Web Services

Explainable AI (XAI) - A Perspective Saurabh Kaushik

Deep Learning Explained: The future of Artificial Intelligence and Smart Netw...Melanie Swan

Poisoning attacks on Federated Learning based IoT Intrusion Detection SystemSai Kiran Kadam

Big Data Stockholm v 7 | "Federated Machine Learning for Collaborative and Se...Dataconomy Media

Supervised and unsupervised learningAmAn Singh

Meta learning tutorialJoaquin Vanschoren

Cloud computing and artificial intelligenceFurqan Haider

Federated Machine Learning FrameworkAnup kumar

Adversarial Attacks on A.I. Systems — NextCon, Jan 2019anant90

Deep Learning Applications to Satellite Imageryrlewis48

Master's Thesis PresentationWajdi Khattel

Uncertainty Quantification in AIFlorian Wilhelm

Data Quality for Machine Learning TasksHima Patel

Multi-Layer PerceptronsESCOM

Computational learning theoryswapnac12

Explainable Machine Learning (Explainable ML)Hayim Makabee

Federated learning in briefShashi Perera

Explainable AI is not yet Understandable AIepsilon_tud

What's hot (20)

Federated learning

Machine Learning on AWS

Explainable AI (XAI) - A Perspective

Deep Learning Explained: The future of Artificial Intelligence and Smart Netw...

Poisoning attacks on Federated Learning based IoT Intrusion Detection System

Big Data Stockholm v 7 | "Federated Machine Learning for Collaborative and Se...

Supervised and unsupervised learning

Meta learning tutorial

Cloud computing and artificial intelligence

Federated Machine Learning Framework

Adversarial Attacks on A.I. Systems — NextCon, Jan 2019

Deep Learning Applications to Satellite Imagery

Master's Thesis Presentation

Uncertainty Quantification in AI

Data Quality for Machine Learning Tasks

Multi-Layer Perceptrons

Computational learning theory

Explainable Machine Learning (Explainable ML)

Federated learning in brief

Explainable AI is not yet Understandable AI

Similar to Sim-to-Real Transfer in Deep Reinforcement Learning

The deep bootstrap 논문 리뷰Seonghoon Jung

Presentation File of paper "Leveraging Normalization Layer in Adapters With P...dyyjkd

Transfer Learning for Improving Model Predictions in Robotic SystemsPooyan Jamshidi

Preliminary Exam SlidesDebasmit Das

How useful is self-supervised pretraining for Visual tasks?Seunghyun Hwang

Moving object detection in complex sceneKumar Mayank

Fcv rep darrellzukun

Deep Learning in Robotics: Robot gains Social Intelligence through Multimodal...gabrielesisinna

Graph Matching Unsupervised Domain Adaptation Debasmit Das

“DNN Training Data: How to Know What You Need and How to Get It,” a Presentat...Edge AI and Vision Alliance

Indoor scene understanding for autonomous agentsVarun Bhaseen

Deep Learning in Limited Resource EnvironmentsOguzVuruskaner

PhD Defense SlidesDebasmit Das

[BMVC 2022] DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detec...Ziyuan Zhao

Computer modelling and simulationstangytangling

PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...Jinwon Lee

GeoAI: A Model-Agnostic Meta-Ensemble Zero-Shot Learning Method for Hyperspec...Konstantinos Demertzis

Preference learning for guiding the tree searches in continuous POMDPs (CoRL ...Jisu Han

MACHINE LEARNING YEAR DL SECOND PART.pptxNAGARAJANS68

Task Scheduling using Tabu Search algorithm in Cloud Computing Environment us...AzarulIkhwan

Similar to Sim-to-Real Transfer in Deep Reinforcement Learning (20)

The deep bootstrap 논문 리뷰

Presentation File of paper "Leveraging Normalization Layer in Adapters With P...

Transfer Learning for Improving Model Predictions in Robotic Systems

Preliminary Exam Slides

How useful is self-supervised pretraining for Visual tasks?

Moving object detection in complex scene

Fcv rep darrell

Deep Learning in Robotics: Robot gains Social Intelligence through Multimodal...

Graph Matching Unsupervised Domain Adaptation

“DNN Training Data: How to Know What You Need and How to Get It,” a Presentat...

Indoor scene understanding for autonomous agents

Deep Learning in Limited Resource Environments

PhD Defense Slides

[BMVC 2022] DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detec...

Computer modelling and simulations

PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...

GeoAI: A Model-Agnostic Meta-Ensemble Zero-Shot Learning Method for Hyperspec...

Preference learning for guiding the tree searches in continuous POMDPs (CoRL ...

MACHINE LEARNING YEAR DL SECOND PART.pptx

Task Scheduling using Tabu Search algorithm in Cloud Computing Environment us...

Recently uploaded

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j

The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad

Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes

Pigging Solutions Piggable Sweeping ElbowsPigging Solutions

Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik

How to Remove Document Management Hurdles with X-Docs?XfilesPro

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

A Domino Admins Adventures (Engage 2024)Gabriella Davis

Slack Application Development 101 Slidespraypatel2

Understanding the Laravel MVC ArchitecturePixlogix Infotech

AI as an Interface for Commercial BuildingsMemoori

Recently uploaded (20)

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

SIEMENS: RAPUNZEL – A Tale About Knowledge Graph

The Codex of Business Writing Software for Real-World Solutions 2.pptx

Azure Monitor & Application Insight to monitor Infrastructure & Application

Human Factors of XR: Using Human Factors to Design XR Systems

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

IAC 2024 - IA Fast Track to Search Focused AI Solutions

Breaking the Kubernetes Kill Chain: Host Path Mount

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...

Presentation on how to chat with PDF using ChatGPT code interpreter

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners

Pigging Solutions Piggable Sweeping Elbows

Injustice - Developers Among Us (SciFiDevCon 2024)

How to Remove Document Management Hurdles with X-Docs?

The 7 Things I Know About Cyber Security After 25 Years | April 2024

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

A Domino Admins Adventures (Engage 2024)

Slack Application Development 101 Slides

Understanding the Laravel MVC Architecture

AI as an Interface for Commercial Buildings

Sim-to-Real Transfer in Deep Reinforcement Learning

1. Sim-to-Real Transfer in Deep Reinforcement Learning Student ID: 014530243 Name: Atul Shah

2. Deep Reinforcement Learning (DRL) Fundatmentals Deep Reinforcement Learning is an effective way to train robots to adapt to real world as it overcomes the problem of data source sample inefficiency and the cost of collection. It provides potentially infinite source of data as the agent explores the environment and exploits the knowledge learned from its exploration.

3. Sim-to-Real Transfer • Transferring of policies learned during training phase by robot to that in real-world environment. • There is a remarkable degradation in performance observed in transitioning from simulated environment to real world. • Learning via exploration in DRL is cost effective but the differences between simulations and real-world scenarios pose challenges for the process of learning.

4. Sim-to-Real and related fields

5. Methods for Sim-to-Real Transfer • Zero Shot Transfer An extreme example of domain adaptation in which agent is exposed to unseen test samples which were not available during training phase. Agent is expected to predict classes using meta representation of classes. • System identification Represent physical system via mathematical model and precisely calibrate the simulator • Domain Randomization Randomize the simulated environment so as to generalize the data distribution as in real world. Visual Randomization and Dynamics Randomization.

6. Methods for Sim-to-Real Transfer • Domain Adaptation Methods To transfer knowledge from source domain to target which has limited data, we unify source and target feature spaces. • Learning with disturbances Introduce perturbations in the simulation to minimize mismatches between simulation and real-world environment. • Simulation environments Carefully calibrated simulation environments to introduce realism. E.g Gazebo, Unity3D, and PyBullet or MuJoCo.

7. Domain Randomization Overview

8. Challenges • Domain Randomizations: Hard to determine what and how the randomizations work for the simulations. • Domain Adaptations: Feature space of source and target domains may not be easily unified.

9. Conclusion • A need to add more realism to the simulation environment to have a successful sim-to-real transfer of knowledge. • Domain randomization and domain adaptation are most commonly used methods. • Policy distillation for multi-task learning while meta learning for variety of tasks can be utilized. • This field has provided opportunities for future research in the domain of transferring knowledge.

Sim-to-Real Transfer in Deep Reinforcement Learning

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Sim-to-Real Transfer in Deep Reinforcement Learning

Similar to Sim-to-Real Transfer in Deep Reinforcement Learning (20)

Recently uploaded

Recently uploaded (20)

Sim-to-Real Transfer in Deep Reinforcement Learning