AI NEXTCon Seattle '18: Bringing Machine Learning to the Unity Creation Engine with ML-Agents

•

1 like•775 views

1. Unity ML-Agents is a platform for training intelligent agents to make good decisions in complex 3D simulated environments built with Unity. 2. It allows training agents using reinforcement learning algorithms like PPO through Python and then embedding the trained models back into Unity for inference. 3. The workflow involves creating a Unity environment with agents, brains, and rewards; training agents using the Python API; and embedding the trained model back into Unity for deployment. This enables training agents in photo-realistic 3D simulations.

Technology

AI NEXTCon Seattle ‘18
1/17-20th | Seattle
#ainextcon
http://aisea18.xnextcon.com

Bringing Machine Learning to the Unity
Creation Engine with ML-Agents
Arthur Juliani

About Unity
Leading global game industry platform
• 16 Billion Downloads in 2016 – up
31%
• 2.6 Billion Unique Devices
• 700 Million Gamers
• 38% of top 1000 free mobile games

ML Training Platforms
VizDoom Malmo Atari
PyBulletDeepMind Lab SCII LE

Visual Complexity Cognitive ComplexityPhysical Complexity
ML Training Environments

Unity ML-Agents Workflow
Create Environment Train Agents Embed Agents

Create Environment (Unity)
1. Create Scene
2. Add Academy, Brain(s), and
Agent(s)
3. Define Observations, Actions,
and Rewards
4. Build Executable

Create Environment (Unity)
Perceive & Act
Decide
Coordinate

Agents
• Agents are GameObjects within the Unity scene.
• They perceive the environment via observations, take actions,
and optionally receive rewards.
• Each agent is linked to a brain, which makes decisions for the
agent.

Brains
• Player – Actions are decided by user input through keyboard or
gamepad.
• Heuristic – Actions are decided by C# script using state input.
• External – Actions are decided using Tensorflow via Python
interface.
• Internal – Actions are decided using Tensorflow model
embedded into project.

Unity ML-Agents Workflow
Build Environment Train Agents Embed Agents

Train Agents (Python)
• Launch an environment from python with
env = UnityEnvironment(“my_environment”)
• Interact with gym-like interface:
env.reset()
env.step()
env.close()

Train Agents (Python)
• Proximal Policy Optimization (PPO)
algorithm included by default.
• Works with Continuous and Discrete Control,
plus image and/or vector inputs.
• Supports multiple-agents per brain within a
scene (and multiple brains coming in v0.3!)
• Monitor progress with TensorBoard.

Embed Agents (Unity)
• Once a model is trained, it can be
exported into the Unity project.
• Simply drop .bytes file into Unity
project, and use it in corresponding
Brain with “Internal” mode.
• Support for Mac, Windows, Linux,
iOS, and Android.

Agents
• Learning environments are populated with agents, which are the
actors within the scene.
• Each actor contains functions which define their state variables,
observation cameras, action effects, and reward function.
• Agents are linked to game objects within a scene.

Brains
• Each brain corresponds to a policy for deciding actions within a
learning environment.
• Many agents can be linked to a single brain.
• Brains define the state and action space for their linked agents.
• Multiple brains can be within single environment.
• Four different types of brain.

Academy
• Only one per scene/environment.
• Responsibilities:
• Configures engine for Training or Inference
• Coordinates all Agents and Brains
• Contains set of custom “reset parameters” which can be used for
Curriculum Learning.

Twelve Agents, One Brain,
Independent Rewards

Four Agents, Two Brains:
Competitive Multi-Agent

Multi-Stage Soccer Training
Defense
Train one brain with
negative reward for ball
entering their goal
Offense
Train one brain with
positive reward for ball
entering opponents goal
Combined
Train both brains together
to play against opponent
team

Multi-Agent Scenarios
• Competitive Multi-Agent
Multiple agents, multiple brains, inverse rewards
• Cooperative Multi-Agent
Multiple agents, multiple brains, shared rewards
• Ecosystem Simulations
Multiple agents, multiple brains, independent rewards

What's hot

Game Development with Unitydavidluzgouveia

Game Project / Working with UnityPetri Lankoski

Introduction-to-Unity.pptGravityboi

FlutterHimanshu Singh

Unity3D ProgrammingMichael Ivanov

Practical QML - Key Navigation, Dynamic Language and Theme ChangeBurkhard Stubert

Introduction to QtPuja Pramudya

Java class 6Edureka!

Android - Android Intent TypesVibrant Technologies & Computers

Flutter presentation.pptxFalgunSorathiya

2-Agents- Artificial IntelligenceMhd Sb

ASP.Net Technologies Part-1Vasudev Sharma

Introduction to UnityUniversity of Auckland

Android application-componentLy Haza

Android lifecycleKumar

Android UInationalmobileapps

Network programming in JavaTushar B Kute

Artificial intelligence and video gamesSimple_Harsh

Unity - Game EngineGeeks Anonymes

Intelligent web applicationsPriti Srinivas Sajja

What's hot (20)

Game Development with Unity

Game Project / Working with Unity

Introduction-to-Unity.ppt

Flutter

Unity3D Programming

Practical QML - Key Navigation, Dynamic Language and Theme Change

Introduction to Qt

Java class 6

Android - Android Intent Types

Flutter presentation.pptx

2-Agents- Artificial Intelligence

ASP.Net Technologies Part-1

Introduction to Unity

Android application-component

Android lifecycle

Android UI

Network programming in Java

Artificial intelligence and video games

Unity - Game Engine

Intelligent web applications

Similar to AI NEXTCon Seattle '18: Bringing Machine Learning to the Unity Creation Engine with ML-Agents

How to generate game character behaviors using AI and ML - Unite CopenhagenUnity Technologies

MASRitesh Nayak

unity gaming programing basics for students pptKathiriyaParthiv

The Evolution Of Eclipse 1. 1 )Patty Buckley

Deep Learning with CNTKAshish Jaiman

Autonomous Machines with Project BonsaiIvo Andreev

Unity3d Game Development - CreatiosoftCreatioSoft

01 foundationsankit_ppt

Biological organism simulation using procedural growth "Organimo 1.0"Devyani Singh

Android Mobile App Development basics PPTnithya697634

Yocto projectUniversity of Texas at Dallas

AI for Everyone: Master the BasicsStutty Srivastava

OpenAI Gym & UniverseEntrepreneur / Startup

Agent-based System - IntroductionBambang Purnomosidi D. P.

Continuous Delivery Tools Collaboration Conways Law - QCon London - Matthew S...Skelton Thatcher Consulting Ltd

DSD-INT 2014 - OpenMI Symposium - Federated modelling of Critical Infrastruct...Deltares

AI Dominoes ProjectAdil Gasimov

AI on the EdgeJared Rhodes

BYOD: Build Your First VR Experience with Unreal EngineMichael Sheyahshe

Similar to AI NEXTCon Seattle '18: Bringing Machine Learning to the Unity Creation Engine with ML-Agents (20)

How to generate game character behaviors using AI and ML - Unite Copenhagen

MAS

unity gaming programing basics for students ppt

The Evolution Of Eclipse 1. 1 )

Deep Learning with CNTK

Autonomous Machines with Project Bonsai

Unity3d Game Development - Creatiosoft

01 foundations

Biological organism simulation using procedural growth "Organimo 1.0"

Android Mobile App Development basics PPT

Yocto project

AI for Everyone: Master the Basics

OpenAI Gym & Universe

Agent-based System - Introduction

Continuous Delivery Tools Collaboration Conways Law - QCon London - Matthew S...

DSD-INT 2014 - OpenMI Symposium - Federated modelling of Critical Infrastruct...

AI Dominoes Project

AI on the Edge

BYOD: Build Your First VR Experience with Unreal Engine

Recently uploaded

Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity

APIForce Zurich 5 April Automation LPDGMarianaLemus7

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge

Key Features Of Token Development (1).pptxLBM Solutions

Vulnerability_Management_GRC_by Sohang Sengupta.pptxnull - The Open Security Community

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited

DMCC Future of Trade Web3 - Special EditionDubai Multi Commodity Centre

costume and set research powerpoint presentationphoebematthew05

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren

Pigging Solutions in Pet Food ManufacturingPigging Solutions

"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays

Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm

Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro

WordPress Websites for Engineers: Elevate Your Brandgvaughan

SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

Recently uploaded (20)

Dev Dives: Streamline document processing with UiPath Studio Web

APIForce Zurich 5 April Automation LPDG

SQL Database Design For Developers at php[tek] 2024

Designing IA for AI - Information Architecture Conference 2024

Key Features Of Token Development (1).pptx

Vulnerability_Management_GRC_by Sohang Sengupta.pptx

Human Factors of XR: Using Human Factors to Design XR Systems

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365

DMCC Future of Trade Web3 - Special Edition

costume and set research powerpoint presentation

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...

My Hashitalk Indonesia April 2024 Presentation

Advanced Test Driven-Development @ php[tek] 2024

Pigging Solutions in Pet Food Manufacturing

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn

Streamlining Python Development: A Guide to a Modern Project Setup

Unraveling Multimodality with Large Language Models.pdf

WordPress Websites for Engineers: Elevate Your Brand

SIP trunking in Janus @ Kamailio World 2024

Unblocking The Main Thread Solving ANRs and Frozen Frames

AI NEXTCon Seattle '18: Bringing Machine Learning to the Unity Creation Engine with ML-Agents

1. AI NEXTCon Seattle ‘18 1/17-20th | Seattle #ainextcon http://aisea18.xnextcon.com

2. Bringing Machine Learning to the Unity Creation Engine with ML-Agents Arthur Juliani

3. About Unity Leading global game industry platform • 16 Billion Downloads in 2016 – up 31% • 2.6 Billion Unique Devices • 700 Million Gamers • 38% of top 1000 free mobile games

4. Machine Learning

5. Machine Learning And Games?

10. Yet another ML training platform?

11. ML Training Platforms VizDoom Malmo Atari PyBulletDeepMind Lab SCII LE

12. Visual Complexity Cognitive ComplexityPhysical Complexity ML Training Environments

13. The Unity Ecosystem

14. Unity ML-Agents Workflow Create Environment Train Agents Embed Agents

15. Unity ML-Agents Workflow Create Environment Train Agents Embed Agents

16. Create Environment (Unity) 1. Create Scene 2. Add Academy, Brain(s), and Agent(s) 3. Define Observations, Actions, and Rewards 4. Build Executable

17. Create Environment (Unity) Perceive & Act Decide Coordinate

18. Agents • Agents are GameObjects within the Unity scene. • They perceive the environment via observations, take actions, and optionally receive rewards. • Each agent is linked to a brain, which makes decisions for the agent.

19. Brains • Player – Actions are decided by user input through keyboard or gamepad. • Heuristic – Actions are decided by C# script using state input. • External – Actions are decided using Tensorflow via Python interface. • Internal – Actions are decided using Tensorflow model embedded into project.

20. Unity ML-Agents Workflow Build Environment Train Agents Embed Agents

21. Agent Training Process

22. Train Agents (Python) • Launch an environment from python with env = UnityEnvironment(“my_environment”) • Interact with gym-like interface: env.reset() env.step() env.close()

23. Train Agents (Python) • Proximal Policy Optimization (PPO) algorithm included by default. • Works with Continuous and Discrete Control, plus image and/or vector inputs. • Supports multiple-agents per brain within a scene (and multiple brains coming in v0.3!) • Monitor progress with TensorBoard.

24. Unity ML-Agents Workflow Build Environment Train Agents Embed Agents

25. Embed Agents (Unity) • Once a model is trained, it can be exported into the Unity project. • Simply drop .bytes file into Unity project, and use it in corresponding Brain with “Internal” mode. • Support for Mac, Windows, Linux, iOS, and Android.

26. Agents • Learning environments are populated with agents, which are the actors within the scene. • Each actor contains functions which define their state variables, observation cameras, action effects, and reward function. • Agents are linked to game objects within a scene.

27. Brains • Each brain corresponds to a policy for deciding actions within a learning environment. • Many agents can be linked to a single brain. • Brains define the state and action space for their linked agents. • Multiple brains can be within single environment. • Four different types of brain.

28. Academy • Only one per scene/environment. • Responsibilities: • Configures engine for Training or Inference • Coordinates all Agents and Brains • Contains set of custom “reset parameters” which can be used for Curriculum Learning.

29. Learning Scenarios

30. One Agent, One Brain

31.

32. Twelve Agents, One Brain, Independent Rewards

33.

34. Two Agents, One Brain, Inverse Rewards

35.

36. Physical Manipulation Reacher Crawler

37. Four Agents, Two Brains: Competitive Multi-Agent

38. Multi-Stage Soccer Training Defense Train one brain with negative reward for ball entering their goal Offense Train one brain with positive reward for ball entering opponents goal Combined Train both brains together to play against opponent team

39.

40. Multi-Agent Scenarios • Competitive Multi-Agent Multiple agents, multiple brains, inverse rewards • Cooperative Multi-Agent Multiple agents, multiple brains, shared rewards • Ecosystem Simulations Multiple agents, multiple brains, independent rewards

AI NEXTCon Seattle '18: Bringing Machine Learning to the Unity Creation Engine with ML-Agents

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to AI NEXTCon Seattle '18: Bringing Machine Learning to the Unity Creation Engine with ML-Agents

Similar to AI NEXTCon Seattle '18: Bringing Machine Learning to the Unity Creation Engine with ML-Agents (20)

More from Bill Liu

More from Bill Liu (20)

Recently uploaded

Recently uploaded (20)

AI NEXTCon Seattle '18: Bringing Machine Learning to the Unity Creation Engine with ML-Agents