How games are driving advances in AI research- Unite Copenhagen 2019

Driving AI Research
Using Games
Jeffrey Shih
Lead Product Manager, AI
Unity Technologies
@shihzy
Dr. Matt Crosby
Postdoctoral Researcher
Leverhulme Centre for the
Future of Intelligence
Imperial College London
@MaCroPhilosophy
Benjamin Beyret
Researcher
Leverhulme Centre for the
Future of Intelligence
Imperial College London
@BenBeyret

Video games have been
used to drive artiﬁcial
intelligence research
3

Video games have been
used to drive artiﬁcial
intelligence research
(for a long time)
4

Video games
6
A few examples
● Atari 2600 Games - DeepMind, OpenAI

Video games
7
A few examples
● Doom - VizDoom from Poznan University

Video games
8
A few examples
● Quake 3 - DeepMind

Video games
9
A few examples
● Minecraft - Microsoft Project Malmo

Video games
10
A few examples
● Starcraft 2 - DeepMind / Blizzard

Video games
11
A few examples
● Starcraft 2 - DeepMind / Blizzard
● Dota 2 - OpenAI Five

Video games
12
Some novel approaches in AI

Video games
13
● Deep Q-Network (DQN) paper published by
DeepMind

Video games
14
DeepMind
● Proximal Policy Optimization (PPO) paper
published by OpenAI

Video games
15
DeepMind
● Proximal Policy Optimization (PPO) paper
published by OpenAI
● New AI systems and approaches to beating
top players in StarCraft II and DOTA 2

Demis Hassabis - co-founder and CEO of
DeepMind
As a former video game designer myself, I couldn’t be more
excited to be collaborating with Unity, creating virtual
environments for developing and testing the kind of smart,
ﬂexible algorithms we need to tackle real-world problems

AI-Based Challenges Using Unity
The Animal-AI Olympics

History of the Obstacle Tower Challenge
— February 18, 2019: Launched 1st qualifying round on AICrowd with Google
Cloud Platform as a co-sponsor. Over $100K in prizes.
— Qualifying round participation: 2000+ entries from 350+ teams
— May 15, 2019: 2nd round launched
— August 7, 2019: Winners announced and Obstacle Tower open-sourced

Obstacle Tower Environment
Research and benchmarks areas
— Vision
– High-ﬁdelity 3D visuals
– Realtime lighting/shadows
— Control
– Platforming puzzles
— Planning
– Complex ﬂoor layouts
— Generalization
– Procedural Floors, Rooms, and Visuals
22

Different Floors and Themes
23

Obstacle Tower Environment
Procedurally generated design
— Each episode a new tower
– Each tower filled with 25 floors
– Each floor filled with rooms
– Each room filled with
obstacles and puzzles!
24

Learnings...
Winner: Alex Nicol
— Classifier for object identification
— Imitation Learning
— PPO (Proximal Policy Optimization)
for fine-tuning behavior
2nd: Gianni & Miha
— PPO with modifications
— Sampling algorithm
— 10 Billion steps sampled
3rd: Songbin Choi
— Standard PPO
— Human Play experience added

The Animal-AI Olympics
Download it today
github.com/beyretb/AnimalAI-Olympics 26
animalaiolympics.com

Animal Cognition Tasks
28
Can we build AI systems to do this?

29
— No!
— It’s way too hard.

30
— No!
Can we build a research pathway?

31
— No!
Can we build a research pathway?
— Yes!
— Using Unity - ML-Agents

Robot Corvids
33
Detect food is inside the containers

Robot Corvids
34
Detect Containers have open tops

Robot Corvids
35
Detect
objects that
could help

Robot Corvids
36
Successfully grip object
Detect
objects that
could help

Robot Corvids
37
Notice different substances
Detect
objects that
could help

Robot Corvids
38
Not to
mention
anything
required to
actually
solve the
problem!
Notice different substances
Detect
objects that
could help

Design Principles
39
— Simple tasks
— Simple affordances
— Abstract from animals

The Animal-AI Environment: Possibilities
41
— Goal is to get food
— From simple building blocks
can build complex
structures/experiments.

Use example: Maze Navigation
43
Test animals’ navigation skills
Claude Shannon

44
Claude Shannon

45
Claude Shannon

46
Claude Shannon

47
Claude Shannon
Experimentation design

48
Step 1: design training

49

50

51

52

53

54

55

56

57

In practice
AGENT ENVIRONMENT

In practice
AGENT ENVIRONMENT
LESSON

In practice
AGENT ENVIRONMENT
ACTION
LESSON

61
In practice
AGENT ENVIRONMENT
ACTION
OBSERVATIONS
REWARD
LESSON

62
In practice
AGENT ENVIRONMENT
ACTION
OBSERVATIONS
REWARD
LESSON

63
In practice
AGENT ENVIRONMENT
ACTION
OBSERVATIONS
REWARD
LESSON

64
In practice
AGENT ENVIRONMENT
ACTION
OBSERVATIONS
REWARD
LESSON

65
In practice
AGENT ENVIRONMENT
ACTION
OBSERVATIONS
REWARD
LESSON

66
Step 3: Enjoy

10 Categories 300 tests
67
1. Food
2. Preferences
3. Obstacles
4. Avoidance
5. Spatial Reasoning
6. Generalisation
7. Internal Models
8. Object Permanence
9. Advanced Preferences
10. Causal Reasoning

Competition Results
68
— 50+ teams
— 25,000+ correct solutions
— One month left to go!

Conclusions
72
— A gaming-style environment allows us to build an arena with
simulated physics and visual inputs.
— Many tests used on animals in the real world can be
translated to our environment.
— Creates a research framework for creating AIs with
animal-like skills.
— A crucial step towards Artiﬁcial General Intelligence.

Learn more!
Additional Information
(Point Camera)
Chat with us, demos and more!
Visit us at the Games AI Kiosk on
the Expo Floor!
https://tinyurl.com/unite-airesearch

How games are driving advances in AI research- Unite Copenhagen 2019

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to How games are driving advances in AI research- Unite Copenhagen 2019

Similar to How games are driving advances in AI research- Unite Copenhagen 2019 (20)

More from Unity Technologies

More from Unity Technologies (20)

Recently uploaded

Recently uploaded (20)

How games are driving advances in AI research- Unite Copenhagen 2019