SlideShare a Scribd company logo
1 of 1
Download to read offline
Learning to Recognize Distance to Stop Signs Using
the Virtual World of Grand Theft Auto 5
Artur Filipowicz, Jeremiah Liu, Alain Kornhauser
Operations Research and Financial Engineering
Princeton University
The Problem
Detect the presence of a stop sign and determine the distance to it based on an
image.
Related Works
• Method employs both single-image and multi-view analysis for a 97% traffic sign
classification rate (1).
• A neural network trained to perform traffic sign classification on images obtains an
accuracy of 95% (2).
• (1) uses multiple views to locate 95% of signs within 3 m. of the real location at 2
images per second.
• Algorithms in (3) can determine traffic sign position with error between 0.2 m and
1.6 m within the range of 7 m to 25 m from the stop sign.
The Proposed Solution
• Generate a dataset of synthetic autonomically labeled driving scenes using the
video game Grand Theft Auto 5.
• Train a convolutional neural network (CNN) with an architecture from (4) which
runs at 10 images per second.
Image: 𝑿∈ℝ
Output: 𝒀﹦[Ⅱ , 𝒅 ]
Network: ⨍(𝑿)→𝒀
𝒉∗𝒘∗𝒄
Stop Stop
STOP ⅡStop
𝒅Stop
Camera
Vehicle
Grand Theft Auto 5
• Video game with a rich road environment
• Vehicles, pedestrians and animals
• Road network of bridges, tunnels, freeways, and intersections
• Urban, suburban, rural, desert and woodland environments
• 14 weather conditions
• Lighting conditions for 24 hours of the day
Figure 1: Diagram of the problem.
Figure 2: Driving scene in different weather and lighting
conditions generated from Grand Theft Auto 5.
Figure 3: A stop sign 10 meters away in Grand Theft Auto
5 (left) and the real world (right).
• Using Grand Theft Auto 5, we’ve generated a dataset of 1.4 million images with
and without stop signs in different light and weather conditions and locations.
Result
• The convolutional neural network is accurate on game images as well as on real
world images.
Table 1: Performance on images from Grand Theft Auto 5
Range Accuracy
False Negative
Rate
False Positive
Rate
Mean AE (m) Median AE (m)
Mean AE (m)
when correct
Median AE (m)
when correct
0m - 10m 0.961 0.039 na 2.2 0.9 1.2 0.8
10m - 20m 0.949 0.051 na 3.3 1.7 2.4 1.6
20m - 30m 0.798 0.202 na 4.7 3.4 3.1 2.7
30m - 40m 0.440 0.560 na 3.4 2.6 3.1 2.1
> 40m 0.944 na 0.056 1.8 0.2 0.9 0.2
na = not applicable
Table 2: Performance on 200 real world images.
Range Accuracy
False Negative
Rate
False Positive
Rate
Mean AE (m) Median AE (m)
Mean AE (m)
when correct
Median AE (m)
when correct
0m - 10m 1.000 0.000 na 9.1 8.9 9.1 8.9
10m - 20m 0.750 0.25 na 16.4 16.3 15.1 15.7
20m - 30m 0.968 0.032 na 10.0 10.9 9.8 10.8
30m - 40m 0.687 0.313 na 5.9 4.2 7.5 6.1
> 40m 1.000 na 0.000 0.9 0.4 0.9 0.4
na = not applicable
Conclusions
• Virtual environment enables creative data collection methods
• CNN can detect 95.5% of the stops signs within 20 meters with an average error
in distance of 1.2m to 2.4m on video game data
• Need for research on real world adaptation
• Need for research on optimal use of simulators
References
• [1] Radu Timofte, Karel Zimmermann, and Luc Van Gool. Multi-view traffic sign detection, recognition, and 3d
localisation. Machine Vision and Applications, 25(3):633–647, 2014.
• [2] Arturo De La Escalera, Luis E Moreno, Miguel Angel Salichs, and José María Armingol. Road traffic sign
detection and classification. IEEE transactions on industrial electronics, 44(6):848–859, 1997.
• [3] André Welzel, Andreas Auerswald, and Gerd Wanielik. Accurate camera-based traffic sign localization. In 17th
International IEEE Conference on Intelligent Transportation Systems (ITSC), pages 445–450. IEEE, 2014.
• [4] Chenyi Chen, Ari Seff, Alain Kornhauser, and Jianxiong Xiao. Deepdriving: Learning affordance for direct
perception in autonomous driving. In Proceedings of the IEEE International Conference on Computer Vision,
pages 2722–2730, 2015.
Acknowledgment:
We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Tesla K40 GPU used for this research.

More Related Content

What's hot

Motion capture technology
Motion capture technologyMotion capture technology
Motion capture technology
Anvesh Ranga
 
Vision Based Traffic Surveillance System
Vision Based Traffic Surveillance SystemVision Based Traffic Surveillance System
Vision Based Traffic Surveillance System
maheshwaraneee
 
Photogrametry_3D_Modelling[1]
Photogrametry_3D_Modelling[1]Photogrametry_3D_Modelling[1]
Photogrametry_3D_Modelling[1]
Joachim Nkendeys
 

What's hot (20)

Motion Capturing Technology
Motion Capturing TechnologyMotion Capturing Technology
Motion Capturing Technology
 
THE 3D MODELLING USING FRAME CAMERAS AND PANORAMIC CAMERA
THE 3D MODELLING USING FRAME CAMERAS AND PANORAMIC CAMERATHE 3D MODELLING USING FRAME CAMERAS AND PANORAMIC CAMERA
THE 3D MODELLING USING FRAME CAMERAS AND PANORAMIC CAMERA
 
Final project report
Final project reportFinal project report
Final project report
 
AUTO LANDING PROCESS FOR AUTONOMOUS FLYING ROBOT BY USING IMAGE PROCESSING BA...
AUTO LANDING PROCESS FOR AUTONOMOUS FLYING ROBOT BY USING IMAGE PROCESSING BA...AUTO LANDING PROCESS FOR AUTONOMOUS FLYING ROBOT BY USING IMAGE PROCESSING BA...
AUTO LANDING PROCESS FOR AUTONOMOUS FLYING ROBOT BY USING IMAGE PROCESSING BA...
 
Advantages and Disadvantages of Motion Capture
Advantages and Disadvantages of Motion CaptureAdvantages and Disadvantages of Motion Capture
Advantages and Disadvantages of Motion Capture
 
Motion capture technology
Motion capture technologyMotion capture technology
Motion capture technology
 
Motion capture technology
Motion capture technologyMotion capture technology
Motion capture technology
 
Accuracy Analysis of Three-Dimensional Model Reconstructed by Spherical Video...
Accuracy Analysis of Three-Dimensional Model Reconstructed by Spherical Video...Accuracy Analysis of Three-Dimensional Model Reconstructed by Spherical Video...
Accuracy Analysis of Three-Dimensional Model Reconstructed by Spherical Video...
 
DimEye Corp Presents Revolutionary VLS (Video Laser Scan) at SS IMMR 2013
DimEye Corp Presents Revolutionary VLS (Video Laser Scan) at SS IMMR 2013DimEye Corp Presents Revolutionary VLS (Video Laser Scan) at SS IMMR 2013
DimEye Corp Presents Revolutionary VLS (Video Laser Scan) at SS IMMR 2013
 
Vision Based Traffic Surveillance System
Vision Based Traffic Surveillance SystemVision Based Traffic Surveillance System
Vision Based Traffic Surveillance System
 
Team SNUSAT ARLISS 2015
Team SNUSAT ARLISS 2015Team SNUSAT ARLISS 2015
Team SNUSAT ARLISS 2015
 
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
 
Introduction to motion capture
Introduction to motion captureIntroduction to motion capture
Introduction to motion capture
 
Garbage Box: 3D Reconstruction and 4D Deformation Measurement
Garbage Box: 3D Reconstruction and 4D Deformation MeasurementGarbage Box: 3D Reconstruction and 4D Deformation Measurement
Garbage Box: 3D Reconstruction and 4D Deformation Measurement
 
Photogrametry_3D_Modelling[1]
Photogrametry_3D_Modelling[1]Photogrametry_3D_Modelling[1]
Photogrametry_3D_Modelling[1]
 
Motion capture technology
Motion capture technologyMotion capture technology
Motion capture technology
 
5 27pres
5 27pres5 27pres
5 27pres
 
Web odm workflow pres
Web odm workflow presWeb odm workflow pres
Web odm workflow pres
 
4 d motion / for orthodontists by Almuzian
4 d motion / for orthodontists by Almuzian4 d motion / for orthodontists by Almuzian
4 d motion / for orthodontists by Almuzian
 
Space mouse[1]
Space mouse[1]Space mouse[1]
Space mouse[1]
 

Similar to Learning to Recognize Distance to Stop Signs Using the Virtual World of Grand Theft Auto 5 (Poster)

Similar to Learning to Recognize Distance to Stop Signs Using the Virtual World of Grand Theft Auto 5 (Poster) (20)

ROBOTOR AN AUTONOMOUS VEHICLE FOR TARGET DETECTION AND SHOOTING
ROBOTOR AN AUTONOMOUS VEHICLE FOR TARGET DETECTION AND SHOOTINGROBOTOR AN AUTONOMOUS VEHICLE FOR TARGET DETECTION AND SHOOTING
ROBOTOR AN AUTONOMOUS VEHICLE FOR TARGET DETECTION AND SHOOTING
 
THE ACCURACY OF MOBILE PHONE CAMERA INSTEAD OF HIGH RESOLUTION CAMERA IN DIGI...
THE ACCURACY OF MOBILE PHONE CAMERA INSTEAD OF HIGH RESOLUTION CAMERA IN DIGI...THE ACCURACY OF MOBILE PHONE CAMERA INSTEAD OF HIGH RESOLUTION CAMERA IN DIGI...
THE ACCURACY OF MOBILE PHONE CAMERA INSTEAD OF HIGH RESOLUTION CAMERA IN DIGI...
 
The accuracy of mobile phone camera instead of high resolution camera in digi...
The accuracy of mobile phone camera instead of high resolution camera in digi...The accuracy of mobile phone camera instead of high resolution camera in digi...
The accuracy of mobile phone camera instead of high resolution camera in digi...
 
30120140506012 2
30120140506012 230120140506012 2
30120140506012 2
 
30120140506012 2
30120140506012 230120140506012 2
30120140506012 2
 
Visual pattern recognition in robotics
Visual pattern recognition in roboticsVisual pattern recognition in robotics
Visual pattern recognition in robotics
 
“Next-generation Computer Vision Methods for Automated Navigation of Unmanned...
“Next-generation Computer Vision Methods for Automated Navigation of Unmanned...“Next-generation Computer Vision Methods for Automated Navigation of Unmanned...
“Next-generation Computer Vision Methods for Automated Navigation of Unmanned...
 
Visual pattern recognition in robotics
Visual pattern recognition in roboticsVisual pattern recognition in robotics
Visual pattern recognition in robotics
 
Enhanced Algorithm for Obstacle Detection and Avoidance Using a Hybrid of Pla...
Enhanced Algorithm for Obstacle Detection and Avoidance Using a Hybrid of Pla...Enhanced Algorithm for Obstacle Detection and Avoidance Using a Hybrid of Pla...
Enhanced Algorithm for Obstacle Detection and Avoidance Using a Hybrid of Pla...
 
Moving object detection in video surveillance
Moving object detection in video surveillanceMoving object detection in video surveillance
Moving object detection in video surveillance
 
Utilization f LiDAR and IKONOS for Security Hotspot Analysis based on Realism...
Utilization f LiDAR and IKONOS for Security Hotspot Analysis based on Realism...Utilization f LiDAR and IKONOS for Security Hotspot Analysis based on Realism...
Utilization f LiDAR and IKONOS for Security Hotspot Analysis based on Realism...
 
Multi-Objective Convolutional Neural Networks for Robot Localisation and 3D P...
Multi-Objective Convolutional Neural Networks for Robot Localisation and 3D P...Multi-Objective Convolutional Neural Networks for Robot Localisation and 3D P...
Multi-Objective Convolutional Neural Networks for Robot Localisation and 3D P...
 
Concealed Weapon Detection
Concealed Weapon Detection Concealed Weapon Detection
Concealed Weapon Detection
 
2009- Smal Size Virtual Slides and 4M microscopes
2009- Smal Size Virtual Slides and 4M microscopes2009- Smal Size Virtual Slides and 4M microscopes
2009- Smal Size Virtual Slides and 4M microscopes
 
Thesis_Khan_2073209.pptx
Thesis_Khan_2073209.pptxThesis_Khan_2073209.pptx
Thesis_Khan_2073209.pptx
 
Intelligent indoor mobile robot navigation using stereo vision
Intelligent indoor mobile robot navigation using stereo visionIntelligent indoor mobile robot navigation using stereo vision
Intelligent indoor mobile robot navigation using stereo vision
 
Traffic Violation Detection Using Multiple Trajectories of Vehicles
Traffic Violation Detection Using Multiple Trajectories of VehiclesTraffic Violation Detection Using Multiple Trajectories of Vehicles
Traffic Violation Detection Using Multiple Trajectories of Vehicles
 
IRJET - Real-Time Analysis of Video Surveillance using Machine Learning a...
IRJET -  	  Real-Time Analysis of Video Surveillance using Machine Learning a...IRJET -  	  Real-Time Analysis of Video Surveillance using Machine Learning a...
IRJET - Real-Time Analysis of Video Surveillance using Machine Learning a...
 
SSII2018企画: センシングデバイスの多様化と空間モデリングの未来
SSII2018企画: センシングデバイスの多様化と空間モデリングの未来SSII2018企画: センシングデバイスの多様化と空間モデリングの未来
SSII2018企画: センシングデバイスの多様化と空間モデリングの未来
 
H028038042
H028038042H028038042
H028038042
 

More from Artur Filipowicz

Incorporating Learning Strategies in Training of Deep Neural Networks for Au...
Incorporating Learning Strategies in Training of Deep Neural  Networks for Au...Incorporating Learning Strategies in Training of Deep Neural  Networks for Au...
Incorporating Learning Strategies in Training of Deep Neural Networks for Au...
Artur Filipowicz
 
Virtual Environments as Driving Schools for Deep Learning Vision-Based Sensor...
Virtual Environments as Driving Schools for Deep Learning Vision-Based Sensor...Virtual Environments as Driving Schools for Deep Learning Vision-Based Sensor...
Virtual Environments as Driving Schools for Deep Learning Vision-Based Sensor...
Artur Filipowicz
 
Video Games for Autonomous Driving
Video Games for Autonomous DrivingVideo Games for Autonomous Driving
Video Games for Autonomous Driving
Artur Filipowicz
 

More from Artur Filipowicz (10)

Smart Safety for Commercial Vehicles (ENG)
Smart Safety for Commercial Vehicles (ENG)Smart Safety for Commercial Vehicles (ENG)
Smart Safety for Commercial Vehicles (ENG)
 
Smart Safety for Commercial Vehicles (中文)
Smart Safety for Commercial Vehicles (中文)Smart Safety for Commercial Vehicles (中文)
Smart Safety for Commercial Vehicles (中文)
 
Incorporating Learning Strategies in Training of Deep Neural Networks for Au...
Incorporating Learning Strategies in Training of Deep Neural  Networks for Au...Incorporating Learning Strategies in Training of Deep Neural  Networks for Au...
Incorporating Learning Strategies in Training of Deep Neural Networks for Au...
 
Virtual Environments as Driving Schools for Deep Learning Vision-Based Sensor...
Virtual Environments as Driving Schools for Deep Learning Vision-Based Sensor...Virtual Environments as Driving Schools for Deep Learning Vision-Based Sensor...
Virtual Environments as Driving Schools for Deep Learning Vision-Based Sensor...
 
Virtual Environments as Driving Schools for Deep Learning Vision-Based Sensor...
Virtual Environments as Driving Schools for Deep Learning Vision-Based Sensor...Virtual Environments as Driving Schools for Deep Learning Vision-Based Sensor...
Virtual Environments as Driving Schools for Deep Learning Vision-Based Sensor...
 
Direct Perception for Congestion Scene Detection Using TensorFlow
Direct Perception for Congestion Scene Detection Using TensorFlowDirect Perception for Congestion Scene Detection Using TensorFlow
Direct Perception for Congestion Scene Detection Using TensorFlow
 
Learning to Recognize Distance to Stop Signs Using the Virtual World of Grand...
Learning to Recognize Distance to Stop Signs Using the Virtual World of Grand...Learning to Recognize Distance to Stop Signs Using the Virtual World of Grand...
Learning to Recognize Distance to Stop Signs Using the Virtual World of Grand...
 
Filtering of Frequency Components for Privacy Preserving Facial Recognition
Filtering of Frequency Components for Privacy Preserving Facial RecognitionFiltering of Frequency Components for Privacy Preserving Facial Recognition
Filtering of Frequency Components for Privacy Preserving Facial Recognition
 
Desensitized RDCA Subspaces for Compressive Privacy in Machine Learning
Desensitized RDCA Subspaces for Compressive Privacy in Machine LearningDesensitized RDCA Subspaces for Compressive Privacy in Machine Learning
Desensitized RDCA Subspaces for Compressive Privacy in Machine Learning
 
Video Games for Autonomous Driving
Video Games for Autonomous DrivingVideo Games for Autonomous Driving
Video Games for Autonomous Driving
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 

Learning to Recognize Distance to Stop Signs Using the Virtual World of Grand Theft Auto 5 (Poster)

  • 1. Learning to Recognize Distance to Stop Signs Using the Virtual World of Grand Theft Auto 5 Artur Filipowicz, Jeremiah Liu, Alain Kornhauser Operations Research and Financial Engineering Princeton University The Problem Detect the presence of a stop sign and determine the distance to it based on an image. Related Works • Method employs both single-image and multi-view analysis for a 97% traffic sign classification rate (1). • A neural network trained to perform traffic sign classification on images obtains an accuracy of 95% (2). • (1) uses multiple views to locate 95% of signs within 3 m. of the real location at 2 images per second. • Algorithms in (3) can determine traffic sign position with error between 0.2 m and 1.6 m within the range of 7 m to 25 m from the stop sign. The Proposed Solution • Generate a dataset of synthetic autonomically labeled driving scenes using the video game Grand Theft Auto 5. • Train a convolutional neural network (CNN) with an architecture from (4) which runs at 10 images per second. Image: 𝑿∈ℝ Output: 𝒀﹦[Ⅱ , 𝒅 ] Network: ⨍(𝑿)→𝒀 𝒉∗𝒘∗𝒄 Stop Stop STOP ⅡStop 𝒅Stop Camera Vehicle Grand Theft Auto 5 • Video game with a rich road environment • Vehicles, pedestrians and animals • Road network of bridges, tunnels, freeways, and intersections • Urban, suburban, rural, desert and woodland environments • 14 weather conditions • Lighting conditions for 24 hours of the day Figure 1: Diagram of the problem. Figure 2: Driving scene in different weather and lighting conditions generated from Grand Theft Auto 5. Figure 3: A stop sign 10 meters away in Grand Theft Auto 5 (left) and the real world (right). • Using Grand Theft Auto 5, we’ve generated a dataset of 1.4 million images with and without stop signs in different light and weather conditions and locations. Result • The convolutional neural network is accurate on game images as well as on real world images. Table 1: Performance on images from Grand Theft Auto 5 Range Accuracy False Negative Rate False Positive Rate Mean AE (m) Median AE (m) Mean AE (m) when correct Median AE (m) when correct 0m - 10m 0.961 0.039 na 2.2 0.9 1.2 0.8 10m - 20m 0.949 0.051 na 3.3 1.7 2.4 1.6 20m - 30m 0.798 0.202 na 4.7 3.4 3.1 2.7 30m - 40m 0.440 0.560 na 3.4 2.6 3.1 2.1 > 40m 0.944 na 0.056 1.8 0.2 0.9 0.2 na = not applicable Table 2: Performance on 200 real world images. Range Accuracy False Negative Rate False Positive Rate Mean AE (m) Median AE (m) Mean AE (m) when correct Median AE (m) when correct 0m - 10m 1.000 0.000 na 9.1 8.9 9.1 8.9 10m - 20m 0.750 0.25 na 16.4 16.3 15.1 15.7 20m - 30m 0.968 0.032 na 10.0 10.9 9.8 10.8 30m - 40m 0.687 0.313 na 5.9 4.2 7.5 6.1 > 40m 1.000 na 0.000 0.9 0.4 0.9 0.4 na = not applicable Conclusions • Virtual environment enables creative data collection methods • CNN can detect 95.5% of the stops signs within 20 meters with an average error in distance of 1.2m to 2.4m on video game data • Need for research on real world adaptation • Need for research on optimal use of simulators References • [1] Radu Timofte, Karel Zimmermann, and Luc Van Gool. Multi-view traffic sign detection, recognition, and 3d localisation. Machine Vision and Applications, 25(3):633–647, 2014. • [2] Arturo De La Escalera, Luis E Moreno, Miguel Angel Salichs, and José María Armingol. Road traffic sign detection and classification. IEEE transactions on industrial electronics, 44(6):848–859, 1997. • [3] André Welzel, Andreas Auerswald, and Gerd Wanielik. Accurate camera-based traffic sign localization. In 17th International IEEE Conference on Intelligent Transportation Systems (ITSC), pages 445–450. IEEE, 2014. • [4] Chenyi Chen, Ari Seff, Alain Kornhauser, and Jianxiong Xiao. Deepdriving: Learning affordance for direct perception in autonomous driving. In Proceedings of the IEEE International Conference on Computer Vision, pages 2722–2730, 2015. Acknowledgment: We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Tesla K40 GPU used for this research.