SlideShare a Scribd company logo
Smart Traffic Lights that Learn !
Multi-Agent Reinforcement Learning Integrated Network
of Adaptive Traffic Signal Controllers
M A R L I N
Samah El-Tantawy, Ph.D. Post Doctoral Fellow, Dept of Civil Engineering
Baher Abdulhai, Ph.D., P.Eng. Director, ITS Centre and Testbed, Dept of Civil Engineering
Hossam Abdelgawad, Ph.D., P.Eng. Manager of ITS Centre and Testbed
ACGM 2013- Intelligent Transport for Smart Cities
Outline
2
1. In a Nutshell
2. Theory in Brief
Reinforcement Learning and Game Theory
3. Applications
City of Toronto Testbed
4. Hardware in the Loop Testing
Approach
Integration with PEEK ATC-1000
Next Steps
Q&A
In a Nutshell
3
 Grand objective
 Intersections "talk to each other",
 Each is affected by what is happening upstream
 Each affects what is happening downstream –
 Whole network control in one shot from a grand brain is the dream
 Issue
 Intractable theoretically,
 Too complex practically,
 Requires massive and very expensive communication
 Solution
 Decentralized,
 Self learning: agents learn to control their local intersection, and
 Game theory based: agents learn to collaborate
What is MARLIN?
4
 Artificial-intelligence-based control
software
 Enables traffic lights to self-learn
and self-collaborate with
neighbouring traffic lights
 Cuts down motorists’ delay, fuel
consumption and the negative
environmental effects of congestion
 Easier to operate (self learning)
 Less expensive communication if
even necessary (less costly)
MARLIN-ATSC: Level 4
Evolution of “Adaptive” Signal Control
Level 0
• Fixed-Time
and
Actuated
Control
• TRANSYT
• 1969, UK
Level 1
• Centralized
Control, Off-
line
Optimization
• SCATS
• 1979,
Australia
• >50
installations
worldwide
Level 2
• Centralized
Control, On-line
Optimization
• SCOOT
• 1981, UK
• >170
installations
worldwide
Level 3
• Distributed
Control, Model-
Based
• OPAC, RHODES
• 1992, USA
• 5 installations in
USA
Level 4
• Distributed
Self-Learning
Control
• MARLIN-ATSC
• 2011, Canada
5
Issues with Leading ATSC Technologies?
• Expensive
• Not scalable
• Not robust
Centralized
• Relying on an accurate traffic modelling
framework
• the accuracy of which is questionable
Model-Based
• Increasing the complexity of the system
exponentially with the increase in the number of
intersections/controllers
Curse of
Dimensionality
• Requiring highly skilled labour to operate due to
their complexity.
Human
Intervention
Requirements
6
Why is MARLIN Different?
7
MARLIN
Self-Learning
Decentralized
Model-Free
CoordinatedScalable
Pattern
Sensitive
Generic
Human Intervention Requirements
Centralized
Inefficient Coordination
Model-Based
Curse of Dimensionality
Prediction
Requirement
Specific Design
Learning the Control Law:
Reinforcement Learning Architecture
8
Environment
RL Architecture
Agent
State RewardAction
Goal: Optimal Control law = mapping between states and actions
)],(),(max[),(),( 111 kkkkk
a
kkkkkkk
asQasQrasQasQ  

),(maxarg 11
asQa kk
a
k 
 Balancing exploration and exploitation
Q a1 a2
s1 -10 -5
s2 -3 -15
Q Table
RL-based ATSC Architecture
RL Software
Agent
State
(Queue Lengths)
Reward
(Delay Savings)
Action
(Extend
/Switch)
Traffic Simulation
Environment
9
10
 Each agent plays a game with each adjacent
intersection in its neighborhood
I5 I6I4
I2 I3I1
I8 I9I7
I5 I6I4
I2 I3I1
I8 I9I7
I5 I6I4
I2 I3I1
I8 I9I7
Example for Intermediate Intersection
(4 Games )
Example for Edge Intersection
( 3 Games)
Example for Corner Intersection
( 2 Games)
MARLIN- ATSC: Coordination Principle
MARLIN-ATSC: (a) Independent Mode, (b) Integrated Mode
MARLIN-ATSC Available Modes
Queue Length 1
Delay 1Extend 1
Queue Length 2
Delay 2Extend 2
MARLIN-ATSC
Queue Length 1
Delay 1Extend 1
Queue Length 2
Delay 2Extend 2
(a)
(b)
11
Large-Scale Application
Network-Wide MOE in the Normal Scenario
12
System
MOE
BC
%
Improvments
MARL-TI Vs.
BC
%
Improvments
MARLIN-IC Vs.
BC
% Improvments
MARLIN-IC Vs.
MARL-TI
Average Intersection
Delay (sec/veh)
35.27 27% 38% 14%
Throughput (veh) 23084 3% 6% 3%
Avg Queue Length (veh) 8.66 24% 32% 11%
Std. Avg. Queue Length
(veh)
2.12 23% 31% 10%
Avg. Link Delay (sec) 9.45 10% 47% 41%
Large-Scale Application
% Improvement in Average Delay
13
MARLIN-IC vs BC
% Improvement
Area 1
Area 2
Area 3
Large-Scale Application
Average Route Travel Time for Selected Routes
14
0
1
2
3
4
5
6
7
8
1 2 3 4 5 6 7 8 9 10 11 12
AverageTravelTime(min)
Time Interval (5 min)
Gardiner EB
BC MARL-TI MARLIN-IC
Freeway
Large-Scale Application
Average Route Travel Time for Selected Routes
15
0
2
4
6
8
10
12
14
16
18
20
1 2 3 4 5 6 7 8 9 10 11 12
AverageTravelTime(min)
Time Interval (5 min)
LakeShore EB to Spadina NB
BC MARL-TI MARLIN-IC
MajorArterial
Controller Interface Device(CID)
RS485 to USB
Traffic Signal Controller
RS485 -
SDLC protocol
USB -
SDLC protocol
Industrial Computer
Ethernet -
NTCIP protocol
Paramics
Modeller
MARLIN-HILS Architecture
16
HILS Setup: Demo
17
Conclusion
18
 MARLIN state of the art gen4+
 Thoroughly developed and tested
 Patent Pending Status
 On going:
 HILS & PEEK ATC-1000 Integration
 Potential Field Operation Test
 Productization
 From TSP to People Priority (PSP)
Samah El-Tantawy samah.el.tantawy@utoronto.ca
Baher Abdulhai baher.abdulhai@utoronto.ca
Hossam Abdelgawad h.abdel.gawad@utoronto.ca
ACGM 2013- Intelligent Transport for Smart Cities
Smart Traffic Lights that Learn !
Multi-Agent Reinforcement Learning Integrated Network
of Adaptive Traffic Signal Controllers
M A R L I N
Samah ElTantawy, Ph.D. Post Doctoral Fellow, Dept of Civil Engineering
Baher Abdulhai, Ph.D., P.Eng. Director, ITS Centre and Testbed, Dept of Civil Engineering
Hossam Abdelgawad, Ph.D., P.Eng. Manager of ITS Centre and Testbed
ACGM 2013- Intelligent Transport for Smart Cities

More Related Content

What's hot

Smart City: Many Applications and Devices
Smart City: Many Applications and DevicesSmart City: Many Applications and Devices
Smart City: Many Applications and Devices
Eurotech
 
IoT Healthcare
IoT HealthcareIoT Healthcare
IoT Healthcare
neha kumari
 
Introduction to IOT & Smart City
Introduction to IOT & Smart CityIntroduction to IOT & Smart City
Introduction to IOT & Smart City
Dr. Mazlan Abbas
 
Neural style transfer
Neural style transferNeural style transfer
Neural style transfer
Keshan Sodimana
 
Image Restoration for 3D Computer Vision
Image Restoration for 3D Computer VisionImage Restoration for 3D Computer Vision
Image Restoration for 3D Computer Vision
PetteriTeikariPhD
 
The Role of Internet-of-Things (IoT) in Healthcare
The Role of Internet-of-Things (IoT) in HealthcareThe Role of Internet-of-Things (IoT) in Healthcare
The Role of Internet-of-Things (IoT) in Healthcare
Luís Rita
 
IOT in SMART Cities
IOT in SMART CitiesIOT in SMART Cities
IOT in SMART Cities
Dr. Shivananda Koteshwar
 
1.arithmetic & logical operations
1.arithmetic & logical operations1.arithmetic & logical operations
1.arithmetic & logical operations
mukesh bhardwaj
 
fog computing ppt
fog computing ppt fog computing ppt
fog computing ppt
sravya raju
 
Data Augmentation
Data AugmentationData Augmentation
Data Augmentation
Md Tajul Islam
 
Home automation using IoT
Home automation using IoTHome automation using IoT
Home automation using IoT
Athira_1993
 
Glasses for the blind using ping ultrasonic, ATMEGA8535 and ISD25120
Glasses for the blind using ping ultrasonic, ATMEGA8535 and ISD25120Glasses for the blind using ping ultrasonic, ATMEGA8535 and ISD25120
Glasses for the blind using ping ultrasonic, ATMEGA8535 and ISD25120
TELKOMNIKA JOURNAL
 
Attendance Management System using Face Recognition
Attendance Management System using Face RecognitionAttendance Management System using Face Recognition
Attendance Management System using Face Recognition
NanditaDutta4
 
lecun-01.ppt
lecun-01.pptlecun-01.ppt
lecun-01.ppt
VenkyChinna8
 
Human Pose Estimation by Deep Learning
Human Pose Estimation by Deep LearningHuman Pose Estimation by Deep Learning
Human Pose Estimation by Deep Learning
Wei Yang
 
Deep learning-for-pose-estimation-wyang-defense
Deep learning-for-pose-estimation-wyang-defenseDeep learning-for-pose-estimation-wyang-defense
Deep learning-for-pose-estimation-wyang-defense
Wei Yang
 
IRJET- Road Accident Prediction using Machine Learning Algorithm
IRJET- Road Accident Prediction using Machine Learning AlgorithmIRJET- Road Accident Prediction using Machine Learning Algorithm
IRJET- Road Accident Prediction using Machine Learning Algorithm
IRJET Journal
 
Smart Cities, Smart Cars, Smart Living
Smart Cities, Smart Cars, Smart LivingSmart Cities, Smart Cars, Smart Living
Smart Cities, Smart Cars, Smart Living
Marie-Paule Odini
 
Edge computing
Edge computingEdge computing
Edge computing
AbhayDhupar
 
AIoT: AI Meets IoT (IOT204) - AWS re:Invent 2018
AIoT: AI Meets IoT (IOT204) - AWS re:Invent 2018AIoT: AI Meets IoT (IOT204) - AWS re:Invent 2018
AIoT: AI Meets IoT (IOT204) - AWS re:Invent 2018
Amazon Web Services
 

What's hot (20)

Smart City: Many Applications and Devices
Smart City: Many Applications and DevicesSmart City: Many Applications and Devices
Smart City: Many Applications and Devices
 
IoT Healthcare
IoT HealthcareIoT Healthcare
IoT Healthcare
 
Introduction to IOT & Smart City
Introduction to IOT & Smart CityIntroduction to IOT & Smart City
Introduction to IOT & Smart City
 
Neural style transfer
Neural style transferNeural style transfer
Neural style transfer
 
Image Restoration for 3D Computer Vision
Image Restoration for 3D Computer VisionImage Restoration for 3D Computer Vision
Image Restoration for 3D Computer Vision
 
The Role of Internet-of-Things (IoT) in Healthcare
The Role of Internet-of-Things (IoT) in HealthcareThe Role of Internet-of-Things (IoT) in Healthcare
The Role of Internet-of-Things (IoT) in Healthcare
 
IOT in SMART Cities
IOT in SMART CitiesIOT in SMART Cities
IOT in SMART Cities
 
1.arithmetic & logical operations
1.arithmetic & logical operations1.arithmetic & logical operations
1.arithmetic & logical operations
 
fog computing ppt
fog computing ppt fog computing ppt
fog computing ppt
 
Data Augmentation
Data AugmentationData Augmentation
Data Augmentation
 
Home automation using IoT
Home automation using IoTHome automation using IoT
Home automation using IoT
 
Glasses for the blind using ping ultrasonic, ATMEGA8535 and ISD25120
Glasses for the blind using ping ultrasonic, ATMEGA8535 and ISD25120Glasses for the blind using ping ultrasonic, ATMEGA8535 and ISD25120
Glasses for the blind using ping ultrasonic, ATMEGA8535 and ISD25120
 
Attendance Management System using Face Recognition
Attendance Management System using Face RecognitionAttendance Management System using Face Recognition
Attendance Management System using Face Recognition
 
lecun-01.ppt
lecun-01.pptlecun-01.ppt
lecun-01.ppt
 
Human Pose Estimation by Deep Learning
Human Pose Estimation by Deep LearningHuman Pose Estimation by Deep Learning
Human Pose Estimation by Deep Learning
 
Deep learning-for-pose-estimation-wyang-defense
Deep learning-for-pose-estimation-wyang-defenseDeep learning-for-pose-estimation-wyang-defense
Deep learning-for-pose-estimation-wyang-defense
 
IRJET- Road Accident Prediction using Machine Learning Algorithm
IRJET- Road Accident Prediction using Machine Learning AlgorithmIRJET- Road Accident Prediction using Machine Learning Algorithm
IRJET- Road Accident Prediction using Machine Learning Algorithm
 
Smart Cities, Smart Cars, Smart Living
Smart Cities, Smart Cars, Smart LivingSmart Cities, Smart Cars, Smart Living
Smart Cities, Smart Cars, Smart Living
 
Edge computing
Edge computingEdge computing
Edge computing
 
AIoT: AI Meets IoT (IOT204) - AWS re:Invent 2018
AIoT: AI Meets IoT (IOT204) - AWS re:Invent 2018AIoT: AI Meets IoT (IOT204) - AWS re:Invent 2018
AIoT: AI Meets IoT (IOT204) - AWS re:Invent 2018
 

Viewers also liked

Smart Mobility by Optimizing the Traffic Lights: A new Tool for Traffic Cont...
Smart Mobility by Optimizing the Traffic Lights:  A new Tool for Traffic Cont...Smart Mobility by Optimizing the Traffic Lights:  A new Tool for Traffic Cont...
Smart Mobility by Optimizing the Traffic Lights: A new Tool for Traffic Cont...
Yesnier Bravo
 
Dream jabalpur
Dream jabalpurDream jabalpur
Dream jabalpur
Sakshi Goswami
 
Ijarcet vol-2-issue-7-2292-2296
Ijarcet vol-2-issue-7-2292-2296Ijarcet vol-2-issue-7-2292-2296
Ijarcet vol-2-issue-7-2292-2296Editor IJARCET
 
Smart Traffic Congestion Control System
Smart Traffic Congestion Control SystemSmart Traffic Congestion Control System
Smart Traffic Congestion Control System
Muzzamil Shaikh
 
Antony Johnstone
Antony JohnstoneAntony Johnstone
Antony Johnstone
JumpingJaq
 
The Future Development of Traffic Signals and the Impact of Autonomous Vehicles
The Future Development of Traffic Signals and the Impact of Autonomous VehiclesThe Future Development of Traffic Signals and the Impact of Autonomous Vehicles
The Future Development of Traffic Signals and the Impact of Autonomous Vehicles
JumpingJaq
 
Smart city presentation
Smart city presentationSmart city presentation
Smart city presentation
Neeraj Mandloi IAS
 
Traffic Signals Safety and Efficiency Project – Stage Two Implementation
Traffic Signals Safety and Efficiency Project – Stage Two ImplementationTraffic Signals Safety and Efficiency Project – Stage Two Implementation
Traffic Signals Safety and Efficiency Project – Stage Two Implementation
JumpingJaq
 
Suzhou bridges
Suzhou bridgesSuzhou bridges
Suzhou bridges
Pei Zhenquan
 
Un Algoritmo Evolutivo para la Reducción de Tiempos de Viaje y Emisiones Util...
Un Algoritmo Evolutivo para la Reducción de Tiempos de Viaje y Emisiones Util...Un Algoritmo Evolutivo para la Reducción de Tiempos de Viaje y Emisiones Util...
Un Algoritmo Evolutivo para la Reducción de Tiempos de Viaje y Emisiones Util...
Daniel H. Stolfi
 
Smartcity Evolution
Smartcity Evolution Smartcity Evolution
Smartcity Evolution
Deepal Bhatnagar
 
Intelligent Transportation System
Intelligent Transportation SystemIntelligent Transportation System
Intelligent Transportation System
Jin-Hyeok Yang
 
Smart traffic management system
Smart traffic management systemSmart traffic management system
Smart traffic management systemChirag Dalal
 
Traffic signal
Traffic signalTraffic signal
Traffic signal
aloknitb
 
Smart Traffic Light Controller
Smart Traffic Light ControllerSmart Traffic Light Controller
Smart Traffic Light Controller
Himanshi_Sharma
 
Workshop 4 5 fei nhan dien nha tai tro tiem nang
Workshop 4 5 fei nhan dien nha tai tro tiem nangWorkshop 4 5 fei nhan dien nha tai tro tiem nang
Workshop 4 5 fei nhan dien nha tai tro tiem nangHaphan116
 

Viewers also liked (20)

SCADA Security in CDIC 2009
SCADA Security in CDIC 2009SCADA Security in CDIC 2009
SCADA Security in CDIC 2009
 
Smart Mobility by Optimizing the Traffic Lights: A new Tool for Traffic Cont...
Smart Mobility by Optimizing the Traffic Lights:  A new Tool for Traffic Cont...Smart Mobility by Optimizing the Traffic Lights:  A new Tool for Traffic Cont...
Smart Mobility by Optimizing the Traffic Lights: A new Tool for Traffic Cont...
 
Dream jabalpur
Dream jabalpurDream jabalpur
Dream jabalpur
 
Ijarcet vol-2-issue-7-2292-2296
Ijarcet vol-2-issue-7-2292-2296Ijarcet vol-2-issue-7-2292-2296
Ijarcet vol-2-issue-7-2292-2296
 
Smart Traffic Congestion Control System
Smart Traffic Congestion Control SystemSmart Traffic Congestion Control System
Smart Traffic Congestion Control System
 
Antony Johnstone
Antony JohnstoneAntony Johnstone
Antony Johnstone
 
The Future Development of Traffic Signals and the Impact of Autonomous Vehicles
The Future Development of Traffic Signals and the Impact of Autonomous VehiclesThe Future Development of Traffic Signals and the Impact of Autonomous Vehicles
The Future Development of Traffic Signals and the Impact of Autonomous Vehicles
 
Smart city presentation
Smart city presentationSmart city presentation
Smart city presentation
 
Adaptive Signals 101
Adaptive Signals 101Adaptive Signals 101
Adaptive Signals 101
 
Traffic Signals Safety and Efficiency Project – Stage Two Implementation
Traffic Signals Safety and Efficiency Project – Stage Two ImplementationTraffic Signals Safety and Efficiency Project – Stage Two Implementation
Traffic Signals Safety and Efficiency Project – Stage Two Implementation
 
Adaptive Signals 101
Adaptive Signals 101Adaptive Signals 101
Adaptive Signals 101
 
Suzhou bridges
Suzhou bridgesSuzhou bridges
Suzhou bridges
 
Un Algoritmo Evolutivo para la Reducción de Tiempos de Viaje y Emisiones Util...
Un Algoritmo Evolutivo para la Reducción de Tiempos de Viaje y Emisiones Util...Un Algoritmo Evolutivo para la Reducción de Tiempos de Viaje y Emisiones Util...
Un Algoritmo Evolutivo para la Reducción de Tiempos de Viaje y Emisiones Util...
 
Smartcity Evolution
Smartcity Evolution Smartcity Evolution
Smartcity Evolution
 
Intelligent Transportation System
Intelligent Transportation SystemIntelligent Transportation System
Intelligent Transportation System
 
Smart traffic management system
Smart traffic management systemSmart traffic management system
Smart traffic management system
 
Traffic signal
Traffic signalTraffic signal
Traffic signal
 
Urban traffic management system assignment 2
Urban traffic management system assignment 2Urban traffic management system assignment 2
Urban traffic management system assignment 2
 
Smart Traffic Light Controller
Smart Traffic Light ControllerSmart Traffic Light Controller
Smart Traffic Light Controller
 
Workshop 4 5 fei nhan dien nha tai tro tiem nang
Workshop 4 5 fei nhan dien nha tai tro tiem nangWorkshop 4 5 fei nhan dien nha tai tro tiem nang
Workshop 4 5 fei nhan dien nha tai tro tiem nang
 

Similar to Smart Traffic Lights that Learn ! Multi-Agent Reinforcement Learning Integrated Network of Adaptive Traffic Signal Controllers

Report on Enviorment Panel Monitoring
Report on Enviorment Panel MonitoringReport on Enviorment Panel Monitoring
Report on Enviorment Panel Monitoring
Mohammed Irshad S K
 
High Fault Coverage For On Chip Network Using Priority Based Routing Algorithm
High Fault Coverage For On Chip Network Using Priority Based Routing AlgorithmHigh Fault Coverage For On Chip Network Using Priority Based Routing Algorithm
High Fault Coverage For On Chip Network Using Priority Based Routing Algorithm
IJSRD
 
High Fault Coverage For On Chip Network Using Priority Based Routing Algorithm
High Fault Coverage For On Chip Network Using Priority Based Routing AlgorithmHigh Fault Coverage For On Chip Network Using Priority Based Routing Algorithm
High Fault Coverage For On Chip Network Using Priority Based Routing Algorithm
IJSRD
 
Lan network with Redundancy
Lan network with RedundancyLan network with Redundancy
Lan network with Redundancy
Santanu Mukhopadhyay
 
Embedded system for traffic light control
Embedded system for traffic light controlEmbedded system for traffic light control
Embedded system for traffic light control
Madhu Prasad
 
VANETS Vehicular Adhoc NETworkS
VANETS Vehicular Adhoc NETworkSVANETS Vehicular Adhoc NETworkS
VANETS Vehicular Adhoc NETworkS
Sridhar Raghavan
 
Network Planning & Design: An Art or a Science?
Network Planning & Design: An Art or a Science?Network Planning & Design: An Art or a Science?
Network Planning & Design: An Art or a Science?
Vishal Sharma, Ph.D.
 
Ship Ad-hoc Network (SANET)
Ship Ad-hoc Network (SANET)	Ship Ad-hoc Network (SANET)
Ship Ad-hoc Network (SANET)
Benyamin Moadab
 
Carrier Strategies for Backbone Traffic Engineering and QoS
Carrier Strategies for Backbone Traffic Engineering and QoSCarrier Strategies for Backbone Traffic Engineering and QoS
Carrier Strategies for Backbone Traffic Engineering and QoS
Vishal Sharma, Ph.D.
 
Orchestration of Ethernet Services in Software-Defined and Flexible Heterogen...
Orchestration of Ethernet Services in Software-Defined and Flexible Heterogen...Orchestration of Ethernet Services in Software-Defined and Flexible Heterogen...
Orchestration of Ethernet Services in Software-Defined and Flexible Heterogen...
ADVA
 
Performance Evolution of LTE Cellular Technology
Performance Evolution of LTE Cellular TechnologyPerformance Evolution of LTE Cellular Technology
Performance Evolution of LTE Cellular Technology
IRJET Journal
 
Profile tulasi v1.1
Profile tulasi v1.1Profile tulasi v1.1
Profile tulasi v1.1
Sivanesan Tulasidas
 
Dynamic cluster based adaptive gateway discovery mechanisms in an integrated ...
Dynamic cluster based adaptive gateway discovery mechanisms in an integrated ...Dynamic cluster based adaptive gateway discovery mechanisms in an integrated ...
Dynamic cluster based adaptive gateway discovery mechanisms in an integrated ...IAEME Publication
 
Critical Information Infrastructure Systems Worldwide
Critical Information Infrastructure Systems WorldwideCritical Information Infrastructure Systems Worldwide
Critical Information Infrastructure Systems Worldwide
Angela Hays
 
Wang lab introduction_1204_2018
Wang lab introduction_1204_2018Wang lab introduction_1204_2018
Wang lab introduction_1204_2018
俊廷 陳
 
Arpan pal ncccs
Arpan pal ncccsArpan pal ncccs
Arpan pal ncccsArpan Pal
 
Preprint-IC3I2022 - 14-16 Dec 2022.pdf
Preprint-IC3I2022 - 14-16 Dec 2022.pdfPreprint-IC3I2022 - 14-16 Dec 2022.pdf
Preprint-IC3I2022 - 14-16 Dec 2022.pdf
Christo Ananth
 
Internet of Things - structured approach to the physical plant network - Rock...
Internet of Things - structured approach to the physical plant network - Rock...Internet of Things - structured approach to the physical plant network - Rock...
Internet of Things - structured approach to the physical plant network - Rock...
Carotek
 
GLOBAL FRONTRUNNER ROUTING ALGORITHM (GFRA) FOR V2V COMMUNICATION IN VANETS
GLOBAL FRONTRUNNER ROUTING ALGORITHM (GFRA) FOR V2V COMMUNICATION IN VANETS GLOBAL FRONTRUNNER ROUTING ALGORITHM (GFRA) FOR V2V COMMUNICATION IN VANETS
GLOBAL FRONTRUNNER ROUTING ALGORITHM (GFRA) FOR V2V COMMUNICATION IN VANETS
cscpconf
 
GLOBAL FRONTRUNNER ROUTING ALGORITHM (GFRA) FOR V2V COMMUNICATION IN VANETS
GLOBAL FRONTRUNNER ROUTING ALGORITHM (GFRA) FOR V2V COMMUNICATION IN VANETS GLOBAL FRONTRUNNER ROUTING ALGORITHM (GFRA) FOR V2V COMMUNICATION IN VANETS
GLOBAL FRONTRUNNER ROUTING ALGORITHM (GFRA) FOR V2V COMMUNICATION IN VANETS
cscpconf
 

Similar to Smart Traffic Lights that Learn ! Multi-Agent Reinforcement Learning Integrated Network of Adaptive Traffic Signal Controllers (20)

Report on Enviorment Panel Monitoring
Report on Enviorment Panel MonitoringReport on Enviorment Panel Monitoring
Report on Enviorment Panel Monitoring
 
High Fault Coverage For On Chip Network Using Priority Based Routing Algorithm
High Fault Coverage For On Chip Network Using Priority Based Routing AlgorithmHigh Fault Coverage For On Chip Network Using Priority Based Routing Algorithm
High Fault Coverage For On Chip Network Using Priority Based Routing Algorithm
 
High Fault Coverage For On Chip Network Using Priority Based Routing Algorithm
High Fault Coverage For On Chip Network Using Priority Based Routing AlgorithmHigh Fault Coverage For On Chip Network Using Priority Based Routing Algorithm
High Fault Coverage For On Chip Network Using Priority Based Routing Algorithm
 
Lan network with Redundancy
Lan network with RedundancyLan network with Redundancy
Lan network with Redundancy
 
Embedded system for traffic light control
Embedded system for traffic light controlEmbedded system for traffic light control
Embedded system for traffic light control
 
VANETS Vehicular Adhoc NETworkS
VANETS Vehicular Adhoc NETworkSVANETS Vehicular Adhoc NETworkS
VANETS Vehicular Adhoc NETworkS
 
Network Planning & Design: An Art or a Science?
Network Planning & Design: An Art or a Science?Network Planning & Design: An Art or a Science?
Network Planning & Design: An Art or a Science?
 
Ship Ad-hoc Network (SANET)
Ship Ad-hoc Network (SANET)	Ship Ad-hoc Network (SANET)
Ship Ad-hoc Network (SANET)
 
Carrier Strategies for Backbone Traffic Engineering and QoS
Carrier Strategies for Backbone Traffic Engineering and QoSCarrier Strategies for Backbone Traffic Engineering and QoS
Carrier Strategies for Backbone Traffic Engineering and QoS
 
Orchestration of Ethernet Services in Software-Defined and Flexible Heterogen...
Orchestration of Ethernet Services in Software-Defined and Flexible Heterogen...Orchestration of Ethernet Services in Software-Defined and Flexible Heterogen...
Orchestration of Ethernet Services in Software-Defined and Flexible Heterogen...
 
Performance Evolution of LTE Cellular Technology
Performance Evolution of LTE Cellular TechnologyPerformance Evolution of LTE Cellular Technology
Performance Evolution of LTE Cellular Technology
 
Profile tulasi v1.1
Profile tulasi v1.1Profile tulasi v1.1
Profile tulasi v1.1
 
Dynamic cluster based adaptive gateway discovery mechanisms in an integrated ...
Dynamic cluster based adaptive gateway discovery mechanisms in an integrated ...Dynamic cluster based adaptive gateway discovery mechanisms in an integrated ...
Dynamic cluster based adaptive gateway discovery mechanisms in an integrated ...
 
Critical Information Infrastructure Systems Worldwide
Critical Information Infrastructure Systems WorldwideCritical Information Infrastructure Systems Worldwide
Critical Information Infrastructure Systems Worldwide
 
Wang lab introduction_1204_2018
Wang lab introduction_1204_2018Wang lab introduction_1204_2018
Wang lab introduction_1204_2018
 
Arpan pal ncccs
Arpan pal ncccsArpan pal ncccs
Arpan pal ncccs
 
Preprint-IC3I2022 - 14-16 Dec 2022.pdf
Preprint-IC3I2022 - 14-16 Dec 2022.pdfPreprint-IC3I2022 - 14-16 Dec 2022.pdf
Preprint-IC3I2022 - 14-16 Dec 2022.pdf
 
Internet of Things - structured approach to the physical plant network - Rock...
Internet of Things - structured approach to the physical plant network - Rock...Internet of Things - structured approach to the physical plant network - Rock...
Internet of Things - structured approach to the physical plant network - Rock...
 
GLOBAL FRONTRUNNER ROUTING ALGORITHM (GFRA) FOR V2V COMMUNICATION IN VANETS
GLOBAL FRONTRUNNER ROUTING ALGORITHM (GFRA) FOR V2V COMMUNICATION IN VANETS GLOBAL FRONTRUNNER ROUTING ALGORITHM (GFRA) FOR V2V COMMUNICATION IN VANETS
GLOBAL FRONTRUNNER ROUTING ALGORITHM (GFRA) FOR V2V COMMUNICATION IN VANETS
 
GLOBAL FRONTRUNNER ROUTING ALGORITHM (GFRA) FOR V2V COMMUNICATION IN VANETS
GLOBAL FRONTRUNNER ROUTING ALGORITHM (GFRA) FOR V2V COMMUNICATION IN VANETS GLOBAL FRONTRUNNER ROUTING ALGORITHM (GFRA) FOR V2V COMMUNICATION IN VANETS
GLOBAL FRONTRUNNER ROUTING ALGORITHM (GFRA) FOR V2V COMMUNICATION IN VANETS
 

Recently uploaded

Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
Vlad Stirbu
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
UiPathCommunity
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 

Recently uploaded (20)

Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 

Smart Traffic Lights that Learn ! Multi-Agent Reinforcement Learning Integrated Network of Adaptive Traffic Signal Controllers

  • 1. Smart Traffic Lights that Learn ! Multi-Agent Reinforcement Learning Integrated Network of Adaptive Traffic Signal Controllers M A R L I N Samah El-Tantawy, Ph.D. Post Doctoral Fellow, Dept of Civil Engineering Baher Abdulhai, Ph.D., P.Eng. Director, ITS Centre and Testbed, Dept of Civil Engineering Hossam Abdelgawad, Ph.D., P.Eng. Manager of ITS Centre and Testbed ACGM 2013- Intelligent Transport for Smart Cities
  • 2. Outline 2 1. In a Nutshell 2. Theory in Brief Reinforcement Learning and Game Theory 3. Applications City of Toronto Testbed 4. Hardware in the Loop Testing Approach Integration with PEEK ATC-1000 Next Steps Q&A
  • 3. In a Nutshell 3  Grand objective  Intersections "talk to each other",  Each is affected by what is happening upstream  Each affects what is happening downstream –  Whole network control in one shot from a grand brain is the dream  Issue  Intractable theoretically,  Too complex practically,  Requires massive and very expensive communication  Solution  Decentralized,  Self learning: agents learn to control their local intersection, and  Game theory based: agents learn to collaborate
  • 4. What is MARLIN? 4  Artificial-intelligence-based control software  Enables traffic lights to self-learn and self-collaborate with neighbouring traffic lights  Cuts down motorists’ delay, fuel consumption and the negative environmental effects of congestion  Easier to operate (self learning)  Less expensive communication if even necessary (less costly)
  • 5. MARLIN-ATSC: Level 4 Evolution of “Adaptive” Signal Control Level 0 • Fixed-Time and Actuated Control • TRANSYT • 1969, UK Level 1 • Centralized Control, Off- line Optimization • SCATS • 1979, Australia • >50 installations worldwide Level 2 • Centralized Control, On-line Optimization • SCOOT • 1981, UK • >170 installations worldwide Level 3 • Distributed Control, Model- Based • OPAC, RHODES • 1992, USA • 5 installations in USA Level 4 • Distributed Self-Learning Control • MARLIN-ATSC • 2011, Canada 5
  • 6. Issues with Leading ATSC Technologies? • Expensive • Not scalable • Not robust Centralized • Relying on an accurate traffic modelling framework • the accuracy of which is questionable Model-Based • Increasing the complexity of the system exponentially with the increase in the number of intersections/controllers Curse of Dimensionality • Requiring highly skilled labour to operate due to their complexity. Human Intervention Requirements 6
  • 7. Why is MARLIN Different? 7 MARLIN Self-Learning Decentralized Model-Free CoordinatedScalable Pattern Sensitive Generic Human Intervention Requirements Centralized Inefficient Coordination Model-Based Curse of Dimensionality Prediction Requirement Specific Design
  • 8. Learning the Control Law: Reinforcement Learning Architecture 8 Environment RL Architecture Agent State RewardAction Goal: Optimal Control law = mapping between states and actions )],(),(max[),(),( 111 kkkkk a kkkkkkk asQasQrasQasQ    ),(maxarg 11 asQa kk a k   Balancing exploration and exploitation Q a1 a2 s1 -10 -5 s2 -3 -15 Q Table
  • 9. RL-based ATSC Architecture RL Software Agent State (Queue Lengths) Reward (Delay Savings) Action (Extend /Switch) Traffic Simulation Environment 9
  • 10. 10  Each agent plays a game with each adjacent intersection in its neighborhood I5 I6I4 I2 I3I1 I8 I9I7 I5 I6I4 I2 I3I1 I8 I9I7 I5 I6I4 I2 I3I1 I8 I9I7 Example for Intermediate Intersection (4 Games ) Example for Edge Intersection ( 3 Games) Example for Corner Intersection ( 2 Games) MARLIN- ATSC: Coordination Principle
  • 11. MARLIN-ATSC: (a) Independent Mode, (b) Integrated Mode MARLIN-ATSC Available Modes Queue Length 1 Delay 1Extend 1 Queue Length 2 Delay 2Extend 2 MARLIN-ATSC Queue Length 1 Delay 1Extend 1 Queue Length 2 Delay 2Extend 2 (a) (b) 11
  • 12. Large-Scale Application Network-Wide MOE in the Normal Scenario 12 System MOE BC % Improvments MARL-TI Vs. BC % Improvments MARLIN-IC Vs. BC % Improvments MARLIN-IC Vs. MARL-TI Average Intersection Delay (sec/veh) 35.27 27% 38% 14% Throughput (veh) 23084 3% 6% 3% Avg Queue Length (veh) 8.66 24% 32% 11% Std. Avg. Queue Length (veh) 2.12 23% 31% 10% Avg. Link Delay (sec) 9.45 10% 47% 41%
  • 13. Large-Scale Application % Improvement in Average Delay 13 MARLIN-IC vs BC % Improvement Area 1 Area 2 Area 3
  • 14. Large-Scale Application Average Route Travel Time for Selected Routes 14 0 1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8 9 10 11 12 AverageTravelTime(min) Time Interval (5 min) Gardiner EB BC MARL-TI MARLIN-IC Freeway
  • 15. Large-Scale Application Average Route Travel Time for Selected Routes 15 0 2 4 6 8 10 12 14 16 18 20 1 2 3 4 5 6 7 8 9 10 11 12 AverageTravelTime(min) Time Interval (5 min) LakeShore EB to Spadina NB BC MARL-TI MARLIN-IC MajorArterial
  • 16. Controller Interface Device(CID) RS485 to USB Traffic Signal Controller RS485 - SDLC protocol USB - SDLC protocol Industrial Computer Ethernet - NTCIP protocol Paramics Modeller MARLIN-HILS Architecture 16
  • 18. Conclusion 18  MARLIN state of the art gen4+  Thoroughly developed and tested  Patent Pending Status  On going:  HILS & PEEK ATC-1000 Integration  Potential Field Operation Test  Productization  From TSP to People Priority (PSP)
  • 19. Samah El-Tantawy samah.el.tantawy@utoronto.ca Baher Abdulhai baher.abdulhai@utoronto.ca Hossam Abdelgawad h.abdel.gawad@utoronto.ca ACGM 2013- Intelligent Transport for Smart Cities
  • 20. Smart Traffic Lights that Learn ! Multi-Agent Reinforcement Learning Integrated Network of Adaptive Traffic Signal Controllers M A R L I N Samah ElTantawy, Ph.D. Post Doctoral Fellow, Dept of Civil Engineering Baher Abdulhai, Ph.D., P.Eng. Director, ITS Centre and Testbed, Dept of Civil Engineering Hossam Abdelgawad, Ph.D., P.Eng. Manager of ITS Centre and Testbed ACGM 2013- Intelligent Transport for Smart Cities