SlideShare a Scribd company logo
1 of 13
Download to read offline
ICWT 2022
The 8th
International Conference on
Wireless and Telematics
Yogyakarta, Indonesia | 21-22 July 2022
ICWT 2022 The 8th International
Conference on Wireless and
Telematics
Yogyakarta, Indonesia | 21-22 July 2022
Mobile Robot Navigation in
Dynamic Environments using
Reinforcement Learning
Fauzi Abdul Rohim*, Sony Sumaryo*, Syamsul Rizal*, Eki Ahmad Zaki
Hamidi**
(*electrical engineering Telkom University Bandung,
**electrical engineering UIN Sunan Gunung Djati Bandung)
083923, Fauzi , Mobile Robot Navigation
ICWT 2022
The 8th
International Conference on
Wireless and Telematics
Yogyakarta, Indonesia | 21-22 July 2022
2
Outline
• I INTRODUCTION
• II METHOD
• II.1 Behavior Based Robot (BBR)
• II.2 Reinforcement Learning
• II.2 Q-Learning
• II.3 Design of Q-Learning Algorithm
• III EXPERIMENTAL RESULT AND DISCUSSION
• IV CONCLUSION
• QnA
083923, Fauzi, Mobile Robot Navigation,
ICWT 2022
The 8th
International Conference on
Wireless and Telematics
Yogyakarta, Indonesia | 21-22 July 2022
3
INTRODUCTION
• Data from robot sensors can be mapped and used by robots for navigation
and movement planning.
• All modules that represent one behaviors work together [1].
• The main attention is paid to two approaches to coordination mechanisms,
namely competitive/arbiter and cooperative/command fusion.
• A mobile robot must have high navigation abilities before it can perform
other jobs such as carrying items or performing mapping activities.
• The navigation system’s use of non-learning algorithms allows the mobile
robot to plan and follow a path to its destination, but the algorithm isn’t
good enough to provide local planning skills in an unfamiliar environment,
where the robot is still stuck in the implementation area.
• One of the most powerful methods for solving navigating issues in
complicated and unknown surroundings is machine learning.
083923, Fauzi, Mobile Robot Navigation,
ICWT 2022
The 8th
International Conference on
Wireless and Telematics
Yogyakarta, Indonesia | 21-22 July 2022
4
INTRODUCTION
• One of the most suitable and widely used RL learning methods for
autonomous robot applications is RL with the Q-Learning algorithm
type.
• Therefore, to extend the Q- Learning algorithm related to continuous
state and action, combines Q-Learning.
• To use reinforcement study, several approaches are possible.
• The first consists of manually discrete issues to obtain state and action
space; which can be used directly by the algorithm using table Q [3].
• The results obtained show a substantial improvement of the robot’s
behavior and learning speed.
• Q-Learning is a representation of a machine learning algorithm based on
a driven learning process, in which an agent learns and discovers via
experience the pattern of action that will provide him with the best
longterm reward.
083923, Fauzi, Mobile Robot Navigation,
ICWT 2022
The 8th
International Conference on
Wireless and Telematics
Yogyakarta, Indonesia | 21-22 July 2022
5
II.1 Behavior-based Robot (BBR)
• The robot will be controlled directly by a
low level controller (P controller) which
functions to control the movement of the
robot by sending perceptual signals
(sensors) to the high level controller.
• Then the signal becomes a stimulus input
that will activate certain behaviors
(behavior coordination section).
• The output of the behavior coordination is
given to the low level controller for
controlling the robot.
083923, Fauzi, Mobile Robot Navigation,
ICWT 2022
The 8th
International Conference on
Wireless and Telematics
Yogyakarta, Indonesia | 21-22 July 2022
6
II.2 Reinforcement Learning
- Reinforcement learning is a kind of middle
ground between supervised learning and
unsupervised learning, where the algorithm is in
a way told only how well it works, not knowing
what the exact output of the system should be.
- Q-Learning is competitive learning
where there are agents with the
ability to learn to act optimally by
evaluating the consequences of
their actions.
083923, Fauzi, Mobile Robot Navigation,
ICWT 2022
The 8th
International Conference on
Wireless and Telematics
Yogyakarta, Indonesia | 21-22 July 2022
7
II.3 Design of Q-Learning Algorithm
1) Discretization of state space and action:
• In order to improve the control system based on the
closed loop controller, the artificial intelligence
algorithm described, the Q learning algorithm, will
also be designed.
2) Defining the Reward Function:
• Defining the Reward Function: The next step in
design- ing the algorithm is to define the reward
function needed to determine the Q-value and fill in
the given Q-table.
3) Policy Determination:
• Policy Determination: To select the best action, the
Q- Learning algorithm requires a search strategy to
determine the actions taken by the agent as a
function of the agent’s state and environment, which
is called Policy ).
083923, Fauzi, Mobile Robot Navigation,
ICWT 2022
The 8th
International Conference on
Wireless and Telematics
Yogyakarta, Indonesia | 21-22 July 2022
8
IV EXPERIMENTAL RESULT AND
DISCUSSION
The training process is carried out in 200
epochs, where the maximum number of agent
actions during one epoch is 300.
Since the final training process will involve a
larger number of epochs and a greater number
of steps per epoch, the optimal value of the
parameters will be chosen = 0.5 to ensure safer
algorithm convergence in the long run.
Finally, the final process of training agents in a simulated environment is
carried out with the parameters and hyperparameters selected based on the
previous analysis, but this time the training is extended to a larger number of
epochs and a greater number of steps per epoch.
083923, Fauzi, Mobile Robot Navigation,
ICWT 2022
The 8th
International Conference on
Wireless and Telematics
Yogyakarta, Indonesia | 21-22 July 2022
9
EXPERIMENTAL RESULT AND
DISCUSSION
083923, Fauzi, Mobile Robot Navigation,
Fig. 4. Average number of agent steps through epochs for a
parameter ,Accumulated agent reward through epochs for a
parameter, Accumulated agent reward through epochs for a
parameter,Average number of agent steps through epochs for
a parameter
ICWT 2022
The 8th
International Conference on
Wireless and Telematics
Yogyakarta, Indonesia | 21-22 July 2022
10
083923, Fauzi, Mobile Robot Navigation,
Fig. 6. Parameters of epoch search strategies,
Accumulated agent reward through epochs for
different search strategies, The average number
of agent steps through epochs for different search
strategies
Fig. 5. Accumulated agent reward through epochs, final
training, Average number of agent steps through epochs,
final training, Q-table at the end of the training algorithm
ICWT 2022
The 8th
International Conference on
Wireless and Telematics
Yogyakarta, Indonesia | 21-22 July 2022
11
CONCLUSION
• After several simulations, the training resulted in convergence and the
final Q-table that will be used in the implementation of the algorithm.
• the application of artificial intelligence and machine learning algorithms
in the field of mobile robotics and autonomous driving.
• One of the most popular incentive learning algorithms has been
implemented, the Q-learning algorithm, which is based on agent
learning from experience.
• a conventional control algorithm based on a closed loop controller
which ensures that the robot reaches a certain position and orientation.
• The controller is implemented and tested in a simulated environment,
where its parameters are adjusted.
• After several simulations, the training resulted in convergence and the
final Q-table that will be used in the implementation of the algorithm.
083923, Fauzi, Mobile Robot Navigation,
ICWT 2022
The 8th
International Conference on
Wireless and Telematics
Yogyakarta, Indonesia | 21-22 July 2022
12
QnA
• Q :
• A :
083923, Fauzi, Mobile Robot Navigation,
ICWT 2022
The 8th
International Conference on
Wireless and Telematics
Yogyakarta, Indonesia | 21-22 July 2022
13
End of Presentation
Thank You
083923, Fauzi, Mobile Robot Navigation,

More Related Content

Similar to mobilerobot_icwt [Autosaved].pdf

Student Attendance Management System by Using Fingerprint Reader
Student Attendance Management System by Using Fingerprint ReaderStudent Attendance Management System by Using Fingerprint Reader
Student Attendance Management System by Using Fingerprint ReaderIRJET Journal
 
Autonomous Campus Tour Guide Robot by using Ultrasonic Range Sensors and QR c...
Autonomous Campus Tour Guide Robot by using Ultrasonic Range Sensors and QR c...Autonomous Campus Tour Guide Robot by using Ultrasonic Range Sensors and QR c...
Autonomous Campus Tour Guide Robot by using Ultrasonic Range Sensors and QR c...ShwetonKedia
 
Module 5_BETCK105H_Introduction to IoT.pdf
Module 5_BETCK105H_Introduction to IoT.pdfModule 5_BETCK105H_Introduction to IoT.pdf
Module 5_BETCK105H_Introduction to IoT.pdfRoopaDNDandally
 
LEARNING OF ROBOT NAVIGATION TASKS BY PROBABILISTIC NEURAL NETWORK
LEARNING OF ROBOT NAVIGATION TASKS BY PROBABILISTIC NEURAL NETWORKLEARNING OF ROBOT NAVIGATION TASKS BY PROBABILISTIC NEURAL NETWORK
LEARNING OF ROBOT NAVIGATION TASKS BY PROBABILISTIC NEURAL NETWORKcscpconf
 
Map-Based Attendance Monitoring System
Map-Based Attendance Monitoring SystemMap-Based Attendance Monitoring System
Map-Based Attendance Monitoring SystemIRJET Journal
 
Object and Currency Detection for the Visually Impaired
Object and Currency Detection for the Visually ImpairedObject and Currency Detection for the Visually Impaired
Object and Currency Detection for the Visually ImpairedIRJET Journal
 
AN ANDROID APPLICATION FOR CAMPUS INFORMATION SYSTEM
AN ANDROID APPLICATION FOR CAMPUS INFORMATION SYSTEMAN ANDROID APPLICATION FOR CAMPUS INFORMATION SYSTEM
AN ANDROID APPLICATION FOR CAMPUS INFORMATION SYSTEMIRJET Journal
 
Efficient and secure real-time mobile robots cooperation using visual servoing
Efficient and secure real-time mobile robots cooperation using visual servoing Efficient and secure real-time mobile robots cooperation using visual servoing
Efficient and secure real-time mobile robots cooperation using visual servoing IJECEIAES
 
Pothole Detection Using ML and DL Algorithms
Pothole Detection Using ML and DL AlgorithmsPothole Detection Using ML and DL Algorithms
Pothole Detection Using ML and DL AlgorithmsIRJET Journal
 
ROAD POTHOLE DETECTION USING YOLOV4 DARKNET
ROAD POTHOLE DETECTION USING YOLOV4 DARKNETROAD POTHOLE DETECTION USING YOLOV4 DARKNET
ROAD POTHOLE DETECTION USING YOLOV4 DARKNETIRJET Journal
 
Internet of Things: Current Trends & Future Directions in Malaysia
Internet of Things: Current Trends & Future Directions in MalaysiaInternet of Things: Current Trends & Future Directions in Malaysia
Internet of Things: Current Trends & Future Directions in MalaysiaRosdiadee Nordin
 
Arduino Based Hand Gesture Controlled Robot
Arduino Based Hand Gesture Controlled RobotArduino Based Hand Gesture Controlled Robot
Arduino Based Hand Gesture Controlled RobotIRJET Journal
 
IRJET- Smart IoT based Bridge Monitoring and Damage Detection using Android App
IRJET- Smart IoT based Bridge Monitoring and Damage Detection using Android AppIRJET- Smart IoT based Bridge Monitoring and Damage Detection using Android App
IRJET- Smart IoT based Bridge Monitoring and Damage Detection using Android AppIRJET Journal
 
Machine Learning in IOT
Machine Learning in IOTMachine Learning in IOT
Machine Learning in IOTArikumarKS2
 
IRJET- A System for Indoor Navigation: GuideIt - Personal Assistant
IRJET-  	  A System for Indoor Navigation: GuideIt - Personal AssistantIRJET-  	  A System for Indoor Navigation: GuideIt - Personal Assistant
IRJET- A System for Indoor Navigation: GuideIt - Personal AssistantIRJET Journal
 
POTHOLE DETECTION SYSTEM USING YOLO v4 ALGORITHM
POTHOLE DETECTION SYSTEM USING YOLO v4 ALGORITHMPOTHOLE DETECTION SYSTEM USING YOLO v4 ALGORITHM
POTHOLE DETECTION SYSTEM USING YOLO v4 ALGORITHMIRJET Journal
 
IRJET- IoT based Classroom Automation System
IRJET-  	  IoT based Classroom Automation SystemIRJET-  	  IoT based Classroom Automation System
IRJET- IoT based Classroom Automation SystemIRJET Journal
 
Software engineering model based smart indoor localization system using deep-...
Software engineering model based smart indoor localization system using deep-...Software engineering model based smart indoor localization system using deep-...
Software engineering model based smart indoor localization system using deep-...TELKOMNIKA JOURNAL
 

Similar to mobilerobot_icwt [Autosaved].pdf (20)

Student Attendance Management System by Using Fingerprint Reader
Student Attendance Management System by Using Fingerprint ReaderStudent Attendance Management System by Using Fingerprint Reader
Student Attendance Management System by Using Fingerprint Reader
 
Autonomous Campus Tour Guide Robot by using Ultrasonic Range Sensors and QR c...
Autonomous Campus Tour Guide Robot by using Ultrasonic Range Sensors and QR c...Autonomous Campus Tour Guide Robot by using Ultrasonic Range Sensors and QR c...
Autonomous Campus Tour Guide Robot by using Ultrasonic Range Sensors and QR c...
 
Module 5_BETCK105H_Introduction to IoT.pdf
Module 5_BETCK105H_Introduction to IoT.pdfModule 5_BETCK105H_Introduction to IoT.pdf
Module 5_BETCK105H_Introduction to IoT.pdf
 
LEARNING OF ROBOT NAVIGATION TASKS BY PROBABILISTIC NEURAL NETWORK
LEARNING OF ROBOT NAVIGATION TASKS BY PROBABILISTIC NEURAL NETWORKLEARNING OF ROBOT NAVIGATION TASKS BY PROBABILISTIC NEURAL NETWORK
LEARNING OF ROBOT NAVIGATION TASKS BY PROBABILISTIC NEURAL NETWORK
 
Map-Based Attendance Monitoring System
Map-Based Attendance Monitoring SystemMap-Based Attendance Monitoring System
Map-Based Attendance Monitoring System
 
review_ppt.pptx
review_ppt.pptxreview_ppt.pptx
review_ppt.pptx
 
Object and Currency Detection for the Visually Impaired
Object and Currency Detection for the Visually ImpairedObject and Currency Detection for the Visually Impaired
Object and Currency Detection for the Visually Impaired
 
AN ANDROID APPLICATION FOR CAMPUS INFORMATION SYSTEM
AN ANDROID APPLICATION FOR CAMPUS INFORMATION SYSTEMAN ANDROID APPLICATION FOR CAMPUS INFORMATION SYSTEM
AN ANDROID APPLICATION FOR CAMPUS INFORMATION SYSTEM
 
Efficient and secure real-time mobile robots cooperation using visual servoing
Efficient and secure real-time mobile robots cooperation using visual servoing Efficient and secure real-time mobile robots cooperation using visual servoing
Efficient and secure real-time mobile robots cooperation using visual servoing
 
Pothole Detection Using ML and DL Algorithms
Pothole Detection Using ML and DL AlgorithmsPothole Detection Using ML and DL Algorithms
Pothole Detection Using ML and DL Algorithms
 
Report
ReportReport
Report
 
ROAD POTHOLE DETECTION USING YOLOV4 DARKNET
ROAD POTHOLE DETECTION USING YOLOV4 DARKNETROAD POTHOLE DETECTION USING YOLOV4 DARKNET
ROAD POTHOLE DETECTION USING YOLOV4 DARKNET
 
Internet of Things: Current Trends & Future Directions in Malaysia
Internet of Things: Current Trends & Future Directions in MalaysiaInternet of Things: Current Trends & Future Directions in Malaysia
Internet of Things: Current Trends & Future Directions in Malaysia
 
Arduino Based Hand Gesture Controlled Robot
Arduino Based Hand Gesture Controlled RobotArduino Based Hand Gesture Controlled Robot
Arduino Based Hand Gesture Controlled Robot
 
IRJET- Smart IoT based Bridge Monitoring and Damage Detection using Android App
IRJET- Smart IoT based Bridge Monitoring and Damage Detection using Android AppIRJET- Smart IoT based Bridge Monitoring and Damage Detection using Android App
IRJET- Smart IoT based Bridge Monitoring and Damage Detection using Android App
 
Machine Learning in IOT
Machine Learning in IOTMachine Learning in IOT
Machine Learning in IOT
 
IRJET- A System for Indoor Navigation: GuideIt - Personal Assistant
IRJET-  	  A System for Indoor Navigation: GuideIt - Personal AssistantIRJET-  	  A System for Indoor Navigation: GuideIt - Personal Assistant
IRJET- A System for Indoor Navigation: GuideIt - Personal Assistant
 
POTHOLE DETECTION SYSTEM USING YOLO v4 ALGORITHM
POTHOLE DETECTION SYSTEM USING YOLO v4 ALGORITHMPOTHOLE DETECTION SYSTEM USING YOLO v4 ALGORITHM
POTHOLE DETECTION SYSTEM USING YOLO v4 ALGORITHM
 
IRJET- IoT based Classroom Automation System
IRJET-  	  IoT based Classroom Automation SystemIRJET-  	  IoT based Classroom Automation System
IRJET- IoT based Classroom Automation System
 
Software engineering model based smart indoor localization system using deep-...
Software engineering model based smart indoor localization system using deep-...Software engineering model based smart indoor localization system using deep-...
Software engineering model based smart indoor localization system using deep-...
 

Recently uploaded

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Recently uploaded (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

mobilerobot_icwt [Autosaved].pdf

  • 1. ICWT 2022 The 8th International Conference on Wireless and Telematics Yogyakarta, Indonesia | 21-22 July 2022 ICWT 2022 The 8th International Conference on Wireless and Telematics Yogyakarta, Indonesia | 21-22 July 2022 Mobile Robot Navigation in Dynamic Environments using Reinforcement Learning Fauzi Abdul Rohim*, Sony Sumaryo*, Syamsul Rizal*, Eki Ahmad Zaki Hamidi** (*electrical engineering Telkom University Bandung, **electrical engineering UIN Sunan Gunung Djati Bandung) 083923, Fauzi , Mobile Robot Navigation
  • 2. ICWT 2022 The 8th International Conference on Wireless and Telematics Yogyakarta, Indonesia | 21-22 July 2022 2 Outline • I INTRODUCTION • II METHOD • II.1 Behavior Based Robot (BBR) • II.2 Reinforcement Learning • II.2 Q-Learning • II.3 Design of Q-Learning Algorithm • III EXPERIMENTAL RESULT AND DISCUSSION • IV CONCLUSION • QnA 083923, Fauzi, Mobile Robot Navigation,
  • 3. ICWT 2022 The 8th International Conference on Wireless and Telematics Yogyakarta, Indonesia | 21-22 July 2022 3 INTRODUCTION • Data from robot sensors can be mapped and used by robots for navigation and movement planning. • All modules that represent one behaviors work together [1]. • The main attention is paid to two approaches to coordination mechanisms, namely competitive/arbiter and cooperative/command fusion. • A mobile robot must have high navigation abilities before it can perform other jobs such as carrying items or performing mapping activities. • The navigation system’s use of non-learning algorithms allows the mobile robot to plan and follow a path to its destination, but the algorithm isn’t good enough to provide local planning skills in an unfamiliar environment, where the robot is still stuck in the implementation area. • One of the most powerful methods for solving navigating issues in complicated and unknown surroundings is machine learning. 083923, Fauzi, Mobile Robot Navigation,
  • 4. ICWT 2022 The 8th International Conference on Wireless and Telematics Yogyakarta, Indonesia | 21-22 July 2022 4 INTRODUCTION • One of the most suitable and widely used RL learning methods for autonomous robot applications is RL with the Q-Learning algorithm type. • Therefore, to extend the Q- Learning algorithm related to continuous state and action, combines Q-Learning. • To use reinforcement study, several approaches are possible. • The first consists of manually discrete issues to obtain state and action space; which can be used directly by the algorithm using table Q [3]. • The results obtained show a substantial improvement of the robot’s behavior and learning speed. • Q-Learning is a representation of a machine learning algorithm based on a driven learning process, in which an agent learns and discovers via experience the pattern of action that will provide him with the best longterm reward. 083923, Fauzi, Mobile Robot Navigation,
  • 5. ICWT 2022 The 8th International Conference on Wireless and Telematics Yogyakarta, Indonesia | 21-22 July 2022 5 II.1 Behavior-based Robot (BBR) • The robot will be controlled directly by a low level controller (P controller) which functions to control the movement of the robot by sending perceptual signals (sensors) to the high level controller. • Then the signal becomes a stimulus input that will activate certain behaviors (behavior coordination section). • The output of the behavior coordination is given to the low level controller for controlling the robot. 083923, Fauzi, Mobile Robot Navigation,
  • 6. ICWT 2022 The 8th International Conference on Wireless and Telematics Yogyakarta, Indonesia | 21-22 July 2022 6 II.2 Reinforcement Learning - Reinforcement learning is a kind of middle ground between supervised learning and unsupervised learning, where the algorithm is in a way told only how well it works, not knowing what the exact output of the system should be. - Q-Learning is competitive learning where there are agents with the ability to learn to act optimally by evaluating the consequences of their actions. 083923, Fauzi, Mobile Robot Navigation,
  • 7. ICWT 2022 The 8th International Conference on Wireless and Telematics Yogyakarta, Indonesia | 21-22 July 2022 7 II.3 Design of Q-Learning Algorithm 1) Discretization of state space and action: • In order to improve the control system based on the closed loop controller, the artificial intelligence algorithm described, the Q learning algorithm, will also be designed. 2) Defining the Reward Function: • Defining the Reward Function: The next step in design- ing the algorithm is to define the reward function needed to determine the Q-value and fill in the given Q-table. 3) Policy Determination: • Policy Determination: To select the best action, the Q- Learning algorithm requires a search strategy to determine the actions taken by the agent as a function of the agent’s state and environment, which is called Policy ). 083923, Fauzi, Mobile Robot Navigation,
  • 8. ICWT 2022 The 8th International Conference on Wireless and Telematics Yogyakarta, Indonesia | 21-22 July 2022 8 IV EXPERIMENTAL RESULT AND DISCUSSION The training process is carried out in 200 epochs, where the maximum number of agent actions during one epoch is 300. Since the final training process will involve a larger number of epochs and a greater number of steps per epoch, the optimal value of the parameters will be chosen = 0.5 to ensure safer algorithm convergence in the long run. Finally, the final process of training agents in a simulated environment is carried out with the parameters and hyperparameters selected based on the previous analysis, but this time the training is extended to a larger number of epochs and a greater number of steps per epoch. 083923, Fauzi, Mobile Robot Navigation,
  • 9. ICWT 2022 The 8th International Conference on Wireless and Telematics Yogyakarta, Indonesia | 21-22 July 2022 9 EXPERIMENTAL RESULT AND DISCUSSION 083923, Fauzi, Mobile Robot Navigation, Fig. 4. Average number of agent steps through epochs for a parameter ,Accumulated agent reward through epochs for a parameter, Accumulated agent reward through epochs for a parameter,Average number of agent steps through epochs for a parameter
  • 10. ICWT 2022 The 8th International Conference on Wireless and Telematics Yogyakarta, Indonesia | 21-22 July 2022 10 083923, Fauzi, Mobile Robot Navigation, Fig. 6. Parameters of epoch search strategies, Accumulated agent reward through epochs for different search strategies, The average number of agent steps through epochs for different search strategies Fig. 5. Accumulated agent reward through epochs, final training, Average number of agent steps through epochs, final training, Q-table at the end of the training algorithm
  • 11. ICWT 2022 The 8th International Conference on Wireless and Telematics Yogyakarta, Indonesia | 21-22 July 2022 11 CONCLUSION • After several simulations, the training resulted in convergence and the final Q-table that will be used in the implementation of the algorithm. • the application of artificial intelligence and machine learning algorithms in the field of mobile robotics and autonomous driving. • One of the most popular incentive learning algorithms has been implemented, the Q-learning algorithm, which is based on agent learning from experience. • a conventional control algorithm based on a closed loop controller which ensures that the robot reaches a certain position and orientation. • The controller is implemented and tested in a simulated environment, where its parameters are adjusted. • After several simulations, the training resulted in convergence and the final Q-table that will be used in the implementation of the algorithm. 083923, Fauzi, Mobile Robot Navigation,
  • 12. ICWT 2022 The 8th International Conference on Wireless and Telematics Yogyakarta, Indonesia | 21-22 July 2022 12 QnA • Q : • A : 083923, Fauzi, Mobile Robot Navigation,
  • 13. ICWT 2022 The 8th International Conference on Wireless and Telematics Yogyakarta, Indonesia | 21-22 July 2022 13 End of Presentation Thank You 083923, Fauzi, Mobile Robot Navigation,