SlideShare a Scribd company logo
Deep Reinforcement Learning for
Real-Time Assembly Planning in
Robot-Based Prefabricated
Construction
Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction
Presenter
Haider Ali
Usama Hassan (399898)
INTRODUCTION
Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction
● Prefabricated Construction - Efficiency, Cost, Quality and Reduced
Waste.
● Despite intro of construction 4.0 - Traditional methods still dominate on-
site assembly processes.
● Challenges in Real time assembly planning methods:
❖ Difficult to model
❖ Safety and Quality control
❖ Obstacle Free assembly paths
❖ Construction scenario
❖ Uncertainties in assembly process
● Advances in DRL provides a solution to these optimization and control
problems.
LITERATURE REVIEW
Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction
● BIM and IOT based: Management and monitoring of on-site assembly processes.
● Motion Planning Algorithms: Robot-based assembly process of hospitalization
facilities using prefab components and BIM based prototypes.
● Discrete Event Processing Models: Discrete event processing models integrated with
the BIM based tool to simulate the construction process.
● Geometric Reasoning based: A component-based automated assembly method for
prefabricated buildings
● DRL based Robotic Manipulator Control
● DRL Applications : Games, Transportation, Navigation, Healthcare, Constructions, etc.
PROPOSED METHODOLOGY
Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction
● The framework of DRL for assembly
planning is depicted here.
● It contributes to the following issues:
❖ Flexible decision making between
on-site planning.
❖ Mitigate scenario complexities,
dynamics and uncertainties.
❖ A set of benchmark for assembly
planning in robot based
prefabricated construction.
DRL MODEL FOR ASSEMBLING PLANNING
States and Actions
Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction
● Construction environment states represented as a tensor of shape
W * L * d, where W denotes width and L denotes length.
● Each cube's features and status encoded using a 6-tuple of integer values
within the tensor, represented by variable d.
● The tuple includes height of the component on the yard, number of transit
steps of the building component, height of the target position, etc
● Actions defined as operations on a component by a crane
● Six possible directions for executing an action: forward, backward, left,
right, up, and down
DRL MODEL FOR ASSEMBLING PLANNING
Rewards
Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction
● Designing rewards as a critical factor for learning success and efficiency
● Transforming the time factor into the number of steps for assembly
completion in the simulation environment.
● Minimizing the number of component actions by providing a basic reward
for each step taken by the agent.
● Imposing a threshold on the maximum number of steps for individual
component construction to avoid over-exploration.
● Utilizing a progressive reward mechanism where the agent receives larger
rewards as more components are successfully built.
DRL MODEL FOR ASSEMBLING PLANNING
Rewards
Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction
● The reward function is summarized where
Rc represents the rewards of a
component, i is the index and t is the total
steps of the component.
● The constant ci represents the serial no.
during assembly process and epsilon is the
step threshold for each component.
RECONFIGURABLE SIMULATOR FOR ASSEMBLY
OPERATIONS
Scenario Description
Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction
● Prefabricated construction involves assembling prefab components
according to a construction plan, following three main steps: component
transfer, connection node positioning, and component connection.
● Robots are essential in facilitating prefab component assembly by
identifying component states and performing corresponding tasks.
● Component states, including initial, in transit, arrived, and assembled,
represent different stages of the assembly process.
● Simulator design considers safety requirements (avoiding collisions,
maintaining distance) and assembly requirements (order of assembly,
orientation adjustment). Six actions simulate component movement in 3D
space.
RECONFIGURABLE SIMULATOR FOR ASSEMBLY
OPERATIONS
Simulator Development
Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction
● Developed a new simulation
environment for assembly planning
using DRL.
● Utilizes 3D grids and simplified BIM
model for prefabricated construction.
● Simulates construction tasks,
assembly processes, and evaluates
DRL performance.
● Developed with pygame and
pyOpenGL, using 1x1x1 cubes as
basic units.
TRANSFORMING THE SCENES INTO SIMULATION
Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction
EXPERIMENTAL CASE STUDIES
Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction
Four environment simulations were set up ranging from simple to more complex
RESULTS AND ANALYSIS
Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction
MEAN REWARDS
± STANDARD ERRORS
OVER 15 RANDOM
SEEDS IN THE LAST 100
EPISODES ON ALL
ENVIRONMENTS.
THE BEST RESULTS
ARE IN BOLD
RESULTS AND ANALYSIS
Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction
Comparison between RL
algorithms in all
environments with
different scenarios
CONCLUSION AND FUTURE WORK
Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction
● RL Integration: This paper explores the use of RL in construction planning, leveraging its
ability to learn optimal strategies through interaction with a simulation environment.
● Simplified Simulation Environment: A simplified environment is developed to simulate
robot-based construction processes, allowing for the evaluation of RL algorithms and their
performance in planning tasks.
● Performance Benchmarks: Four construction scenarios and environments are
considered, ranging from simple to complex, providing benchmarks to assess the
effectiveness of RL-based controllers. DQN and DDQN algorithms show superior
performance.
● Future Directions: Future work focuses on enhancing realism by incorporating 4D
planning software and real BIM models. Additionally, performance of DQN and DDQN
algorithms will be optimized using a short-sighted lean planner for improved scalability.

More Related Content

Similar to Deep_Reinforcment_learning_Presentation.pptx

Suresh resume
Suresh resumeSuresh resume
Suresh resume
suresh ravichandran
 
6.0 CURT CII - Digital Transformation of Construction Industry
6.0 CURT CII  - Digital Transformation of Construction Industry6.0 CURT CII  - Digital Transformation of Construction Industry
6.0 CURT CII - Digital Transformation of Construction Industry
CCT International
 
IRJET- Design, Development and Manufacturing of Dedicated Fixture for the...
IRJET-  	  Design, Development and Manufacturing of Dedicated Fixture for the...IRJET-  	  Design, Development and Manufacturing of Dedicated Fixture for the...
IRJET- Design, Development and Manufacturing of Dedicated Fixture for the...
IRJET Journal
 
Mechatronics engineer
Mechatronics engineerMechatronics engineer
Mechatronics engineer
Samuel Narcisse
 
022 B Construction Croductivity-WRITE SYSTEM
022 B Construction Croductivity-WRITE SYSTEM 022 B Construction Croductivity-WRITE SYSTEM
022 B Construction Croductivity-WRITE SYSTEM
Dr Fereidoun Dejahang
 
022 b construction productivity-write
022 b construction productivity-write022 b construction productivity-write
022 b construction productivity-write
drdej19
 
022 b construction productivity-write
022 b construction productivity-write022 b construction productivity-write
022 b construction productivity-write
DrFereydounDejahang
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
022 b construction productivity-write
022 b construction productivity-write022 b construction productivity-write
022 b construction productivity-write
Dr Fereidoun Dejahang
 
bim and project controls
 bim and project controls bim and project controls
bim and project controls
Omar Selim
 
Keeping code clean
Keeping code cleanKeeping code clean
Keeping code clean
Brett Child
 
Computational steering Interactive Design-through-Analysis for Simulation Sci...
Computational steering Interactive Design-through-Analysis for Simulation Sci...Computational steering Interactive Design-through-Analysis for Simulation Sci...
Computational steering Interactive Design-through-Analysis for Simulation Sci...
SURFevents
 
Simulation of robotic positions and programming
Simulation of robotic positions and programmingSimulation of robotic positions and programming
Simulation of robotic positions and programming
Rachit Laharia
 
Resume-Shrikant Kumbhar-2
Resume-Shrikant Kumbhar-2Resume-Shrikant Kumbhar-2
Resume-Shrikant Kumbhar-2
Shrikant Kumbhar
 
Software Presentation Slides
Software Presentation SlidesSoftware Presentation Slides
Software Presentation Slides
Arissa Loh
 
Zuhair Haddad - 2nd Annual BIM Conference ME September 2013
Zuhair Haddad - 2nd Annual BIM Conference ME September 2013Zuhair Haddad - 2nd Annual BIM Conference ME September 2013
Zuhair Haddad - 2nd Annual BIM Conference ME September 2013
CCT International
 
design of g building.pdf
design of g building.pdfdesign of g building.pdf
design of g building.pdf
NikhilMoviechanger
 
REEP[1]
REEP[1]REEP[1]
Prasanth
PrasanthPrasanth
Prasanth
Prasanth K
 
B.Agasthian July16
B.Agasthian July16B.Agasthian July16
B.Agasthian July16
Agasthian Boopathi
 

Similar to Deep_Reinforcment_learning_Presentation.pptx (20)

Suresh resume
Suresh resumeSuresh resume
Suresh resume
 
6.0 CURT CII - Digital Transformation of Construction Industry
6.0 CURT CII  - Digital Transformation of Construction Industry6.0 CURT CII  - Digital Transformation of Construction Industry
6.0 CURT CII - Digital Transformation of Construction Industry
 
IRJET- Design, Development and Manufacturing of Dedicated Fixture for the...
IRJET-  	  Design, Development and Manufacturing of Dedicated Fixture for the...IRJET-  	  Design, Development and Manufacturing of Dedicated Fixture for the...
IRJET- Design, Development and Manufacturing of Dedicated Fixture for the...
 
Mechatronics engineer
Mechatronics engineerMechatronics engineer
Mechatronics engineer
 
022 B Construction Croductivity-WRITE SYSTEM
022 B Construction Croductivity-WRITE SYSTEM 022 B Construction Croductivity-WRITE SYSTEM
022 B Construction Croductivity-WRITE SYSTEM
 
022 b construction productivity-write
022 b construction productivity-write022 b construction productivity-write
022 b construction productivity-write
 
022 b construction productivity-write
022 b construction productivity-write022 b construction productivity-write
022 b construction productivity-write
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
022 b construction productivity-write
022 b construction productivity-write022 b construction productivity-write
022 b construction productivity-write
 
bim and project controls
 bim and project controls bim and project controls
bim and project controls
 
Keeping code clean
Keeping code cleanKeeping code clean
Keeping code clean
 
Computational steering Interactive Design-through-Analysis for Simulation Sci...
Computational steering Interactive Design-through-Analysis for Simulation Sci...Computational steering Interactive Design-through-Analysis for Simulation Sci...
Computational steering Interactive Design-through-Analysis for Simulation Sci...
 
Simulation of robotic positions and programming
Simulation of robotic positions and programmingSimulation of robotic positions and programming
Simulation of robotic positions and programming
 
Resume-Shrikant Kumbhar-2
Resume-Shrikant Kumbhar-2Resume-Shrikant Kumbhar-2
Resume-Shrikant Kumbhar-2
 
Software Presentation Slides
Software Presentation SlidesSoftware Presentation Slides
Software Presentation Slides
 
Zuhair Haddad - 2nd Annual BIM Conference ME September 2013
Zuhair Haddad - 2nd Annual BIM Conference ME September 2013Zuhair Haddad - 2nd Annual BIM Conference ME September 2013
Zuhair Haddad - 2nd Annual BIM Conference ME September 2013
 
design of g building.pdf
design of g building.pdfdesign of g building.pdf
design of g building.pdf
 
REEP[1]
REEP[1]REEP[1]
REEP[1]
 
Prasanth
PrasanthPrasanth
Prasanth
 
B.Agasthian July16
B.Agasthian July16B.Agasthian July16
B.Agasthian July16
 

Recently uploaded

Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeckPoznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
FilipTomaszewski5
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
AlexanderRichford
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
Enterprise Knowledge
 
Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!
Ortus Solutions, Corp
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
DianaGray10
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
Pablo Gómez Abajo
 
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
zjhamm304
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid ResearchHarnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
Neo4j
 
Principle of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptxPrinciple of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptx
BibashShahi
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
Neo4j
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
Neo4j
 
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
Fwdays
 
ScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking ReplicationScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking Replication
ScyllaDB
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
ScyllaDB
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
Fwdays
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
Miro Wengner
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Neo4j
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
UiPathCommunity
 

Recently uploaded (20)

Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeckPoznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
 
Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
 
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid ResearchHarnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
 
Principle of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptxPrinciple of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptx
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
 
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
 
ScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking ReplicationScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking Replication
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
 

Deep_Reinforcment_learning_Presentation.pptx

  • 1. Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction Presenter Haider Ali Usama Hassan (399898)
  • 2. INTRODUCTION Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction ● Prefabricated Construction - Efficiency, Cost, Quality and Reduced Waste. ● Despite intro of construction 4.0 - Traditional methods still dominate on- site assembly processes. ● Challenges in Real time assembly planning methods: ❖ Difficult to model ❖ Safety and Quality control ❖ Obstacle Free assembly paths ❖ Construction scenario ❖ Uncertainties in assembly process ● Advances in DRL provides a solution to these optimization and control problems.
  • 3. LITERATURE REVIEW Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction ● BIM and IOT based: Management and monitoring of on-site assembly processes. ● Motion Planning Algorithms: Robot-based assembly process of hospitalization facilities using prefab components and BIM based prototypes. ● Discrete Event Processing Models: Discrete event processing models integrated with the BIM based tool to simulate the construction process. ● Geometric Reasoning based: A component-based automated assembly method for prefabricated buildings ● DRL based Robotic Manipulator Control ● DRL Applications : Games, Transportation, Navigation, Healthcare, Constructions, etc.
  • 4. PROPOSED METHODOLOGY Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction ● The framework of DRL for assembly planning is depicted here. ● It contributes to the following issues: ❖ Flexible decision making between on-site planning. ❖ Mitigate scenario complexities, dynamics and uncertainties. ❖ A set of benchmark for assembly planning in robot based prefabricated construction.
  • 5. DRL MODEL FOR ASSEMBLING PLANNING States and Actions Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction ● Construction environment states represented as a tensor of shape W * L * d, where W denotes width and L denotes length. ● Each cube's features and status encoded using a 6-tuple of integer values within the tensor, represented by variable d. ● The tuple includes height of the component on the yard, number of transit steps of the building component, height of the target position, etc ● Actions defined as operations on a component by a crane ● Six possible directions for executing an action: forward, backward, left, right, up, and down
  • 6. DRL MODEL FOR ASSEMBLING PLANNING Rewards Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction ● Designing rewards as a critical factor for learning success and efficiency ● Transforming the time factor into the number of steps for assembly completion in the simulation environment. ● Minimizing the number of component actions by providing a basic reward for each step taken by the agent. ● Imposing a threshold on the maximum number of steps for individual component construction to avoid over-exploration. ● Utilizing a progressive reward mechanism where the agent receives larger rewards as more components are successfully built.
  • 7. DRL MODEL FOR ASSEMBLING PLANNING Rewards Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction ● The reward function is summarized where Rc represents the rewards of a component, i is the index and t is the total steps of the component. ● The constant ci represents the serial no. during assembly process and epsilon is the step threshold for each component.
  • 8. RECONFIGURABLE SIMULATOR FOR ASSEMBLY OPERATIONS Scenario Description Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction ● Prefabricated construction involves assembling prefab components according to a construction plan, following three main steps: component transfer, connection node positioning, and component connection. ● Robots are essential in facilitating prefab component assembly by identifying component states and performing corresponding tasks. ● Component states, including initial, in transit, arrived, and assembled, represent different stages of the assembly process. ● Simulator design considers safety requirements (avoiding collisions, maintaining distance) and assembly requirements (order of assembly, orientation adjustment). Six actions simulate component movement in 3D space.
  • 9. RECONFIGURABLE SIMULATOR FOR ASSEMBLY OPERATIONS Simulator Development Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction ● Developed a new simulation environment for assembly planning using DRL. ● Utilizes 3D grids and simplified BIM model for prefabricated construction. ● Simulates construction tasks, assembly processes, and evaluates DRL performance. ● Developed with pygame and pyOpenGL, using 1x1x1 cubes as basic units.
  • 10. TRANSFORMING THE SCENES INTO SIMULATION Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction
  • 11. EXPERIMENTAL CASE STUDIES Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction Four environment simulations were set up ranging from simple to more complex
  • 12. RESULTS AND ANALYSIS Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction MEAN REWARDS ± STANDARD ERRORS OVER 15 RANDOM SEEDS IN THE LAST 100 EPISODES ON ALL ENVIRONMENTS. THE BEST RESULTS ARE IN BOLD
  • 13. RESULTS AND ANALYSIS Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction Comparison between RL algorithms in all environments with different scenarios
  • 14. CONCLUSION AND FUTURE WORK Deep Reinforcement Learning for Real-Time Assembly Planning in Robot-Based Prefabricated Construction ● RL Integration: This paper explores the use of RL in construction planning, leveraging its ability to learn optimal strategies through interaction with a simulation environment. ● Simplified Simulation Environment: A simplified environment is developed to simulate robot-based construction processes, allowing for the evaluation of RL algorithms and their performance in planning tasks. ● Performance Benchmarks: Four construction scenarios and environments are considered, ranging from simple to complex, providing benchmarks to assess the effectiveness of RL-based controllers. DQN and DDQN algorithms show superior performance. ● Future Directions: Future work focuses on enhancing realism by incorporating 4D planning software and real BIM models. Additionally, performance of DQN and DDQN algorithms will be optimized using a short-sighted lean planner for improved scalability.