SlideShare a Scribd company logo
1 of 20
Download to read offline
Adaptation
                          in embodied & situated agents
                                      Author: Claudio Martella
                          Collaborators: Dott. Stefano Nolfi (ISTC - CNR)
                               Prof. N.A. Borghese (AIS Lab - UniMi)

                                          October, 2011


                                                1
Tuesday, October 11, 11
The problem
                It is difficult to build autonomous systems
                through a top-down approach:


                     • the behavior might be too complex for the
                          designer to control
                     • the environment is noisy and not perfect
                     • the world is unpredictable
                                                2
Tuesday, October 11, 11
Evolutionary robotics is a branch of robotics
                       that uses evolutionary methodologies
                  to develop controllers for autonomous robots.

                                        Nolfi, Floreano [2004] - MIT Press




                                        3
Tuesday, October 11, 11
The objective

                            We wanted to analyze the possibility
                                of applying adaptive processes
                               to embodied & situated agents
                                          considering
                          evolutionary, individual and social learning.



                                                4
Tuesday, October 11, 11
E&S agents

                     • Embodied: the agent can exploit the
                          characteristics of the robot (shape,
                          sensors, actuators etc.).
                     • Situated: the solution can exploit the
                          possible interactions that the environments
                          offers.


                                              5
Tuesday, October 11, 11
The methodology
                          E-puck Robot       Simulation




            Problem: categorize 10 objects (Good, Poisonous)
                                         6
Tuesday, October 11, 11
The evolutionary process




                                     7
Tuesday, October 11, 11
1st goal
                   Implement an algorithm for individual learning.

                                  The algorithm should start
                            with one set of candidate parameters
                          and it would modify them by trial & error.

                          Decision: start from Simulated Annealing *


          * "Optimization by Simulated Annealing", Kirkpatrick, S.; Gelatt, C. D.; Vecchi, M. P. (1983) - Science
                                                             8
Tuesday, October 11, 11
Simulated Annealing
                                       Temperature:

                                       It probabilistically accepts
                                        mutations that decrease
                                               the fitness.

                                       The probability decreases
                                              with time.

                                       It allows the algorithm to
                                       jump out of local minima.
                                   9
Tuesday, October 11, 11
Stochasticity in E&S
                                Evaluation depends on
                            the (random) initial conditions:




                                           10
Tuesday, October 11, 11
The intuition
                            Temperature                                  Stochasticity

             0.9                                         0.9

         0.675                                         0.675

           0.45                                         0.45

         0.225                                         0.225

                0                                         0
                    100   200   300   400   500                10   20       30      40   50

         Probability of accepting negative             Probability of accepting negative
          mutations decreases with the                  mutations decreases with the
                 increase of time                         increase of #evaluations
                                                  11
Tuesday, October 11, 11
Contributions
             Substitute external stochasticity with internal:

             •       Remove Temperature

             •       Start with few evaluations and increase with time


                                       Results
             •       Simplifies the algorithm

             •       Better performance (~10% improvement)

             •       Lighter algorithm (~50% less evaluations for us)

                                               12
Tuesday, October 11, 11
2nd goal
                          Implement an algorithm for social learning.

                            The algorithm should take advantage
                           of the interaction with an expert agent
                               to acquire an adaptive solution
                            that is improved and/or in less time.

                   Decision: apply individual learning to imitation.

                                              13
Tuesday, October 11, 11
Why?

                 Social learning should avoid reinventing the wheel.

                 In principle, when guided, learning is faster & safer.

                          It should be the basis for cultural evolution.




                                                14
Tuesday, October 11, 11
How?
               There are simpler forms of social learning:


                • social facilitation
                • contagious behavior
                • stimulus enhancement

                                       15
Tuesday, October 11, 11
How (technically)?




           Fitness function: student should learn to give
           outputs similar to the agent’s, given the same input.
                                     16
Tuesday, October 11, 11
How (technically)?
               Pure imitation brings to under-fitting individuals.
                      We introduced a hybrid approach.


    f it = f itsoc · (1      ↵) + f itind · ↵

                 c
    ↵=           N




                                          17
Tuesday, October 11, 11
Contributions
             •       Modeled social learning with simple form of imitation

             •       Modeled hybrid social-individual learning approach


                                       Results
             •       Performance on the problem is not improved

             •       Adaptive behavior is acquired faster

             •       More agents acquire an adaptive behavior

                                               18
Tuesday, October 11, 11
Intuitive interpretation
          parameters space                         solutions space




                                  Social learning as a method
                          for promising initial parameters selection.
                                  Social learning as a method
                               for jumping out of local maxima.
                                              19
Tuesday, October 11, 11
Questions?



                              20
Tuesday, October 11, 11

More Related Content

Viewers also liked

Representantes comenius1
Representantes comenius1Representantes comenius1
Representantes comenius1Rosafcid
 
Hadoop: A Hands-on Introduction
Hadoop: A Hands-on IntroductionHadoop: A Hands-on Introduction
Hadoop: A Hands-on IntroductionClaudio Martella
 
Greek trivia2
Greek trivia2Greek trivia2
Greek trivia2sbillia
 
Relay for life
Relay for lifeRelay for life
Relay for lifesbillia
 
Disney trivia2
Disney trivia2Disney trivia2
Disney trivia2sbillia
 
Greek trivia
Greek triviaGreek trivia
Greek triviasbillia
 
Pro sports trivia
Pro sports triviaPro sports trivia
Pro sports triviasbillia
 
Giraph at Hadoop Summit 2014
Giraph at Hadoop Summit 2014Giraph at Hadoop Summit 2014
Giraph at Hadoop Summit 2014Claudio Martella
 

Viewers also liked (10)

Laurel's slide show
Laurel's slide showLaurel's slide show
Laurel's slide show
 
Haikus 1
Haikus 1Haikus 1
Haikus 1
 
Representantes comenius1
Representantes comenius1Representantes comenius1
Representantes comenius1
 
Hadoop: A Hands-on Introduction
Hadoop: A Hands-on IntroductionHadoop: A Hands-on Introduction
Hadoop: A Hands-on Introduction
 
Greek trivia2
Greek trivia2Greek trivia2
Greek trivia2
 
Relay for life
Relay for lifeRelay for life
Relay for life
 
Disney trivia2
Disney trivia2Disney trivia2
Disney trivia2
 
Greek trivia
Greek triviaGreek trivia
Greek trivia
 
Pro sports trivia
Pro sports triviaPro sports trivia
Pro sports trivia
 
Giraph at Hadoop Summit 2014
Giraph at Hadoop Summit 2014Giraph at Hadoop Summit 2014
Giraph at Hadoop Summit 2014
 

Similar to Adaptation in Embodied & Situated Agents

Similar to Adaptation in Embodied & Situated Agents (6)

VBPR 1st seminar
VBPR 1st seminarVBPR 1st seminar
VBPR 1st seminar
 
Why do a designed experiment
Why do a designed experimentWhy do a designed experiment
Why do a designed experiment
 
Adapt or Go extinct
Adapt or Go extinctAdapt or Go extinct
Adapt or Go extinct
 
When Tdd Goes Awry
When Tdd Goes AwryWhen Tdd Goes Awry
When Tdd Goes Awry
 
A Cloud Outage Under the Lens of “Profound Knowledge”
A Cloud Outage Under the Lens of  “Profound Knowledge” A Cloud Outage Under the Lens of  “Profound Knowledge”
A Cloud Outage Under the Lens of “Profound Knowledge”
 
Exploratory Testing in an Agile Context
Exploratory Testing in an Agile ContextExploratory Testing in an Agile Context
Exploratory Testing in an Agile Context
 

Recently uploaded

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 

Recently uploaded (20)

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 

Adaptation in Embodied & Situated Agents

  • 1. Adaptation in embodied & situated agents Author: Claudio Martella Collaborators: Dott. Stefano Nolfi (ISTC - CNR) Prof. N.A. Borghese (AIS Lab - UniMi) October, 2011 1 Tuesday, October 11, 11
  • 2. The problem It is difficult to build autonomous systems through a top-down approach: • the behavior might be too complex for the designer to control • the environment is noisy and not perfect • the world is unpredictable 2 Tuesday, October 11, 11
  • 3. Evolutionary robotics is a branch of robotics that uses evolutionary methodologies to develop controllers for autonomous robots. Nolfi, Floreano [2004] - MIT Press 3 Tuesday, October 11, 11
  • 4. The objective We wanted to analyze the possibility of applying adaptive processes to embodied & situated agents considering evolutionary, individual and social learning. 4 Tuesday, October 11, 11
  • 5. E&S agents • Embodied: the agent can exploit the characteristics of the robot (shape, sensors, actuators etc.). • Situated: the solution can exploit the possible interactions that the environments offers. 5 Tuesday, October 11, 11
  • 6. The methodology E-puck Robot Simulation Problem: categorize 10 objects (Good, Poisonous) 6 Tuesday, October 11, 11
  • 7. The evolutionary process 7 Tuesday, October 11, 11
  • 8. 1st goal Implement an algorithm for individual learning. The algorithm should start with one set of candidate parameters and it would modify them by trial & error. Decision: start from Simulated Annealing * * "Optimization by Simulated Annealing", Kirkpatrick, S.; Gelatt, C. D.; Vecchi, M. P. (1983) - Science 8 Tuesday, October 11, 11
  • 9. Simulated Annealing Temperature: It probabilistically accepts mutations that decrease the fitness. The probability decreases with time. It allows the algorithm to jump out of local minima. 9 Tuesday, October 11, 11
  • 10. Stochasticity in E&S Evaluation depends on the (random) initial conditions: 10 Tuesday, October 11, 11
  • 11. The intuition Temperature Stochasticity 0.9 0.9 0.675 0.675 0.45 0.45 0.225 0.225 0 0 100 200 300 400 500 10 20 30 40 50 Probability of accepting negative Probability of accepting negative mutations decreases with the mutations decreases with the increase of time increase of #evaluations 11 Tuesday, October 11, 11
  • 12. Contributions Substitute external stochasticity with internal: • Remove Temperature • Start with few evaluations and increase with time Results • Simplifies the algorithm • Better performance (~10% improvement) • Lighter algorithm (~50% less evaluations for us) 12 Tuesday, October 11, 11
  • 13. 2nd goal Implement an algorithm for social learning. The algorithm should take advantage of the interaction with an expert agent to acquire an adaptive solution that is improved and/or in less time. Decision: apply individual learning to imitation. 13 Tuesday, October 11, 11
  • 14. Why? Social learning should avoid reinventing the wheel. In principle, when guided, learning is faster & safer. It should be the basis for cultural evolution. 14 Tuesday, October 11, 11
  • 15. How? There are simpler forms of social learning: • social facilitation • contagious behavior • stimulus enhancement 15 Tuesday, October 11, 11
  • 16. How (technically)? Fitness function: student should learn to give outputs similar to the agent’s, given the same input. 16 Tuesday, October 11, 11
  • 17. How (technically)? Pure imitation brings to under-fitting individuals. We introduced a hybrid approach. f it = f itsoc · (1 ↵) + f itind · ↵ c ↵= N 17 Tuesday, October 11, 11
  • 18. Contributions • Modeled social learning with simple form of imitation • Modeled hybrid social-individual learning approach Results • Performance on the problem is not improved • Adaptive behavior is acquired faster • More agents acquire an adaptive behavior 18 Tuesday, October 11, 11
  • 19. Intuitive interpretation parameters space solutions space Social learning as a method for promising initial parameters selection. Social learning as a method for jumping out of local maxima. 19 Tuesday, October 11, 11
  • 20. Questions? 20 Tuesday, October 11, 11