SlideShare a Scribd company logo
1 of 33
Exploit vs. Explore
Multi-Armed Bandit problem
Image: medium.com/10x-curiosity
Exploit vs Explore
MORE ABOUT
ENVIRONMENTS
Types of environments
Fully observable: agent knows completely about its current state
Partially observable: agent knows just a part of its current state
Unobservable: agent don’t knows anything its current state.
Types of environments
Single agent vs. multiagent
Episodic vs. sequential
Types of environments
Other types:
 Deterministic vs. stochastic
 Discrete vs. continuous
 Static vs. dynamic
 …
Learn more from Section 2.3 of
(Russell, 2016)
Types of environments
State representations
Source: (Russell, 2016)
MORE ABOUT
AGENTS
PEAS description of agents
Source: (Russell, 2016)
Source: (Russell, 2016)
Basic kinds of agents
Simple reflex agents
Source: (Russell, 2016)
Agents that select actions based ONLY on the current percept, ignoring the previous percepts.
For example,
Simple reflex agents can NOT work well in cases, e.g., partially
observable environments.
Model-based reflex agents
Agents that try to "guess" the unobserved part of the environment by using some model of the
environment.
Goal-based agents
The mentioned agents are reflex ones: they act just according to the state of the environment.
Goal-based agents select action regarding both state and its goal.
For example, when seeing brake lights of the car in front of it:
+ A reflex agent
+ A goal-based agent, with the goal "not hitting other cars",
The goal-based agent is more flexible.
Utility-based agents
Goals just provide a binary distinction between “DONE” and “NOT DONE” states.
Utilities consider HOW WELL it is done.
For example,
Learning agents
State-of-the-art systems.
A learning agent includes 4 components:
+ Performance element:
+ Critic:
+ Learning element:
+ Problem generator:
Example: A taxi agent
+ It drives using the
+ The critic observes the world and gives feedback to the learning element. For example,
+ See this feedback, the learning element creates a rule to
and gives it to the performance element to use.
+ The problem generator might identify some behaviors in need of improvement and suggest
experiments, such as
SOLVING PROBLEMS BY
SEARCHING
Main reference:
Chapters 3 of Russell, S., & Norvig, P. (2016). Artificial intelligence: a modern approach.
Focus on problem-solving agents
 A kind of goal-based agent
 Uses atomic representation for environment state
Assumptions of the environment:
 Oversable:
 Discrete:
 Deterministic:
Concepts
Goal:
Solution:
Optimal solution:
Search:
State space:
Level of abstraction:
Problem description using graph
Node:
Edge:
Path:
BẾN TRE
Components of a problem
1. Initial state: a state that the agent starts in.
For example,
2. Possible actions: actions that are applicable to the agent at a given state.
Notation: ACTIONS(state s).
For example,
3. Transition model: result of an action in a state.
Notation: RESULT(state s, action a).
For example,
Note:
Successor:
4. Goal test: determines whether a state is a goal state.
For example,
5. Cost: a performance measure (lower is better).
Step cost:
Path cost:
Example problem 1
Source: (Russell, 2016)
1. Initial state:
2. Possible actions:
3. Transition model:
4. Goal test:
5. Cost:
Example problem 2
Source: (Russell, 2016)
1. Initial state:
2. Possible actions:
3. Transition model:
4. Goal test:
5. Cost:
Exercise: Route-finding problem
Fomulate the problem of finding routes to go from one city/province to another.
Cities/provinces are shown on the map given in the next slide.
FYI: Route-finding algorithms are used in package routing on the Internet, airline travel-planning systems and so on.
1. Initial state:
2. Possible actions:
3. Transition model:
4. Goal test:
5. Cost:
BẾN TRE
Other real-world problems
Touring problems
The traveling salesperson problem (TSP)
A VLSI layout problem
Robot navigation
Automatic assembly sequencing
See section 3.2.2 in (Russell, 2016) for more details.

More Related Content

Similar to AIw04_slide.pptx

Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagentsJarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
PalGov
 
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagentsJarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
PalGov
 
This is a Case Study about you. (me)As you reflect your readings.docx
This is a Case Study about you. (me)As you reflect your readings.docxThis is a Case Study about you. (me)As you reflect your readings.docx
This is a Case Study about you. (me)As you reflect your readings.docx
christalgrieg
 
Reinforcement learning 7313
Reinforcement learning 7313Reinforcement learning 7313
Reinforcement learning 7313
Slideshare
 

Similar to AIw04_slide.pptx (20)

CS3013 -MACHINE LEARNING.pptx
CS3013 -MACHINE LEARNING.pptxCS3013 -MACHINE LEARNING.pptx
CS3013 -MACHINE LEARNING.pptx
 
Slide01 - Intelligent Agents.ppt
Slide01 - Intelligent Agents.pptSlide01 - Intelligent Agents.ppt
Slide01 - Intelligent Agents.ppt
 
Intelligent agent
Intelligent agentIntelligent agent
Intelligent agent
 
Infosec
InfosecInfosec
Infosec
 
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagentsJarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
 
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagentsJarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
 
AIw08.pptx
AIw08.pptxAIw08.pptx
AIw08.pptx
 
intelligentagent-140313053301-phpapp01 (1).pdf
intelligentagent-140313053301-phpapp01 (1).pdfintelligentagent-140313053301-phpapp01 (1).pdf
intelligentagent-140313053301-phpapp01 (1).pdf
 
AI_03_Solving Problems by Searching.pptx
AI_03_Solving Problems by Searching.pptxAI_03_Solving Problems by Searching.pptx
AI_03_Solving Problems by Searching.pptx
 
AI_Ch2.pptx
AI_Ch2.pptxAI_Ch2.pptx
AI_Ch2.pptx
 
AI - Intelligent Agents
AI - Intelligent AgentsAI - Intelligent Agents
AI - Intelligent Agents
 
Lecture 2
Lecture 2Lecture 2
Lecture 2
 
Duality between OOP and RL
Duality between OOP and RLDuality between OOP and RL
Duality between OOP and RL
 
This is a Case Study about you. (me)As you reflect your readings.docx
This is a Case Study about you. (me)As you reflect your readings.docxThis is a Case Study about you. (me)As you reflect your readings.docx
This is a Case Study about you. (me)As you reflect your readings.docx
 
AI_02_Intelligent Agents.pptx
AI_02_Intelligent Agents.pptxAI_02_Intelligent Agents.pptx
AI_02_Intelligent Agents.pptx
 
Artificial intelligence introduction
Artificial intelligence introductionArtificial intelligence introduction
Artificial intelligence introduction
 
REINFORCEMENT LEARNING
REINFORCEMENT LEARNINGREINFORCEMENT LEARNING
REINFORCEMENT LEARNING
 
Lecture 4 (1).pptx
Lecture 4 (1).pptxLecture 4 (1).pptx
Lecture 4 (1).pptx
 
chapterThree.pptx
chapterThree.pptxchapterThree.pptx
chapterThree.pptx
 
Reinforcement learning 7313
Reinforcement learning 7313Reinforcement learning 7313
Reinforcement learning 7313
 

More from Nguyễn Tiến (20)

AIw13_Exercises.pptx
AIw13_Exercises.pptxAIw13_Exercises.pptx
AIw13_Exercises.pptx
 
AIw13_slide.pptx
AIw13_slide.pptxAIw13_slide.pptx
AIw13_slide.pptx
 
AIw12_Cross entropy.pptx
AIw12_Cross entropy.pptxAIw12_Cross entropy.pptx
AIw12_Cross entropy.pptx
 
AIw11_Exercises.pptx
AIw11_Exercises.pptxAIw11_Exercises.pptx
AIw11_Exercises.pptx
 
AIw11_slide.pptx
AIw11_slide.pptxAIw11_slide.pptx
AIw11_slide.pptx
 
AIw10_Exercises.pptx
AIw10_Exercises.pptxAIw10_Exercises.pptx
AIw10_Exercises.pptx
 
AIw10_Backtracking.pptx
AIw10_Backtracking.pptxAIw10_Backtracking.pptx
AIw10_Backtracking.pptx
 
AIw09_Exercises.pptx
AIw09_Exercises.pptxAIw09_Exercises.pptx
AIw09_Exercises.pptx
 
AIw09.pptx
AIw09.pptxAIw09.pptx
AIw09.pptx
 
AIw08_Exercises.pptx
AIw08_Exercises.pptxAIw08_Exercises.pptx
AIw08_Exercises.pptx
 
AIw07 Exercises.pptx
AIw07 Exercises.pptxAIw07 Exercises.pptx
AIw07 Exercises.pptx
 
AIw07.pptx
AIw07.pptxAIw07.pptx
AIw07.pptx
 
AIw06_Exercises.pptx
AIw06_Exercises.pptxAIw06_Exercises.pptx
AIw06_Exercises.pptx
 
AIw06.pptx
AIw06.pptxAIw06.pptx
AIw06.pptx
 
AIw05_Exercises.pptx
AIw05_Exercises.pptxAIw05_Exercises.pptx
AIw05_Exercises.pptx
 
AIw05.pptx
AIw05.pptxAIw05.pptx
AIw05.pptx
 
AIw04_Exercises.pptx
AIw04_Exercises.pptxAIw04_Exercises.pptx
AIw04_Exercises.pptx
 
AIw03.pptx
AIw03.pptxAIw03.pptx
AIw03.pptx
 
AI02_exercises.pptx
AI02_exercises.pptxAI02_exercises.pptx
AI02_exercises.pptx
 
AI02_Python (cont.).pptx
AI02_Python (cont.).pptxAI02_Python (cont.).pptx
AI02_Python (cont.).pptx
 

Recently uploaded

Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
EADTU
 
Contoh Aksi Nyata Refleksi Diri ( NUR ).pdf
Contoh Aksi Nyata Refleksi Diri ( NUR ).pdfContoh Aksi Nyata Refleksi Diri ( NUR ).pdf
Contoh Aksi Nyata Refleksi Diri ( NUR ).pdf
cupulin
 
MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...
MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...
MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...
MysoreMuleSoftMeetup
 
SPLICE Working Group: Reusable Code Examples
SPLICE Working Group:Reusable Code ExamplesSPLICE Working Group:Reusable Code Examples
SPLICE Working Group: Reusable Code Examples
Peter Brusilovsky
 
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
中 央社
 

Recently uploaded (20)

Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
 
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
 
Contoh Aksi Nyata Refleksi Diri ( NUR ).pdf
Contoh Aksi Nyata Refleksi Diri ( NUR ).pdfContoh Aksi Nyata Refleksi Diri ( NUR ).pdf
Contoh Aksi Nyata Refleksi Diri ( NUR ).pdf
 
The Liver & Gallbladder (Anatomy & Physiology).pptx
The Liver &  Gallbladder (Anatomy & Physiology).pptxThe Liver &  Gallbladder (Anatomy & Physiology).pptx
The Liver & Gallbladder (Anatomy & Physiology).pptx
 
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdfFICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
 
Major project report on Tata Motors and its marketing strategies
Major project report on Tata Motors and its marketing strategiesMajor project report on Tata Motors and its marketing strategies
Major project report on Tata Motors and its marketing strategies
 
ESSENTIAL of (CS/IT/IS) class 07 (Networks)
ESSENTIAL of (CS/IT/IS) class 07 (Networks)ESSENTIAL of (CS/IT/IS) class 07 (Networks)
ESSENTIAL of (CS/IT/IS) class 07 (Networks)
 
MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...
MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...
MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...
 
The Story of Village Palampur Class 9 Free Study Material PDF
The Story of Village Palampur Class 9 Free Study Material PDFThe Story of Village Palampur Class 9 Free Study Material PDF
The Story of Village Palampur Class 9 Free Study Material PDF
 
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...
 
An overview of the various scriptures in Hinduism
An overview of the various scriptures in HinduismAn overview of the various scriptures in Hinduism
An overview of the various scriptures in Hinduism
 
SPLICE Working Group: Reusable Code Examples
SPLICE Working Group:Reusable Code ExamplesSPLICE Working Group:Reusable Code Examples
SPLICE Working Group: Reusable Code Examples
 
OS-operating systems- ch05 (CPU Scheduling) ...
OS-operating systems- ch05 (CPU Scheduling) ...OS-operating systems- ch05 (CPU Scheduling) ...
OS-operating systems- ch05 (CPU Scheduling) ...
 
Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"
 
Observing-Correct-Grammar-in-Making-Definitions.pptx
Observing-Correct-Grammar-in-Making-Definitions.pptxObserving-Correct-Grammar-in-Making-Definitions.pptx
Observing-Correct-Grammar-in-Making-Definitions.pptx
 
diagnosting testing bsc 2nd sem.pptx....
diagnosting testing bsc 2nd sem.pptx....diagnosting testing bsc 2nd sem.pptx....
diagnosting testing bsc 2nd sem.pptx....
 
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
 
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
 
male presentation...pdf.................
male presentation...pdf.................male presentation...pdf.................
male presentation...pdf.................
 
How to Manage Website in Odoo 17 Studio App.pptx
How to Manage Website in Odoo 17 Studio App.pptxHow to Manage Website in Odoo 17 Studio App.pptx
How to Manage Website in Odoo 17 Studio App.pptx
 

AIw04_slide.pptx