SlideShare a Scribd company logo
CSE 412: Artificial IntelligenceCSE 412: Artificial Intelligence
Topic - 2: Intelligent AgentsTopic - 2: Intelligent Agents
Fall 2018Fall 2018
Tajim Md. Niamat Ullah Akhund
Lecturer
Department of Computer Science and Engineering
Daffodil International University
Email: tajim.cse@diu.edu.bd
Agents and EnvironmentsAgents and Environments
Good Behavior: The Concept of RationalityGood Behavior: The Concept of Rationality
The Nature of EnvironmentsThe Nature of Environments
The Structure of AgentsThe Structure of Agents
Topic ContentsTopic Contents
Intelligent AgentIntelligent Agent
An agent
 perceives its environment with sensors
and
 acts upon that environment with its
actuators.
An agent gets percepts one at a time,
and maps this percept sequence to
actions.
3
Majidur RahmanMajidur Rahman
An AgentAn Agent
4
Majidur RahmanMajidur Rahman
Example of an AgentExample of an Agent
Let us consider a vacuum-cleaner agent that is
shown in the figure bellow.
The vacuum-cleaner world has just two locations:
squares A and B.
5
Example of an AgentExample of an Agent
The vacuum agent perceives which square it is in
and whether there is dirt in the square.
It can choose to move left, move right, suck up the
dirt, or do nothing.
One very simple agent function is the following: if
the current square is dirty, then suck; otherwise,
move to the other square.
6
Example of an AgentExample of an Agent
Environment: squares A and B
Percepts: [location, status], e.g. [A, Dirty]
Actions: left, right, suck, and no-op
7
Majidur RahmanMajidur Rahman
function REFLEX-VACUUM-AGENT ([location, status]) return an action
if status == Dirty then return Suck
else if location == A then return Right
else if location == B then return Left
Example of an AgentExample of an Agent
Majidur RahmanMajidur Rahman
Rational AgentRational Agent
 For each possible percept sequence, a rational agent
should select an action (using an agent function) that is
expected to maximize its performance measure, given
the evidence provided by the percept sequence and
whatever built-in prior knowledge the agent has.
 A percept sequence is the complete history of anything
the agent has ever perceived.
 A performance measure is a means of calculating how
well the agent has performed based on the sequence of
percepts that it has received.
 An agent’s prior knowledge of the environment is the
knowledge that the agent designer has given to the
agent before its introduction to the environment.
9
Majidur RahmanMajidur Rahman
Rational AgentRational Agent
 An agent function maps percept sequences to actions.
f : seq(P) →A
 Agent function for vacuum Cleaner example:
10
Majidur RahmanMajidur Rahman
Performance
Measures
Environment
Actuators
Sensors
used by the agent to perform actions
surroundings beyond the control of the agent
used to evaluate how well an agent
solves the task at hand
provide information about the current state of
the environment
 To design a rational agent we must specify its task
environment:
Task EnvironmentsTask Environments
11
Majidur RahmanMajidur Rahman
ExampleExample
PEAS description of the environment for an
automated taxi:
 Performance
Safety, destination, profits, legality, comfort
 Environment
Streets/motorways, other traffic, pedestrians, weather
 Actuators
Steering, accelerator, brake, horn, speaker/display
 Sensors
Video, sonar, speedometer, engine sensors, keyboard,
GPS
12
Majidur RahmanMajidur Rahman
Example…Example…
13
Majidur RahmanMajidur Rahman
Environment TypesEnvironment Types
The environment for an agent may be
 Fully or partially observable
 Deterministic or stochastic
 Episodic or sequential
 Static or dynamic
 Discrete or continuous
 Single or multi-agent
14
Majidur RahmanMajidur Rahman
Environment TypesEnvironment Types
Solitaire Backgammom Internet shopping Taxi
Observable??
Deterministic??
Episodic??
Static??
Discrete??
Single-agent??
Majidur RahmanMajidur Rahman 15
Environment TypesEnvironment Types
Solitaire Backgammom Internet shopping Taxi
ObservableObservable??
Deterministic??
Episodic??
Static??
Discrete??
Single-agent??
Fully vs. partially observable: an environment is full observable when
the sensors can detect all aspects that are relevant to the choice of
action.
Majidur RahmanMajidur Rahman 16
Environment TypesEnvironment Types
Solitaire Backgammom Internet shopping Taxi
ObservableObservable?? FULL FULL PARTIAL PARTIAL
Deterministic??
Episodic??
Static??
Discrete??
Single-agent??
Fully vs. partially observable: an environment is full observable when
the sensors can detect all aspects that are relevant to the choice of
action.
Majidur RahmanMajidur Rahman 17
Environment TypesEnvironment Types
Solitaire Backgammom Internet shopping Taxi
ObservableObservable?? FULL FULL PARTIAL PARTIAL
DeterministicDeterministic??
Episodic??
Static??
Discrete??
Single-agent??
Deterministic vs. stochastic: if the next environment state is
completely determined by the current state the executed action then the
environment is deterministic.
Majidur RahmanMajidur Rahman 18
Environment TypesEnvironment Types
Solitaire Backgammom Internet shopping Taxi
ObservableObservable?? FULL FULL PARTIAL PARTIAL
DeterministicDeterministic?? YES NO YES NO
Episodic??
Static??
Discrete??
Single-agent??
Deterministic vs. stochastic: if the next environment state is
completely
determined by the current state the executed action then the
environment is deterministic.
Majidur RahmanMajidur Rahman 19
Environment TypesEnvironment Types
Solitaire Backgammom Internet shopping Taxi
ObservableObservable?? FULL FULL PARTIAL PARTIAL
DeterministicDeterministic?? YES NO YES NO
EpisodicEpisodic??
Static??
Discrete??
Single-agent??
Episodic vs. sequential: In an episodic environment the agent’s
experience can be divided into atomic steps where the agents perceives
and then performs a single action. The choice of action depends only on
the episode itself.
Majidur RahmanMajidur Rahman 20
Environment TypesEnvironment Types
Solitaire Backgammom Internet shopping Taxi
ObservableObservable?? FULL FULL PARTIAL PARTIAL
Deterministic?? YES NO YES NO
EpisodicEpisodic?? NO NO NO NO
Static??
Discrete??
Single-agent??
Episodic vs. sequential: In an episodic environment the agent’s
experience can be divided into atomic steps where the agents perceives
and then performs a single action. The choice of action depends only on
the episode itself
Majidur RahmanMajidur Rahman 21
Environment TypesEnvironment Types
Solitaire Backgammom Internet shopping Taxi
Observable?? FULL FULL PARTIAL PARTIAL
Deterministic?? YES NO YES NO
Episodic?? NO NO NO NO
Static??
Discrete??
Single-agent??
Static vs. dynamic: If the environment can change while the agent is
choosing an action, the environment is dynamic. Semi-dynamic if the
agent’s performance changes even when the environment remains the
same.
Majidur RahmanMajidur Rahman 22
Environment TypesEnvironment Types
Solitaire Backgammom Internet shopping Taxi
ObservableObservable?? FULL FULL PARTIAL PARTIAL
DeterministicDeterministic?? YES NO YES NO
EpisodicEpisodic?? NO NO NO NO
StaticStatic?? YES YES SEMI NO
Discrete??
Single-agent??
Static vs. dynamic: If the environment can change while the agent is choosing
an action, the environment is dynamic. Semi-dynamic if the agent’s performance
changes even when the environment remains the same.
Majidur RahmanMajidur Rahman 23
Environment TypesEnvironment Types
Solitaire Backgammom Internet shopping Taxi
Observable?? FULL FULL PARTIAL PARTIAL
Deterministic?? YES NO YES NO
Episodic?? NO NO NO NO
Static?? YES YES SEMI NO
Discrete??
Single-agent??
Discrete vs. continuous: This distinction can be applied to the state of the
environment, the way time is handled and to the percepts/actions of the agent.
Majidur RahmanMajidur Rahman 24
Environment TypesEnvironment Types
Solitaire Backgammom Internet shopping Taxi
Observable?? FULL FULL PARTIAL PARTIAL
Deterministic?? YES NO YES NO
Episodic?? NO NO NO NO
Static?? YES YES SEMI NO
Discrete?? YES YES YES NO
Single-agent??
Discrete vs. continuous: This distinction can be applied to the state of the
environment, the way time is handled and to the percepts/actions of the agent.
Majidur RahmanMajidur Rahman 25
Environment TypesEnvironment Types
Solitaire Backgammom Internet shopping Taxi
ObservableObservable?? FULL FULL PARTIAL PARTIAL
DeterministicDeterministic???? YES NO YES NO
EpisodicEpisodic???? NO NO NO NO
StaticStatic???? YES YES SEMI NO
DiscreteDiscrete???? YES YES YES NO
Single-agentSingle-agent???? YES NO NO NO
Single vs. multi-agent: Does the environment contain other agents who are
also maximizing some performance measure that depends on the current
agent’s actions?
Majidur RahmanMajidur Rahman 26
Environment TypesEnvironment Types
The simplest environment is
 Fully observable, deterministic, episodic,
static, discrete and single-agent.
Most real situations are:
 Partially observable, stochastic, sequential,
dynamic, continuous and multi-agent.
Majidur RahmanMajidur Rahman 27
Agent TypesAgent Types
Four basic kinds of agent that embody the
principles underlying almost all intelligent systems:
 Simple reflex agents;
 Model-based reflex agents;
 Goal-based agents; and
 Utility-based agents.
28
Majidur RahmanMajidur Rahman
Agent Types: Simple ReflexAgent Types: Simple Reflex
 Select action on the basis of only
the current percept,
e.g. the vacuum-agent
 Large reduction in possible
percept/action situations
 Implemented through
condition-action rules
if dirty then suck
 Example:
– Agent: Mail sorting robot
– Environment: Conveyor belt of
letters
– Rule: e.g.
city = Edin → put Scotland bag
29
Majidur RahmanMajidur Rahman
Agent Types: Model-BasedAgent Types: Model-Based
 To tackle partially observable
environments
 Maintain internal state that
depends on percept history
 Over time update state using
world knowledge
 How does the world change
independently
 How do actions affect the world
 Example:
– Agent: Robot vacuum cleaner
– Environment: Dirty room, furniture
– Model: Map of room, which areas
already cleaned 30
Majidur RahmanMajidur Rahman
Agent Types: Goal-BasedAgent Types: Goal-Based
 The agent needs a goal to know
which situations are desirable
 Difficulties arise when long
sequences of actions are required
to find the goal
 Typically investigated in search
and planning research
 Major difference: future is taken
into account
 Example:
– Agent: Robot maid
– Environment: House and people
– Goal: Clean clothes, tidy room,
table laid, etc 31
Majidur RahmanMajidur Rahman
32
Agent Types: Utility-BasedAgent Types: Utility-Based
 Certain goals can be reached in
different ways
 Some are better: have a higher
utility
 Utility function maps a (sequence
of) state(s) onto a real number
 Improves on goals:
 Selecting between conflicting
goals
 Selecting between several goals
based on likelihood of success
and importance of goals
 Example:
– Agent: Mars Lander
– Environment: The surface of
mars with obstacles
– Utility: Shortest and obstacle-free
path Majidur RahmanMajidur Rahman
THANKS…
Majidur RahmanMajidur Rahman
********** ********** If You Need Me ********** **********
Mail: tajim.cse@diu.edu.bd
Website: https://www.tajimiitju.blogspot.com
ORCID: https://orcid.org/0000-0002-2834-1507
LinkedIn: https://www.linkedin.com/in/tajimiitju
ResearchGate: https://www.researchgate.net/profile/Tajim_Md_Niamat_Ullah_Akhund
YouTube: https://www.youtube.com/tajimiitju?sub_confirmation=1
SlideShare: https://www.slideshare.net/TajimMdNiamatUllahAk
Facebook: https://www.facebook.com/tajim.mohammad
GitHub: https://github.com/tajimiitju
Google+: https://plus.google.com/+tajimiitju
Gmail: tajim.mohammad.3@gmail.com
Twitter: https://twitter.com/Tajim53
Thank you
Tajim Md. Niamat Ullah AkhundTajim Md. Niamat Ullah Akhund

More Related Content

What's hot

Agent architectures
Agent architecturesAgent architectures
Agent architectures
Antonio Moreno
 
Chapter 2 intelligent agents
Chapter 2 intelligent agentsChapter 2 intelligent agents
Chapter 2 intelligent agents
LukasJohnny
 
AI
AIAI
Forms of learning in ai
Forms of learning in aiForms of learning in ai
Forms of learning in ai
Robert Antony
 
An introduction to reinforcement learning
An introduction to reinforcement learningAn introduction to reinforcement learning
An introduction to reinforcement learning
Subrat Panda, PhD
 
1.1 What are Agent and Environment.pptx
1.1 What are Agent and Environment.pptx1.1 What are Agent and Environment.pptx
1.1 What are Agent and Environment.pptx
Suvamvlogs
 
Agents_AI.ppt
Agents_AI.pptAgents_AI.ppt
Agents_AI.ppt
sandeep54552
 
Agents and environments
Agents and environmentsAgents and environments
Agents and environments
Megha Sharma
 
Unit1: Introduction to AI
Unit1: Introduction to AIUnit1: Introduction to AI
Unit1: Introduction to AI
Tekendra Nath Yogi
 
Production system in ai
Production system in aiProduction system in ai
Production system in ai
sabin kafle
 
Reflex and model based agents
Reflex and model based agentsReflex and model based agents
Reflex and model based agents
Megha Sharma
 
Intelligent Agents
Intelligent Agents Intelligent Agents
Intelligent Agents
Amar Jukuntla
 
Lecture 4- Agent types
Lecture 4- Agent typesLecture 4- Agent types
Lecture 4- Agent types
Antonio Moreno
 
AI Lecture 4 (informed search and exploration)
AI Lecture 4 (informed search and exploration)AI Lecture 4 (informed search and exploration)
AI Lecture 4 (informed search and exploration)
Tajim Md. Niamat Ullah Akhund
 
Agents in Artificial intelligence
Agents in Artificial intelligence Agents in Artificial intelligence
Agents in Artificial intelligence
Lalit Birla
 
Multi-agent systems
Multi-agent systemsMulti-agent systems
Multi-agent systemsR A Akerkar
 
I. Alpha-Beta Pruning in ai
I. Alpha-Beta Pruning in aiI. Alpha-Beta Pruning in ai
I. Alpha-Beta Pruning in ai
vikas dhakane
 
peas description of task environment with different types of properties
 peas description of task environment with different types of properties peas description of task environment with different types of properties
peas description of task environment with different types of properties
monircse2
 
Hill climbing algorithm
Hill climbing algorithmHill climbing algorithm
Hill climbing algorithm
Dr. C.V. Suresh Babu
 
Types of environment
Types of environmentTypes of environment
Types of environment
Megha Sharma
 

What's hot (20)

Agent architectures
Agent architecturesAgent architectures
Agent architectures
 
Chapter 2 intelligent agents
Chapter 2 intelligent agentsChapter 2 intelligent agents
Chapter 2 intelligent agents
 
AI
AIAI
AI
 
Forms of learning in ai
Forms of learning in aiForms of learning in ai
Forms of learning in ai
 
An introduction to reinforcement learning
An introduction to reinforcement learningAn introduction to reinforcement learning
An introduction to reinforcement learning
 
1.1 What are Agent and Environment.pptx
1.1 What are Agent and Environment.pptx1.1 What are Agent and Environment.pptx
1.1 What are Agent and Environment.pptx
 
Agents_AI.ppt
Agents_AI.pptAgents_AI.ppt
Agents_AI.ppt
 
Agents and environments
Agents and environmentsAgents and environments
Agents and environments
 
Unit1: Introduction to AI
Unit1: Introduction to AIUnit1: Introduction to AI
Unit1: Introduction to AI
 
Production system in ai
Production system in aiProduction system in ai
Production system in ai
 
Reflex and model based agents
Reflex and model based agentsReflex and model based agents
Reflex and model based agents
 
Intelligent Agents
Intelligent Agents Intelligent Agents
Intelligent Agents
 
Lecture 4- Agent types
Lecture 4- Agent typesLecture 4- Agent types
Lecture 4- Agent types
 
AI Lecture 4 (informed search and exploration)
AI Lecture 4 (informed search and exploration)AI Lecture 4 (informed search and exploration)
AI Lecture 4 (informed search and exploration)
 
Agents in Artificial intelligence
Agents in Artificial intelligence Agents in Artificial intelligence
Agents in Artificial intelligence
 
Multi-agent systems
Multi-agent systemsMulti-agent systems
Multi-agent systems
 
I. Alpha-Beta Pruning in ai
I. Alpha-Beta Pruning in aiI. Alpha-Beta Pruning in ai
I. Alpha-Beta Pruning in ai
 
peas description of task environment with different types of properties
 peas description of task environment with different types of properties peas description of task environment with different types of properties
peas description of task environment with different types of properties
 
Hill climbing algorithm
Hill climbing algorithmHill climbing algorithm
Hill climbing algorithm
 
Types of environment
Types of environmentTypes of environment
Types of environment
 

Similar to AI Lecture 2 (intelligent agents)

topic-2intelligentagents-190426041219.pdf
topic-2intelligentagents-190426041219.pdftopic-2intelligentagents-190426041219.pdf
topic-2intelligentagents-190426041219.pdf
ShivareddyGangam
 
agents in ai ppt
agents in ai pptagents in ai ppt
agents in ai ppt
Prasanth633635
 
Artificial Intelligent Agents
Artificial Intelligent AgentsArtificial Intelligent Agents
Artificial Intelligent Agents
Dr. Mazhar Ali Dootio
 
AI Agents, Agents in Artificial Intelligence
AI Agents, Agents in Artificial IntelligenceAI Agents, Agents in Artificial Intelligence
AI Agents, Agents in Artificial Intelligence
Kirti Verma
 
ai-slides-1233566181695672-2 (1).pdf
ai-slides-1233566181695672-2 (1).pdfai-slides-1233566181695672-2 (1).pdf
ai-slides-1233566181695672-2 (1).pdf
ShivareddyGangam
 
introduction to inteligent IntelligentAgent.ppt
introduction to inteligent IntelligentAgent.pptintroduction to inteligent IntelligentAgent.ppt
introduction to inteligent IntelligentAgent.ppt
dejene3
 
chapter2-(Intelligent Agents).ppt
chapter2-(Intelligent Agents).pptchapter2-(Intelligent Agents).ppt
chapter2-(Intelligent Agents).ppt
ssuser99ca78
 
Intelligent Agents, A discovery on How A Rational Agent Acts
Intelligent Agents, A discovery on How A Rational Agent ActsIntelligent Agents, A discovery on How A Rational Agent Acts
Intelligent Agents, A discovery on How A Rational Agent Acts
Sheetal Jain
 
www.funFruit.com Intelligent Agent www.funFruit.co.docx
www.funFruit.com Intelligent Agent www.funFruit.co.docxwww.funFruit.com Intelligent Agent www.funFruit.co.docx
www.funFruit.com Intelligent Agent www.funFruit.co.docx
ericbrooks84875
 
chapter2.pptx
chapter2.pptxchapter2.pptx
chapter2.pptx
jpradha86
 
Introduction To Artificial Intelligence
Introduction To Artificial IntelligenceIntroduction To Artificial Intelligence
Introduction To Artificial Intelligence
NeHal VeRma
 
Introduction To Artificial Intelligence
Introduction To Artificial IntelligenceIntroduction To Artificial Intelligence
Introduction To Artificial IntelligenceNeHal VeRma
 
AI_02_Intelligent Agents.pptx
AI_02_Intelligent Agents.pptxAI_02_Intelligent Agents.pptx
AI_02_Intelligent Agents.pptx
Yousef Aburawi
 
Artificial intelligence introduction
Artificial intelligence introductionArtificial intelligence introduction
Artificial intelligence introduction
melchismel
 
Lecture 2 Agents.pptx
Lecture 2 Agents.pptxLecture 2 Agents.pptx
Lecture 2 Agents.pptx
AndrewKuziwakwasheMu
 
Unit 1.ppt
Unit 1.pptUnit 1.ppt
Unit 1.ppt
BaskarChelladurai
 
AI_Ch2.pptx
AI_Ch2.pptxAI_Ch2.pptx
AI_Ch2.pptx
qwtadhsaber
 
Intelligent Agents
Intelligent AgentsIntelligent Agents
Intelligent Agents
Pankaj Debbarma
 
Artificial Intelligence Course of BIT Unit 2
Artificial Intelligence Course of BIT Unit 2Artificial Intelligence Course of BIT Unit 2
Artificial Intelligence Course of BIT Unit 2
Home
 
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagentsJarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagentsPalGov
 

Similar to AI Lecture 2 (intelligent agents) (20)

topic-2intelligentagents-190426041219.pdf
topic-2intelligentagents-190426041219.pdftopic-2intelligentagents-190426041219.pdf
topic-2intelligentagents-190426041219.pdf
 
agents in ai ppt
agents in ai pptagents in ai ppt
agents in ai ppt
 
Artificial Intelligent Agents
Artificial Intelligent AgentsArtificial Intelligent Agents
Artificial Intelligent Agents
 
AI Agents, Agents in Artificial Intelligence
AI Agents, Agents in Artificial IntelligenceAI Agents, Agents in Artificial Intelligence
AI Agents, Agents in Artificial Intelligence
 
ai-slides-1233566181695672-2 (1).pdf
ai-slides-1233566181695672-2 (1).pdfai-slides-1233566181695672-2 (1).pdf
ai-slides-1233566181695672-2 (1).pdf
 
introduction to inteligent IntelligentAgent.ppt
introduction to inteligent IntelligentAgent.pptintroduction to inteligent IntelligentAgent.ppt
introduction to inteligent IntelligentAgent.ppt
 
chapter2-(Intelligent Agents).ppt
chapter2-(Intelligent Agents).pptchapter2-(Intelligent Agents).ppt
chapter2-(Intelligent Agents).ppt
 
Intelligent Agents, A discovery on How A Rational Agent Acts
Intelligent Agents, A discovery on How A Rational Agent ActsIntelligent Agents, A discovery on How A Rational Agent Acts
Intelligent Agents, A discovery on How A Rational Agent Acts
 
www.funFruit.com Intelligent Agent www.funFruit.co.docx
www.funFruit.com Intelligent Agent www.funFruit.co.docxwww.funFruit.com Intelligent Agent www.funFruit.co.docx
www.funFruit.com Intelligent Agent www.funFruit.co.docx
 
chapter2.pptx
chapter2.pptxchapter2.pptx
chapter2.pptx
 
Introduction To Artificial Intelligence
Introduction To Artificial IntelligenceIntroduction To Artificial Intelligence
Introduction To Artificial Intelligence
 
Introduction To Artificial Intelligence
Introduction To Artificial IntelligenceIntroduction To Artificial Intelligence
Introduction To Artificial Intelligence
 
AI_02_Intelligent Agents.pptx
AI_02_Intelligent Agents.pptxAI_02_Intelligent Agents.pptx
AI_02_Intelligent Agents.pptx
 
Artificial intelligence introduction
Artificial intelligence introductionArtificial intelligence introduction
Artificial intelligence introduction
 
Lecture 2 Agents.pptx
Lecture 2 Agents.pptxLecture 2 Agents.pptx
Lecture 2 Agents.pptx
 
Unit 1.ppt
Unit 1.pptUnit 1.ppt
Unit 1.ppt
 
AI_Ch2.pptx
AI_Ch2.pptxAI_Ch2.pptx
AI_Ch2.pptx
 
Intelligent Agents
Intelligent AgentsIntelligent Agents
Intelligent Agents
 
Artificial Intelligence Course of BIT Unit 2
Artificial Intelligence Course of BIT Unit 2Artificial Intelligence Course of BIT Unit 2
Artificial Intelligence Course of BIT Unit 2
 
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagentsJarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
 

More from Tajim Md. Niamat Ullah Akhund

AI Lecture 7 (uncertainty)
AI Lecture 7 (uncertainty)AI Lecture 7 (uncertainty)
AI Lecture 7 (uncertainty)
Tajim Md. Niamat Ullah Akhund
 
AI Lecture 6 (logical agents)
AI Lecture 6 (logical agents)AI Lecture 6 (logical agents)
AI Lecture 6 (logical agents)
Tajim Md. Niamat Ullah Akhund
 
AI Lecture 5 (game playing)
AI Lecture 5 (game playing)AI Lecture 5 (game playing)
AI Lecture 5 (game playing)
Tajim Md. Niamat Ullah Akhund
 
AI Lecture 3 (solving problems by searching)
AI Lecture 3 (solving problems by searching)AI Lecture 3 (solving problems by searching)
AI Lecture 3 (solving problems by searching)
Tajim Md. Niamat Ullah Akhund
 
AI Lecture 1 (introduction)
AI Lecture 1 (introduction)AI Lecture 1 (introduction)
AI Lecture 1 (introduction)
Tajim Md. Niamat Ullah Akhund
 
Course outline (cse 412 - artificial intelligence)
Course outline (cse   412 - artificial intelligence)Course outline (cse   412 - artificial intelligence)
Course outline (cse 412 - artificial intelligence)
Tajim Md. Niamat Ullah Akhund
 
Artificial intelligence LAB 1 overview & intelligent systems
Artificial intelligence LAB 1   overview & intelligent systemsArtificial intelligence LAB 1   overview & intelligent systems
Artificial intelligence LAB 1 overview & intelligent systems
Tajim Md. Niamat Ullah Akhund
 
Matlab lecture 9 – simpson 1,3 and trapezoidal method@taj
Matlab lecture 9 – simpson 1,3 and trapezoidal method@tajMatlab lecture 9 – simpson 1,3 and trapezoidal method@taj
Matlab lecture 9 – simpson 1,3 and trapezoidal method@taj
Tajim Md. Niamat Ullah Akhund
 
Matlab lecture 8 – newton's forward and backword interpolation@taj copy
Matlab lecture 8 – newton's forward and backword interpolation@taj   copyMatlab lecture 8 – newton's forward and backword interpolation@taj   copy
Matlab lecture 8 – newton's forward and backword interpolation@taj copy
Tajim Md. Niamat Ullah Akhund
 
Matlab lecture 7 – regula falsi or false position method@taj
Matlab lecture 7 – regula falsi or false position method@tajMatlab lecture 7 – regula falsi or false position method@taj
Matlab lecture 7 – regula falsi or false position method@taj
Tajim Md. Niamat Ullah Akhund
 
Matlab lecture 6 – newton raphson method@taj copy
Matlab lecture 6 – newton raphson method@taj   copyMatlab lecture 6 – newton raphson method@taj   copy
Matlab lecture 6 – newton raphson method@taj copy
Tajim Md. Niamat Ullah Akhund
 
Matlab lecture 5 bisection method@taj
Matlab lecture 5  bisection method@tajMatlab lecture 5  bisection method@taj
Matlab lecture 5 bisection method@taj
Tajim Md. Niamat Ullah Akhund
 
Matlab lecture 4 loops@taj
Matlab lecture 4  loops@tajMatlab lecture 4  loops@taj
Matlab lecture 4 loops@taj
Tajim Md. Niamat Ullah Akhund
 
Matlab lecture 3 – commands, the m files, data types, bitwise op, set@taj
Matlab lecture 3 – commands, the m files, data types, bitwise op, set@tajMatlab lecture 3 – commands, the m files, data types, bitwise op, set@taj
Matlab lecture 3 – commands, the m files, data types, bitwise op, set@taj
Tajim Md. Niamat Ullah Akhund
 
Matlab lecture 2 matlab basic syntax & variables @taj
Matlab lecture 2   matlab basic syntax & variables @tajMatlab lecture 2   matlab basic syntax & variables @taj
Matlab lecture 2 matlab basic syntax & variables @taj
Tajim Md. Niamat Ullah Akhund
 
Matlab lecture 1 - installation of matlab, introduction and course outline@taj
Matlab lecture 1 - installation of matlab, introduction and course outline@tajMatlab lecture 1 - installation of matlab, introduction and course outline@taj
Matlab lecture 1 - installation of matlab, introduction and course outline@taj
Tajim Md. Niamat Ullah Akhund
 
Circuit lab 10 verification of norton's theorem@taj
Circuit lab 10  verification of norton's theorem@tajCircuit lab 10  verification of norton's theorem@taj
Circuit lab 10 verification of norton's theorem@taj
Tajim Md. Niamat Ullah Akhund
 
Circuit lab 9 verification of maximum power transfer theorem@taj
Circuit lab 9  verification of maximum power transfer theorem@tajCircuit lab 9  verification of maximum power transfer theorem@taj
Circuit lab 9 verification of maximum power transfer theorem@taj
Tajim Md. Niamat Ullah Akhund
 
Circuit lab 8 verification of thevenin's theorem@taj
Circuit lab 8  verification of thevenin's theorem@tajCircuit lab 8  verification of thevenin's theorem@taj
Circuit lab 8 verification of thevenin's theorem@taj
Tajim Md. Niamat Ullah Akhund
 
Circuit lab 7 verification of superposition theorem@taj
Circuit lab 7  verification of superposition theorem@tajCircuit lab 7  verification of superposition theorem@taj
Circuit lab 7 verification of superposition theorem@taj
Tajim Md. Niamat Ullah Akhund
 

More from Tajim Md. Niamat Ullah Akhund (20)

AI Lecture 7 (uncertainty)
AI Lecture 7 (uncertainty)AI Lecture 7 (uncertainty)
AI Lecture 7 (uncertainty)
 
AI Lecture 6 (logical agents)
AI Lecture 6 (logical agents)AI Lecture 6 (logical agents)
AI Lecture 6 (logical agents)
 
AI Lecture 5 (game playing)
AI Lecture 5 (game playing)AI Lecture 5 (game playing)
AI Lecture 5 (game playing)
 
AI Lecture 3 (solving problems by searching)
AI Lecture 3 (solving problems by searching)AI Lecture 3 (solving problems by searching)
AI Lecture 3 (solving problems by searching)
 
AI Lecture 1 (introduction)
AI Lecture 1 (introduction)AI Lecture 1 (introduction)
AI Lecture 1 (introduction)
 
Course outline (cse 412 - artificial intelligence)
Course outline (cse   412 - artificial intelligence)Course outline (cse   412 - artificial intelligence)
Course outline (cse 412 - artificial intelligence)
 
Artificial intelligence LAB 1 overview & intelligent systems
Artificial intelligence LAB 1   overview & intelligent systemsArtificial intelligence LAB 1   overview & intelligent systems
Artificial intelligence LAB 1 overview & intelligent systems
 
Matlab lecture 9 – simpson 1,3 and trapezoidal method@taj
Matlab lecture 9 – simpson 1,3 and trapezoidal method@tajMatlab lecture 9 – simpson 1,3 and trapezoidal method@taj
Matlab lecture 9 – simpson 1,3 and trapezoidal method@taj
 
Matlab lecture 8 – newton's forward and backword interpolation@taj copy
Matlab lecture 8 – newton's forward and backword interpolation@taj   copyMatlab lecture 8 – newton's forward and backword interpolation@taj   copy
Matlab lecture 8 – newton's forward and backword interpolation@taj copy
 
Matlab lecture 7 – regula falsi or false position method@taj
Matlab lecture 7 – regula falsi or false position method@tajMatlab lecture 7 – regula falsi or false position method@taj
Matlab lecture 7 – regula falsi or false position method@taj
 
Matlab lecture 6 – newton raphson method@taj copy
Matlab lecture 6 – newton raphson method@taj   copyMatlab lecture 6 – newton raphson method@taj   copy
Matlab lecture 6 – newton raphson method@taj copy
 
Matlab lecture 5 bisection method@taj
Matlab lecture 5  bisection method@tajMatlab lecture 5  bisection method@taj
Matlab lecture 5 bisection method@taj
 
Matlab lecture 4 loops@taj
Matlab lecture 4  loops@tajMatlab lecture 4  loops@taj
Matlab lecture 4 loops@taj
 
Matlab lecture 3 – commands, the m files, data types, bitwise op, set@taj
Matlab lecture 3 – commands, the m files, data types, bitwise op, set@tajMatlab lecture 3 – commands, the m files, data types, bitwise op, set@taj
Matlab lecture 3 – commands, the m files, data types, bitwise op, set@taj
 
Matlab lecture 2 matlab basic syntax & variables @taj
Matlab lecture 2   matlab basic syntax & variables @tajMatlab lecture 2   matlab basic syntax & variables @taj
Matlab lecture 2 matlab basic syntax & variables @taj
 
Matlab lecture 1 - installation of matlab, introduction and course outline@taj
Matlab lecture 1 - installation of matlab, introduction and course outline@tajMatlab lecture 1 - installation of matlab, introduction and course outline@taj
Matlab lecture 1 - installation of matlab, introduction and course outline@taj
 
Circuit lab 10 verification of norton's theorem@taj
Circuit lab 10  verification of norton's theorem@tajCircuit lab 10  verification of norton's theorem@taj
Circuit lab 10 verification of norton's theorem@taj
 
Circuit lab 9 verification of maximum power transfer theorem@taj
Circuit lab 9  verification of maximum power transfer theorem@tajCircuit lab 9  verification of maximum power transfer theorem@taj
Circuit lab 9 verification of maximum power transfer theorem@taj
 
Circuit lab 8 verification of thevenin's theorem@taj
Circuit lab 8  verification of thevenin's theorem@tajCircuit lab 8  verification of thevenin's theorem@taj
Circuit lab 8 verification of thevenin's theorem@taj
 
Circuit lab 7 verification of superposition theorem@taj
Circuit lab 7  verification of superposition theorem@tajCircuit lab 7  verification of superposition theorem@taj
Circuit lab 7 verification of superposition theorem@taj
 

Recently uploaded

road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
VENKATESHvenky89705
 
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdfTop 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Teleport Manpower Consultant
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
Amil Baba Dawood bangali
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
SamSarthak3
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
SupreethSP4
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
Kamal Acharya
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
thanhdowork
 
ethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.pptethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.ppt
Jayaprasanna4
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
R&R Consult
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generation
Robbie Edward Sayers
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
AafreenAbuthahir2
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
WENKENLI1
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
JoytuBarua2
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
BrazilAccount1
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
Pipe Restoration Solutions
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
Vijay Dialani, PhD
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
Kamal Acharya
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Dr.Costas Sachpazis
 
ethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.pptethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.ppt
Jayaprasanna4
 

Recently uploaded (20)

road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
 
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdfTop 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
 
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
 
ethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.pptethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.ppt
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generation
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
 
ethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.pptethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.ppt
 

AI Lecture 2 (intelligent agents)

  • 1. CSE 412: Artificial IntelligenceCSE 412: Artificial Intelligence Topic - 2: Intelligent AgentsTopic - 2: Intelligent Agents Fall 2018Fall 2018 Tajim Md. Niamat Ullah Akhund Lecturer Department of Computer Science and Engineering Daffodil International University Email: tajim.cse@diu.edu.bd
  • 2. Agents and EnvironmentsAgents and Environments Good Behavior: The Concept of RationalityGood Behavior: The Concept of Rationality The Nature of EnvironmentsThe Nature of Environments The Structure of AgentsThe Structure of Agents Topic ContentsTopic Contents
  • 3. Intelligent AgentIntelligent Agent An agent  perceives its environment with sensors and  acts upon that environment with its actuators. An agent gets percepts one at a time, and maps this percept sequence to actions. 3 Majidur RahmanMajidur Rahman
  • 4. An AgentAn Agent 4 Majidur RahmanMajidur Rahman
  • 5. Example of an AgentExample of an Agent Let us consider a vacuum-cleaner agent that is shown in the figure bellow. The vacuum-cleaner world has just two locations: squares A and B. 5
  • 6. Example of an AgentExample of an Agent The vacuum agent perceives which square it is in and whether there is dirt in the square. It can choose to move left, move right, suck up the dirt, or do nothing. One very simple agent function is the following: if the current square is dirty, then suck; otherwise, move to the other square. 6
  • 7. Example of an AgentExample of an Agent Environment: squares A and B Percepts: [location, status], e.g. [A, Dirty] Actions: left, right, suck, and no-op 7 Majidur RahmanMajidur Rahman
  • 8. function REFLEX-VACUUM-AGENT ([location, status]) return an action if status == Dirty then return Suck else if location == A then return Right else if location == B then return Left Example of an AgentExample of an Agent Majidur RahmanMajidur Rahman
  • 9. Rational AgentRational Agent  For each possible percept sequence, a rational agent should select an action (using an agent function) that is expected to maximize its performance measure, given the evidence provided by the percept sequence and whatever built-in prior knowledge the agent has.  A percept sequence is the complete history of anything the agent has ever perceived.  A performance measure is a means of calculating how well the agent has performed based on the sequence of percepts that it has received.  An agent’s prior knowledge of the environment is the knowledge that the agent designer has given to the agent before its introduction to the environment. 9 Majidur RahmanMajidur Rahman
  • 10. Rational AgentRational Agent  An agent function maps percept sequences to actions. f : seq(P) →A  Agent function for vacuum Cleaner example: 10 Majidur RahmanMajidur Rahman
  • 11. Performance Measures Environment Actuators Sensors used by the agent to perform actions surroundings beyond the control of the agent used to evaluate how well an agent solves the task at hand provide information about the current state of the environment  To design a rational agent we must specify its task environment: Task EnvironmentsTask Environments 11 Majidur RahmanMajidur Rahman
  • 12. ExampleExample PEAS description of the environment for an automated taxi:  Performance Safety, destination, profits, legality, comfort  Environment Streets/motorways, other traffic, pedestrians, weather  Actuators Steering, accelerator, brake, horn, speaker/display  Sensors Video, sonar, speedometer, engine sensors, keyboard, GPS 12 Majidur RahmanMajidur Rahman
  • 14. Environment TypesEnvironment Types The environment for an agent may be  Fully or partially observable  Deterministic or stochastic  Episodic or sequential  Static or dynamic  Discrete or continuous  Single or multi-agent 14 Majidur RahmanMajidur Rahman
  • 15. Environment TypesEnvironment Types Solitaire Backgammom Internet shopping Taxi Observable?? Deterministic?? Episodic?? Static?? Discrete?? Single-agent?? Majidur RahmanMajidur Rahman 15
  • 16. Environment TypesEnvironment Types Solitaire Backgammom Internet shopping Taxi ObservableObservable?? Deterministic?? Episodic?? Static?? Discrete?? Single-agent?? Fully vs. partially observable: an environment is full observable when the sensors can detect all aspects that are relevant to the choice of action. Majidur RahmanMajidur Rahman 16
  • 17. Environment TypesEnvironment Types Solitaire Backgammom Internet shopping Taxi ObservableObservable?? FULL FULL PARTIAL PARTIAL Deterministic?? Episodic?? Static?? Discrete?? Single-agent?? Fully vs. partially observable: an environment is full observable when the sensors can detect all aspects that are relevant to the choice of action. Majidur RahmanMajidur Rahman 17
  • 18. Environment TypesEnvironment Types Solitaire Backgammom Internet shopping Taxi ObservableObservable?? FULL FULL PARTIAL PARTIAL DeterministicDeterministic?? Episodic?? Static?? Discrete?? Single-agent?? Deterministic vs. stochastic: if the next environment state is completely determined by the current state the executed action then the environment is deterministic. Majidur RahmanMajidur Rahman 18
  • 19. Environment TypesEnvironment Types Solitaire Backgammom Internet shopping Taxi ObservableObservable?? FULL FULL PARTIAL PARTIAL DeterministicDeterministic?? YES NO YES NO Episodic?? Static?? Discrete?? Single-agent?? Deterministic vs. stochastic: if the next environment state is completely determined by the current state the executed action then the environment is deterministic. Majidur RahmanMajidur Rahman 19
  • 20. Environment TypesEnvironment Types Solitaire Backgammom Internet shopping Taxi ObservableObservable?? FULL FULL PARTIAL PARTIAL DeterministicDeterministic?? YES NO YES NO EpisodicEpisodic?? Static?? Discrete?? Single-agent?? Episodic vs. sequential: In an episodic environment the agent’s experience can be divided into atomic steps where the agents perceives and then performs a single action. The choice of action depends only on the episode itself. Majidur RahmanMajidur Rahman 20
  • 21. Environment TypesEnvironment Types Solitaire Backgammom Internet shopping Taxi ObservableObservable?? FULL FULL PARTIAL PARTIAL Deterministic?? YES NO YES NO EpisodicEpisodic?? NO NO NO NO Static?? Discrete?? Single-agent?? Episodic vs. sequential: In an episodic environment the agent’s experience can be divided into atomic steps where the agents perceives and then performs a single action. The choice of action depends only on the episode itself Majidur RahmanMajidur Rahman 21
  • 22. Environment TypesEnvironment Types Solitaire Backgammom Internet shopping Taxi Observable?? FULL FULL PARTIAL PARTIAL Deterministic?? YES NO YES NO Episodic?? NO NO NO NO Static?? Discrete?? Single-agent?? Static vs. dynamic: If the environment can change while the agent is choosing an action, the environment is dynamic. Semi-dynamic if the agent’s performance changes even when the environment remains the same. Majidur RahmanMajidur Rahman 22
  • 23. Environment TypesEnvironment Types Solitaire Backgammom Internet shopping Taxi ObservableObservable?? FULL FULL PARTIAL PARTIAL DeterministicDeterministic?? YES NO YES NO EpisodicEpisodic?? NO NO NO NO StaticStatic?? YES YES SEMI NO Discrete?? Single-agent?? Static vs. dynamic: If the environment can change while the agent is choosing an action, the environment is dynamic. Semi-dynamic if the agent’s performance changes even when the environment remains the same. Majidur RahmanMajidur Rahman 23
  • 24. Environment TypesEnvironment Types Solitaire Backgammom Internet shopping Taxi Observable?? FULL FULL PARTIAL PARTIAL Deterministic?? YES NO YES NO Episodic?? NO NO NO NO Static?? YES YES SEMI NO Discrete?? Single-agent?? Discrete vs. continuous: This distinction can be applied to the state of the environment, the way time is handled and to the percepts/actions of the agent. Majidur RahmanMajidur Rahman 24
  • 25. Environment TypesEnvironment Types Solitaire Backgammom Internet shopping Taxi Observable?? FULL FULL PARTIAL PARTIAL Deterministic?? YES NO YES NO Episodic?? NO NO NO NO Static?? YES YES SEMI NO Discrete?? YES YES YES NO Single-agent?? Discrete vs. continuous: This distinction can be applied to the state of the environment, the way time is handled and to the percepts/actions of the agent. Majidur RahmanMajidur Rahman 25
  • 26. Environment TypesEnvironment Types Solitaire Backgammom Internet shopping Taxi ObservableObservable?? FULL FULL PARTIAL PARTIAL DeterministicDeterministic???? YES NO YES NO EpisodicEpisodic???? NO NO NO NO StaticStatic???? YES YES SEMI NO DiscreteDiscrete???? YES YES YES NO Single-agentSingle-agent???? YES NO NO NO Single vs. multi-agent: Does the environment contain other agents who are also maximizing some performance measure that depends on the current agent’s actions? Majidur RahmanMajidur Rahman 26
  • 27. Environment TypesEnvironment Types The simplest environment is  Fully observable, deterministic, episodic, static, discrete and single-agent. Most real situations are:  Partially observable, stochastic, sequential, dynamic, continuous and multi-agent. Majidur RahmanMajidur Rahman 27
  • 28. Agent TypesAgent Types Four basic kinds of agent that embody the principles underlying almost all intelligent systems:  Simple reflex agents;  Model-based reflex agents;  Goal-based agents; and  Utility-based agents. 28 Majidur RahmanMajidur Rahman
  • 29. Agent Types: Simple ReflexAgent Types: Simple Reflex  Select action on the basis of only the current percept, e.g. the vacuum-agent  Large reduction in possible percept/action situations  Implemented through condition-action rules if dirty then suck  Example: – Agent: Mail sorting robot – Environment: Conveyor belt of letters – Rule: e.g. city = Edin → put Scotland bag 29 Majidur RahmanMajidur Rahman
  • 30. Agent Types: Model-BasedAgent Types: Model-Based  To tackle partially observable environments  Maintain internal state that depends on percept history  Over time update state using world knowledge  How does the world change independently  How do actions affect the world  Example: – Agent: Robot vacuum cleaner – Environment: Dirty room, furniture – Model: Map of room, which areas already cleaned 30 Majidur RahmanMajidur Rahman
  • 31. Agent Types: Goal-BasedAgent Types: Goal-Based  The agent needs a goal to know which situations are desirable  Difficulties arise when long sequences of actions are required to find the goal  Typically investigated in search and planning research  Major difference: future is taken into account  Example: – Agent: Robot maid – Environment: House and people – Goal: Clean clothes, tidy room, table laid, etc 31 Majidur RahmanMajidur Rahman
  • 32. 32 Agent Types: Utility-BasedAgent Types: Utility-Based  Certain goals can be reached in different ways  Some are better: have a higher utility  Utility function maps a (sequence of) state(s) onto a real number  Improves on goals:  Selecting between conflicting goals  Selecting between several goals based on likelihood of success and importance of goals  Example: – Agent: Mars Lander – Environment: The surface of mars with obstacles – Utility: Shortest and obstacle-free path Majidur RahmanMajidur Rahman
  • 34. ********** ********** If You Need Me ********** ********** Mail: tajim.cse@diu.edu.bd Website: https://www.tajimiitju.blogspot.com ORCID: https://orcid.org/0000-0002-2834-1507 LinkedIn: https://www.linkedin.com/in/tajimiitju ResearchGate: https://www.researchgate.net/profile/Tajim_Md_Niamat_Ullah_Akhund YouTube: https://www.youtube.com/tajimiitju?sub_confirmation=1 SlideShare: https://www.slideshare.net/TajimMdNiamatUllahAk Facebook: https://www.facebook.com/tajim.mohammad GitHub: https://github.com/tajimiitju Google+: https://plus.google.com/+tajimiitju Gmail: tajim.mohammad.3@gmail.com Twitter: https://twitter.com/Tajim53 Thank you Tajim Md. Niamat Ullah AkhundTajim Md. Niamat Ullah Akhund

Editor's Notes

  1. Volledige observeerbare omgevingen zijn handig omdat de agent geen Interne staat moet bijhouden over de omgeving Partieel observeerbare omgevingen door noise of niet goed afgestelde sensoren of ontbrekende informatie
  2. Als de omgeving gedeeltelijk observeerbaar is kan deze stochastisch lijken. Dus het is beter om te kijken of de omgebing determinitisch of stochastisch is vanuit het standpunt van de agent. Als de omgeving deterministisch is behalve de acties van de andere agenten dan is de omgeving strategisch.
  3. If the agent performance score changes ahen time passes, the environment is semi-dynamic.
  4. If the agent performance score changes ahen time passes, the environment is semi-dynamic.
  5. If the agent performance score changes ahen time passes, the environment is semi-dynamic.
  6. If the agent performance score changes ahen time passes, the environment is semi-dynamic.
  7. If the agent performance score changes ahen time passes, the environment is semi-dynamic.