AIw04_slide.pptx

Multi-Armed Bandit problem
Image: medium.com/10x-curiosity
Exploit vs Explore

Types of environments
Fully observable: agent knows completely about its current state
Partially observable: agent knows just a part of its current state
Unobservable: agent don’t knows anything its current state.

Single agent vs. multiagent

Episodic vs. sequential

Other types:
 Deterministic vs. stochastic
 Discrete vs. continuous
 Static vs. dynamic
 …
Learn more from Section 2.3 of
(Russell, 2016)

State representations
Source: (Russell, 2016)

PEAS description of agents

Simple reﬂex agents
Agents that select actions based ONLY on the current percept, ignoring the previous percepts.
For example,

Simple reflex agents can NOT work well in cases, e.g., partially
observable environments.

Model-based reﬂex agents
Agents that try to "guess" the unobserved part of the environment by using some model of the
environment.

Goal-based agents
The mentioned agents are reflex ones: they act just according to the state of the environment.
Goal-based agents select action regarding both state and its goal.
For example, when seeing brake lights of the car in front of it:
+ A reﬂex agent
+ A goal-based agent, with the goal "not hitting other cars",
The goal-based agent is more ﬂexible.

Utility-based agents
Goals just provide a binary distinction between “DONE” and “NOT DONE” states.
Utilities consider HOW WELL it is done.
For example,

Learning agents
State-of-the-art systems.
A learning agent includes 4 components:
+ Performance element:
+ Critic:
+ Learning element:
+ Problem generator:

Example: A taxi agent
+ It drives using the
+ The critic observes the world and gives feedback to the learning element. For example,
+ See this feedback, the learning element creates a rule to
and gives it to the performance element to use.
+ The problem generator might identify some behaviors in need of improvement and suggest
experiments, such as

SOLVING PROBLEMS BY
SEARCHING
Main reference:
Chapters 3 of Russell, S., & Norvig, P. (2016). Artificial intelligence: a modern approach.

Focus on problem-solving agents
 A kind of goal-based agent
 Uses atomic representation for environment state
Assumptions of the environment:
 Oversable:
 Discrete:
 Deterministic:

Concepts
Goal:
Solution:
Optimal solution:
Search:
State space:
Level of abstraction:

Problem description using graph
Node:
Edge:
Path:

Components of a problem
1. Initial state: a state that the agent starts in.
For example,
2. Possible actions: actions that are applicable to the agent at a given state.
Notation: ACTIONS(state s).
For example,
3. Transition model: result of an action in a state.
Notation: RESULT(state s, action a).
For example,
Note:
Successor:

4. Goal test: determines whether a state is a goal state.
For example,
5. Cost: a performance measure (lower is better).
Step cost:
Path cost:

Example problem 1

1. Initial state:
2. Possible actions:
3. Transition model:
4. Goal test:
5. Cost:

Example problem 2

Exercise: Route-finding problem
Fomulate the problem of finding routes to go from one city/province to another.
Cities/provinces are shown on the map given in the next slide.
FYI: Route-ﬁnding algorithms are used in package routing on the Internet, airline travel-planning systems and so on.
1. Initial state:
2. Possible actions:
3. Transition model:
4. Goal test:
5. Cost:

Other real-world problems
Touring problems
The traveling salesperson problem (TSP)
A VLSI layout problem
Robot navigation
Automatic assembly sequencing
See section 3.2.2 in (Russell, 2016) for more details.

AIw04_slide.pptx

Recommended

Recommended

More Related Content

Similar to AIw04_slide.pptx

Similar to AIw04_slide.pptx (20)

More from Nguyễn Tiến

More from Nguyễn Tiến (20)

Recently uploaded

Recently uploaded (20)

AIw04_slide.pptx