Artificial intelligence for minesweeper

MINESWEEPER
A. Couëtoux, O. Teytaud
TAO, Inria, Lri, U-Psud,
Umr Cnrs 8623
+ OASE, NUTN

MINESWEEPER
TAO, Inria, Lri, U-Psud, Umr Cnrs 8623 + OASE, NUTN
Sometimes we work
on (visibly)
serious stuff.

MINESWEEPER
TAO, Inria, Lri, U-Psud, Umr Cnrs 8623 + OASE, NUTN
Sometimes we work
on (visibly)
serious stuff.

But I think the best
challenge for proving that
we have good algorithms
is games

And a great challenge is MineSweeper!

Yes I'm serious!

Good news!

No mine in
the
neighborhood!

I can “click”
all the
neighbours.

I have 3
uncovered
neighbors,
and I have 3
mines in the
neighborhood
==> 3 flags!

I know
it's a
mine,
so I put
a flag!

The most
successful
game ever!
Who never
played
Mine-
Sweeper ?

Do you
think it's
easy ?
(10 mines)

What is
the optimal
move ?

Remark: the question makes sense.
You don't need the history for playing optimaly.
==> (non-trivial proof!)

What is
the optimal
move ?

This one is easy.

Both remaining locations win with proba 50%.

More
difficult!
Which
move is
optimal ?

Probability
of a mine ?
- Top:
- Middle:
- Bottom:

Probability
of a mine ?
- Top: 33%
- Middle:
- Bottom:

Probability
of a mine ?
- Top: 33%
- Middle: 33%
- Bottom:

Probability
of a mine ?
- Top: 33%
- Middle: 33%
- Bottom: 33%

Probability
of a mine ?
- Top: 33%
- Middle: 33%
- Bottom: 33%

==> so all moves
equivalent ?

Probability
of a mine ?
- Top: 33%
- Middle: 33%
- Bottom: 33%

==> so all moves
equivalent ?
==> NOOOOO!!!

MineSweeper approaches
- exact MDP: very expensive. 4x4 solved.
- CSP: the main approach.
- (unknown) state:
x(i) = 1 if there is a mine at location i
- each visible location is a constraint:
If location 15 is 4, then
x(04)+x(05)+x(06)
+x(14)+ x(16)
+x(24)+x(25)+x(26) = 4.
- find all solutions X1, X2, X3,...,XN
- P(mine in j) = sumi Xij / N
- play j such that P(mine in j) minimal
- randomly break tie.
MDP= Markov Decision Process
CSP = Constraint Satisfaction Problem

CSP
- is very fast
- but it's not optimal
- because of

Here CSP plays randomly!
Also for the initial move: don't play
randomly the first move! (sometimes opening book)

Why not UCT ?
- looks like a stupid idea at first view
- can not compete with CSP in terms of speed
- But at least UCT is
consistent: if given
sufficient
time, it will play
optimally.

UCT (Upper Confidence Trees)

Coulom (06)
Chaslot, Saito & Bouzy (06)
Kocsis Szepesvari (06)

UCT
Kocsis & Szepesvari (06)

Exploitation ...
SCORE =
5/7
+ k.sqrt( log(10)/7 )

... or exploration ?
SCORE =
0/2
+ k.sqrt( log(10)/2 )

What do I need for implementing UCT ?
A complete generative model.
Given a state and an action,
I must be able to simulate possible transitions.
State S, Action a:
(S,a) ==> S'
Example: given the state below, and the action “top left”, what
are the possible next states ?



State S, Action a:
(S,a) ==> S'

Example: given the state below, and the action “top left”, what are the possible next
states ?

Can you please
forgive me for that ?

Given a state andI've been lazy, I have just
an action,
implemented the rejection algorithm.

State S, Action a:
(S,a) ==> S'

states ?

Rejection algorithm:
1- randomly draw the mines

Given 2- if and an action, return the new observation
a state it's ok,

3- otherwise, go back to 1.

State S, Action a:
(S,a) ==> S'

states ?

(being lazy is good:
I could write a second paper with

a better algorithm :-) )
(using CSP for this!)

State S, Action a:
(S,a) ==> S'

states ?

An example showing that the initial
move matters (and our algorithm finds it!)..

3x3, 7 mines:
the optimal move
is anything but the center.
Optimal winning rate: 25%.
Optimal winning rate if
random uniform
initial move: 17/72.

(yes we get 1/72
improvement!)

15 mines on 5x5 board with
GnoMine rule
(i.e. initial move is 0)
Optimal success rate = 100%!!!!!
Play the center, and you win (well, you have to work...)

UCT vs CSP + opening book (play corners)
in the Windows mode

Probability
of a mine ?
- Top: 33%
- Middle: 33%
- Bottom: 33%

Top or bottom:
66% of win!

Middle: 33%!

CONCLUSIONS
- MineSweeper is not dead!
==> still a challenge

- When you have a myopic solver
(i.e. which neglects long term
effect, as often in industry!) ==>
combine with UCT

- More to come, big boards are far
from optimal

Thanks for your attention!

9 Mines.
What is the
optimal move ?

Artificial intelligence for minesweeper

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (9)

Recently uploaded

Recently uploaded (20)

Artificial intelligence for minesweeper