Tags
artificial intelligence
reinforcement learning
machine learning
model-free methods
lstdq
lspi
tile coding
fourier-cosine basis
polynomial approximation
value function approximation
semi-gradient descent
stochastic gradient descent
tabular methods
simple black jack
monte carlo
sarsa
q learning
temporal difference learning
canonical maze
bellman equations
markov decision process
dynamic programming
See more
Presentations
(3)Personal Information
Organization / Workplace
Helsinki Finland
Tags
artificial intelligence
reinforcement learning
machine learning
model-free methods
lstdq
lspi
tile coding
fourier-cosine basis
polynomial approximation
value function approximation
semi-gradient descent
stochastic gradient descent
tabular methods
simple black jack
monte carlo
sarsa
q learning
temporal difference learning
canonical maze
bellman equations
markov decision process
dynamic programming
See more