Juantomás García gave a talk on machine learning pipelines for developing AI that can learn to play and solve the 1980s video game "The Abbey of the Crime". He discussed gathering game data, exploring different reinforcement learning strategies, and developing a simple neural network model with policy and value networks to determine moves and rewards. He described his current pipeline that moves raw game data through processing steps using technologies like Kubernetes, PubSub, training jobs, and model storage. The talk encouraged attendees to collaborate on the open source project on GitHub and join the AbadIA Slack channel.
3. Who I am
Juantomás García ( 0-)
•Chief Envisioning Officer @ Sngular
•GDEx2 (Google Developer Expert) for cloud and Machine Learning
•#AbadIA Cheer Leader
Others
•Co-Author of the first Spanish free software book “La Pastilla Roja”
•Former President of Hispalinux (Spanish Linux User Group)
•Organizer of the Machine Learning Spain and GDG Cloud Madrid.
7. First 8-bit RPG in pseudo 3D (2.5D)
It was at 1987 and this game is a kind
of legend in the video games world.
Based in Umberto Eco book “In the
name of the rose”
Do you know the game?
THE GAME
11. We recollect a lot of information
- Game Info (timestamps, rewards, bonus, obsequium)
- Games moves (state, action, reward, new_state)
- Checkpoints (to restore the game at an interesting time)
- ML Models (for recovering good models o just make a benchmark)
GATHERING INFORMATION
12. A game
server with
REST API
An openAI
Gym
Enought
hardware
resources
So what’s the next step
SO WE HAVE
Tons of
games
data
14. A RL agent is a program that interacts with an environment, in our
case a OpenAI gym for AbadIA, and learn from observations and
rewards.
CREATE A RL AGENT
65. Takeaways
• Don’t Over Engineering
• It’s all about data
• Marios Cap
• Serverless
• RF is simple, powerful and
Easy
• Lots of tools to use.
66. Questions?
This talk have a free questions lifetime warranty: If you have any questions or concerns
about this talk, feel free to contact me anytime.
Selfie Time: If you like the talk just smile while I take
the selfie ;-)
We’re Hiring, Sngular People
twitter: @juantomas
juantomas.garcia@sngular.com