15. Which image is of a real person?
Credit: Karras, T., Laine, S., & Aila, T. (2019). A style-based generator architecture for generative adversarial networks
31. Google DeepMind's Deep Q-learning playing Atari Breakout. Video: Two Minute Papers
A visualisation of the AlphaStar agent during game two of the match against MaNa. Image credit: deepmind.com
AlphaStar
35. 39
Image: analyticsinsight.net
“…an unsupervised method can
recommend materials for functional
applications several years before their
discovery.”
For example, using only abstracts
published before 2009, the algorithm was
able to spot the best thermoelectric
materials today which were only
recommended in the literature several
years later, e.g., CuGaTe2 in 2012, ReS2 in
2016 and CdIn2Te4 in 2017.
Tshitoyan, V., Dagdelen, J., Weston, L., Dunn, A., Rong, Z.,
Kononova, O., ... & Jain, A. (2019). Unsupervised word
embeddings capture latent knowledge from materials
science literature. Nature, 571(7763), 95.
41. Course’s content
1. Python for Data Science (optional)
2. Ethics in Data Analytics
3. Artificial Intelligence: Fundamental concepts and approaches
4. Introduction to Reinforcement Learning
45
42. References
Russell, S. J., & Norvig, P. (2020). Artificial Intelligence: A Modern Approach.
4th edition. Pearson.
Microsoft Professional Program in Artificial Intelligence. (2019, June 20).
Retrieved from https://www.edx.org/microsoft-professional-program-
artificial-intelligence
Lapan, M. (2020). Deep Reinforcement Learning Hands-On. Packt Publishing
Ltd.
Sutton, R. S., & Barto, A. G. (2018). Reinforcement learning: An introduction.
MIT press.
Swaroop, C. H. (2013). A Byte of Python. 46
43. Course forum and materials
Q&A, give feedback:
https://quang-tran-ute.mn.co/topics
Email:
quangtn@hcmute.edu.vn
Course materials:
https://drive.google.com/drive/folders/1ASYldrTiErJodpeTijjNxP1FDCuISPiD
47
45. Bonus points
Answer questions, exercises during class meetings:
+ up to 0.5 point/one answer
Help other students by answering their questions on the forum:
+ up to 0.5 point/one answer
At the end of class sessions, random students submit their work on the
current week’s exercises:
+ up to 1 point: if all exercises are done correctly
- up to 0.5 point: if no exercise is done
- up to 1 point: if student is absent
49
Editor's Notes
Video source: https://www.youtube.com/watch?v=neB1S0UPJFw
2017
Video source: https://www.youtube.com/watch?v=V-blj3PQ7jI
Both images are AI-generated photos by NVIDIA’s new work with generative adversarial networks (GANs):
Karras, T., Laine, S., & Aila, T. (2019). A style-based generator architecture for generative adversarial networks.
Source: https://www.youtube.com/watch?v=AmUC4m6w1wo
+ See more: Fake videos of real people -- and how to spot them | Supasorn Suwajanakorn
https://www.youtube.com/watch?v=o2DDU4g0PRo
+ Supasorn page: https://www.supasorn.com/
1: Rey's Theme, by John Williams
2: Among The Stars, by AIVA
1: First Step, by Hans Zimmer
2: Op. 36 for Symphonic Orchestra in F# minor, by AIVA
3: Genesis, by AIVA
Ideas for this part is based on
Man Vs. Machine: The 6 Greatest AI Challenges To Showcase The Power Of Artificial Intelligence. (n.d.). Retrieved from https://www.linkedin.com/pulse/man-vs-machine-6-greatest-ai-challenges-showcase-power-bernard-marr
“World Champion chess player Garry Kasparov competed against artificial intelligence twice. In the first chess match-up between machine (IBM Deep Blue) and man (Kasparov) in 1996 Kasparov won. The next year, Deep Blue was victorious.”
Source: https://www.linkedin.com/pulse/man-vs-machine-6-greatest-ai-challenges-showcase-power-bernard-marr
“In 2011, IBM Watson took on Ken Jennings and Brad Rutter, two of the most successful contestants of the game show Jeopardy who had collectively won $5 million during their reigns as Jeopardy champions. Watson won! To prepare for the competition, Watson played 100 games against past winners.”
Source: https://www.linkedin.com/pulse/man-vs-machine-6-greatest-ai-challenges-showcase-power-bernard-marr
Video: https://www.youtube.com/watch?v=P18EdAKuC1U
“The breakthroughs in computer vision and speech recognition allowed the innovators at DeepMind Technologies to develop a convolutional neural network for reinforcement learning to enable a machine to master several Atari games using only raw pixels as input and in a few games have better results than humans.”
Source: https://www.linkedin.com/pulse/man-vs-machine-6-greatest-ai-challenges-showcase-power-bernard-marr
Video: https://www.youtube.com/watch?v=V1eYniJ0Rnk
“AlphaGo Zero, the latest evolution of AlphaGo, the first computer program to defeat a world champion at the ancient Chinese game of Go (cờ vây). Zero is even more powerful and is arguably the strongest Go player in history.
Previous versions of AlphaGo initially trained on thousands of human amateur and professional games to learn how to play Go. AlphaGo Zero skips this step and learns to play simply by playing games against itself, starting from completely random play. In doing so, it quickly surpassed human level of play and defeated the previously published champion-defeating version of AlphaGo by 100 games to 0.
It is able to do this by using a novel form of reinforcement learning, in which AlphaGo Zero becomes its own teacher. The system starts off with a neural network that knows nothing about the game of Go. It then plays games against itself, by combining this neural network with a powerful search algorithm. As it plays, the neural network is tuned and updated to predict moves, as well as the eventual winner of the games.”
Source: https://deepmind.com/blog/alphago-zero-learning-scratch/
Video: https://www.youtube.com/watch?v=V1eYniJ0Rnk
“In a series of test matches held on 19 December, AlphaStar decisively beat Team Liquid’s Grzegorz "MaNa" Komincz, one of the world’s strongest professional StarCraft players, 5-0, following a successful benchmark match against his team-mate Dario “TLO” Wünsch.”
Source: https://deepmind.com/blog/alphastar-mastering-real-time-strategy-game-starcraft-ii/
“AlphaStar was ranked above 99.8% of active players on Battle.net, and achieved a Grandmaster level for all three StarCraft II races: Protoss, Terran, and Zerg. We expect these methods could be applied to many other domains.”
https://deepmind.com/blog/article/AlphaStar-Grandmaster-level-in-StarCraft-II-using-multi-agent-reinforcement-learning
“Project Debater’s opponent is 2016 World Debating Championships Grand Finalist and 2012 European Debate Champion Harish Natarajan. Harish holds the world record for most debate competition victories. Each side has only 15 minutes to prep for the debate, during which they prepare arguments for and against a given thesis statement. Both sides will then present a four-minute opening statement, a four-minute rebuttal, and a two-minute summary.”
Source: https://www.ibm.com/blogs/research/2019/02/ai-debate-think-2019/
Topic: We should subsidize preschools. Harish won (by voting of audience).
Watch the debate: https://www.youtube.com/watch?v=m3u-1yttrVw
Controlling Assistive Robots with Learned Latent Actions
[Control grasp action in 1 degree of freedom]
Standford Univ. 2019
Further readings:
+ Skandhas (or Five aggregates): They are the five factors that constitute and explain a sentient being’s person and personality, https://en.wikipedia.org/wiki/Skandha
+ Dhammapada: All that we are is the result of what we have thought: it is founded on our thoughts, it is made up of our thoughts.