SlideShare a Scribd company logo
Artificial Neural Networks
recent breakthroughs and
applications
Armando Vieira
Summary
● What went wrong with traditional AI?
● Probabilistic machines - a new paradigm
● Why Deep Learning is different and why it matters
● Applications
○ NLP
○ Image
○ Drug discovery
○ Weather forecasting
● Large generative models
○ GTP3
○ LAMP
○ GLAMP
● Is this intelligence?
Rule based programs
Solve numerical equations
Accurate
Fast
Pure logic
Why “old” AI failed?
Can’t deal with unstructured data
Get stuck with exceptions to rules
Not scalable
All actions have to be programmed
Rigid, can’t learn
The vodka was good but the
meat was rotten
Blind and insane
English - Russian - English
“The spirit was willing but the flesh
was weak“
“Out of sight out of mind”
Summary
Symbolic thinking may be the
supreme form of human intelligence
but there are other ways to build
“smart” machines
The associative paradigm
Learning by associations
No true or false but probabilities
No symbols but distributions
Pattern matching
Trained by examples not hard coded
No CPU or memory, just signals flowing through a mesh of connections
The Deep Learning Revolution
Better than humans?
What made the difference?
Bigger is better
Train
Deploy
JOHN
“JOHN”
MARY
“MARY”
MARY
“JULIE”
MARY
“MARY”
PAUL
“PAUL”
MARY
“MARY”
How DL works?
How to make the right choice?
POSITIVE MITOSES
FALSE POSITIVE MITOSES TRUE POSITIVE MITOSES
Janowczyk A, Madabhushi A
Accuracy in medical imagery
Applications
Alpha Fold
stateof.ai 2021
Deep learning models can learn drug-protein binding relationships from a small number of empirical experiments
in order to help prioritise which areas of vast chemical spaces to virtually screen.
Accelerating high-throughput virtual drug screening with model-guided search
● Structure-based drug discovery searches for drugs that bind a protein
of interest whose 3D structure is available. This process, referred to
as “docking”, can be run virtually using simulations. However, with
databases of small molecule chemicals exploding past billions of
records, virtually screening all combinations becomes
computationally and commercially intractable.
● A solution is to train a model on a sample of drug-protein
interactions with empirically determined docking scores.
● This model can be used to virtually score a library of interest,
followed by docking the top scoring drug candidates. These results
are used to update the model with active learning. With several
iterations, model-guided search ultimately generates hits faster.
#stateofai | 26
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
DreamerV2 is the first model-based RL agent trained on a single GPU to surpass human level performance on 55
popular tasks of the Atari benchmark.The agent learns behaviors purely within the latent space of a world model
trained from pixels, which makes these behaviors more generalisable to solving future tasks more efficiently.
Superhuman world models for Atari, but on a budget
● DreamerV2 vastly outperforms other RL agents trained with the same computational budget, across all
performance aggregation metrics.
#stateofai | 29
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
RL agents have shown impressive performance on challenging individual tasks. But can they generalize to tasks
they never trained on? DeepMind trained RL agents on 3.4M tasks across a diverse set of 700k games in a 3D
simulated environment, and show they can generalize to radically different games without additional training.
Zero-shot generalisation in reinforcement learning
● The researchers created XLand, a vast controllable environment, which
allows them to dynamically adapt both how the agents train and,
crucially, the games on which they train.
● The distribution of games is learned using a hyperparameter optimization
technique called Population Based Training. It allows them to find the
games which have the right level of difficulty given the agents’ behaviour.
This ensures the agents build evermore general capabilities.
● As training progresses, the agents exhibit heuristic behaviours such as
experimenting, changing the state of the world, and cooperation, which
are uncharacteristic of usual RL agents. These learned behaviours allow
them to generalize to hand-designed held-out tasks, a first in RL research.
Figure: Examples of XLand environments.
Figure: Test metrics progress during training.
#stateofai | 30
Introduction | Research | Talent | Industry | Politics | Predictions
Big models
Size matters
Some BIG generative models
GTP3
Language model
Next word prediction
DALLE 2
Image and text
Diffusion model
GLAMP / PALM
NLP and text
understanding
OpenAI OpenAI Google
PALM
Humans vs machines
Not everyone agrees. “Artificial intelligence
programs lack consciousness and
self-awareness,” researcher Gwern Branwen
wrote in his article about GPT-3. “They will
never be able to have a sense of humor. They
will never be able to appreciate art, or beauty,
or love. They will never feel lonely. They will
never have empathy for other people, for
animals, for the environment. They will never
enjoy music or fall in love, or cry at the drop
of a hat.”
Chain of thought
Explaining jokes
Continuous learning
The rise of diffusion models
CLIP: Learning self-supervised representations of text and images
DALLE-2
A dinosaur dressing a
suit is looking at the
mirror
“Honeybees wearing
welding helmets while
welding a futuristic giant
steel honeycomb, digital
art.”
Medieval biblical
scroll about
Darwinian Evolution
Image processing
What has been solved
Identification of objects
Automatic subtitles generation
Image segmentation, Depth
NLP
Writing coherent text
Explainability
Generative models
High quality synthetic data
Conditional text to image models
Reinforcement Learning
“Almost zero shot” learning
Learning by observing: replication
Object manipulation
Video
Object tracking and Identification
Pose estimation
Science
Drug discovery
Weather forecasting
Physics Informed Networks
IMAGE
What’s still a challenge
Zero shot learning
Video
Self driving cars
Generative models
Spatio-Temporal data
Reinforcement learning
Self exploration without goals
NLP
Keep coherence on long texts
Understanding meaning
Science
Discover new laws
Deductive thinking
Future
Beyond the present paradigma
BEYOND GRADIENT DESCENDENT
● Gradient based algorithms are continuous but nature is discrete
● Learning can gradual but also through sharp transitions - paradigms
● Recursivity hard to model with GD
FROM BLACK-BLOXES TO CONJECTURE MACHINES
● ANN are induction machines, but knownledge can also be deductive
● At the moment we are brute-forcing learning with big models and data
● Hard tp generalize with a single example

More Related Content

Similar to AI - history and recent breakthroughs

The Incredible Disappearing Data Scientist
The Incredible Disappearing Data ScientistThe Incredible Disappearing Data Scientist
The Incredible Disappearing Data Scientist
Rebecca Bilbro
 
Sippin: A Mobile Application Case Study presented at Techfest Louisville
Sippin: A Mobile Application Case Study presented at Techfest LouisvilleSippin: A Mobile Application Case Study presented at Techfest Louisville
Sippin: A Mobile Application Case Study presented at Techfest Louisville
Dawn Yankeelov
 
The Need for Deep Learning Transparency
The Need for Deep Learning TransparencyThe Need for Deep Learning Transparency
The Need for Deep Learning Transparency
inside-BigData.com
 
Ai lecture1 final
Ai lecture1 finalAi lecture1 final
Ai lecture1 final
Shivam Agrawal
 
Inverse Modeling for Cognitive Science "in the Wild"
Inverse Modeling for Cognitive Science "in the Wild"Inverse Modeling for Cognitive Science "in the Wild"
Inverse Modeling for Cognitive Science "in the Wild"
Aalto University
 
Keepler Data Tech | Entendiendo tus propios modelos predictivos
Keepler Data Tech | Entendiendo tus propios modelos predictivosKeepler Data Tech | Entendiendo tus propios modelos predictivos
Keepler Data Tech | Entendiendo tus propios modelos predictivos
Keepler Data Tech
 
How to choose the right AI model for your application?
How to choose the right AI model for your application?How to choose the right AI model for your application?
How to choose the right AI model for your application?
Benjaminlapid1
 
Basics of Soft Computing
Basics of Soft  Computing Basics of Soft  Computing
Basics of Soft Computing
Sangeetha Rajesh
 
Hacking Predictive Modeling - RoadSec 2018
Hacking Predictive Modeling - RoadSec 2018Hacking Predictive Modeling - RoadSec 2018
Hacking Predictive Modeling - RoadSec 2018
HJ van Veen
 
AI - Artificial Intelligence - Implications for Libraries
AI - Artificial Intelligence - Implications for LibrariesAI - Artificial Intelligence - Implications for Libraries
AI - Artificial Intelligence - Implications for Libraries
Brian Pichman
 
Artificial Intelligence power point presentation document
Artificial Intelligence power point presentation documentArtificial Intelligence power point presentation document
Artificial Intelligence power point presentation document
David Raj Kanthi
 
Introduction to ml
Introduction to mlIntroduction to ml
Introduction to ml
Girija Muscut
 
[PR12] understanding deep learning requires rethinking generalization
[PR12] understanding deep learning requires rethinking generalization[PR12] understanding deep learning requires rethinking generalization
[PR12] understanding deep learning requires rethinking generalization
JaeJun Yoo
 
Machine learning
 Machine learning Machine learning
Machine learning
Siddharth Kar
 
Lessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsLessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systems
Xavier Amatriain
 
(Ch#1) artificial intelligence
(Ch#1) artificial intelligence(Ch#1) artificial intelligence
(Ch#1) artificial intelligence
Noor Ul Hudda Memon
 
Introduction to Artificial Intelligence
Introduction to Artificial IntelligenceIntroduction to Artificial Intelligence
Introduction to Artificial Intelligence
Luca Bianchi
 
Brains@Bay Meetup: The Effect of Sensorimotor Learning on the Learned Represe...
Brains@Bay Meetup: The Effect of Sensorimotor Learning on the Learned Represe...Brains@Bay Meetup: The Effect of Sensorimotor Learning on the Learned Represe...
Brains@Bay Meetup: The Effect of Sensorimotor Learning on the Learned Represe...
Numenta
 
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!
Sri Ambati
 
The current state of prediction in neuroimaging
The current state of prediction in neuroimagingThe current state of prediction in neuroimaging
The current state of prediction in neuroimaging
SaigeRutherford
 

Similar to AI - history and recent breakthroughs (20)

The Incredible Disappearing Data Scientist
The Incredible Disappearing Data ScientistThe Incredible Disappearing Data Scientist
The Incredible Disappearing Data Scientist
 
Sippin: A Mobile Application Case Study presented at Techfest Louisville
Sippin: A Mobile Application Case Study presented at Techfest LouisvilleSippin: A Mobile Application Case Study presented at Techfest Louisville
Sippin: A Mobile Application Case Study presented at Techfest Louisville
 
The Need for Deep Learning Transparency
The Need for Deep Learning TransparencyThe Need for Deep Learning Transparency
The Need for Deep Learning Transparency
 
Ai lecture1 final
Ai lecture1 finalAi lecture1 final
Ai lecture1 final
 
Inverse Modeling for Cognitive Science "in the Wild"
Inverse Modeling for Cognitive Science "in the Wild"Inverse Modeling for Cognitive Science "in the Wild"
Inverse Modeling for Cognitive Science "in the Wild"
 
Keepler Data Tech | Entendiendo tus propios modelos predictivos
Keepler Data Tech | Entendiendo tus propios modelos predictivosKeepler Data Tech | Entendiendo tus propios modelos predictivos
Keepler Data Tech | Entendiendo tus propios modelos predictivos
 
How to choose the right AI model for your application?
How to choose the right AI model for your application?How to choose the right AI model for your application?
How to choose the right AI model for your application?
 
Basics of Soft Computing
Basics of Soft  Computing Basics of Soft  Computing
Basics of Soft Computing
 
Hacking Predictive Modeling - RoadSec 2018
Hacking Predictive Modeling - RoadSec 2018Hacking Predictive Modeling - RoadSec 2018
Hacking Predictive Modeling - RoadSec 2018
 
AI - Artificial Intelligence - Implications for Libraries
AI - Artificial Intelligence - Implications for LibrariesAI - Artificial Intelligence - Implications for Libraries
AI - Artificial Intelligence - Implications for Libraries
 
Artificial Intelligence power point presentation document
Artificial Intelligence power point presentation documentArtificial Intelligence power point presentation document
Artificial Intelligence power point presentation document
 
Introduction to ml
Introduction to mlIntroduction to ml
Introduction to ml
 
[PR12] understanding deep learning requires rethinking generalization
[PR12] understanding deep learning requires rethinking generalization[PR12] understanding deep learning requires rethinking generalization
[PR12] understanding deep learning requires rethinking generalization
 
Machine learning
 Machine learning Machine learning
Machine learning
 
Lessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsLessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systems
 
(Ch#1) artificial intelligence
(Ch#1) artificial intelligence(Ch#1) artificial intelligence
(Ch#1) artificial intelligence
 
Introduction to Artificial Intelligence
Introduction to Artificial IntelligenceIntroduction to Artificial Intelligence
Introduction to Artificial Intelligence
 
Brains@Bay Meetup: The Effect of Sensorimotor Learning on the Learned Represe...
Brains@Bay Meetup: The Effect of Sensorimotor Learning on the Learned Represe...Brains@Bay Meetup: The Effect of Sensorimotor Learning on the Learned Represe...
Brains@Bay Meetup: The Effect of Sensorimotor Learning on the Learned Represe...
 
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!
 
The current state of prediction in neuroimaging
The current state of prediction in neuroimagingThe current state of prediction in neuroimaging
The current state of prediction in neuroimaging
 

Recently uploaded

Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 

Recently uploaded (20)

Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 

AI - history and recent breakthroughs

  • 1. Artificial Neural Networks recent breakthroughs and applications Armando Vieira
  • 2. Summary ● What went wrong with traditional AI? ● Probabilistic machines - a new paradigm ● Why Deep Learning is different and why it matters ● Applications ○ NLP ○ Image ○ Drug discovery ○ Weather forecasting ● Large generative models ○ GTP3 ○ LAMP ○ GLAMP ● Is this intelligence?
  • 3. Rule based programs Solve numerical equations Accurate Fast Pure logic Why “old” AI failed? Can’t deal with unstructured data Get stuck with exceptions to rules Not scalable All actions have to be programmed Rigid, can’t learn
  • 4. The vodka was good but the meat was rotten Blind and insane English - Russian - English “The spirit was willing but the flesh was weak“ “Out of sight out of mind”
  • 6. Symbolic thinking may be the supreme form of human intelligence but there are other ways to build “smart” machines
  • 7. The associative paradigm Learning by associations No true or false but probabilities No symbols but distributions Pattern matching Trained by examples not hard coded No CPU or memory, just signals flowing through a mesh of connections
  • 8. The Deep Learning Revolution
  • 10.
  • 11. What made the difference?
  • 14. How to make the right choice? POSITIVE MITOSES FALSE POSITIVE MITOSES TRUE POSITIVE MITOSES Janowczyk A, Madabhushi A
  • 18.
  • 19. stateof.ai 2021 Deep learning models can learn drug-protein binding relationships from a small number of empirical experiments in order to help prioritise which areas of vast chemical spaces to virtually screen. Accelerating high-throughput virtual drug screening with model-guided search ● Structure-based drug discovery searches for drugs that bind a protein of interest whose 3D structure is available. This process, referred to as “docking”, can be run virtually using simulations. However, with databases of small molecule chemicals exploding past billions of records, virtually screening all combinations becomes computationally and commercially intractable. ● A solution is to train a model on a sample of drug-protein interactions with empirically determined docking scores. ● This model can be used to virtually score a library of interest, followed by docking the top scoring drug candidates. These results are used to update the model with active learning. With several iterations, model-guided search ultimately generates hits faster. #stateofai | 26 Introduction | Research | Talent | Industry | Politics | Predictions
  • 20.
  • 21.
  • 22. stateof.ai 2021 DreamerV2 is the first model-based RL agent trained on a single GPU to surpass human level performance on 55 popular tasks of the Atari benchmark.The agent learns behaviors purely within the latent space of a world model trained from pixels, which makes these behaviors more generalisable to solving future tasks more efficiently. Superhuman world models for Atari, but on a budget ● DreamerV2 vastly outperforms other RL agents trained with the same computational budget, across all performance aggregation metrics. #stateofai | 29 Introduction | Research | Talent | Industry | Politics | Predictions
  • 23. stateof.ai 2021 RL agents have shown impressive performance on challenging individual tasks. But can they generalize to tasks they never trained on? DeepMind trained RL agents on 3.4M tasks across a diverse set of 700k games in a 3D simulated environment, and show they can generalize to radically different games without additional training. Zero-shot generalisation in reinforcement learning ● The researchers created XLand, a vast controllable environment, which allows them to dynamically adapt both how the agents train and, crucially, the games on which they train. ● The distribution of games is learned using a hyperparameter optimization technique called Population Based Training. It allows them to find the games which have the right level of difficulty given the agents’ behaviour. This ensures the agents build evermore general capabilities. ● As training progresses, the agents exhibit heuristic behaviours such as experimenting, changing the state of the world, and cooperation, which are uncharacteristic of usual RL agents. These learned behaviours allow them to generalize to hand-designed held-out tasks, a first in RL research. Figure: Examples of XLand environments. Figure: Test metrics progress during training. #stateofai | 30 Introduction | Research | Talent | Industry | Politics | Predictions
  • 25. Some BIG generative models GTP3 Language model Next word prediction DALLE 2 Image and text Diffusion model GLAMP / PALM NLP and text understanding OpenAI OpenAI Google
  • 26. PALM
  • 27. Humans vs machines Not everyone agrees. “Artificial intelligence programs lack consciousness and self-awareness,” researcher Gwern Branwen wrote in his article about GPT-3. “They will never be able to have a sense of humor. They will never be able to appreciate art, or beauty, or love. They will never feel lonely. They will never have empathy for other people, for animals, for the environment. They will never enjoy music or fall in love, or cry at the drop of a hat.”
  • 30.
  • 31.
  • 33. The rise of diffusion models
  • 34.
  • 35. CLIP: Learning self-supervised representations of text and images
  • 37. A dinosaur dressing a suit is looking at the mirror
  • 38. “Honeybees wearing welding helmets while welding a futuristic giant steel honeycomb, digital art.”
  • 40. Image processing What has been solved Identification of objects Automatic subtitles generation Image segmentation, Depth NLP Writing coherent text Explainability Generative models High quality synthetic data Conditional text to image models Reinforcement Learning “Almost zero shot” learning Learning by observing: replication Object manipulation Video Object tracking and Identification Pose estimation Science Drug discovery Weather forecasting Physics Informed Networks
  • 41. IMAGE What’s still a challenge Zero shot learning Video Self driving cars Generative models Spatio-Temporal data Reinforcement learning Self exploration without goals NLP Keep coherence on long texts Understanding meaning Science Discover new laws Deductive thinking
  • 43. Beyond the present paradigma BEYOND GRADIENT DESCENDENT ● Gradient based algorithms are continuous but nature is discrete ● Learning can gradual but also through sharp transitions - paradigms ● Recursivity hard to model with GD FROM BLACK-BLOXES TO CONJECTURE MACHINES ● ANN are induction machines, but knownledge can also be deductive ● At the moment we are brute-forcing learning with big models and data ● Hard tp generalize with a single example