SlideShare a Scribd company logo
Brandon  Allgood,  PhD
Chief  Technology  Officer
allg00d
How  AI  can  be  implemented  in  
Key  Areas  of  Failure  along  the  
R&D  Pipeline
Why  now?
2
Source:  ANDREESSEN  HOROWITZ
AI  in  the  hype  cycle
3
We  tend  to  overestimate  the  effect  of  a  technology  in  the  short  run
and  underestimate  the  effect  in  the  long  run  – Amara’s  Law
What  is  AI?
ML  systems  are  function  (model)  generators
4
AI
General  AI Narrow  AI
Machine  
Learning
Unsupervised  
Learning
Clustering
Anomaly  
detection
PCA
Supervised  
Learning
Linear  
regression
SVM
Random  
Forest
Deep    
Learning
Reasoning  
Systems
…
…
…
Expert  
Systems
Areas  of  largest  impact
5
Common  characteristics
Data  is  cheap  (there  is  a  lot  of  labeled  data)
Training  sets  look  a  lot  like  space  being  predicted
Training  set  features  are  obvious
Very  constrained  environment
6
Data  in  pharma
Data  is  expensive  (small  amounts  of  labeled  data)
7
Data  in  pharma
Data  is  expensive  (small  amounts  of  labeled  data)
Data  is  imbalanced  (very  few  actives)
8
Active
Inactive
Healthy
Sick
Data  in  pharma
Data  is  expensive  (small  amounts  of  labeled  data)
Data  is  imbalanced  (very  few  actives)
Models  are  often  used  to  extrapolate,  not  interpolate
9
Data  in  pharma
Data  is  expensive  (small  amounts  of  labeled  data)
Data  is  imbalanced  (very  few  actives)
Models  are  often  used  to  extrapolate,  not  interpolate
Biological  processes  are  not  fully  understood
Right  features  are  unclear
Avoid  assumptions  about  the  biology
10
Data  in  pharma
Data  is  expensive  (small  amounts  of  labeled  data)
Data  is  imbalanced  (very  few  actives)
Models  are  often  used  to  extrapolate,  not  interpolate
Biological  processes  are  not  fully  understood
Right  features  are  unclear
Avoid  assumptions  about  the  biology
Processes  and  phenomena  span  orders  of  magnitude
11
Data  in  pharma
Data  is  expensive  (small  amounts  of  labeled  data)
Data  is  imbalanced  (very  few  actives)
Models  are  often  used  to  extrapolate,  not  interpolate
Biological  processes  are  not  fully  understood
Right  features  are  unclear
Avoid  assumptions  about  the  biology
Processes  and  phenomena  span  orders  of  magnitude
12
Need  to  develop  domain  specific  approaches
Enabling  data-­driven  research
Availability  and  discoverability  over  organization
Proper  structure  is  hard  to  anticipate
Organization  will  arise,  if  the  data  is  accessible
Storage  is  cheap,  keep  many  copies
13
Enabling  data-­driven  research
Availability  and  discoverability  over  organization
Proper  structure  is  hard  to  anticipate
Organization  will  arise,  if  the  data  is  accessible
Storage  is  cheap,  keep  many  copies
Co-­locate  the  data  with  the  compute
Data  gravity  – Moving  data  is  now  often  the  bottle  neck
14
Enabling  data-­driven  research
Availability  and  discoverability  over  organization
Proper  structure  is  hard  to  anticipate
Organization  will  arise,  if  the  data  is  accessible
Storage  is  cheap,  keep  many  copies
Co-­locate  the  data  with  the  compute
Data  gravity  – Moving  data  is  now  often  the  bottle  neck
Compute  should  ideally  be  unlimited  and  elastic
Use  the  public  cloud!
Security  posture  doesn’t  need  to  be  compromised,  a  new  skill  
set  is  required
15
Data  sharing
To  assume  the  AI  revolution  in  pharma  will  happen  in  
your  organization  is  not  realistic
Data  needs  to  reach  those  that  can  make  the  next  
breakthroughs  in  algorithm  design
It  is  time  to  start  sharing  data  through  data  partnerships
ATOM  collaboration  (GSK,  UCSF,  NCI,  LLNL,  others)
To  be  announced:  European  partnership  including  J&J
16
Data’s  value  lies  in  the  insights  extract  from  it
Not  your  grandma’s  CADD
CADD  tools  help  scientists  build  mental  models
AI  can  ingest  more  data  &  build  more  complex  models
AI  models  are  however  narrowly  focused  and  require  a  
human/AI  partnership  to  be  successful
Med-­chemists  can  now  focus  on  creating  enriched  spaces
Model  predictions  need  to  be  double  checked
Must  remain  vigilant  to  avoid  imparting  bias
17
Deploying  AI  systems  in  the  same  way  as  CADD  
will  highly  limit  their  effectiveness
Strengths  of  AI
Ingest  all  available  data  &  build  more  complex  models
Models  can  be  deployed  at  scale  to  evaluate  orders  of  
magnitude  more  options
Models  can  be  continually  retrained  as  new  data  
becomes  available
AI  systems  don’t  sleep
AI  is  naïve  (of  chemistry,  biology,  psychology,  etc)
18
Weaknesses  of  AI
Still  need  experts  to  build  AI  models
Need  some  data  to  start
Still  narrow,  so  training  &  output  need  to  be  scrutinized
AI  models  can  be  black  boxes
19
These  are  all  addressable  in  the  long  term
Impact  on  pharma  R&D
Omics  analysis
Literature  connections
Example:  Open  Targets
20
Target  ID Discovery Pre-­clinical Clinical Beyond
Source:  Enrico  Ferrero,  PhD,  Associate  GSK  Fellow
Impact  on  pharma  R&D
Hit  ID
Numerate  platform
AI  models  built  with  as  few  as  8  active  compounds
Identify  compounds  from  libraries  in  novel  scaffolds  (30%-­70%  hit  rate)
Screen  10M  purchasables  <  1  hour  or  128M  synthesizables  <  1  day
Including  active  learning  à predicts  most  informative  compounds
Lead  ID  &  optimization
Numerate  platform
Includes  all  available  data  from  all  available  source
Search  and  rank  ~10B  novel  compound  libraries  in  ~2  weeks
Drive  efficacy  in  fewer  design  cycles
Include  any  number  of  off-­target  or  ADME  models  of  concern
21
Target  ID Discovery Pre-­clinical Clinical Beyond
Impact  on  pharma  R&D
ADME  /  toxicity
Numerate  platform
ADME  suite  – Includes  models  for  all  standard  preclinical  in  vitro  
assays,  including,  stability,  permeability,  and  protein  binding
ToxTool – Includes  over  6,600  MoA  models  and  4,000  GO  based  meta-­
models  (including  pathway  context)
Other  work:  DL  applied  to  the  Merck  Kaggle challenge
22
Target  ID Discovery Pre-­clinical Clinical Beyond
Impact  on  pharma  R&D
Trial  design
Biomarker  development
Cohort  selection
Patient  recruitment
Patient compliance
Smart  data  collection
Anomaly  detection
Basic  analytics
23
Target  ID Discovery Pre-­clinical Clinical Beyond
Impact  on  pharma  R&D
Repurposing/Repositioning
Drug-­Omics  analysis
Literature  analysis
Patients
Preventative  care
Personalized  medicine
24
Target  ID Discovery Pre-­clinical Clinical Beyond
Strategic  Value  of  AI  in  pharma  R&D
Faster  drug  discovery
Increased  safety
Lower  failure  rates
Faster  regulatory  approval
Speed  to  market
Cost  efficiency
Improved  patient  engagement
Better  treatment  outcomes
New  growth  opportunities
25
Target  ID Discovery Pre-­clinical Clinical Beyond
Conclusions
Believe  the  hype,  just  don’t  get  ahead  of  yourself
Data  needs  to  be  made  more  available
Life  scientists  and  computer  scientists  need  to  come  
together  to  develop  domain  specific  methods
The  culture  and  approach  to  using  AI  cannot  be  the  
same  as  to  CADD  tools
Need  to  be  robust,  beyond  the  hype
26
Fourth  industrial  revolution?
Time  to  begin  retraining
The  ability  to  work  with  AI  and  understand  its  capability  &
potential  needs  to  be  pervasive
27

More Related Content

What's hot

2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar
Pistoia Alliance
 
MPS webinar master deck
MPS webinar master deckMPS webinar master deck
MPS webinar master deck
Pistoia Alliance
 
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
Hellmuth Broda
 
ELRIG Event Biocity Scotland May19
ELRIG Event Biocity Scotland May19ELRIG Event Biocity Scotland May19
ELRIG Event Biocity Scotland May19
Angelo Pugliese
 
Running A Better Clinical Trial
Running A Better Clinical TrialRunning A Better Clinical Trial
Running A Better Clinical Trial
David Levin
 
Central_Analytics_Treating_the_Cause_Not_Just_the_Symptoms
Central_Analytics_Treating_the_Cause_Not_Just_the_SymptomsCentral_Analytics_Treating_the_Cause_Not_Just_the_Symptoms
Central_Analytics_Treating_the_Cause_Not_Just_the_Symptoms
Tiffany Nealy
 
AI in translational medicine webinar
AI in translational medicine webinarAI in translational medicine webinar
AI in translational medicine webinar
Pistoia Alliance
 
Genpro Life Sciences
Genpro Life SciencesGenpro Life Sciences
Genpro Life Sciences
Anoop Ambika
 
Moving from programmer to statistician
Moving from programmer to statisticianMoving from programmer to statistician
Moving from programmer to statistician
ray4hz
 
Analytics in Pharmaceutical Industry
Analytics in Pharmaceutical IndustryAnalytics in Pharmaceutical Industry
Analytics in Pharmaceutical Industry
International School of Engineering
 
Filling the gaps in translational research
Filling the gaps in translational researchFilling the gaps in translational research
Filling the gaps in translational research
Paul Agapow
 
BigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINALBigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINAL
John Koch
 
Machine Learning in Healthcare: A Case Study
Machine Learning in Healthcare: A Case StudyMachine Learning in Healthcare: A Case Study
Machine Learning in Healthcare: A Case Study
AlgoAnalytics Financial Consultancy Pvt. Ltd.
 
Machine Learning and Multi Drug Resistant(MDR) Infections case study
Machine Learning and Multi Drug Resistant(MDR) Infections case studyMachine Learning and Multi Drug Resistant(MDR) Infections case study
Machine Learning and Multi Drug Resistant(MDR) Infections case study
AlgoAnalytics Financial Consultancy Pvt. Ltd.
 
Data Integrity Validation Keynote Address Boston August 2016
Data Integrity Validation Keynote Address Boston August 2016Data Integrity Validation Keynote Address Boston August 2016
Data Integrity Validation Keynote Address Boston August 2016
Ajaz Hussain
 
Pharos: Putting targets in context
Pharos: Putting targets in contextPharos: Putting targets in context
Pharos: Putting targets in context
Rajarshi Guha
 
Pharos: A Torch to Use in Your Journey in the Dark Genome
Pharos: A Torch to Use in Your Journey in the Dark GenomePharos: A Torch to Use in Your Journey in the Dark Genome
Pharos: A Torch to Use in Your Journey in the Dark Genome
Rajarshi Guha
 
AlogoAnalytics Company Presentation
AlogoAnalytics Company PresentationAlogoAnalytics Company Presentation
AlogoAnalytics Company Presentation
AlgoAnalytics Financial Consultancy Pvt. Ltd.
 
How to Create a Big Data Culture in Pharma
How to Create a Big Data Culture in PharmaHow to Create a Big Data Culture in Pharma
How to Create a Big Data Culture in Pharma
Chris Waller
 
Raghav_CDM-Exp_4.5yrs
Raghav_CDM-Exp_4.5yrsRaghav_CDM-Exp_4.5yrs
Raghav_CDM-Exp_4.5yrs
Raghavendra S
 

What's hot (20)

2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar
 
MPS webinar master deck
MPS webinar master deckMPS webinar master deck
MPS webinar master deck
 
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
 
ELRIG Event Biocity Scotland May19
ELRIG Event Biocity Scotland May19ELRIG Event Biocity Scotland May19
ELRIG Event Biocity Scotland May19
 
Running A Better Clinical Trial
Running A Better Clinical TrialRunning A Better Clinical Trial
Running A Better Clinical Trial
 
Central_Analytics_Treating_the_Cause_Not_Just_the_Symptoms
Central_Analytics_Treating_the_Cause_Not_Just_the_SymptomsCentral_Analytics_Treating_the_Cause_Not_Just_the_Symptoms
Central_Analytics_Treating_the_Cause_Not_Just_the_Symptoms
 
AI in translational medicine webinar
AI in translational medicine webinarAI in translational medicine webinar
AI in translational medicine webinar
 
Genpro Life Sciences
Genpro Life SciencesGenpro Life Sciences
Genpro Life Sciences
 
Moving from programmer to statistician
Moving from programmer to statisticianMoving from programmer to statistician
Moving from programmer to statistician
 
Analytics in Pharmaceutical Industry
Analytics in Pharmaceutical IndustryAnalytics in Pharmaceutical Industry
Analytics in Pharmaceutical Industry
 
Filling the gaps in translational research
Filling the gaps in translational researchFilling the gaps in translational research
Filling the gaps in translational research
 
BigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINALBigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINAL
 
Machine Learning in Healthcare: A Case Study
Machine Learning in Healthcare: A Case StudyMachine Learning in Healthcare: A Case Study
Machine Learning in Healthcare: A Case Study
 
Machine Learning and Multi Drug Resistant(MDR) Infections case study
Machine Learning and Multi Drug Resistant(MDR) Infections case studyMachine Learning and Multi Drug Resistant(MDR) Infections case study
Machine Learning and Multi Drug Resistant(MDR) Infections case study
 
Data Integrity Validation Keynote Address Boston August 2016
Data Integrity Validation Keynote Address Boston August 2016Data Integrity Validation Keynote Address Boston August 2016
Data Integrity Validation Keynote Address Boston August 2016
 
Pharos: Putting targets in context
Pharos: Putting targets in contextPharos: Putting targets in context
Pharos: Putting targets in context
 
Pharos: A Torch to Use in Your Journey in the Dark Genome
Pharos: A Torch to Use in Your Journey in the Dark GenomePharos: A Torch to Use in Your Journey in the Dark Genome
Pharos: A Torch to Use in Your Journey in the Dark Genome
 
AlogoAnalytics Company Presentation
AlogoAnalytics Company PresentationAlogoAnalytics Company Presentation
AlogoAnalytics Company Presentation
 
How to Create a Big Data Culture in Pharma
How to Create a Big Data Culture in PharmaHow to Create a Big Data Culture in Pharma
How to Create a Big Data Culture in Pharma
 
Raghav_CDM-Exp_4.5yrs
Raghav_CDM-Exp_4.5yrsRaghav_CDM-Exp_4.5yrs
Raghav_CDM-Exp_4.5yrs
 

Similar to AI Pharma Summit Keynote Boston 7-26-17

Transforming Big Data into Big Value
Transforming Big Data into Big ValueTransforming Big Data into Big Value
Transforming Big Data into Big Value
Thomas Kelly, PMP
 
How AstraZeneca is Applying AI, Imaging & Data Analytics (AI-Driven Drug Deve...
How AstraZeneca is Applying AI, Imaging & Data Analytics (AI-Driven Drug Deve...How AstraZeneca is Applying AI, Imaging & Data Analytics (AI-Driven Drug Deve...
How AstraZeneca is Applying AI, Imaging & Data Analytics (AI-Driven Drug Deve...
Nick Brown
 
ai in clinical trails.pptx
ai in clinical trails.pptxai in clinical trails.pptx
ai in clinical trails.pptx
RajdeepMaji3
 
aiinclinicaltrails-221008052225-c7ed8a95.pdf
aiinclinicaltrails-221008052225-c7ed8a95.pdfaiinclinicaltrails-221008052225-c7ed8a95.pdf
aiinclinicaltrails-221008052225-c7ed8a95.pdf
MartaHC1
 
"Challenges for AI in Healthcare" - Peter Graven Ph.D
"Challenges for AI in Healthcare" - Peter Graven Ph.D"Challenges for AI in Healthcare" - Peter Graven Ph.D
"Challenges for AI in Healthcare" - Peter Graven Ph.D
Grid Dynamics
 
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Tom Plasterer
 
Practical Drug Discovery using Explainable Artificial Intelligence
Practical Drug Discovery using Explainable Artificial IntelligencePractical Drug Discovery using Explainable Artificial Intelligence
Practical Drug Discovery using Explainable Artificial Intelligence
Al Dossetter
 
AI applications in life sciences - drug development
AI applications in life sciences - drug developmentAI applications in life sciences - drug development
AI applications in life sciences - drug development
Jayanthi Repalli, PhD
 
AI in Healthcare
AI in HealthcareAI in Healthcare
AI in Healthcare
Paul Agapow
 
Big data, RWE and AI in Clinical Trials made simple
Big data, RWE and AI in Clinical Trials made simpleBig data, RWE and AI in Clinical Trials made simple
Big data, RWE and AI in Clinical Trials made simple
Hadas Jacoby
 
Applications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdf
Applications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdfApplications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdf
Applications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdf
ArunPrasad880048
 
Five critical lessons you should learn from the IBM Watson misfire slideshare
Five critical lessons you should learn from the IBM Watson misfire   slideshareFive critical lessons you should learn from the IBM Watson misfire   slideshare
Five critical lessons you should learn from the IBM Watson misfire slideshare
Ruth Smith
 
Big Data in Disease Management
Big Data in Disease ManagementBig Data in Disease Management
Big Data in Disease Management
InterpretOmics
 
Collaborative Drug Discovery -- Life Science Collaboration & Virtualization: ...
Collaborative Drug Discovery -- Life Science Collaboration & Virtualization: ...Collaborative Drug Discovery -- Life Science Collaboration & Virtualization: ...
Collaborative Drug Discovery -- Life Science Collaboration & Virtualization: ...
Pistoia Alliance
 
Pistoia Pharma-in-a box: A vision for virtualized pharma in 2020
Pistoia Pharma-in-a box: A vision for virtualized pharma in 2020Pistoia Pharma-in-a box: A vision for virtualized pharma in 2020
Pistoia Pharma-in-a box: A vision for virtualized pharma in 2020
Sean Ekins
 
Faster and Cheaper Clinical Trials: The Benefit of Synthetic Data
Faster and Cheaper Clinical Trials: The Benefit of Synthetic DataFaster and Cheaper Clinical Trials: The Benefit of Synthetic Data
Faster and Cheaper Clinical Trials: The Benefit of Synthetic Data
accenture
 
Big Data & ML for Clinical Data
Big Data & ML for Clinical DataBig Data & ML for Clinical Data
Big Data & ML for Clinical Data
Paul Agapow
 
Computer aided drug designing (CADD)
Computer aided drug designing (CADD)Computer aided drug designing (CADD)
Computer aided drug designing (CADD)
Aakshay Subramaniam
 
Data & Technology in Clinical Trials
Data & Technology in Clinical TrialsData & Technology in Clinical Trials
Data & Technology in Clinical Trials
Nassim Azzi, MBA
 
Artificial intelligence
Artificial intelligenceArtificial intelligence
Artificial intelligence
Pratiksha Chandragirivar
 

Similar to AI Pharma Summit Keynote Boston 7-26-17 (20)

Transforming Big Data into Big Value
Transforming Big Data into Big ValueTransforming Big Data into Big Value
Transforming Big Data into Big Value
 
How AstraZeneca is Applying AI, Imaging & Data Analytics (AI-Driven Drug Deve...
How AstraZeneca is Applying AI, Imaging & Data Analytics (AI-Driven Drug Deve...How AstraZeneca is Applying AI, Imaging & Data Analytics (AI-Driven Drug Deve...
How AstraZeneca is Applying AI, Imaging & Data Analytics (AI-Driven Drug Deve...
 
ai in clinical trails.pptx
ai in clinical trails.pptxai in clinical trails.pptx
ai in clinical trails.pptx
 
aiinclinicaltrails-221008052225-c7ed8a95.pdf
aiinclinicaltrails-221008052225-c7ed8a95.pdfaiinclinicaltrails-221008052225-c7ed8a95.pdf
aiinclinicaltrails-221008052225-c7ed8a95.pdf
 
"Challenges for AI in Healthcare" - Peter Graven Ph.D
"Challenges for AI in Healthcare" - Peter Graven Ph.D"Challenges for AI in Healthcare" - Peter Graven Ph.D
"Challenges for AI in Healthcare" - Peter Graven Ph.D
 
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
 
Practical Drug Discovery using Explainable Artificial Intelligence
Practical Drug Discovery using Explainable Artificial IntelligencePractical Drug Discovery using Explainable Artificial Intelligence
Practical Drug Discovery using Explainable Artificial Intelligence
 
AI applications in life sciences - drug development
AI applications in life sciences - drug developmentAI applications in life sciences - drug development
AI applications in life sciences - drug development
 
AI in Healthcare
AI in HealthcareAI in Healthcare
AI in Healthcare
 
Big data, RWE and AI in Clinical Trials made simple
Big data, RWE and AI in Clinical Trials made simpleBig data, RWE and AI in Clinical Trials made simple
Big data, RWE and AI in Clinical Trials made simple
 
Applications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdf
Applications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdfApplications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdf
Applications-of-AI-in-Drug-Discovery-and-Development-PreScouter.pdf
 
Five critical lessons you should learn from the IBM Watson misfire slideshare
Five critical lessons you should learn from the IBM Watson misfire   slideshareFive critical lessons you should learn from the IBM Watson misfire   slideshare
Five critical lessons you should learn from the IBM Watson misfire slideshare
 
Big Data in Disease Management
Big Data in Disease ManagementBig Data in Disease Management
Big Data in Disease Management
 
Collaborative Drug Discovery -- Life Science Collaboration & Virtualization: ...
Collaborative Drug Discovery -- Life Science Collaboration & Virtualization: ...Collaborative Drug Discovery -- Life Science Collaboration & Virtualization: ...
Collaborative Drug Discovery -- Life Science Collaboration & Virtualization: ...
 
Pistoia Pharma-in-a box: A vision for virtualized pharma in 2020
Pistoia Pharma-in-a box: A vision for virtualized pharma in 2020Pistoia Pharma-in-a box: A vision for virtualized pharma in 2020
Pistoia Pharma-in-a box: A vision for virtualized pharma in 2020
 
Faster and Cheaper Clinical Trials: The Benefit of Synthetic Data
Faster and Cheaper Clinical Trials: The Benefit of Synthetic DataFaster and Cheaper Clinical Trials: The Benefit of Synthetic Data
Faster and Cheaper Clinical Trials: The Benefit of Synthetic Data
 
Big Data & ML for Clinical Data
Big Data & ML for Clinical DataBig Data & ML for Clinical Data
Big Data & ML for Clinical Data
 
Computer aided drug designing (CADD)
Computer aided drug designing (CADD)Computer aided drug designing (CADD)
Computer aided drug designing (CADD)
 
Data & Technology in Clinical Trials
Data & Technology in Clinical TrialsData & Technology in Clinical Trials
Data & Technology in Clinical Trials
 
Artificial intelligence
Artificial intelligenceArtificial intelligence
Artificial intelligence
 

Recently uploaded

5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Jeffrey Haguewood
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
SAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloudSAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloud
maazsz111
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
Edge AI and Vision Alliance
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 

Recently uploaded (20)

5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
SAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloudSAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloud
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 

AI Pharma Summit Keynote Boston 7-26-17

  • 1. Brandon  Allgood,  PhD Chief  Technology  Officer allg00d How  AI  can  be  implemented  in   Key  Areas  of  Failure  along  the   R&D  Pipeline
  • 3. AI  in  the  hype  cycle 3 We  tend  to  overestimate  the  effect  of  a  technology  in  the  short  run and  underestimate  the  effect  in  the  long  run  – Amara’s  Law
  • 4. What  is  AI? ML  systems  are  function  (model)  generators 4 AI General  AI Narrow  AI Machine   Learning Unsupervised   Learning Clustering Anomaly   detection PCA Supervised   Learning Linear   regression SVM Random   Forest Deep     Learning Reasoning   Systems … … … Expert   Systems
  • 5. Areas  of  largest  impact 5
  • 6. Common  characteristics Data  is  cheap  (there  is  a  lot  of  labeled  data) Training  sets  look  a  lot  like  space  being  predicted Training  set  features  are  obvious Very  constrained  environment 6
  • 7. Data  in  pharma Data  is  expensive  (small  amounts  of  labeled  data) 7
  • 8. Data  in  pharma Data  is  expensive  (small  amounts  of  labeled  data) Data  is  imbalanced  (very  few  actives) 8 Active Inactive Healthy Sick
  • 9. Data  in  pharma Data  is  expensive  (small  amounts  of  labeled  data) Data  is  imbalanced  (very  few  actives) Models  are  often  used  to  extrapolate,  not  interpolate 9
  • 10. Data  in  pharma Data  is  expensive  (small  amounts  of  labeled  data) Data  is  imbalanced  (very  few  actives) Models  are  often  used  to  extrapolate,  not  interpolate Biological  processes  are  not  fully  understood Right  features  are  unclear Avoid  assumptions  about  the  biology 10
  • 11. Data  in  pharma Data  is  expensive  (small  amounts  of  labeled  data) Data  is  imbalanced  (very  few  actives) Models  are  often  used  to  extrapolate,  not  interpolate Biological  processes  are  not  fully  understood Right  features  are  unclear Avoid  assumptions  about  the  biology Processes  and  phenomena  span  orders  of  magnitude 11
  • 12. Data  in  pharma Data  is  expensive  (small  amounts  of  labeled  data) Data  is  imbalanced  (very  few  actives) Models  are  often  used  to  extrapolate,  not  interpolate Biological  processes  are  not  fully  understood Right  features  are  unclear Avoid  assumptions  about  the  biology Processes  and  phenomena  span  orders  of  magnitude 12 Need  to  develop  domain  specific  approaches
  • 13. Enabling  data-­driven  research Availability  and  discoverability  over  organization Proper  structure  is  hard  to  anticipate Organization  will  arise,  if  the  data  is  accessible Storage  is  cheap,  keep  many  copies 13
  • 14. Enabling  data-­driven  research Availability  and  discoverability  over  organization Proper  structure  is  hard  to  anticipate Organization  will  arise,  if  the  data  is  accessible Storage  is  cheap,  keep  many  copies Co-­locate  the  data  with  the  compute Data  gravity  – Moving  data  is  now  often  the  bottle  neck 14
  • 15. Enabling  data-­driven  research Availability  and  discoverability  over  organization Proper  structure  is  hard  to  anticipate Organization  will  arise,  if  the  data  is  accessible Storage  is  cheap,  keep  many  copies Co-­locate  the  data  with  the  compute Data  gravity  – Moving  data  is  now  often  the  bottle  neck Compute  should  ideally  be  unlimited  and  elastic Use  the  public  cloud! Security  posture  doesn’t  need  to  be  compromised,  a  new  skill   set  is  required 15
  • 16. Data  sharing To  assume  the  AI  revolution  in  pharma  will  happen  in   your  organization  is  not  realistic Data  needs  to  reach  those  that  can  make  the  next   breakthroughs  in  algorithm  design It  is  time  to  start  sharing  data  through  data  partnerships ATOM  collaboration  (GSK,  UCSF,  NCI,  LLNL,  others) To  be  announced:  European  partnership  including  J&J 16 Data’s  value  lies  in  the  insights  extract  from  it
  • 17. Not  your  grandma’s  CADD CADD  tools  help  scientists  build  mental  models AI  can  ingest  more  data  &  build  more  complex  models AI  models  are  however  narrowly  focused  and  require  a   human/AI  partnership  to  be  successful Med-­chemists  can  now  focus  on  creating  enriched  spaces Model  predictions  need  to  be  double  checked Must  remain  vigilant  to  avoid  imparting  bias 17 Deploying  AI  systems  in  the  same  way  as  CADD   will  highly  limit  their  effectiveness
  • 18. Strengths  of  AI Ingest  all  available  data  &  build  more  complex  models Models  can  be  deployed  at  scale  to  evaluate  orders  of   magnitude  more  options Models  can  be  continually  retrained  as  new  data   becomes  available AI  systems  don’t  sleep AI  is  naïve  (of  chemistry,  biology,  psychology,  etc) 18
  • 19. Weaknesses  of  AI Still  need  experts  to  build  AI  models Need  some  data  to  start Still  narrow,  so  training  &  output  need  to  be  scrutinized AI  models  can  be  black  boxes 19 These  are  all  addressable  in  the  long  term
  • 20. Impact  on  pharma  R&D Omics  analysis Literature  connections Example:  Open  Targets 20 Target  ID Discovery Pre-­clinical Clinical Beyond Source:  Enrico  Ferrero,  PhD,  Associate  GSK  Fellow
  • 21. Impact  on  pharma  R&D Hit  ID Numerate  platform AI  models  built  with  as  few  as  8  active  compounds Identify  compounds  from  libraries  in  novel  scaffolds  (30%-­70%  hit  rate) Screen  10M  purchasables  <  1  hour  or  128M  synthesizables  <  1  day Including  active  learning  à predicts  most  informative  compounds Lead  ID  &  optimization Numerate  platform Includes  all  available  data  from  all  available  source Search  and  rank  ~10B  novel  compound  libraries  in  ~2  weeks Drive  efficacy  in  fewer  design  cycles Include  any  number  of  off-­target  or  ADME  models  of  concern 21 Target  ID Discovery Pre-­clinical Clinical Beyond
  • 22. Impact  on  pharma  R&D ADME  /  toxicity Numerate  platform ADME  suite  – Includes  models  for  all  standard  preclinical  in  vitro   assays,  including,  stability,  permeability,  and  protein  binding ToxTool – Includes  over  6,600  MoA  models  and  4,000  GO  based  meta-­ models  (including  pathway  context) Other  work:  DL  applied  to  the  Merck  Kaggle challenge 22 Target  ID Discovery Pre-­clinical Clinical Beyond
  • 23. Impact  on  pharma  R&D Trial  design Biomarker  development Cohort  selection Patient  recruitment Patient compliance Smart  data  collection Anomaly  detection Basic  analytics 23 Target  ID Discovery Pre-­clinical Clinical Beyond
  • 24. Impact  on  pharma  R&D Repurposing/Repositioning Drug-­Omics  analysis Literature  analysis Patients Preventative  care Personalized  medicine 24 Target  ID Discovery Pre-­clinical Clinical Beyond
  • 25. Strategic  Value  of  AI  in  pharma  R&D Faster  drug  discovery Increased  safety Lower  failure  rates Faster  regulatory  approval Speed  to  market Cost  efficiency Improved  patient  engagement Better  treatment  outcomes New  growth  opportunities 25 Target  ID Discovery Pre-­clinical Clinical Beyond
  • 26. Conclusions Believe  the  hype,  just  don’t  get  ahead  of  yourself Data  needs  to  be  made  more  available Life  scientists  and  computer  scientists  need  to  come   together  to  develop  domain  specific  methods The  culture  and  approach  to  using  AI  cannot  be  the   same  as  to  CADD  tools Need  to  be  robust,  beyond  the  hype 26
  • 27. Fourth  industrial  revolution? Time  to  begin  retraining The  ability  to  work  with  AI  and  understand  its  capability  & potential  needs  to  be  pervasive 27