SlideShare a Scribd company logo
1 of 12
http://pistoiaalliance.org @PistoiaAlliance
Pistoia Alliance HELM Project
- What About the Big Guys?
The emerging HELM standard for macromolecular
representation
Domain Lead – Sergio Rotstein
Business Technology, Pfizer
What is a “Biomolecule”?
2
Peptides
Therapeutic
Proteins
ADCs
Antibodies
Vaccines
ASOs
siRNAs
For our purposes, anything
that is not a small molecule is
a biomolecule
Goal
• Eliminate biomolecule
penalty
• Make these entities first-
class citizens of the
Informatics tool portfolio
G
A
P
So what’s the problem?
3
N
NH
O
O
O
N
NH
O
O
O
Small
Molecules
Sequences
Biomolecules
Small Molecule Tools Sequence-Based Tools
“Fit-for-Purpose” Structure Representation
We need to enable the
representation, manipulation and
visualization of each molecule type in
a way that is appropriate for its size
and complexity
4
Fit for Purpose: “Monomer” Level
• While you could draw out an oligonucleotide like this:
• The representation is likely more intuitive / practical:
5
Fit for Purpose: Sequence Level
• But even the monomer level representation would not scale well to
proteins with hundreds of amino acids. Larger molecules require a
more sequence-oriented representation:
6
Fit for Purpose: Component Level
• For multi-component structures such as antibody drug
conjugates, component level representations are required to enable
each component to dealt with separately.
7
“Collapsed” Antibody
Expanded Drug
Ab
Hierarchical Editing Language for Macromolecules
– Hierarchical – Amenable to the various “levels”
• Complex Polymer ⇒ Simple Polymer ⇒ Monomer ⇒ Atom
– Extensible
• Allowing addition of new biopolymer types
– (Reasonably) comprehensive
• e.g. Allowing representation of oligonucleotide
hybridization
– Canonicalizable
• Facilitating uniqueness checking
– (Somewhat) human-readable
8
HELM Example: Simple polymer
• HELM notation: A.R.G.[dF].C.K.[ahA].E.D.A
– Non-natural amino acid codes are enclosed in square
brackets
• Natural equivalent: ARGFCKXEDA
9
HELM Example: Complex Polymer
10
Monomer Database
• Each monomer used in the notation needs to be predefined in a
monomer database
• The database includes the chemical structure of the monomer and
a description of all acceptable attachment points
11
J. Chem. Inf. Model 2012, 52, 2796-2806
12

More Related Content

Similar to HELM Notation Overview

Drug R&D Portfolio Challenges
Drug R&D Portfolio ChallengesDrug R&D Portfolio Challenges
Drug R&D Portfolio Challengesmeijia_yang
 
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Adam Ford
 
Designing a community resource - Sandra Orchard
Designing a community resource - Sandra OrchardDesigning a community resource - Sandra Orchard
Designing a community resource - Sandra OrchardEMBL-ABR
 
MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...
MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...
MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...Medicines Discovery Catapult
 
Machine learning, health data & the limits of knowledge
Machine learning, health data & the limits of knowledgeMachine learning, health data & the limits of knowledge
Machine learning, health data & the limits of knowledgePaul Agapow
 
Biovays Discovery Summit Presentation
Biovays Discovery Summit PresentationBiovays Discovery Summit Presentation
Biovays Discovery Summit PresentationIguanaBio Iguana
 
Session 3 part 5
Session 3 part 5Session 3 part 5
Session 3 part 5plmiami
 
Computational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptxComputational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptxashharnomani
 
How to Implement Biomedical Named Entity Recognition with Machine Learning
How to Implement Biomedical Named Entity Recognition with Machine Learning How to Implement Biomedical Named Entity Recognition with Machine Learning
How to Implement Biomedical Named Entity Recognition with Machine Learning Skyl.ai
 
Informatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems BiologyInformatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems BiologyNeil Swainston
 
Fake news detection
Fake news detection Fake news detection
Fake news detection shalushamil
 
Multi-Agent Modelling With applications to robotics and cognition
Multi-Agent Modelling With applications to robotics and cognitionMulti-Agent Modelling With applications to robotics and cognition
Multi-Agent Modelling With applications to robotics and cognitionAladdin Ayesh
 
Sample Prep Solutions for Microbiome Research
Sample Prep Solutions for Microbiome ResearchSample Prep Solutions for Microbiome Research
Sample Prep Solutions for Microbiome ResearchQIAGEN
 
Lecture1-Introduction-Jan18-2021.pptx
Lecture1-Introduction-Jan18-2021.pptxLecture1-Introduction-Jan18-2021.pptx
Lecture1-Introduction-Jan18-2021.pptxSangeetaTripathi8
 
Intro to in silico drug discovery 2014
Intro to in silico drug discovery 2014Intro to in silico drug discovery 2014
Intro to in silico drug discovery 2014Lee Larcombe
 

Similar to HELM Notation Overview (20)

Drug R&D Portfolio Challenges
Drug R&D Portfolio ChallengesDrug R&D Portfolio Challenges
Drug R&D Portfolio Challenges
 
Innovation og værdiskabelse i it-projekter
Innovation og værdiskabelse i it-projekterInnovation og værdiskabelse i it-projekter
Innovation og værdiskabelse i it-projekter
 
Session ii g2 lab modeling mmc
Session ii g2 lab modeling mmcSession ii g2 lab modeling mmc
Session ii g2 lab modeling mmc
 
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
 
Designing a community resource - Sandra Orchard
Designing a community resource - Sandra OrchardDesigning a community resource - Sandra Orchard
Designing a community resource - Sandra Orchard
 
MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...
MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...
MDC Connects Series 2021 | A Guide to Complex Medicines: Developing the assay...
 
Machine learning, health data & the limits of knowledge
Machine learning, health data & the limits of knowledgeMachine learning, health data & the limits of knowledge
Machine learning, health data & the limits of knowledge
 
Biovays Discovery Summit Presentation
Biovays Discovery Summit PresentationBiovays Discovery Summit Presentation
Biovays Discovery Summit Presentation
 
Neolite Business Credential
Neolite Business CredentialNeolite Business Credential
Neolite Business Credential
 
Session 3 part 5
Session 3 part 5Session 3 part 5
Session 3 part 5
 
Computational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptxComputational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptx
 
How to Implement Biomedical Named Entity Recognition with Machine Learning
How to Implement Biomedical Named Entity Recognition with Machine Learning How to Implement Biomedical Named Entity Recognition with Machine Learning
How to Implement Biomedical Named Entity Recognition with Machine Learning
 
Neo4j and bioinformatics
Neo4j and bioinformaticsNeo4j and bioinformatics
Neo4j and bioinformatics
 
Switching from academia to industry - and back
Switching from academia to industry - and backSwitching from academia to industry - and back
Switching from academia to industry - and back
 
Informatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems BiologyInformatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems Biology
 
Fake news detection
Fake news detection Fake news detection
Fake news detection
 
Multi-Agent Modelling With applications to robotics and cognition
Multi-Agent Modelling With applications to robotics and cognitionMulti-Agent Modelling With applications to robotics and cognition
Multi-Agent Modelling With applications to robotics and cognition
 
Sample Prep Solutions for Microbiome Research
Sample Prep Solutions for Microbiome ResearchSample Prep Solutions for Microbiome Research
Sample Prep Solutions for Microbiome Research
 
Lecture1-Introduction-Jan18-2021.pptx
Lecture1-Introduction-Jan18-2021.pptxLecture1-Introduction-Jan18-2021.pptx
Lecture1-Introduction-Jan18-2021.pptx
 
Intro to in silico drug discovery 2014
Intro to in silico drug discovery 2014Intro to in silico drug discovery 2014
Intro to in silico drug discovery 2014
 

Recently uploaded

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 

Recently uploaded (20)

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 

HELM Notation Overview

  • 1. http://pistoiaalliance.org @PistoiaAlliance Pistoia Alliance HELM Project - What About the Big Guys? The emerging HELM standard for macromolecular representation Domain Lead – Sergio Rotstein Business Technology, Pfizer
  • 2. What is a “Biomolecule”? 2 Peptides Therapeutic Proteins ADCs Antibodies Vaccines ASOs siRNAs For our purposes, anything that is not a small molecule is a biomolecule Goal • Eliminate biomolecule penalty • Make these entities first- class citizens of the Informatics tool portfolio
  • 3. G A P So what’s the problem? 3 N NH O O O N NH O O O Small Molecules Sequences Biomolecules Small Molecule Tools Sequence-Based Tools
  • 4. “Fit-for-Purpose” Structure Representation We need to enable the representation, manipulation and visualization of each molecule type in a way that is appropriate for its size and complexity 4
  • 5. Fit for Purpose: “Monomer” Level • While you could draw out an oligonucleotide like this: • The representation is likely more intuitive / practical: 5
  • 6. Fit for Purpose: Sequence Level • But even the monomer level representation would not scale well to proteins with hundreds of amino acids. Larger molecules require a more sequence-oriented representation: 6
  • 7. Fit for Purpose: Component Level • For multi-component structures such as antibody drug conjugates, component level representations are required to enable each component to dealt with separately. 7 “Collapsed” Antibody Expanded Drug Ab
  • 8. Hierarchical Editing Language for Macromolecules – Hierarchical – Amenable to the various “levels” • Complex Polymer ⇒ Simple Polymer ⇒ Monomer ⇒ Atom – Extensible • Allowing addition of new biopolymer types – (Reasonably) comprehensive • e.g. Allowing representation of oligonucleotide hybridization – Canonicalizable • Facilitating uniqueness checking – (Somewhat) human-readable 8
  • 9. HELM Example: Simple polymer • HELM notation: A.R.G.[dF].C.K.[ahA].E.D.A – Non-natural amino acid codes are enclosed in square brackets • Natural equivalent: ARGFCKXEDA 9
  • 10. HELM Example: Complex Polymer 10
  • 11. Monomer Database • Each monomer used in the notation needs to be predefined in a monomer database • The database includes the chemical structure of the monomer and a description of all acceptable attachment points 11
  • 12. J. Chem. Inf. Model 2012, 52, 2796-2806 12

Editor's Notes

  1. Paper will soon be posted on the upcoming HELM web site.