SlideShare a Scribd company logo
1 of 27
Biologics information in PubChem
Jian Zhang*, Paul Thiessen, Tiejun Cheng, Ben Shoemaker, Evan Bolton,
Noel O'Boyle, Roger Sayle
2019 Fall ACS National Meeting and Expo, San Diego
Biologics: definitions..
Biologic(s) - Biological products:
 a wide range of products such as vaccines, blood and blood
components, tissues ..
 can be composed of sugars, proteins, lipids, or nucleic acids or
complex combinations of these substances
 isolated from a variety of natural sources - human, animal, or
microorganism ...
 produced by biotechnology methods and other cutting-edge
technologies...
 used to treat a variety of medical conditions for which no other
treatments are available...
Biologics: large to small
• Biologics – large molecules, can be composed of sugars, proteins, or
nucleic acids or complex combinations of these substances.
• Information of sugars, proteins, nucleic acids, peptides are important for
biologic studies.
Example: vaccination – virus or bacteria antigen to stimulate the body to
produce antibody
Biologics: large to small
• Biologics extension – small biopolymers (oligomers, or repeat units) play
an import role for biologic studies.
• Small molecules – atoms < 1000 … biopolymers: components of protein,
glycans, nucleotides…
• Extended (PubChem) definition:
Structure contains recognized biopolymers monomers (glycan, lipids,
amino-acid, nucleotide… )
Outline
• PubChem brief
• Biologic information in PubChem - line notations from Sugar &
Splice
• Data accessing and retrieving
• NCBI Glycans
• Summary
PubChem brief ..
• An open chemistry database
• A public chemical information repository
• A chemical information hub
Contents: Chemicals structure, depictions and
notations, properties, drug information, food
additives, safety, toxicity, target, pathways,
bioactivities, literature, patents, and more ..
PubChem brief ..
• PubChem keep growing ..
Data Collection Live Item Count
Compounds 95,753,185
Substances 234,916,398
BioAssays 1,340,534
Bioactivities 265,373,498
Gene Targets 58,029
Protein Targets 17,847
Taxonomy Targets 3,746
Literature 29,876,654
Patents 3,142,716
Biologics in PubChem
Extended (PubChem) definition:
structure contains recognized biopolymers,
monomers (glycan, amino-acid, nucleotide,
lipids …)
Biologics in PubChem - examples
AZT: CID 35370
CID 1083 CID 138798156 CID 138802788
CID 3687 CID 10008322
Biologics in PubChem – Contents
PubChem biologic description:
• Condensed IUPAC
• Sequence
• PLN (protein line notation)
• HELM (Hierarchical Editing Language for Macromolecules)
• LINUCS (LInear Notation for Unique description of Carbohydrate
Sequences)
• SNFG depictions
• IUPAC
Biologics in PubChem – examples proc with Sugar&Splice
AZT: CID 35370
CID 1083 CID 138798156 CID 138802788
CID 3687 CID 10008322
Biologics in PubChem –
structure can be very complex
E.g. Teriparatide
(Teriparatide is a recombinant human
parathyroid hormone analogue that is
used to treat osteoporosis in women or
men with a high risk for bone fracture)
https://pubchem.ncbi.nlm.nih.gov/comp
ound/Teriparatide
Sugar & Slice – Generate line notations for biologics
Teriparatide
Biologics in PubChem – saccharides example
G(M1)-Oligosaccharide:
https://pubchem.ncbi.nlm.nih.gov/compound/G(M1)-
Oligosaccharide
Biologics in PubChem – peptide lipids example
(2S,3S)-3-methyl-2-[[(2S)-2-[[2-[[2-
(tetradecanoylamino)acetyl]amino]ace
tyl]amino]propanoyl]amino]pentanoic
acid
CID 138810998
https://pubchem.ncbi.nlm.nih.gov/com
pound/138810998
Line notations from “Sugar and Splice” simplified the
complex structure information which is readable for both
human and computer.
Biologics in PubChem – browse and download
PubChem classification browser: Compound TOC tree
https://pubchem.ncbi.nlm.nih.gov/classification/#hid=72
1) Start from the PubChem homepage
2) Click the “browse” data to launch the PubChem classification
browser
3) At the dropdown menu, choose “PubChem”, then “PubChem
Compound TOC”
Biologics in PubChem – browse and download
Biologics in PubChem – data accessing
• Website:
1. Text search (Google, PubChem .. )
2. PubChem structure search
• Programmatic: Pug_view API
Format Heading: Biologic Description
CID
https://pubchem.ncbi.nlm.nih.gov/rest/pug_view/data/compound/6918011/
XML?heading=Biologic%20Description
Biologics in PubChem – data accessing
https://pubchem.ncbi.nlm.nih.gov/compound/6918011#section=
Biologic-Description
Example: CID 6918011 - Lanreotide
Biologics in PubChem – data accessing
Format Heading: Biologic Description
CID
https://pubchem.ncbi.nlm.nih.gov/rest/pug_view/data/compound/6918011/
XML?heading=Biologic%20Description
Pug_view API for the same compound:
NCBI Glycans
NCBI glycans website was created in 2016 as a joint project from
PubChem and the Glycan Informatics Advisory Group (globally).
• Glycan information resource summary
• Definition for the carbohydrate monomer depictions
• Links to various resource tools.
NCBI Glycans – https://www.ncbi.nlm.nih.gov/glycans/
The main page contains a brief introduction, links to other pages
and external resources
Symbol Nomenclature for Glycans (SNFG)
NCBI Glycans – https://www.ncbi.nlm.nih.gov/glycans/snfg.html
• The SNFG (symbol nomenclature for glycans) page provides
carbohydrate monomer depictions, useful resource links, and
SNFG examples.
NCBI Glycans – SNFG examples
NCBI Glycans – https://www.ncbi.nlm.nih.gov/glycans/snfg.html
• Monosaccharide monomer abbreviations and names.
Summary
• PubChem provides biologic information for more than 1.5
million compounds.
• The line notation created using “Sugar and Splice” simplified the
complex structure information which is readable for both human
and computer.
• The biologic information in PubChem can be accessed and
retrieved in both website and program.
• The NCBI glycans website provides a great resource for glycan
studies.
Thank you. This research was supported by the Intramural Research
Program of the NIH, National Library of Medicine.
Evan Bolton
Asta Gindulyte
Ben Shoemaker
Paul Thiessen
Siqian He
Bo Yu
Jie Chen
Tiejun Cheng
Jane He
Sunghwan Kim
Leon Li
Leonid Zaslavsky
Collaborators
Noel O'Boyle, NextMove Software
Roger Sayle, NextMove Software
The Glycan Informatics Advisory Group (GlyAG)

More Related Content

Similar to Biologics information in PubChem

Bioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST ToolBioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST ToolJesminBinti
 
PubChem and Big Data Chemistry
PubChem and Big Data ChemistryPubChem and Big Data Chemistry
PubChem and Big Data ChemistrySunghwan Kim
 
PepTalk the protein science week 2015
PepTalk the protein science week 2015PepTalk the protein science week 2015
PepTalk the protein science week 2015Nicole Proulx
 
SYNTHETIC BIOLOGY: Putting engineering into biology | Presented by Pranjali ...
SYNTHETIC BIOLOGY: Putting engineering into biology | Presented by  Pranjali ...SYNTHETIC BIOLOGY: Putting engineering into biology | Presented by  Pranjali ...
SYNTHETIC BIOLOGY: Putting engineering into biology | Presented by Pranjali ...pranjali bhadane
 
PubChem: A Public Chemical Information Resource for Big Data Chemistry
PubChem: A Public Chemical Information Resource for Big Data ChemistryPubChem: A Public Chemical Information Resource for Big Data Chemistry
PubChem: A Public Chemical Information Resource for Big Data ChemistrySunghwan Kim
 
Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...Syed Muhammad Ali Hasnain
 
10th Annual Bioassays and Bioanalytical Method Development Conference Report ...
10th Annual Bioassays and Bioanalytical Method Development Conference Report ...10th Annual Bioassays and Bioanalytical Method Development Conference Report ...
10th Annual Bioassays and Bioanalytical Method Development Conference Report ...Doranelly (Dolly) Koltchev
 
Exploiting PubChem for drug discovery based on natural products
Exploiting PubChem for drug discovery based on natural productsExploiting PubChem for drug discovery based on natural products
Exploiting PubChem for drug discovery based on natural productsSunghwan Kim
 
Pbio Investor Presentation Global Online Growth Conference - Oct 2016
Pbio Investor Presentation Global Online Growth Conference - Oct 2016Pbio Investor Presentation Global Online Growth Conference - Oct 2016
Pbio Investor Presentation Global Online Growth Conference - Oct 2016RedChip Companies, Inc.
 
Big data in metabolism
Big data in metabolismBig data in metabolism
Big data in metabolismAlichy Sowmya
 
Guide to Pharmacology Poster - ELIXIR All Hands 2020
Guide to Pharmacology Poster - ELIXIR All Hands 2020Guide to Pharmacology Poster - ELIXIR All Hands 2020
Guide to Pharmacology Poster - ELIXIR All Hands 2020Guide to PHARMACOLOGY
 
Applications of bioinformatics, main by kk sahu
Applications of bioinformatics, main by kk sahuApplications of bioinformatics, main by kk sahu
Applications of bioinformatics, main by kk sahuKAUSHAL SAHU
 
introduction to bioinfromatics.pptx
introduction to bioinfromatics.pptxintroduction to bioinfromatics.pptx
introduction to bioinfromatics.pptxAbelPhilipJoseph
 
Pressure Biosciences Investor Presentation
Pressure Biosciences Investor PresentationPressure Biosciences Investor Presentation
Pressure Biosciences Investor PresentationRedChip Companies, Inc.
 
Synthetic Biology.pptx
Synthetic Biology.pptxSynthetic Biology.pptx
Synthetic Biology.pptxShubNoor4
 
Sample Prep Solutions for Microbiome Research
Sample Prep Solutions for Microbiome ResearchSample Prep Solutions for Microbiome Research
Sample Prep Solutions for Microbiome ResearchQIAGEN
 

Similar to Biologics information in PubChem (20)

Bioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST ToolBioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST Tool
 
PubChem and Big Data Chemistry
PubChem and Big Data ChemistryPubChem and Big Data Chemistry
PubChem and Big Data Chemistry
 
PepTalk the protein science week 2015
PepTalk the protein science week 2015PepTalk the protein science week 2015
PepTalk the protein science week 2015
 
SYNTHETIC BIOLOGY: Putting engineering into biology | Presented by Pranjali ...
SYNTHETIC BIOLOGY: Putting engineering into biology | Presented by  Pranjali ...SYNTHETIC BIOLOGY: Putting engineering into biology | Presented by  Pranjali ...
SYNTHETIC BIOLOGY: Putting engineering into biology | Presented by Pranjali ...
 
PubChem: A Public Chemical Information Resource for Big Data Chemistry
PubChem: A Public Chemical Information Resource for Big Data ChemistryPubChem: A Public Chemical Information Resource for Big Data Chemistry
PubChem: A Public Chemical Information Resource for Big Data Chemistry
 
Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...
 
Pubchem
PubchemPubchem
Pubchem
 
DRUG discovery
DRUG discoveryDRUG discovery
DRUG discovery
 
10th Annual Bioassays and Bioanalytical Method Development Conference Report ...
10th Annual Bioassays and Bioanalytical Method Development Conference Report ...10th Annual Bioassays and Bioanalytical Method Development Conference Report ...
10th Annual Bioassays and Bioanalytical Method Development Conference Report ...
 
Exploiting PubChem for drug discovery based on natural products
Exploiting PubChem for drug discovery based on natural productsExploiting PubChem for drug discovery based on natural products
Exploiting PubChem for drug discovery based on natural products
 
Pbio Investor Presentation Global Online Growth Conference - Oct 2016
Pbio Investor Presentation Global Online Growth Conference - Oct 2016Pbio Investor Presentation Global Online Growth Conference - Oct 2016
Pbio Investor Presentation Global Online Growth Conference - Oct 2016
 
Big data in metabolism
Big data in metabolismBig data in metabolism
Big data in metabolism
 
BIOINFORMATICS.pptx
BIOINFORMATICS.pptxBIOINFORMATICS.pptx
BIOINFORMATICS.pptx
 
Guide to Pharmacology Poster - ELIXIR All Hands 2020
Guide to Pharmacology Poster - ELIXIR All Hands 2020Guide to Pharmacology Poster - ELIXIR All Hands 2020
Guide to Pharmacology Poster - ELIXIR All Hands 2020
 
Applications of bioinformatics, main by kk sahu
Applications of bioinformatics, main by kk sahuApplications of bioinformatics, main by kk sahu
Applications of bioinformatics, main by kk sahu
 
introduction to bioinfromatics.pptx
introduction to bioinfromatics.pptxintroduction to bioinfromatics.pptx
introduction to bioinfromatics.pptx
 
Pressure Biosciences Investor Presentation
Pressure Biosciences Investor PresentationPressure Biosciences Investor Presentation
Pressure Biosciences Investor Presentation
 
Synthetic Biology.pptx
Synthetic Biology.pptxSynthetic Biology.pptx
Synthetic Biology.pptx
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Sample Prep Solutions for Microbiome Research
Sample Prep Solutions for Microbiome ResearchSample Prep Solutions for Microbiome Research
Sample Prep Solutions for Microbiome Research
 

More from Jian Zhang

Chemical Information for Household Products
Chemical Information for Household ProductsChemical Information for Household Products
Chemical Information for Household ProductsJian Zhang
 
Chemical safety information in PubChem
Chemical safety information in PubChemChemical safety information in PubChem
Chemical safety information in PubChemJian Zhang
 
Publishing chemical data in public data repository
Publishing chemical data in public data repository Publishing chemical data in public data repository
Publishing chemical data in public data repository Jian Zhang
 
Green chemistry and open data
Green chemistry and open dataGreen chemistry and open data
Green chemistry and open dataJian Zhang
 
Environmental chemical information in PubChem
Environmental chemical information in PubChem Environmental chemical information in PubChem
Environmental chemical information in PubChem Jian Zhang
 

More from Jian Zhang (6)

Chemical Information for Household Products
Chemical Information for Household ProductsChemical Information for Household Products
Chemical Information for Household Products
 
PubChem LCSS
PubChem LCSSPubChem LCSS
PubChem LCSS
 
Chemical safety information in PubChem
Chemical safety information in PubChemChemical safety information in PubChem
Chemical safety information in PubChem
 
Publishing chemical data in public data repository
Publishing chemical data in public data repository Publishing chemical data in public data repository
Publishing chemical data in public data repository
 
Green chemistry and open data
Green chemistry and open dataGreen chemistry and open data
Green chemistry and open data
 
Environmental chemical information in PubChem
Environmental chemical information in PubChem Environmental chemical information in PubChem
Environmental chemical information in PubChem
 

Recently uploaded

Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 

Recently uploaded (20)

Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 

Biologics information in PubChem

  • 1. Biologics information in PubChem Jian Zhang*, Paul Thiessen, Tiejun Cheng, Ben Shoemaker, Evan Bolton, Noel O'Boyle, Roger Sayle 2019 Fall ACS National Meeting and Expo, San Diego
  • 2. Biologics: definitions.. Biologic(s) - Biological products:  a wide range of products such as vaccines, blood and blood components, tissues ..  can be composed of sugars, proteins, lipids, or nucleic acids or complex combinations of these substances  isolated from a variety of natural sources - human, animal, or microorganism ...  produced by biotechnology methods and other cutting-edge technologies...  used to treat a variety of medical conditions for which no other treatments are available...
  • 3. Biologics: large to small • Biologics – large molecules, can be composed of sugars, proteins, or nucleic acids or complex combinations of these substances. • Information of sugars, proteins, nucleic acids, peptides are important for biologic studies. Example: vaccination – virus or bacteria antigen to stimulate the body to produce antibody
  • 4. Biologics: large to small • Biologics extension – small biopolymers (oligomers, or repeat units) play an import role for biologic studies. • Small molecules – atoms < 1000 … biopolymers: components of protein, glycans, nucleotides… • Extended (PubChem) definition: Structure contains recognized biopolymers monomers (glycan, lipids, amino-acid, nucleotide… )
  • 5. Outline • PubChem brief • Biologic information in PubChem - line notations from Sugar & Splice • Data accessing and retrieving • NCBI Glycans • Summary
  • 6. PubChem brief .. • An open chemistry database • A public chemical information repository • A chemical information hub Contents: Chemicals structure, depictions and notations, properties, drug information, food additives, safety, toxicity, target, pathways, bioactivities, literature, patents, and more ..
  • 7. PubChem brief .. • PubChem keep growing .. Data Collection Live Item Count Compounds 95,753,185 Substances 234,916,398 BioAssays 1,340,534 Bioactivities 265,373,498 Gene Targets 58,029 Protein Targets 17,847 Taxonomy Targets 3,746 Literature 29,876,654 Patents 3,142,716
  • 8. Biologics in PubChem Extended (PubChem) definition: structure contains recognized biopolymers, monomers (glycan, amino-acid, nucleotide, lipids …)
  • 9. Biologics in PubChem - examples AZT: CID 35370 CID 1083 CID 138798156 CID 138802788 CID 3687 CID 10008322
  • 10. Biologics in PubChem – Contents PubChem biologic description: • Condensed IUPAC • Sequence • PLN (protein line notation) • HELM (Hierarchical Editing Language for Macromolecules) • LINUCS (LInear Notation for Unique description of Carbohydrate Sequences) • SNFG depictions • IUPAC
  • 11. Biologics in PubChem – examples proc with Sugar&Splice AZT: CID 35370 CID 1083 CID 138798156 CID 138802788 CID 3687 CID 10008322
  • 12. Biologics in PubChem – structure can be very complex E.g. Teriparatide (Teriparatide is a recombinant human parathyroid hormone analogue that is used to treat osteoporosis in women or men with a high risk for bone fracture) https://pubchem.ncbi.nlm.nih.gov/comp ound/Teriparatide
  • 13. Sugar & Slice – Generate line notations for biologics Teriparatide
  • 14. Biologics in PubChem – saccharides example G(M1)-Oligosaccharide: https://pubchem.ncbi.nlm.nih.gov/compound/G(M1)- Oligosaccharide
  • 15. Biologics in PubChem – peptide lipids example (2S,3S)-3-methyl-2-[[(2S)-2-[[2-[[2- (tetradecanoylamino)acetyl]amino]ace tyl]amino]propanoyl]amino]pentanoic acid CID 138810998 https://pubchem.ncbi.nlm.nih.gov/com pound/138810998 Line notations from “Sugar and Splice” simplified the complex structure information which is readable for both human and computer.
  • 16. Biologics in PubChem – browse and download PubChem classification browser: Compound TOC tree https://pubchem.ncbi.nlm.nih.gov/classification/#hid=72
  • 17. 1) Start from the PubChem homepage 2) Click the “browse” data to launch the PubChem classification browser 3) At the dropdown menu, choose “PubChem”, then “PubChem Compound TOC” Biologics in PubChem – browse and download
  • 18. Biologics in PubChem – data accessing • Website: 1. Text search (Google, PubChem .. ) 2. PubChem structure search • Programmatic: Pug_view API Format Heading: Biologic Description CID https://pubchem.ncbi.nlm.nih.gov/rest/pug_view/data/compound/6918011/ XML?heading=Biologic%20Description
  • 19. Biologics in PubChem – data accessing https://pubchem.ncbi.nlm.nih.gov/compound/6918011#section= Biologic-Description Example: CID 6918011 - Lanreotide
  • 20. Biologics in PubChem – data accessing Format Heading: Biologic Description CID https://pubchem.ncbi.nlm.nih.gov/rest/pug_view/data/compound/6918011/ XML?heading=Biologic%20Description Pug_view API for the same compound:
  • 21. NCBI Glycans NCBI glycans website was created in 2016 as a joint project from PubChem and the Glycan Informatics Advisory Group (globally). • Glycan information resource summary • Definition for the carbohydrate monomer depictions • Links to various resource tools.
  • 22. NCBI Glycans – https://www.ncbi.nlm.nih.gov/glycans/ The main page contains a brief introduction, links to other pages and external resources Symbol Nomenclature for Glycans (SNFG)
  • 23. NCBI Glycans – https://www.ncbi.nlm.nih.gov/glycans/snfg.html • The SNFG (symbol nomenclature for glycans) page provides carbohydrate monomer depictions, useful resource links, and SNFG examples.
  • 24. NCBI Glycans – SNFG examples
  • 25. NCBI Glycans – https://www.ncbi.nlm.nih.gov/glycans/snfg.html • Monosaccharide monomer abbreviations and names.
  • 26. Summary • PubChem provides biologic information for more than 1.5 million compounds. • The line notation created using “Sugar and Splice” simplified the complex structure information which is readable for both human and computer. • The biologic information in PubChem can be accessed and retrieved in both website and program. • The NCBI glycans website provides a great resource for glycan studies.
  • 27. Thank you. This research was supported by the Intramural Research Program of the NIH, National Library of Medicine. Evan Bolton Asta Gindulyte Ben Shoemaker Paul Thiessen Siqian He Bo Yu Jie Chen Tiejun Cheng Jane He Sunghwan Kim Leon Li Leonid Zaslavsky Collaborators Noel O'Boyle, NextMove Software Roger Sayle, NextMove Software The Glycan Informatics Advisory Group (GlyAG)