SlideShare a Scribd company logo
1 of 56
PubChem as a Resource for
Chemical Information Education
ACS Fall 2020 Virtual Meeting
August 20, 2020
Sunghwan Kim, Ph.D., M.Sc.
2
 PubChem (https://pubchem.ncbi.nlm.nih.gov)
 Public chemical database.
 Developed and maintained by
the U.S. National Institutes of Health.
 Contains various chemical entities:
• Small molecules
• siRNAs & miRNAs
• Carbohydrates
• Lipids
• Peptides
• Chemically modified
macromolecules
• ……
3
 PubChem (https://pubchem.ncbi.nlm.nih.gov)
 Collects chemical information from 750+ data sources
and disseminates it to the public free of charge.
 103 million unique chemical structures.
 Crosslinks to many other databases.
 Search, analysis, download and visualization tools.
 A key resource in many areas:
• Cheminformatics
• Chemical biology
• Medicinal chemistry
• Drug discovery
4
0
1
2
3
4
5
NumberofUsers
(Millions)
Month
Unique Monthly Users
PubChem Usage Statistics
2016 2017 2018
>4.3 million unique users per month (Apr. 2020)
2019 2020
Source: Google Analytics
5
 Top 5 Chemistry Websites
1. acs.org
2. rsc.org
3. sigmaaldrich.com
4. pubchem.ncbi.nlm.nih.gov
5. cas.org
Source: https://www.alexa.com/topsites/category/Top/Science/Chemistry
PubChem is the only public website among them.
PubChem Usage Statistics
6
~36% of PubChem users are between 18-24.
(likely to be college students)
[CELLRANGE]
[CELLRANGE]
[CELLRANGE]
[CELLRANGE][CELLRANGE][CELLRANGE]
0%
10%
20%
30%
40%
18-24 25-34 35-44 45-54 55-64 65+
%NumberofUsers
Age
Users by Age (April 2020)
PubChem Usage Statistics
7
 Popularity:
Many young people are already using PubChem.
 Sustainability:
It is sixteen years old and not going away soon.
 Zero-cost (to students):
U.S. taxpayers have already paid for it.
8
 Popularity:
Many young people are already using PubChem.
 Sustainability:
It is sixteen years old and not going away soon.
 Zero-cost (to students):
U.S. taxpayers have already paid for it.
A strong potential as an education resource,
especially for small organizations like:
• primarily undergraduate institutions (PUIs)
• community colleges (CCs)
9
 How about R1 universities with large endowments?
 Likely to have access to proprietary databases.
• Primarily used for research.
• Inconvenient off-campus access.
• Students will lose access when they graduate.
 Most students will eventually rely on public resources.
 Need for training/education opportunities while in
school.
10
Exploring Chemical Information in PubChem
1. Search by chemical name
2. Search by chemical structure
3. Search by gene/protein name
4. PubChem Periodic Table and Element pages
5. Programmatic access
11
Exploring Chemical Information in PubChem
1. Search by chemical name
2. Search by chemical structure
3. Search by gene/protein name
4. PubChem Periodic Table and Element pages
5. Programmatic access
12
13
14
15
16
17
18
19
20
Exploring Chemical Information in PubChem
1. Search by chemical name
2. Search by chemical structure
3. Search by gene/protein name
4. PubChem Periodic Table and Element pages
5. Programmatic access
21
22
23
24
25
Exploring Chemical Information in PubChem
1. Search by chemical name
2. Search by chemical structure
3. Search by gene/protein name
4. PubChem Periodic Table and Element pages
5. Programmatic access
26
27
28
29
30
31
32
Exploring Chemical Information in PubChem
1. Search by chemical name
2. Search by chemical structure
3. Search by gene/protein name
4. PubChem Periodic Table and Element pages
5. Programmatic access
33
34
Kim et al., Chem. Teacher International, 2020. doi:10.1515/cti-2020-0006
35
Kim et al., Chem. Teacher International, 2020. doi:10.1515/cti-2020-0006
36
Kim et al., Chem. Teacher International, 2020. doi:10.1515/cti-2020-0006
37
Kim et al., Chem. Teacher International, 2020. doi:10.1515/cti-2020-0006
38
Kim et al., Chem. Teacher International, 2020. doi:10.1515/cti-2020-0006
39
0
5
10
15
20
25
30
0 10 20 30 40 50 60 70 80 90 100
IonizationEnergy(eV)
Atomic Number
He
Ne
Ar
Kr
Xe
Rn
Li Na K Rb Cs Fr
Kim et al., Chem. Teacher International, 2020. doi:10.1515/cti-2020-0006
40
Exploring Chemical Information in PubChem
1. Search by chemical name
2. Search by chemical structure
3. Search by gene/protein name
4. PubChem Periodic Table and Element pages
5. Programmatic access
41
 Why should students learn programmatic
access?
 PubChem users have very diverse
backgrounds/interests.
 PubChem’s web interfaces are optimized to perform
commonly requested tasks interactively.
 Everything you can do with PubChem through the web
browser can be automated through PubChem’s
programmatic interfaces.
 Programmatic access enables one to do much more
complicated and specialized tasks that cannot be
done through the web browser.
42
 Why should students learn programmatic
access?
 Programming skills are essential for:
• automating routine tasks and
• processing/analyzing a large data set
 Important skills for students pursuing STEM careers in
the age of big data.
43
 Programmatic Access to PubChem
 Multiple programmatic access routes.
 Two major programmatic access methods.
• PUG-REST (primarily for computed properties).
Kim et al., Nucleic Acids Res. 2018, 46(W1):W563-570.
https://pubchemdocs.ncbi.nlm.nih.gov/pug-rest
• PUG-View (primarily for text information).
Kim et al., J. Cheminform. 2019, 11:56.
https://pubchemdocs.ncbi.nlm.nih.gov/pug-view
 Jupyter Notebooks containing sample codes (in
python/R) are freely available at LibreTexts:
https://chem.libretexts.org/link?143689
44
Cheminformatics
On-Line Chemistry Course (OLCC)
Kim et al., J. Chem. Educ., 2020, submitted.
45
Cheminformatics OLCC
 Unique challenges to teaching cheminformatics
 Cheminformatics is not an established chemistry field.
 Chemistry + Informatics + Computer Science
+ Library Science + Pharmaceutical Science + ……
 Not so many faculty members with Cheminformatics
expertise.
 No textbook suitable for undergraduate chemistry students.
46
The Cheminformatics OLCC addresses these issues!
Cheminformatics OLCC
 Unique challenges to teaching cheminformatics
 Cheminformatics is not an established chemistry field.
 Chemistry + Informatics + Computer Science
+ Library Science + Pharmaceutical Science + ……
 Not so many faculty members with Cheminformatics
expertise.
 No textbook suitable for undergraduate chemistry students.
47
Course website
Cheminformatics
experts
Prepare online reading materials &
homework problem sets
Cheminformatics OLCC
48
Course website
Cheminformatics
experts
Prepare online reading materials &
homework problem sets
Course
Instructor
Students
Run the course
using the course materials
at multiple schools
Cheminformatics OLCC
49
Course website
Cheminformatics
experts
Prepare online reading materials &
homework problem sets
Course
Instructor
Students
Run the course
using the course materials
at multiple schools
Face-to-face
meeting
Online discussion among
experts, instructors, & students
through the website
Cheminformatics OLCC
50
It was offered three times:
 Fall 2015: 36 students from 4 schools
 Spring 2017: 47 students from 9 schools
 Fall 2019: 23 students from 5 schools
 All course materials are available at:
 CCCE website (http://olcc.ccce.divched.org)
 LibreTexts (https://libretexts.org)
(free online textbook site)
 Many of the course materials cover PubChem data, tools
and services.
Cheminformatics OLCC
51
 PubChem-related topics in Cheminformatics OLCC
 Critical assessment of chemical information
 Chemical representations (e.g., InChI and SMILES)
• As alternatives to chemical name queries
• For chemical data exchange/integration/sharing
 Search by chemical name
 Search by chemical structure
• Identity search
• 2-D/3-D similarity search
• Substructure/superstructure search
• Molecular formula search
 Structure clustering and structure-activity relationship analysis
 Automation of chemical data retrieval through a computer code
Cheminformatics OLCC
52
Many PubChem users are likely to be college students.
Summary
PubChem has a strong potential as a resource for
chemical information training because of its:
• popularity
• sustainability
• low cost
53
Summary
 PubChem supports various use cases beyond simple
chemical name search.
• Search by chemical structure
• Search by gene/protein name
• PubChem Periodic Table and Element pages
• Programmatic access
54
Summary
 PubChem works with the chemical education community
to provide chemical information training for students.
 Please reach out to us for collaboration if you are
interested.
55
Acknowledgements
Evan Bolton
Jie Chen
Tiejun Cheng
Asta Gindulyte
Jia He
Siqian He
Qingliang (Leon) Li
Benjamin Shoemaker
Thiessen Paul
Olga Pujolras
Bo Yu
Leonid Zaslavsky
Jian (Jeff) Zhang
Zhi (Leon) Sun
 The PubChem Team
 PubChem users, depositors, and collaborators
 Funded by the National Library of Medicine
56
Thank you!
Questions?
Sunghwan Kim, Ph.D., M.Sc.
Email: sunghwan.kim@nih.gov
SlideShare: https://www.slideshare.net/SunghwanKim95/presentations

More Related Content

What's hot

Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...Chanin Nantasenamat
 
Molecular Docking using Autodock 4.2.6
Molecular Docking using Autodock 4.2.6Molecular Docking using Autodock 4.2.6
Molecular Docking using Autodock 4.2.6Mariam Abdelrasoul
 
Qsar and drug design ppt
Qsar and drug design pptQsar and drug design ppt
Qsar and drug design pptAbhik Seal
 
Cheminformatics: An overview
Cheminformatics: An overviewCheminformatics: An overview
Cheminformatics: An overviewsubhasis banerjee
 
Pharmacophore mapping
Pharmacophore mapping Pharmacophore mapping
Pharmacophore mapping GamitKinjal
 
PubChem and Big Data Chemistry
PubChem and Big Data ChemistryPubChem and Big Data Chemistry
PubChem and Big Data ChemistrySunghwan Kim
 
Cheminformatics
CheminformaticsCheminformatics
Cheminformaticsbaoilleach
 
Fragment based drug design
Fragment based drug designFragment based drug design
Fragment based drug designEkta Tembhare
 
Computer aided drug design
Computer aided drug designComputer aided drug design
Computer aided drug designN K
 
PubChem Database
PubChem DatabasePubChem Database
PubChem DatabaseLucia Ravi
 
PubChem and Its Applications for Drug Discovery
PubChem and Its Applications for Drug DiscoveryPubChem and Its Applications for Drug Discovery
PubChem and Its Applications for Drug DiscoverySunghwan Kim
 
Chemical File Formats for storing chemical data
Chemical File Formats for storing chemical dataChemical File Formats for storing chemical data
Chemical File Formats for storing chemical dataAbhik Seal
 
In silico drug desigining
In silico drug desiginingIn silico drug desigining
In silico drug desiginingDevesh Shukla
 
Cheminformatics
CheminformaticsCheminformatics
CheminformaticsVin Anto
 
Chemical database preparation ppt
Chemical database preparation pptChemical database preparation ppt
Chemical database preparation pptsamantlalit
 

What's hot (20)

Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
 
Molecular Docking using Autodock 4.2.6
Molecular Docking using Autodock 4.2.6Molecular Docking using Autodock 4.2.6
Molecular Docking using Autodock 4.2.6
 
Qsar and drug design ppt
Qsar and drug design pptQsar and drug design ppt
Qsar and drug design ppt
 
Cheminformatics: An overview
Cheminformatics: An overviewCheminformatics: An overview
Cheminformatics: An overview
 
Pharmacophore mapping
Pharmacophore mapping Pharmacophore mapping
Pharmacophore mapping
 
PubChem and Big Data Chemistry
PubChem and Big Data ChemistryPubChem and Big Data Chemistry
PubChem and Big Data Chemistry
 
Molecular modelling
Molecular modellingMolecular modelling
Molecular modelling
 
Chemoinformatics.ppt
Chemoinformatics.pptChemoinformatics.ppt
Chemoinformatics.ppt
 
Cheminformatics
CheminformaticsCheminformatics
Cheminformatics
 
Fragment based drug design
Fragment based drug designFragment based drug design
Fragment based drug design
 
Computer aided drug design
Computer aided drug designComputer aided drug design
Computer aided drug design
 
PubChem Database
PubChem DatabasePubChem Database
PubChem Database
 
PubChem and Its Applications for Drug Discovery
PubChem and Its Applications for Drug DiscoveryPubChem and Its Applications for Drug Discovery
PubChem and Its Applications for Drug Discovery
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
 
Chemical File Formats for storing chemical data
Chemical File Formats for storing chemical dataChemical File Formats for storing chemical data
Chemical File Formats for storing chemical data
 
In silico drug desigining
In silico drug desiginingIn silico drug desigining
In silico drug desigining
 
Cheminformatics
CheminformaticsCheminformatics
Cheminformatics
 
Chemical database preparation ppt
Chemical database preparation pptChemical database preparation ppt
Chemical database preparation ppt
 
Cambridge structural database
Cambridge structural databaseCambridge structural database
Cambridge structural database
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
 

Similar to PubChem as a resource for chemical information education

Cheminformatics Education with PubChem
Cheminformatics Education with PubChemCheminformatics Education with PubChem
Cheminformatics Education with PubChemSunghwan Kim
 
PubChem and its application for cheminformatics education
PubChem and its application for cheminformatics educationPubChem and its application for cheminformatics education
PubChem and its application for cheminformatics educationSunghwan Kim
 
PubChem: A Public Chemical Information Resource for Big Data Chemistry
PubChem: A Public Chemical Information Resource for Big Data ChemistryPubChem: A Public Chemical Information Resource for Big Data Chemistry
PubChem: A Public Chemical Information Resource for Big Data ChemistrySunghwan Kim
 
PubChem: a public chemical information resource for big data chemistry
PubChem: a public chemical information resource for big data chemistryPubChem: a public chemical information resource for big data chemistry
PubChem: a public chemical information resource for big data chemistrySunghwan Kim
 
PubChem as a resource for chemical information training
PubChem as a resource for chemical information trainingPubChem as a resource for chemical information training
PubChem as a resource for chemical information trainingSunghwan Kim
 
Exploiting PubChem for Drug Discovery
Exploiting PubChem for Drug DiscoveryExploiting PubChem for Drug Discovery
Exploiting PubChem for Drug DiscoverySunghwan Kim
 
Cheminformatics Online Chemistry Course (OLCC): A Community Effort to Introdu...
Cheminformatics Online Chemistry Course (OLCC): A Community Effort to Introdu...Cheminformatics Online Chemistry Course (OLCC): A Community Effort to Introdu...
Cheminformatics Online Chemistry Course (OLCC): A Community Effort to Introdu...Sunghwan Kim
 
PubChem for chemical information literacy training
PubChem for chemical information literacy trainingPubChem for chemical information literacy training
PubChem for chemical information literacy trainingSunghwan Kim
 
Searching for patent information in PubChem
Searching for patent information in PubChem Searching for patent information in PubChem
Searching for patent information in PubChem Sunghwan Kim
 
Exploiting PubChem for drug discovery based on natural products
Exploiting PubChem for drug discovery based on natural productsExploiting PubChem for drug discovery based on natural products
Exploiting PubChem for drug discovery based on natural productsSunghwan Kim
 
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiaoIRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiaoIRIDA_community
 
PSI-MI standards and PSICQUIC
PSI-MI standards and PSICQUICPSI-MI standards and PSICQUIC
PSI-MI standards and PSICQUICRafael C. Jimenez
 

Similar to PubChem as a resource for chemical information education (20)

Cheminformatics Education with PubChem
Cheminformatics Education with PubChemCheminformatics Education with PubChem
Cheminformatics Education with PubChem
 
PubChem and its application for cheminformatics education
PubChem and its application for cheminformatics educationPubChem and its application for cheminformatics education
PubChem and its application for cheminformatics education
 
PubChem: A Public Chemical Information Resource for Big Data Chemistry
PubChem: A Public Chemical Information Resource for Big Data ChemistryPubChem: A Public Chemical Information Resource for Big Data Chemistry
PubChem: A Public Chemical Information Resource for Big Data Chemistry
 
PubChem: a public chemical information resource for big data chemistry
PubChem: a public chemical information resource for big data chemistryPubChem: a public chemical information resource for big data chemistry
PubChem: a public chemical information resource for big data chemistry
 
PubChem as a resource for chemical information training
PubChem as a resource for chemical information trainingPubChem as a resource for chemical information training
PubChem as a resource for chemical information training
 
Exploiting PubChem for Drug Discovery
Exploiting PubChem for Drug DiscoveryExploiting PubChem for Drug Discovery
Exploiting PubChem for Drug Discovery
 
Cheminformatics Online Chemistry Course (OLCC): A Community Effort to Introdu...
Cheminformatics Online Chemistry Course (OLCC): A Community Effort to Introdu...Cheminformatics Online Chemistry Course (OLCC): A Community Effort to Introdu...
Cheminformatics Online Chemistry Course (OLCC): A Community Effort to Introdu...
 
PubChem for chemical information literacy training
PubChem for chemical information literacy trainingPubChem for chemical information literacy training
PubChem for chemical information literacy training
 
ChemSpider as a hub for online chemical information resources
ChemSpider as a hub for online chemical information resources   ChemSpider as a hub for online chemical information resources
ChemSpider as a hub for online chemical information resources
 
Searching for patent information in PubChem
Searching for patent information in PubChem Searching for patent information in PubChem
Searching for patent information in PubChem
 
Delivering chemical-associated data via EPA web applications
Delivering chemical-associated data via EPA web applicationsDelivering chemical-associated data via EPA web applications
Delivering chemical-associated data via EPA web applications
 
Exploiting PubChem for drug discovery based on natural products
Exploiting PubChem for drug discovery based on natural productsExploiting PubChem for drug discovery based on natural products
Exploiting PubChem for drug discovery based on natural products
 
Precompetitive preclinical ADME/tox data and set it free on the web to facili...
Precompetitive preclinical ADME/tox data and set it free on the web to facili...Precompetitive preclinical ADME/tox data and set it free on the web to facili...
Precompetitive preclinical ADME/tox data and set it free on the web to facili...
 
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiaoIRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
 
A comprehensive framework for building multilingual domain ontologies
A comprehensive framework for building multilingual domain ontologiesA comprehensive framework for building multilingual domain ontologies
A comprehensive framework for building multilingual domain ontologies
 
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
 
Cheminformatics Support for MS Supporting Exposomics
Cheminformatics Support for MS Supporting ExposomicsCheminformatics Support for MS Supporting Exposomics
Cheminformatics Support for MS Supporting Exposomics
 
PSI-MI standards and PSICQUIC
PSI-MI standards and PSICQUICPSI-MI standards and PSICQUIC
PSI-MI standards and PSICQUIC
 
New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...
 
Building A Community Resource For The Life Sciences
Building A Community Resource For The Life SciencesBuilding A Community Resource For The Life Sciences
Building A Community Resource For The Life Sciences
 

More from Sunghwan Kim

PubChem for drug discovery in the age of big data and artificial intelligence
PubChem for drug discovery in the age of big data and artificial intelligencePubChem for drug discovery in the age of big data and artificial intelligence
PubChem for drug discovery in the age of big data and artificial intelligenceSunghwan Kim
 
PubChem as an Emerging Toxicological Information Resource
PubChem as an Emerging Toxicological Information ResourcePubChem as an Emerging Toxicological Information Resource
PubChem as an Emerging Toxicological Information ResourceSunghwan Kim
 
Toxicological information in PubChem
Toxicological information in PubChemToxicological information in PubChem
Toxicological information in PubChemSunghwan Kim
 
Chemical Health and Safety Information in PubChem
Chemical Health and Safety Information in PubChemChemical Health and Safety Information in PubChem
Chemical Health and Safety Information in PubChemSunghwan Kim
 
Chemical Structure Standardization and Synonym Filtering in PubChem
Chemical Structure Standardization and Synonym Filtering in PubChemChemical Structure Standardization and Synonym Filtering in PubChem
Chemical Structure Standardization and Synonym Filtering in PubChemSunghwan Kim
 
Searching for chemical information using PubChem
Searching for chemical information using PubChemSearching for chemical information using PubChem
Searching for chemical information using PubChemSunghwan Kim
 
Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...Sunghwan Kim
 
Using open bioactivity data for developing machine-learning prediction models...
Using open bioactivity data for developing machine-learning prediction models...Using open bioactivity data for developing machine-learning prediction models...
Using open bioactivity data for developing machine-learning prediction models...Sunghwan Kim
 
NCBI Minute: Integrating PubChem into Your Chemistry Teaching
NCBI Minute: Integrating PubChem into Your Chemistry TeachingNCBI Minute: Integrating PubChem into Your Chemistry Teaching
NCBI Minute: Integrating PubChem into Your Chemistry TeachingSunghwan Kim
 
How can you access PubChem programmatically?
How can you access PubChem programmatically?How can you access PubChem programmatically?
How can you access PubChem programmatically?Sunghwan Kim
 

More from Sunghwan Kim (10)

PubChem for drug discovery in the age of big data and artificial intelligence
PubChem for drug discovery in the age of big data and artificial intelligencePubChem for drug discovery in the age of big data and artificial intelligence
PubChem for drug discovery in the age of big data and artificial intelligence
 
PubChem as an Emerging Toxicological Information Resource
PubChem as an Emerging Toxicological Information ResourcePubChem as an Emerging Toxicological Information Resource
PubChem as an Emerging Toxicological Information Resource
 
Toxicological information in PubChem
Toxicological information in PubChemToxicological information in PubChem
Toxicological information in PubChem
 
Chemical Health and Safety Information in PubChem
Chemical Health and Safety Information in PubChemChemical Health and Safety Information in PubChem
Chemical Health and Safety Information in PubChem
 
Chemical Structure Standardization and Synonym Filtering in PubChem
Chemical Structure Standardization and Synonym Filtering in PubChemChemical Structure Standardization and Synonym Filtering in PubChem
Chemical Structure Standardization and Synonym Filtering in PubChem
 
Searching for chemical information using PubChem
Searching for chemical information using PubChemSearching for chemical information using PubChem
Searching for chemical information using PubChem
 
Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...
 
Using open bioactivity data for developing machine-learning prediction models...
Using open bioactivity data for developing machine-learning prediction models...Using open bioactivity data for developing machine-learning prediction models...
Using open bioactivity data for developing machine-learning prediction models...
 
NCBI Minute: Integrating PubChem into Your Chemistry Teaching
NCBI Minute: Integrating PubChem into Your Chemistry TeachingNCBI Minute: Integrating PubChem into Your Chemistry Teaching
NCBI Minute: Integrating PubChem into Your Chemistry Teaching
 
How can you access PubChem programmatically?
How can you access PubChem programmatically?How can you access PubChem programmatically?
How can you access PubChem programmatically?
 

Recently uploaded

The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfakmcokerachita
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerunnathinaik
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxAnaBeatriceAblay2
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Science lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonScience lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonJericReyAuditor
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 

Recently uploaded (20)

The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdf
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developer
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Science lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonScience lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lesson
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 

PubChem as a resource for chemical information education

  • 1. PubChem as a Resource for Chemical Information Education ACS Fall 2020 Virtual Meeting August 20, 2020 Sunghwan Kim, Ph.D., M.Sc.
  • 2. 2  PubChem (https://pubchem.ncbi.nlm.nih.gov)  Public chemical database.  Developed and maintained by the U.S. National Institutes of Health.  Contains various chemical entities: • Small molecules • siRNAs & miRNAs • Carbohydrates • Lipids • Peptides • Chemically modified macromolecules • ……
  • 3. 3  PubChem (https://pubchem.ncbi.nlm.nih.gov)  Collects chemical information from 750+ data sources and disseminates it to the public free of charge.  103 million unique chemical structures.  Crosslinks to many other databases.  Search, analysis, download and visualization tools.  A key resource in many areas: • Cheminformatics • Chemical biology • Medicinal chemistry • Drug discovery
  • 4. 4 0 1 2 3 4 5 NumberofUsers (Millions) Month Unique Monthly Users PubChem Usage Statistics 2016 2017 2018 >4.3 million unique users per month (Apr. 2020) 2019 2020 Source: Google Analytics
  • 5. 5  Top 5 Chemistry Websites 1. acs.org 2. rsc.org 3. sigmaaldrich.com 4. pubchem.ncbi.nlm.nih.gov 5. cas.org Source: https://www.alexa.com/topsites/category/Top/Science/Chemistry PubChem is the only public website among them. PubChem Usage Statistics
  • 6. 6 ~36% of PubChem users are between 18-24. (likely to be college students) [CELLRANGE] [CELLRANGE] [CELLRANGE] [CELLRANGE][CELLRANGE][CELLRANGE] 0% 10% 20% 30% 40% 18-24 25-34 35-44 45-54 55-64 65+ %NumberofUsers Age Users by Age (April 2020) PubChem Usage Statistics
  • 7. 7  Popularity: Many young people are already using PubChem.  Sustainability: It is sixteen years old and not going away soon.  Zero-cost (to students): U.S. taxpayers have already paid for it.
  • 8. 8  Popularity: Many young people are already using PubChem.  Sustainability: It is sixteen years old and not going away soon.  Zero-cost (to students): U.S. taxpayers have already paid for it. A strong potential as an education resource, especially for small organizations like: • primarily undergraduate institutions (PUIs) • community colleges (CCs)
  • 9. 9  How about R1 universities with large endowments?  Likely to have access to proprietary databases. • Primarily used for research. • Inconvenient off-campus access. • Students will lose access when they graduate.  Most students will eventually rely on public resources.  Need for training/education opportunities while in school.
  • 10. 10 Exploring Chemical Information in PubChem 1. Search by chemical name 2. Search by chemical structure 3. Search by gene/protein name 4. PubChem Periodic Table and Element pages 5. Programmatic access
  • 11. 11 Exploring Chemical Information in PubChem 1. Search by chemical name 2. Search by chemical structure 3. Search by gene/protein name 4. PubChem Periodic Table and Element pages 5. Programmatic access
  • 12. 12
  • 13. 13
  • 14. 14
  • 15. 15
  • 16. 16
  • 17. 17
  • 18. 18
  • 19. 19
  • 20. 20 Exploring Chemical Information in PubChem 1. Search by chemical name 2. Search by chemical structure 3. Search by gene/protein name 4. PubChem Periodic Table and Element pages 5. Programmatic access
  • 21. 21
  • 22. 22
  • 23. 23
  • 24. 24
  • 25. 25 Exploring Chemical Information in PubChem 1. Search by chemical name 2. Search by chemical structure 3. Search by gene/protein name 4. PubChem Periodic Table and Element pages 5. Programmatic access
  • 26. 26
  • 27. 27
  • 28. 28
  • 29. 29
  • 30. 30
  • 31. 31
  • 32. 32 Exploring Chemical Information in PubChem 1. Search by chemical name 2. Search by chemical structure 3. Search by gene/protein name 4. PubChem Periodic Table and Element pages 5. Programmatic access
  • 33. 33
  • 34. 34 Kim et al., Chem. Teacher International, 2020. doi:10.1515/cti-2020-0006
  • 35. 35 Kim et al., Chem. Teacher International, 2020. doi:10.1515/cti-2020-0006
  • 36. 36 Kim et al., Chem. Teacher International, 2020. doi:10.1515/cti-2020-0006
  • 37. 37 Kim et al., Chem. Teacher International, 2020. doi:10.1515/cti-2020-0006
  • 38. 38 Kim et al., Chem. Teacher International, 2020. doi:10.1515/cti-2020-0006
  • 39. 39 0 5 10 15 20 25 30 0 10 20 30 40 50 60 70 80 90 100 IonizationEnergy(eV) Atomic Number He Ne Ar Kr Xe Rn Li Na K Rb Cs Fr Kim et al., Chem. Teacher International, 2020. doi:10.1515/cti-2020-0006
  • 40. 40 Exploring Chemical Information in PubChem 1. Search by chemical name 2. Search by chemical structure 3. Search by gene/protein name 4. PubChem Periodic Table and Element pages 5. Programmatic access
  • 41. 41  Why should students learn programmatic access?  PubChem users have very diverse backgrounds/interests.  PubChem’s web interfaces are optimized to perform commonly requested tasks interactively.  Everything you can do with PubChem through the web browser can be automated through PubChem’s programmatic interfaces.  Programmatic access enables one to do much more complicated and specialized tasks that cannot be done through the web browser.
  • 42. 42  Why should students learn programmatic access?  Programming skills are essential for: • automating routine tasks and • processing/analyzing a large data set  Important skills for students pursuing STEM careers in the age of big data.
  • 43. 43  Programmatic Access to PubChem  Multiple programmatic access routes.  Two major programmatic access methods. • PUG-REST (primarily for computed properties). Kim et al., Nucleic Acids Res. 2018, 46(W1):W563-570. https://pubchemdocs.ncbi.nlm.nih.gov/pug-rest • PUG-View (primarily for text information). Kim et al., J. Cheminform. 2019, 11:56. https://pubchemdocs.ncbi.nlm.nih.gov/pug-view  Jupyter Notebooks containing sample codes (in python/R) are freely available at LibreTexts: https://chem.libretexts.org/link?143689
  • 44. 44 Cheminformatics On-Line Chemistry Course (OLCC) Kim et al., J. Chem. Educ., 2020, submitted.
  • 45. 45 Cheminformatics OLCC  Unique challenges to teaching cheminformatics  Cheminformatics is not an established chemistry field.  Chemistry + Informatics + Computer Science + Library Science + Pharmaceutical Science + ……  Not so many faculty members with Cheminformatics expertise.  No textbook suitable for undergraduate chemistry students.
  • 46. 46 The Cheminformatics OLCC addresses these issues! Cheminformatics OLCC  Unique challenges to teaching cheminformatics  Cheminformatics is not an established chemistry field.  Chemistry + Informatics + Computer Science + Library Science + Pharmaceutical Science + ……  Not so many faculty members with Cheminformatics expertise.  No textbook suitable for undergraduate chemistry students.
  • 47. 47 Course website Cheminformatics experts Prepare online reading materials & homework problem sets Cheminformatics OLCC
  • 48. 48 Course website Cheminformatics experts Prepare online reading materials & homework problem sets Course Instructor Students Run the course using the course materials at multiple schools Cheminformatics OLCC
  • 49. 49 Course website Cheminformatics experts Prepare online reading materials & homework problem sets Course Instructor Students Run the course using the course materials at multiple schools Face-to-face meeting Online discussion among experts, instructors, & students through the website Cheminformatics OLCC
  • 50. 50 It was offered three times:  Fall 2015: 36 students from 4 schools  Spring 2017: 47 students from 9 schools  Fall 2019: 23 students from 5 schools  All course materials are available at:  CCCE website (http://olcc.ccce.divched.org)  LibreTexts (https://libretexts.org) (free online textbook site)  Many of the course materials cover PubChem data, tools and services. Cheminformatics OLCC
  • 51. 51  PubChem-related topics in Cheminformatics OLCC  Critical assessment of chemical information  Chemical representations (e.g., InChI and SMILES) • As alternatives to chemical name queries • For chemical data exchange/integration/sharing  Search by chemical name  Search by chemical structure • Identity search • 2-D/3-D similarity search • Substructure/superstructure search • Molecular formula search  Structure clustering and structure-activity relationship analysis  Automation of chemical data retrieval through a computer code Cheminformatics OLCC
  • 52. 52 Many PubChem users are likely to be college students. Summary PubChem has a strong potential as a resource for chemical information training because of its: • popularity • sustainability • low cost
  • 53. 53 Summary  PubChem supports various use cases beyond simple chemical name search. • Search by chemical structure • Search by gene/protein name • PubChem Periodic Table and Element pages • Programmatic access
  • 54. 54 Summary  PubChem works with the chemical education community to provide chemical information training for students.  Please reach out to us for collaboration if you are interested.
  • 55. 55 Acknowledgements Evan Bolton Jie Chen Tiejun Cheng Asta Gindulyte Jia He Siqian He Qingliang (Leon) Li Benjamin Shoemaker Thiessen Paul Olga Pujolras Bo Yu Leonid Zaslavsky Jian (Jeff) Zhang Zhi (Leon) Sun  The PubChem Team  PubChem users, depositors, and collaborators  Funded by the National Library of Medicine
  • 56. 56 Thank you! Questions? Sunghwan Kim, Ph.D., M.Sc. Email: sunghwan.kim@nih.gov SlideShare: https://www.slideshare.net/SunghwanKim95/presentations