SlideShare a Scribd company logo
ChemSpider – Building an Online Database of Open Spectra
Antony J. Williams and Valery Tkachenko
ChemSpider, Royal Society of Chemistry, info@chemspider.com
Introduction
ChemSpider is an online database of
almost 25 million unique chemical
compounds sourced from almost 400
different sources including government
laboratories, chemical vendors, public
resources and publications. Developed
with the intention of building community
for chemists ChemSpider allows its users
to deposit data including structures,
properties, links to external resources and
various forms of spectral data. Over the
past three years ChemSpider has
aggregated almost 3000 high quality NMR
spectra and continues to expand as the
community deposits additional data.
Free Resources of NMR Data
There have been a number of efforts in
recent years to aggregate and make
available NMR data to the community. A
number of commercial databases exist of
structures with associated NMR
assignments. These include databases
from ACD/Labs,Wiley/Chemical
Concepts, Bio-Rad and others. The
NMRShift database [1] from EBI contains
over 35,000 structures with multinuclear
assignments extracted from over 40,000
spectra. However, databases of NMR
spectral curves are less common and
generally limited to those made available
by academics and especially those from
groups generating metabonomics data
(for example, the BMRB [2] and DrugBank
[3]). One component of the ChemSpider
project is to gather, host and make
available a structure searchable database
of spectral data: 1D/2D NMR, IR, Raman
and Mass Spectrometry.
Sources of Spectral Data
While efforts have been made to harvest
NMR data from various websites, with
permission, the majority of the data are
deposited by users of ChemSpider.
Submission of spectra in the form of
JCAMP-DX (for 1D spectra) or
images/PDF (for 1D or 2D spectra) are
supported. In order to deposit a
spectrum a user simply searches
ChemSpider for the associated structure
and uploads the JCAMP-DX or image
form of the spectrum and it is available
to the community immediately.
Community-based curators will validate
and annotate the data as appropriate to
ensure that only the highest quality data
are available on the database.
Figure 1: The spectrum upload page
Spectral Visualization
1D NMR data are viewed inside the
JSpecView spectral display applet.
Zooming, scrolling and integration of the
data are possible. 2D NMR spectra are
viewed as images only at present.
Figure 2: The JSpecView Applet
Data Downloads
The majority of spectral data are
licensed as Open Data allowing it to be
downloaded and reused in
presentations, lesson plans and for
teaching purposes. A web service is
available to allow the data to be
integrated into other systems.
The Spectral Game
Using a ChemSpider web service as a
basis Bradley et al. [3] have set up an
online game to learn how to interpret
NMR spectra. The game is at
www.spectralgame.com and has been
played by thousands of students
around the world.
Figure 3: The Spectral Game
A Call to Action
The ChemSpider database grows only
as a result of participation by the
community. We encourage you to
deposit and share your non-proprietary
spectral data with the community to
benefit chemists and spectroscopists
everywhere. Feel free to contact us at
info@chemspider.com to assist you in
depositing data. Whether it is an
individual spectrum or many hundreds
we are interested in hosting your data.
Future Directions
We intend to continue to grow the
spectral database by encouraging
further depositions from the
community. A 2D NMR Spectral
Game is available as a proof of
concept at
http://spectralgame.com/2d/. We have
recently introduced a beta version of
NMR prediction integrated to the
NMRShiftDB prediction algorithms.
Conclusions
ChemSpider is an online structure
database allowing the community to
participate in the deposition of
additional data. A growing NMR
spectral curve data collection is
available to download and use but
users are encouraged to contribute
back to the database and share their
own data with the community. In this
way a major reference source of
Open NMR data can be provided.
References
1)NMRShiftDB:
http://www.ebi.ac.uk/nmrshiftdb/
2)Biological Magnetic Resonance
Bank: http://www.bmrb.wisc.edu/
3)DrugBank: http://www.drugbank.ca/
4)The JSpecView Project: an Open
Source Java viewer and converter for
JCAMP-DX, and XML spectral data
files, R.J. Lancashire,
http://www.journal.chemistrycentral.co
m/content/1/1/31
5) The Spectral Game: leveraging
Open Data and crowdsourcing for
education, J.C. Bradley et al.,
http://www.jcheminf.com/content/1/1/9
www.chemspider.com

More Related Content

Similar to Chem Spider Building An Online Database Of Open Spectra

ChemSpider, how a free community resource of data can support teaching NMR sp...
ChemSpider, how a free community resource of data can support teaching NMR sp...ChemSpider, how a free community resource of data can support teaching NMR sp...
ChemSpider, how a free community resource of data can support teaching NMR sp...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Teaching NMR spectroscopy using online resources from rsc
Teaching NMR spectroscopy using online resources from rscTeaching NMR spectroscopy using online resources from rsc
Building a Community Resource of Open Spectral Data
Building a Community Resource of Open Spectral DataBuilding a Community Resource of Open Spectral Data
RSC ChemSpider as an environment for teaching and sharing chemistry
RSC ChemSpider as an environment for teaching and sharing chemistryRSC ChemSpider as an environment for teaching and sharing chemistry
RSC ChemSpider as an environment for teaching and sharing chemistry
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
R.P Maurya ppt on C C D C & DSSP(Bioinformatics)
R.P Maurya ppt  on C C D C & DSSP(Bioinformatics)R.P Maurya ppt  on C C D C & DSSP(Bioinformatics)
R.P Maurya ppt on C C D C & DSSP(Bioinformatics)
R.P MAURYA
 
Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...
Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...
Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Validating the ChemSpider Open Spectral Database NMR Collection using ACD/Lab...
Validating the ChemSpider Open Spectral Database NMR Collection using ACD/Lab...Validating the ChemSpider Open Spectral Database NMR Collection using ACD/Lab...
Validating the ChemSpider Open Spectral Database NMR Collection using ACD/Lab...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Structure verification and elucidation using the ChemSpider database
Structure verification and elucidation using the ChemSpider databaseStructure verification and elucidation using the ChemSpider database
Structure verification and elucidation using the ChemSpider database
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Microarrays Databases.pptx
Microarrays Databases.pptxMicroarrays Databases.pptx
Microarrays Databases.pptx
Muzzamilahmed14
 
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Crawling Across the Web of Chemistry Using ChemSpider
Crawling Across the Web of Chemistry Using ChemSpider Crawling Across the Web of Chemistry Using ChemSpider
"Some Reflections on Data in the Public Sector" : Communia: The European Them...
"Some Reflections on Data in the Public Sector" : Communia: The European Them..."Some Reflections on Data in the Public Sector" : Communia: The European Them...
"Some Reflections on Data in the Public Sector" : Communia: The European Them...
Tom Moritz
 
What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care?
Robert Grossman
 
RSC ChemSpider is the online chemistry database where community contributions...
RSC ChemSpider is the online chemistry database where community contributions...RSC ChemSpider is the online chemistry database where community contributions...
RSC ChemSpider is the online chemistry database where community contributions...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Free online access to experimental and predicted chemical properties through ...
Free online access to experimental and predicted chemical properties through ...Free online access to experimental and predicted chemical properties through ...
Free online access to experimental and predicted chemical properties through ...
Kamel Mansouri
 
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...
Valery Tkachenko
 
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
Applying Royal Society of Chemistry cheminformatics skills to support the Pha...Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
Applying Royal Society of Chemistry cheminformatics skills to support the Pha...Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
Ken Karapetyan
 

Similar to Chem Spider Building An Online Database Of Open Spectra (20)

ChemSpider, how a free community resource of data can support teaching NMR sp...
ChemSpider, how a free community resource of data can support teaching NMR sp...ChemSpider, how a free community resource of data can support teaching NMR sp...
ChemSpider, how a free community resource of data can support teaching NMR sp...
 
Teaching NMR spectroscopy using online resources from rsc
Teaching NMR spectroscopy using online resources from rscTeaching NMR spectroscopy using online resources from rsc
Teaching NMR spectroscopy using online resources from rsc
 
Building a Community Resource of Open Spectral Data
Building a Community Resource of Open Spectral DataBuilding a Community Resource of Open Spectral Data
Building a Community Resource of Open Spectral Data
 
RSC ChemSpider as an environment for teaching and sharing chemistry
RSC ChemSpider as an environment for teaching and sharing chemistryRSC ChemSpider as an environment for teaching and sharing chemistry
RSC ChemSpider as an environment for teaching and sharing chemistry
 
R.P Maurya ppt on C C D C & DSSP(Bioinformatics)
R.P Maurya ppt  on C C D C & DSSP(Bioinformatics)R.P Maurya ppt  on C C D C & DSSP(Bioinformatics)
R.P Maurya ppt on C C D C & DSSP(Bioinformatics)
 
Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...
Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...
Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...
 
Validating the ChemSpider Open Spectral Database NMR Collection using ACD/Lab...
Validating the ChemSpider Open Spectral Database NMR Collection using ACD/Lab...Validating the ChemSpider Open Spectral Database NMR Collection using ACD/Lab...
Validating the ChemSpider Open Spectral Database NMR Collection using ACD/Lab...
 
Structure verification and elucidation using the ChemSpider database
Structure verification and elucidation using the ChemSpider databaseStructure verification and elucidation using the ChemSpider database
Structure verification and elucidation using the ChemSpider database
 
Microarrays Databases.pptx
Microarrays Databases.pptxMicroarrays Databases.pptx
Microarrays Databases.pptx
 
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
 
Crawling Across the Web of Chemistry Using ChemSpider
Crawling Across the Web of Chemistry Using ChemSpider Crawling Across the Web of Chemistry Using ChemSpider
Crawling Across the Web of Chemistry Using ChemSpider
 
"Some Reflections on Data in the Public Sector" : Communia: The European Them...
"Some Reflections on Data in the Public Sector" : Communia: The European Them..."Some Reflections on Data in the Public Sector" : Communia: The European Them...
"Some Reflections on Data in the Public Sector" : Communia: The European Them...
 
What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care?
 
RSC ChemSpider is the online chemistry database where community contributions...
RSC ChemSpider is the online chemistry database where community contributions...RSC ChemSpider is the online chemistry database where community contributions...
RSC ChemSpider is the online chemistry database where community contributions...
 
Free online access to experimental and predicted chemical properties through ...
Free online access to experimental and predicted chemical properties through ...Free online access to experimental and predicted chemical properties through ...
Free online access to experimental and predicted chemical properties through ...
 
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
 
Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...
 
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
 
Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
Applying Royal Society of Chemistry cheminformatics skills to support the Pha...Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
 
Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
Applying Royal Society of Chemistry cheminformatics skills to support the Pha...Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
 

Recently uploaded

The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
Alex Pruden
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
Vlad Stirbu
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 

Recently uploaded (20)

The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 

Chem Spider Building An Online Database Of Open Spectra

  • 1. ChemSpider – Building an Online Database of Open Spectra Antony J. Williams and Valery Tkachenko ChemSpider, Royal Society of Chemistry, info@chemspider.com Introduction ChemSpider is an online database of almost 25 million unique chemical compounds sourced from almost 400 different sources including government laboratories, chemical vendors, public resources and publications. Developed with the intention of building community for chemists ChemSpider allows its users to deposit data including structures, properties, links to external resources and various forms of spectral data. Over the past three years ChemSpider has aggregated almost 3000 high quality NMR spectra and continues to expand as the community deposits additional data. Free Resources of NMR Data There have been a number of efforts in recent years to aggregate and make available NMR data to the community. A number of commercial databases exist of structures with associated NMR assignments. These include databases from ACD/Labs,Wiley/Chemical Concepts, Bio-Rad and others. The NMRShift database [1] from EBI contains over 35,000 structures with multinuclear assignments extracted from over 40,000 spectra. However, databases of NMR spectral curves are less common and generally limited to those made available by academics and especially those from groups generating metabonomics data (for example, the BMRB [2] and DrugBank [3]). One component of the ChemSpider project is to gather, host and make available a structure searchable database of spectral data: 1D/2D NMR, IR, Raman and Mass Spectrometry. Sources of Spectral Data While efforts have been made to harvest NMR data from various websites, with permission, the majority of the data are deposited by users of ChemSpider. Submission of spectra in the form of JCAMP-DX (for 1D spectra) or images/PDF (for 1D or 2D spectra) are supported. In order to deposit a spectrum a user simply searches ChemSpider for the associated structure and uploads the JCAMP-DX or image form of the spectrum and it is available to the community immediately. Community-based curators will validate and annotate the data as appropriate to ensure that only the highest quality data are available on the database. Figure 1: The spectrum upload page Spectral Visualization 1D NMR data are viewed inside the JSpecView spectral display applet. Zooming, scrolling and integration of the data are possible. 2D NMR spectra are viewed as images only at present. Figure 2: The JSpecView Applet Data Downloads The majority of spectral data are licensed as Open Data allowing it to be downloaded and reused in presentations, lesson plans and for teaching purposes. A web service is available to allow the data to be integrated into other systems. The Spectral Game Using a ChemSpider web service as a basis Bradley et al. [3] have set up an online game to learn how to interpret NMR spectra. The game is at www.spectralgame.com and has been played by thousands of students around the world. Figure 3: The Spectral Game A Call to Action The ChemSpider database grows only as a result of participation by the community. We encourage you to deposit and share your non-proprietary spectral data with the community to benefit chemists and spectroscopists everywhere. Feel free to contact us at info@chemspider.com to assist you in depositing data. Whether it is an individual spectrum or many hundreds we are interested in hosting your data. Future Directions We intend to continue to grow the spectral database by encouraging further depositions from the community. A 2D NMR Spectral Game is available as a proof of concept at http://spectralgame.com/2d/. We have recently introduced a beta version of NMR prediction integrated to the NMRShiftDB prediction algorithms. Conclusions ChemSpider is an online structure database allowing the community to participate in the deposition of additional data. A growing NMR spectral curve data collection is available to download and use but users are encouraged to contribute back to the database and share their own data with the community. In this way a major reference source of Open NMR data can be provided. References 1)NMRShiftDB: http://www.ebi.ac.uk/nmrshiftdb/ 2)Biological Magnetic Resonance Bank: http://www.bmrb.wisc.edu/ 3)DrugBank: http://www.drugbank.ca/ 4)The JSpecView Project: an Open Source Java viewer and converter for JCAMP-DX, and XML spectral data files, R.J. Lancashire, http://www.journal.chemistrycentral.co m/content/1/1/31 5) The Spectral Game: leveraging Open Data and crowdsourcing for education, J.C. Bradley et al., http://www.jcheminf.com/content/1/1/9 www.chemspider.com