SlideShare a Scribd company logo
Looking closer at the failed samples provides some insight to the nature of the failure.  ID #3 failed specifically because of an unassigned 2D peak appearing at 17.8 ppm and 2.2 ppm.  It was determined that this peak was due to the aromatic methyl group in the structure.  The reason for a lack of consistency between the predicted and experimental chemical shifts in this case were due to a slow rotation around the N-CO bond.  As a result, this rotamer produces an experimental spectrum that looks like a mixture.  The software was unable to accurately predict this mixture of forms based on the experimental conditions, and as a result, the predicted spectrum did not match the experimental and the sample was flagged for manual analysis.  Figure 2.   ID #3 was a false negative as the software was unable to assign the methyl group (highlighted in blue) due to slow rotation around the N-CO bond.  The presence of this rotamer resulted in an inconsistency between the experimental and predicted chemical shifts.  The software also flagged the sample ID #7 to be considered for closer inspection.  In this particular case, the issue with this spectrum was determined to be based on the presence of a mixture.  Based on some of the spectral features in both the  1 H and HSQC–DEPT spectra, it is believed that some of the product had converted to an alcohol resulting in a mixture of both the brominated and hydroxylated products.  As a result, the software correctly identified this spectrum as not being consistent with the proposed structure and it was flagged for manual analysis. Figure 3.   ID #7 was flagged as an ambiguous result.  Spectral features in the  1 H and HSQC–DEPT dataset suggest a mixture between the two compounds shown above.  The final sample in the blind test set (ID #10) was also flagged by the software.  In this particular case, the software was unable to identify two protons in the experimental spectrum because the software correctly set a dark region over a large water peak.  Unfortunately, the creation of this dark region resulted in the exclusion of an important multiplet in the experimental spectrum that was in close proximity to the water peak.  Because of this dark region, the software was unable to confirm a match between the spectrum and structure. Table 1.   The results of the 19 Aldrich datasets.  For this dataset, 13 of the 19 datasets (69%) were automatically evaluated by the software.  The software was unable to confirm the proposed structure in Figure 1 (ID #8).  Upon closer inspection it was observed that the software failed because the experimental peak located at 4.74 ppm corresponding to atom #11 in the proposed structure had an integration value that was too low to assign correctly.  As a result, the software flagged this result as ambiguous.  It was later observed that the low integration value could be due to enol formation.  A longer relaxation delay may have more adequately prepared the experimental spectrum for automatic evaluation by the software.  Figure 1.   ID #8 was a false negative because the integration value for the multiplet at 4.74 ppm was too low.  A second compound, ID #11, was rejected by the software as well.  Upon closer inspection of the experimental data, it was determined that the purity of this compound was not sufficient and that the sample contained several different components.  In these two particular examples, the software did a good job of flagging the two problem spectra that required a closer look.  Results of the Blind Test After the previous results and settings had been agreed upon, a blind test set of 10 compounds was run through the system in the exact same fashion.  No changes to the processing or verification parameters were made.  The results of this test are shown in table 2.  Table 2.   The results of the 10 blind Aldrich datasets.  For this dataset, 7 of the 10 datasets (70%) were automatically evaluated by the software.  Ryan Sasaki, Brent Lefebvre, Antony J. Williams, and Sergey Golotvin Advanced Chemistry Development, Inc. Toronto, ON, Canada 110 Yonge Street, 14 th  floor, Toronto Ontario, Canada  M5C 1T4 Tel: (416) 368-3435 Fax: (416) 368-5596 Toll Free: 1-800-304-3988 Email: info@acdlabs.com   Validating Automated Structure Confirmation in a Blind Study Figure 6 .  ID #10 failed due to the close proximity of an important multiplet to an intense water peak in the experimental spectrum.  CONCLUSIONS The goal of this study was to evaluate a fully automated NMR processing and structure verification workflow for a blind test set of compounds.  The processing and evaluation settings for a typical group of samples was set up using a pilot set of 19 compounds.  Once these settings were adjusted, they were used to automatically process and evaluate a blind set of 10 compounds that were prepared under the same conditions.  The results revealed that this completely automated system could reduce the interpretation workload of a spectroscopist by up to 90% if problems with rotomers and impurities are filtered out before the NMR Verification step, up to 70% when these problem samples are left in.  This study highlighted several examples where datasets were flagged by the software for closer inspection by a spectroscopist.  These particular examples illustrate the software’s discrimination ability that help reduce the risk of false positives.  The results of this blind study suggested that a fully automated processing and interpretation system can perform sufficiently in an industrial environment.  ACKNOWLEDGEMENTS The authors would like to acknowledge Dr. Timothy D. Spitzer and Randy D. Rutkowske of GlaxoSmithKline for providing us with NMR data for the compounds in this study.  REFERENCES 1. Automated Structure Verification Based on  1 H NMR Prediction,  Sergey S. Golotvin, Eugene Vodopianov, Brent A. Lefebvre, Antony J. Williams, and Timothy D. Spitzer .  Magn. Reson. Chem.  2006; 44: 524-538.  2. Automated Evaluation of a Chemical Structure with Only 1D  1 H and 2D  1 H– 13 C HSQC,  Sergey S. Golotvin, Eugene Vodopianov, Rostislav Pol, Brent A. Lefebvre, Antony J. Williams, and Timothy D. Spitzer  .  ENC Poster  2006.   INTRODUCTION In previous work, we have presented several findings on the automated evaluation of chemical structures using  1 H,  13 C, and 2D NMR verification algorithms. 1–2   These studies have shown that these systems have performed extremely well through numerous challenges.  The current study focuses not only on the performance of the verification algorithms but also on the automated preparation of experimental data through a blind test.  This study was designed to prove that such a system would hold up in an industrial environment without any human intervention.  This study consisted of two distinct sets of structures and spectra.  The first contained 19 spectra sets (each dataset contained 1D  1 H and 2D HSQC spectra) that were provided ahead of time for adjustment of processing settings and options.  This step was necessary to identify the best software settings based on the instrument and data collection practices for the laboratory where the samples were prepared and run.  Once the first set was run through the system and results of the verification procedure obtained, the second, blind test, was performed on 10 distinct datasets (with chemical structures) that were not available to the software or the software operators in advance.  The details and results of these two tests are presented here, along with a comprehensive look at the structures that could not be confirmed.  Setting Up Ideal Processing and Evaluation Parameters In order to have a system that can run without human intervention, automated processing and structure verification procedures (macros) must be created in the software to perform these tasks.  The raw 1D and 2D NMR datasets for 19 Aldrich compounds were first evaluated using ACD/Labs’ standard macros.  These settings proved to be non-sufficient as the datasets contained several abnormally broad water peaks and low signal-to-noise ratios.  These macros were then modified to exclude these water peaks and set more stringent peak picking guidelines to combat the S/N issues.  The second attempt was improved but had some issues with the referencing in one of the 2D datasets.  In addition, the 1D spectra were not well-resolved, resulting in an inaccurate evaluation of some multiplets.  These issues were rectified by decreasing the line broadening setting in the software by a factor of 10.  Following this modification, the settings were then deemed to be sufficient.  Results of the First Test An explanation of the combined verification algorithms used to evaluate spectrum-to-structure matches have been previously reported. 2   Following the modification of ACD/Labs’ standard macros explained in the previous section, the raw data of the 19 Aldrich compounds were fully processed and evaluated automatically.  The results revealed that the software was able to successfully evaluate 13 of the 19 datasets provided.  In other words, for this particular dataset, 69% of the samples were automatically evaluated by software without any human intervention.  The remaining 6 samples would require manual analysis by an NMR Spectroscopist as the software had flagged them as being either inconsistent or incorrect (Table 1).   Of these 6 samples, it was concluded that 4 of the false negatives were a result of algorithm errors that have been fixed in Version 10 of the software (ID# 6, 12, 13, and 15).  The other two ambiguous results require a closer look to be explained.

More Related Content

What's hot

Selected ion flow tube MS - Online quantitative VOC analysis
Selected ion flow tube MS - Online quantitative VOC analysisSelected ion flow tube MS - Online quantitative VOC analysis
Selected ion flow tube MS - Online quantitative VOC analysis
IS-X
 
Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...
Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...
Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...
Shimadzu Scientific Instruments
 
Archana B lc ppt
Archana B lc pptArchana B lc ppt
Archana B lc ppt
Dipak Shetty
 
LC-MS/MS analysis of emerging food contaminants
LC-MS/MS analysis of emerging food contaminantsLC-MS/MS analysis of emerging food contaminants
LC-MS/MS analysis of emerging food contaminants
SCIEX
 
Ionization Techniques in LC-MS
Ionization Techniques in LC-MSIonization Techniques in LC-MS
Practical Implementation of the New Elemental Impurities Guidelines May 2015
Practical Implementation of the New Elemental Impurities Guidelines May 2015Practical Implementation of the New Elemental Impurities Guidelines May 2015
Practical Implementation of the New Elemental Impurities Guidelines May 2015
SGS
 
Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)
Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)
Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)
IS-X
 
Impact of novel MS/MSall acquisition and processing techniques on forensic to...
Impact of novel MS/MSall acquisition and processing techniques on forensic to...Impact of novel MS/MSall acquisition and processing techniques on forensic to...
Impact of novel MS/MSall acquisition and processing techniques on forensic to...SCIEX
 
DETECTORS IN HPLC
DETECTORS IN HPLCDETECTORS IN HPLC
Expanding Your High Performance Liquid Chromatography and Ultra High Performa...
Expanding Your High Performance Liquid Chromatography and Ultra High Performa...Expanding Your High Performance Liquid Chromatography and Ultra High Performa...
Expanding Your High Performance Liquid Chromatography and Ultra High Performa...
Chromatography & Mass Spectrometry Solutions
 
Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...
Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...
Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...
pharmaindexing
 
Case Study: Sterilization of Surgical Instruments
Case Study: Sterilization of Surgical InstrumentsCase Study: Sterilization of Surgical Instruments
Case Study: Sterilization of Surgical Instruments
Jordi Labs
 
Evap 135-cxp gfn2017-evap-poster-170615
Evap 135-cxp gfn2017-evap-poster-170615Evap 135-cxp gfn2017-evap-poster-170615
Evap 135-cxp gfn2017-evap-poster-170615
Rana Tayyarah
 
Validation and method development of Apixaban A research project.
Validation and method development of Apixaban A research project.Validation and method development of Apixaban A research project.
Validation and method development of Apixaban A research project.
Bhavana Gundavarapu
 
MALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up Approach
MALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up ApproachMALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up Approach
MALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up Approach
Shimadzu Scientific Instruments
 
High throughput, data independent acquisition for qualitative and quantitativ...
High throughput, data independent acquisition for qualitative and quantitativ...High throughput, data independent acquisition for qualitative and quantitativ...
High throughput, data independent acquisition for qualitative and quantitativ...SCIEX
 
Refocus: Analysis of malodor
Refocus: Analysis of malodorRefocus: Analysis of malodor
Refocus: Analysis of malodor
IS-X
 
Development and Validation of Novel RP-HPLC method for the estimation of Nalo...
Development and Validation of Novel RP-HPLC method for the estimation of Nalo...Development and Validation of Novel RP-HPLC method for the estimation of Nalo...
Development and Validation of Novel RP-HPLC method for the estimation of Nalo...
Bhavana Gundavarapu
 
LCMS
LCMSLCMS

What's hot (20)

Selected ion flow tube MS - Online quantitative VOC analysis
Selected ion flow tube MS - Online quantitative VOC analysisSelected ion flow tube MS - Online quantitative VOC analysis
Selected ion flow tube MS - Online quantitative VOC analysis
 
Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...
Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...
Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...
 
Archana B lc ppt
Archana B lc pptArchana B lc ppt
Archana B lc ppt
 
LC-MS/MS analysis of emerging food contaminants
LC-MS/MS analysis of emerging food contaminantsLC-MS/MS analysis of emerging food contaminants
LC-MS/MS analysis of emerging food contaminants
 
Ionization Techniques in LC-MS
Ionization Techniques in LC-MSIonization Techniques in LC-MS
Ionization Techniques in LC-MS
 
Practical Implementation of the New Elemental Impurities Guidelines May 2015
Practical Implementation of the New Elemental Impurities Guidelines May 2015Practical Implementation of the New Elemental Impurities Guidelines May 2015
Practical Implementation of the New Elemental Impurities Guidelines May 2015
 
Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)
Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)
Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)
 
Impact of novel MS/MSall acquisition and processing techniques on forensic to...
Impact of novel MS/MSall acquisition and processing techniques on forensic to...Impact of novel MS/MSall acquisition and processing techniques on forensic to...
Impact of novel MS/MSall acquisition and processing techniques on forensic to...
 
DETECTORS IN HPLC
DETECTORS IN HPLCDETECTORS IN HPLC
DETECTORS IN HPLC
 
Expanding Your High Performance Liquid Chromatography and Ultra High Performa...
Expanding Your High Performance Liquid Chromatography and Ultra High Performa...Expanding Your High Performance Liquid Chromatography and Ultra High Performa...
Expanding Your High Performance Liquid Chromatography and Ultra High Performa...
 
Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...
Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...
Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...
 
Case Study: Sterilization of Surgical Instruments
Case Study: Sterilization of Surgical InstrumentsCase Study: Sterilization of Surgical Instruments
Case Study: Sterilization of Surgical Instruments
 
Evap 135-cxp gfn2017-evap-poster-170615
Evap 135-cxp gfn2017-evap-poster-170615Evap 135-cxp gfn2017-evap-poster-170615
Evap 135-cxp gfn2017-evap-poster-170615
 
Validation and method development of Apixaban A research project.
Validation and method development of Apixaban A research project.Validation and method development of Apixaban A research project.
Validation and method development of Apixaban A research project.
 
MALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up Approach
MALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up ApproachMALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up Approach
MALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up Approach
 
High throughput, data independent acquisition for qualitative and quantitativ...
High throughput, data independent acquisition for qualitative and quantitativ...High throughput, data independent acquisition for qualitative and quantitativ...
High throughput, data independent acquisition for qualitative and quantitativ...
 
Refocus: Analysis of malodor
Refocus: Analysis of malodorRefocus: Analysis of malodor
Refocus: Analysis of malodor
 
Development and Validation of Novel RP-HPLC method for the estimation of Nalo...
Development and Validation of Novel RP-HPLC method for the estimation of Nalo...Development and Validation of Novel RP-HPLC method for the estimation of Nalo...
Development and Validation of Novel RP-HPLC method for the estimation of Nalo...
 
LCMS
LCMSLCMS
LCMS
 
HPLC Method Development
HPLC Method DevelopmentHPLC Method Development
HPLC Method Development
 

Similar to Validating Automated Structure Confirmation Using NMR Prediction in a Blind Study

The Performance Validation of Neural Network Based 13C NMR Prediction Using a...
The Performance Validation of Neural Network Based 13C NMR Prediction Using a...The Performance Validation of Neural Network Based 13C NMR Prediction Using a...
The Performance Validation of Neural Network Based 13C NMR Prediction Using a...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Event 32
Event 32Event 32
Event 32
etheruoh
 
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
Simone Brogi
 
Automated well test analysis ii using ‘well test auto’
Automated well test analysis ii using ‘well test auto’Automated well test analysis ii using ‘well test auto’
Automated well test analysis ii using ‘well test auto’
Alexander Decker
 
IRJET- Detection and Identification of Artificially Ripened Fruits using ...
IRJET-  	  Detection and Identification of Artificially Ripened Fruits using ...IRJET-  	  Detection and Identification of Artificially Ripened Fruits using ...
IRJET- Detection and Identification of Artificially Ripened Fruits using ...
IRJET Journal
 
Neural network-based pH and coagulation adjustment system in water treatment
Neural network-based pH and coagulation adjustment system in water treatmentNeural network-based pH and coagulation adjustment system in water treatment
Neural network-based pH and coagulation adjustment system in water treatment
IAESIJAI
 
PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER
PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER
PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER
ecij
 
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
Kamel Mansouri
 
Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...
Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...
Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
A360105
A360105A360105
A360105
IJERA Editor
 
2 partners ed_kickoff_dtai
2 partners ed_kickoff_dtai2 partners ed_kickoff_dtai
2 partners ed_kickoff_dtai
Sirris
 
Cw33592595
Cw33592595Cw33592595
Cw33592595
IJERA Editor
 
Cw33592595
Cw33592595Cw33592595
Cw33592595
IJERA Editor
 
High Sensitivity Sanger Sequencing for Minor Variant Detection
High Sensitivity Sanger Sequencing for Minor Variant DetectionHigh Sensitivity Sanger Sequencing for Minor Variant Detection
High Sensitivity Sanger Sequencing for Minor Variant Detection
Thermo Fisher Scientific
 
Comparison of Formulation Analysis by UPLC FINAL
Comparison of Formulation Analysis by UPLC FINALComparison of Formulation Analysis by UPLC FINAL
Comparison of Formulation Analysis by UPLC FINALJessica Sitko
 
A Path-Oriented Automatic Random Testing Based on Double Constraint Propagation
A Path-Oriented Automatic Random Testing Based on Double Constraint PropagationA Path-Oriented Automatic Random Testing Based on Double Constraint Propagation
A Path-Oriented Automatic Random Testing Based on Double Constraint Propagation
ijseajournal
 
Development and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning modelsDevelopment and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning models
Sean Ekins
 
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
IRJET Journal
 
A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...
A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...
A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...
Jessica Navarro
 

Similar to Validating Automated Structure Confirmation Using NMR Prediction in a Blind Study (20)

The Performance Validation of Neural Network Based 13C NMR Prediction Using a...
The Performance Validation of Neural Network Based 13C NMR Prediction Using a...The Performance Validation of Neural Network Based 13C NMR Prediction Using a...
The Performance Validation of Neural Network Based 13C NMR Prediction Using a...
 
Event 32
Event 32Event 32
Event 32
 
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
 
Automated well test analysis ii using ‘well test auto’
Automated well test analysis ii using ‘well test auto’Automated well test analysis ii using ‘well test auto’
Automated well test analysis ii using ‘well test auto’
 
IRJET- Detection and Identification of Artificially Ripened Fruits using ...
IRJET-  	  Detection and Identification of Artificially Ripened Fruits using ...IRJET-  	  Detection and Identification of Artificially Ripened Fruits using ...
IRJET- Detection and Identification of Artificially Ripened Fruits using ...
 
Neural network-based pH and coagulation adjustment system in water treatment
Neural network-based pH and coagulation adjustment system in water treatmentNeural network-based pH and coagulation adjustment system in water treatment
Neural network-based pH and coagulation adjustment system in water treatment
 
PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER
PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER
PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER
 
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
 
Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...
Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...
Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...
 
A360105
A360105A360105
A360105
 
2 partners ed_kickoff_dtai
2 partners ed_kickoff_dtai2 partners ed_kickoff_dtai
2 partners ed_kickoff_dtai
 
Cw33592595
Cw33592595Cw33592595
Cw33592595
 
Cw33592595
Cw33592595Cw33592595
Cw33592595
 
High Sensitivity Sanger Sequencing for Minor Variant Detection
High Sensitivity Sanger Sequencing for Minor Variant DetectionHigh Sensitivity Sanger Sequencing for Minor Variant Detection
High Sensitivity Sanger Sequencing for Minor Variant Detection
 
Comparison of Formulation Analysis by UPLC FINAL
Comparison of Formulation Analysis by UPLC FINALComparison of Formulation Analysis by UPLC FINAL
Comparison of Formulation Analysis by UPLC FINAL
 
EMSDVici
EMSDViciEMSDVici
EMSDVici
 
A Path-Oriented Automatic Random Testing Based on Double Constraint Propagation
A Path-Oriented Automatic Random Testing Based on Double Constraint PropagationA Path-Oriented Automatic Random Testing Based on Double Constraint Propagation
A Path-Oriented Automatic Random Testing Based on Double Constraint Propagation
 
Development and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning modelsDevelopment and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning models
 
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
 
A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...
A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...
A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...
 

Recently uploaded

FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
UiPathCommunity
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 

Recently uploaded (20)

FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 

Validating Automated Structure Confirmation Using NMR Prediction in a Blind Study

  • 1. Looking closer at the failed samples provides some insight to the nature of the failure. ID #3 failed specifically because of an unassigned 2D peak appearing at 17.8 ppm and 2.2 ppm. It was determined that this peak was due to the aromatic methyl group in the structure. The reason for a lack of consistency between the predicted and experimental chemical shifts in this case were due to a slow rotation around the N-CO bond. As a result, this rotamer produces an experimental spectrum that looks like a mixture. The software was unable to accurately predict this mixture of forms based on the experimental conditions, and as a result, the predicted spectrum did not match the experimental and the sample was flagged for manual analysis. Figure 2. ID #3 was a false negative as the software was unable to assign the methyl group (highlighted in blue) due to slow rotation around the N-CO bond. The presence of this rotamer resulted in an inconsistency between the experimental and predicted chemical shifts. The software also flagged the sample ID #7 to be considered for closer inspection. In this particular case, the issue with this spectrum was determined to be based on the presence of a mixture. Based on some of the spectral features in both the 1 H and HSQC–DEPT spectra, it is believed that some of the product had converted to an alcohol resulting in a mixture of both the brominated and hydroxylated products. As a result, the software correctly identified this spectrum as not being consistent with the proposed structure and it was flagged for manual analysis. Figure 3. ID #7 was flagged as an ambiguous result. Spectral features in the 1 H and HSQC–DEPT dataset suggest a mixture between the two compounds shown above. The final sample in the blind test set (ID #10) was also flagged by the software. In this particular case, the software was unable to identify two protons in the experimental spectrum because the software correctly set a dark region over a large water peak. Unfortunately, the creation of this dark region resulted in the exclusion of an important multiplet in the experimental spectrum that was in close proximity to the water peak. Because of this dark region, the software was unable to confirm a match between the spectrum and structure. Table 1. The results of the 19 Aldrich datasets. For this dataset, 13 of the 19 datasets (69%) were automatically evaluated by the software. The software was unable to confirm the proposed structure in Figure 1 (ID #8). Upon closer inspection it was observed that the software failed because the experimental peak located at 4.74 ppm corresponding to atom #11 in the proposed structure had an integration value that was too low to assign correctly. As a result, the software flagged this result as ambiguous. It was later observed that the low integration value could be due to enol formation. A longer relaxation delay may have more adequately prepared the experimental spectrum for automatic evaluation by the software. Figure 1. ID #8 was a false negative because the integration value for the multiplet at 4.74 ppm was too low. A second compound, ID #11, was rejected by the software as well. Upon closer inspection of the experimental data, it was determined that the purity of this compound was not sufficient and that the sample contained several different components. In these two particular examples, the software did a good job of flagging the two problem spectra that required a closer look. Results of the Blind Test After the previous results and settings had been agreed upon, a blind test set of 10 compounds was run through the system in the exact same fashion. No changes to the processing or verification parameters were made. The results of this test are shown in table 2. Table 2. The results of the 10 blind Aldrich datasets. For this dataset, 7 of the 10 datasets (70%) were automatically evaluated by the software. Ryan Sasaki, Brent Lefebvre, Antony J. Williams, and Sergey Golotvin Advanced Chemistry Development, Inc. Toronto, ON, Canada 110 Yonge Street, 14 th floor, Toronto Ontario, Canada M5C 1T4 Tel: (416) 368-3435 Fax: (416) 368-5596 Toll Free: 1-800-304-3988 Email: info@acdlabs.com Validating Automated Structure Confirmation in a Blind Study Figure 6 . ID #10 failed due to the close proximity of an important multiplet to an intense water peak in the experimental spectrum. CONCLUSIONS The goal of this study was to evaluate a fully automated NMR processing and structure verification workflow for a blind test set of compounds. The processing and evaluation settings for a typical group of samples was set up using a pilot set of 19 compounds. Once these settings were adjusted, they were used to automatically process and evaluate a blind set of 10 compounds that were prepared under the same conditions. The results revealed that this completely automated system could reduce the interpretation workload of a spectroscopist by up to 90% if problems with rotomers and impurities are filtered out before the NMR Verification step, up to 70% when these problem samples are left in. This study highlighted several examples where datasets were flagged by the software for closer inspection by a spectroscopist. These particular examples illustrate the software’s discrimination ability that help reduce the risk of false positives. The results of this blind study suggested that a fully automated processing and interpretation system can perform sufficiently in an industrial environment. ACKNOWLEDGEMENTS The authors would like to acknowledge Dr. Timothy D. Spitzer and Randy D. Rutkowske of GlaxoSmithKline for providing us with NMR data for the compounds in this study. REFERENCES 1. Automated Structure Verification Based on 1 H NMR Prediction, Sergey S. Golotvin, Eugene Vodopianov, Brent A. Lefebvre, Antony J. Williams, and Timothy D. Spitzer . Magn. Reson. Chem. 2006; 44: 524-538. 2. Automated Evaluation of a Chemical Structure with Only 1D 1 H and 2D 1 H– 13 C HSQC, Sergey S. Golotvin, Eugene Vodopianov, Rostislav Pol, Brent A. Lefebvre, Antony J. Williams, and Timothy D. Spitzer . ENC Poster 2006. INTRODUCTION In previous work, we have presented several findings on the automated evaluation of chemical structures using 1 H, 13 C, and 2D NMR verification algorithms. 1–2 These studies have shown that these systems have performed extremely well through numerous challenges. The current study focuses not only on the performance of the verification algorithms but also on the automated preparation of experimental data through a blind test. This study was designed to prove that such a system would hold up in an industrial environment without any human intervention. This study consisted of two distinct sets of structures and spectra. The first contained 19 spectra sets (each dataset contained 1D 1 H and 2D HSQC spectra) that were provided ahead of time for adjustment of processing settings and options. This step was necessary to identify the best software settings based on the instrument and data collection practices for the laboratory where the samples were prepared and run. Once the first set was run through the system and results of the verification procedure obtained, the second, blind test, was performed on 10 distinct datasets (with chemical structures) that were not available to the software or the software operators in advance. The details and results of these two tests are presented here, along with a comprehensive look at the structures that could not be confirmed. Setting Up Ideal Processing and Evaluation Parameters In order to have a system that can run without human intervention, automated processing and structure verification procedures (macros) must be created in the software to perform these tasks. The raw 1D and 2D NMR datasets for 19 Aldrich compounds were first evaluated using ACD/Labs’ standard macros. These settings proved to be non-sufficient as the datasets contained several abnormally broad water peaks and low signal-to-noise ratios. These macros were then modified to exclude these water peaks and set more stringent peak picking guidelines to combat the S/N issues. The second attempt was improved but had some issues with the referencing in one of the 2D datasets. In addition, the 1D spectra were not well-resolved, resulting in an inaccurate evaluation of some multiplets. These issues were rectified by decreasing the line broadening setting in the software by a factor of 10. Following this modification, the settings were then deemed to be sufficient. Results of the First Test An explanation of the combined verification algorithms used to evaluate spectrum-to-structure matches have been previously reported. 2 Following the modification of ACD/Labs’ standard macros explained in the previous section, the raw data of the 19 Aldrich compounds were fully processed and evaluated automatically. The results revealed that the software was able to successfully evaluate 13 of the 19 datasets provided. In other words, for this particular dataset, 69% of the samples were automatically evaluated by software without any human intervention. The remaining 6 samples would require manual analysis by an NMR Spectroscopist as the software had flagged them as being either inconsistent or incorrect (Table 1). Of these 6 samples, it was concluded that 4 of the false negatives were a result of algorithm errors that have been fixed in Version 10 of the software (ID# 6, 12, 13, and 15). The other two ambiguous results require a closer look to be explained.