SlideShare a Scribd company logo
John May and Roger Sayle
NextMove Software Ltd, Cambridge, UK.
NextMove Software Limited
Innovation Centre (Unit 23)
Cambridge Science Park
Milton Road, Cambridge
England CB4 0EY
www.nextmovesoftware.co.uk
www.nextmovesoftware.com
5. Reactive incompatibles
1. Introduction
7. Acknowledgements
8. Bibliography
3. Identity matching
Reactive incompatibilities representative a more subtle class of hazard where
compounds may have minimal risk in isolation but are dangerous when combined.
Combinations of compounds are encoded by conjunction where all child
expressions must be satisfied for the alert to trigger. An example found by
Casandra in the US patents is:
US20020173655A1 [paragraph:230]
Here dimethylformamide (DMF) and sodium hydride react exothermically in a self-
accelerating reaction. Care is taken when matching sodium hydride to maintain
the fragment grouping.
An extension to specific incompatibles is the ability to encode hierarchical generic
incompatibles[5]:
(DMF + NaH) is child of (amide + metal hydride)
(acetone + H2O2) is child of (ketone + peroxide)
Groupings of conditions can be nested arbitrarily and allow non-binary
combinations (e.g. ammonium nitrate + formamide + H2O).
The authors would like to thank Dana Vanderwall, Ramesh Durvasula, Dong Li and
Carol McNab from Bristol-Myers Squibb for their assistance and feedback.
Reactive Chemical Hazard Alerting
in Pharmaceutical Electronic Lab
Notebooks
6. Summary and future work
Of the many chemical reactions performed by synthetic chemists in the
pharmaceutical industry and academia, some are potentially more hazardous
than others. Fortunately, best practices, compliance, and education helps
ensure that incidents are rare, but as highlighted by the recent explosion and
building evacuation at two UK universities in March 2015, constant vigilance is
necessary to ensure a safe work environment.
Chemical safety information from MSDS/SDS datasheets, Bretherick's
Handbook[1] or the internet, is readily available but the volume of such
information makes it difficult for an experimentalist to identify relevant risks in a
timely manner. Casandra is a server for delivering real time safety warnings of
experimental hazards straight to the pharmaceutical electronic laboratory
notebooks (ELNs)
In Casandra, safety alerts from an in-house or public ruleset are encoded as XML
(Alerting Configuration). The server then builds an expression tree of this ruleset
allowing efficient matching of potential hazards in real time. Expressions can
extend beyond simple name lookup utilising the chemical structure and
quantities of reactants to identify hazards.
Experimental reactions are encoded in XML by an ELN desktop client and
submitted to the server for analysis. Potential hazards are identified, a report is
generated and sent back to the client. The report can contain a concise
description, citation, classification, actions (e.g. SOP), and a highlighted
structure diagram. The ELN client handles the report and chooses whether to
prompt the user for compliance.
Sometimes just the presence of a compound in a reaction may be cause for
caution. Examples may include: toxic, volatile, radioactive, or odorous
compounds. Occurrence of a compound is identified with Casandra through
identity matching.
The configuration encodes a name and optionally a structure. If no structure has
been specified, Casandra can assign one using name-to-structure tools such as
OPSIN[2] or NextMove Software’s LeadMine.
Identity matching uses name, InChI, and canonical SMILES comparison. When no
structure can be assigned name matching is still utilised. Chemical spelling
correction[3] may be used for non-exact name matches. Running the above
configuration in Casandra over 1.2 million reactions extracted from the US
patents[4] highlights the following example:
US20120122891A1 [paragraph:499]
In this particular case the risk may be minimal since the reagent is present in
toluene (rather than being powdered) and only at small amounts (9.3 ml). ELN
desktop clients typically include the quantity of a compound used in an
experiment. The configuration can therefore be modified to only trigger above a
certain threshold.
4. Substructure patterns
Capturing hazards related to specific compounds with the identity matching can
be informative but is limited to known and encoded examples. Compounds that
are primarily hazardous due the presence of a specific functional group can be
matched with more generic patterns. In the previous section the example
reaction also contains an azide which one may want to detect.
As with the identity matching, a quantity threshold may be specified. The
condition can be tuned to only alert when the specified number of occurrences
are found. For example, checking US patents for reactions where one compound
contains four nitro groups finds the following:
US20100081811A1 [paragraph:18]
2. Overview
We have described how Casandra provides the alerting of experimental hazards.
The capturing of alerts in a flexible XML format facilitates exchange and reuse.
Future work includes extending conditions to check theoretical calculated
property predictions, such as maximum heat of decomposition/deflagration,
flashpoint, oxygen balance, radioactive half-life and vapour pressure.
1. Bretherick. Bretherick’s Handbook of reactive chemical hazards
2. Lowe DM et al. Chemical Name to Structure: OPSIN, an Open Source Solution. JCIM.
51(1). 2012
3. Sayle RS et al. Improved Chemical Text Mining of patents with Infinite Dictionaries
and Automatic Spelling Correction. JCIM. 51(3). 2011.
4. Lowe DM. Unleashing over a million reactions into the wild. 2014. Online:
nextmovesoftware.com/blog/2014/02/
5. Johnson LE and Farr JK. CRW 2.0. Process Safety Progress. 27(3). 2008

More Related Content

Similar to Reactive Chemical Hazard Alerting in Pharmaceutical Notebooks

Extraction, Analysis, Atom Mapping, Classification and Naming of Reactions fr...
Extraction, Analysis, Atom Mapping, Classification and Naming of Reactions fr...Extraction, Analysis, Atom Mapping, Classification and Naming of Reactions fr...
Extraction, Analysis, Atom Mapping, Classification and Naming of Reactions fr...
NextMove Software
 
Impact of the temperature and humidity variations on link quality of xm1000 m...
Impact of the temperature and humidity variations on link quality of xm1000 m...Impact of the temperature and humidity variations on link quality of xm1000 m...
Impact of the temperature and humidity variations on link quality of xm1000 m...
ijasuc
 
PERFORMANCE PREDICTION OF AN ADIABATIC SOLAR LIQUID DESICCANT REGENERATOR USI...
PERFORMANCE PREDICTION OF AN ADIABATIC SOLAR LIQUID DESICCANT REGENERATOR USI...PERFORMANCE PREDICTION OF AN ADIABATIC SOLAR LIQUID DESICCANT REGENERATOR USI...
PERFORMANCE PREDICTION OF AN ADIABATIC SOLAR LIQUID DESICCANT REGENERATOR USI...
IAEME Publication
 
Virtual Reaction Service Using Chem Axon Reactor July06
Virtual Reaction Service Using Chem Axon Reactor July06Virtual Reaction Service Using Chem Axon Reactor July06
Virtual Reaction Service Using Chem Axon Reactor July06DanielSButler
 
ADF modeling suite: DFT to MD software for chemistry and materials
ADF modeling suite: DFT to MD software for chemistry and materialsADF modeling suite: DFT to MD software for chemistry and materials
ADF modeling suite: DFT to MD software for chemistry and materials
Software for Chemistry & Materials
 
Efficient Searching and Similarity of Unmapped Reactions: Application to ELN ...
Efficient Searching and Similarity of Unmapped Reactions: Application to ELN ...Efficient Searching and Similarity of Unmapped Reactions: Application to ELN ...
Efficient Searching and Similarity of Unmapped Reactions: Application to ELN ...
NextMove Software
 
Ecrice - ChemEd X Data - Barcelona - 2016 talk
Ecrice - ChemEd X Data - Barcelona - 2016 talkEcrice - ChemEd X Data - Barcelona - 2016 talk
Ecrice - ChemEd X Data - Barcelona - 2016 talk
University of Minnesota Rochester
 
Azide safety archibald
Azide safety archibaldAzide safety archibald
Azide safety archibald
Vikas Nigam
 
Paper id 36201531
Paper id 36201531Paper id 36201531
Paper id 36201531IJRAT
 
Extraction, analysis, atom mapping, classification and naming of reactions fr...
Extraction, analysis, atom mapping, classification and naming of reactions fr...Extraction, analysis, atom mapping, classification and naming of reactions fr...
Extraction, analysis, atom mapping, classification and naming of reactions fr...
NextMove Software
 
Synthetically Accessible Virtual Inventory (SAVI) : Reaction generation and h...
Synthetically Accessible Virtual Inventory (SAVI) : Reaction generation and h...Synthetically Accessible Virtual Inventory (SAVI) : Reaction generation and h...
Synthetically Accessible Virtual Inventory (SAVI) : Reaction generation and h...
Hitesh Patel
 
Analysis of electromagnetic (em) radiation shielding and decay chain for nucl...
Analysis of electromagnetic (em) radiation shielding and decay chain for nucl...Analysis of electromagnetic (em) radiation shielding and decay chain for nucl...
Analysis of electromagnetic (em) radiation shielding and decay chain for nucl...Alexander Decker
 
conventional Vs. tactile computing
conventional Vs. tactile computingconventional Vs. tactile computing
conventional Vs. tactile computingharish kumar
 
COMPARISON FREE ENERGY BINDING SITES NEURAMINIDASE
COMPARISON FREE ENERGY BINDING SITES NEURAMINIDASECOMPARISON FREE ENERGY BINDING SITES NEURAMINIDASE
COMPARISON FREE ENERGY BINDING SITES NEURAMINIDASE
ijabjournal
 
Open-source tools for querying and organizing large reaction databases
Open-source tools for querying and organizing large reaction databasesOpen-source tools for querying and organizing large reaction databases
Open-source tools for querying and organizing large reaction databases
Greg Landrum
 
20090511 Manchester Biochemistry
20090511 Manchester Biochemistry20090511 Manchester Biochemistry
20090511 Manchester Biochemistry
Michel Dumontier
 
Multi sensor data fusion system for enhanced analysis of deterioration in con...
Multi sensor data fusion system for enhanced analysis of deterioration in con...Multi sensor data fusion system for enhanced analysis of deterioration in con...
Multi sensor data fusion system for enhanced analysis of deterioration in con...Sayed Abulhasan Quadri
 
Multilabel Classification by BCH Code and Random Forests
Multilabel Classification by BCH Code and Random ForestsMultilabel Classification by BCH Code and Random Forests
Multilabel Classification by BCH Code and Random Forests
IDES Editor
 
Introduction to Nanotechnology: Part 3
Introduction to Nanotechnology: Part 3Introduction to Nanotechnology: Part 3
Introduction to Nanotechnology: Part 3glennfish
 

Similar to Reactive Chemical Hazard Alerting in Pharmaceutical Notebooks (20)

Extraction, Analysis, Atom Mapping, Classification and Naming of Reactions fr...
Extraction, Analysis, Atom Mapping, Classification and Naming of Reactions fr...Extraction, Analysis, Atom Mapping, Classification and Naming of Reactions fr...
Extraction, Analysis, Atom Mapping, Classification and Naming of Reactions fr...
 
Impact of the temperature and humidity variations on link quality of xm1000 m...
Impact of the temperature and humidity variations on link quality of xm1000 m...Impact of the temperature and humidity variations on link quality of xm1000 m...
Impact of the temperature and humidity variations on link quality of xm1000 m...
 
PERFORMANCE PREDICTION OF AN ADIABATIC SOLAR LIQUID DESICCANT REGENERATOR USI...
PERFORMANCE PREDICTION OF AN ADIABATIC SOLAR LIQUID DESICCANT REGENERATOR USI...PERFORMANCE PREDICTION OF AN ADIABATIC SOLAR LIQUID DESICCANT REGENERATOR USI...
PERFORMANCE PREDICTION OF AN ADIABATIC SOLAR LIQUID DESICCANT REGENERATOR USI...
 
Virtual Reaction Service Using Chem Axon Reactor July06
Virtual Reaction Service Using Chem Axon Reactor July06Virtual Reaction Service Using Chem Axon Reactor July06
Virtual Reaction Service Using Chem Axon Reactor July06
 
ADF modeling suite: DFT to MD software for chemistry and materials
ADF modeling suite: DFT to MD software for chemistry and materialsADF modeling suite: DFT to MD software for chemistry and materials
ADF modeling suite: DFT to MD software for chemistry and materials
 
Efficient Searching and Similarity of Unmapped Reactions: Application to ELN ...
Efficient Searching and Similarity of Unmapped Reactions: Application to ELN ...Efficient Searching and Similarity of Unmapped Reactions: Application to ELN ...
Efficient Searching and Similarity of Unmapped Reactions: Application to ELN ...
 
Ecrice - ChemEd X Data - Barcelona - 2016 talk
Ecrice - ChemEd X Data - Barcelona - 2016 talkEcrice - ChemEd X Data - Barcelona - 2016 talk
Ecrice - ChemEd X Data - Barcelona - 2016 talk
 
Azide safety archibald
Azide safety archibaldAzide safety archibald
Azide safety archibald
 
Paper id 36201531
Paper id 36201531Paper id 36201531
Paper id 36201531
 
Extraction, analysis, atom mapping, classification and naming of reactions fr...
Extraction, analysis, atom mapping, classification and naming of reactions fr...Extraction, analysis, atom mapping, classification and naming of reactions fr...
Extraction, analysis, atom mapping, classification and naming of reactions fr...
 
Synthetically Accessible Virtual Inventory (SAVI) : Reaction generation and h...
Synthetically Accessible Virtual Inventory (SAVI) : Reaction generation and h...Synthetically Accessible Virtual Inventory (SAVI) : Reaction generation and h...
Synthetically Accessible Virtual Inventory (SAVI) : Reaction generation and h...
 
Analysis of electromagnetic (em) radiation shielding and decay chain for nucl...
Analysis of electromagnetic (em) radiation shielding and decay chain for nucl...Analysis of electromagnetic (em) radiation shielding and decay chain for nucl...
Analysis of electromagnetic (em) radiation shielding and decay chain for nucl...
 
conventional Vs. tactile computing
conventional Vs. tactile computingconventional Vs. tactile computing
conventional Vs. tactile computing
 
COMPARISON FREE ENERGY BINDING SITES NEURAMINIDASE
COMPARISON FREE ENERGY BINDING SITES NEURAMINIDASECOMPARISON FREE ENERGY BINDING SITES NEURAMINIDASE
COMPARISON FREE ENERGY BINDING SITES NEURAMINIDASE
 
Open-source tools for querying and organizing large reaction databases
Open-source tools for querying and organizing large reaction databasesOpen-source tools for querying and organizing large reaction databases
Open-source tools for querying and organizing large reaction databases
 
Presentation
PresentationPresentation
Presentation
 
20090511 Manchester Biochemistry
20090511 Manchester Biochemistry20090511 Manchester Biochemistry
20090511 Manchester Biochemistry
 
Multi sensor data fusion system for enhanced analysis of deterioration in con...
Multi sensor data fusion system for enhanced analysis of deterioration in con...Multi sensor data fusion system for enhanced analysis of deterioration in con...
Multi sensor data fusion system for enhanced analysis of deterioration in con...
 
Multilabel Classification by BCH Code and Random Forests
Multilabel Classification by BCH Code and Random ForestsMultilabel Classification by BCH Code and Random Forests
Multilabel Classification by BCH Code and Random Forests
 
Introduction to Nanotechnology: Part 3
Introduction to Nanotechnology: Part 3Introduction to Nanotechnology: Part 3
Introduction to Nanotechnology: Part 3
 

More from NextMove Software

DeepSMILES
DeepSMILESDeepSMILES
DeepSMILES
NextMove Software
 
CINF 170: Regioselectivity: An application of expert systems and ontologies t...
CINF 170: Regioselectivity: An application of expert systems and ontologies t...CINF 170: Regioselectivity: An application of expert systems and ontologies t...
CINF 170: Regioselectivity: An application of expert systems and ontologies t...
NextMove Software
 
Building a bridge between human-readable and machine-readable representations...
Building a bridge between human-readable and machine-readable representations...Building a bridge between human-readable and machine-readable representations...
Building a bridge between human-readable and machine-readable representations...
NextMove Software
 
CINF 35: Structure searching for patent information: The need for speed
CINF 35: Structure searching for patent information: The need for speedCINF 35: Structure searching for patent information: The need for speed
CINF 35: Structure searching for patent information: The need for speed
NextMove Software
 
A de facto standard or a free-for-all? A benchmark for reading SMILES
A de facto standard or a free-for-all? A benchmark for reading SMILESA de facto standard or a free-for-all? A benchmark for reading SMILES
A de facto standard or a free-for-all? A benchmark for reading SMILES
NextMove Software
 
Recent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Recent Advances in Chemical & Biological Search Systems: Evolution vs RevolutionRecent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Recent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
NextMove Software
 
Can we agree on the structure represented by a SMILES string? A benchmark dat...
Can we agree on the structure represented by a SMILES string? A benchmark dat...Can we agree on the structure represented by a SMILES string? A benchmark dat...
Can we agree on the structure represented by a SMILES string? A benchmark dat...
NextMove Software
 
Comparing Cahn-Ingold-Prelog Rule Implementations
Comparing Cahn-Ingold-Prelog Rule ImplementationsComparing Cahn-Ingold-Prelog Rule Implementations
Comparing Cahn-Ingold-Prelog Rule Implementations
NextMove Software
 
Eugene Garfield: the father of chemical text mining and artificial intelligen...
Eugene Garfield: the father of chemical text mining and artificial intelligen...Eugene Garfield: the father of chemical text mining and artificial intelligen...
Eugene Garfield: the father of chemical text mining and artificial intelligen...
NextMove Software
 
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
NextMove Software
 
Recent improvements to the RDKit
Recent improvements to the RDKitRecent improvements to the RDKit
Recent improvements to the RDKit
NextMove Software
 
Pharmaceutical industry best practices in lessons learned: ELN implementation...
Pharmaceutical industry best practices in lessons learned: ELN implementation...Pharmaceutical industry best practices in lessons learned: ELN implementation...
Pharmaceutical industry best practices in lessons learned: ELN implementation...
NextMove Software
 
Digital Chemical Representations
Digital Chemical RepresentationsDigital Chemical Representations
Digital Chemical Representations
NextMove Software
 
Challenges and successes in machine interpretation of Markush descriptions
Challenges and successes in machine interpretation of Markush descriptionsChallenges and successes in machine interpretation of Markush descriptions
Challenges and successes in machine interpretation of Markush descriptions
NextMove Software
 
PubChem as a Biologics Database
PubChem as a Biologics DatabasePubChem as a Biologics Database
PubChem as a Biologics Database
NextMove Software
 
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
NextMove Software
 
CINF 13: Pistachio - Search and Faceting of Large Reaction Databases
CINF 13: Pistachio - Search and Faceting of Large Reaction DatabasesCINF 13: Pistachio - Search and Faceting of Large Reaction Databases
CINF 13: Pistachio - Search and Faceting of Large Reaction Databases
NextMove Software
 
Building on Sand: Standard InChIs on non-standard molfiles
Building on Sand: Standard InChIs on non-standard molfilesBuilding on Sand: Standard InChIs on non-standard molfiles
Building on Sand: Standard InChIs on non-standard molfiles
NextMove Software
 
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
NextMove Software
 
Advanced grammars for state-of-the-art named entity recognition (NER)
Advanced grammars for state-of-the-art named entity recognition (NER)Advanced grammars for state-of-the-art named entity recognition (NER)
Advanced grammars for state-of-the-art named entity recognition (NER)
NextMove Software
 

More from NextMove Software (20)

DeepSMILES
DeepSMILESDeepSMILES
DeepSMILES
 
CINF 170: Regioselectivity: An application of expert systems and ontologies t...
CINF 170: Regioselectivity: An application of expert systems and ontologies t...CINF 170: Regioselectivity: An application of expert systems and ontologies t...
CINF 170: Regioselectivity: An application of expert systems and ontologies t...
 
Building a bridge between human-readable and machine-readable representations...
Building a bridge between human-readable and machine-readable representations...Building a bridge between human-readable and machine-readable representations...
Building a bridge between human-readable and machine-readable representations...
 
CINF 35: Structure searching for patent information: The need for speed
CINF 35: Structure searching for patent information: The need for speedCINF 35: Structure searching for patent information: The need for speed
CINF 35: Structure searching for patent information: The need for speed
 
A de facto standard or a free-for-all? A benchmark for reading SMILES
A de facto standard or a free-for-all? A benchmark for reading SMILESA de facto standard or a free-for-all? A benchmark for reading SMILES
A de facto standard or a free-for-all? A benchmark for reading SMILES
 
Recent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Recent Advances in Chemical & Biological Search Systems: Evolution vs RevolutionRecent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Recent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
 
Can we agree on the structure represented by a SMILES string? A benchmark dat...
Can we agree on the structure represented by a SMILES string? A benchmark dat...Can we agree on the structure represented by a SMILES string? A benchmark dat...
Can we agree on the structure represented by a SMILES string? A benchmark dat...
 
Comparing Cahn-Ingold-Prelog Rule Implementations
Comparing Cahn-Ingold-Prelog Rule ImplementationsComparing Cahn-Ingold-Prelog Rule Implementations
Comparing Cahn-Ingold-Prelog Rule Implementations
 
Eugene Garfield: the father of chemical text mining and artificial intelligen...
Eugene Garfield: the father of chemical text mining and artificial intelligen...Eugene Garfield: the father of chemical text mining and artificial intelligen...
Eugene Garfield: the father of chemical text mining and artificial intelligen...
 
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
 
Recent improvements to the RDKit
Recent improvements to the RDKitRecent improvements to the RDKit
Recent improvements to the RDKit
 
Pharmaceutical industry best practices in lessons learned: ELN implementation...
Pharmaceutical industry best practices in lessons learned: ELN implementation...Pharmaceutical industry best practices in lessons learned: ELN implementation...
Pharmaceutical industry best practices in lessons learned: ELN implementation...
 
Digital Chemical Representations
Digital Chemical RepresentationsDigital Chemical Representations
Digital Chemical Representations
 
Challenges and successes in machine interpretation of Markush descriptions
Challenges and successes in machine interpretation of Markush descriptionsChallenges and successes in machine interpretation of Markush descriptions
Challenges and successes in machine interpretation of Markush descriptions
 
PubChem as a Biologics Database
PubChem as a Biologics DatabasePubChem as a Biologics Database
PubChem as a Biologics Database
 
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
CINF 17: Comparing Cahn-Ingold-Prelog Rule Implementations: The need for an o...
 
CINF 13: Pistachio - Search and Faceting of Large Reaction Databases
CINF 13: Pistachio - Search and Faceting of Large Reaction DatabasesCINF 13: Pistachio - Search and Faceting of Large Reaction Databases
CINF 13: Pistachio - Search and Faceting of Large Reaction Databases
 
Building on Sand: Standard InChIs on non-standard molfiles
Building on Sand: Standard InChIs on non-standard molfilesBuilding on Sand: Standard InChIs on non-standard molfiles
Building on Sand: Standard InChIs on non-standard molfiles
 
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
Chemical Structure Representation of Inorganic Salts and Mixtures of Gases: A...
 
Advanced grammars for state-of-the-art named entity recognition (NER)
Advanced grammars for state-of-the-art named entity recognition (NER)Advanced grammars for state-of-the-art named entity recognition (NER)
Advanced grammars for state-of-the-art named entity recognition (NER)
 

Recently uploaded

如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
yqqaatn0
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
MAGOTI ERNEST
 
Red blood cells- genesis-maturation.pptx
Red blood cells- genesis-maturation.pptxRed blood cells- genesis-maturation.pptx
Red blood cells- genesis-maturation.pptx
muralinath2
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
HongcNguyn6
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
fafyfskhan251kmf
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
muralinath2
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
TinyAnderson
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
pablovgd
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
zeex60
 
Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
SSR02
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
University of Maribor
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
David Osipyan
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
Mudde & Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
Mudde &  Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...Mudde &  Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
Mudde & Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
frank0071
 
Anemia_ types_clinical significance.pptx
Anemia_ types_clinical significance.pptxAnemia_ types_clinical significance.pptx
Anemia_ types_clinical significance.pptx
muralinath2
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdfMudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
frank0071
 

Recently uploaded (20)

如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
 
Red blood cells- genesis-maturation.pptx
Red blood cells- genesis-maturation.pptxRed blood cells- genesis-maturation.pptx
Red blood cells- genesis-maturation.pptx
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
 
Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
Mudde & Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
Mudde &  Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...Mudde &  Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
Mudde & Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
 
Anemia_ types_clinical significance.pptx
Anemia_ types_clinical significance.pptxAnemia_ types_clinical significance.pptx
Anemia_ types_clinical significance.pptx
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdfMudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
 

Reactive Chemical Hazard Alerting in Pharmaceutical Notebooks

  • 1. John May and Roger Sayle NextMove Software Ltd, Cambridge, UK. NextMove Software Limited Innovation Centre (Unit 23) Cambridge Science Park Milton Road, Cambridge England CB4 0EY www.nextmovesoftware.co.uk www.nextmovesoftware.com 5. Reactive incompatibles 1. Introduction 7. Acknowledgements 8. Bibliography 3. Identity matching Reactive incompatibilities representative a more subtle class of hazard where compounds may have minimal risk in isolation but are dangerous when combined. Combinations of compounds are encoded by conjunction where all child expressions must be satisfied for the alert to trigger. An example found by Casandra in the US patents is: US20020173655A1 [paragraph:230] Here dimethylformamide (DMF) and sodium hydride react exothermically in a self- accelerating reaction. Care is taken when matching sodium hydride to maintain the fragment grouping. An extension to specific incompatibles is the ability to encode hierarchical generic incompatibles[5]: (DMF + NaH) is child of (amide + metal hydride) (acetone + H2O2) is child of (ketone + peroxide) Groupings of conditions can be nested arbitrarily and allow non-binary combinations (e.g. ammonium nitrate + formamide + H2O). The authors would like to thank Dana Vanderwall, Ramesh Durvasula, Dong Li and Carol McNab from Bristol-Myers Squibb for their assistance and feedback. Reactive Chemical Hazard Alerting in Pharmaceutical Electronic Lab Notebooks 6. Summary and future work Of the many chemical reactions performed by synthetic chemists in the pharmaceutical industry and academia, some are potentially more hazardous than others. Fortunately, best practices, compliance, and education helps ensure that incidents are rare, but as highlighted by the recent explosion and building evacuation at two UK universities in March 2015, constant vigilance is necessary to ensure a safe work environment. Chemical safety information from MSDS/SDS datasheets, Bretherick's Handbook[1] or the internet, is readily available but the volume of such information makes it difficult for an experimentalist to identify relevant risks in a timely manner. Casandra is a server for delivering real time safety warnings of experimental hazards straight to the pharmaceutical electronic laboratory notebooks (ELNs) In Casandra, safety alerts from an in-house or public ruleset are encoded as XML (Alerting Configuration). The server then builds an expression tree of this ruleset allowing efficient matching of potential hazards in real time. Expressions can extend beyond simple name lookup utilising the chemical structure and quantities of reactants to identify hazards. Experimental reactions are encoded in XML by an ELN desktop client and submitted to the server for analysis. Potential hazards are identified, a report is generated and sent back to the client. The report can contain a concise description, citation, classification, actions (e.g. SOP), and a highlighted structure diagram. The ELN client handles the report and chooses whether to prompt the user for compliance. Sometimes just the presence of a compound in a reaction may be cause for caution. Examples may include: toxic, volatile, radioactive, or odorous compounds. Occurrence of a compound is identified with Casandra through identity matching. The configuration encodes a name and optionally a structure. If no structure has been specified, Casandra can assign one using name-to-structure tools such as OPSIN[2] or NextMove Software’s LeadMine. Identity matching uses name, InChI, and canonical SMILES comparison. When no structure can be assigned name matching is still utilised. Chemical spelling correction[3] may be used for non-exact name matches. Running the above configuration in Casandra over 1.2 million reactions extracted from the US patents[4] highlights the following example: US20120122891A1 [paragraph:499] In this particular case the risk may be minimal since the reagent is present in toluene (rather than being powdered) and only at small amounts (9.3 ml). ELN desktop clients typically include the quantity of a compound used in an experiment. The configuration can therefore be modified to only trigger above a certain threshold. 4. Substructure patterns Capturing hazards related to specific compounds with the identity matching can be informative but is limited to known and encoded examples. Compounds that are primarily hazardous due the presence of a specific functional group can be matched with more generic patterns. In the previous section the example reaction also contains an azide which one may want to detect. As with the identity matching, a quantity threshold may be specified. The condition can be tuned to only alert when the specified number of occurrences are found. For example, checking US patents for reactions where one compound contains four nitro groups finds the following: US20100081811A1 [paragraph:18] 2. Overview We have described how Casandra provides the alerting of experimental hazards. The capturing of alerts in a flexible XML format facilitates exchange and reuse. Future work includes extending conditions to check theoretical calculated property predictions, such as maximum heat of decomposition/deflagration, flashpoint, oxygen balance, radioactive half-life and vapour pressure. 1. Bretherick. Bretherick’s Handbook of reactive chemical hazards 2. Lowe DM et al. Chemical Name to Structure: OPSIN, an Open Source Solution. JCIM. 51(1). 2012 3. Sayle RS et al. Improved Chemical Text Mining of patents with Infinite Dictionaries and Automatic Spelling Correction. JCIM. 51(3). 2011. 4. Lowe DM. Unleashing over a million reactions into the wild. 2014. Online: nextmovesoftware.com/blog/2014/02/ 5. Johnson LE and Farr JK. CRW 2.0. Process Safety Progress. 27(3). 2008