SlideShare a Scribd company logo
1 of 57
Download to read offline
How to use data to design and optimize reactions?
⎯⎯⎯⎯⎯⎯ A quick introduction to work from Sigman lab
Speaker: Kosuke Higashida (Sawamura-G)
Slides: Ichigaku Takigawa (Takigawa-G)
ICReDD Fusion Research Seminar, Apr 27 2020.
Today's talk
Introduce papers from Matthew S. Sigman's group that 

considers how to use data to design and optimize catalysts.
Matthew S Sigman
Dept. Chemistry

The University of Utah
Organic Synthesis &
Asymmetric Catalysis
MS Sigman, KC Harper, EN Bess, A Milo

The development of multidimensional analysis tools
for asymmetric catalysis and beyond.

Accounts of Chemical Research 2016 49 (6), 1292-1301.
summarized the following studies from the lab.
1. Harper & Sigman, PNAS 2011
2. Harper & Sigman, Science 2011
• Werner, Mei, Burckle & Sigman, Science 2012
3. Milo, Bess & Sigman, Nature 2014
• Harper, Vilardi & Sigman, JACS 2013
• Bess, Bischoff, Sigman, PNAS 2014
4. Niemeyer, Milo, Hickey & Sigman, Nat Chem 2016;

Milo, Neel, Toste & Sigman, Science 2015
Our consistent theme today
• Chemically, it's about how to rationally design and improve
asymmetric catalysis ( , 不对称合成).
• Chemoinformatically, it's about QSAR (Quantitative Structure-
Activity Relationship) and descriptors.
• Statistically/Mathematically, it's about an application of linear
regression (design of experiments).
How to grasp "empirical" trends in experiments, and how
to effectively use them to develop new catalytic reactions?
Outline
1. Harper & Sigman, PNAS. 2011

Proof of Concept (PoC)

Consider "Design of Experiments (DoE)" (Information Science)
2. Harper & Sigman, Science. 2011

De novo Applications of 1
• Werner, Mei, Burckle & Sigman, Science 2012
3. Milo, Bess & Sigman, Nature. 2014

"More sophisticated descriptors" (Computational Science)
• Harper, Vilardi & Sigman, JACS, 2013
• Bess, Bischoff, Sigman, PNAS, 2014
4. Niemeyer, Milo, Hickey & Sigman, Nat Chem 2016;

Milo, Neel, Toste & Sigman, Science 2015

Establish a data-intensive workflow for "the study of reaction
mechanisms"
0. Quick Introduction to Asymmetric Synthesis
Asymmetric synthesis (Enantioselective synthesis)
• A molecule is called "chiral" if it cannot be
superposed on its mirror.
• A chiral molecule exists in two stereoisomers
that are mirror images of each other, called
"enantiomers".
• They have the same chemical properties,
except when reacting with other chiral
compounds.
• They also have the same physical
properties, except that they often have
opposite optical activities.
• A chiral molecule must have at least one
"chiral center" or "stereocenter".
A reaction that produces the stereoisomeric (enantiomeric or
diastereoisomeric) products in unequal amounts.
A key process in modern chemistry, particularly for pharmaceuticals!
"enantiomers"
A starting example
0.1 equiv. CrCl3
0.1 equiv. Chiral Ligand
2 equiv. Mn(0)
0.1 equiv. Triethylamine
2 equiv. SiMe3Cl
Nozaki-Hiyama-Kishi (NHK) allylation reaction
THF, RT, 20h
R
How to read this?
0.1 equiv. CrCl3
0.1 equiv. Chiral Ligand
Nozaki-Hiyama-Kishi (NHK) allylation reaction
Substrates
(Reactants)
Product
THF, RT, 20h
Catalyst Ligand (binds to Cr)
Reagents
Solvent
Reaction Temperature (RT = Room Temperature)
Reaction Time
Starting material
( )
2 equiv. Mn(0)
0.1 equiv. Triethylamine
2 equiv. SiMe3Cl
Reaction partner
A starting example
0.1 equiv. CrCl3
0.1 equiv. Chiral Ligand
Nozaki-Hiyama-Kishi (NHK) allylation reaction
THF, RT, 20h
an activated allyl group is added to carbonyl group
Phenyl-group
Asymmetric center
R
2 equiv. Mn(0)
0.1 equiv. Triethylamine
2 equiv. SiMe3Cl
A starting example
0.1 equiv. CrCl3
0.1 equiv. Chiral Ligand
Nozaki-Hiyama-Kishi (NHK) allylation reaction
THF, RT, 20h
an activated allyl group is added to carbonyl group
Phenyl-group
or
SR
Enantiomer ratio: e.r.
Enantiomeric Excess: e.e.
AR : AS
R
2 equiv. Mn(0)
0.1 equiv. Triethylamine
2 equiv. SiMe3Cl
Our problem for now
What "substituent" for can
improve the enantioselectivity,
preferably for a wide range of substrates?
"Ligands" ( )
iPr Bn
( )
Groups for ?

"Substituents"
• Ph
• tBu
• CF3
• CH(CH3)2
• OiPr
• Cl
• CH3
• CHO
• CO2Me
• COMe
• COOH
• OEt
• NMe2
• OMe
• NH2
• NO2
• OH
• F
• H= isopropyl group = benzyl group
tBu
= tertiary butyl 

group
Me
= methyl
group
Et
= ethyl
group
Sigman's approach
1. Use high-throughput assays to rapidly evaluate
many reaction conditions and catalyst structures.
2. Investigate the reaction mechanism to inform the
design of better-performing catalysts.

(by computing various transition-state structures)
Two common tactics of many labs (including Sigman's lab)
The newly proposed approach of Sigman's lab
Simultaneously optimize reaction performance and
interrogate mechanistic features of reactions by
relating empirical results of carefully designed datasets
to physical organic descriptors.
Backstory before the new approach
Sigman & Miller, J. Org. Chem. 2009, 74, 20, 7633-7643
They observed 1. Linear Free Energy Relationship (LFER):

the enantioselectivity and the size of the proline
substituent G (measured by Taft/Charton steric
parameters) were positively correlated.
2. A break in the LFER when larger G groups were
employed.
[ ]‡ means 

a transition state unclear
multistep
reactions
Their two assumptions for the break of LFER
Their focus on outliers: "results that appear at first glance to be
peculiar or poor are considerably more interesting than ones
that follow obvious or intuitive trends."
Their two assumptions for the break of LFER
Their focus on outliers: "results that appear at first glance to be
peculiar or poor are considerably more interesting than ones
that follow obvious or intuitive trends."
1. The poor extrapolation to larger substituents may be due to
any change in the mechanisms of asymmetric catalysis
2. The Taft/Charton parameters implemented do not precisely
describe the observed steric effect.
These two assumptions for the break of LFER are the starting
point of their newly developed approach.
Their two assumptions for the break of LFER
Their focus on outliers: "results that appear at first glance to be
peculiar or poor are considerably more interesting than ones
that follow obvious or intuitive trends."
1. The poor extrapolation to larger substituents may be due to
any change in the mechanisms of asymmetric catalysis
2. The Taft/Charton parameters implemented do not precisely
describe the observed steric effect.
These two assumptions for the break of LFER are the starting
point of their newly developed approach.
Design of experiments? Data-intensive methods to probe it?
More sophisticated parameters? Computational parameters?
First study 1/4 Harper & Sigman, PNAS 2011.
Initially 25 ligands synthesized

acc. the commercial availability and
the ease of synthesis.
First study 1/4 Harper & Sigman, PNAS 2011.
Initially 25 ligands synthesized

acc. the commercial availability and
the ease of synthesis.
LFER development by fitting
a linear regression model.
enantioselectivity
enantiomeric
ratio
gas constant reaction
temperature
First study 1/4 Harper & Sigman, PNAS 2011.
Initially 25 ligands synthesized

acc. the commercial availability and
the ease of synthesis.
... resulted in only moderate predictive power and did little to
inform catalyst design.
"How many and what distribution of data points would provide a
meaningful model for prediction?" (from the initial review of the work)
LFER development by fitting
a linear regression model.
enantioselectivity
enantiomeric
ratio
gas constant reaction
temperature
First study 2/4
Apply the principles of "Design of Experiments (DoE)" using physical organic
parameters.
"Briefly, the precepts of DoE involve evenly spreading experiments to
the sensitivity limits of a system using continuous variables (e.g.
temperature and concentration)."
Harper & Sigman, PNAS 2011.
Technical note 1/3: Design of experiments (DoE)
We have many variations, but in short...
(1) Define coordinates

with great care.
(2) Examine/consider

the limit of possible ranges.
(3) Cover the region evenly 

with interacting points.
"32 factorial design"

(full factorial design

with 3 levels of 2 factors)
Consult a statistician when descriptors have
strong correlation or clear interdependency
If you have more variables, you can also
consider a "fractional" design...

(a carefully chosen subset of experiments)
better
simplex
lattice
design
central
composite
design
Box-Behnken
design
face-centered
central composite
design
Latin hypercube
design
Sphere packing
design
Uniform design Maximum
entropy design
Technical note 2/3: Least-squares regression
"Predictors"

(descriptors)
"Response"
(target)
Regression equation for prediction
that minimizes the sum of (error-bar height)
simple matrix multiplication
2
error-bar
fitting a plane!
Technical note 3/3: Least-squares regression
Regression equation for prediction
Exactly same calculation!
fitting a surface!
called "response surface" models (RSMs) in DoE
If we set like this...
all calculated from X and Y

(composite variables)
First study 3/4
1. Extrapolation is subject to greater uncertainty and error than interpolation.
2. The predictive power of any statistical model is dependent upon the
variation in the distribution of data points used to develop the model.
Harper & Sigman, PNAS 2011.
Their two key tenets
First study 3/4
1. Extrapolation is subject to greater uncertainty and error than interpolation.
2. The predictive power of any statistical model is dependent upon the
variation in the distribution of data points used to develop the model.
Harper & Sigman, PNAS 2011.
Their two key tenets
Carefully designed and evenly spaced ligand space according to the
range of substituent Charton values that are synthetically accessible!
A 3 x 3 library defined
First study 3/4 Harper & Sigman, PNAS 2011.
H Me iPr tBu
Me
tBu
CEt3
The response surface model fitted
to 3x3 design by linear regression
2 replicates for each point in 3x3
Not reproducible area
First study 3/4 Harper & Sigman, PNAS 2011.
Et
The response surface model fitted
to 3x3 design by linear regression
2 replicates for each point in 3x3
H Me iPr tBu
Me
tBu
CEt3
Second study 1/4
They admitted that ...
they were aware of the likely best catalyst structure through extensive
trial-and-error screening in the development of these systems, and the
identity of the optimal predicted catalyst(s) was not surprising.
Harper & Sigman, Science 2011.
Second study 1/4
They admitted that ...
they were aware of the likely best catalyst structure through extensive
trial-and-error screening in the development of these systems, and the
identity of the optimal predicted catalyst(s) was not surprising.
De novo application of this approach
They applied this to an NHK-type propargylation
reaction that had been abondoned years earlier in
the group because of their inability, for more than 2
years, to achieve effective asymmetric catalysis....
Harper & Sigman, Science 2011.
Second study 1/4
They admitted that ...
they were aware of the likely best catalyst structure through extensive
trial-and-error screening in the development of these systems, and the
identity of the optimal predicted catalyst(s) was not surprising.
De novo application of this approach
They applied this to an NHK-type propargylation
reaction that had been abondoned years earlier in
the group because of their inability, for more than 2
years, to achieve effective asymmetric catalysis....
It turned out 

the best predicted one was

limited to 50% ee...
Harper & Sigman, Science 2011.
Second study 2/4
By screening only 9 ligands, they could confidently reach an informed
decision that a change in catalyst structure was necessary to proceed
with catalyst optimization.
Harper & Sigman, Science 2011.
Second study 2/4
By screening only 9 ligands, they could confidently reach an informed
decision that a change in catalyst structure was necessary to proceed
with catalyst optimization.
Several observations for redesigning the catalyst.
1. oxazoline unit did not bias the
reaction to a great extent
2. a Hammett correlation between the
enantioselectivity and the substrate
structure was observed. (A significant
electronic contribution from the substrate
impacts the enantioselectivity)
oxazoline
Harper & Sigman, Science 2011.
Second study 3/4
These two observations implied ...
The new catalyst design should incorporate a readily modifiable group
whose electronics could be systematically perturbed to override the
innate electronic bias of the substrate.
Harper & Sigman, Science 2011.
Second study 3/4
These two observations implied ...
The new catalyst design should incorporate a readily modifiable group
whose electronics could be systematically perturbed to override the
innate electronic bias of the substrate.
A 3 x 3 library, replacing the oxazoline with a quinoline, with one dimension
manipulated electronically (E) and the other sterically (S).
oxazoline
quinoline
pyridine
Harper & Sigman, Science 2011.
Side note 1/2: LFERs and Hammett equation
"The Hammett equation (and its extended forms) has been one of the
most widely used means for the study and interpretation of organic
reactions and their mechanisms." (Hansch, Leo & Taft, Chem Rev 1991)
Side note 1/2: LFERs and Hammett equation
"The Hammett equation (and its extended forms) has been one of the
most widely used means for the study and interpretation of organic
reactions and their mechanisms." (Hansch, Leo & Taft, Chem Rev 1991)
For any two reactions with two aromatic reactants only differing in the
type of substituent , the change in free energy of activation is
proportional to the change in Gibbs free energy. (i.e. a "linear" relationship)
equilibrium
constant with
that with = H
(reference)
substituent constant
(depends only on )
reaction constant (depends on
reactions but independent from )
"parameter"

values for
Side note 1/2: LFERs and Hammett equation
"The Hammett equation (and its extended forms) has been one of the
most widely used means for the study and interpretation of organic
reactions and their mechanisms." (Hansch, Leo & Taft, Chem Rev 1991)
• This is an empirical trend, not follow from any chemical theory 

(thus also long critisized from the theoretical side?).
• A surprise was, this equation also holds for reaction rates.
• This triggered many other useful variations as linear free-energy
relationships (LFER) as well as nonlinear ones such as quadratic.
For any two reactions with two aromatic reactants only differing in the
type of substituent , the change in free energy of activation is
proportional to the change in Gibbs free energy. (i.e. a "linear" relationship)
equilibrium
constant with
that with = H
(reference)
substituent constant
(depends only on )
reaction constant (depends on
reactions but independent from )
"parameter"

values for
Side note 2/2: LFER is generalized to QSAR/QSPR
LFERs or prediction by empirical trends in data also yielded 

a long history of QSAR/QSPR in Chemoinformatics 

since Taft/Hammett and Hansch-Fujita, etc....
: Hansch (2014)
Second study 4/4
This allowed them to confidently identify a catalyst that performed well
for a wide range of subtrates with differential electronic features.
The same approach is also applied to identify highly enantioselective
catalysts for the Pd-catalyzed redox-relay Heck reactions of alkenols 

(Werner, Mei, Burckle & Sigman, Science 2012, 338, 1455-1458).
Hammett parameters (σ)
Charton parameters (ν)
electronical effect (E)
sterical effect (S)
Harper & Sigman, Science 2011.
Toward parameter (re-)considerations
It is critical to use appropriate parameters (descriptors) for characteristically
describing the properties of substituents...
For example, Taft/Charton values have been historically controversial
because of the limited generality... (They arose from a particular mechanism)
Charton values
NHK allylation Desymmetrization of bisphenols
Worked. Failed.
Harper, Bess & Sigman, Nat Chem 2012.
Sterimol Harper, Bess & Sigman, Nat Chem 2012.
(Rediscovery of) Sterimol parameters
• Develop in 70's (Verloop & Tipker, 1977)
• Three parameters (L, B1, B5)
• Computationally derived.
Worked. Worked.
Third study 1/2 Milo, Bess & Sigman, Nature 2014.
New computationally derived parameters: 

Use IR vibrations as parameters to describe simultaneous
steric and electronic perturbations to structure.
To this point, a key problem in the history of LFERs has avoided:

the inability to analyze substituents with changes to both size
and electronic nature.
(Use computational science!)
Third study 2/2
To this point, a key problem in the history of LFERs has avoided:

the inability to analyze substituents with changes to both size
and electronic nature.
1. Perform a geometry minimization and frequency calculation of starting
materials
2. Compute vibrations as well as other parameters, including Sterimol
values, NBO charges, and geometric factors such as torsion angles.
3. Abridge the parameter set to a statistically manageable number.
4. Perform and evaluate a systematic iterative process to establish and
improve statistical predictive models.
New computationally derived parameters: 

IR(Infrared)-spectroscopy-based parameters to describe
simultaneous steric and electronic perturbations to structure.
Milo, Bess & Sigman, Nature 2014.
Applications of this to evaluate the substrate scope
This approach is intended to use in catalyst optimization, but can
be applied to different challenges such as "substrate scope".
1. Harper, Vilardi & Sigman, JACS 2013 

The catalyst and substrate structures were jointly analyzed to predict
the "best" catalyst/substrate combinations.
2. Bess, Bischoff, Sigman, PNAS 2014 

A modest-sized library of substrates was designed to build a model for
predicting the performance of even unreported substrates, and
investigated a Rh-catalyzed asymmetric transfer hydrogeneation of
ketones.
Returned to their original pursuit: reaction mechanisms
They hypothesized
their predictive models (multidimensional correlations) would
encompass mechanism-rich information.
However, inferring mechanistic information becomes a daunting task

because
1. the models become increasingly complex.
2. the models contain compounded parameters that are not
representative of one distinct structural characteristic.
A data-intensive approach to mechanistic elucidation requires
philoshophical changes: One has to overcome the entrenched
dogma that low-enantioselectivity measurements are "bad results".
Every value obtained in the course of data collection is useful and
can serve to reveal structural information regarding the mechanism!
4th/5th studies
Niemeyer, Milo, Hickey & Sigman, Nat Chem 2016

Milo, Neel, Toste & Sigman, Science 2015
1. Niemeyer, Milo, Hickey & Sigman, Nat Chem, 2016 

Parameter selection is not intuitive and an effective parameter set is
statistically obtained for phosphine ligands that served to distinguish
between two limiting mechanistic regimes in the site-selective oxidative
addition in a Suzuki reaction.
2. Milo, Neel, Toste & Sigman, Science 2015 

Synthetically reasonable and mechanistically informative catalysts,
substrates, or reagents are virtually predicted, and then prepared and
tested experimentally in the context of chiral phase-transfer catalysis.
This tactic requires the design and preparation of a systematic and
synthetically modular library of catalysts, substrates, and reagents.
The experimental design defines the effects that can be explained
by the produced models.
Collaborative and further studies
1. Neel, Milo, Sigman, Toste, JACS, 2016 

Chiral phosphoric acid-catalyzed fluorination of allylic alcohols
2. Bess, DeLuca, Tindall, Oderinde, Roizen, Du Bois & Sigman,
JACS, 2014 

Rh-catalyzed C-H amination reaction
3. Bess, Guptill, Davies & Sigman, Chem Sci, 2015 

a carbene C-H insertion reaction
4. Mougel, Santiago, Zhizhko, Bess, Varga, Frater, Sigman, Copéret,
JACS, 2015

Turnover frequency (TOF) of solid-supported W-based catalysts for
olefin metathesis
5. Zhang, Santiago, Crawford & Sigman, JACS, 2015

ligand optimization for Pd-catalyzed enantioselective redox-relay Heck
reactions
6. Zhang, Santiago, Kou & Sigman, JACS, 2015

evaluation of solvent effects for Pd-catalyzed enantioselective redox-
relay Heck reactions
Lessons we learned
1. The design of experiments (DoE) is crucial to obtain causal
insights on the target reactions.
2. We can computationally derive the physical organic chemical
parameters to describe the mechanistic features of reactions.
3. "In brief, one needs a reliable and accurate assay for the
output of interest, a reasonable spread of such outputs to
facilitate statistical analysis, and a reasonable level of
modularity in the process under study."
Deconstructing rather complex relationships still requires
profound physical organic chemistry expertise, the use
of various classical tools in this field, and creativity and
intuition to understand sophisticated processes.
After 2016: The 2019 Nature paper and beyond
1 page summary of the 2019 Nature paper
Method: Forward stepwise linear regression of Matlab statistics toolbox (since Nature 2014?)
Give up DoE and go more towards a "ML/AI"-like approach for out-of-sample prediction.
nucleophile
imine starting
material
amine
product
chiral
phosphoric
acid catalyst
• 367 reactions 

(manually curated from 17 papers)
• 313 parameters from every components

(Steric, DFT-derived, 2D chemoinfo
descriptors, conditions)
Proof-of-concept with a reaction class with different
catalyst chemotypes, substrate structural types.
out-of-sample prediction
Note: this is predicted
vs measured plot

(not a fitted line)
Lessons we learned
1. The design of experiments (DoE) is crucial to obtain causal
insights on the target reactions.
2. We can computationally derive the physical organic chemical
parameters to describe the mechanistic features of reactions.
3. "In brief, one needs a reliable and accurate assay for the
output of interest, a reasonable spread of such outputs to
facilitate statistical analysis, and a reasonable level of
modularity in the process under study."
Deconstructing rather complex relationships still requires
profound physical organic chemistry expertise, the use
of various classical tools in this field, and creativity and
intuition to understand sophisticated processes.
References
References
References
References
References
References

More Related Content

What's hot

機械学習は化学研究の"経験と勘"を合理化できるか?
機械学習は化学研究の"経験と勘"を合理化できるか?機械学習は化学研究の"経験と勘"を合理化できるか?
機械学習は化学研究の"経験と勘"を合理化できるか?Ichigaku Takigawa
 
When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliersaimsnist
 
Machine Learning in Chemistry: Part II
Machine Learning in Chemistry: Part IIMachine Learning in Chemistry: Part II
Machine Learning in Chemistry: Part IIJon Paul Janet
 
Machine Learning in Chemistry: Part I
Machine Learning in Chemistry: Part IMachine Learning in Chemistry: Part I
Machine Learning in Chemistry: Part IJon Paul Janet
 
Case Study: Caltech 'Orchid' Fundamental Research Project
Case Study: Caltech 'Orchid' Fundamental Research ProjectCase Study: Caltech 'Orchid' Fundamental Research Project
Case Study: Caltech 'Orchid' Fundamental Research ProjectSociotechnical Roundtable
 
A Machine Learning Framework for Materials Knowledge Systems
A Machine Learning Framework for Materials Knowledge SystemsA Machine Learning Framework for Materials Knowledge Systems
A Machine Learning Framework for Materials Knowledge Systemsaimsnist
 
Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material Designaimsnist
 
A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...aimsnist
 
Scikit-learn : Machine Learning in Python
Scikit-learn : Machine Learning in PythonScikit-learn : Machine Learning in Python
Scikit-learn : Machine Learning in PythonAjay Ohri
 
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...aimsnist
 
Materials Informatics Overview
Materials Informatics OverviewMaterials Informatics Overview
Materials Informatics OverviewTony Fast
 
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...aimsnist
 
2D/3D Materials screening and genetic algorithm with ML model
2D/3D Materials screening and genetic algorithm with ML model2D/3D Materials screening and genetic algorithm with ML model
2D/3D Materials screening and genetic algorithm with ML modelaimsnist
 
The potential role of ai in the minimisation and mitigation of project delay
The potential role of ai in the minimisation and mitigation of project delayThe potential role of ai in the minimisation and mitigation of project delay
The potential role of ai in the minimisation and mitigation of project delayPieter Rautenbach
 
人工知能の基本問題:これまでとこれから
人工知能の基本問題:これまでとこれから人工知能の基本問題:これまでとこれから
人工知能の基本問題:これまでとこれからIchigaku Takigawa
 
Nature Inspired Metaheuristic Algorithms
Nature Inspired Metaheuristic AlgorithmsNature Inspired Metaheuristic Algorithms
Nature Inspired Metaheuristic AlgorithmsIRJET Journal
 
Resource Allocation Using Metaheuristic Search
Resource Allocation Using Metaheuristic SearchResource Allocation Using Metaheuristic Search
Resource Allocation Using Metaheuristic Searchcsandit
 
Biology-Derived Algorithms in Engineering Optimization
Biology-Derived Algorithms in Engineering OptimizationBiology-Derived Algorithms in Engineering Optimization
Biology-Derived Algorithms in Engineering OptimizationXin-She Yang
 
Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"ieee_cis_cyprus
 

What's hot (20)

機械学習は化学研究の"経験と勘"を合理化できるか?
機械学習は化学研究の"経験と勘"を合理化できるか?機械学習は化学研究の"経験と勘"を合理化できるか?
機械学習は化学研究の"経験と勘"を合理化できるか?
 
When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliers
 
Machine Learning in Chemistry: Part II
Machine Learning in Chemistry: Part IIMachine Learning in Chemistry: Part II
Machine Learning in Chemistry: Part II
 
Machine Learning in Chemistry: Part I
Machine Learning in Chemistry: Part IMachine Learning in Chemistry: Part I
Machine Learning in Chemistry: Part I
 
Case Study: Caltech 'Orchid' Fundamental Research Project
Case Study: Caltech 'Orchid' Fundamental Research ProjectCase Study: Caltech 'Orchid' Fundamental Research Project
Case Study: Caltech 'Orchid' Fundamental Research Project
 
A Machine Learning Framework for Materials Knowledge Systems
A Machine Learning Framework for Materials Knowledge SystemsA Machine Learning Framework for Materials Knowledge Systems
A Machine Learning Framework for Materials Knowledge Systems
 
Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material Design
 
A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...
 
Scikit-learn : Machine Learning in Python
Scikit-learn : Machine Learning in PythonScikit-learn : Machine Learning in Python
Scikit-learn : Machine Learning in Python
 
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
 
Materials Informatics Overview
Materials Informatics OverviewMaterials Informatics Overview
Materials Informatics Overview
 
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
 
2D/3D Materials screening and genetic algorithm with ML model
2D/3D Materials screening and genetic algorithm with ML model2D/3D Materials screening and genetic algorithm with ML model
2D/3D Materials screening and genetic algorithm with ML model
 
The potential role of ai in the minimisation and mitigation of project delay
The potential role of ai in the minimisation and mitigation of project delayThe potential role of ai in the minimisation and mitigation of project delay
The potential role of ai in the minimisation and mitigation of project delay
 
人工知能の基本問題:これまでとこれから
人工知能の基本問題:これまでとこれから人工知能の基本問題:これまでとこれから
人工知能の基本問題:これまでとこれから
 
Nature Inspired Metaheuristic Algorithms
Nature Inspired Metaheuristic AlgorithmsNature Inspired Metaheuristic Algorithms
Nature Inspired Metaheuristic Algorithms
 
Resource Allocation Using Metaheuristic Search
Resource Allocation Using Metaheuristic SearchResource Allocation Using Metaheuristic Search
Resource Allocation Using Metaheuristic Search
 
Presentation1
Presentation1Presentation1
Presentation1
 
Biology-Derived Algorithms in Engineering Optimization
Biology-Derived Algorithms in Engineering OptimizationBiology-Derived Algorithms in Engineering Optimization
Biology-Derived Algorithms in Engineering Optimization
 
Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"
 

Similar to How to use data to design and optimize reaction? A quick introduction to work from Sigman lab

Mass Spectrometry Applications and spectral interpretation: Basics
Mass Spectrometry Applications and spectral interpretation: BasicsMass Spectrometry Applications and spectral interpretation: Basics
Mass Spectrometry Applications and spectral interpretation: BasicsShreekant Deshpande
 
Trends in Computer Science and Information Technology
Trends in Computer Science and Information TechnologyTrends in Computer Science and Information Technology
Trends in Computer Science and Information Technologypeertechzpublication
 
Computational Organic Chemistry
Computational Organic ChemistryComputational Organic Chemistry
Computational Organic ChemistryIsamu Katsuyama
 
Chemistry homework help
Chemistry homework helpChemistry homework help
Chemistry homework helpalicalland
 
Chemistry homework help
Chemistry homework helpChemistry homework help
Chemistry homework helpgaryantione
 
Chemistry homework help
Chemistry homework helpChemistry homework help
Chemistry homework helppowellabril
 
Incorporation of Linear Scaling Relations into Automatic Mechanism Generation...
Incorporation of Linear Scaling Relations into Automatic Mechanism Generation...Incorporation of Linear Scaling Relations into Automatic Mechanism Generation...
Incorporation of Linear Scaling Relations into Automatic Mechanism Generation...Richard West
 
IRJET- Structural Analysis of Viscoelastic Materials: Review Paper
IRJET- Structural Analysis of Viscoelastic Materials: Review PaperIRJET- Structural Analysis of Viscoelastic Materials: Review Paper
IRJET- Structural Analysis of Viscoelastic Materials: Review PaperIRJET Journal
 
2015 New trans-stilbene derivatives with large TPA values
2015 New trans-stilbene derivatives with large TPA values2015 New trans-stilbene derivatives with large TPA values
2015 New trans-stilbene derivatives with large TPA valuesvarun Kundi
 
Quantum pharmacology. Basics
Quantum pharmacology. BasicsQuantum pharmacology. Basics
Quantum pharmacology. BasicsMobiliuz
 
Quantative Structure-Activity Relationships (QSAR)
Quantative Structure-Activity Relationships (QSAR)Quantative Structure-Activity Relationships (QSAR)
Quantative Structure-Activity Relationships (QSAR)Atai Rabby
 
Open-source tools for querying and organizing large reaction databases
Open-source tools for querying and organizing large reaction databasesOpen-source tools for querying and organizing large reaction databases
Open-source tools for querying and organizing large reaction databasesGreg Landrum
 
A guide to molecular mechanics and quantum chemical calculations
A guide to molecular mechanics and quantum chemical calculationsA guide to molecular mechanics and quantum chemical calculations
A guide to molecular mechanics and quantum chemical calculationsSapna Jha
 
Molecular modelling for in silico drug discovery
Molecular modelling for in silico drug discoveryMolecular modelling for in silico drug discovery
Molecular modelling for in silico drug discoveryLee Larcombe
 

Similar to How to use data to design and optimize reaction? A quick introduction to work from Sigman lab (20)

Mass Spectrometry Applications and spectral interpretation: Basics
Mass Spectrometry Applications and spectral interpretation: BasicsMass Spectrometry Applications and spectral interpretation: Basics
Mass Spectrometry Applications and spectral interpretation: Basics
 
Trends in Computer Science and Information Technology
Trends in Computer Science and Information TechnologyTrends in Computer Science and Information Technology
Trends in Computer Science and Information Technology
 
Computational Organic Chemistry
Computational Organic ChemistryComputational Organic Chemistry
Computational Organic Chemistry
 
Chemistry homework help
Chemistry homework helpChemistry homework help
Chemistry homework help
 
Chemistry homework help
Chemistry homework helpChemistry homework help
Chemistry homework help
 
Chemistry homework help
Chemistry homework helpChemistry homework help
Chemistry homework help
 
Chemistry homework help
Chemistry homework helpChemistry homework help
Chemistry homework help
 
Incorporation of Linear Scaling Relations into Automatic Mechanism Generation...
Incorporation of Linear Scaling Relations into Automatic Mechanism Generation...Incorporation of Linear Scaling Relations into Automatic Mechanism Generation...
Incorporation of Linear Scaling Relations into Automatic Mechanism Generation...
 
Retrosynth
RetrosynthRetrosynth
Retrosynth
 
A systematic approach for the generation and verification of structural hypot...
A systematic approach for the generation and verification of structural hypot...A systematic approach for the generation and verification of structural hypot...
A systematic approach for the generation and verification of structural hypot...
 
IRJET- Structural Analysis of Viscoelastic Materials: Review Paper
IRJET- Structural Analysis of Viscoelastic Materials: Review PaperIRJET- Structural Analysis of Viscoelastic Materials: Review Paper
IRJET- Structural Analysis of Viscoelastic Materials: Review Paper
 
2015 New trans-stilbene derivatives with large TPA values
2015 New trans-stilbene derivatives with large TPA values2015 New trans-stilbene derivatives with large TPA values
2015 New trans-stilbene derivatives with large TPA values
 
Quantum pharmacology. Basics
Quantum pharmacology. BasicsQuantum pharmacology. Basics
Quantum pharmacology. Basics
 
Quantative Structure-Activity Relationships (QSAR)
Quantative Structure-Activity Relationships (QSAR)Quantative Structure-Activity Relationships (QSAR)
Quantative Structure-Activity Relationships (QSAR)
 
Open-source tools for querying and organizing large reaction databases
Open-source tools for querying and organizing large reaction databasesOpen-source tools for querying and organizing large reaction databases
Open-source tools for querying and organizing large reaction databases
 
A guide to molecular mechanics and quantum chemical calculations
A guide to molecular mechanics and quantum chemical calculationsA guide to molecular mechanics and quantum chemical calculations
A guide to molecular mechanics and quantum chemical calculations
 
Applications of Computational Quantum Chemistry
Applications of Computational Quantum ChemistryApplications of Computational Quantum Chemistry
Applications of Computational Quantum Chemistry
 
Poster_Jun 2014
Poster_Jun 2014Poster_Jun 2014
Poster_Jun 2014
 
Molecular modelling for in silico drug discovery
Molecular modelling for in silico drug discoveryMolecular modelling for in silico drug discovery
Molecular modelling for in silico drug discovery
 
CADD
CADDCADD
CADD
 

More from Ichigaku Takigawa

データ社会を生きる技術
〜機械学習の夢と現実〜
データ社会を生きる技術
〜機械学習の夢と現実〜データ社会を生きる技術
〜機械学習の夢と現実〜
データ社会を生きる技術
〜機械学習の夢と現実〜Ichigaku Takigawa
 
機械学習を科学研究で使うとは?
機械学習を科学研究で使うとは?機械学習を科学研究で使うとは?
機械学習を科学研究で使うとは?Ichigaku Takigawa
 
A Modern Introduction to Decision Tree Ensembles
A Modern Introduction to Decision Tree EnsemblesA Modern Introduction to Decision Tree Ensembles
A Modern Introduction to Decision Tree EnsemblesIchigaku Takigawa
 
Exploring Practices in Machine Learning and Machine Discovery for Heterogeneo...
Exploring Practices in Machine Learning and Machine Discovery for Heterogeneo...Exploring Practices in Machine Learning and Machine Discovery for Heterogeneo...
Exploring Practices in Machine Learning and Machine Discovery for Heterogeneo...Ichigaku Takigawa
 
機械学習と機械発見:自然科学融合が誘起するデータ科学の新展開
機械学習と機械発見:自然科学融合が誘起するデータ科学の新展開機械学習と機械発見:自然科学融合が誘起するデータ科学の新展開
機械学習と機械発見:自然科学融合が誘起するデータ科学の新展開Ichigaku Takigawa
 
機械学習と機械発見:自然科学研究におけるデータ利活用の再考
機械学習と機械発見:自然科学研究におけるデータ利活用の再考機械学習と機械発見:自然科学研究におけるデータ利活用の再考
機械学習と機械発見:自然科学研究におけるデータ利活用の再考Ichigaku Takigawa
 
小1にルービックキューブを教えてみた 〜群論スポーツの教育とパターン認知〜
小1にルービックキューブを教えてみた 〜群論スポーツの教育とパターン認知〜小1にルービックキューブを教えてみた 〜群論スポーツの教育とパターン認知〜
小1にルービックキューブを教えてみた 〜群論スポーツの教育とパターン認知〜Ichigaku Takigawa
 
"データ化"する化学と情報技術・人工知能・データサイエンス
"データ化"する化学と情報技術・人工知能・データサイエンス"データ化"する化学と情報技術・人工知能・データサイエンス
"データ化"する化学と情報技術・人工知能・データサイエンスIchigaku Takigawa
 
自然科学における機械学習と機械発見
自然科学における機械学習と機械発見自然科学における機械学習と機械発見
自然科学における機械学習と機械発見Ichigaku Takigawa
 
幾何と機械学習: A Short Intro
幾何と機械学習: A Short Intro幾何と機械学習: A Short Intro
幾何と機械学習: A Short IntroIchigaku Takigawa
 
決定森回帰の信頼区間推定, Benign Overfitting, 多変量木とReLUネットの入力空間分割
決定森回帰の信頼区間推定, Benign Overfitting, 多変量木とReLUネットの入力空間分割決定森回帰の信頼区間推定, Benign Overfitting, 多変量木とReLUネットの入力空間分割
決定森回帰の信頼区間推定, Benign Overfitting, 多変量木とReLUネットの入力空間分割Ichigaku Takigawa
 
機械学習を自然現象の理解・発見に使いたい人に知っておいてほしいこと
機械学習を自然現象の理解・発見に使いたい人に知っておいてほしいこと機械学習を自然現象の理解・発見に使いたい人に知っておいてほしいこと
機械学習を自然現象の理解・発見に使いたい人に知っておいてほしいことIchigaku Takigawa
 
自己紹介:機械学習・機械発見とデータ中心的自然科学
自己紹介:機械学習・機械発見とデータ中心的自然科学自己紹介:機械学習・機械発見とデータ中心的自然科学
自己紹介:機械学習・機械発見とデータ中心的自然科学Ichigaku Takigawa
 
機械学習・機械発見から見るデータ中心型化学の野望と憂鬱
機械学習・機械発見から見るデータ中心型化学の野望と憂鬱機械学習・機械発見から見るデータ中心型化学の野望と憂鬱
機械学習・機械発見から見るデータ中心型化学の野望と憂鬱Ichigaku Takigawa
 
Machine Learning for Molecular Graph Representations and Geometries
Machine Learning for Molecular Graph Representations and GeometriesMachine Learning for Molecular Graph Representations and Geometries
Machine Learning for Molecular Graph Representations and GeometriesIchigaku Takigawa
 
(2021.11) 機械学習と機械発見:データ中心型の化学・材料科学の教訓とこれから
(2021.11) 機械学習と機械発見:データ中心型の化学・材料科学の教訓とこれから(2021.11) 機械学習と機械発見:データ中心型の化学・材料科学の教訓とこれから
(2021.11) 機械学習と機械発見:データ中心型の化学・材料科学の教訓とこれからIchigaku Takigawa
 
機械学習~データを予測に変える技術~で化学に挑む! (サイエンスアゴラ2021)
機械学習~データを予測に変える技術~で化学に挑む! (サイエンスアゴラ2021)機械学習~データを予測に変える技術~で化学に挑む! (サイエンスアゴラ2021)
機械学習~データを予測に変える技術~で化学に挑む! (サイエンスアゴラ2021)Ichigaku Takigawa
 
(2021.10) 機械学習と機械発見 データ中心型の化学・材料科学の教訓とこれから
(2021.10) 機械学習と機械発見 データ中心型の化学・材料科学の教訓とこれから (2021.10) 機械学習と機械発見 データ中心型の化学・材料科学の教訓とこれから
(2021.10) 機械学習と機械発見 データ中心型の化学・材料科学の教訓とこれから Ichigaku Takigawa
 
帰納バイアスと分子の組合せ的表現・幾何的表現 (本発表)
帰納バイアスと分子の組合せ的表現・幾何的表現 (本発表)帰納バイアスと分子の組合せ的表現・幾何的表現 (本発表)
帰納バイアスと分子の組合せ的表現・幾何的表現 (本発表)Ichigaku Takigawa
 

More from Ichigaku Takigawa (20)

機械学習と自動微分
機械学習と自動微分機械学習と自動微分
機械学習と自動微分
 
データ社会を生きる技術
〜機械学習の夢と現実〜
データ社会を生きる技術
〜機械学習の夢と現実〜データ社会を生きる技術
〜機械学習の夢と現実〜
データ社会を生きる技術
〜機械学習の夢と現実〜
 
機械学習を科学研究で使うとは?
機械学習を科学研究で使うとは?機械学習を科学研究で使うとは?
機械学習を科学研究で使うとは?
 
A Modern Introduction to Decision Tree Ensembles
A Modern Introduction to Decision Tree EnsemblesA Modern Introduction to Decision Tree Ensembles
A Modern Introduction to Decision Tree Ensembles
 
Exploring Practices in Machine Learning and Machine Discovery for Heterogeneo...
Exploring Practices in Machine Learning and Machine Discovery for Heterogeneo...Exploring Practices in Machine Learning and Machine Discovery for Heterogeneo...
Exploring Practices in Machine Learning and Machine Discovery for Heterogeneo...
 
機械学習と機械発見:自然科学融合が誘起するデータ科学の新展開
機械学習と機械発見:自然科学融合が誘起するデータ科学の新展開機械学習と機械発見:自然科学融合が誘起するデータ科学の新展開
機械学習と機械発見:自然科学融合が誘起するデータ科学の新展開
 
機械学習と機械発見:自然科学研究におけるデータ利活用の再考
機械学習と機械発見:自然科学研究におけるデータ利活用の再考機械学習と機械発見:自然科学研究におけるデータ利活用の再考
機械学習と機械発見:自然科学研究におけるデータ利活用の再考
 
小1にルービックキューブを教えてみた 〜群論スポーツの教育とパターン認知〜
小1にルービックキューブを教えてみた 〜群論スポーツの教育とパターン認知〜小1にルービックキューブを教えてみた 〜群論スポーツの教育とパターン認知〜
小1にルービックキューブを教えてみた 〜群論スポーツの教育とパターン認知〜
 
"データ化"する化学と情報技術・人工知能・データサイエンス
"データ化"する化学と情報技術・人工知能・データサイエンス"データ化"する化学と情報技術・人工知能・データサイエンス
"データ化"する化学と情報技術・人工知能・データサイエンス
 
自然科学における機械学習と機械発見
自然科学における機械学習と機械発見自然科学における機械学習と機械発見
自然科学における機械学習と機械発見
 
幾何と機械学習: A Short Intro
幾何と機械学習: A Short Intro幾何と機械学習: A Short Intro
幾何と機械学習: A Short Intro
 
決定森回帰の信頼区間推定, Benign Overfitting, 多変量木とReLUネットの入力空間分割
決定森回帰の信頼区間推定, Benign Overfitting, 多変量木とReLUネットの入力空間分割決定森回帰の信頼区間推定, Benign Overfitting, 多変量木とReLUネットの入力空間分割
決定森回帰の信頼区間推定, Benign Overfitting, 多変量木とReLUネットの入力空間分割
 
機械学習を自然現象の理解・発見に使いたい人に知っておいてほしいこと
機械学習を自然現象の理解・発見に使いたい人に知っておいてほしいこと機械学習を自然現象の理解・発見に使いたい人に知っておいてほしいこと
機械学習を自然現象の理解・発見に使いたい人に知っておいてほしいこと
 
自己紹介:機械学習・機械発見とデータ中心的自然科学
自己紹介:機械学習・機械発見とデータ中心的自然科学自己紹介:機械学習・機械発見とデータ中心的自然科学
自己紹介:機械学習・機械発見とデータ中心的自然科学
 
機械学習・機械発見から見るデータ中心型化学の野望と憂鬱
機械学習・機械発見から見るデータ中心型化学の野望と憂鬱機械学習・機械発見から見るデータ中心型化学の野望と憂鬱
機械学習・機械発見から見るデータ中心型化学の野望と憂鬱
 
Machine Learning for Molecular Graph Representations and Geometries
Machine Learning for Molecular Graph Representations and GeometriesMachine Learning for Molecular Graph Representations and Geometries
Machine Learning for Molecular Graph Representations and Geometries
 
(2021.11) 機械学習と機械発見:データ中心型の化学・材料科学の教訓とこれから
(2021.11) 機械学習と機械発見:データ中心型の化学・材料科学の教訓とこれから(2021.11) 機械学習と機械発見:データ中心型の化学・材料科学の教訓とこれから
(2021.11) 機械学習と機械発見:データ中心型の化学・材料科学の教訓とこれから
 
機械学習~データを予測に変える技術~で化学に挑む! (サイエンスアゴラ2021)
機械学習~データを予測に変える技術~で化学に挑む! (サイエンスアゴラ2021)機械学習~データを予測に変える技術~で化学に挑む! (サイエンスアゴラ2021)
機械学習~データを予測に変える技術~で化学に挑む! (サイエンスアゴラ2021)
 
(2021.10) 機械学習と機械発見 データ中心型の化学・材料科学の教訓とこれから
(2021.10) 機械学習と機械発見 データ中心型の化学・材料科学の教訓とこれから (2021.10) 機械学習と機械発見 データ中心型の化学・材料科学の教訓とこれから
(2021.10) 機械学習と機械発見 データ中心型の化学・材料科学の教訓とこれから
 
帰納バイアスと分子の組合せ的表現・幾何的表現 (本発表)
帰納バイアスと分子の組合せ的表現・幾何的表現 (本発表)帰納バイアスと分子の組合せ的表現・幾何的表現 (本発表)
帰納バイアスと分子の組合せ的表現・幾何的表現 (本発表)
 

Recently uploaded

insect anatomy and insect body wall and their physiology
insect anatomy and insect body wall and their  physiologyinsect anatomy and insect body wall and their  physiology
insect anatomy and insect body wall and their physiologyDrAnita Sharma
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptArshadWarsi13
 
Module 4: Mendelian Genetics and Punnett Square
Module 4:  Mendelian Genetics and Punnett SquareModule 4:  Mendelian Genetics and Punnett Square
Module 4: Mendelian Genetics and Punnett SquareIsiahStephanRadaza
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Heredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of TraitsHeredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of TraitsCharlene Llagas
 
Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10ROLANARIBATO3
 
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
Temporomandibular joint Muscles of Mastication
Temporomandibular joint Muscles of MasticationTemporomandibular joint Muscles of Mastication
Temporomandibular joint Muscles of Masticationvidulajaib
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsssuserddc89b
 
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxRESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxFarihaAbdulRasheed
 
Call Girls in Hauz Khas Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Hauz Khas Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Hauz Khas Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Hauz Khas Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth
 
Welcome to GFDL for Take Your Child To Work Day
Welcome to GFDL for Take Your Child To Work DayWelcome to GFDL for Take Your Child To Work Day
Welcome to GFDL for Take Your Child To Work DayZachary Labe
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRlizamodels9
 

Recently uploaded (20)

insect anatomy and insect body wall and their physiology
insect anatomy and insect body wall and their  physiologyinsect anatomy and insect body wall and their  physiology
insect anatomy and insect body wall and their physiology
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.ppt
 
Module 4: Mendelian Genetics and Punnett Square
Module 4:  Mendelian Genetics and Punnett SquareModule 4:  Mendelian Genetics and Punnett Square
Module 4: Mendelian Genetics and Punnett Square
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
Heredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of TraitsHeredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of Traits
 
Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10
 
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
Temporomandibular joint Muscles of Mastication
Temporomandibular joint Muscles of MasticationTemporomandibular joint Muscles of Mastication
Temporomandibular joint Muscles of Mastication
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physics
 
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxRESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
 
Call Girls in Hauz Khas Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Hauz Khas Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Hauz Khas Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Hauz Khas Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
 
Welcome to GFDL for Take Your Child To Work Day
Welcome to GFDL for Take Your Child To Work DayWelcome to GFDL for Take Your Child To Work Day
Welcome to GFDL for Take Your Child To Work Day
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
 

How to use data to design and optimize reaction? A quick introduction to work from Sigman lab

  • 1. How to use data to design and optimize reactions? ⎯⎯⎯⎯⎯⎯ A quick introduction to work from Sigman lab Speaker: Kosuke Higashida (Sawamura-G) Slides: Ichigaku Takigawa (Takigawa-G) ICReDD Fusion Research Seminar, Apr 27 2020.
  • 2. Today's talk Introduce papers from Matthew S. Sigman's group that 
 considers how to use data to design and optimize catalysts. Matthew S Sigman Dept. Chemistry
 The University of Utah Organic Synthesis & Asymmetric Catalysis MS Sigman, KC Harper, EN Bess, A Milo
 The development of multidimensional analysis tools for asymmetric catalysis and beyond.
 Accounts of Chemical Research 2016 49 (6), 1292-1301. summarized the following studies from the lab. 1. Harper & Sigman, PNAS 2011 2. Harper & Sigman, Science 2011 • Werner, Mei, Burckle & Sigman, Science 2012 3. Milo, Bess & Sigman, Nature 2014 • Harper, Vilardi & Sigman, JACS 2013 • Bess, Bischoff, Sigman, PNAS 2014 4. Niemeyer, Milo, Hickey & Sigman, Nat Chem 2016;
 Milo, Neel, Toste & Sigman, Science 2015
  • 3. Our consistent theme today • Chemically, it's about how to rationally design and improve asymmetric catalysis ( , 不对称合成). • Chemoinformatically, it's about QSAR (Quantitative Structure- Activity Relationship) and descriptors. • Statistically/Mathematically, it's about an application of linear regression (design of experiments). How to grasp "empirical" trends in experiments, and how to effectively use them to develop new catalytic reactions?
  • 4. Outline 1. Harper & Sigman, PNAS. 2011
 Proof of Concept (PoC)
 Consider "Design of Experiments (DoE)" (Information Science) 2. Harper & Sigman, Science. 2011
 De novo Applications of 1 • Werner, Mei, Burckle & Sigman, Science 2012 3. Milo, Bess & Sigman, Nature. 2014
 "More sophisticated descriptors" (Computational Science) • Harper, Vilardi & Sigman, JACS, 2013 • Bess, Bischoff, Sigman, PNAS, 2014 4. Niemeyer, Milo, Hickey & Sigman, Nat Chem 2016;
 Milo, Neel, Toste & Sigman, Science 2015
 Establish a data-intensive workflow for "the study of reaction mechanisms" 0. Quick Introduction to Asymmetric Synthesis
  • 5. Asymmetric synthesis (Enantioselective synthesis) • A molecule is called "chiral" if it cannot be superposed on its mirror. • A chiral molecule exists in two stereoisomers that are mirror images of each other, called "enantiomers". • They have the same chemical properties, except when reacting with other chiral compounds. • They also have the same physical properties, except that they often have opposite optical activities. • A chiral molecule must have at least one "chiral center" or "stereocenter". A reaction that produces the stereoisomeric (enantiomeric or diastereoisomeric) products in unequal amounts. A key process in modern chemistry, particularly for pharmaceuticals! "enantiomers"
  • 6. A starting example 0.1 equiv. CrCl3 0.1 equiv. Chiral Ligand 2 equiv. Mn(0) 0.1 equiv. Triethylamine 2 equiv. SiMe3Cl Nozaki-Hiyama-Kishi (NHK) allylation reaction THF, RT, 20h R
  • 7. How to read this? 0.1 equiv. CrCl3 0.1 equiv. Chiral Ligand Nozaki-Hiyama-Kishi (NHK) allylation reaction Substrates (Reactants) Product THF, RT, 20h Catalyst Ligand (binds to Cr) Reagents Solvent Reaction Temperature (RT = Room Temperature) Reaction Time Starting material ( ) 2 equiv. Mn(0) 0.1 equiv. Triethylamine 2 equiv. SiMe3Cl Reaction partner
  • 8. A starting example 0.1 equiv. CrCl3 0.1 equiv. Chiral Ligand Nozaki-Hiyama-Kishi (NHK) allylation reaction THF, RT, 20h an activated allyl group is added to carbonyl group Phenyl-group Asymmetric center R 2 equiv. Mn(0) 0.1 equiv. Triethylamine 2 equiv. SiMe3Cl
  • 9. A starting example 0.1 equiv. CrCl3 0.1 equiv. Chiral Ligand Nozaki-Hiyama-Kishi (NHK) allylation reaction THF, RT, 20h an activated allyl group is added to carbonyl group Phenyl-group or SR Enantiomer ratio: e.r. Enantiomeric Excess: e.e. AR : AS R 2 equiv. Mn(0) 0.1 equiv. Triethylamine 2 equiv. SiMe3Cl
  • 10. Our problem for now What "substituent" for can improve the enantioselectivity, preferably for a wide range of substrates? "Ligands" ( ) iPr Bn ( ) Groups for ?
 "Substituents" • Ph • tBu • CF3 • CH(CH3)2 • OiPr • Cl • CH3 • CHO • CO2Me • COMe • COOH • OEt • NMe2 • OMe • NH2 • NO2 • OH • F • H= isopropyl group = benzyl group tBu = tertiary butyl 
 group Me = methyl group Et = ethyl group
  • 11. Sigman's approach 1. Use high-throughput assays to rapidly evaluate many reaction conditions and catalyst structures. 2. Investigate the reaction mechanism to inform the design of better-performing catalysts.
 (by computing various transition-state structures) Two common tactics of many labs (including Sigman's lab) The newly proposed approach of Sigman's lab Simultaneously optimize reaction performance and interrogate mechanistic features of reactions by relating empirical results of carefully designed datasets to physical organic descriptors.
  • 12. Backstory before the new approach Sigman & Miller, J. Org. Chem. 2009, 74, 20, 7633-7643 They observed 1. Linear Free Energy Relationship (LFER):
 the enantioselectivity and the size of the proline substituent G (measured by Taft/Charton steric parameters) were positively correlated. 2. A break in the LFER when larger G groups were employed. [ ]‡ means 
 a transition state unclear multistep reactions
  • 13. Their two assumptions for the break of LFER Their focus on outliers: "results that appear at first glance to be peculiar or poor are considerably more interesting than ones that follow obvious or intuitive trends."
  • 14. Their two assumptions for the break of LFER Their focus on outliers: "results that appear at first glance to be peculiar or poor are considerably more interesting than ones that follow obvious or intuitive trends." 1. The poor extrapolation to larger substituents may be due to any change in the mechanisms of asymmetric catalysis 2. The Taft/Charton parameters implemented do not precisely describe the observed steric effect. These two assumptions for the break of LFER are the starting point of their newly developed approach.
  • 15. Their two assumptions for the break of LFER Their focus on outliers: "results that appear at first glance to be peculiar or poor are considerably more interesting than ones that follow obvious or intuitive trends." 1. The poor extrapolation to larger substituents may be due to any change in the mechanisms of asymmetric catalysis 2. The Taft/Charton parameters implemented do not precisely describe the observed steric effect. These two assumptions for the break of LFER are the starting point of their newly developed approach. Design of experiments? Data-intensive methods to probe it? More sophisticated parameters? Computational parameters?
  • 16. First study 1/4 Harper & Sigman, PNAS 2011. Initially 25 ligands synthesized
 acc. the commercial availability and the ease of synthesis.
  • 17. First study 1/4 Harper & Sigman, PNAS 2011. Initially 25 ligands synthesized
 acc. the commercial availability and the ease of synthesis. LFER development by fitting a linear regression model. enantioselectivity enantiomeric ratio gas constant reaction temperature
  • 18. First study 1/4 Harper & Sigman, PNAS 2011. Initially 25 ligands synthesized
 acc. the commercial availability and the ease of synthesis. ... resulted in only moderate predictive power and did little to inform catalyst design. "How many and what distribution of data points would provide a meaningful model for prediction?" (from the initial review of the work) LFER development by fitting a linear regression model. enantioselectivity enantiomeric ratio gas constant reaction temperature
  • 19. First study 2/4 Apply the principles of "Design of Experiments (DoE)" using physical organic parameters. "Briefly, the precepts of DoE involve evenly spreading experiments to the sensitivity limits of a system using continuous variables (e.g. temperature and concentration)." Harper & Sigman, PNAS 2011.
  • 20. Technical note 1/3: Design of experiments (DoE) We have many variations, but in short... (1) Define coordinates
 with great care. (2) Examine/consider
 the limit of possible ranges. (3) Cover the region evenly 
 with interacting points. "32 factorial design"
 (full factorial design
 with 3 levels of 2 factors) Consult a statistician when descriptors have strong correlation or clear interdependency If you have more variables, you can also consider a "fractional" design...
 (a carefully chosen subset of experiments) better simplex lattice design central composite design Box-Behnken design face-centered central composite design Latin hypercube design Sphere packing design Uniform design Maximum entropy design
  • 21. Technical note 2/3: Least-squares regression "Predictors"
 (descriptors) "Response" (target) Regression equation for prediction that minimizes the sum of (error-bar height) simple matrix multiplication 2 error-bar fitting a plane!
  • 22. Technical note 3/3: Least-squares regression Regression equation for prediction Exactly same calculation! fitting a surface! called "response surface" models (RSMs) in DoE If we set like this... all calculated from X and Y
 (composite variables)
  • 23. First study 3/4 1. Extrapolation is subject to greater uncertainty and error than interpolation. 2. The predictive power of any statistical model is dependent upon the variation in the distribution of data points used to develop the model. Harper & Sigman, PNAS 2011. Their two key tenets
  • 24. First study 3/4 1. Extrapolation is subject to greater uncertainty and error than interpolation. 2. The predictive power of any statistical model is dependent upon the variation in the distribution of data points used to develop the model. Harper & Sigman, PNAS 2011. Their two key tenets Carefully designed and evenly spaced ligand space according to the range of substituent Charton values that are synthetically accessible! A 3 x 3 library defined
  • 25. First study 3/4 Harper & Sigman, PNAS 2011. H Me iPr tBu Me tBu CEt3 The response surface model fitted to 3x3 design by linear regression 2 replicates for each point in 3x3 Not reproducible area
  • 26. First study 3/4 Harper & Sigman, PNAS 2011. Et The response surface model fitted to 3x3 design by linear regression 2 replicates for each point in 3x3 H Me iPr tBu Me tBu CEt3
  • 27. Second study 1/4 They admitted that ... they were aware of the likely best catalyst structure through extensive trial-and-error screening in the development of these systems, and the identity of the optimal predicted catalyst(s) was not surprising. Harper & Sigman, Science 2011.
  • 28. Second study 1/4 They admitted that ... they were aware of the likely best catalyst structure through extensive trial-and-error screening in the development of these systems, and the identity of the optimal predicted catalyst(s) was not surprising. De novo application of this approach They applied this to an NHK-type propargylation reaction that had been abondoned years earlier in the group because of their inability, for more than 2 years, to achieve effective asymmetric catalysis.... Harper & Sigman, Science 2011.
  • 29. Second study 1/4 They admitted that ... they were aware of the likely best catalyst structure through extensive trial-and-error screening in the development of these systems, and the identity of the optimal predicted catalyst(s) was not surprising. De novo application of this approach They applied this to an NHK-type propargylation reaction that had been abondoned years earlier in the group because of their inability, for more than 2 years, to achieve effective asymmetric catalysis.... It turned out 
 the best predicted one was
 limited to 50% ee... Harper & Sigman, Science 2011.
  • 30. Second study 2/4 By screening only 9 ligands, they could confidently reach an informed decision that a change in catalyst structure was necessary to proceed with catalyst optimization. Harper & Sigman, Science 2011.
  • 31. Second study 2/4 By screening only 9 ligands, they could confidently reach an informed decision that a change in catalyst structure was necessary to proceed with catalyst optimization. Several observations for redesigning the catalyst. 1. oxazoline unit did not bias the reaction to a great extent 2. a Hammett correlation between the enantioselectivity and the substrate structure was observed. (A significant electronic contribution from the substrate impacts the enantioselectivity) oxazoline Harper & Sigman, Science 2011.
  • 32. Second study 3/4 These two observations implied ... The new catalyst design should incorporate a readily modifiable group whose electronics could be systematically perturbed to override the innate electronic bias of the substrate. Harper & Sigman, Science 2011.
  • 33. Second study 3/4 These two observations implied ... The new catalyst design should incorporate a readily modifiable group whose electronics could be systematically perturbed to override the innate electronic bias of the substrate. A 3 x 3 library, replacing the oxazoline with a quinoline, with one dimension manipulated electronically (E) and the other sterically (S). oxazoline quinoline pyridine Harper & Sigman, Science 2011.
  • 34. Side note 1/2: LFERs and Hammett equation "The Hammett equation (and its extended forms) has been one of the most widely used means for the study and interpretation of organic reactions and their mechanisms." (Hansch, Leo & Taft, Chem Rev 1991)
  • 35. Side note 1/2: LFERs and Hammett equation "The Hammett equation (and its extended forms) has been one of the most widely used means for the study and interpretation of organic reactions and their mechanisms." (Hansch, Leo & Taft, Chem Rev 1991) For any two reactions with two aromatic reactants only differing in the type of substituent , the change in free energy of activation is proportional to the change in Gibbs free energy. (i.e. a "linear" relationship) equilibrium constant with that with = H (reference) substituent constant (depends only on ) reaction constant (depends on reactions but independent from ) "parameter"
 values for
  • 36. Side note 1/2: LFERs and Hammett equation "The Hammett equation (and its extended forms) has been one of the most widely used means for the study and interpretation of organic reactions and their mechanisms." (Hansch, Leo & Taft, Chem Rev 1991) • This is an empirical trend, not follow from any chemical theory 
 (thus also long critisized from the theoretical side?). • A surprise was, this equation also holds for reaction rates. • This triggered many other useful variations as linear free-energy relationships (LFER) as well as nonlinear ones such as quadratic. For any two reactions with two aromatic reactants only differing in the type of substituent , the change in free energy of activation is proportional to the change in Gibbs free energy. (i.e. a "linear" relationship) equilibrium constant with that with = H (reference) substituent constant (depends only on ) reaction constant (depends on reactions but independent from ) "parameter"
 values for
  • 37. Side note 2/2: LFER is generalized to QSAR/QSPR LFERs or prediction by empirical trends in data also yielded 
 a long history of QSAR/QSPR in Chemoinformatics 
 since Taft/Hammett and Hansch-Fujita, etc.... : Hansch (2014)
  • 38. Second study 4/4 This allowed them to confidently identify a catalyst that performed well for a wide range of subtrates with differential electronic features. The same approach is also applied to identify highly enantioselective catalysts for the Pd-catalyzed redox-relay Heck reactions of alkenols 
 (Werner, Mei, Burckle & Sigman, Science 2012, 338, 1455-1458). Hammett parameters (σ) Charton parameters (ν) electronical effect (E) sterical effect (S) Harper & Sigman, Science 2011.
  • 39. Toward parameter (re-)considerations It is critical to use appropriate parameters (descriptors) for characteristically describing the properties of substituents... For example, Taft/Charton values have been historically controversial because of the limited generality... (They arose from a particular mechanism)
  • 40. Charton values NHK allylation Desymmetrization of bisphenols Worked. Failed. Harper, Bess & Sigman, Nat Chem 2012.
  • 41. Sterimol Harper, Bess & Sigman, Nat Chem 2012. (Rediscovery of) Sterimol parameters • Develop in 70's (Verloop & Tipker, 1977) • Three parameters (L, B1, B5) • Computationally derived. Worked. Worked.
  • 42. Third study 1/2 Milo, Bess & Sigman, Nature 2014. New computationally derived parameters: 
 Use IR vibrations as parameters to describe simultaneous steric and electronic perturbations to structure. To this point, a key problem in the history of LFERs has avoided:
 the inability to analyze substituents with changes to both size and electronic nature. (Use computational science!)
  • 43. Third study 2/2 To this point, a key problem in the history of LFERs has avoided:
 the inability to analyze substituents with changes to both size and electronic nature. 1. Perform a geometry minimization and frequency calculation of starting materials 2. Compute vibrations as well as other parameters, including Sterimol values, NBO charges, and geometric factors such as torsion angles. 3. Abridge the parameter set to a statistically manageable number. 4. Perform and evaluate a systematic iterative process to establish and improve statistical predictive models. New computationally derived parameters: 
 IR(Infrared)-spectroscopy-based parameters to describe simultaneous steric and electronic perturbations to structure. Milo, Bess & Sigman, Nature 2014.
  • 44. Applications of this to evaluate the substrate scope This approach is intended to use in catalyst optimization, but can be applied to different challenges such as "substrate scope". 1. Harper, Vilardi & Sigman, JACS 2013 
 The catalyst and substrate structures were jointly analyzed to predict the "best" catalyst/substrate combinations. 2. Bess, Bischoff, Sigman, PNAS 2014 
 A modest-sized library of substrates was designed to build a model for predicting the performance of even unreported substrates, and investigated a Rh-catalyzed asymmetric transfer hydrogeneation of ketones.
  • 45. Returned to their original pursuit: reaction mechanisms They hypothesized their predictive models (multidimensional correlations) would encompass mechanism-rich information. However, inferring mechanistic information becomes a daunting task
 because 1. the models become increasingly complex. 2. the models contain compounded parameters that are not representative of one distinct structural characteristic. A data-intensive approach to mechanistic elucidation requires philoshophical changes: One has to overcome the entrenched dogma that low-enantioselectivity measurements are "bad results". Every value obtained in the course of data collection is useful and can serve to reveal structural information regarding the mechanism!
  • 46. 4th/5th studies Niemeyer, Milo, Hickey & Sigman, Nat Chem 2016
 Milo, Neel, Toste & Sigman, Science 2015 1. Niemeyer, Milo, Hickey & Sigman, Nat Chem, 2016 
 Parameter selection is not intuitive and an effective parameter set is statistically obtained for phosphine ligands that served to distinguish between two limiting mechanistic regimes in the site-selective oxidative addition in a Suzuki reaction. 2. Milo, Neel, Toste & Sigman, Science 2015 
 Synthetically reasonable and mechanistically informative catalysts, substrates, or reagents are virtually predicted, and then prepared and tested experimentally in the context of chiral phase-transfer catalysis. This tactic requires the design and preparation of a systematic and synthetically modular library of catalysts, substrates, and reagents. The experimental design defines the effects that can be explained by the produced models.
  • 47. Collaborative and further studies 1. Neel, Milo, Sigman, Toste, JACS, 2016 
 Chiral phosphoric acid-catalyzed fluorination of allylic alcohols 2. Bess, DeLuca, Tindall, Oderinde, Roizen, Du Bois & Sigman, JACS, 2014 
 Rh-catalyzed C-H amination reaction 3. Bess, Guptill, Davies & Sigman, Chem Sci, 2015 
 a carbene C-H insertion reaction 4. Mougel, Santiago, Zhizhko, Bess, Varga, Frater, Sigman, Copéret, JACS, 2015
 Turnover frequency (TOF) of solid-supported W-based catalysts for olefin metathesis 5. Zhang, Santiago, Crawford & Sigman, JACS, 2015
 ligand optimization for Pd-catalyzed enantioselective redox-relay Heck reactions 6. Zhang, Santiago, Kou & Sigman, JACS, 2015
 evaluation of solvent effects for Pd-catalyzed enantioselective redox- relay Heck reactions
  • 48. Lessons we learned 1. The design of experiments (DoE) is crucial to obtain causal insights on the target reactions. 2. We can computationally derive the physical organic chemical parameters to describe the mechanistic features of reactions. 3. "In brief, one needs a reliable and accurate assay for the output of interest, a reasonable spread of such outputs to facilitate statistical analysis, and a reasonable level of modularity in the process under study." Deconstructing rather complex relationships still requires profound physical organic chemistry expertise, the use of various classical tools in this field, and creativity and intuition to understand sophisticated processes.
  • 49. After 2016: The 2019 Nature paper and beyond
  • 50. 1 page summary of the 2019 Nature paper Method: Forward stepwise linear regression of Matlab statistics toolbox (since Nature 2014?) Give up DoE and go more towards a "ML/AI"-like approach for out-of-sample prediction. nucleophile imine starting material amine product chiral phosphoric acid catalyst • 367 reactions 
 (manually curated from 17 papers) • 313 parameters from every components
 (Steric, DFT-derived, 2D chemoinfo descriptors, conditions) Proof-of-concept with a reaction class with different catalyst chemotypes, substrate structural types. out-of-sample prediction Note: this is predicted vs measured plot
 (not a fitted line)
  • 51. Lessons we learned 1. The design of experiments (DoE) is crucial to obtain causal insights on the target reactions. 2. We can computationally derive the physical organic chemical parameters to describe the mechanistic features of reactions. 3. "In brief, one needs a reliable and accurate assay for the output of interest, a reasonable spread of such outputs to facilitate statistical analysis, and a reasonable level of modularity in the process under study." Deconstructing rather complex relationships still requires profound physical organic chemistry expertise, the use of various classical tools in this field, and creativity and intuition to understand sophisticated processes.