SlideShare a Scribd company logo
Brian DeCost1
, Heshan Yu2
, Xiaohang Zhang2
, Seunghun Lee2
, Yangang Liang2
, Ichiro Takeuchi2
, Jason Hattrick-Simpers1
, A. Gilad Kusne1
1
National Institute of Standards and Technology, 2
University of Maryland, College Park
Autonomous experimental phase diagram acquisition
2018.08.07 -- AIMS 2018 Meeting -- NIST, Gaithersburg MD
Brian DeCost
2
High-Throughput Experimental Materials Collaboratory
https://mgi.nist.gov/htemc
Distributed experimental materials science platform, built on a network of member institutes.
Realization of the HPC paradigm for experimental materials science?
Contact: martin.green@nist.gov Look for a forthcoming white paper
3
Active clustering for phase diagram acquisition
4
Autonomous science systems
Tabor, Daniel P., et al. "Accelerating the discovery of materials for clean energy in the era of smart automation." Nat. Rev. Mater. 3 (2018): 5-20
https://doi.org/10.1038/s41578-018-0005-z
5
Parallel synthesis, serial characterization
Co-sputtering scheme Ni
Mn
Al
3” spread wafer
Ni Al
Mn
Phase diagram
Gregoire, J. M., et al. "High-throughput synchrotron X-ray diffraction for combinatorial phase mapping." Journal of synchrotron radiation 21.6 (2014): 1262-1268.
Bi
Fe V
XRD
6
Unsupervised phase diagram estimation is hard
Hattrick-Simpers, Jason R., John M. Gregoire, and A. Gilad Kusne.
"Perspective: Composition–structure–property mapping in high-throughput experiments: Turning data into knowledge." APL Materials 4.5 (2016): 053211.
https://doi.org/10.1063/1.4950995
What you really want:
- multi-phase: linear unmixing
- single-phase: invariance to peak shift
- infer the number of regions...
- respect thermodynamics
- leverage archival data
- deal with missing reflections
- fast!
Compromises we can live with
How to discover e.g. line compounds with this approach?
7
Simultaneous phase and property mapping
Kusne, Aaron Gilad, et al. "On-the-fly machine-learning for high-throughput experiments: search for rare-earth-free permanent magnets." Scientific reports 4 (2014): 6367. 10.1038/srep06367
Finding novel rare-earth-free
permanent magnets
8
GRENDEL: Iterative piecewise matrix factorization
Alternate between:
- clustering
- matrix factorization
Kusne, Aaron G., et al. "High-throughput determination of structural phase diagram and constituent phases using GRENDEL." Nanotechnology 26.44 (2015): 444002.
Include archival data from
- ICSD
- AFLOW
9
Amdahl's law in materials science
Speedup (innovation) is limited by the serial portion of the process!
parallel synthesis
MnNiGe: 535 'samples'
serial characterization
Lab diffractometer
30min per composition
2 weeks per ternary!
fast (serial) characterization
Synchrotron (SLAC)
30s per composition
4.5 hours per ternary
Exploit the structure of materials data to scale up
10
Autonomous run: cluster, extrapolate, select
Fe
Fe0.4Pd0.6
Fe0.4Ga0.6
Fe Fe0.4Ga0.6
Fe0.4Pd0.6
VO2
11
Metal Insulator transition: VNbO2, VWO2, etc.
100 150 200 250 300 350 400
10
2
10
3
10
4
10
5
R(W)
T (K)
0.33
0.96
1.36
1.68
2.27
2.61
2.86
3.44
c-Al
2
O
3
substrate
W%
Mixed
Tetragonal
We'd like to efficiently determine metal-insulator transition temperatures experimentally in a variety of systems
9mm composition spread chip
Metal-insulator transition temperature
decreases with doping
Monoclinic
By Original PNGs by Daniel Mayer, traced in Inkscape by User:Stannered - Crystal stucture
CC BY-SA 3.0, https://commons.wikimedia.org/w/index.php?curid=1735636
12
X-ray diffraction for phase diagram determination
- Dense collection
- Laboriously hand labeled by experts
- clustering can help
- scaling to more complex systems is a challenge
- higher temperatures: avoid annealing and diffusion effects
Temperature,W
13
Active clustering for autonomous XRD phase mapping
Think carefully about modeling to remove researcher degrees of freedom
14
Spectral clustering loosely: like kernel k-means clustering
27.00 27.25 27.50 27.75 28.00 28.25 28.50
2✓
0
50
100
Intensity(arb.units)
27.00 27.25 27.50 27.75 28.00 28.25 28.50
2✓
0
50
100
Intensity(arb.units)
1. Form the cosine similarity covariance matrix
Y
von Luxburg Tutorial on Spectral Clustering arXiv:0711.0189Zelnik-Manor and Perona, Self-Tuning Spectral Clustering, NIPS 2005
2. compute the eigendecomposition of the kernel matrix
N a = Kasolve
min
C
X
c2C
X
x2c
kx ck2
3. perform k-means clustering in the latent space
find cluster centers c:
project data onto k principal eigenvectors
Ki,j = e dcos(yi,yj )/2 i j
set to k'th nearest neighbor distance fori yi
15
Gaussian process classification
p(y = 1|x) = (f(x))
(z) = 1/(1 + exp( z))
A Bayesian non-parametric generalization of logistic regression
X
0.0 0.2 0.4 0.6 0.8 1.0
X
1
0
1
y
0.0 0.2 0.4 0.6 0.8 1.0
X
1
0
1
y
0.0 0.2 0.4 0.6 0.8 1.0
X
1
0
1
y
0.0 0.2 0.4 0.6 0.8 1.0
X
1
0
1
y
0.0 0.2 0.4 0.6 0.8 1.0
X
1
0
1
y
Bayesian model selection:
gradient-based optimization of the marginal likelihood p(y|X, Hi)
0.0 0.2 0.4 0.6 0.8 1.0
X
1
0
1
y
Ki,j = e |xj xi|2
/`2
Multi-class: one-vs-all strategy
f⇤ = k(x⇤)(K + 2
I) 1
y
V[f⇤] = k(x⇤, x⇤) kT
⇤ (K + 2
I) 1
k⇤
GP prior on f:
http://gpflow.readthedocs.io
Why you should consider Bayesian non-parametric models:
principled hyperparameter tuning (without CV)
This model knows what it doesn't know!
16
Active Gaussian process classification
Kapoor et al. Gaussian Processes for Object Categorization (2010)
DOI: 10.1007/s11263-009-0268-3
Classification uncertainty
Multi-class: one-vs-all strategy
arg min
xu2Xu
|fu|
p
⌃u + 2
monoclinic tetragonal
arg min
xu2Xu
|yu 0.5|
arg max
xu2Xu
⌃u
Margin
Variance
two phase
17
☢
4. GET
command
f = k⇤(K + 2
nI) 1
y
V[f⇤] = k(x⇤, x⇤) kT
⇤ (K + 2
nI) 1
k⇤
0.6 0.7 0.8 0.9 1.0
composition
30
40
50
60
temperature
0.6 0.7 0.8 0.9 1.0
composition
30
40
50
60
temperature
5. POST
new data
1. GET
available data
3. POST
proposed experiment
2. (re)train and predict
Analytics client XRD client
Diffractometer
Control server
26.0 26.5 27.0 27.5 28.0 28.5 29.0 29.5 30.0
2✓
0
25
50
intensity(arb.units)
Infrastructure for autonomous experiments
18
VWO2
monoclinic tetragonal
19
VWO2 clustering performance
monoclinic tetragonal
20
Acknowledgements
Funding sources
NIST
ONR
NRC Postdoctoral Research Associate Program
XRD setup and VNbO2 data collection by Yangang Liang
VWO2 film growth by Xiaohang Zhang
VWO2 setup by Heshan Yu
Ground truth phase labeling by Jason Hattrick-Simpers

More Related Content

What's hot

Combinatorial Experimentation and Machine Learning for Materials Discovery
Combinatorial Experimentation and Machine Learning for Materials DiscoveryCombinatorial Experimentation and Machine Learning for Materials Discovery
Combinatorial Experimentation and Machine Learning for Materials Discovery
aimsnist
 
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
aimsnist
 
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
KAMAL CHOUDHARY
 
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
KAMAL CHOUDHARY
 
Physics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learningPhysics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learning
KAMAL CHOUDHARY
 
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
aimsnist
 
A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...
aimsnist
 
When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliers
aimsnist
 
Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material Design
aimsnist
 
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
aimsnist
 
Database of Topological Materials and Spin-orbit Spillage
Database of Topological Materials and Spin-orbit SpillageDatabase of Topological Materials and Spin-orbit Spillage
Database of Topological Materials and Spin-orbit Spillage
KAMAL CHOUDHARY
 
High-throughput discovery of low-dimensional and topologically non-trivial ma...
High-throughput discovery of low-dimensional and topologically non-trivial ma...High-throughput discovery of low-dimensional and topologically non-trivial ma...
High-throughput discovery of low-dimensional and topologically non-trivial ma...
KAMAL CHOUDHARY
 
Computational Database for 3D and 2D materials to accelerate discovery
Computational Database for 3D and 2D materials to accelerate discoveryComputational Database for 3D and 2D materials to accelerate discovery
Computational Database for 3D and 2D materials to accelerate discovery
KAMAL CHOUDHARY
 
Morgan uw maGIV v1.3 dist
Morgan uw maGIV v1.3 distMorgan uw maGIV v1.3 dist
Morgan uw maGIV v1.3 dist
ddm314
 
Materials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum ComputationMaterials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum Computation
KAMAL CHOUDHARY
 
Automated Machine Learning Applied to Diverse Materials Design Problems
Automated Machine Learning Applied to Diverse Materials Design ProblemsAutomated Machine Learning Applied to Diverse Materials Design Problems
Automated Machine Learning Applied to Diverse Materials Design Problems
Anubhav Jain
 
Density functional theory calculations and data mining for new thermoelectric...
Density functional theory calculations and data mining for new thermoelectric...Density functional theory calculations and data mining for new thermoelectric...
Density functional theory calculations and data mining for new thermoelectric...
Anubhav Jain
 
Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...
Anubhav Jain
 
Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...
Anubhav Jain
 
Methods, tools, and examples (Part II): High-throughput computation and machi...
Methods, tools, and examples (Part II): High-throughput computation and machi...Methods, tools, and examples (Part II): High-throughput computation and machi...
Methods, tools, and examples (Part II): High-throughput computation and machi...
Anubhav Jain
 

What's hot (20)

Combinatorial Experimentation and Machine Learning for Materials Discovery
Combinatorial Experimentation and Machine Learning for Materials DiscoveryCombinatorial Experimentation and Machine Learning for Materials Discovery
Combinatorial Experimentation and Machine Learning for Materials Discovery
 
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
 
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
 
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
 
Physics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learningPhysics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learning
 
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
 
A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...
 
When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliers
 
Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material Design
 
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
 
Database of Topological Materials and Spin-orbit Spillage
Database of Topological Materials and Spin-orbit SpillageDatabase of Topological Materials and Spin-orbit Spillage
Database of Topological Materials and Spin-orbit Spillage
 
High-throughput discovery of low-dimensional and topologically non-trivial ma...
High-throughput discovery of low-dimensional and topologically non-trivial ma...High-throughput discovery of low-dimensional and topologically non-trivial ma...
High-throughput discovery of low-dimensional and topologically non-trivial ma...
 
Computational Database for 3D and 2D materials to accelerate discovery
Computational Database for 3D and 2D materials to accelerate discoveryComputational Database for 3D and 2D materials to accelerate discovery
Computational Database for 3D and 2D materials to accelerate discovery
 
Morgan uw maGIV v1.3 dist
Morgan uw maGIV v1.3 distMorgan uw maGIV v1.3 dist
Morgan uw maGIV v1.3 dist
 
Materials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum ComputationMaterials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum Computation
 
Automated Machine Learning Applied to Diverse Materials Design Problems
Automated Machine Learning Applied to Diverse Materials Design ProblemsAutomated Machine Learning Applied to Diverse Materials Design Problems
Automated Machine Learning Applied to Diverse Materials Design Problems
 
Density functional theory calculations and data mining for new thermoelectric...
Density functional theory calculations and data mining for new thermoelectric...Density functional theory calculations and data mining for new thermoelectric...
Density functional theory calculations and data mining for new thermoelectric...
 
Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...
 
Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...
 
Methods, tools, and examples (Part II): High-throughput computation and machi...
Methods, tools, and examples (Part II): High-throughput computation and machi...Methods, tools, and examples (Part II): High-throughput computation and machi...
Methods, tools, and examples (Part II): High-throughput computation and machi...
 

Similar to Autonomous experimental phase diagram acquisition

NIST-JARVIS infrastructure for Improved Materials Design
NIST-JARVIS infrastructure for Improved Materials DesignNIST-JARVIS infrastructure for Improved Materials Design
NIST-JARVIS infrastructure for Improved Materials Design
KAMAL CHOUDHARY
 
TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...
BrianDeCost
 
qmms_wines.pptx
qmms_wines.pptxqmms_wines.pptx
qmms_wines.pptx
DanielWines1
 
MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...
MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...
MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...
The Statistical and Applied Mathematical Sciences Institute
 
A*STAR Webinar on The AI Revolution in Materials Science
A*STAR Webinar on The AI Revolution in Materials ScienceA*STAR Webinar on The AI Revolution in Materials Science
A*STAR Webinar on The AI Revolution in Materials Science
University of California, San Diego
 
The Transformation of Systems Biology Into A Large Data Science
The Transformation of Systems Biology Into A Large Data ScienceThe Transformation of Systems Biology Into A Large Data Science
The Transformation of Systems Biology Into A Large Data Science
Robert Grossman
 
Accelerating Science with Generative Adversarial Networks
Accelerating Science with Generative Adversarial NetworksAccelerating Science with Generative Adversarial Networks
Accelerating Science with Generative Adversarial Networks
Michela Paganini
 
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
thanhdowork
 
Introduction to Chainer Chemistry
Introduction to Chainer ChemistryIntroduction to Chainer Chemistry
Introduction to Chainer Chemistry
Preferred Networks
 
Enabling Real Time Analysis & Decision Making - A Paradigm Shift for Experime...
Enabling Real Time Analysis & Decision Making - A Paradigm Shift for Experime...Enabling Real Time Analysis & Decision Making - A Paradigm Shift for Experime...
Enabling Real Time Analysis & Decision Making - A Paradigm Shift for Experime...
PyData
 
Ema 20190124 v1.4_dist
Ema 20190124 v1.4_distEma 20190124 v1.4_dist
Ema 20190124 v1.4_dist
ddm314
 
⭐⭐⭐⭐⭐ Device Free Indoor Localization in the 28 GHz band based on machine lea...
⭐⭐⭐⭐⭐ Device Free Indoor Localization in the 28 GHz band based on machine lea...⭐⭐⭐⭐⭐ Device Free Indoor Localization in the 28 GHz band based on machine lea...
⭐⭐⭐⭐⭐ Device Free Indoor Localization in the 28 GHz band based on machine lea...
Victor Asanza
 
The Algorithms of Life - Scientific Computing for Systems Biology
The Algorithms of Life - Scientific Computing for Systems BiologyThe Algorithms of Life - Scientific Computing for Systems Biology
The Algorithms of Life - Scientific Computing for Systems Biology
inside-BigData.com
 
Large Scale Data Clustering: an overview
Large Scale Data Clustering: an overviewLarge Scale Data Clustering: an overview
Large Scale Data Clustering: an overview
Vahid Mirjalili
 
A walk through the intersection between machine learning and mechanistic mode...
A walk through the intersection between machine learning and mechanistic mode...A walk through the intersection between machine learning and mechanistic mode...
A walk through the intersection between machine learning and mechanistic mode...
JuanPabloCarbajal3
 
(PhD Dissertation Defense) Theoretical and Numerical Investigations on Crysta...
(PhD Dissertation Defense) Theoretical and Numerical Investigations on Crysta...(PhD Dissertation Defense) Theoretical and Numerical Investigations on Crysta...
(PhD Dissertation Defense) Theoretical and Numerical Investigations on Crysta...
James D.B. Wang, PhD
 
Signal Discrimination in Cells Through A Negative Feedback
Signal Discrimination in Cells Through A Negative FeedbackSignal Discrimination in Cells Through A Negative Feedback
Signal Discrimination in Cells Through A Negative Feedback
Vimalathithan Devaraj
 
Error of Multileaf collimator prediction using recurrent neural network (LSTM)
Error of Multileaf collimator prediction using recurrent neural network (LSTM)Error of Multileaf collimator prediction using recurrent neural network (LSTM)
Error of Multileaf collimator prediction using recurrent neural network (LSTM)
WonjoongCheon
 
Cyberinfrastructure for Einstein's Equations and Beyond
Cyberinfrastructure for Einstein's Equations and BeyondCyberinfrastructure for Einstein's Equations and Beyond
Cyberinfrastructure for Einstein's Equations and Beyond
University of Illinois at Urbana-Champaign
 

Similar to Autonomous experimental phase diagram acquisition (20)

NIST-JARVIS infrastructure for Improved Materials Design
NIST-JARVIS infrastructure for Improved Materials DesignNIST-JARVIS infrastructure for Improved Materials Design
NIST-JARVIS infrastructure for Improved Materials Design
 
TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...
 
qmms_wines.pptx
qmms_wines.pptxqmms_wines.pptx
qmms_wines.pptx
 
MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...
MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...
MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...
 
A*STAR Webinar on The AI Revolution in Materials Science
A*STAR Webinar on The AI Revolution in Materials ScienceA*STAR Webinar on The AI Revolution in Materials Science
A*STAR Webinar on The AI Revolution in Materials Science
 
The Transformation of Systems Biology Into A Large Data Science
The Transformation of Systems Biology Into A Large Data ScienceThe Transformation of Systems Biology Into A Large Data Science
The Transformation of Systems Biology Into A Large Data Science
 
Accelerating Science with Generative Adversarial Networks
Accelerating Science with Generative Adversarial NetworksAccelerating Science with Generative Adversarial Networks
Accelerating Science with Generative Adversarial Networks
 
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
 
Introduction to Chainer Chemistry
Introduction to Chainer ChemistryIntroduction to Chainer Chemistry
Introduction to Chainer Chemistry
 
Enabling Real Time Analysis & Decision Making - A Paradigm Shift for Experime...
Enabling Real Time Analysis & Decision Making - A Paradigm Shift for Experime...Enabling Real Time Analysis & Decision Making - A Paradigm Shift for Experime...
Enabling Real Time Analysis & Decision Making - A Paradigm Shift for Experime...
 
Ema 20190124 v1.4_dist
Ema 20190124 v1.4_distEma 20190124 v1.4_dist
Ema 20190124 v1.4_dist
 
⭐⭐⭐⭐⭐ Device Free Indoor Localization in the 28 GHz band based on machine lea...
⭐⭐⭐⭐⭐ Device Free Indoor Localization in the 28 GHz band based on machine lea...⭐⭐⭐⭐⭐ Device Free Indoor Localization in the 28 GHz band based on machine lea...
⭐⭐⭐⭐⭐ Device Free Indoor Localization in the 28 GHz band based on machine lea...
 
The Algorithms of Life - Scientific Computing for Systems Biology
The Algorithms of Life - Scientific Computing for Systems BiologyThe Algorithms of Life - Scientific Computing for Systems Biology
The Algorithms of Life - Scientific Computing for Systems Biology
 
Large Scale Data Clustering: an overview
Large Scale Data Clustering: an overviewLarge Scale Data Clustering: an overview
Large Scale Data Clustering: an overview
 
A walk through the intersection between machine learning and mechanistic mode...
A walk through the intersection between machine learning and mechanistic mode...A walk through the intersection between machine learning and mechanistic mode...
A walk through the intersection between machine learning and mechanistic mode...
 
(PhD Dissertation Defense) Theoretical and Numerical Investigations on Crysta...
(PhD Dissertation Defense) Theoretical and Numerical Investigations on Crysta...(PhD Dissertation Defense) Theoretical and Numerical Investigations on Crysta...
(PhD Dissertation Defense) Theoretical and Numerical Investigations on Crysta...
 
Oral Defence
Oral DefenceOral Defence
Oral Defence
 
Signal Discrimination in Cells Through A Negative Feedback
Signal Discrimination in Cells Through A Negative FeedbackSignal Discrimination in Cells Through A Negative Feedback
Signal Discrimination in Cells Through A Negative Feedback
 
Error of Multileaf collimator prediction using recurrent neural network (LSTM)
Error of Multileaf collimator prediction using recurrent neural network (LSTM)Error of Multileaf collimator prediction using recurrent neural network (LSTM)
Error of Multileaf collimator prediction using recurrent neural network (LSTM)
 
Cyberinfrastructure for Einstein's Equations and Beyond
Cyberinfrastructure for Einstein's Equations and BeyondCyberinfrastructure for Einstein's Equations and Beyond
Cyberinfrastructure for Einstein's Equations and Beyond
 

More from aimsnist

Enabling Data Science Methods for Catalyst Design and Discovery
Enabling Data Science Methods for Catalyst Design and DiscoveryEnabling Data Science Methods for Catalyst Design and Discovery
Enabling Data Science Methods for Catalyst Design and Discovery
aimsnist
 
The MGI and AI
The MGI and AIThe MGI and AI
The MGI and AI
aimsnist
 
Coupling AI with HiTp experiments to Discover Metallic Glasses Faster
Coupling AI with HiTp experiments to Discover Metallic Glasses FasterCoupling AI with HiTp experiments to Discover Metallic Glasses Faster
Coupling AI with HiTp experiments to Discover Metallic Glasses Faster
aimsnist
 
Classical force fields as physics-based neural networks
Classical force fields as physics-based neural networksClassical force fields as physics-based neural networks
Classical force fields as physics-based neural networks
aimsnist
 
Pathways Towards a Hierarchical Discovery of Materials
Pathways Towards a Hierarchical Discovery of MaterialsPathways Towards a Hierarchical Discovery of Materials
Pathways Towards a Hierarchical Discovery of Materials
aimsnist
 
Materials Data in Action
Materials Data in ActionMaterials Data in Action
Materials Data in Action
aimsnist
 
Progress in Natural Language Processing of Materials Science Text
Progress in Natural Language Processing of Materials Science TextProgress in Natural Language Processing of Materials Science Text
Progress in Natural Language Processing of Materials Science Text
aimsnist
 

More from aimsnist (7)

Enabling Data Science Methods for Catalyst Design and Discovery
Enabling Data Science Methods for Catalyst Design and DiscoveryEnabling Data Science Methods for Catalyst Design and Discovery
Enabling Data Science Methods for Catalyst Design and Discovery
 
The MGI and AI
The MGI and AIThe MGI and AI
The MGI and AI
 
Coupling AI with HiTp experiments to Discover Metallic Glasses Faster
Coupling AI with HiTp experiments to Discover Metallic Glasses FasterCoupling AI with HiTp experiments to Discover Metallic Glasses Faster
Coupling AI with HiTp experiments to Discover Metallic Glasses Faster
 
Classical force fields as physics-based neural networks
Classical force fields as physics-based neural networksClassical force fields as physics-based neural networks
Classical force fields as physics-based neural networks
 
Pathways Towards a Hierarchical Discovery of Materials
Pathways Towards a Hierarchical Discovery of MaterialsPathways Towards a Hierarchical Discovery of Materials
Pathways Towards a Hierarchical Discovery of Materials
 
Materials Data in Action
Materials Data in ActionMaterials Data in Action
Materials Data in Action
 
Progress in Natural Language Processing of Materials Science Text
Progress in Natural Language Processing of Materials Science TextProgress in Natural Language Processing of Materials Science Text
Progress in Natural Language Processing of Materials Science Text
 

Recently uploaded

Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
SupreethSP4
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
WENKENLI1
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
VENKATESHvenky89705
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation & Control
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
ViniHema
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
MdTanvirMahtab2
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
Pipe Restoration Solutions
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generation
Robbie Edward Sayers
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
TeeVichai
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
karthi keyan
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
fxintegritypublishin
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
AhmedHussein950959
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
zwunae
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
FluxPrime1
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
AafreenAbuthahir2
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
BrazilAccount1
 
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
SamSarthak3
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
Kerry Sado
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
Amil Baba Dawood bangali
 

Recently uploaded (20)

Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generation
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
 
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
 

Autonomous experimental phase diagram acquisition

  • 1. Brian DeCost1 , Heshan Yu2 , Xiaohang Zhang2 , Seunghun Lee2 , Yangang Liang2 , Ichiro Takeuchi2 , Jason Hattrick-Simpers1 , A. Gilad Kusne1 1 National Institute of Standards and Technology, 2 University of Maryland, College Park Autonomous experimental phase diagram acquisition 2018.08.07 -- AIMS 2018 Meeting -- NIST, Gaithersburg MD Brian DeCost
  • 2. 2 High-Throughput Experimental Materials Collaboratory https://mgi.nist.gov/htemc Distributed experimental materials science platform, built on a network of member institutes. Realization of the HPC paradigm for experimental materials science? Contact: martin.green@nist.gov Look for a forthcoming white paper
  • 3. 3 Active clustering for phase diagram acquisition
  • 4. 4 Autonomous science systems Tabor, Daniel P., et al. "Accelerating the discovery of materials for clean energy in the era of smart automation." Nat. Rev. Mater. 3 (2018): 5-20 https://doi.org/10.1038/s41578-018-0005-z
  • 5. 5 Parallel synthesis, serial characterization Co-sputtering scheme Ni Mn Al 3” spread wafer Ni Al Mn Phase diagram Gregoire, J. M., et al. "High-throughput synchrotron X-ray diffraction for combinatorial phase mapping." Journal of synchrotron radiation 21.6 (2014): 1262-1268. Bi Fe V XRD
  • 6. 6 Unsupervised phase diagram estimation is hard Hattrick-Simpers, Jason R., John M. Gregoire, and A. Gilad Kusne. "Perspective: Composition–structure–property mapping in high-throughput experiments: Turning data into knowledge." APL Materials 4.5 (2016): 053211. https://doi.org/10.1063/1.4950995 What you really want: - multi-phase: linear unmixing - single-phase: invariance to peak shift - infer the number of regions... - respect thermodynamics - leverage archival data - deal with missing reflections - fast! Compromises we can live with How to discover e.g. line compounds with this approach?
  • 7. 7 Simultaneous phase and property mapping Kusne, Aaron Gilad, et al. "On-the-fly machine-learning for high-throughput experiments: search for rare-earth-free permanent magnets." Scientific reports 4 (2014): 6367. 10.1038/srep06367 Finding novel rare-earth-free permanent magnets
  • 8. 8 GRENDEL: Iterative piecewise matrix factorization Alternate between: - clustering - matrix factorization Kusne, Aaron G., et al. "High-throughput determination of structural phase diagram and constituent phases using GRENDEL." Nanotechnology 26.44 (2015): 444002. Include archival data from - ICSD - AFLOW
  • 9. 9 Amdahl's law in materials science Speedup (innovation) is limited by the serial portion of the process! parallel synthesis MnNiGe: 535 'samples' serial characterization Lab diffractometer 30min per composition 2 weeks per ternary! fast (serial) characterization Synchrotron (SLAC) 30s per composition 4.5 hours per ternary Exploit the structure of materials data to scale up
  • 10. 10 Autonomous run: cluster, extrapolate, select Fe Fe0.4Pd0.6 Fe0.4Ga0.6 Fe Fe0.4Ga0.6 Fe0.4Pd0.6
  • 11. VO2 11 Metal Insulator transition: VNbO2, VWO2, etc. 100 150 200 250 300 350 400 10 2 10 3 10 4 10 5 R(W) T (K) 0.33 0.96 1.36 1.68 2.27 2.61 2.86 3.44 c-Al 2 O 3 substrate W% Mixed Tetragonal We'd like to efficiently determine metal-insulator transition temperatures experimentally in a variety of systems 9mm composition spread chip Metal-insulator transition temperature decreases with doping Monoclinic By Original PNGs by Daniel Mayer, traced in Inkscape by User:Stannered - Crystal stucture CC BY-SA 3.0, https://commons.wikimedia.org/w/index.php?curid=1735636
  • 12. 12 X-ray diffraction for phase diagram determination - Dense collection - Laboriously hand labeled by experts - clustering can help - scaling to more complex systems is a challenge - higher temperatures: avoid annealing and diffusion effects Temperature,W
  • 13. 13 Active clustering for autonomous XRD phase mapping Think carefully about modeling to remove researcher degrees of freedom
  • 14. 14 Spectral clustering loosely: like kernel k-means clustering 27.00 27.25 27.50 27.75 28.00 28.25 28.50 2✓ 0 50 100 Intensity(arb.units) 27.00 27.25 27.50 27.75 28.00 28.25 28.50 2✓ 0 50 100 Intensity(arb.units) 1. Form the cosine similarity covariance matrix Y von Luxburg Tutorial on Spectral Clustering arXiv:0711.0189Zelnik-Manor and Perona, Self-Tuning Spectral Clustering, NIPS 2005 2. compute the eigendecomposition of the kernel matrix N a = Kasolve min C X c2C X x2c kx ck2 3. perform k-means clustering in the latent space find cluster centers c: project data onto k principal eigenvectors Ki,j = e dcos(yi,yj )/2 i j set to k'th nearest neighbor distance fori yi
  • 15. 15 Gaussian process classification p(y = 1|x) = (f(x)) (z) = 1/(1 + exp( z)) A Bayesian non-parametric generalization of logistic regression X 0.0 0.2 0.4 0.6 0.8 1.0 X 1 0 1 y 0.0 0.2 0.4 0.6 0.8 1.0 X 1 0 1 y 0.0 0.2 0.4 0.6 0.8 1.0 X 1 0 1 y 0.0 0.2 0.4 0.6 0.8 1.0 X 1 0 1 y 0.0 0.2 0.4 0.6 0.8 1.0 X 1 0 1 y Bayesian model selection: gradient-based optimization of the marginal likelihood p(y|X, Hi) 0.0 0.2 0.4 0.6 0.8 1.0 X 1 0 1 y Ki,j = e |xj xi|2 /`2 Multi-class: one-vs-all strategy f⇤ = k(x⇤)(K + 2 I) 1 y V[f⇤] = k(x⇤, x⇤) kT ⇤ (K + 2 I) 1 k⇤ GP prior on f: http://gpflow.readthedocs.io Why you should consider Bayesian non-parametric models: principled hyperparameter tuning (without CV) This model knows what it doesn't know!
  • 16. 16 Active Gaussian process classification Kapoor et al. Gaussian Processes for Object Categorization (2010) DOI: 10.1007/s11263-009-0268-3 Classification uncertainty Multi-class: one-vs-all strategy arg min xu2Xu |fu| p ⌃u + 2 monoclinic tetragonal arg min xu2Xu |yu 0.5| arg max xu2Xu ⌃u Margin Variance two phase
  • 17. 17 ☢ 4. GET command f = k⇤(K + 2 nI) 1 y V[f⇤] = k(x⇤, x⇤) kT ⇤ (K + 2 nI) 1 k⇤ 0.6 0.7 0.8 0.9 1.0 composition 30 40 50 60 temperature 0.6 0.7 0.8 0.9 1.0 composition 30 40 50 60 temperature 5. POST new data 1. GET available data 3. POST proposed experiment 2. (re)train and predict Analytics client XRD client Diffractometer Control server 26.0 26.5 27.0 27.5 28.0 28.5 29.0 29.5 30.0 2✓ 0 25 50 intensity(arb.units) Infrastructure for autonomous experiments
  • 20. 20 Acknowledgements Funding sources NIST ONR NRC Postdoctoral Research Associate Program XRD setup and VNbO2 data collection by Yangang Liang VWO2 film growth by Xiaohang Zhang VWO2 setup by Heshan Yu Ground truth phase labeling by Jason Hattrick-Simpers