SlideShare a Scribd company logo
Software tools to facilitate
materials science research
Anubhav Jain
Energy Technologies Area
Lawrence Berkeley National Laboratory
Berkeley, CA
S2I2 Workshop, Feb 2017
Slides (already) posted to http://www.slideshare.net/anubhavster
What we work on
•  We don’t develop or
debut the new and
fashionable
computational methods
•  We adopt methods,
standardize the parts
that are ready for mass
reproduction, and
execute them over
thousands of materials
2
Our research interests as materials scientists
3
High-throughput calculations
(each point is a possible battery cathode)
Discovery of new functional materials
(e.g., new bulk thermoelectrics)
A user’s perspective of materials simulation
4
“something”!
Results!
PI!
What	is	the	
GGA-PBE	elas0c	
tensor	of	GaAs?
A user’s perspective of materials simulation
5
“something”!
= student/postdoc!
Results!
PI!
What	is	the	
GGA-PBE	elas0c	
tensor	of	GaAs?	
Input	file	flags	
Queue	format	
how	to	fix	ZPOTRF?
Why this system?
•  It works!
•  Many aspects of running
simulations seem tailor-
made for assigning to
students/postdocs
–  requires specialized
knowledge
–  labor intensive
–  helpful to have a high pain
threshold
•  But there are also
disadvantages…
6
Nicola Marzari’s “Middle
Age Workshop” analogy
Staff specialization can get out of control
Because of the steep learning curve to
computational methods, there is often a single
group member assigned to a technique
7
“Alice knows how to do charged defect calculations.”!
“Bob is the one who can properly converge GW runs.”!
“Olga has all the scripts for phonon calculations.”!
Errors are all too common
Let’s take a look at two alternate universes:
Which universe you are in?
Are you sure? 8
student! has coffee!
copies files from!
previous simulation!
edits 5 lines!
runs simulation,!
delivers report!
student! forgets coffee!
copies files from!
previous simulation!
edits 4 lines!
forgets!
LHFCALC=F!
delivers report, !
looks fine at first, !
in a month you !
discover it was wrong!
1
2
Takes too long to get results
•  Calculations are labor intensive!
–  set up the structure coordinates
–  write input files, double-check all the flags
–  copy to supercomputer
–  submit job to queue
–  deal with supercomputer headaches
–  monitor job
–  fix error jobs, resubmit to queue, wait again
–  repeat process for subsequent calculations in
workflow
–  parse output files to obtain results
–  copy and organize results, e.g., into Excel
9
There is a lot of back-and-forth in the analysis
•  Student/postdoc presents Powerpoint/Excel of the
results
•  PI wants to know certain details or follow up based
on the data, which are missing from the
Powerpoint/Excel
•  Student/postdoc says “I will get back to you”, goes
back to office, re-processes the data, and prepares
a revised report within a few days
•  Repeat…
10
What would be a better way?
11
“something”!
= a computer!
!
Results!
PI!
What	is	the	
GGA-PBE	elas0c	
tensor	of	GaAs?
All past and present knowledge, from
everyone in the group, everyone previously
in the group, and outside collaborators,
about how to run calculations
Reduce specialization
12
Reduce errors and improve efficiency
•  Computers can’t forget to set an input flag
•  Computers (in theory) can create, correct,
submit, parse, and deliver the results of
calculations much faster than even the fastest
student
13
Improve analytics / visualization
•  Excel and Powerpoint
works for a curated view
of the results
•  But online analytics
would allow you to do
things like:
–  view crystal structures on
demand
–  generate the plot you
want
14
So this the vision we want – is it achievable?
15
“something”!
= a computer!
!
Results!
PI!
What	is	the	
GGA-PBE	elas0c	
tensor	of	GaAs?
Yes! – and it is available on Materials Project
16
Input generation
(parameter choice)
Workflow mapping Supercomputer
submission /
monitoring
Error
handling
File Transfer
File Parsing /
DB insertion
Custom material
Submit!
www.materialsproject.org
“Crystal Toolkit”
Anyone can find, edit,
and submit (suggest)
structures
Currently, this feature is available for:
•  structure optimization
•  band structures
•  elastic tensors
Software technologies to enable automatization
17
	
(automatic materials
science workflows)
Custodian	
(calculation error
recovery)
	
(materials analysis
framework)
Base packages
Derived package
	
(workflow framework and
supercomputer interface)
These are all open-source:
•  pymatgen and custodian are led by Prof. Ong group (UC San Diego)
•  Developed in coordination with the Materials Project and Persson group
pymatgen – object-oriented materials analysis
18
www.pymatgen.org!
Ong, S. P., Richards, W. D., Jain, A., Hautier, G., Kocher, M., Cholia, S., Gunter,
D., Chevrier, V. L., Persson, K. a. & Ceder, G. Python Materials Genomics
(pymatgen): A robust, open-source python library for materials analysis.
Comput. Mater. Sci. 68, 314–319 (2013).!
pymatgen – examples of analyses
19
phase diagrams
Pourbaix diagrams
diffusivity from MDband structure analysis
pymatgen - many useful tools made accessible
20
Structure Matcher
analyzes if two periodic
structures are equivalent, even
if they are in different settings
or have minor distortions
= ?!
Order-disorder
resolve partial or mixed
occupancies into a fully
ordered crystal structure
(e.g., mixed oxide-fluoride site
into separate oxygen/fluorine)
Many other tools, such as:
•  Bond-valence sums to determine valence
•  Voronoi coordination as well as 3D coordination polyhedron analysis
•  Automatically find and insert interstitial sites
•  Diffraction pattern modeling
•  Simple cost and materials availability estimators
custodian – fixing job errors
•  Custodian can wrap
around an executable
(e.g., VASP)
–  i.e., run custodian instead of
directly running VASP
•  During execution,
custodian will monitor
output files and detect
errors / problems
–  If so, it can change input files
and rerun the job
–  e.g., if ZPOTRF error
detected, rerun with ISYM=0
–  ever-expanding library of
fixes
21
FireWorks – scientific workflow software
•  FireWorks is an open-source scientific
workflow software
•  Materials Project, JCESR, and other
projects manage their runs with
FireWorks
–  >1 million jobs
–  >100 million CPU-hours
–  multiple computing clusters
•  You can write any workflow
–  e.g., FireWorks is used for graphics
processing, machine learning, document
processing, and protein folding
–  #1 Google hit for “Python workflow
software”, top 5 for general scientific
workflow software
•  Detailed tutorials are available
22
Jain, A., Ong, S. P., Chen, W., Medasani, B., Qu, X., Kocher, M., Brafman, M.,
Petretto, G., Rignanese, G.-M., Hautier, G., Gunter, D. & Persson, K. A.
FireWorks: a dynamic workflow system designed for high-throughput
applications. Concurr. Comput. Pract. Exp. 22, 5037–5059 (2015).!
www.pythonhosted.org/
FireWorks!
FireWorks – screenshot of jobs status
23
Live version at http://fireworks.dash.materialsproject.org
atomate – our newest code (redesigns our older codes)
24
translate PI-style (minimal) specifications into well-
defined FireWorks workflows
(FireWorks handles all the execution and
job management details)
What	is	the	
GGA-PBE	elas0c	
tensor	of	GaAs?
atomate – what’s available?
25
K. Mathew J. Montoya S. DwaraknathA. Faghaninia
•  band structure
•  spin-orbit coupling
•  hybrid functional calcs
•  elastic tensor
•  piezoelectric tensor
•  Raman spectra
•  GIBBS method
•  QH thermal expansion
•  AIMD
•  FEFF method
•  LAMMPS MD
All past and present knowledge, from
everyone in the group, everyone previously
in the group, and outside collaborators,
about how to run calculations
M. Aykol S.P. Ong
Further resources
•  The Github web sites
–  www.github.com/materialsproject
–  www.github.com/hackingmaterials
•  Software carpentry
•  https://software-carpentry.org
26
Needed: better way to learn methods
•  It can take many months, and perhaps even an internship in a
group with relevant expertise, to learn to use a new method
•  Workshops are one way to speed the process
•  However, self-serve ways to learn new methods would be
wonderful
–  e.g., web tutorials that mix together theory and practice
•  Consider: what fraction of people could learn to correctly use
your code/method given only a single web link and no direct
communication with anyone? (they are allowed to find and
use other web resources based on the initial link)
–  Example: https://www.youtube.com/user/MaterialsProject
27
Needed: curation of tools and methods
•  A place to kick-start discovery and learning of
new codes and tools:
–  “Too basic” example: http://materials.sh (Shyue Ping
Ong, UCSD)
–  “Too complex/messy” example: Nanohub
28
Needed: standardizing data *containers*
•  Different codes will have different inputs and
outputs, so obviously data organization will vary
•  But the “container” of the data organization can be
consistent. e.g., you can represent arrays within:
–  JSON
–  YAML
–  XML
–  HDF5
–  but don’t invent your own format to represent an array!
•  Some of these container formats are human-
readable, i.e., easy to edit in a text editor
•  No more “code parses custom input file format to
produce custom output file format”
29
Needed: other ways to improve accuracy
30
DFT band gap = cheap lens Some kind of super
accurate post-Bethe-
Salpeter method
How to improve image quality? Strategy 1
Needed: other ways to improve accuracy
31
Computer algorithms
improve image
How to improve image quality? Strategy 2
Software corrects for cheap lens. e.g.,
distortion, two images to create depth of field
Needed: other ways to improve accuracy
32
correct and mix
cheap/simple
calculations to
improve output
quality
Jain,	A.,	Hau0er,	G.,	Ong,	S.	P.,	
Moore,	C.	J.,	Fischer,	C.	C.,	Persson,	
K.	A.	&	Ceder,	G.	Forma0on	
enthalpies	by	mixing	GGA	and	
GGA+U	calcula0ons.	Phys.	Rev.	B	
84,	45115	(2011).	
!
Needed: other ways to improve accuracy
33
Correcting the DFT is necessary to getting decent phase diagrams
Almost everyone that is practicing new materials
design does some flavor of post-correction (e.g., gas
phase energies)
More effort into comparing, developing, and
validating such methods is needed.
Jain,	A.,	Hau0er,	G.,	Ong,	S.	P.,	
Moore,	C.	J.,	Fischer,	C.	C.,	Persson,	
K.	A.	&	Ceder,	G.	Forma0on	
enthalpies	by	mixing	GGA	and	
GGA+U	calcula0ons.	Phys.	Rev.	B	
84,	45115	(2011).	
!
Questions?
34
Slides (already) posted to http://www.slideshare.net/anubhavster
Some lessons learned (1)
•  In the beginning, strong central coordination from
authority was needed to develop these
–  require that people contribute to common code, e.g.
pymatgen, and not write their own detached scripts
•  Once a code was “established”, less authority was
needed
–  people voluntarily contributed improvements rather than
writing their own code because this benefited them
•  Today the process is almost completely
decentralized
–  culture has changed
–  even for new codes, people rally around it rather than
build independent things
35
Some lessons learned (2)
•  It is helpful to have a strong BDFL (benevolent
dictator for life) for each codebase
•  Requirements for the BDFL:
–  very detail-oriented
–  cares about the code itself, not just the application
–  cares more about the code quality than about offending
teammates, i.e., will not accept poor quality contributions
–  at the same time, able to rally support from people and
convince them to contribute or clean up code
–  willing to work overtime to do things like write detailed
docs, advocate for the code, review commits, etc.
–  derives joy from building and deploying things!
36
Some lessons learned (3)
•  Spending time to do things like improve code-cleanliness, writing
unit tests, writing documentation, etc. is not such a “noble” and
“self-sacrificing” act like people make it out to be
–  I’ve referred my own documentation many times
–  I’ve saved myself from a world of trouble by previously writing unit tests to
detect bugs
–  I’ve been able to write and build large code much faster due to previous
commitments to code cleanliness (and been slowed down in my progress
when I’ve relaxed these constraints)
•  We don’t like to admit this, but a lack of attention to detail in the
past has easily cost us tens of thousands of dollars in wasted
computing and countless labor hours – but some of this is inevitable
with large projects
37
Some lessons learned (4)
•  Computer scientists are useful for staying up to
date in the fast-moving world of software
–  2006: I took a graduate class in databases at MIT; all SQL,
not a single mention of “NoSQL”
–  2011: We are designing the framework for Materials
Project; I have lots of experience with SQL; a computer
scientist casually mentions NoSQL, its growing
prominence, and its potential applicability to our problem
–  2017: We do almost everything in NoSQL
•  Lesson: software moves fast! Much faster than
materials science knowledge or methods. Don’t use
data from 5 years ago to inform your decision.
38

More Related Content

What's hot

Computational materials design with high-throughput and machine learning methods
Computational materials design with high-throughput and machine learning methodsComputational materials design with high-throughput and machine learning methods
Computational materials design with high-throughput and machine learning methods
Anubhav Jain
 
Graphene mobile phone
Graphene mobile phoneGraphene mobile phone
Graphene mobile phoneNalini Anand
 
Computational methods applied to materials modeling
Computational methods applied to materials modelingComputational methods applied to materials modeling
Computational methods applied to materials modeling
cippo1987Ita
 
Graphene
GrapheneGraphene
Graphene
durairaj6893
 
Piezoelectric Materials and Applications
Piezoelectric Materials and ApplicationsPiezoelectric Materials and Applications
Piezoelectric Materials and Applications
John Hudak
 
Graphene
GrapheneGraphene
Graphene
hiteshparmar07
 
Multiferroic
MultiferroicMultiferroic
Multiferroic
.Jayanta Saha
 
Smart material
Smart material   Smart material
Smart material
ashwani singh
 
Nanotechnology
NanotechnologyNanotechnology
Nanotechnology
lakshya bhardwaj
 
Nanotechnology.Opportunities&Challenges
Nanotechnology.Opportunities&ChallengesNanotechnology.Opportunities&Challenges
Nanotechnology.Opportunities&Challengeslusik
 
Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...
Anubhav Jain
 
The Materials Project: overview and infrastructure
The Materials Project: overview and infrastructureThe Materials Project: overview and infrastructure
The Materials Project: overview and infrastructure
Anubhav Jain
 
Graphene applications
Graphene applicationsGraphene applications
Graphene applications
Perini Kumar
 
Nanotechnology
NanotechnologyNanotechnology
Nanotechnology
jiniraju
 
Nanotechnology & its applications
Nanotechnology & its applicationsNanotechnology & its applications
Nanotechnology & its applications
Prof. A.Balasubramanian
 
Density functional theory calculations and data mining for new thermoelectric...
Density functional theory calculations and data mining for new thermoelectric...Density functional theory calculations and data mining for new thermoelectric...
Density functional theory calculations and data mining for new thermoelectric...
Anubhav Jain
 
Graphene
Graphene   Graphene
Graphene
Sidhu Taran
 
Giant magnetoresistance
Giant magnetoresistanceGiant magnetoresistance
Giant magnetoresistance
jamilahmedawan
 
Nanomaterials
Nanomaterials Nanomaterials
Nanomaterials
G V S R PAVAN KUMAR
 
Synthesis and characterization of nanocomposites
Synthesis and characterization of nanocompositesSynthesis and characterization of nanocomposites
Synthesis and characterization of nanocomposites
sowmya sankaran
 

What's hot (20)

Computational materials design with high-throughput and machine learning methods
Computational materials design with high-throughput and machine learning methodsComputational materials design with high-throughput and machine learning methods
Computational materials design with high-throughput and machine learning methods
 
Graphene mobile phone
Graphene mobile phoneGraphene mobile phone
Graphene mobile phone
 
Computational methods applied to materials modeling
Computational methods applied to materials modelingComputational methods applied to materials modeling
Computational methods applied to materials modeling
 
Graphene
GrapheneGraphene
Graphene
 
Piezoelectric Materials and Applications
Piezoelectric Materials and ApplicationsPiezoelectric Materials and Applications
Piezoelectric Materials and Applications
 
Graphene
GrapheneGraphene
Graphene
 
Multiferroic
MultiferroicMultiferroic
Multiferroic
 
Smart material
Smart material   Smart material
Smart material
 
Nanotechnology
NanotechnologyNanotechnology
Nanotechnology
 
Nanotechnology.Opportunities&Challenges
Nanotechnology.Opportunities&ChallengesNanotechnology.Opportunities&Challenges
Nanotechnology.Opportunities&Challenges
 
Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...
 
The Materials Project: overview and infrastructure
The Materials Project: overview and infrastructureThe Materials Project: overview and infrastructure
The Materials Project: overview and infrastructure
 
Graphene applications
Graphene applicationsGraphene applications
Graphene applications
 
Nanotechnology
NanotechnologyNanotechnology
Nanotechnology
 
Nanotechnology & its applications
Nanotechnology & its applicationsNanotechnology & its applications
Nanotechnology & its applications
 
Density functional theory calculations and data mining for new thermoelectric...
Density functional theory calculations and data mining for new thermoelectric...Density functional theory calculations and data mining for new thermoelectric...
Density functional theory calculations and data mining for new thermoelectric...
 
Graphene
Graphene   Graphene
Graphene
 
Giant magnetoresistance
Giant magnetoresistanceGiant magnetoresistance
Giant magnetoresistance
 
Nanomaterials
Nanomaterials Nanomaterials
Nanomaterials
 
Synthesis and characterization of nanocomposites
Synthesis and characterization of nanocompositesSynthesis and characterization of nanocomposites
Synthesis and characterization of nanocomposites
 

Viewers also liked

Combining High-Throughput Computing and Statistical Learning to Develop and U...
Combining High-Throughput Computing and Statistical Learning to Develop and U...Combining High-Throughput Computing and Statistical Learning to Develop and U...
Combining High-Throughput Computing and Statistical Learning to Develop and U...
Anubhav Jain
 
The Materials Project: An Electronic Structure Database for Community-Based M...
The Materials Project: An Electronic Structure Database for Community-Based M...The Materials Project: An Electronic Structure Database for Community-Based M...
The Materials Project: An Electronic Structure Database for Community-Based M...
Anubhav Jain
 
FireWorks overview
FireWorks overviewFireWorks overview
FireWorks overview
Anubhav Jain
 
FireWorks workflow software
FireWorks workflow softwareFireWorks workflow software
FireWorks workflow software
Anubhav Jain
 
Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...
Anubhav Jain
 
Open Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & AnalysisOpen Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & AnalysisMarcus Hanwell
 
Application of the Materials Project database and data mining towards the des...
Application of the Materials Project database and data mining towards the des...Application of the Materials Project database and data mining towards the des...
Application of the Materials Project database and data mining towards the des...
Anubhav Jain
 
MAVRL Workshop 2014 - Python Materials Genomics (pymatgen)
MAVRL Workshop 2014 - Python Materials Genomics (pymatgen)MAVRL Workshop 2014 - Python Materials Genomics (pymatgen)
MAVRL Workshop 2014 - Python Materials Genomics (pymatgen)
University of California, San Diego
 
MAVRL Workshop 2014 - pymatgen-db & custodian
MAVRL Workshop 2014 - pymatgen-db & custodianMAVRL Workshop 2014 - pymatgen-db & custodian
MAVRL Workshop 2014 - pymatgen-db & custodian
University of California, San Diego
 
The Materials Project and computational materials discovery
The Materials Project and computational materials discoveryThe Materials Project and computational materials discovery
The Materials Project and computational materials discovery
Anubhav Jain
 
Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...
Anubhav Jain
 
Educational Software Categories
Educational Software CategoriesEducational Software Categories
Educational Software Categories
Jeremy Brueck
 
Academic Software
Academic SoftwareAcademic Software
Academic Softwareandy85730
 
Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...
Anubhav Jain
 
Targeted Band Structure Design and Thermoelectric Materials Discovery Using H...
Targeted Band Structure Design and Thermoelectric Materials Discovery Using H...Targeted Band Structure Design and Thermoelectric Materials Discovery Using H...
Targeted Band Structure Design and Thermoelectric Materials Discovery Using H...
Anubhav Jain
 
Exercise and sports science
Exercise and sports scienceExercise and sports science
Exercise and sports scienceJun Red Ramos
 
Basic Exercise physiology
Basic Exercise physiologyBasic Exercise physiology
Basic Exercise physiology
edwinmills39
 
Types of Instructional Software
Types of Instructional SoftwareTypes of Instructional Software
Types of Instructional Software
Alaa Sadik
 

Viewers also liked (18)

Combining High-Throughput Computing and Statistical Learning to Develop and U...
Combining High-Throughput Computing and Statistical Learning to Develop and U...Combining High-Throughput Computing and Statistical Learning to Develop and U...
Combining High-Throughput Computing and Statistical Learning to Develop and U...
 
The Materials Project: An Electronic Structure Database for Community-Based M...
The Materials Project: An Electronic Structure Database for Community-Based M...The Materials Project: An Electronic Structure Database for Community-Based M...
The Materials Project: An Electronic Structure Database for Community-Based M...
 
FireWorks overview
FireWorks overviewFireWorks overview
FireWorks overview
 
FireWorks workflow software
FireWorks workflow softwareFireWorks workflow software
FireWorks workflow software
 
Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...
 
Open Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & AnalysisOpen Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & Analysis
 
Application of the Materials Project database and data mining towards the des...
Application of the Materials Project database and data mining towards the des...Application of the Materials Project database and data mining towards the des...
Application of the Materials Project database and data mining towards the des...
 
MAVRL Workshop 2014 - Python Materials Genomics (pymatgen)
MAVRL Workshop 2014 - Python Materials Genomics (pymatgen)MAVRL Workshop 2014 - Python Materials Genomics (pymatgen)
MAVRL Workshop 2014 - Python Materials Genomics (pymatgen)
 
MAVRL Workshop 2014 - pymatgen-db & custodian
MAVRL Workshop 2014 - pymatgen-db & custodianMAVRL Workshop 2014 - pymatgen-db & custodian
MAVRL Workshop 2014 - pymatgen-db & custodian
 
The Materials Project and computational materials discovery
The Materials Project and computational materials discoveryThe Materials Project and computational materials discovery
The Materials Project and computational materials discovery
 
Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...
 
Educational Software Categories
Educational Software CategoriesEducational Software Categories
Educational Software Categories
 
Academic Software
Academic SoftwareAcademic Software
Academic Software
 
Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...
 
Targeted Band Structure Design and Thermoelectric Materials Discovery Using H...
Targeted Band Structure Design and Thermoelectric Materials Discovery Using H...Targeted Band Structure Design and Thermoelectric Materials Discovery Using H...
Targeted Band Structure Design and Thermoelectric Materials Discovery Using H...
 
Exercise and sports science
Exercise and sports scienceExercise and sports science
Exercise and sports science
 
Basic Exercise physiology
Basic Exercise physiologyBasic Exercise physiology
Basic Exercise physiology
 
Types of Instructional Software
Types of Instructional SoftwareTypes of Instructional Software
Types of Instructional Software
 

Similar to Software tools to facilitate materials science research

Open-source tools for generating and analyzing large materials data sets
Open-source tools for generating and analyzing large materials data setsOpen-source tools for generating and analyzing large materials data sets
Open-source tools for generating and analyzing large materials data sets
Anubhav Jain
 
The Materials Project: Experiences from running a million computational scien...
The Materials Project: Experiences from running a million computational scien...The Materials Project: Experiences from running a million computational scien...
The Materials Project: Experiences from running a million computational scien...
Anubhav Jain
 
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
Ilkay Altintas, Ph.D.
 
Software tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data miningSoftware tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data mining
Anubhav Jain
 
Parsl: Pervasive Parallel Programming in Python
Parsl: Pervasive Parallel Programming in PythonParsl: Pervasive Parallel Programming in Python
Parsl: Pervasive Parallel Programming in Python
Daniel S. Katz
 
Software and Education at NSF/ACI
Software and Education at NSF/ACISoftware and Education at NSF/ACI
Software and Education at NSF/ACI
Daniel S. Katz
 
2014 nicta-reproducibility
2014 nicta-reproducibility2014 nicta-reproducibility
2014 nicta-reproducibility
c.titus.brown
 
Resources for Teaching Undergraduate Computational Physics
Resources for Teaching Undergraduate Computational PhysicsResources for Teaching Undergraduate Computational Physics
Resources for Teaching Undergraduate Computational Physics
Amdeselassie Amde
 
Confessions of an Interdisciplinary Researcher: The Case of High Performance ...
Confessions of an Interdisciplinary Researcher: The Case of High Performance ...Confessions of an Interdisciplinary Researcher: The Case of High Performance ...
Confessions of an Interdisciplinary Researcher: The Case of High Performance ...
tiberiusp
 
Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...
Anubhav Jain
 
Materials Project computation and database infrastructure
Materials Project computation and database infrastructureMaterials Project computation and database infrastructure
Materials Project computation and database infrastructure
Anubhav Jain
 
Atomate: a high-level interface to generate, execute, and analyze computation...
Atomate: a high-level interface to generate, execute, and analyze computation...Atomate: a high-level interface to generate, execute, and analyze computation...
Atomate: a high-level interface to generate, execute, and analyze computation...
Anubhav Jain
 
Knowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceKnowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems Science
David De Roure
 
ExaLearn Overview - ECP Co-Design Center for Machine Learning
ExaLearn Overview - ECP Co-Design Center for Machine LearningExaLearn Overview - ECP Co-Design Center for Machine Learning
ExaLearn Overview - ECP Co-Design Center for Machine Learning
inside-BigData.com
 
Reproducibility: 10 Simple Rules
Reproducibility: 10 Simple RulesReproducibility: 10 Simple Rules
Reproducibility: 10 Simple Rules
Annika Eriksson
 
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Anubhav Jain
 
Tds — big science dec 2021
Tds — big science dec 2021Tds — big science dec 2021
Tds — big science dec 2021
Gérard Dupont
 
Collaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna WorkflowsCollaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna Workflows
Andrea Wiggins
 
Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...
Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...
Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...
Richard Zijdeman
 
Data legend dh_benelux_2017.key
Data legend dh_benelux_2017.keyData legend dh_benelux_2017.key
Data legend dh_benelux_2017.key
Richard Zijdeman
 

Similar to Software tools to facilitate materials science research (20)

Open-source tools for generating and analyzing large materials data sets
Open-source tools for generating and analyzing large materials data setsOpen-source tools for generating and analyzing large materials data sets
Open-source tools for generating and analyzing large materials data sets
 
The Materials Project: Experiences from running a million computational scien...
The Materials Project: Experiences from running a million computational scien...The Materials Project: Experiences from running a million computational scien...
The Materials Project: Experiences from running a million computational scien...
 
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
 
Software tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data miningSoftware tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data mining
 
Parsl: Pervasive Parallel Programming in Python
Parsl: Pervasive Parallel Programming in PythonParsl: Pervasive Parallel Programming in Python
Parsl: Pervasive Parallel Programming in Python
 
Software and Education at NSF/ACI
Software and Education at NSF/ACISoftware and Education at NSF/ACI
Software and Education at NSF/ACI
 
2014 nicta-reproducibility
2014 nicta-reproducibility2014 nicta-reproducibility
2014 nicta-reproducibility
 
Resources for Teaching Undergraduate Computational Physics
Resources for Teaching Undergraduate Computational PhysicsResources for Teaching Undergraduate Computational Physics
Resources for Teaching Undergraduate Computational Physics
 
Confessions of an Interdisciplinary Researcher: The Case of High Performance ...
Confessions of an Interdisciplinary Researcher: The Case of High Performance ...Confessions of an Interdisciplinary Researcher: The Case of High Performance ...
Confessions of an Interdisciplinary Researcher: The Case of High Performance ...
 
Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...
 
Materials Project computation and database infrastructure
Materials Project computation and database infrastructureMaterials Project computation and database infrastructure
Materials Project computation and database infrastructure
 
Atomate: a high-level interface to generate, execute, and analyze computation...
Atomate: a high-level interface to generate, execute, and analyze computation...Atomate: a high-level interface to generate, execute, and analyze computation...
Atomate: a high-level interface to generate, execute, and analyze computation...
 
Knowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceKnowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems Science
 
ExaLearn Overview - ECP Co-Design Center for Machine Learning
ExaLearn Overview - ECP Co-Design Center for Machine LearningExaLearn Overview - ECP Co-Design Center for Machine Learning
ExaLearn Overview - ECP Co-Design Center for Machine Learning
 
Reproducibility: 10 Simple Rules
Reproducibility: 10 Simple RulesReproducibility: 10 Simple Rules
Reproducibility: 10 Simple Rules
 
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
 
Tds — big science dec 2021
Tds — big science dec 2021Tds — big science dec 2021
Tds — big science dec 2021
 
Collaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna WorkflowsCollaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna Workflows
 
Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...
Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...
Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...
 
Data legend dh_benelux_2017.key
Data legend dh_benelux_2017.keyData legend dh_benelux_2017.key
Data legend dh_benelux_2017.key
 

More from Anubhav Jain

Discovering advanced materials for energy applications: theory, high-throughp...
Discovering advanced materials for energy applications: theory, high-throughp...Discovering advanced materials for energy applications: theory, high-throughp...
Discovering advanced materials for energy applications: theory, high-throughp...
Anubhav Jain
 
Applications of Large Language Models in Materials Discovery and Design
Applications of Large Language Models in Materials Discovery and DesignApplications of Large Language Models in Materials Discovery and Design
Applications of Large Language Models in Materials Discovery and Design
Anubhav Jain
 
An AI-driven closed-loop facility for materials synthesis
An AI-driven closed-loop facility for materials synthesisAn AI-driven closed-loop facility for materials synthesis
An AI-driven closed-loop facility for materials synthesis
Anubhav Jain
 
Best practices for DuraMat software dissemination
Best practices for DuraMat software disseminationBest practices for DuraMat software dissemination
Best practices for DuraMat software dissemination
Anubhav Jain
 
Best practices for DuraMat software dissemination
Best practices for DuraMat software disseminationBest practices for DuraMat software dissemination
Best practices for DuraMat software dissemination
Anubhav Jain
 
Available methods for predicting materials synthesizability using computation...
Available methods for predicting materials synthesizability using computation...Available methods for predicting materials synthesizability using computation...
Available methods for predicting materials synthesizability using computation...
Anubhav Jain
 
Natural Language Processing for Data Extraction and Synthesizability Predicti...
Natural Language Processing for Data Extraction and Synthesizability Predicti...Natural Language Processing for Data Extraction and Synthesizability Predicti...
Natural Language Processing for Data Extraction and Synthesizability Predicti...
Anubhav Jain
 
Machine Learning for Catalyst Design
Machine Learning for Catalyst DesignMachine Learning for Catalyst Design
Machine Learning for Catalyst Design
Anubhav Jain
 
Natural language processing for extracting synthesis recipes and applications...
Natural language processing for extracting synthesis recipes and applications...Natural language processing for extracting synthesis recipes and applications...
Natural language processing for extracting synthesis recipes and applications...
Anubhav Jain
 
Accelerating New Materials Design with Supercomputing and Machine Learning
Accelerating New Materials Design with Supercomputing and Machine LearningAccelerating New Materials Design with Supercomputing and Machine Learning
Accelerating New Materials Design with Supercomputing and Machine Learning
Anubhav Jain
 
DuraMat CO1 Central Data Resource: How it started, how it’s going …
DuraMat CO1 Central Data Resource: How it started, how it’s going …DuraMat CO1 Central Data Resource: How it started, how it’s going …
DuraMat CO1 Central Data Resource: How it started, how it’s going …
Anubhav Jain
 
The Materials Project
The Materials ProjectThe Materials Project
The Materials Project
Anubhav Jain
 
Evaluating Chemical Composition and Crystal Structure Representations using t...
Evaluating Chemical Composition and Crystal Structure Representations using t...Evaluating Chemical Composition and Crystal Structure Representations using t...
Evaluating Chemical Composition and Crystal Structure Representations using t...
Anubhav Jain
 
Perspectives on chemical composition and crystal structure representations fr...
Perspectives on chemical composition and crystal structure representations fr...Perspectives on chemical composition and crystal structure representations fr...
Perspectives on chemical composition and crystal structure representations fr...
Anubhav Jain
 
Discovering and Exploring New Materials through the Materials Project
Discovering and Exploring New Materials through the Materials ProjectDiscovering and Exploring New Materials through the Materials Project
Discovering and Exploring New Materials through the Materials Project
Anubhav Jain
 
The Materials Project: Applications to energy storage and functional materia...
The Materials Project: Applications to energy storage and functional materia...The Materials Project: Applications to energy storage and functional materia...
The Materials Project: Applications to energy storage and functional materia...
Anubhav Jain
 
The Materials Project: A Community Data Resource for Accelerating New Materia...
The Materials Project: A Community Data Resource for Accelerating New Materia...The Materials Project: A Community Data Resource for Accelerating New Materia...
The Materials Project: A Community Data Resource for Accelerating New Materia...
Anubhav Jain
 
Machine Learning Platform for Catalyst Design
Machine Learning Platform for Catalyst DesignMachine Learning Platform for Catalyst Design
Machine Learning Platform for Catalyst Design
Anubhav Jain
 
Applications of Natural Language Processing to Materials Design
Applications of Natural Language Processing to Materials DesignApplications of Natural Language Processing to Materials Design
Applications of Natural Language Processing to Materials Design
Anubhav Jain
 
Assessing Factors Underpinning PV Degradation through Data Analysis
Assessing Factors Underpinning PV Degradation through Data AnalysisAssessing Factors Underpinning PV Degradation through Data Analysis
Assessing Factors Underpinning PV Degradation through Data Analysis
Anubhav Jain
 

More from Anubhav Jain (20)

Discovering advanced materials for energy applications: theory, high-throughp...
Discovering advanced materials for energy applications: theory, high-throughp...Discovering advanced materials for energy applications: theory, high-throughp...
Discovering advanced materials for energy applications: theory, high-throughp...
 
Applications of Large Language Models in Materials Discovery and Design
Applications of Large Language Models in Materials Discovery and DesignApplications of Large Language Models in Materials Discovery and Design
Applications of Large Language Models in Materials Discovery and Design
 
An AI-driven closed-loop facility for materials synthesis
An AI-driven closed-loop facility for materials synthesisAn AI-driven closed-loop facility for materials synthesis
An AI-driven closed-loop facility for materials synthesis
 
Best practices for DuraMat software dissemination
Best practices for DuraMat software disseminationBest practices for DuraMat software dissemination
Best practices for DuraMat software dissemination
 
Best practices for DuraMat software dissemination
Best practices for DuraMat software disseminationBest practices for DuraMat software dissemination
Best practices for DuraMat software dissemination
 
Available methods for predicting materials synthesizability using computation...
Available methods for predicting materials synthesizability using computation...Available methods for predicting materials synthesizability using computation...
Available methods for predicting materials synthesizability using computation...
 
Natural Language Processing for Data Extraction and Synthesizability Predicti...
Natural Language Processing for Data Extraction and Synthesizability Predicti...Natural Language Processing for Data Extraction and Synthesizability Predicti...
Natural Language Processing for Data Extraction and Synthesizability Predicti...
 
Machine Learning for Catalyst Design
Machine Learning for Catalyst DesignMachine Learning for Catalyst Design
Machine Learning for Catalyst Design
 
Natural language processing for extracting synthesis recipes and applications...
Natural language processing for extracting synthesis recipes and applications...Natural language processing for extracting synthesis recipes and applications...
Natural language processing for extracting synthesis recipes and applications...
 
Accelerating New Materials Design with Supercomputing and Machine Learning
Accelerating New Materials Design with Supercomputing and Machine LearningAccelerating New Materials Design with Supercomputing and Machine Learning
Accelerating New Materials Design with Supercomputing and Machine Learning
 
DuraMat CO1 Central Data Resource: How it started, how it’s going …
DuraMat CO1 Central Data Resource: How it started, how it’s going …DuraMat CO1 Central Data Resource: How it started, how it’s going …
DuraMat CO1 Central Data Resource: How it started, how it’s going …
 
The Materials Project
The Materials ProjectThe Materials Project
The Materials Project
 
Evaluating Chemical Composition and Crystal Structure Representations using t...
Evaluating Chemical Composition and Crystal Structure Representations using t...Evaluating Chemical Composition and Crystal Structure Representations using t...
Evaluating Chemical Composition and Crystal Structure Representations using t...
 
Perspectives on chemical composition and crystal structure representations fr...
Perspectives on chemical composition and crystal structure representations fr...Perspectives on chemical composition and crystal structure representations fr...
Perspectives on chemical composition and crystal structure representations fr...
 
Discovering and Exploring New Materials through the Materials Project
Discovering and Exploring New Materials through the Materials ProjectDiscovering and Exploring New Materials through the Materials Project
Discovering and Exploring New Materials through the Materials Project
 
The Materials Project: Applications to energy storage and functional materia...
The Materials Project: Applications to energy storage and functional materia...The Materials Project: Applications to energy storage and functional materia...
The Materials Project: Applications to energy storage and functional materia...
 
The Materials Project: A Community Data Resource for Accelerating New Materia...
The Materials Project: A Community Data Resource for Accelerating New Materia...The Materials Project: A Community Data Resource for Accelerating New Materia...
The Materials Project: A Community Data Resource for Accelerating New Materia...
 
Machine Learning Platform for Catalyst Design
Machine Learning Platform for Catalyst DesignMachine Learning Platform for Catalyst Design
Machine Learning Platform for Catalyst Design
 
Applications of Natural Language Processing to Materials Design
Applications of Natural Language Processing to Materials DesignApplications of Natural Language Processing to Materials Design
Applications of Natural Language Processing to Materials Design
 
Assessing Factors Underpinning PV Degradation through Data Analysis
Assessing Factors Underpinning PV Degradation through Data AnalysisAssessing Factors Underpinning PV Degradation through Data Analysis
Assessing Factors Underpinning PV Degradation through Data Analysis
 

Recently uploaded

general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
yqqaatn0
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
sonaliswain16
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
ChetanK57
 
S.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary levelS.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary level
ronaldlakony0
 
nodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptxnodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptx
alishadewangan1
 
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Studia Poinsotiana
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
Wasswaderrick3
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
tonzsalvador2222
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
kejapriya1
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
Richard Gill
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
yqqaatn0
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
fafyfskhan251kmf
 

Recently uploaded (20)

general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
 
S.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary levelS.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary level
 
nodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptxnodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptx
 
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
 

Software tools to facilitate materials science research

  • 1. Software tools to facilitate materials science research Anubhav Jain Energy Technologies Area Lawrence Berkeley National Laboratory Berkeley, CA S2I2 Workshop, Feb 2017 Slides (already) posted to http://www.slideshare.net/anubhavster
  • 2. What we work on •  We don’t develop or debut the new and fashionable computational methods •  We adopt methods, standardize the parts that are ready for mass reproduction, and execute them over thousands of materials 2
  • 3. Our research interests as materials scientists 3 High-throughput calculations (each point is a possible battery cathode) Discovery of new functional materials (e.g., new bulk thermoelectrics)
  • 4. A user’s perspective of materials simulation 4 “something”! Results! PI! What is the GGA-PBE elas0c tensor of GaAs?
  • 5. A user’s perspective of materials simulation 5 “something”! = student/postdoc! Results! PI! What is the GGA-PBE elas0c tensor of GaAs? Input file flags Queue format how to fix ZPOTRF?
  • 6. Why this system? •  It works! •  Many aspects of running simulations seem tailor- made for assigning to students/postdocs –  requires specialized knowledge –  labor intensive –  helpful to have a high pain threshold •  But there are also disadvantages… 6 Nicola Marzari’s “Middle Age Workshop” analogy
  • 7. Staff specialization can get out of control Because of the steep learning curve to computational methods, there is often a single group member assigned to a technique 7 “Alice knows how to do charged defect calculations.”! “Bob is the one who can properly converge GW runs.”! “Olga has all the scripts for phonon calculations.”!
  • 8. Errors are all too common Let’s take a look at two alternate universes: Which universe you are in? Are you sure? 8 student! has coffee! copies files from! previous simulation! edits 5 lines! runs simulation,! delivers report! student! forgets coffee! copies files from! previous simulation! edits 4 lines! forgets! LHFCALC=F! delivers report, ! looks fine at first, ! in a month you ! discover it was wrong! 1 2
  • 9. Takes too long to get results •  Calculations are labor intensive! –  set up the structure coordinates –  write input files, double-check all the flags –  copy to supercomputer –  submit job to queue –  deal with supercomputer headaches –  monitor job –  fix error jobs, resubmit to queue, wait again –  repeat process for subsequent calculations in workflow –  parse output files to obtain results –  copy and organize results, e.g., into Excel 9
  • 10. There is a lot of back-and-forth in the analysis •  Student/postdoc presents Powerpoint/Excel of the results •  PI wants to know certain details or follow up based on the data, which are missing from the Powerpoint/Excel •  Student/postdoc says “I will get back to you”, goes back to office, re-processes the data, and prepares a revised report within a few days •  Repeat… 10
  • 11. What would be a better way? 11 “something”! = a computer! ! Results! PI! What is the GGA-PBE elas0c tensor of GaAs?
  • 12. All past and present knowledge, from everyone in the group, everyone previously in the group, and outside collaborators, about how to run calculations Reduce specialization 12
  • 13. Reduce errors and improve efficiency •  Computers can’t forget to set an input flag •  Computers (in theory) can create, correct, submit, parse, and deliver the results of calculations much faster than even the fastest student 13
  • 14. Improve analytics / visualization •  Excel and Powerpoint works for a curated view of the results •  But online analytics would allow you to do things like: –  view crystal structures on demand –  generate the plot you want 14
  • 15. So this the vision we want – is it achievable? 15 “something”! = a computer! ! Results! PI! What is the GGA-PBE elas0c tensor of GaAs?
  • 16. Yes! – and it is available on Materials Project 16 Input generation (parameter choice) Workflow mapping Supercomputer submission / monitoring Error handling File Transfer File Parsing / DB insertion Custom material Submit! www.materialsproject.org “Crystal Toolkit” Anyone can find, edit, and submit (suggest) structures Currently, this feature is available for: •  structure optimization •  band structures •  elastic tensors
  • 17. Software technologies to enable automatization 17 (automatic materials science workflows) Custodian (calculation error recovery) (materials analysis framework) Base packages Derived package (workflow framework and supercomputer interface) These are all open-source: •  pymatgen and custodian are led by Prof. Ong group (UC San Diego) •  Developed in coordination with the Materials Project and Persson group
  • 18. pymatgen – object-oriented materials analysis 18 www.pymatgen.org! Ong, S. P., Richards, W. D., Jain, A., Hautier, G., Kocher, M., Cholia, S., Gunter, D., Chevrier, V. L., Persson, K. a. & Ceder, G. Python Materials Genomics (pymatgen): A robust, open-source python library for materials analysis. Comput. Mater. Sci. 68, 314–319 (2013).!
  • 19. pymatgen – examples of analyses 19 phase diagrams Pourbaix diagrams diffusivity from MDband structure analysis
  • 20. pymatgen - many useful tools made accessible 20 Structure Matcher analyzes if two periodic structures are equivalent, even if they are in different settings or have minor distortions = ?! Order-disorder resolve partial or mixed occupancies into a fully ordered crystal structure (e.g., mixed oxide-fluoride site into separate oxygen/fluorine) Many other tools, such as: •  Bond-valence sums to determine valence •  Voronoi coordination as well as 3D coordination polyhedron analysis •  Automatically find and insert interstitial sites •  Diffraction pattern modeling •  Simple cost and materials availability estimators
  • 21. custodian – fixing job errors •  Custodian can wrap around an executable (e.g., VASP) –  i.e., run custodian instead of directly running VASP •  During execution, custodian will monitor output files and detect errors / problems –  If so, it can change input files and rerun the job –  e.g., if ZPOTRF error detected, rerun with ISYM=0 –  ever-expanding library of fixes 21
  • 22. FireWorks – scientific workflow software •  FireWorks is an open-source scientific workflow software •  Materials Project, JCESR, and other projects manage their runs with FireWorks –  >1 million jobs –  >100 million CPU-hours –  multiple computing clusters •  You can write any workflow –  e.g., FireWorks is used for graphics processing, machine learning, document processing, and protein folding –  #1 Google hit for “Python workflow software”, top 5 for general scientific workflow software •  Detailed tutorials are available 22 Jain, A., Ong, S. P., Chen, W., Medasani, B., Qu, X., Kocher, M., Brafman, M., Petretto, G., Rignanese, G.-M., Hautier, G., Gunter, D. & Persson, K. A. FireWorks: a dynamic workflow system designed for high-throughput applications. Concurr. Comput. Pract. Exp. 22, 5037–5059 (2015).! www.pythonhosted.org/ FireWorks!
  • 23. FireWorks – screenshot of jobs status 23 Live version at http://fireworks.dash.materialsproject.org
  • 24. atomate – our newest code (redesigns our older codes) 24 translate PI-style (minimal) specifications into well- defined FireWorks workflows (FireWorks handles all the execution and job management details) What is the GGA-PBE elas0c tensor of GaAs?
  • 25. atomate – what’s available? 25 K. Mathew J. Montoya S. DwaraknathA. Faghaninia •  band structure •  spin-orbit coupling •  hybrid functional calcs •  elastic tensor •  piezoelectric tensor •  Raman spectra •  GIBBS method •  QH thermal expansion •  AIMD •  FEFF method •  LAMMPS MD All past and present knowledge, from everyone in the group, everyone previously in the group, and outside collaborators, about how to run calculations M. Aykol S.P. Ong
  • 26. Further resources •  The Github web sites –  www.github.com/materialsproject –  www.github.com/hackingmaterials •  Software carpentry •  https://software-carpentry.org 26
  • 27. Needed: better way to learn methods •  It can take many months, and perhaps even an internship in a group with relevant expertise, to learn to use a new method •  Workshops are one way to speed the process •  However, self-serve ways to learn new methods would be wonderful –  e.g., web tutorials that mix together theory and practice •  Consider: what fraction of people could learn to correctly use your code/method given only a single web link and no direct communication with anyone? (they are allowed to find and use other web resources based on the initial link) –  Example: https://www.youtube.com/user/MaterialsProject 27
  • 28. Needed: curation of tools and methods •  A place to kick-start discovery and learning of new codes and tools: –  “Too basic” example: http://materials.sh (Shyue Ping Ong, UCSD) –  “Too complex/messy” example: Nanohub 28
  • 29. Needed: standardizing data *containers* •  Different codes will have different inputs and outputs, so obviously data organization will vary •  But the “container” of the data organization can be consistent. e.g., you can represent arrays within: –  JSON –  YAML –  XML –  HDF5 –  but don’t invent your own format to represent an array! •  Some of these container formats are human- readable, i.e., easy to edit in a text editor •  No more “code parses custom input file format to produce custom output file format” 29
  • 30. Needed: other ways to improve accuracy 30 DFT band gap = cheap lens Some kind of super accurate post-Bethe- Salpeter method How to improve image quality? Strategy 1
  • 31. Needed: other ways to improve accuracy 31 Computer algorithms improve image How to improve image quality? Strategy 2 Software corrects for cheap lens. e.g., distortion, two images to create depth of field
  • 32. Needed: other ways to improve accuracy 32 correct and mix cheap/simple calculations to improve output quality Jain, A., Hau0er, G., Ong, S. P., Moore, C. J., Fischer, C. C., Persson, K. A. & Ceder, G. Forma0on enthalpies by mixing GGA and GGA+U calcula0ons. Phys. Rev. B 84, 45115 (2011). !
  • 33. Needed: other ways to improve accuracy 33 Correcting the DFT is necessary to getting decent phase diagrams Almost everyone that is practicing new materials design does some flavor of post-correction (e.g., gas phase energies) More effort into comparing, developing, and validating such methods is needed. Jain, A., Hau0er, G., Ong, S. P., Moore, C. J., Fischer, C. C., Persson, K. A. & Ceder, G. Forma0on enthalpies by mixing GGA and GGA+U calcula0ons. Phys. Rev. B 84, 45115 (2011). !
  • 34. Questions? 34 Slides (already) posted to http://www.slideshare.net/anubhavster
  • 35. Some lessons learned (1) •  In the beginning, strong central coordination from authority was needed to develop these –  require that people contribute to common code, e.g. pymatgen, and not write their own detached scripts •  Once a code was “established”, less authority was needed –  people voluntarily contributed improvements rather than writing their own code because this benefited them •  Today the process is almost completely decentralized –  culture has changed –  even for new codes, people rally around it rather than build independent things 35
  • 36. Some lessons learned (2) •  It is helpful to have a strong BDFL (benevolent dictator for life) for each codebase •  Requirements for the BDFL: –  very detail-oriented –  cares about the code itself, not just the application –  cares more about the code quality than about offending teammates, i.e., will not accept poor quality contributions –  at the same time, able to rally support from people and convince them to contribute or clean up code –  willing to work overtime to do things like write detailed docs, advocate for the code, review commits, etc. –  derives joy from building and deploying things! 36
  • 37. Some lessons learned (3) •  Spending time to do things like improve code-cleanliness, writing unit tests, writing documentation, etc. is not such a “noble” and “self-sacrificing” act like people make it out to be –  I’ve referred my own documentation many times –  I’ve saved myself from a world of trouble by previously writing unit tests to detect bugs –  I’ve been able to write and build large code much faster due to previous commitments to code cleanliness (and been slowed down in my progress when I’ve relaxed these constraints) •  We don’t like to admit this, but a lack of attention to detail in the past has easily cost us tens of thousands of dollars in wasted computing and countless labor hours – but some of this is inevitable with large projects 37
  • 38. Some lessons learned (4) •  Computer scientists are useful for staying up to date in the fast-moving world of software –  2006: I took a graduate class in databases at MIT; all SQL, not a single mention of “NoSQL” –  2011: We are designing the framework for Materials Project; I have lots of experience with SQL; a computer scientist casually mentions NoSQL, its growing prominence, and its potential applicability to our problem –  2017: We do almost everything in NoSQL •  Lesson: software moves fast! Much faster than materials science knowledge or methods. Don’t use data from 5 years ago to inform your decision. 38