SlideShare a Scribd company logo
1 of 20
Data Science Solutions by Materials Scientists

The Early Case Studies
Tony Fast
Materials Data Analyst
Materials Informatics for
Engineering Design

Woodruff School of
Mechanical Engineering
Georgia Institute
of Technology

*Any MINED shield is a link to a resource.
An Archival and Self Describing Data Format using HDF5
Data and Metadata stored in one file, Support in many languages, and Ideal
support for high-dimensional data

*MXADataModel – Archival Data Format – ONR/DARPA Dynamic 3-D Digital Structures Program
HDF5 - The little zip file that could…

One Dataset – 1.6GB – 4 Experiments –with 160 Datasets each
…..no long term value.
Volume Variety Velocity = Big Data
Materials Science
Polymer - MD

Titanium

Jacobs -GaTech

Frasier -OSU

Martensitic Steel

Gumbusch

SiC/SiC

Ritchie- LLNL

Bamboo

Wegst - Dartmouth
Al-Cu Solidification

Voorhees - NW

The velocity that data is generated will rise and the
speed that it will be analyzed in will decrease.
β-Titanium
REDUCED OUTPUT:
Grain size
Grain Faces
Number of Grains
Mean Curvature
Nearest Grain Analysis

10 micron resolution with 4300 Grains

Compare with empirical models

Materials Science is a Big Data domain, but it is not treated that way.

Rowenhorst, Lewis, Spanos, Acta Mat, 2010
Harvard Clean Energy
Project Database

AFLOW, Curtarolo Group

Example Databases
STRUCTURE
INFORMATICS
WORKFLOW

INTELLIGENT DESIGN OF
EXPERIMENTS

PHYSICS BASED MODELS
SIMULATION EXPERIMENT
MICROSTRUCTURE (MATERIAL)
SIGNAL PROCESSING
ADVANCED & OBJECTIVE
STATISTICAL ENCODING

DATA SCIENCE MODULES

INNOVATION ACCOUNTING

Microstructure Informatics is a scalable,
data-driven system to mine structureproperty/processing connections from
experimental and simulation materials
science information; structure being the
independent variable. The system is
agnostic to material system and length
scale, objectively quantifiable, and
rapidly iterates in less cycles for both
materials improvement and discovery.
DATA SCIENCE
MODULES

Property

Microstructure
Material Structure

Processing

Data science modules are machine learning and statistical
tools
to
extract
rich
bi-directional
structureproperty/processing linkages from encodings of materials &
microstructure datasets. Mining modules create structure
taxonomies, homogenization and localization relationships,
ground truth comparison between simulation and experiment,
materials discovery, and materials improvement.
ADVANCED & OBJECTIVE
STATISTICAL ENCODING

THE MICROSTRUCTURE IS A SAMPLE
IN AN IMMENSE STATISTICAL POPULATION.

α-β Titanium
SPATIAL

STATISTICS
Statistical correlations between random points in space/time which
reveal systematic patterns in the microstructure. Contains the original
μS within a translation & inversion. An objective encoding for most
materials datasets.

t

h'
msh ms+t
ft hh' = å
Dt
s

t

t
The fidelity of the spatial statistics are impacted by how the
material structure is parameterized as a signal.

CURRENT APPLICATIONS
metals, polymers, fuel cells, cmc, md, & a bunch of other things

TYPES OF SIGNALS
sparse, experimental, simulation, heterogeneous, surface, bulk
Objective Microstructure Classification of α-β Titanium
Images StatisticsMine with Principal Component Analysis
Mechanical Deformation
of Polymer Chains

Molecular Dynamics
of Aluminum Atoms
Bottom-up Homogenization Relationships

model

GDL

MPL

simulation

X-CTFinite Element ModelingStatistics
Regression to connect the statistics with diffusivity values from FEM
Meta-modeling with Materials Knowledge Systems
Top-down localization relationships

FEM
ε=5e-4

ps = åå a m
h
t

t

h
s+t

h

The MKS design filters that capture the effect of the local arrangement of
the microstructure on the response. The filters are learned from physics
based models and can only be as accurate as the model never better.
Meta-modeling with Materials Knowledge Systems

Any Model

Top-down localization relationships

INPUT

OUTPUT

Control

ps = åå a m
h
t

t

h
s+t

h

The MKS design filters that capture the effect of the local arrangement of
the microstructure on the response. The filters are learned from physics
based models and can only be as accurate as the model never better.
Top-Down Localization Relationships for High Contrast Composites

The MKS is a scalable, parallel meta-model that learns from physics based
models to enable rapid simulation at a cost in accuracy.
N2 vs. Nlog(N) complexity
It learns top-down localization relationships to extra extreme value events and
enables multiscale integration.

OTHER APPLICATIONS
Spinodal Decomposition, Grain Coarsening,
Thermo-mechanical, Polycrystalline
Objective parametric descriptors and data science enable integration
of bi-direction structure-property/processing linkages.

Structure-Property
Homogenization

Structure-Processing MKS
Processing History

Structure-Property
Localization
Data enables bidirectional S-P/P, multiscale integration, and higher throughput

CORE TECHNOLOGIES TO FUEL THE DATA AGE OF MATERIALS SCIENCE
Open Access, Open Source Software, Scalable Databases, High-Statistical
Throughput Simulation and Experiment, Image Segmentation, Machine
Learning, Scalable Databases, Metadata Integration, Mobile Technology,
Visualization, High Performance Computing,
Cyberinfrastructure/Collaboratories, Collaboration & Sharing
Selected Links
Any shield in this presentation is a link

HDF5
HDFView
MXADataModel
Curtarolo Group
AFLOW
Harvard Clean Energy Project
Serial Sectioned Titanium
MATIN
Materials Genome Initiative

http://www.hdfgroup.org/HDF5/whatishdf5.html
http://www.hdfgroup.org/hdf-java-html/hdfview/
http://mxa.web.cmu.edu/Background.html
http://www.mems.duke.edu/faculty/stefano-curtarolo
http://materials.duke.edu/apool.html
http://www.molecularspace.org/
https://cosmicweb.mse.iastate.edu/wiki/pages/viewpage.action?pageId=753830
http://www.materials.gatech.edu/matin
http://www.whitehouse.gov/mgi

More Related Content

What's hot

When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliersaimsnist
 
Technical University of Crete_giakoumisDiplomaThesis
Technical University of Crete_giakoumisDiplomaThesisTechnical University of Crete_giakoumisDiplomaThesis
Technical University of Crete_giakoumisDiplomaThesisGeorgios M. GIAKOUMIS
 
AI at Scale for Materials and Chemistry
AI at Scale for Materials and ChemistryAI at Scale for Materials and Chemistry
AI at Scale for Materials and ChemistryIan Foster
 
Open Source Tools for Materials Informatics
Open Source Tools for Materials InformaticsOpen Source Tools for Materials Informatics
Open Source Tools for Materials InformaticsAnubhav Jain
 
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Anubhav Jain
 
Green wsn optimization of energy use
Green wsn  optimization of energy useGreen wsn  optimization of energy use
Green wsn optimization of energy useijfcstjournal
 
Distributed processing of probabilistic top k queries in wireless sensor netw...
Distributed processing of probabilistic top k queries in wireless sensor netw...Distributed processing of probabilistic top k queries in wireless sensor netw...
Distributed processing of probabilistic top k queries in wireless sensor netw...JPINFOTECH JAYAPRAKASH
 
Feature Subset Selection for High Dimensional Data Using Clustering Techniques
Feature Subset Selection for High Dimensional Data Using Clustering TechniquesFeature Subset Selection for High Dimensional Data Using Clustering Techniques
Feature Subset Selection for High Dimensional Data Using Clustering TechniquesIRJET Journal
 
Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material Designaimsnist
 
Discovering advanced materials for energy applications by mining the scientif...
Discovering advanced materials for energy applications by mining the scientif...Discovering advanced materials for energy applications by mining the scientif...
Discovering advanced materials for energy applications by mining the scientif...Anubhav Jain
 
Microstructural Analysis and Machine Learning
Microstructural Analysis and Machine LearningMicrostructural Analysis and Machine Learning
Microstructural Analysis and Machine LearningPFHub PFHub
 
Exascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor DataExascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor DataJoel Saltz
 
Computational Database for 3D and 2D materials to accelerate discovery
Computational Database for 3D and 2D materials to accelerate discoveryComputational Database for 3D and 2D materials to accelerate discovery
Computational Database for 3D and 2D materials to accelerate discoveryKAMAL CHOUDHARY
 
Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...Anubhav Jain
 
Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...Anubhav Jain
 
Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...
Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...
Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...Anubhav Jain
 
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...ijcsit
 
Software Tools, Methods and Applications of Machine Learning in Functional Ma...
Software Tools, Methods and Applications of Machine Learning in Functional Ma...Software Tools, Methods and Applications of Machine Learning in Functional Ma...
Software Tools, Methods and Applications of Machine Learning in Functional Ma...Anubhav Jain
 

What's hot (20)

When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliers
 
Technical University of Crete_giakoumisDiplomaThesis
Technical University of Crete_giakoumisDiplomaThesisTechnical University of Crete_giakoumisDiplomaThesis
Technical University of Crete_giakoumisDiplomaThesis
 
AI at Scale for Materials and Chemistry
AI at Scale for Materials and ChemistryAI at Scale for Materials and Chemistry
AI at Scale for Materials and Chemistry
 
Open Source Tools for Materials Informatics
Open Source Tools for Materials InformaticsOpen Source Tools for Materials Informatics
Open Source Tools for Materials Informatics
 
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
 
Green wsn optimization of energy use
Green wsn  optimization of energy useGreen wsn  optimization of energy use
Green wsn optimization of energy use
 
Distributed processing of probabilistic top k queries in wireless sensor netw...
Distributed processing of probabilistic top k queries in wireless sensor netw...Distributed processing of probabilistic top k queries in wireless sensor netw...
Distributed processing of probabilistic top k queries in wireless sensor netw...
 
Feature Subset Selection for High Dimensional Data Using Clustering Techniques
Feature Subset Selection for High Dimensional Data Using Clustering TechniquesFeature Subset Selection for High Dimensional Data Using Clustering Techniques
Feature Subset Selection for High Dimensional Data Using Clustering Techniques
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material Design
 
Discovering advanced materials for energy applications by mining the scientif...
Discovering advanced materials for energy applications by mining the scientif...Discovering advanced materials for energy applications by mining the scientif...
Discovering advanced materials for energy applications by mining the scientif...
 
Microstructural Analysis and Machine Learning
Microstructural Analysis and Machine LearningMicrostructural Analysis and Machine Learning
Microstructural Analysis and Machine Learning
 
Exascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor DataExascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor Data
 
Computational Database for 3D and 2D materials to accelerate discovery
Computational Database for 3D and 2D materials to accelerate discoveryComputational Database for 3D and 2D materials to accelerate discovery
Computational Database for 3D and 2D materials to accelerate discovery
 
PPT
PPTPPT
PPT
 
Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...
 
Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...
 
Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...
Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...
Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...
 
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
 
Software Tools, Methods and Applications of Machine Learning in Functional Ma...
Software Tools, Methods and Applications of Machine Learning in Functional Ma...Software Tools, Methods and Applications of Machine Learning in Functional Ma...
Software Tools, Methods and Applications of Machine Learning in Functional Ma...
 

Similar to Data Science Solutions by Materials Scientists: The Early Case Studies

Materials Informatics Overview
Materials Informatics OverviewMaterials Informatics Overview
Materials Informatics OverviewTony Fast
 
NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...
NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...
NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...Dr. Pankaj Dhussa
 
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsData Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and Applicationsaimsnist
 
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsData Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsAnubhav Jain
 
Digging deeper into data processing with emphasis on computational and micros...
Digging deeper into data processing with emphasis on computational and micros...Digging deeper into data processing with emphasis on computational and micros...
Digging deeper into data processing with emphasis on computational and micros...Liza Charalambous
 
Physics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learningPhysics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learningKAMAL CHOUDHARY
 
Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...
Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...
Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...remAYDOAN3
 
Exploring Architected Materials Using Machine Learning
Exploring Architected Materials Using Machine LearningExploring Architected Materials Using Machine Learning
Exploring Architected Materials Using Machine LearningAdvanced-Concepts-Team
 
Automating Machine Learning - Is it feasible?
Automating Machine Learning - Is it feasible?Automating Machine Learning - Is it feasible?
Automating Machine Learning - Is it feasible?Manuel Martín
 
Energy Efficient Optimal Paths Using PDORP-LC
Energy Efficient Optimal Paths Using PDORP-LCEnergy Efficient Optimal Paths Using PDORP-LC
Energy Efficient Optimal Paths Using PDORP-LCpaperpublications3
 
PNNL April 2011 ogce
PNNL April 2011 ogcePNNL April 2011 ogce
PNNL April 2011 ogcemarpierc
 
IEEE Datamining 2016 Title and Abstract
IEEE  Datamining 2016 Title and AbstractIEEE  Datamining 2016 Title and Abstract
IEEE Datamining 2016 Title and Abstracttsysglobalsolutions
 
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...KAMAL CHOUDHARY
 
The Materials Project: A Community Data Resource for Accelerating New Materia...
The Materials Project: A Community Data Resource for Accelerating New Materia...The Materials Project: A Community Data Resource for Accelerating New Materia...
The Materials Project: A Community Data Resource for Accelerating New Materia...Anubhav Jain
 
Software tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data miningSoftware tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data miningAnubhav Jain
 
Integrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesIntegrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesManjulaPatel
 
A hierarchical tensor based approach to compressing, updating and querying ge...
A hierarchical tensor based approach to compressing, updating and querying ge...A hierarchical tensor based approach to compressing, updating and querying ge...
A hierarchical tensor based approach to compressing, updating and querying ge...ieeepondy
 
Hyperparameters analysis of long short-term memory architecture for crop cla...
Hyperparameters analysis of long short-term memory  architecture for crop cla...Hyperparameters analysis of long short-term memory  architecture for crop cla...
Hyperparameters analysis of long short-term memory architecture for crop cla...IJECEIAES
 

Similar to Data Science Solutions by Materials Scientists: The Early Case Studies (20)

Materials Informatics Overview
Materials Informatics OverviewMaterials Informatics Overview
Materials Informatics Overview
 
NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...
NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...
NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...
 
MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...
MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...
MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...
 
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsData Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
 
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsData Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
 
Digging deeper into data processing with emphasis on computational and micros...
Digging deeper into data processing with emphasis on computational and micros...Digging deeper into data processing with emphasis on computational and micros...
Digging deeper into data processing with emphasis on computational and micros...
 
Physics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learningPhysics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learning
 
Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...
Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...
Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...
 
Exploring Architected Materials Using Machine Learning
Exploring Architected Materials Using Machine LearningExploring Architected Materials Using Machine Learning
Exploring Architected Materials Using Machine Learning
 
Automating Machine Learning - Is it feasible?
Automating Machine Learning - Is it feasible?Automating Machine Learning - Is it feasible?
Automating Machine Learning - Is it feasible?
 
Energy Efficient Optimal Paths Using PDORP-LC
Energy Efficient Optimal Paths Using PDORP-LCEnergy Efficient Optimal Paths Using PDORP-LC
Energy Efficient Optimal Paths Using PDORP-LC
 
PNNL April 2011 ogce
PNNL April 2011 ogcePNNL April 2011 ogce
PNNL April 2011 ogce
 
IEEE Datamining 2016 Title and Abstract
IEEE  Datamining 2016 Title and AbstractIEEE  Datamining 2016 Title and Abstract
IEEE Datamining 2016 Title and Abstract
 
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
 
The Materials Project: A Community Data Resource for Accelerating New Materia...
The Materials Project: A Community Data Resource for Accelerating New Materia...The Materials Project: A Community Data Resource for Accelerating New Materia...
The Materials Project: A Community Data Resource for Accelerating New Materia...
 
Software tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data miningSoftware tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data mining
 
Integrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesIntegrated research data management in the Structural Sciences
Integrated research data management in the Structural Sciences
 
SamNola_July2016-s
SamNola_July2016-sSamNola_July2016-s
SamNola_July2016-s
 
A hierarchical tensor based approach to compressing, updating and querying ge...
A hierarchical tensor based approach to compressing, updating and querying ge...A hierarchical tensor based approach to compressing, updating and querying ge...
A hierarchical tensor based approach to compressing, updating and querying ge...
 
Hyperparameters analysis of long short-term memory architecture for crop cla...
Hyperparameters analysis of long short-term memory  architecture for crop cla...Hyperparameters analysis of long short-term memory  architecture for crop cla...
Hyperparameters analysis of long short-term memory architecture for crop cla...
 

More from Tony Fast

The internet killed the lab notebook
The internet killed the lab notebookThe internet killed the lab notebook
The internet killed the lab notebookTony Fast
 
Github for Research Science
Github for Research ScienceGithub for Research Science
Github for Research ScienceTony Fast
 
The Materials Data Scientist
The Materials Data ScientistThe Materials Data Scientist
The Materials Data ScientistTony Fast
 
An Slight Overview of the Critical Elements of Spatial Statistics
An Slight Overview of the Critical Elements of Spatial StatisticsAn Slight Overview of the Critical Elements of Spatial Statistics
An Slight Overview of the Critical Elements of Spatial StatisticsTony Fast
 
Spatially resolved pair correlation functions for structure processing taxono...
Spatially resolved pair correlation functions for structure processing taxono...Spatially resolved pair correlation functions for structure processing taxono...
Spatially resolved pair correlation functions for structure processing taxono...Tony Fast
 
Spatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud dataSpatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud dataTony Fast
 
Microstructure Informatics
Microstructure InformaticsMicrostructure Informatics
Microstructure InformaticsTony Fast
 
Higher-Order Localization Relationships Using the MKS Approach
Higher-Order Localization Relationships Using the MKS Approach Higher-Order Localization Relationships Using the MKS Approach
Higher-Order Localization Relationships Using the MKS Approach Tony Fast
 
Higher-Order Microstructure Statistics for Next Generation Materials Taxonomy
Higher-Order Microstructure Statistics for Next Generation Materials TaxonomyHigher-Order Microstructure Statistics for Next Generation Materials Taxonomy
Higher-Order Microstructure Statistics for Next Generation Materials TaxonomyTony Fast
 
Novel and Enhanced Structure-Property-Processing Relationships with Microstru...
Novel and Enhanced Structure-Property-Processing Relationships with Microstru...Novel and Enhanced Structure-Property-Processing Relationships with Microstru...
Novel and Enhanced Structure-Property-Processing Relationships with Microstru...Tony Fast
 

More from Tony Fast (10)

The internet killed the lab notebook
The internet killed the lab notebookThe internet killed the lab notebook
The internet killed the lab notebook
 
Github for Research Science
Github for Research ScienceGithub for Research Science
Github for Research Science
 
The Materials Data Scientist
The Materials Data ScientistThe Materials Data Scientist
The Materials Data Scientist
 
An Slight Overview of the Critical Elements of Spatial Statistics
An Slight Overview of the Critical Elements of Spatial StatisticsAn Slight Overview of the Critical Elements of Spatial Statistics
An Slight Overview of the Critical Elements of Spatial Statistics
 
Spatially resolved pair correlation functions for structure processing taxono...
Spatially resolved pair correlation functions for structure processing taxono...Spatially resolved pair correlation functions for structure processing taxono...
Spatially resolved pair correlation functions for structure processing taxono...
 
Spatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud dataSpatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud data
 
Microstructure Informatics
Microstructure InformaticsMicrostructure Informatics
Microstructure Informatics
 
Higher-Order Localization Relationships Using the MKS Approach
Higher-Order Localization Relationships Using the MKS Approach Higher-Order Localization Relationships Using the MKS Approach
Higher-Order Localization Relationships Using the MKS Approach
 
Higher-Order Microstructure Statistics for Next Generation Materials Taxonomy
Higher-Order Microstructure Statistics for Next Generation Materials TaxonomyHigher-Order Microstructure Statistics for Next Generation Materials Taxonomy
Higher-Order Microstructure Statistics for Next Generation Materials Taxonomy
 
Novel and Enhanced Structure-Property-Processing Relationships with Microstru...
Novel and Enhanced Structure-Property-Processing Relationships with Microstru...Novel and Enhanced Structure-Property-Processing Relationships with Microstru...
Novel and Enhanced Structure-Property-Processing Relationships with Microstru...
 

Recently uploaded

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 

Recently uploaded (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 

Data Science Solutions by Materials Scientists: The Early Case Studies

  • 1. Data Science Solutions by Materials Scientists The Early Case Studies Tony Fast Materials Data Analyst Materials Informatics for Engineering Design Woodruff School of Mechanical Engineering Georgia Institute of Technology *Any MINED shield is a link to a resource.
  • 2. An Archival and Self Describing Data Format using HDF5 Data and Metadata stored in one file, Support in many languages, and Ideal support for high-dimensional data *MXADataModel – Archival Data Format – ONR/DARPA Dynamic 3-D Digital Structures Program
  • 3. HDF5 - The little zip file that could… One Dataset – 1.6GB – 4 Experiments –with 160 Datasets each …..no long term value.
  • 4. Volume Variety Velocity = Big Data Materials Science Polymer - MD Titanium Jacobs -GaTech Frasier -OSU Martensitic Steel Gumbusch SiC/SiC Ritchie- LLNL Bamboo Wegst - Dartmouth Al-Cu Solidification Voorhees - NW The velocity that data is generated will rise and the speed that it will be analyzed in will decrease.
  • 5. β-Titanium REDUCED OUTPUT: Grain size Grain Faces Number of Grains Mean Curvature Nearest Grain Analysis 10 micron resolution with 4300 Grains Compare with empirical models Materials Science is a Big Data domain, but it is not treated that way. Rowenhorst, Lewis, Spanos, Acta Mat, 2010
  • 6. Harvard Clean Energy Project Database AFLOW, Curtarolo Group Example Databases
  • 7. STRUCTURE INFORMATICS WORKFLOW INTELLIGENT DESIGN OF EXPERIMENTS PHYSICS BASED MODELS SIMULATION EXPERIMENT MICROSTRUCTURE (MATERIAL) SIGNAL PROCESSING ADVANCED & OBJECTIVE STATISTICAL ENCODING DATA SCIENCE MODULES INNOVATION ACCOUNTING Microstructure Informatics is a scalable, data-driven system to mine structureproperty/processing connections from experimental and simulation materials science information; structure being the independent variable. The system is agnostic to material system and length scale, objectively quantifiable, and rapidly iterates in less cycles for both materials improvement and discovery.
  • 8. DATA SCIENCE MODULES Property Microstructure Material Structure Processing Data science modules are machine learning and statistical tools to extract rich bi-directional structureproperty/processing linkages from encodings of materials & microstructure datasets. Mining modules create structure taxonomies, homogenization and localization relationships, ground truth comparison between simulation and experiment, materials discovery, and materials improvement.
  • 9. ADVANCED & OBJECTIVE STATISTICAL ENCODING THE MICROSTRUCTURE IS A SAMPLE IN AN IMMENSE STATISTICAL POPULATION. α-β Titanium
  • 10. SPATIAL STATISTICS Statistical correlations between random points in space/time which reveal systematic patterns in the microstructure. Contains the original μS within a translation & inversion. An objective encoding for most materials datasets. t h' msh ms+t ft hh' = å Dt s t t
  • 11. The fidelity of the spatial statistics are impacted by how the material structure is parameterized as a signal. CURRENT APPLICATIONS metals, polymers, fuel cells, cmc, md, & a bunch of other things TYPES OF SIGNALS sparse, experimental, simulation, heterogeneous, surface, bulk
  • 12. Objective Microstructure Classification of α-β Titanium Images StatisticsMine with Principal Component Analysis
  • 13. Mechanical Deformation of Polymer Chains Molecular Dynamics of Aluminum Atoms
  • 14. Bottom-up Homogenization Relationships model GDL MPL simulation X-CTFinite Element ModelingStatistics Regression to connect the statistics with diffusivity values from FEM
  • 15. Meta-modeling with Materials Knowledge Systems Top-down localization relationships FEM ε=5e-4 ps = åå a m h t t h s+t h The MKS design filters that capture the effect of the local arrangement of the microstructure on the response. The filters are learned from physics based models and can only be as accurate as the model never better.
  • 16. Meta-modeling with Materials Knowledge Systems Any Model Top-down localization relationships INPUT OUTPUT Control ps = åå a m h t t h s+t h The MKS design filters that capture the effect of the local arrangement of the microstructure on the response. The filters are learned from physics based models and can only be as accurate as the model never better.
  • 17. Top-Down Localization Relationships for High Contrast Composites The MKS is a scalable, parallel meta-model that learns from physics based models to enable rapid simulation at a cost in accuracy. N2 vs. Nlog(N) complexity It learns top-down localization relationships to extra extreme value events and enables multiscale integration. OTHER APPLICATIONS Spinodal Decomposition, Grain Coarsening, Thermo-mechanical, Polycrystalline
  • 18. Objective parametric descriptors and data science enable integration of bi-direction structure-property/processing linkages. Structure-Property Homogenization Structure-Processing MKS Processing History Structure-Property Localization
  • 19. Data enables bidirectional S-P/P, multiscale integration, and higher throughput CORE TECHNOLOGIES TO FUEL THE DATA AGE OF MATERIALS SCIENCE Open Access, Open Source Software, Scalable Databases, High-Statistical Throughput Simulation and Experiment, Image Segmentation, Machine Learning, Scalable Databases, Metadata Integration, Mobile Technology, Visualization, High Performance Computing, Cyberinfrastructure/Collaboratories, Collaboration & Sharing
  • 20. Selected Links Any shield in this presentation is a link HDF5 HDFView MXADataModel Curtarolo Group AFLOW Harvard Clean Energy Project Serial Sectioned Titanium MATIN Materials Genome Initiative http://www.hdfgroup.org/HDF5/whatishdf5.html http://www.hdfgroup.org/hdf-java-html/hdfview/ http://mxa.web.cmu.edu/Background.html http://www.mems.duke.edu/faculty/stefano-curtarolo http://materials.duke.edu/apool.html http://www.molecularspace.org/ https://cosmicweb.mse.iastate.edu/wiki/pages/viewpage.action?pageId=753830 http://www.materials.gatech.edu/matin http://www.whitehouse.gov/mgi