SlideShare a Scribd company logo
1 of 20
Data Science Solutions by Materials Scientists

The Early Case Studies
Tony Fast
Materials Data Analyst
Materials Informatics for
Engineering Design

Woodruff School of
Mechanical Engineering
Georgia Institute
of Technology

*Any MINED shield is a link to a resource.
An Archival and Self Describing Data Format using HDF5
Data and Metadata stored in one file, Support in many languages, and Ideal
support for high-dimensional data

*MXADataModel – Archival Data Format – ONR/DARPA Dynamic 3-D Digital Structures Program
HDF5 - The little zip file that could…

One Dataset – 1.6GB – 4 Experiments –with 160 Datasets each
…..no long term value.
Volume Variety Velocity = Big Data
Materials Science
Polymer - MD

Titanium

Jacobs -GaTech

Frasier -OSU

Martensitic Steel

Gumbusch

SiC/SiC

Ritchie- LLNL

Bamboo

Wegst - Dartmouth
Al-Cu Solidification

Voorhees - NW

The velocity that data is generated will rise and the
speed that it will be analyzed in will decrease.
β-Titanium
REDUCED OUTPUT:
Grain size
Grain Faces
Number of Grains
Mean Curvature
Nearest Grain Analysis

10 micron resolution with 4300 Grains

Compare with empirical models

Materials Science is a Big Data domain, but it is not treated that way.

Rowenhorst, Lewis, Spanos, Acta Mat, 2010
Harvard Clean Energy
Project Database

AFLOW, Curtarolo Group

Example Databases
STRUCTURE
INFORMATICS
WORKFLOW

INTELLIGENT DESIGN OF
EXPERIMENTS

PHYSICS BASED MODELS
SIMULATION EXPERIMENT
MICROSTRUCTURE (MATERIAL)
SIGNAL PROCESSING
ADVANCED & OBJECTIVE
STATISTICAL ENCODING

DATA SCIENCE MODULES

INNOVATION ACCOUNTING

Microstructure Informatics is a scalable,
data-driven system to mine structureproperty/processing connections from
experimental and simulation materials
science information; structure being the
independent variable. The system is
agnostic to material system and length
scale, objectively quantifiable, and
rapidly iterates in less cycles for both
materials improvement and discovery.
DATA SCIENCE
MODULES

Property

Microstructure
Material Structure

Processing

Data science modules are machine learning and statistical
tools
to
extract
rich
bi-directional
structureproperty/processing linkages from encodings of materials &
microstructure datasets. Mining modules create structure
taxonomies, homogenization and localization relationships,
ground truth comparison between simulation and experiment,
materials discovery, and materials improvement.
ADVANCED & OBJECTIVE
STATISTICAL ENCODING

THE MICROSTRUCTURE IS A SAMPLE
IN AN IMMENSE STATISTICAL POPULATION.

α-β Titanium
SPATIAL

STATISTICS
Statistical correlations between random points in space/time which
reveal systematic patterns in the microstructure. Contains the original
μS within a translation & inversion. An objective encoding for most
materials datasets.

t

h'
msh ms+t
ft hh' = å
Dt
s

t

t
The fidelity of the spatial statistics are impacted by how the
material structure is parameterized as a signal.

CURRENT APPLICATIONS
metals, polymers, fuel cells, cmc, md, & a bunch of other things

TYPES OF SIGNALS
sparse, experimental, simulation, heterogeneous, surface, bulk
Objective Microstructure Classification of α-β Titanium
Images StatisticsMine with Principal Component Analysis
Mechanical Deformation
of Polymer Chains

Molecular Dynamics
of Aluminum Atoms
Bottom-up Homogenization Relationships

model

GDL

MPL

simulation

X-CTFinite Element ModelingStatistics
Regression to connect the statistics with diffusivity values from FEM
Meta-modeling with Materials Knowledge Systems
Top-down localization relationships

FEM
ε=5e-4

ps = åå a m
h
t

t

h
s+t

h

The MKS design filters that capture the effect of the local arrangement of
the microstructure on the response. The filters are learned from physics
based models and can only be as accurate as the model never better.
Meta-modeling with Materials Knowledge Systems

Any Model

Top-down localization relationships

INPUT

OUTPUT

Control

ps = åå a m
h
t

t

h
s+t

h

The MKS design filters that capture the effect of the local arrangement of
the microstructure on the response. The filters are learned from physics
based models and can only be as accurate as the model never better.
Top-Down Localization Relationships for High Contrast Composites

The MKS is a scalable, parallel meta-model that learns from physics based
models to enable rapid simulation at a cost in accuracy.
N2 vs. Nlog(N) complexity
It learns top-down localization relationships to extra extreme value events and
enables multiscale integration.

OTHER APPLICATIONS
Spinodal Decomposition, Grain Coarsening,
Thermo-mechanical, Polycrystalline
Objective parametric descriptors and data science enable integration
of bi-direction structure-property/processing linkages.

Structure-Property
Homogenization

Structure-Processing MKS
Processing History

Structure-Property
Localization
Data enables bidirectional S-P/P, multiscale integration, and higher throughput

CORE TECHNOLOGIES TO FUEL THE DATA AGE OF MATERIALS SCIENCE
Open Access, Open Source Software, Scalable Databases, High-Statistical
Throughput Simulation and Experiment, Image Segmentation, Machine
Learning, Scalable Databases, Metadata Integration, Mobile Technology,
Visualization, High Performance Computing,
Cyberinfrastructure/Collaboratories, Collaboration & Sharing
Selected Links
Any shield in this presentation is a link

HDF5
HDFView
MXADataModel
Curtarolo Group
AFLOW
Harvard Clean Energy Project
Serial Sectioned Titanium
MATIN
Materials Genome Initiative

http://www.hdfgroup.org/HDF5/whatishdf5.html
http://www.hdfgroup.org/hdf-java-html/hdfview/
http://mxa.web.cmu.edu/Background.html
http://www.mems.duke.edu/faculty/stefano-curtarolo
http://materials.duke.edu/apool.html
http://www.molecularspace.org/
https://cosmicweb.mse.iastate.edu/wiki/pages/viewpage.action?pageId=753830
http://www.materials.gatech.edu/matin
http://www.whitehouse.gov/mgi

More Related Content

What's hot

When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliersaimsnist
 
Technical University of Crete_giakoumisDiplomaThesis
Technical University of Crete_giakoumisDiplomaThesisTechnical University of Crete_giakoumisDiplomaThesis
Technical University of Crete_giakoumisDiplomaThesisGeorgios M. GIAKOUMIS
 
AI at Scale for Materials and Chemistry
AI at Scale for Materials and ChemistryAI at Scale for Materials and Chemistry
AI at Scale for Materials and ChemistryIan Foster
 
Open Source Tools for Materials Informatics
Open Source Tools for Materials InformaticsOpen Source Tools for Materials Informatics
Open Source Tools for Materials InformaticsAnubhav Jain
 
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Anubhav Jain
 
Green wsn optimization of energy use
Green wsn  optimization of energy useGreen wsn  optimization of energy use
Green wsn optimization of energy useijfcstjournal
 
Distributed processing of probabilistic top k queries in wireless sensor netw...
Distributed processing of probabilistic top k queries in wireless sensor netw...Distributed processing of probabilistic top k queries in wireless sensor netw...
Distributed processing of probabilistic top k queries in wireless sensor netw...JPINFOTECH JAYAPRAKASH
 
Feature Subset Selection for High Dimensional Data Using Clustering Techniques
Feature Subset Selection for High Dimensional Data Using Clustering TechniquesFeature Subset Selection for High Dimensional Data Using Clustering Techniques
Feature Subset Selection for High Dimensional Data Using Clustering TechniquesIRJET Journal
 
Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material Designaimsnist
 
Discovering advanced materials for energy applications by mining the scientif...
Discovering advanced materials for energy applications by mining the scientif...Discovering advanced materials for energy applications by mining the scientif...
Discovering advanced materials for energy applications by mining the scientif...Anubhav Jain
 
Microstructural Analysis and Machine Learning
Microstructural Analysis and Machine LearningMicrostructural Analysis and Machine Learning
Microstructural Analysis and Machine LearningPFHub PFHub
 
Exascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor DataExascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor DataJoel Saltz
 
Computational Database for 3D and 2D materials to accelerate discovery
Computational Database for 3D and 2D materials to accelerate discoveryComputational Database for 3D and 2D materials to accelerate discovery
Computational Database for 3D and 2D materials to accelerate discoveryKAMAL CHOUDHARY
 
Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...Anubhav Jain
 
Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...Anubhav Jain
 
Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...
Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...
Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...Anubhav Jain
 
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...ijcsit
 
Software Tools, Methods and Applications of Machine Learning in Functional Ma...
Software Tools, Methods and Applications of Machine Learning in Functional Ma...Software Tools, Methods and Applications of Machine Learning in Functional Ma...
Software Tools, Methods and Applications of Machine Learning in Functional Ma...Anubhav Jain
 

What's hot (20)

When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliers
 
Technical University of Crete_giakoumisDiplomaThesis
Technical University of Crete_giakoumisDiplomaThesisTechnical University of Crete_giakoumisDiplomaThesis
Technical University of Crete_giakoumisDiplomaThesis
 
AI at Scale for Materials and Chemistry
AI at Scale for Materials and ChemistryAI at Scale for Materials and Chemistry
AI at Scale for Materials and Chemistry
 
Open Source Tools for Materials Informatics
Open Source Tools for Materials InformaticsOpen Source Tools for Materials Informatics
Open Source Tools for Materials Informatics
 
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
 
Green wsn optimization of energy use
Green wsn  optimization of energy useGreen wsn  optimization of energy use
Green wsn optimization of energy use
 
Distributed processing of probabilistic top k queries in wireless sensor netw...
Distributed processing of probabilistic top k queries in wireless sensor netw...Distributed processing of probabilistic top k queries in wireless sensor netw...
Distributed processing of probabilistic top k queries in wireless sensor netw...
 
Feature Subset Selection for High Dimensional Data Using Clustering Techniques
Feature Subset Selection for High Dimensional Data Using Clustering TechniquesFeature Subset Selection for High Dimensional Data Using Clustering Techniques
Feature Subset Selection for High Dimensional Data Using Clustering Techniques
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material Design
 
Discovering advanced materials for energy applications by mining the scientif...
Discovering advanced materials for energy applications by mining the scientif...Discovering advanced materials for energy applications by mining the scientif...
Discovering advanced materials for energy applications by mining the scientif...
 
Microstructural Analysis and Machine Learning
Microstructural Analysis and Machine LearningMicrostructural Analysis and Machine Learning
Microstructural Analysis and Machine Learning
 
Exascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor DataExascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor Data
 
Computational Database for 3D and 2D materials to accelerate discovery
Computational Database for 3D and 2D materials to accelerate discoveryComputational Database for 3D and 2D materials to accelerate discovery
Computational Database for 3D and 2D materials to accelerate discovery
 
PPT
PPTPPT
PPT
 
Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...
 
Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...
 
Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...
Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...
Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...
 
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
 
Software Tools, Methods and Applications of Machine Learning in Functional Ma...
Software Tools, Methods and Applications of Machine Learning in Functional Ma...Software Tools, Methods and Applications of Machine Learning in Functional Ma...
Software Tools, Methods and Applications of Machine Learning in Functional Ma...
 

Similar to Data Science Solutions by Materials Scientists: The Early Case Studies

Materials Informatics Overview
Materials Informatics OverviewMaterials Informatics Overview
Materials Informatics OverviewTony Fast
 
NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...
NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...
NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...Dr. Pankaj Dhussa
 
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsData Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and Applicationsaimsnist
 
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsData Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsAnubhav Jain
 
Digging deeper into data processing with emphasis on computational and micros...
Digging deeper into data processing with emphasis on computational and micros...Digging deeper into data processing with emphasis on computational and micros...
Digging deeper into data processing with emphasis on computational and micros...Liza Charalambous
 
Physics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learningPhysics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learningKAMAL CHOUDHARY
 
Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...
Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...
Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...remAYDOAN3
 
Exploring Architected Materials Using Machine Learning
Exploring Architected Materials Using Machine LearningExploring Architected Materials Using Machine Learning
Exploring Architected Materials Using Machine LearningAdvanced-Concepts-Team
 
Automating Machine Learning - Is it feasible?
Automating Machine Learning - Is it feasible?Automating Machine Learning - Is it feasible?
Automating Machine Learning - Is it feasible?Manuel Martín
 
Energy Efficient Optimal Paths Using PDORP-LC
Energy Efficient Optimal Paths Using PDORP-LCEnergy Efficient Optimal Paths Using PDORP-LC
Energy Efficient Optimal Paths Using PDORP-LCpaperpublications3
 
PNNL April 2011 ogce
PNNL April 2011 ogcePNNL April 2011 ogce
PNNL April 2011 ogcemarpierc
 
IEEE Datamining 2016 Title and Abstract
IEEE  Datamining 2016 Title and AbstractIEEE  Datamining 2016 Title and Abstract
IEEE Datamining 2016 Title and Abstracttsysglobalsolutions
 
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...KAMAL CHOUDHARY
 
The Materials Project: A Community Data Resource for Accelerating New Materia...
The Materials Project: A Community Data Resource for Accelerating New Materia...The Materials Project: A Community Data Resource for Accelerating New Materia...
The Materials Project: A Community Data Resource for Accelerating New Materia...Anubhav Jain
 
Software tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data miningSoftware tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data miningAnubhav Jain
 
Integrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesIntegrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesManjulaPatel
 
A hierarchical tensor based approach to compressing, updating and querying ge...
A hierarchical tensor based approach to compressing, updating and querying ge...A hierarchical tensor based approach to compressing, updating and querying ge...
A hierarchical tensor based approach to compressing, updating and querying ge...ieeepondy
 
Hyperparameters analysis of long short-term memory architecture for crop cla...
Hyperparameters analysis of long short-term memory  architecture for crop cla...Hyperparameters analysis of long short-term memory  architecture for crop cla...
Hyperparameters analysis of long short-term memory architecture for crop cla...IJECEIAES
 

Similar to Data Science Solutions by Materials Scientists: The Early Case Studies (20)

Materials Informatics Overview
Materials Informatics OverviewMaterials Informatics Overview
Materials Informatics Overview
 
NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...
NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...
NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...
 
MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...
MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...
MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...
 
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsData Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
 
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsData Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
 
Digging deeper into data processing with emphasis on computational and micros...
Digging deeper into data processing with emphasis on computational and micros...Digging deeper into data processing with emphasis on computational and micros...
Digging deeper into data processing with emphasis on computational and micros...
 
Physics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learningPhysics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learning
 
Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...
Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...
Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...
 
Exploring Architected Materials Using Machine Learning
Exploring Architected Materials Using Machine LearningExploring Architected Materials Using Machine Learning
Exploring Architected Materials Using Machine Learning
 
Automating Machine Learning - Is it feasible?
Automating Machine Learning - Is it feasible?Automating Machine Learning - Is it feasible?
Automating Machine Learning - Is it feasible?
 
Energy Efficient Optimal Paths Using PDORP-LC
Energy Efficient Optimal Paths Using PDORP-LCEnergy Efficient Optimal Paths Using PDORP-LC
Energy Efficient Optimal Paths Using PDORP-LC
 
PNNL April 2011 ogce
PNNL April 2011 ogcePNNL April 2011 ogce
PNNL April 2011 ogce
 
IEEE Datamining 2016 Title and Abstract
IEEE  Datamining 2016 Title and AbstractIEEE  Datamining 2016 Title and Abstract
IEEE Datamining 2016 Title and Abstract
 
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
 
The Materials Project: A Community Data Resource for Accelerating New Materia...
The Materials Project: A Community Data Resource for Accelerating New Materia...The Materials Project: A Community Data Resource for Accelerating New Materia...
The Materials Project: A Community Data Resource for Accelerating New Materia...
 
Software tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data miningSoftware tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data mining
 
Integrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesIntegrated research data management in the Structural Sciences
Integrated research data management in the Structural Sciences
 
SamNola_July2016-s
SamNola_July2016-sSamNola_July2016-s
SamNola_July2016-s
 
A hierarchical tensor based approach to compressing, updating and querying ge...
A hierarchical tensor based approach to compressing, updating and querying ge...A hierarchical tensor based approach to compressing, updating and querying ge...
A hierarchical tensor based approach to compressing, updating and querying ge...
 
Hyperparameters analysis of long short-term memory architecture for crop cla...
Hyperparameters analysis of long short-term memory  architecture for crop cla...Hyperparameters analysis of long short-term memory  architecture for crop cla...
Hyperparameters analysis of long short-term memory architecture for crop cla...
 

More from Tony Fast

The internet killed the lab notebook
The internet killed the lab notebookThe internet killed the lab notebook
The internet killed the lab notebookTony Fast
 
Github for Research Science
Github for Research ScienceGithub for Research Science
Github for Research ScienceTony Fast
 
The Materials Data Scientist
The Materials Data ScientistThe Materials Data Scientist
The Materials Data ScientistTony Fast
 
An Slight Overview of the Critical Elements of Spatial Statistics
An Slight Overview of the Critical Elements of Spatial StatisticsAn Slight Overview of the Critical Elements of Spatial Statistics
An Slight Overview of the Critical Elements of Spatial StatisticsTony Fast
 
Spatially resolved pair correlation functions for structure processing taxono...
Spatially resolved pair correlation functions for structure processing taxono...Spatially resolved pair correlation functions for structure processing taxono...
Spatially resolved pair correlation functions for structure processing taxono...Tony Fast
 
Spatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud dataSpatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud dataTony Fast
 
Microstructure Informatics
Microstructure InformaticsMicrostructure Informatics
Microstructure InformaticsTony Fast
 
Higher-Order Localization Relationships Using the MKS Approach
Higher-Order Localization Relationships Using the MKS Approach Higher-Order Localization Relationships Using the MKS Approach
Higher-Order Localization Relationships Using the MKS Approach Tony Fast
 
Higher-Order Microstructure Statistics for Next Generation Materials Taxonomy
Higher-Order Microstructure Statistics for Next Generation Materials TaxonomyHigher-Order Microstructure Statistics for Next Generation Materials Taxonomy
Higher-Order Microstructure Statistics for Next Generation Materials TaxonomyTony Fast
 
Novel and Enhanced Structure-Property-Processing Relationships with Microstru...
Novel and Enhanced Structure-Property-Processing Relationships with Microstru...Novel and Enhanced Structure-Property-Processing Relationships with Microstru...
Novel and Enhanced Structure-Property-Processing Relationships with Microstru...Tony Fast
 

More from Tony Fast (10)

The internet killed the lab notebook
The internet killed the lab notebookThe internet killed the lab notebook
The internet killed the lab notebook
 
Github for Research Science
Github for Research ScienceGithub for Research Science
Github for Research Science
 
The Materials Data Scientist
The Materials Data ScientistThe Materials Data Scientist
The Materials Data Scientist
 
An Slight Overview of the Critical Elements of Spatial Statistics
An Slight Overview of the Critical Elements of Spatial StatisticsAn Slight Overview of the Critical Elements of Spatial Statistics
An Slight Overview of the Critical Elements of Spatial Statistics
 
Spatially resolved pair correlation functions for structure processing taxono...
Spatially resolved pair correlation functions for structure processing taxono...Spatially resolved pair correlation functions for structure processing taxono...
Spatially resolved pair correlation functions for structure processing taxono...
 
Spatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud dataSpatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud data
 
Microstructure Informatics
Microstructure InformaticsMicrostructure Informatics
Microstructure Informatics
 
Higher-Order Localization Relationships Using the MKS Approach
Higher-Order Localization Relationships Using the MKS Approach Higher-Order Localization Relationships Using the MKS Approach
Higher-Order Localization Relationships Using the MKS Approach
 
Higher-Order Microstructure Statistics for Next Generation Materials Taxonomy
Higher-Order Microstructure Statistics for Next Generation Materials TaxonomyHigher-Order Microstructure Statistics for Next Generation Materials Taxonomy
Higher-Order Microstructure Statistics for Next Generation Materials Taxonomy
 
Novel and Enhanced Structure-Property-Processing Relationships with Microstru...
Novel and Enhanced Structure-Property-Processing Relationships with Microstru...Novel and Enhanced Structure-Property-Processing Relationships with Microstru...
Novel and Enhanced Structure-Property-Processing Relationships with Microstru...
 

Recently uploaded

Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 

Data Science Solutions by Materials Scientists: The Early Case Studies

  • 1. Data Science Solutions by Materials Scientists The Early Case Studies Tony Fast Materials Data Analyst Materials Informatics for Engineering Design Woodruff School of Mechanical Engineering Georgia Institute of Technology *Any MINED shield is a link to a resource.
  • 2. An Archival and Self Describing Data Format using HDF5 Data and Metadata stored in one file, Support in many languages, and Ideal support for high-dimensional data *MXADataModel – Archival Data Format – ONR/DARPA Dynamic 3-D Digital Structures Program
  • 3. HDF5 - The little zip file that could… One Dataset – 1.6GB – 4 Experiments –with 160 Datasets each …..no long term value.
  • 4. Volume Variety Velocity = Big Data Materials Science Polymer - MD Titanium Jacobs -GaTech Frasier -OSU Martensitic Steel Gumbusch SiC/SiC Ritchie- LLNL Bamboo Wegst - Dartmouth Al-Cu Solidification Voorhees - NW The velocity that data is generated will rise and the speed that it will be analyzed in will decrease.
  • 5. β-Titanium REDUCED OUTPUT: Grain size Grain Faces Number of Grains Mean Curvature Nearest Grain Analysis 10 micron resolution with 4300 Grains Compare with empirical models Materials Science is a Big Data domain, but it is not treated that way. Rowenhorst, Lewis, Spanos, Acta Mat, 2010
  • 6. Harvard Clean Energy Project Database AFLOW, Curtarolo Group Example Databases
  • 7. STRUCTURE INFORMATICS WORKFLOW INTELLIGENT DESIGN OF EXPERIMENTS PHYSICS BASED MODELS SIMULATION EXPERIMENT MICROSTRUCTURE (MATERIAL) SIGNAL PROCESSING ADVANCED & OBJECTIVE STATISTICAL ENCODING DATA SCIENCE MODULES INNOVATION ACCOUNTING Microstructure Informatics is a scalable, data-driven system to mine structureproperty/processing connections from experimental and simulation materials science information; structure being the independent variable. The system is agnostic to material system and length scale, objectively quantifiable, and rapidly iterates in less cycles for both materials improvement and discovery.
  • 8. DATA SCIENCE MODULES Property Microstructure Material Structure Processing Data science modules are machine learning and statistical tools to extract rich bi-directional structureproperty/processing linkages from encodings of materials & microstructure datasets. Mining modules create structure taxonomies, homogenization and localization relationships, ground truth comparison between simulation and experiment, materials discovery, and materials improvement.
  • 9. ADVANCED & OBJECTIVE STATISTICAL ENCODING THE MICROSTRUCTURE IS A SAMPLE IN AN IMMENSE STATISTICAL POPULATION. α-β Titanium
  • 10. SPATIAL STATISTICS Statistical correlations between random points in space/time which reveal systematic patterns in the microstructure. Contains the original μS within a translation & inversion. An objective encoding for most materials datasets. t h' msh ms+t ft hh' = å Dt s t t
  • 11. The fidelity of the spatial statistics are impacted by how the material structure is parameterized as a signal. CURRENT APPLICATIONS metals, polymers, fuel cells, cmc, md, & a bunch of other things TYPES OF SIGNALS sparse, experimental, simulation, heterogeneous, surface, bulk
  • 12. Objective Microstructure Classification of α-β Titanium Images StatisticsMine with Principal Component Analysis
  • 13. Mechanical Deformation of Polymer Chains Molecular Dynamics of Aluminum Atoms
  • 14. Bottom-up Homogenization Relationships model GDL MPL simulation X-CTFinite Element ModelingStatistics Regression to connect the statistics with diffusivity values from FEM
  • 15. Meta-modeling with Materials Knowledge Systems Top-down localization relationships FEM ε=5e-4 ps = åå a m h t t h s+t h The MKS design filters that capture the effect of the local arrangement of the microstructure on the response. The filters are learned from physics based models and can only be as accurate as the model never better.
  • 16. Meta-modeling with Materials Knowledge Systems Any Model Top-down localization relationships INPUT OUTPUT Control ps = åå a m h t t h s+t h The MKS design filters that capture the effect of the local arrangement of the microstructure on the response. The filters are learned from physics based models and can only be as accurate as the model never better.
  • 17. Top-Down Localization Relationships for High Contrast Composites The MKS is a scalable, parallel meta-model that learns from physics based models to enable rapid simulation at a cost in accuracy. N2 vs. Nlog(N) complexity It learns top-down localization relationships to extra extreme value events and enables multiscale integration. OTHER APPLICATIONS Spinodal Decomposition, Grain Coarsening, Thermo-mechanical, Polycrystalline
  • 18. Objective parametric descriptors and data science enable integration of bi-direction structure-property/processing linkages. Structure-Property Homogenization Structure-Processing MKS Processing History Structure-Property Localization
  • 19. Data enables bidirectional S-P/P, multiscale integration, and higher throughput CORE TECHNOLOGIES TO FUEL THE DATA AGE OF MATERIALS SCIENCE Open Access, Open Source Software, Scalable Databases, High-Statistical Throughput Simulation and Experiment, Image Segmentation, Machine Learning, Scalable Databases, Metadata Integration, Mobile Technology, Visualization, High Performance Computing, Cyberinfrastructure/Collaboratories, Collaboration & Sharing
  • 20. Selected Links Any shield in this presentation is a link HDF5 HDFView MXADataModel Curtarolo Group AFLOW Harvard Clean Energy Project Serial Sectioned Titanium MATIN Materials Genome Initiative http://www.hdfgroup.org/HDF5/whatishdf5.html http://www.hdfgroup.org/hdf-java-html/hdfview/ http://mxa.web.cmu.edu/Background.html http://www.mems.duke.edu/faculty/stefano-curtarolo http://materials.duke.edu/apool.html http://www.molecularspace.org/ https://cosmicweb.mse.iastate.edu/wiki/pages/viewpage.action?pageId=753830 http://www.materials.gatech.edu/matin http://www.whitehouse.gov/mgi