SlideShare a Scribd company logo
Data Science Solutions by Materials Scientists

The Early Case Studies
Tony Fast
Materials Data Analyst
Materials Informatics for
Engineering Design

Woodruff School of
Mechanical Engineering
Georgia Institute
of Technology

*Any MINED shield is a link to a resource.
An Archival and Self Describing Data Format using HDF5
Data and Metadata stored in one file, Support in many languages, and Ideal
support for high-dimensional data

*MXADataModel – Archival Data Format – ONR/DARPA Dynamic 3-D Digital Structures Program
HDF5 - The little zip file that could…

One Dataset – 1.6GB – 4 Experiments –with 160 Datasets each
…..no long term value.
Volume Variety Velocity = Big Data
Materials Science
Polymer - MD

Titanium

Jacobs -GaTech

Frasier -OSU

Martensitic Steel

Gumbusch

SiC/SiC

Ritchie- LLNL

Bamboo

Wegst - Dartmouth
Al-Cu Solidification

Voorhees - NW

The velocity that data is generated will rise and the
speed that it will be analyzed in will decrease.
β-Titanium
REDUCED OUTPUT:
Grain size
Grain Faces
Number of Grains
Mean Curvature
Nearest Grain Analysis

10 micron resolution with 4300 Grains

Compare with empirical models

Materials Science is a Big Data domain, but it is not treated that way.

Rowenhorst, Lewis, Spanos, Acta Mat, 2010
Harvard Clean Energy
Project Database

AFLOW, Curtarolo Group

Example Databases
STRUCTURE
INFORMATICS
WORKFLOW

INTELLIGENT DESIGN OF
EXPERIMENTS

PHYSICS BASED MODELS
SIMULATION EXPERIMENT
MICROSTRUCTURE (MATERIAL)
SIGNAL PROCESSING
ADVANCED & OBJECTIVE
STATISTICAL ENCODING

DATA SCIENCE MODULES

INNOVATION ACCOUNTING

Microstructure Informatics is a scalable,
data-driven system to mine structureproperty/processing connections from
experimental and simulation materials
science information; structure being the
independent variable. The system is
agnostic to material system and length
scale, objectively quantifiable, and
rapidly iterates in less cycles for both
materials improvement and discovery.
DATA SCIENCE
MODULES

Property

Microstructure
Material Structure

Processing

Data science modules are machine learning and statistical
tools
to
extract
rich
bi-directional
structureproperty/processing linkages from encodings of materials &
microstructure datasets. Mining modules create structure
taxonomies, homogenization and localization relationships,
ground truth comparison between simulation and experiment,
materials discovery, and materials improvement.
ADVANCED & OBJECTIVE
STATISTICAL ENCODING

THE MICROSTRUCTURE IS A SAMPLE
IN AN IMMENSE STATISTICAL POPULATION.

α-β Titanium
SPATIAL

STATISTICS
Statistical correlations between random points in space/time which
reveal systematic patterns in the microstructure. Contains the original
μS within a translation & inversion. An objective encoding for most
materials datasets.

t

h'
msh ms+t
ft hh' = å
Dt
s

t

t
The fidelity of the spatial statistics are impacted by how the
material structure is parameterized as a signal.

CURRENT APPLICATIONS
metals, polymers, fuel cells, cmc, md, & a bunch of other things

TYPES OF SIGNALS
sparse, experimental, simulation, heterogeneous, surface, bulk
Objective Microstructure Classification of α-β Titanium
Images StatisticsMine with Principal Component Analysis
Mechanical Deformation
of Polymer Chains

Molecular Dynamics
of Aluminum Atoms
Bottom-up Homogenization Relationships

model

GDL

MPL

simulation

X-CTFinite Element ModelingStatistics
Regression to connect the statistics with diffusivity values from FEM
Meta-modeling with Materials Knowledge Systems
Top-down localization relationships

FEM
ε=5e-4

ps = åå a m
h
t

t

h
s+t

h

The MKS design filters that capture the effect of the local arrangement of
the microstructure on the response. The filters are learned from physics
based models and can only be as accurate as the model never better.
Meta-modeling with Materials Knowledge Systems

Any Model

Top-down localization relationships

INPUT

OUTPUT

Control

ps = åå a m
h
t

t

h
s+t

h

The MKS design filters that capture the effect of the local arrangement of
the microstructure on the response. The filters are learned from physics
based models and can only be as accurate as the model never better.
Top-Down Localization Relationships for High Contrast Composites

The MKS is a scalable, parallel meta-model that learns from physics based
models to enable rapid simulation at a cost in accuracy.
N2 vs. Nlog(N) complexity
It learns top-down localization relationships to extra extreme value events and
enables multiscale integration.

OTHER APPLICATIONS
Spinodal Decomposition, Grain Coarsening,
Thermo-mechanical, Polycrystalline
Objective parametric descriptors and data science enable integration
of bi-direction structure-property/processing linkages.

Structure-Property
Homogenization

Structure-Processing MKS
Processing History

Structure-Property
Localization
Data enables bidirectional S-P/P, multiscale integration, and higher throughput

CORE TECHNOLOGIES TO FUEL THE DATA AGE OF MATERIALS SCIENCE
Open Access, Open Source Software, Scalable Databases, High-Statistical
Throughput Simulation and Experiment, Image Segmentation, Machine
Learning, Scalable Databases, Metadata Integration, Mobile Technology,
Visualization, High Performance Computing,
Cyberinfrastructure/Collaboratories, Collaboration & Sharing
Selected Links
Any shield in this presentation is a link

HDF5
HDFView
MXADataModel
Curtarolo Group
AFLOW
Harvard Clean Energy Project
Serial Sectioned Titanium
MATIN
Materials Genome Initiative

http://www.hdfgroup.org/HDF5/whatishdf5.html
http://www.hdfgroup.org/hdf-java-html/hdfview/
http://mxa.web.cmu.edu/Background.html
http://www.mems.duke.edu/faculty/stefano-curtarolo
http://materials.duke.edu/apool.html
http://www.molecularspace.org/
https://cosmicweb.mse.iastate.edu/wiki/pages/viewpage.action?pageId=753830
http://www.materials.gatech.edu/matin
http://www.whitehouse.gov/mgi

More Related Content

What's hot

When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliers
aimsnist
 
Technical University of Crete_giakoumisDiplomaThesis
Technical University of Crete_giakoumisDiplomaThesisTechnical University of Crete_giakoumisDiplomaThesis
Technical University of Crete_giakoumisDiplomaThesisGeorgios M. GIAKOUMIS
 
AI at Scale for Materials and Chemistry
AI at Scale for Materials and ChemistryAI at Scale for Materials and Chemistry
AI at Scale for Materials and Chemistry
Ian Foster
 
Open Source Tools for Materials Informatics
Open Source Tools for Materials InformaticsOpen Source Tools for Materials Informatics
Open Source Tools for Materials Informatics
Anubhav Jain
 
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Anubhav Jain
 
Green wsn optimization of energy use
Green wsn  optimization of energy useGreen wsn  optimization of energy use
Green wsn optimization of energy use
ijfcstjournal
 
Distributed processing of probabilistic top k queries in wireless sensor netw...
Distributed processing of probabilistic top k queries in wireless sensor netw...Distributed processing of probabilistic top k queries in wireless sensor netw...
Distributed processing of probabilistic top k queries in wireless sensor netw...
JPINFOTECH JAYAPRAKASH
 
Feature Subset Selection for High Dimensional Data Using Clustering Techniques
Feature Subset Selection for High Dimensional Data Using Clustering TechniquesFeature Subset Selection for High Dimensional Data Using Clustering Techniques
Feature Subset Selection for High Dimensional Data Using Clustering Techniques
IRJET Journal
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
The Statistical and Applied Mathematical Sciences Institute
 
Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material Design
aimsnist
 
Discovering advanced materials for energy applications by mining the scientif...
Discovering advanced materials for energy applications by mining the scientif...Discovering advanced materials for energy applications by mining the scientif...
Discovering advanced materials for energy applications by mining the scientif...
Anubhav Jain
 
Microstructural Analysis and Machine Learning
Microstructural Analysis and Machine LearningMicrostructural Analysis and Machine Learning
Microstructural Analysis and Machine Learning
PFHub PFHub
 
Exascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor DataExascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor Data
Joel Saltz
 
Computational Database for 3D and 2D materials to accelerate discovery
Computational Database for 3D and 2D materials to accelerate discoveryComputational Database for 3D and 2D materials to accelerate discovery
Computational Database for 3D and 2D materials to accelerate discovery
KAMAL CHOUDHARY
 
Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...
Anubhav Jain
 
Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...
Anubhav Jain
 
Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...
Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...
Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...
Anubhav Jain
 
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
ijcsit
 
Software Tools, Methods and Applications of Machine Learning in Functional Ma...
Software Tools, Methods and Applications of Machine Learning in Functional Ma...Software Tools, Methods and Applications of Machine Learning in Functional Ma...
Software Tools, Methods and Applications of Machine Learning in Functional Ma...
Anubhav Jain
 

What's hot (20)

When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliers
 
Technical University of Crete_giakoumisDiplomaThesis
Technical University of Crete_giakoumisDiplomaThesisTechnical University of Crete_giakoumisDiplomaThesis
Technical University of Crete_giakoumisDiplomaThesis
 
AI at Scale for Materials and Chemistry
AI at Scale for Materials and ChemistryAI at Scale for Materials and Chemistry
AI at Scale for Materials and Chemistry
 
Open Source Tools for Materials Informatics
Open Source Tools for Materials InformaticsOpen Source Tools for Materials Informatics
Open Source Tools for Materials Informatics
 
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
 
Green wsn optimization of energy use
Green wsn  optimization of energy useGreen wsn  optimization of energy use
Green wsn optimization of energy use
 
Distributed processing of probabilistic top k queries in wireless sensor netw...
Distributed processing of probabilistic top k queries in wireless sensor netw...Distributed processing of probabilistic top k queries in wireless sensor netw...
Distributed processing of probabilistic top k queries in wireless sensor netw...
 
Feature Subset Selection for High Dimensional Data Using Clustering Techniques
Feature Subset Selection for High Dimensional Data Using Clustering TechniquesFeature Subset Selection for High Dimensional Data Using Clustering Techniques
Feature Subset Selection for High Dimensional Data Using Clustering Techniques
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material Design
 
Discovering advanced materials for energy applications by mining the scientif...
Discovering advanced materials for energy applications by mining the scientif...Discovering advanced materials for energy applications by mining the scientif...
Discovering advanced materials for energy applications by mining the scientif...
 
Microstructural Analysis and Machine Learning
Microstructural Analysis and Machine LearningMicrostructural Analysis and Machine Learning
Microstructural Analysis and Machine Learning
 
Exascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor DataExascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor Data
 
Computational Database for 3D and 2D materials to accelerate discovery
Computational Database for 3D and 2D materials to accelerate discoveryComputational Database for 3D and 2D materials to accelerate discovery
Computational Database for 3D and 2D materials to accelerate discovery
 
PPT
PPTPPT
PPT
 
Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...
 
Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...
 
Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...
Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...
Accelerated Materials Discovery Using Theory, Optimization, and Natural Langu...
 
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
 
Software Tools, Methods and Applications of Machine Learning in Functional Ma...
Software Tools, Methods and Applications of Machine Learning in Functional Ma...Software Tools, Methods and Applications of Machine Learning in Functional Ma...
Software Tools, Methods and Applications of Machine Learning in Functional Ma...
 

Similar to Data Science Solutions by Materials Scientists: The Early Case Studies

Materials Informatics Overview
Materials Informatics OverviewMaterials Informatics Overview
Materials Informatics OverviewTony Fast
 
NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...
NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...
NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...
Dr. Pankaj Dhussa
 
MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...
MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...
MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...
The Statistical and Applied Mathematical Sciences Institute
 
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsData Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
aimsnist
 
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsData Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Anubhav Jain
 
Digging deeper into data processing with emphasis on computational and micros...
Digging deeper into data processing with emphasis on computational and micros...Digging deeper into data processing with emphasis on computational and micros...
Digging deeper into data processing with emphasis on computational and micros...Liza Charalambous
 
Physics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learningPhysics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learning
KAMAL CHOUDHARY
 
Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...
Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...
Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...
remAYDOAN3
 
Exploring Architected Materials Using Machine Learning
Exploring Architected Materials Using Machine LearningExploring Architected Materials Using Machine Learning
Exploring Architected Materials Using Machine Learning
Advanced-Concepts-Team
 
Automating Machine Learning - Is it feasible?
Automating Machine Learning - Is it feasible?Automating Machine Learning - Is it feasible?
Automating Machine Learning - Is it feasible?
Manuel Martín
 
Energy Efficient Optimal Paths Using PDORP-LC
Energy Efficient Optimal Paths Using PDORP-LCEnergy Efficient Optimal Paths Using PDORP-LC
Energy Efficient Optimal Paths Using PDORP-LC
paperpublications3
 
PNNL April 2011 ogce
PNNL April 2011 ogcePNNL April 2011 ogce
PNNL April 2011 ogce
marpierc
 
IEEE Datamining 2016 Title and Abstract
IEEE  Datamining 2016 Title and AbstractIEEE  Datamining 2016 Title and Abstract
IEEE Datamining 2016 Title and Abstract
tsysglobalsolutions
 
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
KAMAL CHOUDHARY
 
The Materials Project: A Community Data Resource for Accelerating New Materia...
The Materials Project: A Community Data Resource for Accelerating New Materia...The Materials Project: A Community Data Resource for Accelerating New Materia...
The Materials Project: A Community Data Resource for Accelerating New Materia...
Anubhav Jain
 
Software tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data miningSoftware tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data mining
Anubhav Jain
 
Integrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesIntegrated research data management in the Structural Sciences
Integrated research data management in the Structural Sciences
ManjulaPatel
 
A hierarchical tensor based approach to compressing, updating and querying ge...
A hierarchical tensor based approach to compressing, updating and querying ge...A hierarchical tensor based approach to compressing, updating and querying ge...
A hierarchical tensor based approach to compressing, updating and querying ge...
ieeepondy
 
Hyperparameters analysis of long short-term memory architecture for crop cla...
Hyperparameters analysis of long short-term memory  architecture for crop cla...Hyperparameters analysis of long short-term memory  architecture for crop cla...
Hyperparameters analysis of long short-term memory architecture for crop cla...
IJECEIAES
 

Similar to Data Science Solutions by Materials Scientists: The Early Case Studies (20)

Materials Informatics Overview
Materials Informatics OverviewMaterials Informatics Overview
Materials Informatics Overview
 
NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...
NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...
NASA Multiscale Analysis Tool (NASMAT) Robust, Integrated, Physics-based, Non...
 
MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...
MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...
MUMS Opening Workshop - Materials Innovation Driven by Data and Knowledge Sys...
 
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsData Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
 
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsData Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
 
Digging deeper into data processing with emphasis on computational and micros...
Digging deeper into data processing with emphasis on computational and micros...Digging deeper into data processing with emphasis on computational and micros...
Digging deeper into data processing with emphasis on computational and micros...
 
Physics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learningPhysics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learning
 
Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...
Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...
Advanced Intelligent Systems - 2020 - Sha - Artificial Intelligence to Power ...
 
Exploring Architected Materials Using Machine Learning
Exploring Architected Materials Using Machine LearningExploring Architected Materials Using Machine Learning
Exploring Architected Materials Using Machine Learning
 
Automating Machine Learning - Is it feasible?
Automating Machine Learning - Is it feasible?Automating Machine Learning - Is it feasible?
Automating Machine Learning - Is it feasible?
 
Energy Efficient Optimal Paths Using PDORP-LC
Energy Efficient Optimal Paths Using PDORP-LCEnergy Efficient Optimal Paths Using PDORP-LC
Energy Efficient Optimal Paths Using PDORP-LC
 
PNNL April 2011 ogce
PNNL April 2011 ogcePNNL April 2011 ogce
PNNL April 2011 ogce
 
IEEE Datamining 2016 Title and Abstract
IEEE  Datamining 2016 Title and AbstractIEEE  Datamining 2016 Title and Abstract
IEEE Datamining 2016 Title and Abstract
 
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
 
The Materials Project: A Community Data Resource for Accelerating New Materia...
The Materials Project: A Community Data Resource for Accelerating New Materia...The Materials Project: A Community Data Resource for Accelerating New Materia...
The Materials Project: A Community Data Resource for Accelerating New Materia...
 
Software tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data miningSoftware tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data mining
 
Integrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesIntegrated research data management in the Structural Sciences
Integrated research data management in the Structural Sciences
 
SamNola_July2016-s
SamNola_July2016-sSamNola_July2016-s
SamNola_July2016-s
 
A hierarchical tensor based approach to compressing, updating and querying ge...
A hierarchical tensor based approach to compressing, updating and querying ge...A hierarchical tensor based approach to compressing, updating and querying ge...
A hierarchical tensor based approach to compressing, updating and querying ge...
 
Hyperparameters analysis of long short-term memory architecture for crop cla...
Hyperparameters analysis of long short-term memory  architecture for crop cla...Hyperparameters analysis of long short-term memory  architecture for crop cla...
Hyperparameters analysis of long short-term memory architecture for crop cla...
 

More from Tony Fast

The internet killed the lab notebook
The internet killed the lab notebookThe internet killed the lab notebook
The internet killed the lab notebook
Tony Fast
 
Github for Research Science
Github for Research ScienceGithub for Research Science
Github for Research Science
Tony Fast
 
The Materials Data Scientist
The Materials Data ScientistThe Materials Data Scientist
The Materials Data Scientist
Tony Fast
 
An Slight Overview of the Critical Elements of Spatial Statistics
An Slight Overview of the Critical Elements of Spatial StatisticsAn Slight Overview of the Critical Elements of Spatial Statistics
An Slight Overview of the Critical Elements of Spatial StatisticsTony Fast
 
Spatially resolved pair correlation functions for structure processing taxono...
Spatially resolved pair correlation functions for structure processing taxono...Spatially resolved pair correlation functions for structure processing taxono...
Spatially resolved pair correlation functions for structure processing taxono...
Tony Fast
 
Spatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud dataSpatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud data
Tony Fast
 
Microstructure Informatics
Microstructure InformaticsMicrostructure Informatics
Microstructure Informatics
Tony Fast
 
Higher-Order Localization Relationships Using the MKS Approach
Higher-Order Localization Relationships Using the MKS Approach Higher-Order Localization Relationships Using the MKS Approach
Higher-Order Localization Relationships Using the MKS Approach
Tony Fast
 
Higher-Order Microstructure Statistics for Next Generation Materials Taxonomy
Higher-Order Microstructure Statistics for Next Generation Materials TaxonomyHigher-Order Microstructure Statistics for Next Generation Materials Taxonomy
Higher-Order Microstructure Statistics for Next Generation Materials Taxonomy
Tony Fast
 
Novel and Enhanced Structure-Property-Processing Relationships with Microstru...
Novel and Enhanced Structure-Property-Processing Relationships with Microstru...Novel and Enhanced Structure-Property-Processing Relationships with Microstru...
Novel and Enhanced Structure-Property-Processing Relationships with Microstru...
Tony Fast
 

More from Tony Fast (10)

The internet killed the lab notebook
The internet killed the lab notebookThe internet killed the lab notebook
The internet killed the lab notebook
 
Github for Research Science
Github for Research ScienceGithub for Research Science
Github for Research Science
 
The Materials Data Scientist
The Materials Data ScientistThe Materials Data Scientist
The Materials Data Scientist
 
An Slight Overview of the Critical Elements of Spatial Statistics
An Slight Overview of the Critical Elements of Spatial StatisticsAn Slight Overview of the Critical Elements of Spatial Statistics
An Slight Overview of the Critical Elements of Spatial Statistics
 
Spatially resolved pair correlation functions for structure processing taxono...
Spatially resolved pair correlation functions for structure processing taxono...Spatially resolved pair correlation functions for structure processing taxono...
Spatially resolved pair correlation functions for structure processing taxono...
 
Spatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud dataSpatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud data
 
Microstructure Informatics
Microstructure InformaticsMicrostructure Informatics
Microstructure Informatics
 
Higher-Order Localization Relationships Using the MKS Approach
Higher-Order Localization Relationships Using the MKS Approach Higher-Order Localization Relationships Using the MKS Approach
Higher-Order Localization Relationships Using the MKS Approach
 
Higher-Order Microstructure Statistics for Next Generation Materials Taxonomy
Higher-Order Microstructure Statistics for Next Generation Materials TaxonomyHigher-Order Microstructure Statistics for Next Generation Materials Taxonomy
Higher-Order Microstructure Statistics for Next Generation Materials Taxonomy
 
Novel and Enhanced Structure-Property-Processing Relationships with Microstru...
Novel and Enhanced Structure-Property-Processing Relationships with Microstru...Novel and Enhanced Structure-Property-Processing Relationships with Microstru...
Novel and Enhanced Structure-Property-Processing Relationships with Microstru...
 

Recently uploaded

JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
Fwdays
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
Bhaskar Mitra
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 

Recently uploaded (20)

JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 

Data Science Solutions by Materials Scientists: The Early Case Studies

  • 1. Data Science Solutions by Materials Scientists The Early Case Studies Tony Fast Materials Data Analyst Materials Informatics for Engineering Design Woodruff School of Mechanical Engineering Georgia Institute of Technology *Any MINED shield is a link to a resource.
  • 2. An Archival and Self Describing Data Format using HDF5 Data and Metadata stored in one file, Support in many languages, and Ideal support for high-dimensional data *MXADataModel – Archival Data Format – ONR/DARPA Dynamic 3-D Digital Structures Program
  • 3. HDF5 - The little zip file that could… One Dataset – 1.6GB – 4 Experiments –with 160 Datasets each …..no long term value.
  • 4. Volume Variety Velocity = Big Data Materials Science Polymer - MD Titanium Jacobs -GaTech Frasier -OSU Martensitic Steel Gumbusch SiC/SiC Ritchie- LLNL Bamboo Wegst - Dartmouth Al-Cu Solidification Voorhees - NW The velocity that data is generated will rise and the speed that it will be analyzed in will decrease.
  • 5. β-Titanium REDUCED OUTPUT: Grain size Grain Faces Number of Grains Mean Curvature Nearest Grain Analysis 10 micron resolution with 4300 Grains Compare with empirical models Materials Science is a Big Data domain, but it is not treated that way. Rowenhorst, Lewis, Spanos, Acta Mat, 2010
  • 6. Harvard Clean Energy Project Database AFLOW, Curtarolo Group Example Databases
  • 7. STRUCTURE INFORMATICS WORKFLOW INTELLIGENT DESIGN OF EXPERIMENTS PHYSICS BASED MODELS SIMULATION EXPERIMENT MICROSTRUCTURE (MATERIAL) SIGNAL PROCESSING ADVANCED & OBJECTIVE STATISTICAL ENCODING DATA SCIENCE MODULES INNOVATION ACCOUNTING Microstructure Informatics is a scalable, data-driven system to mine structureproperty/processing connections from experimental and simulation materials science information; structure being the independent variable. The system is agnostic to material system and length scale, objectively quantifiable, and rapidly iterates in less cycles for both materials improvement and discovery.
  • 8. DATA SCIENCE MODULES Property Microstructure Material Structure Processing Data science modules are machine learning and statistical tools to extract rich bi-directional structureproperty/processing linkages from encodings of materials & microstructure datasets. Mining modules create structure taxonomies, homogenization and localization relationships, ground truth comparison between simulation and experiment, materials discovery, and materials improvement.
  • 9. ADVANCED & OBJECTIVE STATISTICAL ENCODING THE MICROSTRUCTURE IS A SAMPLE IN AN IMMENSE STATISTICAL POPULATION. α-β Titanium
  • 10. SPATIAL STATISTICS Statistical correlations between random points in space/time which reveal systematic patterns in the microstructure. Contains the original μS within a translation & inversion. An objective encoding for most materials datasets. t h' msh ms+t ft hh' = å Dt s t t
  • 11. The fidelity of the spatial statistics are impacted by how the material structure is parameterized as a signal. CURRENT APPLICATIONS metals, polymers, fuel cells, cmc, md, & a bunch of other things TYPES OF SIGNALS sparse, experimental, simulation, heterogeneous, surface, bulk
  • 12. Objective Microstructure Classification of α-β Titanium Images StatisticsMine with Principal Component Analysis
  • 13. Mechanical Deformation of Polymer Chains Molecular Dynamics of Aluminum Atoms
  • 14. Bottom-up Homogenization Relationships model GDL MPL simulation X-CTFinite Element ModelingStatistics Regression to connect the statistics with diffusivity values from FEM
  • 15. Meta-modeling with Materials Knowledge Systems Top-down localization relationships FEM ε=5e-4 ps = åå a m h t t h s+t h The MKS design filters that capture the effect of the local arrangement of the microstructure on the response. The filters are learned from physics based models and can only be as accurate as the model never better.
  • 16. Meta-modeling with Materials Knowledge Systems Any Model Top-down localization relationships INPUT OUTPUT Control ps = åå a m h t t h s+t h The MKS design filters that capture the effect of the local arrangement of the microstructure on the response. The filters are learned from physics based models and can only be as accurate as the model never better.
  • 17. Top-Down Localization Relationships for High Contrast Composites The MKS is a scalable, parallel meta-model that learns from physics based models to enable rapid simulation at a cost in accuracy. N2 vs. Nlog(N) complexity It learns top-down localization relationships to extra extreme value events and enables multiscale integration. OTHER APPLICATIONS Spinodal Decomposition, Grain Coarsening, Thermo-mechanical, Polycrystalline
  • 18. Objective parametric descriptors and data science enable integration of bi-direction structure-property/processing linkages. Structure-Property Homogenization Structure-Processing MKS Processing History Structure-Property Localization
  • 19. Data enables bidirectional S-P/P, multiscale integration, and higher throughput CORE TECHNOLOGIES TO FUEL THE DATA AGE OF MATERIALS SCIENCE Open Access, Open Source Software, Scalable Databases, High-Statistical Throughput Simulation and Experiment, Image Segmentation, Machine Learning, Scalable Databases, Metadata Integration, Mobile Technology, Visualization, High Performance Computing, Cyberinfrastructure/Collaboratories, Collaboration & Sharing
  • 20. Selected Links Any shield in this presentation is a link HDF5 HDFView MXADataModel Curtarolo Group AFLOW Harvard Clean Energy Project Serial Sectioned Titanium MATIN Materials Genome Initiative http://www.hdfgroup.org/HDF5/whatishdf5.html http://www.hdfgroup.org/hdf-java-html/hdfview/ http://mxa.web.cmu.edu/Background.html http://www.mems.duke.edu/faculty/stefano-curtarolo http://materials.duke.edu/apool.html http://www.molecularspace.org/ https://cosmicweb.mse.iastate.edu/wiki/pages/viewpage.action?pageId=753830 http://www.materials.gatech.edu/matin http://www.whitehouse.gov/mgi