SlideShare a Scribd company logo
1 of 90
Download to read offline
Novel Unsupervised Machine Learning Methods for Extraction of
Features Characterizing Datasets and Models
Velimir V. Vesselinov (monty) (vvv@lanl.gov)
Earth and Environmental Sciences Division, Los Alamos National Laboratory, NM, USA
LA-UR-18-30987
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Why unsupervised Machine Learning (ML)?
Supervised ML: requires prior categorization (knowledge) of the processed data
Example: Recognize images of cats and dogs after extensive training; but cannot
recognize horses if not trained
Cannot find something that we do not already know
Unsupervised ML: extracts hidden features (signals) in the processed data without any
prior information (exploratory analysis for data-driven science)
Example: Identify features that distinguish images of animals (e.g., cats, dogs, horses,
etc.); without prior information or training
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Why unsupervised Machine Learning (ML)?
Supervised ML: requires prior categorization (knowledge) of the processed data
Example: Recognize images of cats and dogs after extensive training; but cannot
recognize horses if not trained
Cannot find something that we do not already know
Unsupervised ML: extracts hidden features (signals) in the processed data without any
prior information (exploratory analysis for data-driven science)
Example: Identify features that distinguish images of animals (e.g., cats, dogs, horses,
etc.); without prior information or training
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Why unsupervised Machine Learning (ML)?
Supervised ML: requires prior categorization (knowledge) of the processed data
Example: Recognize images of cats and dogs after extensive training; but cannot
recognize horses if not trained
Cannot find something that we do not already know
Unsupervised ML: extracts hidden features (signals) in the processed data without any
prior information (exploratory analysis for data-driven science)
Example: Identify features that distinguish images of animals (e.g., cats, dogs, horses,
etc.); without prior information or training
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Why unsupervised Machine Learning (ML)?
Supervised ML: requires prior categorization (knowledge) of the processed data
Example: Recognize images of cats and dogs after extensive training; but cannot
recognize horses if not trained
Cannot find something that we do not already know
Unsupervised ML: extracts hidden features (signals) in the processed data without any
prior information (exploratory analysis for data-driven science)
Example: Identify features that distinguish images of animals (e.g., cats, dogs, horses,
etc.); without prior information or training
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Why unsupervised Machine Learning (ML)?
Supervised ML: requires prior categorization (knowledge) of the processed data
Example: Recognize images of cats and dogs after extensive training; but cannot
recognize horses if not trained
Cannot find something that we do not already know
Unsupervised ML: extracts hidden features (signals) in the processed data without any
prior information (exploratory analysis for data-driven science)
Example: Identify features that distinguish images of animals (e.g., cats, dogs, horses,
etc.); without prior information or training
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Why not supervised Machine Learning (ML)
Supervised ML
can introduce subjectivity (through the labeling process)
does not provide insights why horses are different than dogs / cats
cannot make predictions
requires huge training (labeled) datasets
is impacted by “adversarial examples”
⇒ major limitations of the supervised methods
for data-analytics and data-driven science applications
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Supervised Machine Learning (xkcd)
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Why nonnegativity?
Nonnegativity constraints provide meaningful and interpretable results (+sparsity)
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Unsupervised Machine Learning Applications: Exploratory Analysis
Data Analytics: Identify signals (features) in datasets (latent variables)
Feature extraction (FE):
Blind source separation (BSS)
Detection of disruptions / anomalies
Image recognition
Discover unknown dependencies and phenomena
Guide development of physics / reduced-order models representing the data
Model Analytics/Diagnostics: Identify processes (features) in model outputs
Separate processes (inseparable during modeling)
Model reduction (development of reduced-order models)
Identify dependencies between model inputs and outputs
Discover unknown dependencies and phenomena (deterministic/stochastic modeling)
Coupled Data/Model Analytics:
Simultaneous analyses of data and model outputs (data/model fusion)
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Nonnegative Matrix/Tensor Factorization
We have developed a series of novel unsupervised Machine Learning (ML) methods and
computational techniques
Our methods are based in matrix/tensor factorization coupled with custom k-means
clustering and nonnegativity/sparsity constraints:
NMFk: Nonnegative Matrix Factorization
NTFk: Nonnegative Tensor Factorization
NMFk/NTFk are capable to efficently process large datasets (GB/TB’s) utilizing GPU’s &
TPU’s (TensorFlow, PyTorch, MXNet)
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Tucker tensor factorizations (3D case): Tucker-3
≈
X
[K, M, N]
G
[k, m, n]
H
[k, K]
W
[n, N]
V
[m, M]
X ≈ G ⊗ H ⊗ W ⊗ V
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Tucker tensor factorizations (3D case): Tucker-2 (three possible alternatives)
≈
X
[K, M, N]
G
[k, M, n]
H
[k, K]
W
[n, N]
X ≈ G ⊗ H ⊗ W
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Tucker tensor factorizations (3D case): Tucker-1 (three possible alternatives)
≈
X
[K, M, N]
G
[k, M, N]
H
[k, K]
X ≈ G ⊗ H
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Tucker tensor factorizations (3D case): Tucker-3
≈
X
[K, M, N]
G
[k, m, n]
H
[k, K]
W
[n, N]
V
[m, M]
X ≈ G ⊗ H ⊗ W ⊗ V
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Tucker Tensor Decomposition: Example
≈
X
[K, M, N]
G
[k, m, n]
H
[k, K]
W
[n, N]
V
[m, M]
X ≈ G ⊗ H ⊗ W ⊗ V
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Tucker Tensor Decomposition: Example
≈
t
y
x
t
y
x
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Tucker Tensor Decomposition: Example
V =


t
t2
tanh(t − 10) + 1


W =




x
x3
ex
sin(x) + 1




H =






y
y4
ln(y)
sin(2y) + 1
cos(y) + 1






(12 elements)
X = G ⊗ H ⊗ W ⊗ V
= xyt + xy4
t + xtln(y)+
xt(sin(2y) + 1) + x3
yt+
xt(cos(y) + 1) + ytex
+
yt(sin(x) + 1)+
xyt2
+ xy(1 + tanh(t − 10))+
t2
ex
(sin(2y) + 1)+
(1 + tanh(t − 10))(sin(x) + 1)
(cos(y) + 1)
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Tucker Tensor Decomposition: Example
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Tucker Tensor Decomposition: Example
← Truth vs Predictions →
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Tucker Tensor Decomposition: Example
← Truth vs Predictions →
V =


t
t2
tanh(t − 10) + 1


W =




x
x3
ex
sin(x) + 1




H =






y
y4
ln(y)
sin(2y) + 1
cos(y) + 1






Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Tucker Tensor Decomposition: Example
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Tucker Tensor Decomposition: Example
M = G ⊗ H ⊗ W
M1 = xy + xy4
+ xlog(y + 1)+
x(sin(2y) + 1) + yex
x(cos(y) + 1) + x3
y+
y(sin(x) + 1)
M2 = xy+
ex
(cos(z) + 1)
M3 = xy+
(sin(x) + 1)(cos(y) + 1)
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Tucker Tensor Decomposition: Example
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Tucker Tensor Decomposition: Example II
(50 × 50 × 50) tensor
50 columns in x
50 rows in y
50 time frames
‘ones‘ swimming in a sea of ‘zeros‘
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Tucker Tensor Decomposition: Example II
Factorizing all 3 dimensions (50 × 50 × 50) → (6 × 44 × 48)
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Tucker Tensor Decomposition: Example II
6 groups of swimmers (x); 44 lanes occupied (y); 48 time frames (first/last empty)
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
NMFk / NTFk Challenges
Identifying the number of unknown features:
resolved using custom k-means clustering and sparsity constraints on the core tensor
number of features identified based on the reconstruction quality (e.g., Frobenius norm) and
cluster Silhouettes
Solving a non-unique optimization problem:
addressed through multistarts, regularization and nonnegativity constraints
applying diverse optimization techniques (Multiplicative/Alternating Least Squares
algorithms, NLopt, Ipopt, Gurobi, MOSEK, GLPK, Clp, Cbc, ...)
accounting for the physics
Processing Big Data:
GPU’s / TPU’s / Distributed computing
Account for data sparsity and structure
Nonnegative Tensor Trains
Dealing with Noisy Data:
Random noise impacts accuracy but its accountable
Systematic noise is identified as separate signals
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
NMFk / NTFk Performance
4GB Tensor (1000 × 1000 × 1000)
Framework Execution time (seconds)
MATLAB 2634
NumPy 881
MXNet 644
PyTorch 121
TensorFlow 119
Julia 109
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
NMFk / NTFk Scalability
108 109 1010 1011
Tensor Size (bytes)
101
102
103
104
time(s)
3D Tensor
perfect scaling
256 CPUs
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
NMFk / NTFk Analyses
Field Data:
Characterization of groundwater contaminant sources
US Climate data
Geothermal data
Seismic data
Lab Data:
X-ray Spectroscopy
UV Fluorescence Spectroscopy
Microbial population analyses
Operational Data:
LANSCE: Los Alamos Neutron Accelerator
Hydrocarbon (oil/gas) production
Model Data:
Reactive mixing A + B → C
Phase separation of co-polymers
Molecular Dynamics of proteins
EU Climate modeling (Helmholtz Institute, Germany)
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
NMFk / NTFk Publications
Stanev, Vesselinov, Kusne, Antoszewski, Takeuchi, Alexandrov, Unsupervised Phase
Mapping of X-ray Diffraction Data by Nonnegative Matrix Factorization Integrated with
Custom Clustering, Nature Computational Materials, 2018.
Vesselinov, Munuduru, Karra, O’Maley, Alexandrov, Unsupervised Machine Learning
Based on Non-Negative Tensor Factorization for Analyzing Reactive-Mixing, Journal of
Computational Physics, (in review), 2018.
Vesselinov, O’Malley, Alexandrov, Nonnegative Tensor Factorization for Contaminant
Source Identification, Journal of Contaminant Hydrology, (n review), 2018.
O’Malley, Vesselinov, Alexandrov, Alexandrov, Nonnegative/binary matrix factorization
with a D-Wave quantum annealer, PLOS ONE, (accepted), 2018.
Vesselinov, O’Malley, Alexandrov, Contaminant source identification using
semi-supervised machine learning, Journal of Contaminant Hydrology,
10.1016/j.jconhyd.2017.11.002, 2017.
Alexandrov, Vesselinov, Blind source separation for groundwater level analysis based on
nonnegative matrix factorization, WRR, 10.1002/2013WR015037, 2014.
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Reactive mixing
A BBarrier
A + B → C
> 2000 simulations of C concentrations in time/space with varying
model inputs representing reactive mixing (5 input model parameters)
NTFk identifies physics processes impacting C concentrations and
their relationship to model inputs
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Reactive mixing: Flow fields
κfL = 1 κfL = 2 κfL = 3 κfL = 4 κfL = 5
v0 = 1:0
v0 = 10−1
v0 = 10−2
v0 = 10−3
v0 = 10−4
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Reactive mixing: NTFk results
NTFk extracts the dominant time/space features (processes /
vortices) and compresses the model outputs
Compression: > 200GB → ∼ 70MB (ratio ∼ 3000
Here, (1000 × 81 × 81) → (3 × 12 × 13) (time × rows × columns)
Advection Dispersion Diffusion
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Reactive mixing: NTFk results
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Reactive mixing: NTFk results (κf L impacts)
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Climate model of Europe: air temperature
fluctuations in the air temperature [◦
C]
(424 × 412 × 365)
(columns × rows × days)
NTFk extracts spatial and temporal
footprints of dominant features:
storm signal
winter seasonal signal
summer seasonal signal
....
Data compression: ∼ 4GB → ∼ 0.5MB
Compression ratio: ∼ 8000
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Climate model of Europe: air temperature fluctuations represented by 3 signals
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Climate model of Europe: air temperature fluctuations represented by 3 signals
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Climate model of Europe: water-table depth
fluctuations in the water-table depth [m]
(424 × 412 × 365)
(columns × rows × days)
NTFk extracts spatial and temporal
footprints of dominant signals
spring snowmelt signal
summer rainfall signal
seasonal signal
....
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Climate model of Europe: water-table fluctuations represented by 3 signals
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Climate model of Europe: water-table fluctuations represented by 3 signals
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Climate model of Europe: Water-table fluctuations represented by 2 signals
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Climate model of Europe: Water-table fluctuations represented by 3 signals
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Climate model of Europe: Water-table fluctuations represented by 4 signals
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Climate model of Europe: Water-table fluctuations represented by 5 signals
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Climate model of Europe: Water-table fluctuations represented by 6 signals
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Climate model of Europe: Water-table fluctuations represented by 7 signals
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Climate model of Europe: Water-table fluctuations represented by 8 signals
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Climate model of Europe: Water-table fluctuations represented by 9 signals
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Climate model of Europe: Water-table fluctuations represented by 10 signals
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Climate model of Europe: analyze all model outputs (>40) simultaneously
Find interconnections among model outputs
Evaluate impacts of different model setups
Find dominant processes impacting model predictions
(e.g., climate impacts on groundwater resources, impacts of subsurface processes on
atmospheric conditions)
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
US Climate data: precipitation
NOAA data set of observed monthly
averaged precipitation with spatial
resolution of 6km (1/16◦
)
fluctuations in monthly averaged
precipitation [inches]
(928 × 614 × 768)
(columns × rows × months)
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
US precipitation: NTFk results
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
US precipitation: NTFk results
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
US Climate data: temperature
NOAA data set of observed maximum
monthly temperatures with spatial
resolution of 6km (1/16◦
)
fluctuations in maximum monthly
temperatures [◦
C]
(928 × 614 × 768)
(columns × rows × months)
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
US maximum monthly temperatures: NTFk results
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
US maximum monthly temperatures: NTFk results
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
ML for extracting contaminant plumes
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
ML for extracting contaminant plumes
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
ML for extracting contaminant plumes
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Geochemistry: LANL hydrogeochemical dataset
R-44#1
R-28 R-42
(18 × 8 × 12) tensor (wells × species × years)
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Geochemistry: Nonnegative Tensor Factorization based on Tucker-1 decomposition
X: data tensor
W: source (groundwater type) matrix
(unknown)
G: mixing tensor (unknown)
M: number of observation points (wells)
N: number of observation times (e.g,
2001, 2002, ..., 2017)
K: number of geochemical species
observed (e.g., Cr6+
, SO2+
4 , NO−
3 , etc.)
k: number of unknown groundwater
types mixed at each well
Constraints:
all tensor/matrix elements ≥ 0
k
i=1
Gi,j,t = 1 ∀j, t
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
NTFk analysis estimated 7 groundwater types
Sources Cr Cl−
ClO4
3
H NO3 Ca Mg SO4
(µg/L) (mg/L) (µg/L) (pCi/L) (mg/L) (mg/L) (mg/L) (mg/L)
S1 2970.00 63.00 0.00 0.00 14.00 73.00 25.00 170.00
S5 21.00 51.00 0.00 950.00 2.40 67.00 15.00 50.00
S6 1.50 64.00 0.00 0.00 2.80 51.00 10.00 68.00
S2 0.79 0.35 14.00 0.00 0.50 5.30 1.70 0.60
S4 0.50 0.14 0.00 0.00 10.00 21.00 5.00 10.00
S3 (B) 0.25 3.60 0.00 0.00 0.01 41.00 11.00 0.06
S7 (B) 0.10 0.03 0.00 0.00 0.01 0.40 0.80 0.90
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
NTFk estimated concentrations at various wells
R-28 R-42 R-44#1 R-45#1 R-11 R-1 R-50#1
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
NTFk estimated time-dependent mixing of 7 groundwater types at various wells
R-28
R-42
R-44#1
R-45#1
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
NTFk identified sources (groundwater types) Jan-Dec 2016
Source 7: (background)
Source 1: Cr, Cl
−
, NO3, Ca, Mg, and SO4
Source 3: Cl
−
, Ca, Mg (background)
Source 2: ClO4
Source 6: Cl
−
, Ca, Mg, and SO4
Source 4: NO3
Source 5:
3
H, Cr, Cl
−
, Ca, Mg, and SO4
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
NTFk analysis of LANL hydrogeochemical datasets
(18 × 8 × 12) tensor
(wells × species × years)
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Geysers Geothermal Field
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Geysers Geothermal Field Seismic Events
470,263 seismic events have been
identified between 2003 and 2016
spatial and temporal properties converted
to a tensor
(50 × 34 × 4818)
(columns × rows × days)
NTFk extracts spatial and temporal
footprints of dominant features:
EGS injection starts on November 6th,
2011
....
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Seismic Events: NTFk results
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Seismic Events: NTFk results
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Seismic Events: NTFk results
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Seismic Events: NTFk results
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Quantum Machine Learning using D-Wave
Performed unsupervised feature extraction using
Non-negative Binary Matrix Factorization (NBMF)
Coupling quantum (D-Wave) and classical computing
Analyzed the faces from the original NMF paper published in
Nature (Lee & Seung, 1999)
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Polymer-chain folding
polymer transitions between different
states
NTFk extracts phase-transition stages
(201 × 64 × 64 × 3) → (5 × 12 × 12 × 1)
(state × rows × columns × phases)
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
« Play/Pause
Summary
Developed novel unsupervised ML methods and
computational tools based on Nonnegative
Factorization (Matrices/Tensors)
Our ML methods have been used to solve various
real-world problems (brought breakthrough
discoveries related to human cancer research)
Our ML work already contributes to program
development at LANL in mission-critical areas
Our goal is to extend the ML methods and tools to
solve big (> terabyte-scale) high-dimensional (> 3)
data
Modeldata
Sensor data Experimentaldata
Robust Unsupervised
Machine Learning
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Machine Learning (ML) Algorithms / Codes developed by our team
NMFk + ShiftNMFk + GreenNMFk
NTFk
NBMF: Quantum machine learning using D-Wave
MADS: Model-Analyses & Decision Support
open-source, version-controlled, high-performance computational framework
http://mads.lanl.gov http://madsjulia.github.io/Mads.jl
Blind Source Separation examples:
http://madsjulia.github.io/Mads.jl/Examples/blind_source_separation
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
Fluorescence Spectroscopy
5 cuvettes (samples) with unknown mixtures of 3 unknown
compounds
Emissions and excitations measured (5 × 250 × 200 tensor)
NTFk identifies the mixing ratios of the compounds and their
characteristic spectra
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
LANSCE
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
LANSCE: Problem
Numerous independent variables ... knobs controlling the accelerator performance
15 provided / analyzed
Numerous dependent variables ... observations representing the accelerator
performance
8 provided / analyzed
312,326 * 23 = 7,183,498 ≈ 100 MB (per month)
Full monthly dataset ≈ 2 GB; For 30 years ≈ 1 TB;
Goal #1: ML a reduced-order model to simulate the performance (physics model does
not exist and cannot be built)
Goal #2: Use ML to control the knobs to optimize the performance
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
LANSCE: Independent variables = 15
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
LANSCE: Dependent variables = 8
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
LANSCE: NMFk estimated signal #1 based on the dependent variables
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
LANSCE: NMFk estimated signals #2 based on the dependent variables
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
LANSCE: NMFk Estimated signal #3 based on the dependent variables
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
LANSCE: NMFk reconstruction of one of the dependent variables
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
LANSCE: NMFk estimated signal #3 vs one of the independent variables
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
LANSCE: Reduced-order (SVR) Model
Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE

More Related Content

Similar to Novel Machine Learning Methods for Extraction of Features Characterizing Complex Datasets and Models

Search for Excited Randall-Sundrum Gravitons from Warped Extra Dimensions wit...
Search for Excited Randall-Sundrum Gravitons from Warped Extra Dimensions wit...Search for Excited Randall-Sundrum Gravitons from Warped Extra Dimensions wit...
Search for Excited Randall-Sundrum Gravitons from Warped Extra Dimensions wit...
Eric Williams
 
CS221: HMM and Particle Filters
CS221: HMM and Particle FiltersCS221: HMM and Particle Filters
CS221: HMM and Particle Filters
zukun
 
Bp219 04-13-2011
Bp219 04-13-2011Bp219 04-13-2011
Bp219 04-13-2011
waddling
 
Similarity Features, and their Role in Concept Alignment Learning
Similarity Features, and their Role in Concept Alignment Learning Similarity Features, and their Role in Concept Alignment Learning
Similarity Features, and their Role in Concept Alignment Learning
Shenghui Wang
 
(Talk in Powerpoint Format)
(Talk in Powerpoint Format)(Talk in Powerpoint Format)
(Talk in Powerpoint Format)
butest
 
Isat review
Isat reviewIsat review
Isat review
mcopher
 

Similar to Novel Machine Learning Methods for Extraction of Features Characterizing Complex Datasets and Models (20)

ppt
pptppt
ppt
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...
SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...
SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...
 
ch10-graphs2.pptx
ch10-graphs2.pptxch10-graphs2.pptx
ch10-graphs2.pptx
 
prototypes-AMALEA.pdf
prototypes-AMALEA.pdfprototypes-AMALEA.pdf
prototypes-AMALEA.pdf
 
Search for Excited Randall-Sundrum Gravitons from Warped Extra Dimensions wit...
Search for Excited Randall-Sundrum Gravitons from Warped Extra Dimensions wit...Search for Excited Randall-Sundrum Gravitons from Warped Extra Dimensions wit...
Search for Excited Randall-Sundrum Gravitons from Warped Extra Dimensions wit...
 
CS221: HMM and Particle Filters
CS221: HMM and Particle FiltersCS221: HMM and Particle Filters
CS221: HMM and Particle Filters
 
Materials Modelling: From theory to solar cells (Lecture 1)
Materials Modelling: From theory to solar cells  (Lecture 1)Materials Modelling: From theory to solar cells  (Lecture 1)
Materials Modelling: From theory to solar cells (Lecture 1)
 
Imaging the Unseen: Taking the First Picture of a Black Hole
Imaging the Unseen: Taking the First Picture of a Black HoleImaging the Unseen: Taking the First Picture of a Black Hole
Imaging the Unseen: Taking the First Picture of a Black Hole
 
New Insights and Applications of Eco-Finance Networks and Collaborative Games
New Insights and Applications of Eco-Finance Networks and Collaborative GamesNew Insights and Applications of Eco-Finance Networks and Collaborative Games
New Insights and Applications of Eco-Finance Networks and Collaborative Games
 
DIMENSIONAL ANALYSIS (Lecture notes 08)
DIMENSIONAL ANALYSIS (Lecture notes 08)DIMENSIONAL ANALYSIS (Lecture notes 08)
DIMENSIONAL ANALYSIS (Lecture notes 08)
 
Bp219 04-13-2011
Bp219 04-13-2011Bp219 04-13-2011
Bp219 04-13-2011
 
Similarity Features, and their Role in Concept Alignment Learning
Similarity Features, and their Role in Concept Alignment Learning Similarity Features, and their Role in Concept Alignment Learning
Similarity Features, and their Role in Concept Alignment Learning
 
CLIM Fall 2017 Course: Statistics for Climate Research, Climate Informatics -...
CLIM Fall 2017 Course: Statistics for Climate Research, Climate Informatics -...CLIM Fall 2017 Course: Statistics for Climate Research, Climate Informatics -...
CLIM Fall 2017 Course: Statistics for Climate Research, Climate Informatics -...
 
(Talk in Powerpoint Format)
(Talk in Powerpoint Format)(Talk in Powerpoint Format)
(Talk in Powerpoint Format)
 
JAISTサマースクール2016「脳を知るための理論」講義01 Single neuron models
JAISTサマースクール2016「脳を知るための理論」講義01 Single neuron modelsJAISTサマースクール2016「脳を知るための理論」講義01 Single neuron models
JAISTサマースクール2016「脳を知るための理論」講義01 Single neuron models
 
Isat review
Isat reviewIsat review
Isat review
 
Detector Simulation for HEP
Detector Simulation for HEPDetector Simulation for HEP
Detector Simulation for HEP
 
ChemE_2200_lecture_T1.ppt weryuiutewryyuuu
ChemE_2200_lecture_T1.ppt weryuiutewryyuuuChemE_2200_lecture_T1.ppt weryuiutewryyuuu
ChemE_2200_lecture_T1.ppt weryuiutewryyuuu
 
Data Science Education: Needs & Opportunities in Astronomy
Data Science Education: Needs & Opportunities in AstronomyData Science Education: Needs & Opportunities in Astronomy
Data Science Education: Needs & Opportunities in Astronomy
 

More from Velimir (monty) Vesselinov

More from Velimir (monty) Vesselinov (13)

Data and Model-Driven Decision Support for Environmental Management of a Chro...
Data and Model-Driven Decision Support for Environmental Management of a Chro...Data and Model-Driven Decision Support for Environmental Management of a Chro...
Data and Model-Driven Decision Support for Environmental Management of a Chro...
 
Environmental Management Modeling Activities at Los Alamos National Laborator...
Environmental Management Modeling Activities at Los Alamos National Laborator...Environmental Management Modeling Activities at Los Alamos National Laborator...
Environmental Management Modeling Activities at Los Alamos National Laborator...
 
GNI: Coupling Model Analysis Tools and High-Performance Subsurface Flow and T...
GNI: Coupling Model Analysis Tools and High-Performance Subsurface Flow and T...GNI: Coupling Model Analysis Tools and High-Performance Subsurface Flow and T...
GNI: Coupling Model Analysis Tools and High-Performance Subsurface Flow and T...
 
Tomographic inverse estimation of aquifer properties based on pressure varia...
Tomographic inverse estimation of aquifer properties based on  pressure varia...Tomographic inverse estimation of aquifer properties based on  pressure varia...
Tomographic inverse estimation of aquifer properties based on pressure varia...
 
Uncertainties in Transient Capture-Zone Estimates
Uncertainties in Transient Capture-Zone EstimatesUncertainties in Transient Capture-Zone Estimates
Uncertainties in Transient Capture-Zone Estimates
 
Model-driven decision support for monitoring network design based on analysis...
Model-driven decision support for monitoring network design based on analysis...Model-driven decision support for monitoring network design based on analysis...
Model-driven decision support for monitoring network design based on analysis...
 
Decision Analyses Related to a Chromium Plume at Los Alamos National Laboratory
Decision Analyses Related to a Chromium Plume at Los Alamos National LaboratoryDecision Analyses Related to a Chromium Plume at Los Alamos National Laboratory
Decision Analyses Related to a Chromium Plume at Los Alamos National Laboratory
 
Model-free Source Identification
Model-free Source IdentificationModel-free Source Identification
Model-free Source Identification
 
Decision Support for Environmental Management of a Chromium Plume at Los Alam...
Decision Support for Environmental Management of a Chromium Plume at Los Alam...Decision Support for Environmental Management of a Chromium Plume at Los Alam...
Decision Support for Environmental Management of a Chromium Plume at Los Alam...
 
Model Analyses of Complex Systems Behavior using MADS,
Model Analyses of Complex Systems Behavior using MADS,Model Analyses of Complex Systems Behavior using MADS,
Model Analyses of Complex Systems Behavior using MADS,
 
Reduced Order Models for Decision Analysis and Upscaling of Aquifer Heterogen...
Reduced Order Models for Decision Analysis and Upscaling of Aquifer Heterogen...Reduced Order Models for Decision Analysis and Upscaling of Aquifer Heterogen...
Reduced Order Models for Decision Analysis and Upscaling of Aquifer Heterogen...
 
ZEM: Integrated Framework for Real-Time Data and Model Analyses for Robust En...
ZEM: Integrated Framework for Real-Time Data and Model Analyses for Robust En...ZEM: Integrated Framework for Real-Time Data and Model Analyses for Robust En...
ZEM: Integrated Framework for Real-Time Data and Model Analyses for Robust En...
 
Decision Analyses for Groundwater Remediation
Decision Analyses for Groundwater RemediationDecision Analyses for Groundwater Remediation
Decision Analyses for Groundwater Remediation
 

Recently uploaded

Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
amitlee9823
 
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
karishmasinghjnh
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 

Recently uploaded (20)

Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 

Novel Machine Learning Methods for Extraction of Features Characterizing Complex Datasets and Models

  • 1. Novel Unsupervised Machine Learning Methods for Extraction of Features Characterizing Datasets and Models Velimir V. Vesselinov (monty) (vvv@lanl.gov) Earth and Environmental Sciences Division, Los Alamos National Laboratory, NM, USA LA-UR-18-30987 Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 2. Why unsupervised Machine Learning (ML)? Supervised ML: requires prior categorization (knowledge) of the processed data Example: Recognize images of cats and dogs after extensive training; but cannot recognize horses if not trained Cannot find something that we do not already know Unsupervised ML: extracts hidden features (signals) in the processed data without any prior information (exploratory analysis for data-driven science) Example: Identify features that distinguish images of animals (e.g., cats, dogs, horses, etc.); without prior information or training Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 3. Why unsupervised Machine Learning (ML)? Supervised ML: requires prior categorization (knowledge) of the processed data Example: Recognize images of cats and dogs after extensive training; but cannot recognize horses if not trained Cannot find something that we do not already know Unsupervised ML: extracts hidden features (signals) in the processed data without any prior information (exploratory analysis for data-driven science) Example: Identify features that distinguish images of animals (e.g., cats, dogs, horses, etc.); without prior information or training Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 4. Why unsupervised Machine Learning (ML)? Supervised ML: requires prior categorization (knowledge) of the processed data Example: Recognize images of cats and dogs after extensive training; but cannot recognize horses if not trained Cannot find something that we do not already know Unsupervised ML: extracts hidden features (signals) in the processed data without any prior information (exploratory analysis for data-driven science) Example: Identify features that distinguish images of animals (e.g., cats, dogs, horses, etc.); without prior information or training Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 5. Why unsupervised Machine Learning (ML)? Supervised ML: requires prior categorization (knowledge) of the processed data Example: Recognize images of cats and dogs after extensive training; but cannot recognize horses if not trained Cannot find something that we do not already know Unsupervised ML: extracts hidden features (signals) in the processed data without any prior information (exploratory analysis for data-driven science) Example: Identify features that distinguish images of animals (e.g., cats, dogs, horses, etc.); without prior information or training Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 6. Why unsupervised Machine Learning (ML)? Supervised ML: requires prior categorization (knowledge) of the processed data Example: Recognize images of cats and dogs after extensive training; but cannot recognize horses if not trained Cannot find something that we do not already know Unsupervised ML: extracts hidden features (signals) in the processed data without any prior information (exploratory analysis for data-driven science) Example: Identify features that distinguish images of animals (e.g., cats, dogs, horses, etc.); without prior information or training Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 7. Why not supervised Machine Learning (ML) Supervised ML can introduce subjectivity (through the labeling process) does not provide insights why horses are different than dogs / cats cannot make predictions requires huge training (labeled) datasets is impacted by “adversarial examples” ⇒ major limitations of the supervised methods for data-analytics and data-driven science applications Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 8. Supervised Machine Learning (xkcd) Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 9. Why nonnegativity? Nonnegativity constraints provide meaningful and interpretable results (+sparsity) Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 10. Unsupervised Machine Learning Applications: Exploratory Analysis Data Analytics: Identify signals (features) in datasets (latent variables) Feature extraction (FE): Blind source separation (BSS) Detection of disruptions / anomalies Image recognition Discover unknown dependencies and phenomena Guide development of physics / reduced-order models representing the data Model Analytics/Diagnostics: Identify processes (features) in model outputs Separate processes (inseparable during modeling) Model reduction (development of reduced-order models) Identify dependencies between model inputs and outputs Discover unknown dependencies and phenomena (deterministic/stochastic modeling) Coupled Data/Model Analytics: Simultaneous analyses of data and model outputs (data/model fusion) Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 11. Nonnegative Matrix/Tensor Factorization We have developed a series of novel unsupervised Machine Learning (ML) methods and computational techniques Our methods are based in matrix/tensor factorization coupled with custom k-means clustering and nonnegativity/sparsity constraints: NMFk: Nonnegative Matrix Factorization NTFk: Nonnegative Tensor Factorization NMFk/NTFk are capable to efficently process large datasets (GB/TB’s) utilizing GPU’s & TPU’s (TensorFlow, PyTorch, MXNet) Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 12. Tucker tensor factorizations (3D case): Tucker-3 ≈ X [K, M, N] G [k, m, n] H [k, K] W [n, N] V [m, M] X ≈ G ⊗ H ⊗ W ⊗ V Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 13. Tucker tensor factorizations (3D case): Tucker-2 (three possible alternatives) ≈ X [K, M, N] G [k, M, n] H [k, K] W [n, N] X ≈ G ⊗ H ⊗ W Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 14. Tucker tensor factorizations (3D case): Tucker-1 (three possible alternatives) ≈ X [K, M, N] G [k, M, N] H [k, K] X ≈ G ⊗ H Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 15. Tucker tensor factorizations (3D case): Tucker-3 ≈ X [K, M, N] G [k, m, n] H [k, K] W [n, N] V [m, M] X ≈ G ⊗ H ⊗ W ⊗ V Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 16. Tucker Tensor Decomposition: Example ≈ X [K, M, N] G [k, m, n] H [k, K] W [n, N] V [m, M] X ≈ G ⊗ H ⊗ W ⊗ V Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 17. Tucker Tensor Decomposition: Example ≈ t y x t y x Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 18. Tucker Tensor Decomposition: Example V =   t t2 tanh(t − 10) + 1   W =     x x3 ex sin(x) + 1     H =       y y4 ln(y) sin(2y) + 1 cos(y) + 1       (12 elements) X = G ⊗ H ⊗ W ⊗ V = xyt + xy4 t + xtln(y)+ xt(sin(2y) + 1) + x3 yt+ xt(cos(y) + 1) + ytex + yt(sin(x) + 1)+ xyt2 + xy(1 + tanh(t − 10))+ t2 ex (sin(2y) + 1)+ (1 + tanh(t − 10))(sin(x) + 1) (cos(y) + 1) Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 19. Tucker Tensor Decomposition: Example Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 20. Tucker Tensor Decomposition: Example ← Truth vs Predictions → Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 21. Tucker Tensor Decomposition: Example ← Truth vs Predictions → V =   t t2 tanh(t − 10) + 1   W =     x x3 ex sin(x) + 1     H =       y y4 ln(y) sin(2y) + 1 cos(y) + 1       Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 22. Tucker Tensor Decomposition: Example Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 23. Tucker Tensor Decomposition: Example M = G ⊗ H ⊗ W M1 = xy + xy4 + xlog(y + 1)+ x(sin(2y) + 1) + yex x(cos(y) + 1) + x3 y+ y(sin(x) + 1) M2 = xy+ ex (cos(z) + 1) M3 = xy+ (sin(x) + 1)(cos(y) + 1) Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 24. Tucker Tensor Decomposition: Example Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 25. Tucker Tensor Decomposition: Example II (50 × 50 × 50) tensor 50 columns in x 50 rows in y 50 time frames ‘ones‘ swimming in a sea of ‘zeros‘ Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 26. Tucker Tensor Decomposition: Example II Factorizing all 3 dimensions (50 × 50 × 50) → (6 × 44 × 48) Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 27. Tucker Tensor Decomposition: Example II 6 groups of swimmers (x); 44 lanes occupied (y); 48 time frames (first/last empty) Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 28. NMFk / NTFk Challenges Identifying the number of unknown features: resolved using custom k-means clustering and sparsity constraints on the core tensor number of features identified based on the reconstruction quality (e.g., Frobenius norm) and cluster Silhouettes Solving a non-unique optimization problem: addressed through multistarts, regularization and nonnegativity constraints applying diverse optimization techniques (Multiplicative/Alternating Least Squares algorithms, NLopt, Ipopt, Gurobi, MOSEK, GLPK, Clp, Cbc, ...) accounting for the physics Processing Big Data: GPU’s / TPU’s / Distributed computing Account for data sparsity and structure Nonnegative Tensor Trains Dealing with Noisy Data: Random noise impacts accuracy but its accountable Systematic noise is identified as separate signals Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 29. NMFk / NTFk Performance 4GB Tensor (1000 × 1000 × 1000) Framework Execution time (seconds) MATLAB 2634 NumPy 881 MXNet 644 PyTorch 121 TensorFlow 119 Julia 109 Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 30. NMFk / NTFk Scalability 108 109 1010 1011 Tensor Size (bytes) 101 102 103 104 time(s) 3D Tensor perfect scaling 256 CPUs Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 31. NMFk / NTFk Analyses Field Data: Characterization of groundwater contaminant sources US Climate data Geothermal data Seismic data Lab Data: X-ray Spectroscopy UV Fluorescence Spectroscopy Microbial population analyses Operational Data: LANSCE: Los Alamos Neutron Accelerator Hydrocarbon (oil/gas) production Model Data: Reactive mixing A + B → C Phase separation of co-polymers Molecular Dynamics of proteins EU Climate modeling (Helmholtz Institute, Germany) Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 32. NMFk / NTFk Publications Stanev, Vesselinov, Kusne, Antoszewski, Takeuchi, Alexandrov, Unsupervised Phase Mapping of X-ray Diffraction Data by Nonnegative Matrix Factorization Integrated with Custom Clustering, Nature Computational Materials, 2018. Vesselinov, Munuduru, Karra, O’Maley, Alexandrov, Unsupervised Machine Learning Based on Non-Negative Tensor Factorization for Analyzing Reactive-Mixing, Journal of Computational Physics, (in review), 2018. Vesselinov, O’Malley, Alexandrov, Nonnegative Tensor Factorization for Contaminant Source Identification, Journal of Contaminant Hydrology, (n review), 2018. O’Malley, Vesselinov, Alexandrov, Alexandrov, Nonnegative/binary matrix factorization with a D-Wave quantum annealer, PLOS ONE, (accepted), 2018. Vesselinov, O’Malley, Alexandrov, Contaminant source identification using semi-supervised machine learning, Journal of Contaminant Hydrology, 10.1016/j.jconhyd.2017.11.002, 2017. Alexandrov, Vesselinov, Blind source separation for groundwater level analysis based on nonnegative matrix factorization, WRR, 10.1002/2013WR015037, 2014. Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 33. Reactive mixing A BBarrier A + B → C > 2000 simulations of C concentrations in time/space with varying model inputs representing reactive mixing (5 input model parameters) NTFk identifies physics processes impacting C concentrations and their relationship to model inputs Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 34. Reactive mixing: Flow fields κfL = 1 κfL = 2 κfL = 3 κfL = 4 κfL = 5 v0 = 1:0 v0 = 10−1 v0 = 10−2 v0 = 10−3 v0 = 10−4 Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 35. Reactive mixing: NTFk results NTFk extracts the dominant time/space features (processes / vortices) and compresses the model outputs Compression: > 200GB → ∼ 70MB (ratio ∼ 3000 Here, (1000 × 81 × 81) → (3 × 12 × 13) (time × rows × columns) Advection Dispersion Diffusion Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 36. Reactive mixing: NTFk results Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 37. Reactive mixing: NTFk results (κf L impacts) Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 38. Climate model of Europe: air temperature fluctuations in the air temperature [◦ C] (424 × 412 × 365) (columns × rows × days) NTFk extracts spatial and temporal footprints of dominant features: storm signal winter seasonal signal summer seasonal signal .... Data compression: ∼ 4GB → ∼ 0.5MB Compression ratio: ∼ 8000 Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 39. Climate model of Europe: air temperature fluctuations represented by 3 signals Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 40. Climate model of Europe: air temperature fluctuations represented by 3 signals Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 41. Climate model of Europe: water-table depth fluctuations in the water-table depth [m] (424 × 412 × 365) (columns × rows × days) NTFk extracts spatial and temporal footprints of dominant signals spring snowmelt signal summer rainfall signal seasonal signal .... Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 42. Climate model of Europe: water-table fluctuations represented by 3 signals Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 43. Climate model of Europe: water-table fluctuations represented by 3 signals Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 44. Climate model of Europe: Water-table fluctuations represented by 2 signals Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 45. Climate model of Europe: Water-table fluctuations represented by 3 signals Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 46. Climate model of Europe: Water-table fluctuations represented by 4 signals Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 47. Climate model of Europe: Water-table fluctuations represented by 5 signals Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 48. Climate model of Europe: Water-table fluctuations represented by 6 signals Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 49. Climate model of Europe: Water-table fluctuations represented by 7 signals Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 50. Climate model of Europe: Water-table fluctuations represented by 8 signals Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 51. Climate model of Europe: Water-table fluctuations represented by 9 signals Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 52. Climate model of Europe: Water-table fluctuations represented by 10 signals Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 53. Climate model of Europe: analyze all model outputs (>40) simultaneously Find interconnections among model outputs Evaluate impacts of different model setups Find dominant processes impacting model predictions (e.g., climate impacts on groundwater resources, impacts of subsurface processes on atmospheric conditions) Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 54. US Climate data: precipitation NOAA data set of observed monthly averaged precipitation with spatial resolution of 6km (1/16◦ ) fluctuations in monthly averaged precipitation [inches] (928 × 614 × 768) (columns × rows × months) Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 55. US precipitation: NTFk results Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 56. US precipitation: NTFk results Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 57. US Climate data: temperature NOAA data set of observed maximum monthly temperatures with spatial resolution of 6km (1/16◦ ) fluctuations in maximum monthly temperatures [◦ C] (928 × 614 × 768) (columns × rows × months) Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 58. US maximum monthly temperatures: NTFk results Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 59. US maximum monthly temperatures: NTFk results Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 60. ML for extracting contaminant plumes Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 61. ML for extracting contaminant plumes Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 62. ML for extracting contaminant plumes Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 63. Geochemistry: LANL hydrogeochemical dataset R-44#1 R-28 R-42 (18 × 8 × 12) tensor (wells × species × years) Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 64. Geochemistry: Nonnegative Tensor Factorization based on Tucker-1 decomposition X: data tensor W: source (groundwater type) matrix (unknown) G: mixing tensor (unknown) M: number of observation points (wells) N: number of observation times (e.g, 2001, 2002, ..., 2017) K: number of geochemical species observed (e.g., Cr6+ , SO2+ 4 , NO− 3 , etc.) k: number of unknown groundwater types mixed at each well Constraints: all tensor/matrix elements ≥ 0 k i=1 Gi,j,t = 1 ∀j, t Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 65. NTFk analysis estimated 7 groundwater types Sources Cr Cl− ClO4 3 H NO3 Ca Mg SO4 (µg/L) (mg/L) (µg/L) (pCi/L) (mg/L) (mg/L) (mg/L) (mg/L) S1 2970.00 63.00 0.00 0.00 14.00 73.00 25.00 170.00 S5 21.00 51.00 0.00 950.00 2.40 67.00 15.00 50.00 S6 1.50 64.00 0.00 0.00 2.80 51.00 10.00 68.00 S2 0.79 0.35 14.00 0.00 0.50 5.30 1.70 0.60 S4 0.50 0.14 0.00 0.00 10.00 21.00 5.00 10.00 S3 (B) 0.25 3.60 0.00 0.00 0.01 41.00 11.00 0.06 S7 (B) 0.10 0.03 0.00 0.00 0.01 0.40 0.80 0.90 Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 66. NTFk estimated concentrations at various wells R-28 R-42 R-44#1 R-45#1 R-11 R-1 R-50#1 Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 67. NTFk estimated time-dependent mixing of 7 groundwater types at various wells R-28 R-42 R-44#1 R-45#1 Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 68. NTFk identified sources (groundwater types) Jan-Dec 2016 Source 7: (background) Source 1: Cr, Cl − , NO3, Ca, Mg, and SO4 Source 3: Cl − , Ca, Mg (background) Source 2: ClO4 Source 6: Cl − , Ca, Mg, and SO4 Source 4: NO3 Source 5: 3 H, Cr, Cl − , Ca, Mg, and SO4 Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 69. NTFk analysis of LANL hydrogeochemical datasets (18 × 8 × 12) tensor (wells × species × years) Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 70. Geysers Geothermal Field Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 71. Geysers Geothermal Field Seismic Events 470,263 seismic events have been identified between 2003 and 2016 spatial and temporal properties converted to a tensor (50 × 34 × 4818) (columns × rows × days) NTFk extracts spatial and temporal footprints of dominant features: EGS injection starts on November 6th, 2011 .... Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 72. Seismic Events: NTFk results Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 73. Seismic Events: NTFk results Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 74. Seismic Events: NTFk results Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 75. Seismic Events: NTFk results Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 76. Quantum Machine Learning using D-Wave Performed unsupervised feature extraction using Non-negative Binary Matrix Factorization (NBMF) Coupling quantum (D-Wave) and classical computing Analyzed the faces from the original NMF paper published in Nature (Lee & Seung, 1999) Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 77. Polymer-chain folding polymer transitions between different states NTFk extracts phase-transition stages (201 × 64 × 64 × 3) → (5 × 12 × 12 × 1) (state × rows × columns × phases) Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE « Play/Pause
  • 78. Summary Developed novel unsupervised ML methods and computational tools based on Nonnegative Factorization (Matrices/Tensors) Our ML methods have been used to solve various real-world problems (brought breakthrough discoveries related to human cancer research) Our ML work already contributes to program development at LANL in mission-critical areas Our goal is to extend the ML methods and tools to solve big (> terabyte-scale) high-dimensional (> 3) data Modeldata Sensor data Experimentaldata Robust Unsupervised Machine Learning Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 79. Machine Learning (ML) Algorithms / Codes developed by our team NMFk + ShiftNMFk + GreenNMFk NTFk NBMF: Quantum machine learning using D-Wave MADS: Model-Analyses & Decision Support open-source, version-controlled, high-performance computational framework http://mads.lanl.gov http://madsjulia.github.io/Mads.jl Blind Source Separation examples: http://madsjulia.github.io/Mads.jl/Examples/blind_source_separation Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 80. Fluorescence Spectroscopy 5 cuvettes (samples) with unknown mixtures of 3 unknown compounds Emissions and excitations measured (5 × 250 × 200 tensor) NTFk identifies the mixing ratios of the compounds and their characteristic spectra Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 81. LANSCE Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 82. LANSCE: Problem Numerous independent variables ... knobs controlling the accelerator performance 15 provided / analyzed Numerous dependent variables ... observations representing the accelerator performance 8 provided / analyzed 312,326 * 23 = 7,183,498 ≈ 100 MB (per month) Full monthly dataset ≈ 2 GB; For 30 years ≈ 1 TB; Goal #1: ML a reduced-order model to simulate the performance (physics model does not exist and cannot be built) Goal #2: Use ML to control the knobs to optimize the performance Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 83. LANSCE: Independent variables = 15 Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 84. LANSCE: Dependent variables = 8 Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 85. LANSCE: NMFk estimated signal #1 based on the dependent variables Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 86. LANSCE: NMFk estimated signals #2 based on the dependent variables Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 87. LANSCE: NMFk Estimated signal #3 based on the dependent variables Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 88. LANSCE: NMFk reconstruction of one of the dependent variables Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 89. LANSCE: NMFk estimated signal #3 vs one of the independent variables Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE
  • 90. LANSCE: Reduced-order (SVR) Model Unsup ML Studies Mixing Climate EU Climate US Geochem Geysers Quantum Polymer Summary UV LANSCE