SlideShare a Scribd company logo
1 of 45
Download to read offline
Self-organizing Network for Variable Clustering
and Predictive Analytics
Hui Yang, PhD
杨 徽
Associate Professor
Complex Systems Monitoring, Modeling and Control Lab
The Pennsylvania State University
University Park, PA 16802
November 25, 2017
Outline
1 Introduction
2 Clustering
3 Self-organizing Variable Clustering
4 Case Studies - Theoretical and Practical
5 Conclusions and Future Directions
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Research Roadmap
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 3 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Research Projects
Manufacturing Processes, Precision Machining
Publications: Pattern Recognition, IEEE Transactions, Chaos, IIE Transactions
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 4 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Research Projects
Electro-mechanical Processes, Biomanufacturing
Publications: IEEE Transactions, Physical Review, Physiological Measurements
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 5 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Predictive Analytics - Manufacturing
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 6 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Predictive Analytics - Healthcare
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 7 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Big Data
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 8 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Big Data
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 9 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Challenges - Curse of Dimensionality
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 10 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Challenges - Dimensionality Reduction
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 11 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Challenges - Variable Redundancy
Least square estimate: ˆβ = (X’X)−1
X y and var (β) = σ2 (X’X)−1
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 12 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
State of The Art
Variable Selection - Relevancy
Generalized linear models, shrinkage methods, best-subset selection,
ridge regression, LASSO, least angle regression and elastic net
Relevancy between predictors and response variables, while do not
explicitly consider redundancy among predictors.
Variable Clustering - Redundancy
Redundancy measures - linear correlation or mutual information
Linear correlation nonlinear interdependences among variables
Mutual information characterizes linear and nonlinear correlation, but
requires the stationarity assumption
Latent-variable methods - oblique principal component clustering
(linear projections of variables)
Need to fill the gap
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 13 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Research Objectives
Self-organizing Network for Variable Clustering
Network Theory
Nodes ⇐= Variables
Edge weight ⇐= Redundancy measure
Adjacency matrix ⇐= Redundancy matrix
Network community ⇐= Variable cluster
Self-organizing Variable Clustering
Redundancy measures - Nonlinear coupling analysis
Measure nonlinear interdependence structures among variables
Network embedding
Embed variables as nodes in a complex network
Self-organization
Nonlinear-coupling forces move nodes to derive network topology
Community detection
Variables are clustered as sub-network communities in the space
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 14 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Review of Clustering
Data Clustering vs. Variable Clustering
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 15 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Hierarchical Clustering: Agglomerative vs. Divisive
Dissimilarity measure - symmetrical matrix
Cluster Distance - single linkage, complete linkage, group average
Variable correlation - linear correlation, mutual information
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 16 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Oblique principal component analysis
Principal component analysis
The first two PCs, eigenvectors (or loading matrix in factor analysis)
Oblique rotation
Oblique rotation of eigenvectors, Z, to obtain the B
B = ZΩ
max
Ω
p
i=1


q
j=1
b4
ij −


q
j=1
b2
ij


2

Cluster assignment
Calculate the linear correlation between all variables and rotated
components, and then assign each variable to one of two clusters
based on the higher squared correlation.
Recursive partition
Repeat the binary split for each cluster.
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 17 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Demo - Nonlinear Correlation
Cluster 1 (5 variables: 1∼5)
x1, |x1| , x2
1, x3
1, x4
1
x1 ∼ N(0, 1)
Cluster 2 (5 variables: 6∼10)
x2, |x2| , x2
2, x3
2, x4
2
x2 ∼ N(0, 1)
Cluster 3 (5 variables: 11∼15)
x3, x3(t + 3), x3(t + 5), x3(t + 7), x3(t + 9)
x3(n + 1) = 3.8 × x3(n) × (1 − x3(n)), logistic map
Cluster 4 (5 variables: 16∼20)
x4, x4(t + 10), x4(t + 20), x4(t + 30), x4(t + 40)
x4(n) = 1.905x4(n − 1) − 0.4x4(n − 2) + 0.7ε + 0.3x2
lorenz
x4 is a second-order autoregressive variable that is nonlinearly coupled
with the x component of nonlinear Lorenz system
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 18 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Demo - Results
Hierarchical Clustering and Oblique PCA Clustering - Failed
Nonlinear and asymmetric interdependence structures
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 19 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Research Methodology
Nonlinear Variable - State Space Reconstruction
Takens’ embedding theorem
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 20 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Research Methodology
Nonlinear Interdependence
Cross recurrences of two variables in the state space
ˆIx1x2 =
rm (x2) − dm (x2 | x1)
rm (x2) − dm (x2) m
rm (x2) is the average distance from x2(m) to k randomly chosen
x2(i), rm (x2) = 1
k
k
i=1 (x2(m) − x2(i))2
dm (x2 | x1) is the average conditional distance from x2(m) to k
samples of x2(i) whose indices i ∈ {n1, · · · , nk} are from the
recurrence set Γ (x1(m)) of the variable x1
dm (x2 | x1) =
1
k
i∈Γ(x1(m))
(x2(m) − x2(i))2
dm (x2) is the average distance from x2(m) to k nearest neighbors of
x2(m) in the state space
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 21 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Network Topology
Nonlinear Interdependence Matrix ⇒ Network Topology
From the nonlinear interrelationship of variables, derive the topological
structures for network and identify sub-network communities.
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 22 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Self-Organizing Variable Clustering
Spring-Electrical Model
Nodes − electrically charged particles
Edges − springs between nodes
The repulsive force exists between any pair of nodes
fr(i, j) = −
1
s(i) − s(j) 2
×
1
eα|ˆIx1x2 |
The attractive force exists only between nodes that have a relation of
nonlinear interdependence
fa(i, j) = s(i) − s(j) 2
× eγ|ˆIx1x2 |, ˆIx1x2
= 0
The combined force at a node i: f(i, s, α, γ)
i=j
−
(s(i) − s(j))
s(i) − s(j) 3
×
1
eα|ˆIx1x2 |
+
i↔j
s(i) − s(j) × (s(i) − s(j)) × eγ|ˆIx1x2 |
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 23 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Self-organizing Variable Clustering
Minimal energy network: s∗ = arg mins i f(i, s, α, γ)2
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 24 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Predictive Modeling with Grouped Variables
Gram-Schmidt orthonormalization
For variables x1, · · · , xk in one cluster
v1 = x1, w1 = v1
v1
v2 = x2 − x2, w1 w1, w2 = v2
v2
· · ·
vn = xn − n−1
i=1 xn, wi wi, wn = vn
vn
Group elastic-net model
min
β
−
n
i=1
[yi log (hβ (w, i)) + (1 − yi) log (1 − hβ (w, i))]
hβ (w, i) =
1
1 + exp − β0 + M
m=1
Km
k=1 wk(i)βk
s.t.
M
m=1
Km
k=1
αβ2
mk + (1 − α) |βmk| ≤ λ
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 25 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Simulation Experiments
20 variables + b-spline basis expansion =⇒ 60 variables
Response variable ⇐= a sparse set of predictor variables
hβ (x) =
1
1 + exp − β0 + 20×ps
i=1 xiβi + cε
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 26 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Experimental Results
Table I. Averages and standard deviation of prediction errors in the
experimental study with 100 replications
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 27 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Case Study 1 - Nonlinear Recurrence Network
Poincar´e Recurrence Theorem
Let T be a measure-preserving transformation of a probability space
(X, P) and let A ⊂ X be a measurable set. Then, for any natural
number N ∈ N, the trajectory will eventually reappear at
neighborhood A of former states:
Pr({x ∈ A|{Tn
(x)}n≥N ⊂ XA}) = 0
(a) Stamping Machine, from Dr. Jianjun Shi (b) Biological System
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 28 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Recurrence Plot
Recurrence dynamics of nonlinear and nonstationary systems
R(i, j) = Θ(ε − x(i) − x(j) )
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 29 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Structures in Recurrence Plots
Small-scale structures: single dots, diagonal and vertical lines
Large-scale structures: homogenous, periodic and disrupted patterns
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 30 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Recurrence Networks
K-nearest Neighbor Network [Small, 2008]
Directed network
Each node is connected to k nearest nodes in the network
A fixed number of neighbors
Recurrence Network [Marwan, 2008]
Undirected network
Each node may have a different number of links in the network
A fixed size of the neighborhood
Other Approaches
Transition networks [Nicolis, 2005], cycle networks [Zhang, 2006],
correlation networks [Yang, 2008], Visibility graphs [Lacasa, 2008].
Donner, Marwan et. al proposes recurrence networks for a unifying
framework to transform nonlinear time series into complex networks.
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 31 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
K−Nearest Neighbor Networks [Small, 2008]
Given a time series: X = {x1, x2, . . . , xN }
Sate space reconstruction: xi = xi, xi+τ , . . . , xi+τ(m−1)
A node xi is connected to its k nearest neighbors, but excluding the
nodes in the same strand of the trajectory.
Network representation:
Ai,j =
1, |j − i| > ∆t & j ∈ {k nearest neighbors of i}
0, otherwise
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 32 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Recurrence Networks [Marwan, 2008]
Given a time series: X = {x1, x2, . . . , xN }
Sate space reconstruction: xi = xi, xi+τ , . . . , xi+τ(m−1)
The recurrences are treated as links in the network.
The adjacency matrixA is obtained from the recurrence matrix by
removing the diagonal identities:
Ai,j = Ri,j − Ii,j
Ri,j = Θ(ε − xi − xj )
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 33 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Self-organizing Network
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 34 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
KNN Network vs. Recurrence Network
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 35 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Case Study 2
Model-Driven Predictive Analytics of Space-time Vectorcardiogram Signals
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 36 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
VCG Modeling
Multiscale basis function modeling of 3D VCG signals
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 37 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Model Parameters
Figure: The KS statistics for model-driven parametric features.
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 38 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Variable Clustering
Figure: (a) Nonlinear interdependence matrix; (b) Self-organized clustering of
model-based parametric features.
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 39 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Performance Comparison
Figure: Averages and standard deviation of prediction errors in the real-world case study
that extract a sparse set of model parameters from VCG signals for the identification of
myocardial infarctions.
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 40 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Summary
Challenges
Complex Systems =⇒ Advanced Sensing =⇒ Big Data
Large amounts of variables =⇒ curse of dimensionality and redundancy
Nonlinear and asymmetric interdependence =⇒ predictive analytics
Methodology - Self-organizing Variable Clustering
Nonlinear coupling analysis of variables
Network embedding of variables
Self-organizing derivation of network topology
Network community detection and predictive modeling
Results
Simulation experiments: outperform traditional clustering algorithms
VCG study =⇒ an average sensitivity of 96.80% and an average
specificity of 92.62% in the identification of myocardial infarctions
Broad applications
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 41 / 45
References
G. Liu and H. Yang, “Self-organizing network for group variable selection and predictive
modeling,”Annals of Operations Research, 2017. DOI: 10.1007/s10479-017-2442-2
Y. Chen and H. Yang, “A novel information-theoretic approach for variable clustering and
predictive modeling using Dirichlet process mixtures,”Scientific Reports 6, 38913, 2016.
DOI: www.nature.com/articles/srep38913
H. Yang and G. Liu, “Self-organized topology of recurrence-based complex
network,”Chaos, Vol. 23, No. 4, 043116, 2013. DOI: 10.1063/1.4829877
G. Liu and H. Yang, “Multiscale adaptive basis function modeling of spatiotemporal
cardiac electrical signals,”IEEE Journal of Biomedical and Health Informatics, Vol. 17,
No. 2, p484-492, 2013. DOI: 10.1109/JBHI.2013.2243842
H. Yang, C. Kan, G. Liu and Y. Chen, “Spatiotemporal differentiation of myocardial
infarctions,”IEEE Transactions on Automation Science and Engineering, Vol. 10, No. 4,
p938-947, 2013. DOI: 10.1109/TASE.2013.2263497
G. Liu and H. Yang, “A Self-organizing Method for Predictive Modeling with
Highly-redundant Variables,”Proceedings of the 11th Annual IEEE Conference on
Automation Science and Engineering (CASE), August 24-28, 2015, Gothenberg, Sweden.
DOI: 10.1109/CoASE.2015.7294243
G. Liu and H. Yang, “Model-driven Parametric Monitoring of High-dimensional Nonlinear
Functional Profiles,”Proceedings of the 10th Annual IEEE Conference on Automation
Science and Engineering (CASE), August 18-22, 2014, Taipei, Taiwan. DOI:
10.1109/CoASE.2014.6899408
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Acknowledgements
NSF CAREER Award
NSF CMMI-1454012
NSF IIP-1447289
NSF CMMI-1266331
NSF IOS-1146882
James A. Haley Veterans’ Hospital
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 43 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Contact Information
Hui Yang, PhD
Associate Professor
Complex Systems Monitoring Modeling and Control Laboratory
Harold and Inge Marcus Department of Industrial and Manufacturing
Engineering
The Pennsylvania State University
Tel: (814) 865-7397
Fax: (814) 863-4745
Email: huy25@psu.edu
Web: http://www.personal.psu.edu/huy25/
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 44 / 45
Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions
Questions?
Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 45 / 45

More Related Content

What's hot

Exact Matrix Completion via Convex Optimization Slide (PPT)
Exact Matrix Completion via Convex Optimization Slide (PPT)Exact Matrix Completion via Convex Optimization Slide (PPT)
Exact Matrix Completion via Convex Optimization Slide (PPT)Joonyoung Yi
 
Hybrid Block Method for the Solution of First Order Initial Value Problems of...
Hybrid Block Method for the Solution of First Order Initial Value Problems of...Hybrid Block Method for the Solution of First Order Initial Value Problems of...
Hybrid Block Method for the Solution of First Order Initial Value Problems of...iosrjce
 
Pattern-based classification of demographic sequences
Pattern-based classification of demographic sequencesPattern-based classification of demographic sequences
Pattern-based classification of demographic sequencesDmitrii Ignatov
 
Hyers ulam rassias stability of exponential primitive mapping
Hyers  ulam rassias stability of exponential primitive mappingHyers  ulam rassias stability of exponential primitive mapping
Hyers ulam rassias stability of exponential primitive mappingAlexander Decker
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentIJERD Editor
 
Localized methods for diffusions in large graphs
Localized methods for diffusions in large graphsLocalized methods for diffusions in large graphs
Localized methods for diffusions in large graphsDavid Gleich
 
Spacey random walks and higher order Markov chains
Spacey random walks and higher order Markov chainsSpacey random walks and higher order Markov chains
Spacey random walks and higher order Markov chainsDavid Gleich
 
Anti-differentiating Approximation Algorithms: PageRank and MinCut
Anti-differentiating Approximation Algorithms: PageRank and MinCutAnti-differentiating Approximation Algorithms: PageRank and MinCut
Anti-differentiating Approximation Algorithms: PageRank and MinCutDavid Gleich
 
Optimization of positive linear systems via geometric programming
Optimization of positive linear systems via geometric programmingOptimization of positive linear systems via geometric programming
Optimization of positive linear systems via geometric programmingMasaki Ogura
 
Introduction to second gradient theory of elasticity - Arjun Narayanan
Introduction to second gradient theory of elasticity - Arjun NarayananIntroduction to second gradient theory of elasticity - Arjun Narayanan
Introduction to second gradient theory of elasticity - Arjun NarayananArjun Narayanan
 
BIN PACKING PROBLEM: A LINEAR CONSTANTSPACE  -APPROXIMATION ALGORITHM
BIN PACKING PROBLEM: A LINEAR CONSTANTSPACE  -APPROXIMATION ALGORITHMBIN PACKING PROBLEM: A LINEAR CONSTANTSPACE  -APPROXIMATION ALGORITHM
BIN PACKING PROBLEM: A LINEAR CONSTANTSPACE  -APPROXIMATION ALGORITHMijcsa
 
Projection methods for stochastic structural dynamics
Projection methods for stochastic structural dynamicsProjection methods for stochastic structural dynamics
Projection methods for stochastic structural dynamicsUniversity of Glasgow
 
Engineering Data Science Objectives for Social Network Analysis
Engineering Data Science Objectives for Social Network AnalysisEngineering Data Science Objectives for Social Network Analysis
Engineering Data Science Objectives for Social Network AnalysisDavid Gleich
 
A Study on the Root Systems and Dynkin diagrams associated with QHA2(1)
A Study on the Root Systems and Dynkin diagrams associated with QHA2(1)A Study on the Root Systems and Dynkin diagrams associated with QHA2(1)
A Study on the Root Systems and Dynkin diagrams associated with QHA2(1)IRJET Journal
 
Isoparametric mapping
Isoparametric mappingIsoparametric mapping
Isoparametric mappingLinh Tran
 
Tensor Train decomposition in machine learning
Tensor Train decomposition in machine learningTensor Train decomposition in machine learning
Tensor Train decomposition in machine learningAlexander Novikov
 

What's hot (20)

Exact Matrix Completion via Convex Optimization Slide (PPT)
Exact Matrix Completion via Convex Optimization Slide (PPT)Exact Matrix Completion via Convex Optimization Slide (PPT)
Exact Matrix Completion via Convex Optimization Slide (PPT)
 
Hybrid Block Method for the Solution of First Order Initial Value Problems of...
Hybrid Block Method for the Solution of First Order Initial Value Problems of...Hybrid Block Method for the Solution of First Order Initial Value Problems of...
Hybrid Block Method for the Solution of First Order Initial Value Problems of...
 
WITMSE 2013
WITMSE 2013WITMSE 2013
WITMSE 2013
 
Pattern-based classification of demographic sequences
Pattern-based classification of demographic sequencesPattern-based classification of demographic sequences
Pattern-based classification of demographic sequences
 
Hyers ulam rassias stability of exponential primitive mapping
Hyers  ulam rassias stability of exponential primitive mappingHyers  ulam rassias stability of exponential primitive mapping
Hyers ulam rassias stability of exponential primitive mapping
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
Localized methods for diffusions in large graphs
Localized methods for diffusions in large graphsLocalized methods for diffusions in large graphs
Localized methods for diffusions in large graphs
 
Spacey random walks and higher order Markov chains
Spacey random walks and higher order Markov chainsSpacey random walks and higher order Markov chains
Spacey random walks and higher order Markov chains
 
Anti-differentiating Approximation Algorithms: PageRank and MinCut
Anti-differentiating Approximation Algorithms: PageRank and MinCutAnti-differentiating Approximation Algorithms: PageRank and MinCut
Anti-differentiating Approximation Algorithms: PageRank and MinCut
 
Optimization of positive linear systems via geometric programming
Optimization of positive linear systems via geometric programmingOptimization of positive linear systems via geometric programming
Optimization of positive linear systems via geometric programming
 
Polynomial Matrix Decompositions
Polynomial Matrix DecompositionsPolynomial Matrix Decompositions
Polynomial Matrix Decompositions
 
Introduction to second gradient theory of elasticity - Arjun Narayanan
Introduction to second gradient theory of elasticity - Arjun NarayananIntroduction to second gradient theory of elasticity - Arjun Narayanan
Introduction to second gradient theory of elasticity - Arjun Narayanan
 
BIN PACKING PROBLEM: A LINEAR CONSTANTSPACE  -APPROXIMATION ALGORITHM
BIN PACKING PROBLEM: A LINEAR CONSTANTSPACE  -APPROXIMATION ALGORITHMBIN PACKING PROBLEM: A LINEAR CONSTANTSPACE  -APPROXIMATION ALGORITHM
BIN PACKING PROBLEM: A LINEAR CONSTANTSPACE  -APPROXIMATION ALGORITHM
 
Projection methods for stochastic structural dynamics
Projection methods for stochastic structural dynamicsProjection methods for stochastic structural dynamics
Projection methods for stochastic structural dynamics
 
Engineering Data Science Objectives for Social Network Analysis
Engineering Data Science Objectives for Social Network AnalysisEngineering Data Science Objectives for Social Network Analysis
Engineering Data Science Objectives for Social Network Analysis
 
A Study on the Root Systems and Dynkin diagrams associated with QHA2(1)
A Study on the Root Systems and Dynkin diagrams associated with QHA2(1)A Study on the Root Systems and Dynkin diagrams associated with QHA2(1)
A Study on the Root Systems and Dynkin diagrams associated with QHA2(1)
 
Isoparametric mapping
Isoparametric mappingIsoparametric mapping
Isoparametric mapping
 
Estimating Space-Time Covariance from Finite Sample Sets
Estimating Space-Time Covariance from Finite Sample SetsEstimating Space-Time Covariance from Finite Sample Sets
Estimating Space-Time Covariance from Finite Sample Sets
 
PROJECT
PROJECTPROJECT
PROJECT
 
Tensor Train decomposition in machine learning
Tensor Train decomposition in machine learningTensor Train decomposition in machine learning
Tensor Train decomposition in machine learning
 

Similar to Self-organizing Network Clustering Predictive Analytics

Independence_Polynomials
Independence_PolynomialsIndependence_Polynomials
Independence_PolynomialsBailey Ethridge
 
Extreme bound analysis based on correlation coefficient for optimal regressio...
Extreme bound analysis based on correlation coefficient for optimal regressio...Extreme bound analysis based on correlation coefficient for optimal regressio...
Extreme bound analysis based on correlation coefficient for optimal regressio...Loc Nguyen
 
Structured Regularization for conditional Gaussian graphical model
Structured Regularization for conditional Gaussian graphical modelStructured Regularization for conditional Gaussian graphical model
Structured Regularization for conditional Gaussian graphical modelLaboratoire Statistique et génome
 
block-mdp-masters-defense.pdf
block-mdp-masters-defense.pdfblock-mdp-masters-defense.pdf
block-mdp-masters-defense.pdfJunghyun Lee
 
Mining co-expression network
Mining co-expression networkMining co-expression network
Mining co-expression networktuxette
 
Physics-driven Spatiotemporal Regularization for High-dimensional Predictive...
 Physics-driven Spatiotemporal Regularization for High-dimensional Predictive... Physics-driven Spatiotemporal Regularization for High-dimensional Predictive...
Physics-driven Spatiotemporal Regularization for High-dimensional Predictive...Hui Yang
 
Ability Study of Proximity Measure for Big Data Mining Context on Clustering
Ability Study of Proximity Measure for Big Data Mining Context on ClusteringAbility Study of Proximity Measure for Big Data Mining Context on Clustering
Ability Study of Proximity Measure for Big Data Mining Context on ClusteringKamleshKumar394
 
A Mathematical Programming Approach for Selection of Variables in Cluster Ana...
A Mathematical Programming Approach for Selection of Variables in Cluster Ana...A Mathematical Programming Approach for Selection of Variables in Cluster Ana...
A Mathematical Programming Approach for Selection of Variables in Cluster Ana...IJRES Journal
 
Learning a nonlinear embedding by preserving class neibourhood structure 최종
Learning a nonlinear embedding by preserving class neibourhood structure   최종Learning a nonlinear embedding by preserving class neibourhood structure   최종
Learning a nonlinear embedding by preserving class neibourhood structure 최종WooSung Choi
 
An approximate possibilistic
An approximate possibilisticAn approximate possibilistic
An approximate possibilisticcsandit
 
Andrey V. Savchenko - Sequential Hierarchical Image Recognition based on the ...
Andrey V. Savchenko - Sequential Hierarchical Image Recognition based on the ...Andrey V. Savchenko - Sequential Hierarchical Image Recognition based on the ...
Andrey V. Savchenko - Sequential Hierarchical Image Recognition based on the ...AIST
 
MLHEP 2015: Introductory Lecture #1
MLHEP 2015: Introductory Lecture #1MLHEP 2015: Introductory Lecture #1
MLHEP 2015: Introductory Lecture #1arogozhnikov
 
Basic Concepts of Standard Experimental Designs ( Statistics )
Basic Concepts of Standard Experimental Designs ( Statistics )Basic Concepts of Standard Experimental Designs ( Statistics )
Basic Concepts of Standard Experimental Designs ( Statistics )Hasnat Israq
 
Nonlinear dimension reduction
Nonlinear dimension reductionNonlinear dimension reduction
Nonlinear dimension reductionYan Xu
 

Similar to Self-organizing Network Clustering Predictive Analytics (20)

prototypes-AMALEA.pdf
prototypes-AMALEA.pdfprototypes-AMALEA.pdf
prototypes-AMALEA.pdf
 
Independence_Polynomials
Independence_PolynomialsIndependence_Polynomials
Independence_Polynomials
 
Extreme bound analysis based on correlation coefficient for optimal regressio...
Extreme bound analysis based on correlation coefficient for optimal regressio...Extreme bound analysis based on correlation coefficient for optimal regressio...
Extreme bound analysis based on correlation coefficient for optimal regressio...
 
Structured Regularization for conditional Gaussian graphical model
Structured Regularization for conditional Gaussian graphical modelStructured Regularization for conditional Gaussian graphical model
Structured Regularization for conditional Gaussian graphical model
 
block-mdp-masters-defense.pdf
block-mdp-masters-defense.pdfblock-mdp-masters-defense.pdf
block-mdp-masters-defense.pdf
 
Mining co-expression network
Mining co-expression networkMining co-expression network
Mining co-expression network
 
Physics-driven Spatiotemporal Regularization for High-dimensional Predictive...
 Physics-driven Spatiotemporal Regularization for High-dimensional Predictive... Physics-driven Spatiotemporal Regularization for High-dimensional Predictive...
Physics-driven Spatiotemporal Regularization for High-dimensional Predictive...
 
Ability Study of Proximity Measure for Big Data Mining Context on Clustering
Ability Study of Proximity Measure for Big Data Mining Context on ClusteringAbility Study of Proximity Measure for Big Data Mining Context on Clustering
Ability Study of Proximity Measure for Big Data Mining Context on Clustering
 
SASA 2016
SASA 2016SASA 2016
SASA 2016
 
A Mathematical Programming Approach for Selection of Variables in Cluster Ana...
A Mathematical Programming Approach for Selection of Variables in Cluster Ana...A Mathematical Programming Approach for Selection of Variables in Cluster Ana...
A Mathematical Programming Approach for Selection of Variables in Cluster Ana...
 
Cs501 cluster analysis
Cs501 cluster analysisCs501 cluster analysis
Cs501 cluster analysis
 
Pattern baysin
Pattern baysinPattern baysin
Pattern baysin
 
An Introduction to Metric Learning for Clustering
An Introduction to Metric Learning for ClusteringAn Introduction to Metric Learning for Clustering
An Introduction to Metric Learning for Clustering
 
GDRR Opening Workshop - Bayesian Inference for Common Cause Failure Rate Base...
GDRR Opening Workshop - Bayesian Inference for Common Cause Failure Rate Base...GDRR Opening Workshop - Bayesian Inference for Common Cause Failure Rate Base...
GDRR Opening Workshop - Bayesian Inference for Common Cause Failure Rate Base...
 
Learning a nonlinear embedding by preserving class neibourhood structure 최종
Learning a nonlinear embedding by preserving class neibourhood structure   최종Learning a nonlinear embedding by preserving class neibourhood structure   최종
Learning a nonlinear embedding by preserving class neibourhood structure 최종
 
An approximate possibilistic
An approximate possibilisticAn approximate possibilistic
An approximate possibilistic
 
Andrey V. Savchenko - Sequential Hierarchical Image Recognition based on the ...
Andrey V. Savchenko - Sequential Hierarchical Image Recognition based on the ...Andrey V. Savchenko - Sequential Hierarchical Image Recognition based on the ...
Andrey V. Savchenko - Sequential Hierarchical Image Recognition based on the ...
 
MLHEP 2015: Introductory Lecture #1
MLHEP 2015: Introductory Lecture #1MLHEP 2015: Introductory Lecture #1
MLHEP 2015: Introductory Lecture #1
 
Basic Concepts of Standard Experimental Designs ( Statistics )
Basic Concepts of Standard Experimental Designs ( Statistics )Basic Concepts of Standard Experimental Designs ( Statistics )
Basic Concepts of Standard Experimental Designs ( Statistics )
 
Nonlinear dimension reduction
Nonlinear dimension reductionNonlinear dimension reduction
Nonlinear dimension reduction
 

Recently uploaded

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 

Recently uploaded (20)

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 

Self-organizing Network Clustering Predictive Analytics

  • 1. Self-organizing Network for Variable Clustering and Predictive Analytics Hui Yang, PhD 杨 徽 Associate Professor Complex Systems Monitoring, Modeling and Control Lab The Pennsylvania State University University Park, PA 16802 November 25, 2017
  • 2. Outline 1 Introduction 2 Clustering 3 Self-organizing Variable Clustering 4 Case Studies - Theoretical and Practical 5 Conclusions and Future Directions
  • 3. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Research Roadmap Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 3 / 45
  • 4. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Research Projects Manufacturing Processes, Precision Machining Publications: Pattern Recognition, IEEE Transactions, Chaos, IIE Transactions Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 4 / 45
  • 5. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Research Projects Electro-mechanical Processes, Biomanufacturing Publications: IEEE Transactions, Physical Review, Physiological Measurements Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 5 / 45
  • 6. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Predictive Analytics - Manufacturing Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 6 / 45
  • 7. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Predictive Analytics - Healthcare Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 7 / 45
  • 8. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Big Data Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 8 / 45
  • 9. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Big Data Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 9 / 45
  • 10. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Challenges - Curse of Dimensionality Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 10 / 45
  • 11. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Challenges - Dimensionality Reduction Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 11 / 45
  • 12. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Challenges - Variable Redundancy Least square estimate: ˆβ = (X’X)−1 X y and var (β) = σ2 (X’X)−1 Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 12 / 45
  • 13. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions State of The Art Variable Selection - Relevancy Generalized linear models, shrinkage methods, best-subset selection, ridge regression, LASSO, least angle regression and elastic net Relevancy between predictors and response variables, while do not explicitly consider redundancy among predictors. Variable Clustering - Redundancy Redundancy measures - linear correlation or mutual information Linear correlation nonlinear interdependences among variables Mutual information characterizes linear and nonlinear correlation, but requires the stationarity assumption Latent-variable methods - oblique principal component clustering (linear projections of variables) Need to fill the gap Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 13 / 45
  • 14. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Research Objectives Self-organizing Network for Variable Clustering Network Theory Nodes ⇐= Variables Edge weight ⇐= Redundancy measure Adjacency matrix ⇐= Redundancy matrix Network community ⇐= Variable cluster Self-organizing Variable Clustering Redundancy measures - Nonlinear coupling analysis Measure nonlinear interdependence structures among variables Network embedding Embed variables as nodes in a complex network Self-organization Nonlinear-coupling forces move nodes to derive network topology Community detection Variables are clustered as sub-network communities in the space Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 14 / 45
  • 15. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Review of Clustering Data Clustering vs. Variable Clustering Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 15 / 45
  • 16. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Hierarchical Clustering: Agglomerative vs. Divisive Dissimilarity measure - symmetrical matrix Cluster Distance - single linkage, complete linkage, group average Variable correlation - linear correlation, mutual information Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 16 / 45
  • 17. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Oblique principal component analysis Principal component analysis The first two PCs, eigenvectors (or loading matrix in factor analysis) Oblique rotation Oblique rotation of eigenvectors, Z, to obtain the B B = ZΩ max Ω p i=1   q j=1 b4 ij −   q j=1 b2 ij   2  Cluster assignment Calculate the linear correlation between all variables and rotated components, and then assign each variable to one of two clusters based on the higher squared correlation. Recursive partition Repeat the binary split for each cluster. Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 17 / 45
  • 18. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Demo - Nonlinear Correlation Cluster 1 (5 variables: 1∼5) x1, |x1| , x2 1, x3 1, x4 1 x1 ∼ N(0, 1) Cluster 2 (5 variables: 6∼10) x2, |x2| , x2 2, x3 2, x4 2 x2 ∼ N(0, 1) Cluster 3 (5 variables: 11∼15) x3, x3(t + 3), x3(t + 5), x3(t + 7), x3(t + 9) x3(n + 1) = 3.8 × x3(n) × (1 − x3(n)), logistic map Cluster 4 (5 variables: 16∼20) x4, x4(t + 10), x4(t + 20), x4(t + 30), x4(t + 40) x4(n) = 1.905x4(n − 1) − 0.4x4(n − 2) + 0.7ε + 0.3x2 lorenz x4 is a second-order autoregressive variable that is nonlinearly coupled with the x component of nonlinear Lorenz system Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 18 / 45
  • 19. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Demo - Results Hierarchical Clustering and Oblique PCA Clustering - Failed Nonlinear and asymmetric interdependence structures Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 19 / 45
  • 20. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Research Methodology Nonlinear Variable - State Space Reconstruction Takens’ embedding theorem Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 20 / 45
  • 21. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Research Methodology Nonlinear Interdependence Cross recurrences of two variables in the state space ˆIx1x2 = rm (x2) − dm (x2 | x1) rm (x2) − dm (x2) m rm (x2) is the average distance from x2(m) to k randomly chosen x2(i), rm (x2) = 1 k k i=1 (x2(m) − x2(i))2 dm (x2 | x1) is the average conditional distance from x2(m) to k samples of x2(i) whose indices i ∈ {n1, · · · , nk} are from the recurrence set Γ (x1(m)) of the variable x1 dm (x2 | x1) = 1 k i∈Γ(x1(m)) (x2(m) − x2(i))2 dm (x2) is the average distance from x2(m) to k nearest neighbors of x2(m) in the state space Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 21 / 45
  • 22. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Network Topology Nonlinear Interdependence Matrix ⇒ Network Topology From the nonlinear interrelationship of variables, derive the topological structures for network and identify sub-network communities. Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 22 / 45
  • 23. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Self-Organizing Variable Clustering Spring-Electrical Model Nodes − electrically charged particles Edges − springs between nodes The repulsive force exists between any pair of nodes fr(i, j) = − 1 s(i) − s(j) 2 × 1 eα|ˆIx1x2 | The attractive force exists only between nodes that have a relation of nonlinear interdependence fa(i, j) = s(i) − s(j) 2 × eγ|ˆIx1x2 |, ˆIx1x2 = 0 The combined force at a node i: f(i, s, α, γ) i=j − (s(i) − s(j)) s(i) − s(j) 3 × 1 eα|ˆIx1x2 | + i↔j s(i) − s(j) × (s(i) − s(j)) × eγ|ˆIx1x2 | Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 23 / 45
  • 24. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Self-organizing Variable Clustering Minimal energy network: s∗ = arg mins i f(i, s, α, γ)2 Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 24 / 45
  • 25. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Predictive Modeling with Grouped Variables Gram-Schmidt orthonormalization For variables x1, · · · , xk in one cluster v1 = x1, w1 = v1 v1 v2 = x2 − x2, w1 w1, w2 = v2 v2 · · · vn = xn − n−1 i=1 xn, wi wi, wn = vn vn Group elastic-net model min β − n i=1 [yi log (hβ (w, i)) + (1 − yi) log (1 − hβ (w, i))] hβ (w, i) = 1 1 + exp − β0 + M m=1 Km k=1 wk(i)βk s.t. M m=1 Km k=1 αβ2 mk + (1 − α) |βmk| ≤ λ Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 25 / 45
  • 26. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Simulation Experiments 20 variables + b-spline basis expansion =⇒ 60 variables Response variable ⇐= a sparse set of predictor variables hβ (x) = 1 1 + exp − β0 + 20×ps i=1 xiβi + cε Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 26 / 45
  • 27. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Experimental Results Table I. Averages and standard deviation of prediction errors in the experimental study with 100 replications Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 27 / 45
  • 28. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Case Study 1 - Nonlinear Recurrence Network Poincar´e Recurrence Theorem Let T be a measure-preserving transformation of a probability space (X, P) and let A ⊂ X be a measurable set. Then, for any natural number N ∈ N, the trajectory will eventually reappear at neighborhood A of former states: Pr({x ∈ A|{Tn (x)}n≥N ⊂ XA}) = 0 (a) Stamping Machine, from Dr. Jianjun Shi (b) Biological System Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 28 / 45
  • 29. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Recurrence Plot Recurrence dynamics of nonlinear and nonstationary systems R(i, j) = Θ(ε − x(i) − x(j) ) Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 29 / 45
  • 30. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Structures in Recurrence Plots Small-scale structures: single dots, diagonal and vertical lines Large-scale structures: homogenous, periodic and disrupted patterns Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 30 / 45
  • 31. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Recurrence Networks K-nearest Neighbor Network [Small, 2008] Directed network Each node is connected to k nearest nodes in the network A fixed number of neighbors Recurrence Network [Marwan, 2008] Undirected network Each node may have a different number of links in the network A fixed size of the neighborhood Other Approaches Transition networks [Nicolis, 2005], cycle networks [Zhang, 2006], correlation networks [Yang, 2008], Visibility graphs [Lacasa, 2008]. Donner, Marwan et. al proposes recurrence networks for a unifying framework to transform nonlinear time series into complex networks. Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 31 / 45
  • 32. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions K−Nearest Neighbor Networks [Small, 2008] Given a time series: X = {x1, x2, . . . , xN } Sate space reconstruction: xi = xi, xi+τ , . . . , xi+τ(m−1) A node xi is connected to its k nearest neighbors, but excluding the nodes in the same strand of the trajectory. Network representation: Ai,j = 1, |j − i| > ∆t & j ∈ {k nearest neighbors of i} 0, otherwise Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 32 / 45
  • 33. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Recurrence Networks [Marwan, 2008] Given a time series: X = {x1, x2, . . . , xN } Sate space reconstruction: xi = xi, xi+τ , . . . , xi+τ(m−1) The recurrences are treated as links in the network. The adjacency matrixA is obtained from the recurrence matrix by removing the diagonal identities: Ai,j = Ri,j − Ii,j Ri,j = Θ(ε − xi − xj ) Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 33 / 45
  • 34. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Self-organizing Network Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 34 / 45
  • 35. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions KNN Network vs. Recurrence Network Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 35 / 45
  • 36. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Case Study 2 Model-Driven Predictive Analytics of Space-time Vectorcardiogram Signals Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 36 / 45
  • 37. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions VCG Modeling Multiscale basis function modeling of 3D VCG signals Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 37 / 45
  • 38. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Model Parameters Figure: The KS statistics for model-driven parametric features. Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 38 / 45
  • 39. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Variable Clustering Figure: (a) Nonlinear interdependence matrix; (b) Self-organized clustering of model-based parametric features. Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 39 / 45
  • 40. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Performance Comparison Figure: Averages and standard deviation of prediction errors in the real-world case study that extract a sparse set of model parameters from VCG signals for the identification of myocardial infarctions. Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 40 / 45
  • 41. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Summary Challenges Complex Systems =⇒ Advanced Sensing =⇒ Big Data Large amounts of variables =⇒ curse of dimensionality and redundancy Nonlinear and asymmetric interdependence =⇒ predictive analytics Methodology - Self-organizing Variable Clustering Nonlinear coupling analysis of variables Network embedding of variables Self-organizing derivation of network topology Network community detection and predictive modeling Results Simulation experiments: outperform traditional clustering algorithms VCG study =⇒ an average sensitivity of 96.80% and an average specificity of 92.62% in the identification of myocardial infarctions Broad applications Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 41 / 45
  • 42. References G. Liu and H. Yang, “Self-organizing network for group variable selection and predictive modeling,”Annals of Operations Research, 2017. DOI: 10.1007/s10479-017-2442-2 Y. Chen and H. Yang, “A novel information-theoretic approach for variable clustering and predictive modeling using Dirichlet process mixtures,”Scientific Reports 6, 38913, 2016. DOI: www.nature.com/articles/srep38913 H. Yang and G. Liu, “Self-organized topology of recurrence-based complex network,”Chaos, Vol. 23, No. 4, 043116, 2013. DOI: 10.1063/1.4829877 G. Liu and H. Yang, “Multiscale adaptive basis function modeling of spatiotemporal cardiac electrical signals,”IEEE Journal of Biomedical and Health Informatics, Vol. 17, No. 2, p484-492, 2013. DOI: 10.1109/JBHI.2013.2243842 H. Yang, C. Kan, G. Liu and Y. Chen, “Spatiotemporal differentiation of myocardial infarctions,”IEEE Transactions on Automation Science and Engineering, Vol. 10, No. 4, p938-947, 2013. DOI: 10.1109/TASE.2013.2263497 G. Liu and H. Yang, “A Self-organizing Method for Predictive Modeling with Highly-redundant Variables,”Proceedings of the 11th Annual IEEE Conference on Automation Science and Engineering (CASE), August 24-28, 2015, Gothenberg, Sweden. DOI: 10.1109/CoASE.2015.7294243 G. Liu and H. Yang, “Model-driven Parametric Monitoring of High-dimensional Nonlinear Functional Profiles,”Proceedings of the 10th Annual IEEE Conference on Automation Science and Engineering (CASE), August 18-22, 2014, Taipei, Taiwan. DOI: 10.1109/CoASE.2014.6899408
  • 43. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Acknowledgements NSF CAREER Award NSF CMMI-1454012 NSF IIP-1447289 NSF CMMI-1266331 NSF IOS-1146882 James A. Haley Veterans’ Hospital Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 43 / 45
  • 44. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Contact Information Hui Yang, PhD Associate Professor Complex Systems Monitoring Modeling and Control Laboratory Harold and Inge Marcus Department of Industrial and Manufacturing Engineering The Pennsylvania State University Tel: (814) 865-7397 Fax: (814) 863-4745 Email: huy25@psu.edu Web: http://www.personal.psu.edu/huy25/ Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 44 / 45
  • 45. Introduction Clustering Self-organizing Variable Clustering Experiments Conclusions Questions? Yang, Hui (Penn State) Self-organizing Analytics November 25, 2017 45 / 45