SlideShare a Scribd company logo
1 of 13
Download to read offline
Independent Component
Analysis (ICA)
Method Review
INDEPENDENT COMPONENT ANALYSIS
DATA
Imagine that you are a weaver, and you have a loom of colorful strings. Each string
represents a unique pattern in the data. With actual data, each of these strings would be
a vector of numbers that can be fit with a linear equation. As we see the strings above,
they are well organized.
MIXED UP DATA
INDEPENDENCE
INDEPENDENT COMPONENT ANALYSIS
Unfortunately, when we collect data in the real world it does not come to us neat and
organized. Our unique strings get mixed up with other strings, and random signal such as
noise. In our example above, a monkey has come along and mixed up our strings. How
do we untangle them?
HOW TO UNMIX?
INDEPENDENT COMPONENT ANALYSIS
We could know something special about each string, maybe a feature like color, and manually
unmix, however it we are dealing with a huge dataset and don’t have a clue about any special
features, we are powerless. This is where ICA comes in. We start with our mixed data and
assume 1) we have mixed up data (our loom) that is 2) comprised of independent signals
MIXED STRINGS
(OBSERVED DATA)
INDEPENDENT COMPONENT ANALYSIS
=
MONKEY
MADNESS
(“MIXING MATRIX”)
X = A SX
ORIGINAL
STRINGS
(ORIGINAL DATA)
X
We start with this mixed up data, X, and we know that it was generated by the monkey applying
some sequence of movements to it (the “monkey madness”). We call this series of
transformations that the monkey applies to the unmixed data, s our mixing matrix. This matrix
would consist of vectors of numbers that, when multiplied with s, produce the observed data X.
INDEPENDENT COMPONENT ANALYSIS
S = A-1 XX
To solve this problem and recover our original strings from the mixed ones, we just need to
solve this equation for s. We know X, so we just need to figure out what the inverse of A is. This
is normally referred to as “W” or the un-mixing matrix. We are going to choose the numbers in
this matrix that maximize the probability of our data.
MIXED STRINGS
(OBSERVED DATA)
=
UN-MIXING
MATRIX, W
ORIGINAL
STRINGS
(ORIGINAL DATA)
X
INDEPENDENT COMPONENT ANALYSIS
S = A-1 XX
What is basically done is that we model the CDF of each signal’s probability as the sigmoid
function because it increases from 0 to 1, the derivative of the sigmoid is the density function,
and then we would iteratively maximize that function until convergence to find the weights, this
inverse matrix (details in next slides!)
MIXED STRINGS
(OBSERVED DATA)
=
UN-MIXING
MATRIX, W
ORIGINAL
STRINGS
(ORIGINAL DATA)
X
Independent Component Analysis
How to find the weights with Maximum Likelihood Estimation?
Suppose that the distribution of each source si is given by a density
ps, and that the joint distribution of the sources s is given by:
this implies the following density on x = As = W1s
All that remains is to specify a density (a CDF) for the individual sources ps. It can’t
be Gaussian, how about sigmoid? (increases from 0 to 1)
CS229 Notes, Andrew Ng, 2012
So we model the CDF for each independent signal with sigmoid, so to get the
probability of the signal at any particular time-point we look at the derivative of the CDF
(the PDF):
So if we want to maximize this probability (find our data), we want to make it as big as
possible. The square matrix W is the parameter in our model, so given a training set,
the log likelihood is given by:
And we want to maximize this in terms of W. It’s useful to know that:
And so a “one at a time” (stochastic gradient ascent rule) is:
This is how we would update our weights until convergence.
Independent Component Analysis
How to find the weights with Maximum Likelihood Estimation?
CS229 Notes, Andrew Ng, 2012
FastICA Modification
“ICA with Reference” is a modification of FastICA
CS229 Notes, Andrew Ng, 2012
Negative entropy is used to measure mutual independence in formula:
1st term: Gaussian variable (wTx), 2nd non-quadratic contrast function
||w||2 = 1 used when maximizing J(y) such that:
If we choose, for the 2nd function G’’’(u) = u3, the update becomes:
“Inspired” by this form of the update, we can impose an additional
constraint that incorporates prior information about the components
so it no longer maximizes just independence, but is also close to the
reference, r:
ICA CAVEATS
Permutation of the original sources is ambiguous
But this doesn’t matter for most applications
Data assumed to be non-Gaussian
If the data is Gaussian, there is an arbitrary rotational component in the
mixing matrix that cannot be determined from the data, so we cannot
recover the original sources
No way to recover scaling of the weights
If a single column of matrix A were scaled by a factor of 2 and the
corresponding source were scaled by a factor of ½, then there is again no
way, given only the x(i)’s, to determine this had happened.
Why can’t the data be Gaussian?
“Suppose we observe some x = As, where A is our mixing matrix. The distribution of x
will also be Gaussian, with zero mean and covariance
E[xxT ] = E[AssTAT ] = AAT
Now, let R be an arbitrary orthogonal (less formally, a rotation/reflection) matrix, so
that RRT = RTR = I, and let A’ = AR. Then if the data had been mixed according to A’
instead of A, we would have instead observed x’ = A’s. The distribution of x’ is
also Gaussian, with zero mean and covariance
E[x’(x’)T ] = E[A’ssT (A’)T ] = E[ARssT (AR)T ] = ARRTAT = AAT
Hence, whether the mixing matrix is A or A’, we would observe data from a N(0;AAT )
distribution. Thus, there is no way to tell if the sources were mixed using A and A’. So,
there is an arbitrary rotational component in the mixing matrix that cannot be
determined from the data, and we cannot recover the original sources.”
CS229 Notes, Andrew Ng, 2012
vsochat@stanford.edu

More Related Content

What's hot

Logistic regression in Machine Learning
Logistic regression in Machine LearningLogistic regression in Machine Learning
Logistic regression in Machine LearningKuppusamy P
 
Kalman filter - Applications in Image processing
Kalman filter - Applications in Image processingKalman filter - Applications in Image processing
Kalman filter - Applications in Image processingRavi Teja
 
K mean-clustering algorithm
K mean-clustering algorithmK mean-clustering algorithm
K mean-clustering algorithmparry prabhu
 
Introduction to Linear Discriminant Analysis
Introduction to Linear Discriminant AnalysisIntroduction to Linear Discriminant Analysis
Introduction to Linear Discriminant AnalysisJaclyn Kokx
 
Pca(principal components analysis)
Pca(principal components analysis)Pca(principal components analysis)
Pca(principal components analysis)kalung0313
 
5. Linear Algebra for Machine Learning: Singular Value Decomposition and Prin...
5. Linear Algebra for Machine Learning: Singular Value Decomposition and Prin...5. Linear Algebra for Machine Learning: Singular Value Decomposition and Prin...
5. Linear Algebra for Machine Learning: Singular Value Decomposition and Prin...Ceni Babaoglu, PhD
 
Support vector machines
Support vector machinesSupport vector machines
Support vector machinesUjjawal
 
Ensemble methods in machine learning
Ensemble methods in machine learningEnsemble methods in machine learning
Ensemble methods in machine learningSANTHOSH RAJA M G
 
ML - Simple Linear Regression
ML - Simple Linear RegressionML - Simple Linear Regression
ML - Simple Linear RegressionAndrew Ferlitsch
 
ML - Multiple Linear Regression
ML - Multiple Linear RegressionML - Multiple Linear Regression
ML - Multiple Linear RegressionAndrew Ferlitsch
 
Birch Algorithm With Solved Example
Birch Algorithm With Solved ExampleBirch Algorithm With Solved Example
Birch Algorithm With Solved Examplekailash shaw
 
Eigen values and eigen vectors engineering
Eigen values and eigen vectors engineeringEigen values and eigen vectors engineering
Eigen values and eigen vectors engineeringshubham211
 
Singular Value Decompostion (SVD)
Singular Value Decompostion (SVD)Singular Value Decompostion (SVD)
Singular Value Decompostion (SVD)Isaac Yowetu
 
Nonnegative Matrix Factorization
Nonnegative Matrix FactorizationNonnegative Matrix Factorization
Nonnegative Matrix FactorizationTatsuya Yokota
 
Radial Basis Function Interpolation
Radial Basis Function InterpolationRadial Basis Function Interpolation
Radial Basis Function InterpolationJesse Bettencourt
 
Artificial Neural Network Lect4 : Single Layer Perceptron Classifiers
Artificial Neural Network Lect4 : Single Layer Perceptron ClassifiersArtificial Neural Network Lect4 : Single Layer Perceptron Classifiers
Artificial Neural Network Lect4 : Single Layer Perceptron ClassifiersMohammed Bennamoun
 
Optimization for Deep Learning
Optimization for Deep LearningOptimization for Deep Learning
Optimization for Deep LearningSebastian Ruder
 

What's hot (20)

Logistic regression in Machine Learning
Logistic regression in Machine LearningLogistic regression in Machine Learning
Logistic regression in Machine Learning
 
Kalman filter - Applications in Image processing
Kalman filter - Applications in Image processingKalman filter - Applications in Image processing
Kalman filter - Applications in Image processing
 
K mean-clustering algorithm
K mean-clustering algorithmK mean-clustering algorithm
K mean-clustering algorithm
 
Introduction to Linear Discriminant Analysis
Introduction to Linear Discriminant AnalysisIntroduction to Linear Discriminant Analysis
Introduction to Linear Discriminant Analysis
 
Pca(principal components analysis)
Pca(principal components analysis)Pca(principal components analysis)
Pca(principal components analysis)
 
5. Linear Algebra for Machine Learning: Singular Value Decomposition and Prin...
5. Linear Algebra for Machine Learning: Singular Value Decomposition and Prin...5. Linear Algebra for Machine Learning: Singular Value Decomposition and Prin...
5. Linear Algebra for Machine Learning: Singular Value Decomposition and Prin...
 
Support vector machines
Support vector machinesSupport vector machines
Support vector machines
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Ensemble methods in machine learning
Ensemble methods in machine learningEnsemble methods in machine learning
Ensemble methods in machine learning
 
03 Machine Learning Linear Algebra
03 Machine Learning Linear Algebra03 Machine Learning Linear Algebra
03 Machine Learning Linear Algebra
 
ML - Simple Linear Regression
ML - Simple Linear RegressionML - Simple Linear Regression
ML - Simple Linear Regression
 
ML - Multiple Linear Regression
ML - Multiple Linear RegressionML - Multiple Linear Regression
ML - Multiple Linear Regression
 
Birch Algorithm With Solved Example
Birch Algorithm With Solved ExampleBirch Algorithm With Solved Example
Birch Algorithm With Solved Example
 
Eigen values and eigen vectors engineering
Eigen values and eigen vectors engineeringEigen values and eigen vectors engineering
Eigen values and eigen vectors engineering
 
CSC446: Pattern Recognition (LN6)
CSC446: Pattern Recognition (LN6)CSC446: Pattern Recognition (LN6)
CSC446: Pattern Recognition (LN6)
 
Singular Value Decompostion (SVD)
Singular Value Decompostion (SVD)Singular Value Decompostion (SVD)
Singular Value Decompostion (SVD)
 
Nonnegative Matrix Factorization
Nonnegative Matrix FactorizationNonnegative Matrix Factorization
Nonnegative Matrix Factorization
 
Radial Basis Function Interpolation
Radial Basis Function InterpolationRadial Basis Function Interpolation
Radial Basis Function Interpolation
 
Artificial Neural Network Lect4 : Single Layer Perceptron Classifiers
Artificial Neural Network Lect4 : Single Layer Perceptron ClassifiersArtificial Neural Network Lect4 : Single Layer Perceptron Classifiers
Artificial Neural Network Lect4 : Single Layer Perceptron Classifiers
 
Optimization for Deep Learning
Optimization for Deep LearningOptimization for Deep Learning
Optimization for Deep Learning
 

Viewers also liked

Independent component analysis
Independent component analysisIndependent component analysis
Independent component analysisYstallonne Alves
 
A New Approach to Denoising EEG Signals - Merger of Translation Invariant Wav...
A New Approach to Denoising EEG Signals - Merger of Translation Invariant Wav...A New Approach to Denoising EEG Signals - Merger of Translation Invariant Wav...
A New Approach to Denoising EEG Signals - Merger of Translation Invariant Wav...CSCJournals
 
A Comparative Study between ICA (Independent Component Analysis) and PCA (Pri...
A Comparative Study between ICA (Independent Component Analysis) and PCA (Pri...A Comparative Study between ICA (Independent Component Analysis) and PCA (Pri...
A Comparative Study between ICA (Independent Component Analysis) and PCA (Pri...Sahidul Islam
 
Analysis of EEG data Using ICA and Algorithm Development for Energy Comparison
Analysis of EEG data Using ICA and Algorithm Development for Energy ComparisonAnalysis of EEG data Using ICA and Algorithm Development for Energy Comparison
Analysis of EEG data Using ICA and Algorithm Development for Energy Comparisonijsrd.com
 
音響メディア信号処理における独立成分分析の発展と応用, History of independent component analysis for sou...
音響メディア信号処理における独立成分分析の発展と応用, History of independent component analysis for sou...音響メディア信号処理における独立成分分析の発展と応用, History of independent component analysis for sou...
音響メディア信号処理における独立成分分析の発展と応用, History of independent component analysis for sou...Daichi Kitamura
 
01 Introduction to Data Mining
01 Introduction to Data Mining01 Introduction to Data Mining
01 Introduction to Data MiningValerii Klymchuk
 
05 Clustering in Data Mining
05 Clustering in Data Mining05 Clustering in Data Mining
05 Clustering in Data MiningValerii Klymchuk
 
Chap8 basic cluster_analysis
Chap8 basic cluster_analysisChap8 basic cluster_analysis
Chap8 basic cluster_analysisguru_prasadg
 
Feature selection concepts and methods
Feature selection concepts and methodsFeature selection concepts and methods
Feature selection concepts and methodsReza Ramezani
 

Viewers also liked (12)

Independent component analysis
Independent component analysisIndependent component analysis
Independent component analysis
 
ICA Slides
ICA SlidesICA Slides
ICA Slides
 
A New Approach to Denoising EEG Signals - Merger of Translation Invariant Wav...
A New Approach to Denoising EEG Signals - Merger of Translation Invariant Wav...A New Approach to Denoising EEG Signals - Merger of Translation Invariant Wav...
A New Approach to Denoising EEG Signals - Merger of Translation Invariant Wav...
 
A Comparative Study between ICA (Independent Component Analysis) and PCA (Pri...
A Comparative Study between ICA (Independent Component Analysis) and PCA (Pri...A Comparative Study between ICA (Independent Component Analysis) and PCA (Pri...
A Comparative Study between ICA (Independent Component Analysis) and PCA (Pri...
 
Analysis of EEG data Using ICA and Algorithm Development for Energy Comparison
Analysis of EEG data Using ICA and Algorithm Development for Energy ComparisonAnalysis of EEG data Using ICA and Algorithm Development for Energy Comparison
Analysis of EEG data Using ICA and Algorithm Development for Energy Comparison
 
Feature Selection
Feature Selection Feature Selection
Feature Selection
 
Clustering
ClusteringClustering
Clustering
 
音響メディア信号処理における独立成分分析の発展と応用, History of independent component analysis for sou...
音響メディア信号処理における独立成分分析の発展と応用, History of independent component analysis for sou...音響メディア信号処理における独立成分分析の発展と応用, History of independent component analysis for sou...
音響メディア信号処理における独立成分分析の発展と応用, History of independent component analysis for sou...
 
01 Introduction to Data Mining
01 Introduction to Data Mining01 Introduction to Data Mining
01 Introduction to Data Mining
 
05 Clustering in Data Mining
05 Clustering in Data Mining05 Clustering in Data Mining
05 Clustering in Data Mining
 
Chap8 basic cluster_analysis
Chap8 basic cluster_analysisChap8 basic cluster_analysis
Chap8 basic cluster_analysis
 
Feature selection concepts and methods
Feature selection concepts and methodsFeature selection concepts and methods
Feature selection concepts and methods
 

Similar to ICA Method Review: Untangling Mixed Data Strings

Cs229 notes11
Cs229 notes11Cs229 notes11
Cs229 notes11VuTran231
 
Machine learning (12)
Machine learning (12)Machine learning (12)
Machine learning (12)NYversity
 
Linear_Algebra_final.pdf
Linear_Algebra_final.pdfLinear_Algebra_final.pdf
Linear_Algebra_final.pdfRohitAnand125
 
Machine learning (11)
Machine learning (11)Machine learning (11)
Machine learning (11)NYversity
 
Matrix decomposition and_applications_to_nlp
Matrix decomposition and_applications_to_nlpMatrix decomposition and_applications_to_nlp
Matrix decomposition and_applications_to_nlpankit_ppt
 
Spectral Clustering Report
Spectral Clustering ReportSpectral Clustering Report
Spectral Clustering ReportMiaolan Xie
 
Machine learning (10)
Machine learning (10)Machine learning (10)
Machine learning (10)NYversity
 
Cs229 notes9
Cs229 notes9Cs229 notes9
Cs229 notes9VuTran231
 
Estimation theory 1
Estimation theory 1Estimation theory 1
Estimation theory 1Gopi Saiteja
 
DL-unit-1.pptx
DL-unit-1.pptxDL-unit-1.pptx
DL-unit-1.pptxMMAHESH29
 
Tutorial on EM algorithm – Part 1
Tutorial on EM algorithm – Part 1Tutorial on EM algorithm – Part 1
Tutorial on EM algorithm – Part 1Loc Nguyen
 
20070823
2007082320070823
20070823neostar
 
Cs229 notes10
Cs229 notes10Cs229 notes10
Cs229 notes10VuTran231
 

Similar to ICA Method Review: Untangling Mixed Data Strings (20)

Cs229 notes11
Cs229 notes11Cs229 notes11
Cs229 notes11
 
Machine learning (12)
Machine learning (12)Machine learning (12)
Machine learning (12)
 
Linear_Algebra_final.pdf
Linear_Algebra_final.pdfLinear_Algebra_final.pdf
Linear_Algebra_final.pdf
 
Machine learning (11)
Machine learning (11)Machine learning (11)
Machine learning (11)
 
Matrix decomposition and_applications_to_nlp
Matrix decomposition and_applications_to_nlpMatrix decomposition and_applications_to_nlp
Matrix decomposition and_applications_to_nlp
 
overviewPCA
overviewPCAoverviewPCA
overviewPCA
 
Spectral Clustering Report
Spectral Clustering ReportSpectral Clustering Report
Spectral Clustering Report
 
Machine learning (10)
Machine learning (10)Machine learning (10)
Machine learning (10)
 
Cs229 notes9
Cs229 notes9Cs229 notes9
Cs229 notes9
 
RTSP Report
RTSP ReportRTSP Report
RTSP Report
 
Class9_PCA_final.ppt
Class9_PCA_final.pptClass9_PCA_final.ppt
Class9_PCA_final.ppt
 
Ht3613671371
Ht3613671371Ht3613671371
Ht3613671371
 
Ht3613671371
Ht3613671371Ht3613671371
Ht3613671371
 
Estimation theory 1
Estimation theory 1Estimation theory 1
Estimation theory 1
 
DL-unit-1.pptx
DL-unit-1.pptxDL-unit-1.pptx
DL-unit-1.pptx
 
Tutorial on EM algorithm – Part 1
Tutorial on EM algorithm – Part 1Tutorial on EM algorithm – Part 1
Tutorial on EM algorithm – Part 1
 
icarsn
icarsnicarsn
icarsn
 
Lec12
Lec12Lec12
Lec12
 
20070823
2007082320070823
20070823
 
Cs229 notes10
Cs229 notes10Cs229 notes10
Cs229 notes10
 

More from Vanessa S

The Stories We Tell Ourselves
The Stories We Tell OurselvesThe Stories We Tell Ourselves
The Stories We Tell OurselvesVanessa S
 
Singularity Registry HPC
Singularity Registry HPCSingularity Registry HPC
Singularity Registry HPCVanessa S
 
Introduction to Singularity and Data Containers
Introduction to Singularity and Data ContainersIntroduction to Singularity and Data Containers
Introduction to Singularity and Data ContainersVanessa S
 
Research Software Engineering at Stanford University
Research Software Engineering at Stanford UniversityResearch Software Engineering at Stanford University
Research Software Engineering at Stanford UniversityVanessa S
 
Research Software Engineering at Stanford
Research Software Engineering at StanfordResearch Software Engineering at Stanford
Research Software Engineering at StanfordVanessa S
 
Adding An Operator to Airflow: A Contributor Overflow Exception
Adding An Operator to Airflow: A Contributor Overflow ExceptionAdding An Operator to Airflow: A Contributor Overflow Exception
Adding An Operator to Airflow: A Contributor Overflow ExceptionVanessa S
 
The Research Software Encyclopedia
The Research Software EncyclopediaThe Research Software Encyclopedia
The Research Software EncyclopediaVanessa S
 
The Scientific Filesystem
The Scientific FilesystemThe Scientific Filesystem
The Scientific FilesystemVanessa S
 
Singularity Containers for Scientific Compute
Singularity Containers for Scientific ComputeSingularity Containers for Scientific Compute
Singularity Containers for Scientific ComputeVanessa S
 
Laboratory of NeuroGenetics QA (8/2010)
Laboratory of NeuroGenetics QA (8/2010)Laboratory of NeuroGenetics QA (8/2010)
Laboratory of NeuroGenetics QA (8/2010)Vanessa S
 
PEARC17: Reproducibility and Containers: The Perfect Sandwich
PEARC17: Reproducibility and Containers: The Perfect SandwichPEARC17: Reproducibility and Containers: The Perfect Sandwich
PEARC17: Reproducibility and Containers: The Perfect SandwichVanessa S
 
Building Tools for Neuroimaging
Building Tools for NeuroimagingBuilding Tools for Neuroimaging
Building Tools for NeuroimagingVanessa S
 
Brain Maps like Mine
Brain Maps like MineBrain Maps like Mine
Brain Maps like MineVanessa S
 
Qualifying Exam Presentation
Qualifying Exam PresentationQualifying Exam Presentation
Qualifying Exam PresentationVanessa S
 
Pre-Proposal Presentation
Pre-Proposal PresentationPre-Proposal Presentation
Pre-Proposal PresentationVanessa S
 
Subnetworks in Schizophrenia, fMRI
Subnetworks in Schizophrenia, fMRISubnetworks in Schizophrenia, fMRI
Subnetworks in Schizophrenia, fMRIVanessa S
 
Research in Progress April 2014
Research in Progress April 2014Research in Progress April 2014
Research in Progress April 2014Vanessa S
 
Research in Progress Presentation
Research in Progress PresentationResearch in Progress Presentation
Research in Progress PresentationVanessa S
 
Introduction to Machine Learning Lecture
Introduction to Machine Learning LectureIntroduction to Machine Learning Lecture
Introduction to Machine Learning LectureVanessa S
 
Introduction to Neuroimaging Informatics
Introduction to Neuroimaging InformaticsIntroduction to Neuroimaging Informatics
Introduction to Neuroimaging InformaticsVanessa S
 

More from Vanessa S (20)

The Stories We Tell Ourselves
The Stories We Tell OurselvesThe Stories We Tell Ourselves
The Stories We Tell Ourselves
 
Singularity Registry HPC
Singularity Registry HPCSingularity Registry HPC
Singularity Registry HPC
 
Introduction to Singularity and Data Containers
Introduction to Singularity and Data ContainersIntroduction to Singularity and Data Containers
Introduction to Singularity and Data Containers
 
Research Software Engineering at Stanford University
Research Software Engineering at Stanford UniversityResearch Software Engineering at Stanford University
Research Software Engineering at Stanford University
 
Research Software Engineering at Stanford
Research Software Engineering at StanfordResearch Software Engineering at Stanford
Research Software Engineering at Stanford
 
Adding An Operator to Airflow: A Contributor Overflow Exception
Adding An Operator to Airflow: A Contributor Overflow ExceptionAdding An Operator to Airflow: A Contributor Overflow Exception
Adding An Operator to Airflow: A Contributor Overflow Exception
 
The Research Software Encyclopedia
The Research Software EncyclopediaThe Research Software Encyclopedia
The Research Software Encyclopedia
 
The Scientific Filesystem
The Scientific FilesystemThe Scientific Filesystem
The Scientific Filesystem
 
Singularity Containers for Scientific Compute
Singularity Containers for Scientific ComputeSingularity Containers for Scientific Compute
Singularity Containers for Scientific Compute
 
Laboratory of NeuroGenetics QA (8/2010)
Laboratory of NeuroGenetics QA (8/2010)Laboratory of NeuroGenetics QA (8/2010)
Laboratory of NeuroGenetics QA (8/2010)
 
PEARC17: Reproducibility and Containers: The Perfect Sandwich
PEARC17: Reproducibility and Containers: The Perfect SandwichPEARC17: Reproducibility and Containers: The Perfect Sandwich
PEARC17: Reproducibility and Containers: The Perfect Sandwich
 
Building Tools for Neuroimaging
Building Tools for NeuroimagingBuilding Tools for Neuroimaging
Building Tools for Neuroimaging
 
Brain Maps like Mine
Brain Maps like MineBrain Maps like Mine
Brain Maps like Mine
 
Qualifying Exam Presentation
Qualifying Exam PresentationQualifying Exam Presentation
Qualifying Exam Presentation
 
Pre-Proposal Presentation
Pre-Proposal PresentationPre-Proposal Presentation
Pre-Proposal Presentation
 
Subnetworks in Schizophrenia, fMRI
Subnetworks in Schizophrenia, fMRISubnetworks in Schizophrenia, fMRI
Subnetworks in Schizophrenia, fMRI
 
Research in Progress April 2014
Research in Progress April 2014Research in Progress April 2014
Research in Progress April 2014
 
Research in Progress Presentation
Research in Progress PresentationResearch in Progress Presentation
Research in Progress Presentation
 
Introduction to Machine Learning Lecture
Introduction to Machine Learning LectureIntroduction to Machine Learning Lecture
Introduction to Machine Learning Lecture
 
Introduction to Neuroimaging Informatics
Introduction to Neuroimaging InformaticsIntroduction to Neuroimaging Informatics
Introduction to Neuroimaging Informatics
 

Recently uploaded

Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuinethapagita
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationColumbia Weather Systems
 
TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsssuserddc89b
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPirithiRaju
 
Citronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayCitronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayupadhyaymani499
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxEran Akiva Sinbar
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxBerniceCayabyab1
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptArshadWarsi13
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPirithiRaju
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
preservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxpreservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxnoordubaliya2003
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
Pests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPirithiRaju
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024innovationoecd
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxJorenAcuavera1
 

Recently uploaded (20)

Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather Station
 
TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physics
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -I
 
Citronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayCitronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyay
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptx
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.ppt
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
 
preservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxpreservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptx
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
Pests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdf
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptx
 

ICA Method Review: Untangling Mixed Data Strings

  • 2. INDEPENDENT COMPONENT ANALYSIS DATA Imagine that you are a weaver, and you have a loom of colorful strings. Each string represents a unique pattern in the data. With actual data, each of these strings would be a vector of numbers that can be fit with a linear equation. As we see the strings above, they are well organized.
  • 3. MIXED UP DATA INDEPENDENCE INDEPENDENT COMPONENT ANALYSIS Unfortunately, when we collect data in the real world it does not come to us neat and organized. Our unique strings get mixed up with other strings, and random signal such as noise. In our example above, a monkey has come along and mixed up our strings. How do we untangle them?
  • 4. HOW TO UNMIX? INDEPENDENT COMPONENT ANALYSIS We could know something special about each string, maybe a feature like color, and manually unmix, however it we are dealing with a huge dataset and don’t have a clue about any special features, we are powerless. This is where ICA comes in. We start with our mixed data and assume 1) we have mixed up data (our loom) that is 2) comprised of independent signals
  • 5. MIXED STRINGS (OBSERVED DATA) INDEPENDENT COMPONENT ANALYSIS = MONKEY MADNESS (“MIXING MATRIX”) X = A SX ORIGINAL STRINGS (ORIGINAL DATA) X We start with this mixed up data, X, and we know that it was generated by the monkey applying some sequence of movements to it (the “monkey madness”). We call this series of transformations that the monkey applies to the unmixed data, s our mixing matrix. This matrix would consist of vectors of numbers that, when multiplied with s, produce the observed data X.
  • 6. INDEPENDENT COMPONENT ANALYSIS S = A-1 XX To solve this problem and recover our original strings from the mixed ones, we just need to solve this equation for s. We know X, so we just need to figure out what the inverse of A is. This is normally referred to as “W” or the un-mixing matrix. We are going to choose the numbers in this matrix that maximize the probability of our data. MIXED STRINGS (OBSERVED DATA) = UN-MIXING MATRIX, W ORIGINAL STRINGS (ORIGINAL DATA) X
  • 7. INDEPENDENT COMPONENT ANALYSIS S = A-1 XX What is basically done is that we model the CDF of each signal’s probability as the sigmoid function because it increases from 0 to 1, the derivative of the sigmoid is the density function, and then we would iteratively maximize that function until convergence to find the weights, this inverse matrix (details in next slides!) MIXED STRINGS (OBSERVED DATA) = UN-MIXING MATRIX, W ORIGINAL STRINGS (ORIGINAL DATA) X
  • 8. Independent Component Analysis How to find the weights with Maximum Likelihood Estimation? Suppose that the distribution of each source si is given by a density ps, and that the joint distribution of the sources s is given by: this implies the following density on x = As = W1s All that remains is to specify a density (a CDF) for the individual sources ps. It can’t be Gaussian, how about sigmoid? (increases from 0 to 1) CS229 Notes, Andrew Ng, 2012
  • 9. So we model the CDF for each independent signal with sigmoid, so to get the probability of the signal at any particular time-point we look at the derivative of the CDF (the PDF): So if we want to maximize this probability (find our data), we want to make it as big as possible. The square matrix W is the parameter in our model, so given a training set, the log likelihood is given by: And we want to maximize this in terms of W. It’s useful to know that: And so a “one at a time” (stochastic gradient ascent rule) is: This is how we would update our weights until convergence. Independent Component Analysis How to find the weights with Maximum Likelihood Estimation? CS229 Notes, Andrew Ng, 2012
  • 10. FastICA Modification “ICA with Reference” is a modification of FastICA CS229 Notes, Andrew Ng, 2012 Negative entropy is used to measure mutual independence in formula: 1st term: Gaussian variable (wTx), 2nd non-quadratic contrast function ||w||2 = 1 used when maximizing J(y) such that: If we choose, for the 2nd function G’’’(u) = u3, the update becomes: “Inspired” by this form of the update, we can impose an additional constraint that incorporates prior information about the components so it no longer maximizes just independence, but is also close to the reference, r:
  • 11. ICA CAVEATS Permutation of the original sources is ambiguous But this doesn’t matter for most applications Data assumed to be non-Gaussian If the data is Gaussian, there is an arbitrary rotational component in the mixing matrix that cannot be determined from the data, so we cannot recover the original sources No way to recover scaling of the weights If a single column of matrix A were scaled by a factor of 2 and the corresponding source were scaled by a factor of ½, then there is again no way, given only the x(i)’s, to determine this had happened.
  • 12. Why can’t the data be Gaussian? “Suppose we observe some x = As, where A is our mixing matrix. The distribution of x will also be Gaussian, with zero mean and covariance E[xxT ] = E[AssTAT ] = AAT Now, let R be an arbitrary orthogonal (less formally, a rotation/reflection) matrix, so that RRT = RTR = I, and let A’ = AR. Then if the data had been mixed according to A’ instead of A, we would have instead observed x’ = A’s. The distribution of x’ is also Gaussian, with zero mean and covariance E[x’(x’)T ] = E[A’ssT (A’)T ] = E[ARssT (AR)T ] = ARRTAT = AAT Hence, whether the mixing matrix is A or A’, we would observe data from a N(0;AAT ) distribution. Thus, there is no way to tell if the sources were mixed using A and A’. So, there is an arbitrary rotational component in the mixing matrix that cannot be determined from the data, and we cannot recover the original sources.” CS229 Notes, Andrew Ng, 2012