SlideShare a Scribd company logo
1 of 1
Download to read offline
;
Classification of LIBS Spectra using Classificatory Decomposition
Leave-One-Out Approach
Kofi Forson , Kyrhee Powell, and Tomasz Smolinski
Department of Computer and Information Sciences, Delaware State University, Dover DE 19901, USA
What is LIBS?
Laser Induced Breakdown Spectroscopy (LIBS) is a technique that provides
information about the chemical components of a sample through laser ablation.
Different methods have been applied to classify protein LIBS data. Some of
these proteins include Bovine Serum Albumin, Opsteopontin, Leptin, and
Insulin-like Growth Factor II. The classification of these particular proteins can
lead to the detection of diseases, including ovarian cancer.
Classificatory Decomposition
Classificatory Decomposition (CD) is a form of signal decomposition that also
provides classification awareness. It breaks down signals, such as LIBS spectra,
into basis functions. Basis functions are elements that can be found in all of the
original signals, and coefficients are utilized to scale the basis functions to
reconstruct the original signals.
An example of classificatory decomposition involving signals can be seen below:
Figure 1: Example of signal decomposition: A) The original datasets. B) The underlying basis functions. C) The
coefficients for representation of the original signals in the new attribute space.
The main objectives of classificatory decomposition and this experiment are:
• To minimize the reconstruction error of the data;
• To minimize the number of basis functions needed to reconstruct the data;
• To minimize the classification error.
Leave-One-Out Approach
The Leave-One-Out (LOO) approach involves taking a single instance out,
training the classifier on the remaining instances, and then testing the
classification scheme on the instance that was taken out. The process is repeated
n times, where n is the number of instances in the input dataset, and the average
accuracy measure over n trials determines the overall goodness of the classifier.
Methodology
To generate the batches of data used for training the classification scheme, we
employed several Python scripts. One of the essential Python scripts we uti-
lized extracted the LIBS data, created new documents, and input the batches in
the newly created documents with corresponding parameter and test files. In to-
tal, there were 132 batches of LIBS data, on which we trained our classification
scheme.
A fundamental component of our classification scheme and experiment involved
Multi-Objective Evolutionary Algorithms (MOEA), such as end-VEGA,
which we implemented in C++. MOEAs mimic the principles of evolution and
natural selection to optimize multiple objectives simultaneously to arrive at a set
of potential solutions to a given optimization problem.
The figure below explains the evolutionary loop process:
Figure 2: A schematic of an evolutionary algorithm.
Methodology (continued)
After training the classification scheme on the LIBS data, we evaluated the ac-
curacy of our classification scheme by utilizing Weka. Weka is a software work-
bench that incorporates several standard machine learning techniques, which al-
low a user to further analyze a large body of data.
Figure 3: Sample plots for each of the four protein classes.
Our experimental settings consisted of:
• A population size of 20 individuals in the MOEA;
• A generation count of 100;
• 132 batches of LIBS spectra;
• 9 trials testing the accuracy of reconstruction and classification.
Results
When running the test trials on the 132 batches of LIBS spectra, the reconstruc-
tion error was almost always below 1. The average reconstruction error was
around 0.2. Moreover, the reduction on average was about 80 percent.
Furthermore, when classifying the LIBS spectra using Weka, our result was ei-
ther right or wrong. We either got a 0 for an incorrect classification or a 1 for a
correct classification.
Forthcoming Research/Conclusions
In the future, we would like to:
• Train different classification schemes on the LIBS spectra;
• Utilize additional MOEAs in our classification schemes;
• Expand the use of classificatory decomposition to different kinds of data.
Furthermore, we learned that the Leave-One-Out Approach has disadvantages
when it comes to the classification of data, because it is either hit or miss. We re-
ceived a 0 for an incorrect classification or a 1 for a correct classification.
References
• Sparse Correlation Kernel Analysis and Evolutionary Algorithm-Based Model-
ing of the Sensory Activity within the Rats Barrel Cortex
• Evolutionary Algorithms and Rough Sets-Based Hybrid Approach to Classifi-
catory Decomposition of Cortical Evoked Potentials
• Independent Component Analysis-motivated Approach to Classificatory De-
composition of Cortical Evoked Potentials
• Hybridization of Independent Component Analysis, Rough Sets, and Multi-
Objective Evolutionary Algorithms for Classificatory Decomposition of Cor-
tical Evoked Potentials
• Detection of unusual trajectories using multi-objective evolutionary algorithms
and rough sets
Acknowledgments
We acknowledge the Optical Science Center for Applied Research (OSCAR),
the financial support of The National Science Foundation (NSF-CREST grant
No 1242067 and of the National Aeronautics and Space Administration (NASA
URC 5 grant No NNX09AU90A).

More Related Content

Similar to renewed-poster-presentation (12)

CREOSA_FORSON_Kofi
CREOSA_FORSON_KofiCREOSA_FORSON_Kofi
CREOSA_FORSON_Kofi
Kofi Forson
 
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
DataScienceConferenc1
 
Knowledge extraction and visualisation using rule-based machine learning
Knowledge extraction and visualisation using rule-based machine learningKnowledge extraction and visualisation using rule-based machine learning
Knowledge extraction and visualisation using rule-based machine learning
jaumebp
 
BioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataBioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadata
Philip Cheung
 
Metabolomics Society meeting 2011 - presentatie Kees
Metabolomics Society meeting 2011 - presentatie KeesMetabolomics Society meeting 2011 - presentatie Kees
Metabolomics Society meeting 2011 - presentatie Kees
thehyve
 
Autism_risk_factors
Autism_risk_factorsAutism_risk_factors
Autism_risk_factors
Colleen Chen
 
551report.doc
551report.doc551report.doc
551report.doc
butest
 
A novel methodology for constructing rule based naïve bayesian classifiers
A novel methodology for constructing rule based naïve bayesian classifiersA novel methodology for constructing rule based naïve bayesian classifiers
A novel methodology for constructing rule based naïve bayesian classifiers
ijcsit
 

Similar to renewed-poster-presentation (12) (20)

CREOSA_FORSON_Kofi
CREOSA_FORSON_KofiCREOSA_FORSON_Kofi
CREOSA_FORSON_Kofi
 
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
 
Knowledge extraction and visualisation using rule-based machine learning
Knowledge extraction and visualisation using rule-based machine learningKnowledge extraction and visualisation using rule-based machine learning
Knowledge extraction and visualisation using rule-based machine learning
 
SecondaryStructurePredictionReport
SecondaryStructurePredictionReportSecondaryStructurePredictionReport
SecondaryStructurePredictionReport
 
BioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataBioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadata
 
Metabolomics Society meeting 2011 - presentatie Kees
Metabolomics Society meeting 2011 - presentatie KeesMetabolomics Society meeting 2011 - presentatie Kees
Metabolomics Society meeting 2011 - presentatie Kees
 
Microarray Data Classification Using Support Vector Machine
Microarray Data Classification Using Support Vector MachineMicroarray Data Classification Using Support Vector Machine
Microarray Data Classification Using Support Vector Machine
 
Novel Class Detection Using RBF SVM Kernel from Feature Evolving Data Streams
Novel Class Detection Using RBF SVM Kernel from Feature Evolving Data StreamsNovel Class Detection Using RBF SVM Kernel from Feature Evolving Data Streams
Novel Class Detection Using RBF SVM Kernel from Feature Evolving Data Streams
 
Autism_risk_factors
Autism_risk_factorsAutism_risk_factors
Autism_risk_factors
 
CI_GA_module2_ABC_updatedG.ppt .
CI_GA_module2_ABC_updatedG.ppt           .CI_GA_module2_ABC_updatedG.ppt           .
CI_GA_module2_ABC_updatedG.ppt .
 
May workshop
May workshopMay workshop
May workshop
 
May 15 workshop
May 15  workshopMay 15  workshop
May 15 workshop
 
Lipidomics Bioinformatics Analysis
Lipidomics Bioinformatics AnalysisLipidomics Bioinformatics Analysis
Lipidomics Bioinformatics Analysis
 
551report.doc
551report.doc551report.doc
551report.doc
 
A novel methodology for constructing rule based naïve bayesian classifiers
A novel methodology for constructing rule based naïve bayesian classifiersA novel methodology for constructing rule based naïve bayesian classifiers
A novel methodology for constructing rule based naïve bayesian classifiers
 
2224d_final
2224d_final2224d_final
2224d_final
 
Controlling informative features for improved accuracy and faster predictions...
Controlling informative features for improved accuracy and faster predictions...Controlling informative features for improved accuracy and faster predictions...
Controlling informative features for improved accuracy and faster predictions...
 
Bioinformaatics for M.Sc. Biotecchnology.pptx
Bioinformaatics for M.Sc. Biotecchnology.pptxBioinformaatics for M.Sc. Biotecchnology.pptx
Bioinformaatics for M.Sc. Biotecchnology.pptx
 
25.ranking on data manifold with sink points
25.ranking on data manifold with sink points25.ranking on data manifold with sink points
25.ranking on data manifold with sink points
 
Delineation of techniques to implement on the enhanced proposed model using d...
Delineation of techniques to implement on the enhanced proposed model using d...Delineation of techniques to implement on the enhanced proposed model using d...
Delineation of techniques to implement on the enhanced proposed model using d...
 

renewed-poster-presentation (12)

  • 1. ; Classification of LIBS Spectra using Classificatory Decomposition Leave-One-Out Approach Kofi Forson , Kyrhee Powell, and Tomasz Smolinski Department of Computer and Information Sciences, Delaware State University, Dover DE 19901, USA What is LIBS? Laser Induced Breakdown Spectroscopy (LIBS) is a technique that provides information about the chemical components of a sample through laser ablation. Different methods have been applied to classify protein LIBS data. Some of these proteins include Bovine Serum Albumin, Opsteopontin, Leptin, and Insulin-like Growth Factor II. The classification of these particular proteins can lead to the detection of diseases, including ovarian cancer. Classificatory Decomposition Classificatory Decomposition (CD) is a form of signal decomposition that also provides classification awareness. It breaks down signals, such as LIBS spectra, into basis functions. Basis functions are elements that can be found in all of the original signals, and coefficients are utilized to scale the basis functions to reconstruct the original signals. An example of classificatory decomposition involving signals can be seen below: Figure 1: Example of signal decomposition: A) The original datasets. B) The underlying basis functions. C) The coefficients for representation of the original signals in the new attribute space. The main objectives of classificatory decomposition and this experiment are: • To minimize the reconstruction error of the data; • To minimize the number of basis functions needed to reconstruct the data; • To minimize the classification error. Leave-One-Out Approach The Leave-One-Out (LOO) approach involves taking a single instance out, training the classifier on the remaining instances, and then testing the classification scheme on the instance that was taken out. The process is repeated n times, where n is the number of instances in the input dataset, and the average accuracy measure over n trials determines the overall goodness of the classifier. Methodology To generate the batches of data used for training the classification scheme, we employed several Python scripts. One of the essential Python scripts we uti- lized extracted the LIBS data, created new documents, and input the batches in the newly created documents with corresponding parameter and test files. In to- tal, there were 132 batches of LIBS data, on which we trained our classification scheme. A fundamental component of our classification scheme and experiment involved Multi-Objective Evolutionary Algorithms (MOEA), such as end-VEGA, which we implemented in C++. MOEAs mimic the principles of evolution and natural selection to optimize multiple objectives simultaneously to arrive at a set of potential solutions to a given optimization problem. The figure below explains the evolutionary loop process: Figure 2: A schematic of an evolutionary algorithm. Methodology (continued) After training the classification scheme on the LIBS data, we evaluated the ac- curacy of our classification scheme by utilizing Weka. Weka is a software work- bench that incorporates several standard machine learning techniques, which al- low a user to further analyze a large body of data. Figure 3: Sample plots for each of the four protein classes. Our experimental settings consisted of: • A population size of 20 individuals in the MOEA; • A generation count of 100; • 132 batches of LIBS spectra; • 9 trials testing the accuracy of reconstruction and classification. Results When running the test trials on the 132 batches of LIBS spectra, the reconstruc- tion error was almost always below 1. The average reconstruction error was around 0.2. Moreover, the reduction on average was about 80 percent. Furthermore, when classifying the LIBS spectra using Weka, our result was ei- ther right or wrong. We either got a 0 for an incorrect classification or a 1 for a correct classification. Forthcoming Research/Conclusions In the future, we would like to: • Train different classification schemes on the LIBS spectra; • Utilize additional MOEAs in our classification schemes; • Expand the use of classificatory decomposition to different kinds of data. Furthermore, we learned that the Leave-One-Out Approach has disadvantages when it comes to the classification of data, because it is either hit or miss. We re- ceived a 0 for an incorrect classification or a 1 for a correct classification. References • Sparse Correlation Kernel Analysis and Evolutionary Algorithm-Based Model- ing of the Sensory Activity within the Rats Barrel Cortex • Evolutionary Algorithms and Rough Sets-Based Hybrid Approach to Classifi- catory Decomposition of Cortical Evoked Potentials • Independent Component Analysis-motivated Approach to Classificatory De- composition of Cortical Evoked Potentials • Hybridization of Independent Component Analysis, Rough Sets, and Multi- Objective Evolutionary Algorithms for Classificatory Decomposition of Cor- tical Evoked Potentials • Detection of unusual trajectories using multi-objective evolutionary algorithms and rough sets Acknowledgments We acknowledge the Optical Science Center for Applied Research (OSCAR), the financial support of The National Science Foundation (NSF-CREST grant No 1242067 and of the National Aeronautics and Space Administration (NASA URC 5 grant No NNX09AU90A).