SlideShare a Scribd company logo
Introduction
Methodology
Experimental Results
Conclusions
Dimensionality Reduction and Prediction of the
Protein Macromolecule Dissolution Profile
V. K. Ojha, K. Jackowski, V. Sn´aˇsel and A. Abraham
IT4Innovations
VˇSB - Technical University of Ostrava
Czech Republic
24 June 2014
1 / 14 Varun Ojha IBICA 2014
Introduction
Methodology
Experimental Results
Conclusions
The problem
Approach
A Complete overview
Introduction
Problem: Prediction of the dissolution profile of Poly
(Lactic-co-Glycolic Acid) (PLGA) micro- and nanoparticles.
Motivation: PLGA microparticles are important diluents in the
formulation of drugs in the dosage form.
It act as an excipient in drug formation.
It helps dissolution of the drugs, thus increases absorbability
and solubility of drugs.
It helps in pharmaceutical manufacturing process by improving
APIs powder’s flowability and nonstickiness.
2 / 14 Varun Ojha IBICA 2014
Introduction
Methodology
Experimental Results
Conclusions
The problem
Approach
A Complete overview
Introduction
Critical Issue: PLGA dissolution prediction is a complex
problem as there are several potential factors influencing
dissolution of PLGA protein particles. Collecting all such
influencing factors leads to three hundred input features in
dataset.
Background: Szlkeket et al. 1 in their article offered a dataset
with three hundred input features divided into four groups,
namely protein descriptor, plasticizer, formulation
characteristics, and emulsifier collected from various literature.
Goal: Dimensionality reduction using feature
selection/extraction and finding a suitable regression model.
1
Szlkek, J., Paclawski, A., Lau, R., Jachowicz, R., Mendyk, A.: Heuristic modeling of macromolecule release
from PLGA microspheres. International journal of nanomedicine 8 (2013) 4601.
3 / 14 Varun Ojha IBICA 2014
Introduction
Methodology
Experimental Results
Conclusions
The problem
Approach
A Complete overview
Overview
Dataset
Dimension,Reduction
Feature,Selection Feature,Extraction
Linear Nonlinear
PCA FA ICA kPCA MDS
Prediction,Models,:GPReg,,LReg,,MLP,,SMORegT
Results,of,10,Cross-validation,Sets,
Select:,Dimension,Reduction,Technique Select:,Prediction,model
Figure: A complete overview of the experimental setup
4 / 14 Varun Ojha IBICA 2014
Introduction
Methodology
Experimental Results
Conclusions
Dimensionality Reduction
Regression Models
Feature Selection
Backward Feature Elimination (BFE) filter is used for feature
elimination.
BFE starts with maximum number feature in hand (in this
case it starts with three hundred features) and eliminate
features one by one in iterative manner.
At each iteration, resulting accuracy of the prediction is
evaluated for all combination of remaining attributes and
subset of attributes with the highest accuracy is propagated to
next iteration.
The subset with the best accuracy is chosen.
5 / 14 Varun Ojha IBICA 2014
Introduction
Methodology
Experimental Results
Conclusions
Dimensionality Reduction
Regression Models
Feature Extraction
Feature extraction helps in reducing computational overhead which
may incurred due to use of complete input dimension.
Principle Component Analysis (PCA)
Factor Analysis (FA)
Independent Component Analysis (ICA)
Kernel PCA (kPCA)
Multidimensional Scaling (MDS)
6 / 14 Varun Ojha IBICA 2014
Introduction
Methodology
Experimental Results
Conclusions
Dimensionality Reduction
Regression Models
Regression models
Regression/Prediction model tries to figure out the relationship
between input variables and output variable.
Linear regression (LReg)
Gaussian Process Regression (GPReg)
Multilayer perceptron (MLP)
Sequential Minimal Optimization Regression (SMOReg)
7 / 14 Varun Ojha IBICA 2014
Introduction
Methodology
Experimental Results
Conclusions
Feature Selection Results
Feature Extraction Results
Experimental results of feature selection technique
15.000
20.000
25.000
30.000
0.000
5.000
10.000
1 5 10 Optimal 300
Number of Selected Features
AverageRMSE
(a)
15.000
20.000
25.000
GPReg
LReg
0.000
5.000
10.000
1 5 10 Optimal 300
LReg
MLP
SMOReg
Number of Selected Features
Variance (b)
Figure: Experimental results of feature selection, comparison between the
regression models. (a) comparison using average RMSE (b) comparison
using variance.
8 / 14 Varun Ojha IBICA 2014
Introduction
Methodology
Experimental Results
Conclusions
Feature Selection Results
Feature Extraction Results
Experimental results of feature selection technique
Table: Experimental results for 10cv datasets prepared with distinct
random partitions of the complete dataset using feature selection
technique (Identification of regression model) Note. Mean and variance (VAR) is
computed on 10 RMSE obtained.
Regression Reduced Number of Features
Model 1 5 10 Optimal 300
Mean VAR Mean VAR Mean VAR Mean VAR Mean VAR
GPReg 27.474 10.942 17.107 3.989 15.322 3.782 15.709 3.162 16.812 3.551
LReg 26.613 3.232 23.447 3.702 19.979 3.402 17.847 1.634 17.074 2.738
MLP 28.329 7.428 23.113 10.007 20.997 11.365 17.820 8.095 18.571 21.063
SMOReg 26.970 3.307 23.381 2.729 19.526 3.757 17.885 3.321 16.529 2.554
9 / 14 Varun Ojha IBICA 2014
Introduction
Methodology
Experimental Results
Conclusions
Feature Selection Results
Feature Extraction Results
Experimental results of feature extraction technique
20
25
30
35
0
5
10
15
20
ICA PCA FA kPCA MDS
AverageRMSE
(a)
5
6
7
8
9
GPReg
0
1
2
3
4
5
ICA PCA FA kPCA MDS
LReg
MLP
SMOReg
(b)
Variance
Figure: Experimental results of feature extraction with reduced dimension
30, comparison between the regression models. (a) comparison using
average RMSE (b) comparison using variance.
10 / 14 Varun Ojha IBICA 2014
Introduction
Methodology
Experimental Results
Conclusions
Feature Selection Results
Feature Extraction Results
Experimental results of feature selection technique
Table: Experimental results for 10cv datasets prepared with distinct
random partitions of the complete dataset using feature selection
technique (Identification of regression model) Note. Mean and variance (VAR) is
computed on 10 RMSE obtained.
Regression Reduced Number of Features
Model ICA PCA FA kPCA MDS
Mean VAR Mean VAR Mean VAR Mean VAR Mean VAR
GPReg 14.826 3.612 16.636 3.160 28.314 3.338 24.955 1.965 28.413 3.155
LReg 17.233 2.340 17.170 2.790 29.970 1.766 25.348 2.048 29.192 2.079
MLP 13.945 2.765 13.590 1.560 31.010 1.825 27.067 4.090 29.925 3.105
SMOReg 17.925 2.875 17.660 1.560 30.257 3.373 25.900 1.700 29.641 2.758
11 / 14 Varun Ojha IBICA 2014
Introduction
Methodology
Experimental Results
Conclusions
Conclusion
Future Work
Conclusion
Large number of input features predicting the rate of
dissolution is a complex problem.
Feature selection technique let us select most influencing
features among the available features without worsening the
performance.
Features extraction techniques provide a reduced set of new
features which performs better than when considering all the
features together.
We have analysed the performance of GPReg, LReg, MLP and
SMOReg.
Performance of GPReg is best which offers lowest average
RMSE and VAR with 10 selected features.
PCA used to reduce dimension to 30 offered best result using
MLP with lowest average RMSE and VAR.
12 / 14 Varun Ojha IBICA 2014
Introduction
Methodology
Experimental Results
Conclusions
Conclusion
Future Work
Future Work
Focus on the various types of stochastic feature selection
methods
Exploring different other types of regression models.
Study on making use of ensemble of elementary regression.
Comparison of ensemble methods.
13 / 14 Varun Ojha IBICA 2014
Introduction
Methodology
Experimental Results
Conclusions
Thank You!
varun.kumar.ojha.@vsb.cz
14 / 14 Varun Ojha IBICA 2014

More Related Content

Similar to Dimensionality Reduction and Prediction of the Protein Macromolecule Dissolution Pro le

Thesis presentation: Applications of machine learning in predicting supply risks
Thesis presentation: Applications of machine learning in predicting supply risksThesis presentation: Applications of machine learning in predicting supply risks
Thesis presentation: Applications of machine learning in predicting supply risks
TuanNguyen1697
 
Deep_Learning__INAF_baroncelli.pdf
Deep_Learning__INAF_baroncelli.pdfDeep_Learning__INAF_baroncelli.pdf
Deep_Learning__INAF_baroncelli.pdf
asdfasdf214078
 
How predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinarHow predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinar
Ann-Marie Roche
 
A fast Algorithm for Automatic Segmentation of Pancreas Histological Images f...
A fast Algorithm for Automatic Segmentation of Pancreas Histological Images f...A fast Algorithm for Automatic Segmentation of Pancreas Histological Images f...
A fast Algorithm for Automatic Segmentation of Pancreas Histological Images f...
Tathagata Bandyopadhyay
 
Artificial Intelligence based Pattern Recognition
Artificial Intelligence based Pattern RecognitionArtificial Intelligence based Pattern Recognition
Artificial Intelligence based Pattern Recognition
Dr. Amarjeet Singh
 
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
NSEAkure
 
Design of Experimentation, Artificial Neural Network Simulation and Optimizat...
Design of Experimentation, Artificial Neural Network Simulation and Optimizat...Design of Experimentation, Artificial Neural Network Simulation and Optimizat...
Design of Experimentation, Artificial Neural Network Simulation and Optimizat...
IJERA Editor
 
Synthesizing electrocardiogram (ecg) from photoplethysmogram(ppg)
Synthesizing electrocardiogram (ecg) from photoplethysmogram(ppg)Synthesizing electrocardiogram (ecg) from photoplethysmogram(ppg)
Synthesizing electrocardiogram (ecg) from photoplethysmogram(ppg)
SriSruthiChilukuri
 
Process Capability: Overview
Process Capability: OverviewProcess Capability: Overview
Process Capability: Overview
Matt Hansen
 
Protein functional site prediction using the shotest path graphnew1 2
Protein functional site prediction using the shotest path graphnew1 2Protein functional site prediction using the shotest path graphnew1 2
Protein functional site prediction using the shotest path graphnew1 2
M Beneragama
 
Prediction of pIC50 Values for the Acetylcholinesterase (AChE) using QSAR Model
Prediction of pIC50 Values for the Acetylcholinesterase (AChE) using QSAR ModelPrediction of pIC50 Values for the Acetylcholinesterase (AChE) using QSAR Model
Prediction of pIC50 Values for the Acetylcholinesterase (AChE) using QSAR Model
IRJET Journal
 
Analytical QbD
Analytical QbDAnalytical QbD
Analytical QbD
Sneha Kadu
 
Analytical QbD
Analytical QbDAnalytical QbD
Analytical QbD
Sneha Kadu
 
Analytical QbD
Analytical QbDAnalytical QbD
Analytical QbD
Sneha Kadu
 
Bt34433436
Bt34433436Bt34433436
Bt34433436
IJERA Editor
 
Combining Cluster Sampling and ACE analysis to improve fault-injection based ...
Combining Cluster Sampling and ACE analysis to improve fault-injection based ...Combining Cluster Sampling and ACE analysis to improve fault-injection based ...
Combining Cluster Sampling and ACE analysis to improve fault-injection based ...
Stefano Di Carlo
 
Deep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsDeep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpoints
Valery Tkachenko
 
Failure Prediction Using Interaction between Parallel Links of FA Equipment
Failure Prediction Using Interaction between Parallel Links of FA EquipmentFailure Prediction Using Interaction between Parallel Links of FA Equipment
Failure Prediction Using Interaction between Parallel Links of FA Equipment
MasanoriHaga1
 

Similar to Dimensionality Reduction and Prediction of the Protein Macromolecule Dissolution Pro le (20)

Thesis presentation: Applications of machine learning in predicting supply risks
Thesis presentation: Applications of machine learning in predicting supply risksThesis presentation: Applications of machine learning in predicting supply risks
Thesis presentation: Applications of machine learning in predicting supply risks
 
Deep_Learning__INAF_baroncelli.pdf
Deep_Learning__INAF_baroncelli.pdfDeep_Learning__INAF_baroncelli.pdf
Deep_Learning__INAF_baroncelli.pdf
 
How predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinarHow predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinar
 
A fast Algorithm for Automatic Segmentation of Pancreas Histological Images f...
A fast Algorithm for Automatic Segmentation of Pancreas Histological Images f...A fast Algorithm for Automatic Segmentation of Pancreas Histological Images f...
A fast Algorithm for Automatic Segmentation of Pancreas Histological Images f...
 
Artificial Intelligence based Pattern Recognition
Artificial Intelligence based Pattern RecognitionArtificial Intelligence based Pattern Recognition
Artificial Intelligence based Pattern Recognition
 
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
 
Team 16_Report
Team 16_ReportTeam 16_Report
Team 16_Report
 
Team 16_Report
Team 16_ReportTeam 16_Report
Team 16_Report
 
Design of Experimentation, Artificial Neural Network Simulation and Optimizat...
Design of Experimentation, Artificial Neural Network Simulation and Optimizat...Design of Experimentation, Artificial Neural Network Simulation and Optimizat...
Design of Experimentation, Artificial Neural Network Simulation and Optimizat...
 
Synthesizing electrocardiogram (ecg) from photoplethysmogram(ppg)
Synthesizing electrocardiogram (ecg) from photoplethysmogram(ppg)Synthesizing electrocardiogram (ecg) from photoplethysmogram(ppg)
Synthesizing electrocardiogram (ecg) from photoplethysmogram(ppg)
 
Process Capability: Overview
Process Capability: OverviewProcess Capability: Overview
Process Capability: Overview
 
Protein functional site prediction using the shotest path graphnew1 2
Protein functional site prediction using the shotest path graphnew1 2Protein functional site prediction using the shotest path graphnew1 2
Protein functional site prediction using the shotest path graphnew1 2
 
Prediction of pIC50 Values for the Acetylcholinesterase (AChE) using QSAR Model
Prediction of pIC50 Values for the Acetylcholinesterase (AChE) using QSAR ModelPrediction of pIC50 Values for the Acetylcholinesterase (AChE) using QSAR Model
Prediction of pIC50 Values for the Acetylcholinesterase (AChE) using QSAR Model
 
Analytical QbD
Analytical QbDAnalytical QbD
Analytical QbD
 
Analytical QbD
Analytical QbDAnalytical QbD
Analytical QbD
 
Analytical QbD
Analytical QbDAnalytical QbD
Analytical QbD
 
Bt34433436
Bt34433436Bt34433436
Bt34433436
 
Combining Cluster Sampling and ACE analysis to improve fault-injection based ...
Combining Cluster Sampling and ACE analysis to improve fault-injection based ...Combining Cluster Sampling and ACE analysis to improve fault-injection based ...
Combining Cluster Sampling and ACE analysis to improve fault-injection based ...
 
Deep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsDeep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpoints
 
Failure Prediction Using Interaction between Parallel Links of FA Equipment
Failure Prediction Using Interaction between Parallel Links of FA EquipmentFailure Prediction Using Interaction between Parallel Links of FA Equipment
Failure Prediction Using Interaction between Parallel Links of FA Equipment
 

More from Varun Ojha

Chapter 6 Image Processing: Image Enhancement
Chapter 6 Image Processing: Image EnhancementChapter 6 Image Processing: Image Enhancement
Chapter 6 Image Processing: Image Enhancement
Varun Ojha
 
Chapter 5 Image Processing: Fourier Transformation
Chapter 5 Image Processing: Fourier TransformationChapter 5 Image Processing: Fourier Transformation
Chapter 5 Image Processing: Fourier Transformation
Varun Ojha
 
Chapter 4 Image Processing: Image Transformation
Chapter 4 Image Processing: Image TransformationChapter 4 Image Processing: Image Transformation
Chapter 4 Image Processing: Image Transformation
Varun Ojha
 
Chapter 2 Image Processing: Pixel Relation
Chapter 2 Image Processing: Pixel RelationChapter 2 Image Processing: Pixel Relation
Chapter 2 Image Processing: Pixel Relation
Varun Ojha
 
Chapter 3 Image Processing: Basic Transformation
Chapter 3 Image Processing:  Basic TransformationChapter 3 Image Processing:  Basic Transformation
Chapter 3 Image Processing: Basic Transformation
Varun Ojha
 
Chapter 1 introduction (Image Processing)
Chapter 1 introduction (Image Processing)Chapter 1 introduction (Image Processing)
Chapter 1 introduction (Image Processing)
Varun Ojha
 
Neural Tree for Estimating the Uniaxial Compressive Strength of Rock Materials
Neural Tree for Estimating the Uniaxial Compressive Strength of Rock MaterialsNeural Tree for Estimating the Uniaxial Compressive Strength of Rock Materials
Neural Tree for Estimating the Uniaxial Compressive Strength of Rock Materials
Varun Ojha
 
Metaheuristic Tuning of Type-II Fuzzy Inference System for Data Mining
Metaheuristic Tuning of Type-II Fuzzy Inference System for Data MiningMetaheuristic Tuning of Type-II Fuzzy Inference System for Data Mining
Metaheuristic Tuning of Type-II Fuzzy Inference System for Data Mining
Varun Ojha
 
A Framework of Secured and Bio-Inspired Image Steganography Using Chaotic Enc...
A Framework of Secured and Bio-Inspired Image Steganography Using Chaotic Enc...A Framework of Secured and Bio-Inspired Image Steganography Using Chaotic Enc...
A Framework of Secured and Bio-Inspired Image Steganography Using Chaotic Enc...
Varun Ojha
 
Ensemble of Heterogeneous Flexible Neural Tree for the approximation and feat...
Ensemble of Heterogeneous Flexible Neural Tree for the approximation and feat...Ensemble of Heterogeneous Flexible Neural Tree for the approximation and feat...
Ensemble of Heterogeneous Flexible Neural Tree for the approximation and feat...
Varun Ojha
 
Simultaneous optimization of neural network weights and active nodes using me...
Simultaneous optimization of neural network weights and active nodes using me...Simultaneous optimization of neural network weights and active nodes using me...
Simultaneous optimization of neural network weights and active nodes using me...
Varun Ojha
 
Design and analysis of algorithm
Design and analysis of algorithmDesign and analysis of algorithm
Design and analysis of algorithm
Varun Ojha
 

More from Varun Ojha (12)

Chapter 6 Image Processing: Image Enhancement
Chapter 6 Image Processing: Image EnhancementChapter 6 Image Processing: Image Enhancement
Chapter 6 Image Processing: Image Enhancement
 
Chapter 5 Image Processing: Fourier Transformation
Chapter 5 Image Processing: Fourier TransformationChapter 5 Image Processing: Fourier Transformation
Chapter 5 Image Processing: Fourier Transformation
 
Chapter 4 Image Processing: Image Transformation
Chapter 4 Image Processing: Image TransformationChapter 4 Image Processing: Image Transformation
Chapter 4 Image Processing: Image Transformation
 
Chapter 2 Image Processing: Pixel Relation
Chapter 2 Image Processing: Pixel RelationChapter 2 Image Processing: Pixel Relation
Chapter 2 Image Processing: Pixel Relation
 
Chapter 3 Image Processing: Basic Transformation
Chapter 3 Image Processing:  Basic TransformationChapter 3 Image Processing:  Basic Transformation
Chapter 3 Image Processing: Basic Transformation
 
Chapter 1 introduction (Image Processing)
Chapter 1 introduction (Image Processing)Chapter 1 introduction (Image Processing)
Chapter 1 introduction (Image Processing)
 
Neural Tree for Estimating the Uniaxial Compressive Strength of Rock Materials
Neural Tree for Estimating the Uniaxial Compressive Strength of Rock MaterialsNeural Tree for Estimating the Uniaxial Compressive Strength of Rock Materials
Neural Tree for Estimating the Uniaxial Compressive Strength of Rock Materials
 
Metaheuristic Tuning of Type-II Fuzzy Inference System for Data Mining
Metaheuristic Tuning of Type-II Fuzzy Inference System for Data MiningMetaheuristic Tuning of Type-II Fuzzy Inference System for Data Mining
Metaheuristic Tuning of Type-II Fuzzy Inference System for Data Mining
 
A Framework of Secured and Bio-Inspired Image Steganography Using Chaotic Enc...
A Framework of Secured and Bio-Inspired Image Steganography Using Chaotic Enc...A Framework of Secured and Bio-Inspired Image Steganography Using Chaotic Enc...
A Framework of Secured and Bio-Inspired Image Steganography Using Chaotic Enc...
 
Ensemble of Heterogeneous Flexible Neural Tree for the approximation and feat...
Ensemble of Heterogeneous Flexible Neural Tree for the approximation and feat...Ensemble of Heterogeneous Flexible Neural Tree for the approximation and feat...
Ensemble of Heterogeneous Flexible Neural Tree for the approximation and feat...
 
Simultaneous optimization of neural network weights and active nodes using me...
Simultaneous optimization of neural network weights and active nodes using me...Simultaneous optimization of neural network weights and active nodes using me...
Simultaneous optimization of neural network weights and active nodes using me...
 
Design and analysis of algorithm
Design and analysis of algorithmDesign and analysis of algorithm
Design and analysis of algorithm
 

Recently uploaded

一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
correoyaya
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
theahmadsaood
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
NABLAS株式会社
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
James Polillo
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
AlejandraGmez176757
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 

Recently uploaded (20)

一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 

Dimensionality Reduction and Prediction of the Protein Macromolecule Dissolution Pro le

  • 1. Introduction Methodology Experimental Results Conclusions Dimensionality Reduction and Prediction of the Protein Macromolecule Dissolution Profile V. K. Ojha, K. Jackowski, V. Sn´aˇsel and A. Abraham IT4Innovations VˇSB - Technical University of Ostrava Czech Republic 24 June 2014 1 / 14 Varun Ojha IBICA 2014
  • 2. Introduction Methodology Experimental Results Conclusions The problem Approach A Complete overview Introduction Problem: Prediction of the dissolution profile of Poly (Lactic-co-Glycolic Acid) (PLGA) micro- and nanoparticles. Motivation: PLGA microparticles are important diluents in the formulation of drugs in the dosage form. It act as an excipient in drug formation. It helps dissolution of the drugs, thus increases absorbability and solubility of drugs. It helps in pharmaceutical manufacturing process by improving APIs powder’s flowability and nonstickiness. 2 / 14 Varun Ojha IBICA 2014
  • 3. Introduction Methodology Experimental Results Conclusions The problem Approach A Complete overview Introduction Critical Issue: PLGA dissolution prediction is a complex problem as there are several potential factors influencing dissolution of PLGA protein particles. Collecting all such influencing factors leads to three hundred input features in dataset. Background: Szlkeket et al. 1 in their article offered a dataset with three hundred input features divided into four groups, namely protein descriptor, plasticizer, formulation characteristics, and emulsifier collected from various literature. Goal: Dimensionality reduction using feature selection/extraction and finding a suitable regression model. 1 Szlkek, J., Paclawski, A., Lau, R., Jachowicz, R., Mendyk, A.: Heuristic modeling of macromolecule release from PLGA microspheres. International journal of nanomedicine 8 (2013) 4601. 3 / 14 Varun Ojha IBICA 2014
  • 4. Introduction Methodology Experimental Results Conclusions The problem Approach A Complete overview Overview Dataset Dimension,Reduction Feature,Selection Feature,Extraction Linear Nonlinear PCA FA ICA kPCA MDS Prediction,Models,:GPReg,,LReg,,MLP,,SMORegT Results,of,10,Cross-validation,Sets, Select:,Dimension,Reduction,Technique Select:,Prediction,model Figure: A complete overview of the experimental setup 4 / 14 Varun Ojha IBICA 2014
  • 5. Introduction Methodology Experimental Results Conclusions Dimensionality Reduction Regression Models Feature Selection Backward Feature Elimination (BFE) filter is used for feature elimination. BFE starts with maximum number feature in hand (in this case it starts with three hundred features) and eliminate features one by one in iterative manner. At each iteration, resulting accuracy of the prediction is evaluated for all combination of remaining attributes and subset of attributes with the highest accuracy is propagated to next iteration. The subset with the best accuracy is chosen. 5 / 14 Varun Ojha IBICA 2014
  • 6. Introduction Methodology Experimental Results Conclusions Dimensionality Reduction Regression Models Feature Extraction Feature extraction helps in reducing computational overhead which may incurred due to use of complete input dimension. Principle Component Analysis (PCA) Factor Analysis (FA) Independent Component Analysis (ICA) Kernel PCA (kPCA) Multidimensional Scaling (MDS) 6 / 14 Varun Ojha IBICA 2014
  • 7. Introduction Methodology Experimental Results Conclusions Dimensionality Reduction Regression Models Regression models Regression/Prediction model tries to figure out the relationship between input variables and output variable. Linear regression (LReg) Gaussian Process Regression (GPReg) Multilayer perceptron (MLP) Sequential Minimal Optimization Regression (SMOReg) 7 / 14 Varun Ojha IBICA 2014
  • 8. Introduction Methodology Experimental Results Conclusions Feature Selection Results Feature Extraction Results Experimental results of feature selection technique 15.000 20.000 25.000 30.000 0.000 5.000 10.000 1 5 10 Optimal 300 Number of Selected Features AverageRMSE (a) 15.000 20.000 25.000 GPReg LReg 0.000 5.000 10.000 1 5 10 Optimal 300 LReg MLP SMOReg Number of Selected Features Variance (b) Figure: Experimental results of feature selection, comparison between the regression models. (a) comparison using average RMSE (b) comparison using variance. 8 / 14 Varun Ojha IBICA 2014
  • 9. Introduction Methodology Experimental Results Conclusions Feature Selection Results Feature Extraction Results Experimental results of feature selection technique Table: Experimental results for 10cv datasets prepared with distinct random partitions of the complete dataset using feature selection technique (Identification of regression model) Note. Mean and variance (VAR) is computed on 10 RMSE obtained. Regression Reduced Number of Features Model 1 5 10 Optimal 300 Mean VAR Mean VAR Mean VAR Mean VAR Mean VAR GPReg 27.474 10.942 17.107 3.989 15.322 3.782 15.709 3.162 16.812 3.551 LReg 26.613 3.232 23.447 3.702 19.979 3.402 17.847 1.634 17.074 2.738 MLP 28.329 7.428 23.113 10.007 20.997 11.365 17.820 8.095 18.571 21.063 SMOReg 26.970 3.307 23.381 2.729 19.526 3.757 17.885 3.321 16.529 2.554 9 / 14 Varun Ojha IBICA 2014
  • 10. Introduction Methodology Experimental Results Conclusions Feature Selection Results Feature Extraction Results Experimental results of feature extraction technique 20 25 30 35 0 5 10 15 20 ICA PCA FA kPCA MDS AverageRMSE (a) 5 6 7 8 9 GPReg 0 1 2 3 4 5 ICA PCA FA kPCA MDS LReg MLP SMOReg (b) Variance Figure: Experimental results of feature extraction with reduced dimension 30, comparison between the regression models. (a) comparison using average RMSE (b) comparison using variance. 10 / 14 Varun Ojha IBICA 2014
  • 11. Introduction Methodology Experimental Results Conclusions Feature Selection Results Feature Extraction Results Experimental results of feature selection technique Table: Experimental results for 10cv datasets prepared with distinct random partitions of the complete dataset using feature selection technique (Identification of regression model) Note. Mean and variance (VAR) is computed on 10 RMSE obtained. Regression Reduced Number of Features Model ICA PCA FA kPCA MDS Mean VAR Mean VAR Mean VAR Mean VAR Mean VAR GPReg 14.826 3.612 16.636 3.160 28.314 3.338 24.955 1.965 28.413 3.155 LReg 17.233 2.340 17.170 2.790 29.970 1.766 25.348 2.048 29.192 2.079 MLP 13.945 2.765 13.590 1.560 31.010 1.825 27.067 4.090 29.925 3.105 SMOReg 17.925 2.875 17.660 1.560 30.257 3.373 25.900 1.700 29.641 2.758 11 / 14 Varun Ojha IBICA 2014
  • 12. Introduction Methodology Experimental Results Conclusions Conclusion Future Work Conclusion Large number of input features predicting the rate of dissolution is a complex problem. Feature selection technique let us select most influencing features among the available features without worsening the performance. Features extraction techniques provide a reduced set of new features which performs better than when considering all the features together. We have analysed the performance of GPReg, LReg, MLP and SMOReg. Performance of GPReg is best which offers lowest average RMSE and VAR with 10 selected features. PCA used to reduce dimension to 30 offered best result using MLP with lowest average RMSE and VAR. 12 / 14 Varun Ojha IBICA 2014
  • 13. Introduction Methodology Experimental Results Conclusions Conclusion Future Work Future Work Focus on the various types of stochastic feature selection methods Exploring different other types of regression models. Study on making use of ensemble of elementary regression. Comparison of ensemble methods. 13 / 14 Varun Ojha IBICA 2014