SlideShare a Scribd company logo
1 of 3
Download to read offline
Data Collection
E nd Process
A lterna tive Stan d a rd
P roce dure For Regression
Modellin g
Performi n g R o bu st
Re gression with cas e
Resampling
Interpret The Output
Linear Regression
Model
R obust R egression with
m m E stimation
P erforming Weighted
for the Regres sion
A p p lied Fuzzy
Re gression Method to
this Alternative Method
Middle-East Journal of Scientific Research 23 (10): 2559-2571, 2015
ISSN 1990-9233
© IDOSI Publications, 2015
DOI: 10.5829/idosi.mejsr.2015.23.10.22731
Corresponding Author: Wan Muhamad Amir W. Ahmad, School of Dental Sciences,Health Campus, Universiti Sains Malaysia
(USM), 16150 Kubang Kerian, Kelantan, Malaysia.
2569
An Alternative Method for Regression Modelling:
Algorithm for Weighted Robust Regression
Wan Muhamad Amir W. Ahmad, Nor Azlida Aleng, Nurfadhlina Abdul Halim,1 2 2
Syerrina Zakaria, Kasypi Mokhtar, Ruhaya Hasan and Tan Min Pau2 3 1 4
School of Dental Sciences, Health Campus, Universiti Sains Malaysia (USM),1
16150 Kubang Kerian, Kelantan, Malaysia
School of Informatics and Applied Mathematics, School of Maritime Business and Management,2 3
School of Fisheries and Aquaculture Sciences,4
Universiti Malaysia Terengganu (UMT), 21030 Terengganu Malaysia
Abstract: This report supplied a comprehensive method of an alternative fuzzy weighted robust linear
regression as a technique for analysis through SAS algorithm. This alternative method is a manipulation
technique for the small data set and give the researcher an options to launch the analysis even there is not
enough data set.
Key words: Bootstrap Weighted SAS algorithm and robust regression
INTRODUCTION
Thus, this paper provides a road map of the practical
approach for regression modeling and an illustration
using fisheries dataset. The parametric bootstrap method
is recommended for sample size between 50 and 100 for a
reliable performance [1, 2]. A recent approach to analyse
data with missing values in the covariates is weighted
estimating equations and this technique appear to be
highly efficient [3, 4]. Data of this study is a sample which
composed of six variables. Namely variables are as in
Table 1.
Multiple fuzzy with weighted regression technique
was used in the analysis of relationship between
variables. The algorithm is given as follows:
Figure 1 showed the flow chart of an alternative
method of fuzzy weighted regression procedure.
Table 1: Description of Data
Num. Code Explanation of user variables
1. Y Standard Length of the fish
2. X1 Total Length of the fish
3. X2 Fish Weight (g)
4. X3 Turbidity
5. X4 Temperature
6 X5 Dissolved Oxygen (mg/l) Fig. 1: Flow Chart of an Alternative Analysis
Middle-East J. Sci. Res., 23 (10): 2569-2571, 2015
2570
/*Title 'alternatif linear programming with combining /*combining bootstrap technique with fuzzy
robust and bootstrap'*/
Data fish;
Input y x1 x2 x3 x4 x5;
Datalines;
31.00 52.00 2.00 19.70 26.08 5.43
32.00 42.00 2.00 1.420 29.75 3.58
32.00 41.00 2.00 6.100 26.39 7.06
33.00 42.00 1.00 1.420 29.75 3.58
33.00 42.00 2.00 17.80 28.36 6.56
34.00 44.00 2.00 1.420 29.75 3.58
34.00 45.00 10.00 1.730 29.35 3.54
35.00 45.00 1.00 12.63 30.44 4.03
35.00 43.00 1.00 1.420 29.75 3.58
;
run;
Ods rtf style=journal;
/*performing the procedure of modeling linear
regression model*/
Ods graphics on;
Proc reg data=fish;
Model y=x1 x2 x3 x4 x5;
run;
Ods graphics off;
/*robust regression for mm-estimation*/
Ods graphics on;
Proc robustreg method=mm fwls data=fish plots=all;
Model y=x1 x2 x3 x4 x5 / diagnostics itprint;
Output out=resids out=robout r=residual weight=weight
outlier=outlier sr=stdres;
run;
Ods graphics off;
/*using bootstrap with case resampling*/
Proc surveyselect data=fish out=boot1 method=urs
samprate=1 outhits rep=100;
run;
/*robust regression for mm-estimation+bootstrap*/
Ods graphics on;
Proc robustreg method=m(wf=huber(c=1.345))fwls
data=boot1 plots=all;
Model y=x1 x2 x3 x4 x5 / diagnostics itprint;
Output out=resids out=robout r=residual weight=weight
outlier=outlier sr=stdres;
run;
Ods graphics off;
regression*/
Proc optmodel;
Set j= 1..30;
Number y{j}, x1{j}, x2{j}, x3{j}, x4{j}, x5{j};
Read data boot1 into [_n_] y x1 x2 x3 x4 x5;
/*print y x1 x2 x3 x4 x5*/
Print y x1 x2 x3 x4 x5;
Number n init 30; /*total of observation*/
/*decision variable bounded or not bounded*/
Var aw{1..6}>=0; /*these six variables are bounded*/
Var ac{1..6}; /*these six variables are not bounded*/
/*objective function*/
Min z1= aw[1] * n + sum{i in j} x1[i] * aw[2]+sum{i in j}
x2[i] * aw[3]+sum{i in j} x3[i] * aw[4]+sum{i in j} x4[i] *
aw[5]+sum{i in j} x5[i] * aw[6];
/*linear constraints*/
Con c{i in 1..n}:
Ac[1]+x1[i]*ac[2]+x2[i]*ac[3]+x3[i]*ac[4]+x4[i]*ac[5]+x
5[i]*ac[6]-aw[1]-x1[i]*aw[2]-x2[i]*aw[3]-x3[i]*aw[4]-
x4[i]*aw[5]-x5[i]*aw[6]<=y[i];
Con c1{i in 1..n}:
Ac[1]+x1[i]*ac[2]+x2[i]*ac[3]+x3[i]*ac[4]+x4[i]*ac[5]+x
5[i]*ac[6]+aw[1]+x1[i]*aw[2]+x2[i]*aw[3]+x3[i]*aw[4]+x
4[i]*aw[5]+x5[i]*aw[6]>=y[i];
Expand; /*this provides all equations*/
Solve;
Print ac aw;
Quit;
Ods rtf close;
RESULT AND DISCUSSION
This paper explained on how an alternative
programming method of fuzzy bootstrap weighted
regression procedure using SAS software can be
applied for the small sample size which the data is
very difficult to collect. By resampling (using
bootstrap method), it provides the pleminary
comprehensive information and also give the general
overview on how the data behaviour even though
the original data is not enough (small sample size).
Middle-East J. Sci. Res., 23 (10): 2569-2571, 2015
2571
In our case, smaller standard error of the estimate 2. Jung, B.C., M. Jhun and J.W. Lee, 2005. Bootstrap
parameter will tell us how accurate our estimate parameter tests for overdispersion in a zero-inflated Poisson
is likely to be. model. Journal of Biometrics, 61(6): 626-628.
REFERENCES Estimation of regression coe_cients when some
1. Cassel, D.L., 2010. Bootstrap Mania: Re sampling the American Statistical Association, 89: 846-866.
SAS. SAS Global Forum 2010: Statistics and Data 4. Zhao, L.P., S.R. Lipsitz and D. Lew, 1996. Regression
Analysis Paper, pp: 1-11. analysis with missing covariate data using estimating
3. Robin, J.M., A. Rotnitzky and L.P. Zhoa, 1994.
regressors are not always observed. Journal of the
equations. Biometrics, 52: 1165-1182.

More Related Content

Similar to Wan Muhamad Amir et al., 2015

2(c)4-Nurkhairany_Amyra_Mokhtar.pdf
2(c)4-Nurkhairany_Amyra_Mokhtar.pdf2(c)4-Nurkhairany_Amyra_Mokhtar.pdf
2(c)4-Nurkhairany_Amyra_Mokhtar.pdfNurain748010
 
Variable Selection Methods
Variable Selection MethodsVariable Selection Methods
Variable Selection Methodsjoycemi_la
 
Variable Selection Methods
Variable Selection MethodsVariable Selection Methods
Variable Selection Methodsjoycemi_la
 
A note on estimation of population mean in sample survey using auxiliary info...
A note on estimation of population mean in sample survey using auxiliary info...A note on estimation of population mean in sample survey using auxiliary info...
A note on estimation of population mean in sample survey using auxiliary info...Alexander Decker
 
Download-manuals-water quality-technicalpapers-within-labaq-cfindings-2ndround
 Download-manuals-water quality-technicalpapers-within-labaq-cfindings-2ndround Download-manuals-water quality-technicalpapers-within-labaq-cfindings-2ndround
Download-manuals-water quality-technicalpapers-within-labaq-cfindings-2ndroundhydrologywebsite1
 
Reliability assessment for a four cylinder diesel engine
Reliability assessment for a four cylinder diesel engineReliability assessment for a four cylinder diesel engine
Reliability assessment for a four cylinder diesel engineeSAT Publishing House
 
Splay Method of Model Acquisition Assessment
Splay Method of Model Acquisition AssessmentSplay Method of Model Acquisition Assessment
Splay Method of Model Acquisition Assessmentijtsrd
 
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATARESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATAorajjournal
 
Multi-Attribute Decision Making with VIKOR Method for Any Purpose Decision
Multi-Attribute Decision Making with VIKOR Method for Any Purpose DecisionMulti-Attribute Decision Making with VIKOR Method for Any Purpose Decision
Multi-Attribute Decision Making with VIKOR Method for Any Purpose DecisionUniversitas Pembangunan Panca Budi
 
Mathematical Understanding in Traffic Flow Modelling
Mathematical Understanding in Traffic Flow ModellingMathematical Understanding in Traffic Flow Modelling
Mathematical Understanding in Traffic Flow ModellingNikhil Chandra Sarkar
 
1 FACULTY OF SCIENCE AND ENGINEERING SCHOOL OF COMPUT.docx
1  FACULTY OF SCIENCE AND ENGINEERING SCHOOL OF COMPUT.docx1  FACULTY OF SCIENCE AND ENGINEERING SCHOOL OF COMPUT.docx
1 FACULTY OF SCIENCE AND ENGINEERING SCHOOL OF COMPUT.docxmercysuttle
 
Optimization of a 2D localization system of a moving object based on the prop...
Optimization of a 2D localization system of a moving object based on the prop...Optimization of a 2D localization system of a moving object based on the prop...
Optimization of a 2D localization system of a moving object based on the prop...IRJET Journal
 
Comparing the methods of Estimation of Three-Parameter Weibull distribution
Comparing the methods of Estimation of Three-Parameter Weibull distributionComparing the methods of Estimation of Three-Parameter Weibull distribution
Comparing the methods of Estimation of Three-Parameter Weibull distributionIOSRJM
 
eggs_project_interm
eggs_project_intermeggs_project_interm
eggs_project_intermRoopan Verma
 
An overview of statistics management with excel
An overview of statistics management with excelAn overview of statistics management with excel
An overview of statistics management with excelKRISHANACHOUDHARY1
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
07 ch ken black solution
07 ch ken black solution07 ch ken black solution
07 ch ken black solutionKrunal Shah
 
Effects of A Simulated Power Cut in AMS on Milk Yield Valued by Statistics Model
Effects of A Simulated Power Cut in AMS on Milk Yield Valued by Statistics ModelEffects of A Simulated Power Cut in AMS on Milk Yield Valued by Statistics Model
Effects of A Simulated Power Cut in AMS on Milk Yield Valued by Statistics ModelIJERA Editor
 

Similar to Wan Muhamad Amir et al., 2015 (20)

2(c)4-Nurkhairany_Amyra_Mokhtar.pdf
2(c)4-Nurkhairany_Amyra_Mokhtar.pdf2(c)4-Nurkhairany_Amyra_Mokhtar.pdf
2(c)4-Nurkhairany_Amyra_Mokhtar.pdf
 
Variable Selection Methods
Variable Selection MethodsVariable Selection Methods
Variable Selection Methods
 
Variable Selection Methods
Variable Selection MethodsVariable Selection Methods
Variable Selection Methods
 
A note on estimation of population mean in sample survey using auxiliary info...
A note on estimation of population mean in sample survey using auxiliary info...A note on estimation of population mean in sample survey using auxiliary info...
A note on estimation of population mean in sample survey using auxiliary info...
 
Download-manuals-water quality-technicalpapers-within-labaq-cfindings-2ndround
 Download-manuals-water quality-technicalpapers-within-labaq-cfindings-2ndround Download-manuals-water quality-technicalpapers-within-labaq-cfindings-2ndround
Download-manuals-water quality-technicalpapers-within-labaq-cfindings-2ndround
 
Reliability assessment for a four cylinder diesel engine
Reliability assessment for a four cylinder diesel engineReliability assessment for a four cylinder diesel engine
Reliability assessment for a four cylinder diesel engine
 
Splay Method of Model Acquisition Assessment
Splay Method of Model Acquisition AssessmentSplay Method of Model Acquisition Assessment
Splay Method of Model Acquisition Assessment
 
Slides ensae 11bis
Slides ensae 11bisSlides ensae 11bis
Slides ensae 11bis
 
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATARESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
 
Multi-Attribute Decision Making with VIKOR Method for Any Purpose Decision
Multi-Attribute Decision Making with VIKOR Method for Any Purpose DecisionMulti-Attribute Decision Making with VIKOR Method for Any Purpose Decision
Multi-Attribute Decision Making with VIKOR Method for Any Purpose Decision
 
Mathematical Understanding in Traffic Flow Modelling
Mathematical Understanding in Traffic Flow ModellingMathematical Understanding in Traffic Flow Modelling
Mathematical Understanding in Traffic Flow Modelling
 
1 FACULTY OF SCIENCE AND ENGINEERING SCHOOL OF COMPUT.docx
1  FACULTY OF SCIENCE AND ENGINEERING SCHOOL OF COMPUT.docx1  FACULTY OF SCIENCE AND ENGINEERING SCHOOL OF COMPUT.docx
1 FACULTY OF SCIENCE AND ENGINEERING SCHOOL OF COMPUT.docx
 
Optimization of a 2D localization system of a moving object based on the prop...
Optimization of a 2D localization system of a moving object based on the prop...Optimization of a 2D localization system of a moving object based on the prop...
Optimization of a 2D localization system of a moving object based on the prop...
 
Comparing the methods of Estimation of Three-Parameter Weibull distribution
Comparing the methods of Estimation of Three-Parameter Weibull distributionComparing the methods of Estimation of Three-Parameter Weibull distribution
Comparing the methods of Estimation of Three-Parameter Weibull distribution
 
eggs_project_interm
eggs_project_intermeggs_project_interm
eggs_project_interm
 
An overview of statistics management with excel
An overview of statistics management with excelAn overview of statistics management with excel
An overview of statistics management with excel
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
The Right Way
The Right WayThe Right Way
The Right Way
 
07 ch ken black solution
07 ch ken black solution07 ch ken black solution
07 ch ken black solution
 
Effects of A Simulated Power Cut in AMS on Milk Yield Valued by Statistics Model
Effects of A Simulated Power Cut in AMS on Milk Yield Valued by Statistics ModelEffects of A Simulated Power Cut in AMS on Milk Yield Valued by Statistics Model
Effects of A Simulated Power Cut in AMS on Milk Yield Valued by Statistics Model
 

Wan Muhamad Amir et al., 2015

  • 1. Data Collection E nd Process A lterna tive Stan d a rd P roce dure For Regression Modellin g Performi n g R o bu st Re gression with cas e Resampling Interpret The Output Linear Regression Model R obust R egression with m m E stimation P erforming Weighted for the Regres sion A p p lied Fuzzy Re gression Method to this Alternative Method Middle-East Journal of Scientific Research 23 (10): 2559-2571, 2015 ISSN 1990-9233 © IDOSI Publications, 2015 DOI: 10.5829/idosi.mejsr.2015.23.10.22731 Corresponding Author: Wan Muhamad Amir W. Ahmad, School of Dental Sciences,Health Campus, Universiti Sains Malaysia (USM), 16150 Kubang Kerian, Kelantan, Malaysia. 2569 An Alternative Method for Regression Modelling: Algorithm for Weighted Robust Regression Wan Muhamad Amir W. Ahmad, Nor Azlida Aleng, Nurfadhlina Abdul Halim,1 2 2 Syerrina Zakaria, Kasypi Mokhtar, Ruhaya Hasan and Tan Min Pau2 3 1 4 School of Dental Sciences, Health Campus, Universiti Sains Malaysia (USM),1 16150 Kubang Kerian, Kelantan, Malaysia School of Informatics and Applied Mathematics, School of Maritime Business and Management,2 3 School of Fisheries and Aquaculture Sciences,4 Universiti Malaysia Terengganu (UMT), 21030 Terengganu Malaysia Abstract: This report supplied a comprehensive method of an alternative fuzzy weighted robust linear regression as a technique for analysis through SAS algorithm. This alternative method is a manipulation technique for the small data set and give the researcher an options to launch the analysis even there is not enough data set. Key words: Bootstrap Weighted SAS algorithm and robust regression INTRODUCTION Thus, this paper provides a road map of the practical approach for regression modeling and an illustration using fisheries dataset. The parametric bootstrap method is recommended for sample size between 50 and 100 for a reliable performance [1, 2]. A recent approach to analyse data with missing values in the covariates is weighted estimating equations and this technique appear to be highly efficient [3, 4]. Data of this study is a sample which composed of six variables. Namely variables are as in Table 1. Multiple fuzzy with weighted regression technique was used in the analysis of relationship between variables. The algorithm is given as follows: Figure 1 showed the flow chart of an alternative method of fuzzy weighted regression procedure. Table 1: Description of Data Num. Code Explanation of user variables 1. Y Standard Length of the fish 2. X1 Total Length of the fish 3. X2 Fish Weight (g) 4. X3 Turbidity 5. X4 Temperature 6 X5 Dissolved Oxygen (mg/l) Fig. 1: Flow Chart of an Alternative Analysis
  • 2. Middle-East J. Sci. Res., 23 (10): 2569-2571, 2015 2570 /*Title 'alternatif linear programming with combining /*combining bootstrap technique with fuzzy robust and bootstrap'*/ Data fish; Input y x1 x2 x3 x4 x5; Datalines; 31.00 52.00 2.00 19.70 26.08 5.43 32.00 42.00 2.00 1.420 29.75 3.58 32.00 41.00 2.00 6.100 26.39 7.06 33.00 42.00 1.00 1.420 29.75 3.58 33.00 42.00 2.00 17.80 28.36 6.56 34.00 44.00 2.00 1.420 29.75 3.58 34.00 45.00 10.00 1.730 29.35 3.54 35.00 45.00 1.00 12.63 30.44 4.03 35.00 43.00 1.00 1.420 29.75 3.58 ; run; Ods rtf style=journal; /*performing the procedure of modeling linear regression model*/ Ods graphics on; Proc reg data=fish; Model y=x1 x2 x3 x4 x5; run; Ods graphics off; /*robust regression for mm-estimation*/ Ods graphics on; Proc robustreg method=mm fwls data=fish plots=all; Model y=x1 x2 x3 x4 x5 / diagnostics itprint; Output out=resids out=robout r=residual weight=weight outlier=outlier sr=stdres; run; Ods graphics off; /*using bootstrap with case resampling*/ Proc surveyselect data=fish out=boot1 method=urs samprate=1 outhits rep=100; run; /*robust regression for mm-estimation+bootstrap*/ Ods graphics on; Proc robustreg method=m(wf=huber(c=1.345))fwls data=boot1 plots=all; Model y=x1 x2 x3 x4 x5 / diagnostics itprint; Output out=resids out=robout r=residual weight=weight outlier=outlier sr=stdres; run; Ods graphics off; regression*/ Proc optmodel; Set j= 1..30; Number y{j}, x1{j}, x2{j}, x3{j}, x4{j}, x5{j}; Read data boot1 into [_n_] y x1 x2 x3 x4 x5; /*print y x1 x2 x3 x4 x5*/ Print y x1 x2 x3 x4 x5; Number n init 30; /*total of observation*/ /*decision variable bounded or not bounded*/ Var aw{1..6}>=0; /*these six variables are bounded*/ Var ac{1..6}; /*these six variables are not bounded*/ /*objective function*/ Min z1= aw[1] * n + sum{i in j} x1[i] * aw[2]+sum{i in j} x2[i] * aw[3]+sum{i in j} x3[i] * aw[4]+sum{i in j} x4[i] * aw[5]+sum{i in j} x5[i] * aw[6]; /*linear constraints*/ Con c{i in 1..n}: Ac[1]+x1[i]*ac[2]+x2[i]*ac[3]+x3[i]*ac[4]+x4[i]*ac[5]+x 5[i]*ac[6]-aw[1]-x1[i]*aw[2]-x2[i]*aw[3]-x3[i]*aw[4]- x4[i]*aw[5]-x5[i]*aw[6]<=y[i]; Con c1{i in 1..n}: Ac[1]+x1[i]*ac[2]+x2[i]*ac[3]+x3[i]*ac[4]+x4[i]*ac[5]+x 5[i]*ac[6]+aw[1]+x1[i]*aw[2]+x2[i]*aw[3]+x3[i]*aw[4]+x 4[i]*aw[5]+x5[i]*aw[6]>=y[i]; Expand; /*this provides all equations*/ Solve; Print ac aw; Quit; Ods rtf close; RESULT AND DISCUSSION This paper explained on how an alternative programming method of fuzzy bootstrap weighted regression procedure using SAS software can be applied for the small sample size which the data is very difficult to collect. By resampling (using bootstrap method), it provides the pleminary comprehensive information and also give the general overview on how the data behaviour even though the original data is not enough (small sample size).
  • 3. Middle-East J. Sci. Res., 23 (10): 2569-2571, 2015 2571 In our case, smaller standard error of the estimate 2. Jung, B.C., M. Jhun and J.W. Lee, 2005. Bootstrap parameter will tell us how accurate our estimate parameter tests for overdispersion in a zero-inflated Poisson is likely to be. model. Journal of Biometrics, 61(6): 626-628. REFERENCES Estimation of regression coe_cients when some 1. Cassel, D.L., 2010. Bootstrap Mania: Re sampling the American Statistical Association, 89: 846-866. SAS. SAS Global Forum 2010: Statistics and Data 4. Zhao, L.P., S.R. Lipsitz and D. Lew, 1996. Regression Analysis Paper, pp: 1-11. analysis with missing covariate data using estimating 3. Robin, J.M., A. Rotnitzky and L.P. Zhoa, 1994. regressors are not always observed. Journal of the equations. Biometrics, 52: 1165-1182.