SlideShare a Scribd company logo
1 of 4
Statistical Model to Predict IPO Prices for Semiconductor & Semiconductor
Equipment Industry
Manuel G. Russon, Ph.D.
St. John’s University
Xuanhua “Peter” Yin
St. John’s University
Abstract
In the process of IPO, multiple valuation methods are utilized. Conventionally, issuer valuate
equities using intrinsic and relative valuation methods to determine the value of common equity.
However, now more and more statistical modeling is used in the valuation process. The purpose
of this research is to test multiple variables regarding the performance and book value of firms in
semiconductor & semiconductor equipment industry and create a model to predict IPO price in
the industry. Addition to that, this research will utilize mean absolute percentage error(MAPE) to
determine the degree of fitness rather than R-squared conventionally and test the effect of
dividend in corporation operation in semiconductor & semiconductor equipment industry.
I. Introduction
IPO is a crucial process in a corporation’s life cycle. Through IPO, corporation will be able to
gain the capital it needs for its future growth, and in this process equity pricing is the most
delicate but important procedure. A fair IPO valuation will help the corporation’s success in the
IPO and deliver the capital in need.
Parties, who are interest in the results, are investment bankers, institutional investors and
semiconductor & semiconductor equipment companies.
II. Methodology
+ + ?
P = f(Income Statement, Asset Value, Dividend Rate)
1. Data
The research using cross section data from FactSet in the year 2013. The dependent variable will
be stock price and the independent variables are in three categories, representing the three source
of equity value: book value, profitability and dividend.
i. Book value per(bvps) share will be used, representing book value.
ii. Dividend per sahre(dvps) will be used, representing dividend.
iii. For profitability(isvar), earnings per share(eps), earnings per share forward(epsfd),
earnings per share without extraordinary item(epsxo), cash flow per share(cfps), sales
per share(sps), operating income per share(oincps), and net income per share(nincps),
will be used.
2. Techniques
This research will use histogram, scatter plots and sequence chart to rule out outliers and
descriptive statistics and correlation matrix for multicollinearity. And use regression analysis,
linear and gam model, to come up with the prediction model.
Importantly, this research will use R-squared and MAPE as criteria to select the optimum model.
Conventionally, R-squared, the coefficient of determination is used to evaluation the quality of a
regression model. But R-squared is highly sensitive to outliers, which in a lot of situations affects
the quality of R-squared. Both R-squared and MAPE will be used in the research and conclusion
will be drawn based on both.
The mean absolute percentage error(MAPE), displayed in Eqn. 1 below, also known as mean
absolute percentage deviation(MAPD), is a measure of prediction accuracy of forecasting
method in statistics.
𝑀 =
100
𝑛
∑ |
𝐴 𝑡−𝐹𝑡
𝐴 𝑡
|𝑛
𝑡=1 (1)
Where: At is the actual value
Ft is the predicted value
n is the sample size
3. Model
Eqn 2, 3 and 4 display the functional specification, population regression line and sample
regression line in the estimation of price.
+ + +
Price = f(bvps, dvps, isvar) (2)
Price = α+β1*bvps+ β2*dvps+ β3*isvar (3)
Price = α+b1*bvps+ b2*dvps+ b3*isvar (4)
III. Results
Table 1 displays the descriptive statistics of variables tested.
Table 1 Descriptive Statistics
vars n mean sd
media
n skew
kurtosi
s
price 1
5
9 25.33 17.22 21.75 0.71 -0.69
cfps 2
3
9 1.29 1.24 1.17 0.49 0.08
eps 3
5
7 0.49 1.35 0.49 -0.23 0.84
epsfd 4
5
7 0.47 1.33 0.48 -0.27 0.91
epsxo 5
5
9 0.47 1.31 0.48 -0.27 1.07
bvps 6
4
1 11.48 7.93 9.98 1.97 5.97
sps 8
5
9 8.71 5.73 7.45 1.62 3.95
dvps 9
5
9 23.5 45.08 0 2.12 3.8
oincp
s 10
5
9 0.81 1.42 0.6 0.5 0.02
nincp
s 11
5
8 0.52 1.28 0.52 -0.14 0.45
Table 2 displays correlation matrix of all variables.
Table 2 Correlation Matrix
pric
e
cfp
s eps
epsf
d
epsx
o bvps sps dvps
oincp
s nincps
price 1
0.8
6 0.71 0.71 0.71 0.77
0.6
7 0.49 0.69 0.71
cfps 0.86 1 0.79 0.79 0.79 0.71 0.7 0.55 0.84 0.8
eps 0.71
0.7
9 1 1 1 0.4 0.5 0.45 0.95 1
epsfd 0.71
0.7
9 1 1 1 0.4
0.4
9 0.46 0.95 1
epsx
o 0.71
0.7
9 1 1 1 0.4
0.4
9 0.46 0.95 1
bvps 0.77
0.7
1 0.4 0.4 0.4 1
0.7
7 0.1 0.39 0.4
sps 0.67 0.7 0.5 0.49 0.49 0.77 1 0.05 0.55 0.5
dvps 0.49
0.5
5 0.45 0.46 0.46 0.1
0.0
5 1 0.53 0.46
oincp
s 0.69
0.8
4 0.95 0.95 0.95 0.39
0.5
5 0.53 1 0.96
nincp
s 0.71 0.8 1 1 1 0.4 0.5 0.46 0.96 1
Table 3 displays regression results.
Table 3 RegressionResults
Coefficienta t-stat
Int. income var bvps dvps R-sq Int. income var bvps dvps SumOfRed mape
cfps 3.3 6.4 1.3 0.1 0.82 1.1 3 3.4 1.9 210.1 0.9
eps 8.6 4.3 1.2 0.1 0.74 3 2.9 5.2 3.3 278.8 0.1
epsfd 8.6 4.4 1.2 0.1 0.74 3 2.8 5.2 3.3 279.8 0.1
epsxo 8.6 4.4 1.2 0.1 0.74 3 2.9 5.2 3.3 279.6 0.3
sps 5.5 0.4 1.3 0.2 0.69 1.7 0.7 3.2 4.8 309 0.2
oincps 7.6 3.2 1.2 0.1 0.72 2.6 2.2 5.2 3.1 289.1 0.5
nincps 8.5 4.5 1.2 0.1 0.74 3 2.9 5.3 3.2 279.5 0.1
IV. Conclusions
The research determined the gam model and cash flow per share is the best variable from income
statement are the optimum for semiconductor & semiconductor equipment industry. Another
noticeable result is the t-statistics of dividend per share is significant. Further research can be
conducted with different regression models.

More Related Content

Similar to Statistical Model to Predict IPO Prices for Semiconductor

PERFORMANCE ANALYSIS and PREDICTION of NEPAL STOCK MARKET (NEPSE) for INVESTM...
PERFORMANCE ANALYSIS and PREDICTION of NEPAL STOCK MARKET (NEPSE) for INVESTM...PERFORMANCE ANALYSIS and PREDICTION of NEPAL STOCK MARKET (NEPSE) for INVESTM...
PERFORMANCE ANALYSIS and PREDICTION of NEPAL STOCK MARKET (NEPSE) for INVESTM...Hari KC
 
Capstone Project - Nicholas Imholte - Final Draft
Capstone Project - Nicholas Imholte - Final DraftCapstone Project - Nicholas Imholte - Final Draft
Capstone Project - Nicholas Imholte - Final DraftNick Imholte
 
International journal of applied sciences and innovation vol 2015 - no 2 - ...
International journal of applied sciences and innovation   vol 2015 - no 2 - ...International journal of applied sciences and innovation   vol 2015 - no 2 - ...
International journal of applied sciences and innovation vol 2015 - no 2 - ...sophiabelthome
 
Regression and Classification with R
Regression and Classification with RRegression and Classification with R
Regression and Classification with RYanchang Zhao
 
Fpe 90min-all
Fpe 90min-allFpe 90min-all
Fpe 90min-allwenchyan
 
CFM Challenge - Course Project
CFM Challenge - Course ProjectCFM Challenge - Course Project
CFM Challenge - Course ProjectKhalilBergaoui
 
2014-mo444-practical-assignment-04-paulo_faria
2014-mo444-practical-assignment-04-paulo_faria2014-mo444-practical-assignment-04-paulo_faria
2014-mo444-practical-assignment-04-paulo_fariaPaulo Faria
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.pptTanyaWadhwani4
 
Regression Analysis of NBA Points Final
Regression Analysis of NBA Points  FinalRegression Analysis of NBA Points  Final
Regression Analysis of NBA Points FinalJohn Michael Croft
 
OPIM 5604 predictive modeling presentation group7
OPIM 5604 predictive modeling presentation group7OPIM 5604 predictive modeling presentation group7
OPIM 5604 predictive modeling presentation group7Shu-Feng Tsao
 
Forecasting Stock Market using Multiple Linear Regression
Forecasting Stock Market using Multiple Linear RegressionForecasting Stock Market using Multiple Linear Regression
Forecasting Stock Market using Multiple Linear Regressionijtsrd
 
X Bar And S Charts Mini Tutorial
X Bar And S Charts Mini TutorialX Bar And S Charts Mini Tutorial
X Bar And S Charts Mini Tutorialahmad bassiouny
 
Statistics project2
Statistics project2Statistics project2
Statistics project2shri1984
 
Monte carlo-simulation
Monte carlo-simulationMonte carlo-simulation
Monte carlo-simulationjaimarbustos
 
7 QC quality control (7 QC) tools for continuous improvement of manufacturing...
7 QC quality control (7 QC) tools for continuous improvement of manufacturing...7 QC quality control (7 QC) tools for continuous improvement of manufacturing...
7 QC quality control (7 QC) tools for continuous improvement of manufacturing...Chandan Sah
 

Similar to Statistical Model to Predict IPO Prices for Semiconductor (20)

PERFORMANCE ANALYSIS and PREDICTION of NEPAL STOCK MARKET (NEPSE) for INVESTM...
PERFORMANCE ANALYSIS and PREDICTION of NEPAL STOCK MARKET (NEPSE) for INVESTM...PERFORMANCE ANALYSIS and PREDICTION of NEPAL STOCK MARKET (NEPSE) for INVESTM...
PERFORMANCE ANALYSIS and PREDICTION of NEPAL STOCK MARKET (NEPSE) for INVESTM...
 
Capstone Project - Nicholas Imholte - Final Draft
Capstone Project - Nicholas Imholte - Final DraftCapstone Project - Nicholas Imholte - Final Draft
Capstone Project - Nicholas Imholte - Final Draft
 
report
reportreport
report
 
Case Study of Petroleum Consumption With R Code
Case Study of Petroleum Consumption With R CodeCase Study of Petroleum Consumption With R Code
Case Study of Petroleum Consumption With R Code
 
International journal of applied sciences and innovation vol 2015 - no 2 - ...
International journal of applied sciences and innovation   vol 2015 - no 2 - ...International journal of applied sciences and innovation   vol 2015 - no 2 - ...
International journal of applied sciences and innovation vol 2015 - no 2 - ...
 
Regression and Classification with R
Regression and Classification with RRegression and Classification with R
Regression and Classification with R
 
Fpe 90min-all
Fpe 90min-allFpe 90min-all
Fpe 90min-all
 
CFM Challenge - Course Project
CFM Challenge - Course ProjectCFM Challenge - Course Project
CFM Challenge - Course Project
 
2014-mo444-practical-assignment-04-paulo_faria
2014-mo444-practical-assignment-04-paulo_faria2014-mo444-practical-assignment-04-paulo_faria
2014-mo444-practical-assignment-04-paulo_faria
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.ppt
 
FSRM 582 Project
FSRM 582 ProjectFSRM 582 Project
FSRM 582 Project
 
Regression Analysis of NBA Points Final
Regression Analysis of NBA Points  FinalRegression Analysis of NBA Points  Final
Regression Analysis of NBA Points Final
 
OPIM 5604 predictive modeling presentation group7
OPIM 5604 predictive modeling presentation group7OPIM 5604 predictive modeling presentation group7
OPIM 5604 predictive modeling presentation group7
 
Forecasting Stock Market using Multiple Linear Regression
Forecasting Stock Market using Multiple Linear RegressionForecasting Stock Market using Multiple Linear Regression
Forecasting Stock Market using Multiple Linear Regression
 
X Bar And S Charts Mini Tutorial
X Bar And S Charts Mini TutorialX Bar And S Charts Mini Tutorial
X Bar And S Charts Mini Tutorial
 
Statistics project2
Statistics project2Statistics project2
Statistics project2
 
Monte carlo-simulation
Monte carlo-simulationMonte carlo-simulation
Monte carlo-simulation
 
Spc training
Spc training Spc training
Spc training
 
7 QC quality control (7 QC) tools for continuous improvement of manufacturing...
7 QC quality control (7 QC) tools for continuous improvement of manufacturing...7 QC quality control (7 QC) tools for continuous improvement of manufacturing...
7 QC quality control (7 QC) tools for continuous improvement of manufacturing...
 
P Chart Tutorial
P Chart TutorialP Chart Tutorial
P Chart Tutorial
 

Statistical Model to Predict IPO Prices for Semiconductor

  • 1. Statistical Model to Predict IPO Prices for Semiconductor & Semiconductor Equipment Industry Manuel G. Russon, Ph.D. St. John’s University Xuanhua “Peter” Yin St. John’s University Abstract In the process of IPO, multiple valuation methods are utilized. Conventionally, issuer valuate equities using intrinsic and relative valuation methods to determine the value of common equity. However, now more and more statistical modeling is used in the valuation process. The purpose of this research is to test multiple variables regarding the performance and book value of firms in semiconductor & semiconductor equipment industry and create a model to predict IPO price in the industry. Addition to that, this research will utilize mean absolute percentage error(MAPE) to determine the degree of fitness rather than R-squared conventionally and test the effect of dividend in corporation operation in semiconductor & semiconductor equipment industry. I. Introduction IPO is a crucial process in a corporation’s life cycle. Through IPO, corporation will be able to gain the capital it needs for its future growth, and in this process equity pricing is the most delicate but important procedure. A fair IPO valuation will help the corporation’s success in the IPO and deliver the capital in need. Parties, who are interest in the results, are investment bankers, institutional investors and semiconductor & semiconductor equipment companies. II. Methodology + + ? P = f(Income Statement, Asset Value, Dividend Rate) 1. Data The research using cross section data from FactSet in the year 2013. The dependent variable will be stock price and the independent variables are in three categories, representing the three source of equity value: book value, profitability and dividend. i. Book value per(bvps) share will be used, representing book value. ii. Dividend per sahre(dvps) will be used, representing dividend.
  • 2. iii. For profitability(isvar), earnings per share(eps), earnings per share forward(epsfd), earnings per share without extraordinary item(epsxo), cash flow per share(cfps), sales per share(sps), operating income per share(oincps), and net income per share(nincps), will be used. 2. Techniques This research will use histogram, scatter plots and sequence chart to rule out outliers and descriptive statistics and correlation matrix for multicollinearity. And use regression analysis, linear and gam model, to come up with the prediction model. Importantly, this research will use R-squared and MAPE as criteria to select the optimum model. Conventionally, R-squared, the coefficient of determination is used to evaluation the quality of a regression model. But R-squared is highly sensitive to outliers, which in a lot of situations affects the quality of R-squared. Both R-squared and MAPE will be used in the research and conclusion will be drawn based on both. The mean absolute percentage error(MAPE), displayed in Eqn. 1 below, also known as mean absolute percentage deviation(MAPD), is a measure of prediction accuracy of forecasting method in statistics. 𝑀 = 100 𝑛 ∑ | 𝐴 𝑡−𝐹𝑡 𝐴 𝑡 |𝑛 𝑡=1 (1) Where: At is the actual value Ft is the predicted value n is the sample size 3. Model Eqn 2, 3 and 4 display the functional specification, population regression line and sample regression line in the estimation of price. + + + Price = f(bvps, dvps, isvar) (2) Price = α+β1*bvps+ β2*dvps+ β3*isvar (3) Price = α+b1*bvps+ b2*dvps+ b3*isvar (4) III. Results Table 1 displays the descriptive statistics of variables tested. Table 1 Descriptive Statistics vars n mean sd media n skew kurtosi s
  • 3. price 1 5 9 25.33 17.22 21.75 0.71 -0.69 cfps 2 3 9 1.29 1.24 1.17 0.49 0.08 eps 3 5 7 0.49 1.35 0.49 -0.23 0.84 epsfd 4 5 7 0.47 1.33 0.48 -0.27 0.91 epsxo 5 5 9 0.47 1.31 0.48 -0.27 1.07 bvps 6 4 1 11.48 7.93 9.98 1.97 5.97 sps 8 5 9 8.71 5.73 7.45 1.62 3.95 dvps 9 5 9 23.5 45.08 0 2.12 3.8 oincp s 10 5 9 0.81 1.42 0.6 0.5 0.02 nincp s 11 5 8 0.52 1.28 0.52 -0.14 0.45 Table 2 displays correlation matrix of all variables. Table 2 Correlation Matrix pric e cfp s eps epsf d epsx o bvps sps dvps oincp s nincps price 1 0.8 6 0.71 0.71 0.71 0.77 0.6 7 0.49 0.69 0.71 cfps 0.86 1 0.79 0.79 0.79 0.71 0.7 0.55 0.84 0.8 eps 0.71 0.7 9 1 1 1 0.4 0.5 0.45 0.95 1 epsfd 0.71 0.7 9 1 1 1 0.4 0.4 9 0.46 0.95 1 epsx o 0.71 0.7 9 1 1 1 0.4 0.4 9 0.46 0.95 1 bvps 0.77 0.7 1 0.4 0.4 0.4 1 0.7 7 0.1 0.39 0.4
  • 4. sps 0.67 0.7 0.5 0.49 0.49 0.77 1 0.05 0.55 0.5 dvps 0.49 0.5 5 0.45 0.46 0.46 0.1 0.0 5 1 0.53 0.46 oincp s 0.69 0.8 4 0.95 0.95 0.95 0.39 0.5 5 0.53 1 0.96 nincp s 0.71 0.8 1 1 1 0.4 0.5 0.46 0.96 1 Table 3 displays regression results. Table 3 RegressionResults Coefficienta t-stat Int. income var bvps dvps R-sq Int. income var bvps dvps SumOfRed mape cfps 3.3 6.4 1.3 0.1 0.82 1.1 3 3.4 1.9 210.1 0.9 eps 8.6 4.3 1.2 0.1 0.74 3 2.9 5.2 3.3 278.8 0.1 epsfd 8.6 4.4 1.2 0.1 0.74 3 2.8 5.2 3.3 279.8 0.1 epsxo 8.6 4.4 1.2 0.1 0.74 3 2.9 5.2 3.3 279.6 0.3 sps 5.5 0.4 1.3 0.2 0.69 1.7 0.7 3.2 4.8 309 0.2 oincps 7.6 3.2 1.2 0.1 0.72 2.6 2.2 5.2 3.1 289.1 0.5 nincps 8.5 4.5 1.2 0.1 0.74 3 2.9 5.3 3.2 279.5 0.1 IV. Conclusions The research determined the gam model and cash flow per share is the best variable from income statement are the optimum for semiconductor & semiconductor equipment industry. Another noticeable result is the t-statistics of dividend per share is significant. Further research can be conducted with different regression models.