SlideShare a Scribd company logo
1 of 9
Download to read offline
International Journal of Statistics and Probability; Vol. 4, No. 2; 2015
ISSN 1927-7032 E-ISSN 1927-7040
Published by Canadian Center of Science and Education
123
Assessing Relative Importance Using RSP Scoring to Generate
Variable Importance Factor (VIF)
Daniel Koh1
1
School of Business, SIM University, Singapore
Correspondence: Daniel Koh, School of Business, SIM University, 461 Clementi Road, 599 491, Singapore. Tel:
65-6248-9746. E-mail: danielkoh005@unisim.edu.sg
Received: March 17, 2015 Accepted: April 8, 2015 Online Published: April 27, 2015
doi:10.5539/ijsp.v4n2p123 URL: http://dx.doi.org/10.5539/ijsp.v4n2p123
Abstract
Previous research has shown that the construction of VIF is challenging. Some researchers have sought to use
orderly contribution of R2
(coefficient of determination) as measurement for relative importance of variable in a
model, while others have sought the standardized parameter estimates b (beta) instead. These contributions have
been proven to be very valuable to the literature. However, there is a lack of study in combining key properties
of variable importance into one composite score. For example, an intuitive understanding of variable importance
is by scoring reliability, significance and power (RSP) of it in the model. Thereafter the RSP scores can be
aggregated together to form a composite score that reflects VIF. In this paper, the author seeks to prove the
usefulness of the DS methodology. DS stands for Driverβ€Ÿs Score and is defined as the relative, practical
importance of a variable based on RSP scoring. An industry data was used to generate DS for practical example
in this paper. This DS is then translated into a 2x6 matrix where level of importance (LxI) is generated. The final
outcome of this paper is to discuss the use of RSP scoring methodology, theoretical and practical use of DS and
the possible future research that entails this paper. DS methodology is new to the existing literature.
Keywords: variable importance, decomposition of variances, RSP Scoring, multiple linear regression
1. Introduction
In recent history, much effort has been given to the study of variable importance factor (VIF). Researchers have
approached this topic in several manners: taking the increase of R2
– coefficient of determination - as the
usefulness of the regressors (Darlington, 1968), squared standardized coefficients and products of standardized
coefficients with marginal correlations (Hoffman, 1960; Hooker & Yule, 1906), LMG method of using sequential
sums of squares from the linear model (Lindeman, Merenda, & Gold, 1980), conditional variable importance for
Random Forest (Strobl, Boulesteix, Kneib, Augustin, & Zeileis, 2008), the averaging method of variance
decomposition (Kruskal, 1987; Chevan & Sutherland, 1991) and proportional marginal variance decomposition.
However, most of the studies are founded on one dimension. Several authors which include Ehrenberg (1990),
Stufken (1992), and Christensen (1992) have dismissed the usefulness and benefits of relative importance
measure. The premise of this dismissal was that the decomposition of coefficient of determination is too
simplistic and it is difficult to tease out relative importance among correlated variables which could potentially
β€œdouble count” in the model. In this paper, the focus is in the independent measurement of relative importance
and the discussion on teasing out interrelatedness between independent variables is deferred.
The decomposition of coefficient of determination becomes a powerful tool when complementary scorings are
given to improve accuracy in understanding relative importance of variables. Hence, this paper seeks to suggest
a new method to assess relative importance of variable by considering reliability, significance and power, to the
end that the final composite scores reflect the intuitive, practical understanding of relative importance better.
Reliability is defined here as the invert of the sum of residual errors between the predicted and actual values.
Significance is defined here as the heterogeneity of groupings with homogeneity in within-group distances or
maximized distances between predicted values with expected mean of dependent variable and minimized
distances between predicted values with actual values. Power is defined here as the positivity of the slope of the
estimates, with greater or steeper slope leading to greater power. The intention is to develop a score that not only
accounts for variance decomposition, but also practical meaningfulness and accuracy in utilizing a variable. It
also accounts for the goodness of a predictor by scoring the standardized parameter estimates of the variable in
www.ccsenet.org/ijsp International Journal of Statistics and Probability Vol. 4, No. 2; 2015
124
the model. The DS scoring methodology is new to our existing literature today.
2. Residual Errors as First Property
One aspect of a good predictor is its minimized residual error values, which then contribute to a strong
coefficient of determination, R2
. When residual errors are minimized, the chance of making mispredictions
becomes lesser, leading to greater reliability for the predictor. A practical example would be the reliability of age
in understanding income earnings. Between the independent variable age and gender, age is chosen to be the
better predictor for income earnings because gender may not contribute as much between-group residual errors
as compared to the sum of squares of model (SSM) for age. This is particularly true when a meritocratic society
promotes merit for work experiences, of which only age most likely correlates positively with it strongly,
regardless of gender type. Intuitively, the concept of reliability lies on the confidence of which one can get when
the model is put to test. The scores can be decomposed into respondent level, whereby each respondent is given
three scores for RSP, leading up to the final scoring of DS.
Hence, the first function of DS is the invert of residual errors, which is first expressed in the following
mathematical expression for multiple linear regression:
Ε· = π‘Ž0 + π‘Ž1 π‘₯1 + π‘Ž2 π‘₯2 + … + π‘Ž 𝑛 π‘₯ 𝑛 + πœ€
(1)
where π‘₯ ∈ ℝ denotes the regressor, 𝑦̂ ∈ ℝ denotes the dependent variable, π‘Ž denotes the parameter
estimate and πœ€ denotes the error term of the model.
A series of x is fitted into the model, generating a series of predicted values, ŷ. The residual for this model - 𝜁 -
is then expressed as the absolute difference between the actual values of the dependent variable which is denoted
by 𝑦 and the predicted values 𝑦̂.
𝜁 = |𝑦 βˆ’ Ε·|
(2)
The residual is then inverted to form common directionality with two other functions of the RSP framework.
𝜌 =
1
𝜁
=
1
|𝑦 βˆ’ Ε·|
(3)
The residual is then fitted onto the Gaussianβ€Ÿs cumulative distribution function - Ξ¦(𝜌) - , assuming that the
variable is independent and identically distributed (I.I.D) under the Normal distribution, 𝑋 ~ 𝑁(0,1):
Φ(𝜌) =
1
𝜍√2πœ‹
∫ π‘’βˆ’(πœŒβˆ’πœ‡ 𝜌)
2
/ 2𝜍 𝜌
2
π‘‘πœŒ
𝜌
min 𝜌
(4)
where 𝜌 denotes the inverse of residual error, πœ‡ 𝜌 denotes the mean of the residual errors, 𝜍𝜌 denotes the
standard deviation of the residual error.
The first function of DS is improved when 𝜍2
𝜌 is decreased: reliability increases when data are less sparsely
distributed. This cumulative distribution function serves as a score for reliability on the observation or
respondent level. The use of the magnitude of 𝑅2
contributions to assess relative importance was similarly
proposed by Hoffman (1960) and later defended by Pratt (1987).
3. F-Ratios of the Residual Errors as Second Property
While the residual errors are expected to be minimized, the significance of the residual errors is expected to be
maximized. The motivation behind this property is to obtain scores that reflect variableβ€Ÿs distinctiveness among
inter-groups through a study of variance ratio. For example, if a variable πœ† has unique and distinctive πœ…
groupings in understanding income earning levels of a country, the F-Ratios due to πœ† should be greater as
compared to those that have more homogenous groupings.
The residual error in the first function is preferred over observations because the variance of residual errors
(variance study of residual errors) is expected to reflect distinctiveness between groups better if they are truly
www.ccsenet.org/ijsp International Journal of Statistics and Probability Vol. 4, No. 2; 2015
125
distinctive and unique than the observation values themselves. For example, when income is predicted using Age,
the error between πœ… groupings in πœ† should have distinctive noises. This β€œdistinctive noises” should
characterize their identity as unique groupings in the model. This is true for variables that are categorical, which
will partition the data into kth groupings. Hence, the second function of DS has an inverse relationship with the
first function of DS.
The second function of DS - 𝜌 - is expressed in the following mathematical expression for a linear regression
model:
𝐹𝜌 =
π‘‰π‘Žπ‘Ÿπ‘–π‘Žπ‘›π‘π‘’π‘  𝑑𝑒𝑒 π‘‘π‘œ π‘€π‘œπ‘‘π‘’π‘™
π‘‰π‘Žπ‘Ÿπ‘–π‘Žπ‘›π‘π‘’π‘  𝑑𝑒𝑒 π‘‘π‘œ π‘…π‘’π‘ π‘–π‘‘π‘’π‘Žπ‘™
(5)
𝐹𝜌 =
βˆ‘(𝑦̂ βˆ’ 𝑦̅)2
(𝐾 βˆ’ 1)
βˆ‘(𝑦 βˆ’ 𝑦̂)2
(𝑁 βˆ’ 𝐾)
(6)
𝐹𝜌 =
βˆ‘(𝑦̂ βˆ’ 𝑦̅)2
βˆ‘(𝑦 βˆ’ 𝑦̂)2
(
𝑁 βˆ’ 𝐾
𝐾 βˆ’ 1
*
(7)
where 𝐾 is the number of groupings, 𝑦̂ denotes predicted value of 𝑦, and 𝑦̅ denotes the average value of 𝑦,
N denotes the sample size.
The F-Ratios are then fitted into Fisherβ€Ÿs CDF, which is the integral of the PDF of F-distribution, assuming that
the variables are independent and identically distributed (I.I.D) under the Fisherβ€Ÿs distribution:
Ξ¦Fρ(Fρ,πΎβˆ’1,π‘βˆ’πΎ) = ∫
(
πΎβˆ’1
π‘βˆ’πΎ
)
πΎβˆ’1
2
Fρ
Kβˆ’1
2
βˆ’ 1
(1+
Kβˆ’1
Nβˆ’K
Fρ)
βˆ’
Nβˆ’1
2
[
*
1
2
(π‘βˆ’πΎ)+!*
1
2
(πΎβˆ’1)+!
*
1
2
(π‘βˆ’πΎ)++*
1
2
(πΎβˆ’1)+βˆ’ 1
]
Fρ
0
(8)
Φϱ = 1 βˆ’ ∫
(
𝐾 βˆ’ 1
𝑁 βˆ’ 𝐾
)
πΎβˆ’1
2
Fρ
Kβˆ’1
2
βˆ’ 1
(1 +
K βˆ’ 1
N βˆ’ K
Fρ)
βˆ’
Nβˆ’1
2
[
*
1
2
(𝑁 βˆ’ 𝐾)+ ! *
1
2
(𝐾 βˆ’ 1)+ !
*
1
2
(𝑁 βˆ’ 𝐾)+ + *
1
2
(𝐾 βˆ’ 1)+ βˆ’ 1
]
Fρ
0
(9)
As F-ratio 𝐹𝜌 considers residual errors 𝜁, the Fisherβ€Ÿs CDF Ξ¦Fρ(π‘₯,πΎβˆ’1,π‘βˆ’πΎ) is reversed (1 βˆ’ Ξ¦Fρ(Fρ,πΎβˆ’1,π‘βˆ’πΎ))
to generate significance score Φϱ, with greater residual errors leading to lower probability values.
This arrangement allows the decomposition of F-Ratios to the observation level, where each observation is
assigned an F-Ratio value. This cumulative distribution function which follows the Fisherβ€Ÿs Distribution serves
as a score for Significance on the observation or respondent level.
Significance is an important property in DS. If income earnings are to be understood by gender and dwelling
type of individuals, the latter variable provides greater noises in residual error as the separating groups in the
factor create more β€žnoisesβ€Ÿ than the variable gender. Or the noises between age and gender are significantly
different as the residual errors due to variable Age may contain more noises than the variable gender. If the
www.ccsenet.org/ijsp International Journal of Statistics and Probability Vol. 4, No. 2; 2015
126
distributions of errors among groupings are similar under the F-distribution, then the factor is less significant for
use as the factor exhibits homogeneity in variances across all groups. Hence, the significance of a variable relies
on the distinctiveness of errors between groups in factors, with homogeneity for within groups, or greater
distances from sample mean, with lower distances from the model.
4. Standardized Regression Coefficients as Third Property
The third and final function of DS is the standardized parameter estimate of the regressor. This is commonly
known as the slope of the curve. In a linear regression, the slope of the curve is observed by the parameter
estimates of the variable π‘Ž of the model. However, the use of unstandardized slope of the curve is not
appropriate when scales of different variables are different. Hence, the power of regressors – or β€žsteepnessβ€Ÿ – is
then observed by the standardized parameter estimate 𝑏 of the model. This standardized estimate reduces the
metrical scale to a common vector across all regressors. It is expressed through the following mathematical
expression:
𝑦̇̂ = 𝑏̇1 πœƒ1 + 𝑏̇2 πœƒ2 + … + 𝑏̇ π‘š πœƒ π‘š + πœ€
(10)
𝑏 π‘š
Μ‡ = 𝑏 π‘š (
𝜍π‘₯ π‘š
𝜍 𝑦
)
(11)
πœƒπ‘– denotes the standardized values of the predictor π‘₯, 𝑦̇̂ denotes the predicted response variable from
standardized predictors, 𝑏̇ π‘š denotes the standardized regression coefficient, 𝑏 π‘š denotes the unstandardized
regression coefficient, 𝜍π‘₯ denotes the standard deviation of the predictor and 𝜍 𝑦 denotes the standard deviation
of the response variable.
The absolute standardized regression coefficient – |𝑏 π‘š
Μ‡ | – is then converted into a ratio out of the sum of all
absolute standardized coefficients, with 𝑛 number of parameters in a model:
πœ‚ 𝑗 =
|𝑏𝑗
Μ‡ |
βˆ‘ |𝑏𝑗
Μ‡ |π‘š
𝑗=1
(12)
where π‘š denotes the total number of parameter estimates in the model, 𝑏𝑗
Μ‡ denotes the parameter estimate of
interest.
While relative importance measure considers reliability and significance, power is still necessary for the
understanding of relative importance of a variable in the model. When a powerful variable has low reliability and
significance, a low DS score is expected. If reliability and significance scores are high, but power score is low,
the DS still remains effective as the consideration that arises from the combined importance of reliability and
significance outweigh the consideration for power. Intuitively, power or standardized regression coefficient is not
the sole consideration of relative importance of a variable in the model but a combination of reliability,
significance and power.
5. Driver’s Score (DS)
Driverβ€Ÿs Score (DS) is an aggregated score of three properties: reliability, significance and power. It reflects the
relative importance of a variable in the model practically. At the observation level, the Driversβ€Ÿ Score (𝛿) is the
geometric mean of these three properties. The mathematical expression is expressed as:
𝛿 = √(
1
𝜍√2πœ‹
∫ π‘’βˆ’(πœŒβˆ’πœ‡ 𝜌)
2
/ 2𝜍 𝜌
2
π‘‘πœŒ
𝜌
min 𝜌
. (1 βˆ’ ∫
(
πΎβˆ’1
π‘βˆ’πΎ
)
πΎβˆ’1
2
Fρ
Kβˆ’1
2
βˆ’ 1
(1+
Kβˆ’1
Nβˆ’K
Fρ)
βˆ’
Nβˆ’1
2
[
*
1
2
(π‘βˆ’πΎ)+!*
1
2
(πΎβˆ’1)+!
*
1
2
(π‘βˆ’πΎ)++*
1
2
(πΎβˆ’1)+βˆ’ 1
]
Fρ
0
) .
|𝑏 𝑗̇ |
βˆ‘ |𝑏 𝑗̇ |π‘š
𝑗=1
)
3
(13)
As observed in the DS equation, the DS methodology is the cube-root of the ratio of three products to the sum of
www.ccsenet.org/ijsp International Journal of Statistics and Probability Vol. 4, No. 2; 2015
127
all absolute standardized regression coefficient estimates. It is a function of Reliability (Gaussianβ€Ÿs CDF which
has an exponential function), Significance (Fisherβ€Ÿs CDF which has a function of the hypergeometric function
for 𝐹𝜌 by Beta function for 𝐹𝜌) and the absolute standardized regression coefficient of the variable of interest
and an inverse function of the sum of all absolute standardized regression coefficient estimates. The decrease in
other absolute standardized parameter estimates increases DS. This understanding has an important and practical
value: the variable of interest in which it is positioned with other variables is influenced by the mix of the model.
When DS is low, either two of the three properties or all properties are low, resulting to weaker driversβ€Ÿ influence.
When DS is high, either two of the three properties or all properties are high, resulting to stronger driversβ€Ÿ
influence. DS is bounded by a lower limit and an upper limit of 0 and 1 respectively.
At the regressor level, the Driversβ€Ÿ Score (DS) is the arithmetic mean of all the geometric mean of all three
properties at the observation level:
𝛿̅ =
βˆ‘
( √(
1
𝜍√2πœ‹
∫ π‘’βˆ’(πœŒβˆ’πœ‡ 𝜌)
2
/ 2𝜍 𝜌
2
π‘‘πœŒ
𝜌
min 𝜌 .
(
1βˆ’ ∫
(
πΎβˆ’1
π‘βˆ’πΎ
)
πΎβˆ’1
2
Fρ
Kβˆ’1
2
βˆ’ 1
(1+
Kβˆ’1
Nβˆ’K
Fρ)
βˆ’
Nβˆ’1
2
[
*
1
2
(π‘βˆ’πΎ)+!*
1
2
(πΎβˆ’1)+!
*
1
2
(π‘βˆ’πΎ)++*
1
2
(πΎβˆ’1)+βˆ’ 1
]
Fρ
0
)
.|𝑏 𝑗̇ |
)
3
)
βˆ‘ |𝑏 𝑗̇ |π‘š
𝑗=1 .𝑛
(14)
When DS at the regressor level is low, relative influence on the dependent variable in the model is low. When DS
at the regressor level is high, relative influence on the dependent variable in the model is high. DS is bounded by
a lower limit and an upper limit of 0 and 1 respectively.
6. Weighted Drivers’ Score (wDS)
The weighted Driversβ€Ÿ Score (wDS) is a composite score of three properties (RSP), being weighted individually
by their importance in the DS framework. For example, the score for reliability (R) is far more important than
the score for significance (S) and power (P). Hence, the R score is given heavier weights than the rest. The wDS
- 𝛿 𝑀 - can be expressed mathematically by assigning power to the properties:
𝛿 𝑀 =
(
1
𝜍√2πœ‹
∫ π‘’βˆ’(πœŒβˆ’πœ‡ 𝜌)
2
/ 2𝜍 𝜌
2
π‘‘πœŒ
𝜌
min 𝜌 )
w1
.
(
1βˆ’ ∫
(
πΎβˆ’1
π‘βˆ’πΎ
)
πΎβˆ’1
2
Fρ
Kβˆ’1
2
βˆ’ 1
(1+
Kβˆ’1
Nβˆ’K
Fρ)
βˆ’
Nβˆ’1
2
[
*
1
2
(π‘βˆ’πΎ)+!*
1
2
(πΎβˆ’1)+!
*
1
2
(π‘βˆ’πΎ)++*
1
2
(πΎβˆ’1)+βˆ’ 1
]
Fρ
0
)
𝑀2
. |𝑏 𝑗̇ |
𝑀3
(βˆ‘ |𝑏 𝑗̇ |π‘š
𝑗=1 )
𝑀3 ,
𝑀1 + 𝑀2 + 𝑀3 = 1 (15)
At the regressor level, the weighted Driversβ€Ÿ Score which is relative importance of a variable in the model after
adding weights is the arithmetic mean of all three properties at the observation level:
𝛿̅ 𝑀 =
βˆ‘
(
(
1
𝜍√2πœ‹
∫ π‘’βˆ’(πœŒβˆ’πœ‡ 𝜌)
2
/ 2𝜍 𝜌
2
π‘‘πœŒ
𝜌
min 𝜌 )
w1
.
(
1βˆ’ ∫
(
πΎβˆ’1
π‘βˆ’πΎ
)
πΎβˆ’1
2
Fρ
Kβˆ’1
2
βˆ’ 1
(1+
Kβˆ’1
Nβˆ’K
Fρ)
βˆ’
Nβˆ’1
2
[
*
1
2
(π‘βˆ’πΎ)+!*
1
2
(πΎβˆ’1)+!
*
1
2
(π‘βˆ’πΎ)++*
1
2
(πΎβˆ’1)+βˆ’ 1
]
Fρ
0
)
𝑀2
. |𝑏 𝑗̇ |
𝑀3
)
(βˆ‘ |𝑏 𝑗̇ |π‘š
𝑗=1 )
𝑀3
.𝑛
(16)
7. Application of DS
Using data from a Multinational Corporation (MNC) that is based in Singapore (Note 1), the spread of DS of 4
predictors and itsβ€Ÿ response variable are:
www.ccsenet.org/ijsp International Journal of Statistics and Probability Vol. 4, No. 2; 2015
128
Figure 1. DS of v1 to v4 on dependent variable
Based on insights that were gathered from professionals in the industry, v1 has the greatest impact or driving
force in understanding the dependent variable. This impact or driving force is understood under the framework
of the RSP model. To garner reliable, significant and powerful dependent variable, extra care must be given to
v1.
As it is observed from the chart above (Figure 1), the relationship between DS and its dependent variable takes
on the inverse exponential function. This would explicitly highlight that the observations of RSP is often the
inverse of the dependent variable. The tendency to mispredict is higher when dependent variable or its set of
products gets too complex. In this area, the DS model has intuitively performed well.
Figure 2. Scatter-plots of DS with Dependent Variable
The plots in figure 2 show the area under the curve, which represents the magnitude of DS. V1 has the largest
area under the curve.
8. VIF and Its Decomposition
The relative importance of 𝑗 variable - 𝛿̅ - is the arithmetic mean of summation of all 𝛿 in 𝑗 variable:
0
10
20
30
40
50
0.000 0.100 0.200 0.300 0.400 0.500 0.600 0.700
Driver's Scores
Driver's Scores with Dependent Variable
v1 v2 v3 v4
0
10
20
30
40
50
0.00
0.09
0.12
0.14
0.18
0.20
0.22
0.25
0.28
0.32
0.35
0.39
0.43
0.48
0.54
Driver's Scores (DS)
v1 DS
0
10
20
30
40
50
0.00
0.21
0.20
0.27
0.31
0.39
0.32
0.29
0.27
0.26
0.47
0.32
0.26
0.35
0.40
0.31
Driver's Scores (DS)
v2 DS
0
10
20
30
40
50
0.00
0.09
0.08
0.05
0.12
0.07
0.22
0.21
0.12
0.11
0.16
0.14
0.18
0.12
0.23
0.14
Driver's Scores (DS)
v3 DS
0
10
20
30
40
50
0.03
0.20
0.15
0.23
0.30
0.47
0.38
0.37
0.29
0.29
0.18
0.28
0.14
0.20
0.10
0.14
Driver's Scores (DS)
v4 DS
www.ccsenet.org/ijsp International Journal of Statistics and Probability Vol. 4, No. 2; 2015
129
𝛿𝑖,𝑗 = √Φ(𝜌 𝑖,𝑗). Ξ¦Fρ(Fρ,πΎβˆ’1,π‘βˆ’πΎ) . πœ‚ 𝑗
3
(17)
𝛿̅𝑗 =
βˆ‘ 𝛿𝑖,𝑗
𝑛
𝑖=1
𝑛
(18)
Figure 3. VIF chart
As observed in the bar chart, the strongest relative importance of factor in the model is v1. This importance is
decomposed into its RSP properties, which is shown in the table below:
Table 1. Average RSP and DS Scores against Four Independent Variables
R S P DS
v1 0.5454 0.2148 0.3540 0.2867
v2 0.5481 0.3276 0.1830 0.2829
v3 0.5447 0.2067 0.0340 0.1363
v4 0.5364 0.1131 0.4290 0.2353
Mean 0.5437 0.2156 0.2500 0.2353
Note. The DS score is the geometric mean of all three RSP scores.
In this example, the use of the standardized parameter estimate – P - as the relative influence factor is misleading
as industry practitioners have clearly identified the importance of v1 rather than v4. This is also reflected in the
DS score. The other use of variance decomposition – R – is also misleading, as it identifies v2 as the factor that
has the strongest relative importance in the model instead of v1. The composite score of three properties clearly
show that variable importance does not solely rely on one aspect or dimension of importance considerations, but
three aspects: reliability, significance and power.
A 2x6 Variable Importance Matrix (VIM) describes the characteristics of DS. A low score is understood as a
score that is below 0.50. A high score is understood as a score that is above 0.50. For example, a mix of DS=0.90
(highest DS score in the model) and R=0.30 is characterized as the most important variable that has low
reliability. From the VIM table, this variable is known as an Unreliably Important variable.
Table 2. Variable Importance Matrix (VIM)
Less Important Most Important
Low Reliability Unreliably Trivial Unreliably Important
High Reliability Reliably Trivial Reliably Important
Low Significance Insignificantly Trivial
Insignificantly
Important
High Significance Significantly Trivial
Significantly
Important
Low Power Powerlessly Trivial
Powerlessly
Important
High Power Powerfully Trivial Powerfully Important
0.29 0.28
0.14
0.24
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
v1 v2 v3 v4
VIF based on DS scoring
VIF
www.ccsenet.org/ijsp International Journal of Statistics and Probability Vol. 4, No. 2; 2015
130
A classification by levels (CLASS) is used here:
If ALL RSP is low (i.e. 𝑅𝑆𝑃 < 0.50) β‡’ Level 4
If EITHER Two of RSP is low (i.e. 𝑅𝑃/𝑅𝑆/𝑆𝑃 < 0.50) β‡’ Level 3
If ONLY One of RSP is low (i.e. 𝑅/𝑆/𝑃 < 0.50) β‡’ Level 2
If NONE of RSP is low (i.e. 𝑅𝑆𝑃 β‰₯ 0.50) β‡’ Level 1
The CLASS logic is then combined with VIM to generate the final descriptive outcome of VIF for the variable
of interest. For example, v1 is classified as level 3 Importance (L3I) variable. However, DS can be improved if
the CLASS level improves from level 3 to level 1. Ideally, a good and strong driver for response variable is a
level 1 importance (L1I) variable. To achieve such a score, RSP scores that are below 0.50 should be given extra
attention. For example, v1 has high R score but low S and P scores. Hence, measures to create greater
distinctiveness and power could potentially improve the final DS score.
The benefit of this scoring methodology is 1) the flexibility to include categorical variables as a regressor, 2) the
ability to aggregate three important properties of variable from both the theoretical and practical aspects of
analytic, and 3) the simplicity at which an explanation could be given to laypeople.
Categorical variables are treated as dummy variables in the linear regression model and itsβ€Ÿ F-ratios are similarly
calculated by taking the sum of squared error due to model against the sum of squared error due to residual
errors. To assign power to the DS methodology, absolute standardized regression coefficients for each dummy
variable is assigned to each groupings in the factor. Hence, a metrical independent variable has one power
assignment in a factor while the categorical independent variable has multiple power assignments for different
groupings in a single factor.
Theoretically, so long as the variable is normally distributed and conforms to the chi-square distribution, the
usefulness of the DS methodology becomes essential for practical use. A single dimension of the DS
methodology may be taken to generate VIF measurement, but it does not represent the natural and intuitive
considerations of factor importance in practice. Practitioners often need a score that reflects reliability,
significance and power of the predictors and conclude the importance of it thereafter. DS helps to meet the needs
of those who wish to obtain a practical assessment of variable importance.
9. Limitations of DS
DS has several limitations. Firstly, DS takes the absolute standardized regression coefficients as a measurement
for power in a linear model. In cases where nonlinearity is observed, an alternative approach is needed to
substitute this third property of DS. Also, DS does not account for multi-dimensional explanation of variances in
a single vector space. This has implications when it comes to interrelatedness of predictors that could potentially
form interaction terms. When interaction terms are formed, the DS methodology is not able to tease out
interaction effects. Lastly, as categorical variables are converted into dummy variables in a model, the increase in
parameter estimates with constant set of sample size makes the model unstable. Hence, the DS methodology
becomes useful when only key factors that could potentially explain variances better are considered. These
limitations have an impact in practical use when the relationship between predictors and response variable
becomes too complex, resulting to multiple unknown β€žnoisesβ€Ÿ within the model that could affect the whole DS
methodology. The use of DS should be jealously guarded by good and firm understanding of relationship
between variables and the intuitive approach in understanding practical relationships is essential in the DS
methodology.
10. Summary
Managers often ask what drivers or variables influence the outcome of success more significantly over the other
at the respondent level. For example, the main question that managers ask is the drivers which contribute to the
success of an event. Fundamentally, the drivers that contribute mostly to success of an event are obtained by the
correlation coefficient of the variables. However, this methodology falls apart when categorical variables are
used or the requirement to assess additional parameters is apparent. DS is designed to overcome this limitation
by decomposing the DS scores into its three properties – RSP. As the measurement for variable influence is seen
at the observation or respondent level, the aggregation of scores can finally add up to the variable influence
factor (VIF) measurement.
Cost-cutting measures that lead to improved profitability are also possible by using DS. Businesses can utilize
DS to determine which dimension or variable has the greatest importance and pitch marketing efforts on that
dimension with a targeted focus on respondents who have higher RSP scores. If the amount of sales leads is a
www.ccsenet.org/ijsp International Journal of Statistics and Probability Vol. 4, No. 2; 2015
131
stronger or more important factor in determining revenue and the higher RSP scores are found among the
entrepreneurs, then companies can focus their marketing effort on generating leads by targeting the entrepreneurs
first, then the others. Although it is fundamentally true that the outcome of revenue takes prerogative in
marketing effort, the precision of the outcome and the significance of it have a key role to play in understanding
variable importance. Targeted and focused marketing can generate income faster, which could then supplement
additional capital to target the rest of the populations using low-cost marketing efforts.
11. Future Research
Due to the limitation imposed by the DS methodology, more studies can be conducted to understand how
variance decomposition occurs in a multi-dimensional setting in a single vector space. Particularly, studies can
be conducted to examine independent variables that are correlated to each other and when these variables are
categorical, more can be done to examine interaction terms and how it decomposes to the RSP scorings. Finally,
more β€žreal-worldβ€Ÿ data is needed to assess the DS methodology, particularly in RSP scorings. A survey of factor
importance can be distributed to industry practitioners to complement the findings of DS. The results from the
survey can validate the strength of RSP scoring methodology. While the RSP scoring and its DS methodology is
still at its conceptual stage, the use of it can bring many benefits to business practitioners and academic
researchers. In light of the possible use of DS, more industrial report can utilize this RSP scoring method to
generate analytic-based decision makings.
References
Chevan, A., & Sutherland, M. (1991). Hierarchical Partitioning. The American Statistician, 90-96.
http://dx.doi.org/10.2307/2684366
Christensen, R. (1992). Comment on "Hierarchical Partitioning," by A. Chevan and M. Sutherland. The
American Statistician , 74.
Darlington, R. (1968). Multiple Regression in Psychological Research and Practice. Psychological Bulletin ,
161-182. http://dx.doi.org/10.1037/h0025471
Ehrenberg, A. (1990). The Unimportance of Relative Importance. The American Statistician , 260.
Hoffman, P. (1960). The Paramophic Representation of Clinical Judgment. Psychological Bulletin , 116-131.
http://dx.doi.org/10.1037/h0047807
Hooker, R., & Yule, G. U. (1906). Note on Estimating the Relative Influence of Two Variables Upon a Third.
Journal of the Royal Statistical Society , 197-200. http://dx.doi.org/10.2307/2339552
Kruskal, W. (1987). Relative Importance by Averging Over Orderings. The American Statistician, 6-10.
Lindeman, R., Merenda, P., & Gold, R. (1980). Introduction to Bivariate and Multvariate Analysis. Glenview:
Scott, Foresman.
Pratt, J. W. (1987). Dividing the Indivisible: Using Simple Symmetry to Partition Variance Explained.
Proceedings of Second Tampere Conference in Statistics, eds. (pp. 245-260). Finland: T. Pukkila and S.
Puntamen.
Strobl, C., Boulesteix, A., Kneib, T., Augustin, T., & Zeileis, A. (2008). Conditional Variable Importance for
Random Forests. BMC Bioinformatics , 307. http://dx.doi.org/10.1186/1471-2105-9-307
Stufken, J. (1992). On Hierarchical Partitioning. The American Statistician , 70-71.
Note 1: With permission granted from the Multinational Corporation (MNC), under the condition of full
anonymity and non-disclosure.
Copyrights
Copyright for this article is retained by the author(s), with first publication rights granted to the journal.
This is an open-access article distributed under the terms and conditions of the Creative Commons Attribution
license (http://creativecommons.org/licenses/by/3.0/).

More Related Content

What's hot

Linear regression
Linear regression Linear regression
Linear regression Babasab Patil
Β 
Bbs10 ppt ch03
Bbs10 ppt ch03Bbs10 ppt ch03
Bbs10 ppt ch03Anwar Afridi
Β 
Multiple regression in spss
Multiple regression in spssMultiple regression in spss
Multiple regression in spssDr. Ravneet Kaur
Β 
Simple (and Simplistic) Introduction to Econometrics and Linear Regression
Simple (and Simplistic) Introduction to Econometrics and Linear RegressionSimple (and Simplistic) Introduction to Econometrics and Linear Regression
Simple (and Simplistic) Introduction to Econometrics and Linear RegressionPhilip Tiongson
Β 
7. logistics regression using spss
7. logistics regression using spss7. logistics regression using spss
7. logistics regression using spssDr Nisha Arora
Β 
4th Grade Glossary
4th Grade Glossary4th Grade Glossary
4th Grade GlossaryIsaac_Schools_5
Β 
3rd Grade Glossary
3rd Grade Glossary3rd Grade Glossary
3rd Grade GlossaryIsaac_Schools_5
Β 
Morse et al 2012
Morse et al 2012Morse et al 2012
Morse et al 2012Brendan Morse
Β 
[M3A4] Data Analysis and Interpretation Specialization
[M3A4] Data Analysis and Interpretation Specialization[M3A4] Data Analysis and Interpretation Specialization
[M3A4] Data Analysis and Interpretation SpecializationAndrea Rubio
Β 
Regression analysis in R
Regression analysis in RRegression analysis in R
Regression analysis in RAlichy Sowmya
Β 
5th Grade Math Glossary
5th Grade Math Glossary5th Grade Math Glossary
5th Grade Math GlossaryIsaac_Schools_5
Β 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticsAileen Balbido
Β 
Reporting an independent sample t test
Reporting an independent sample t testReporting an independent sample t test
Reporting an independent sample t testKen Plummer
Β 
Lesson 1 07 measures of variation
Lesson 1 07 measures of variationLesson 1 07 measures of variation
Lesson 1 07 measures of variationPerla Pelicano Corpez
Β 
BasicStatistics.pdf
BasicStatistics.pdfBasicStatistics.pdf
BasicStatistics.pdfsweetAI1
Β 
Presentation on Regression Analysis
Presentation on Regression AnalysisPresentation on Regression Analysis
Presentation on Regression AnalysisJ P Verma
Β 

What's hot (17)

Linear regression
Linear regression Linear regression
Linear regression
Β 
Bbs10 ppt ch03
Bbs10 ppt ch03Bbs10 ppt ch03
Bbs10 ppt ch03
Β 
Multiple regression in spss
Multiple regression in spssMultiple regression in spss
Multiple regression in spss
Β 
Simple (and Simplistic) Introduction to Econometrics and Linear Regression
Simple (and Simplistic) Introduction to Econometrics and Linear RegressionSimple (and Simplistic) Introduction to Econometrics and Linear Regression
Simple (and Simplistic) Introduction to Econometrics and Linear Regression
Β 
7. logistics regression using spss
7. logistics regression using spss7. logistics regression using spss
7. logistics regression using spss
Β 
4th Grade Glossary
4th Grade Glossary4th Grade Glossary
4th Grade Glossary
Β 
3rd Grade Glossary
3rd Grade Glossary3rd Grade Glossary
3rd Grade Glossary
Β 
Morse et al 2012
Morse et al 2012Morse et al 2012
Morse et al 2012
Β 
[M3A4] Data Analysis and Interpretation Specialization
[M3A4] Data Analysis and Interpretation Specialization[M3A4] Data Analysis and Interpretation Specialization
[M3A4] Data Analysis and Interpretation Specialization
Β 
Regression analysis in R
Regression analysis in RRegression analysis in R
Regression analysis in R
Β 
Correlation
CorrelationCorrelation
Correlation
Β 
5th Grade Math Glossary
5th Grade Math Glossary5th Grade Math Glossary
5th Grade Math Glossary
Β 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Β 
Reporting an independent sample t test
Reporting an independent sample t testReporting an independent sample t test
Reporting an independent sample t test
Β 
Lesson 1 07 measures of variation
Lesson 1 07 measures of variationLesson 1 07 measures of variation
Lesson 1 07 measures of variation
Β 
BasicStatistics.pdf
BasicStatistics.pdfBasicStatistics.pdf
BasicStatistics.pdf
Β 
Presentation on Regression Analysis
Presentation on Regression AnalysisPresentation on Regression Analysis
Presentation on Regression Analysis
Β 

Similar to Assessing relative importance using rsp scoring to generate

Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...Deren Lei
Β 
Multinomial Logistic Regression.pdf
Multinomial Logistic Regression.pdfMultinomial Logistic Regression.pdf
Multinomial Logistic Regression.pdfAlemAyahu
Β 
KIT-601 Lecture Notes-UNIT-2.pdf
KIT-601 Lecture Notes-UNIT-2.pdfKIT-601 Lecture Notes-UNIT-2.pdf
KIT-601 Lecture Notes-UNIT-2.pdfDr. Radhey Shyam
Β 
Correlation Research Design
Correlation Research DesignCorrelation Research Design
Correlation Research DesignSu Qee
Β 
SPSS statistics - get help using SPSS
SPSS statistics - get help using SPSSSPSS statistics - get help using SPSS
SPSS statistics - get help using SPSScsula its training
Β 
Recommender system
Recommender systemRecommender system
Recommender systemBhumi Patel
Β 
artigo correlação policorica x correlaçãoperson pdf
artigo correlação policorica x correlaçãoperson pdfartigo correlação policorica x correlaçãoperson pdf
artigo correlação policorica x correlaçãoperson pdflarissaxavier60
Β 
8064-an-excel-spreadsheet-and-vba-macro-for-model-selection-and-predictor-imp...
8064-an-excel-spreadsheet-and-vba-macro-for-model-selection-and-predictor-imp...8064-an-excel-spreadsheet-and-vba-macro-for-model-selection-and-predictor-imp...
8064-an-excel-spreadsheet-and-vba-macro-for-model-selection-and-predictor-imp...MFOZOUNI
Β 
Scope and objective of the assignment
Scope and objective of the assignmentScope and objective of the assignment
Scope and objective of the assignmentGourab Chakraborty
Β 
ALPHA LOGARITHM TRANSFORMED SEMI LOGISTIC DISTRIBUTION USING MAXIMUM LIKELIH...
ALPHA LOGARITHM TRANSFORMED SEMI LOGISTIC  DISTRIBUTION USING MAXIMUM LIKELIH...ALPHA LOGARITHM TRANSFORMED SEMI LOGISTIC  DISTRIBUTION USING MAXIMUM LIKELIH...
ALPHA LOGARITHM TRANSFORMED SEMI LOGISTIC DISTRIBUTION USING MAXIMUM LIKELIH...BRNSS Publication Hub
Β 
Data Science - Part IV - Regression Analysis & ANOVA
Data Science - Part IV - Regression Analysis & ANOVAData Science - Part IV - Regression Analysis & ANOVA
Data Science - Part IV - Regression Analysis & ANOVADerek Kane
Β 
marketing research & applications on SPSS
marketing research & applications on SPSSmarketing research & applications on SPSS
marketing research & applications on SPSSANSHU TIWARI
Β 
Assigning Scores For Ordered Categorical Responses
Assigning Scores For Ordered Categorical ResponsesAssigning Scores For Ordered Categorical Responses
Assigning Scores For Ordered Categorical ResponsesMary Montoya
Β 
Machine Learning Unit 3 Semester 3 MSc IT Part 2 Mumbai University
Machine Learning Unit 3 Semester 3  MSc IT Part 2 Mumbai UniversityMachine Learning Unit 3 Semester 3  MSc IT Part 2 Mumbai University
Machine Learning Unit 3 Semester 3 MSc IT Part 2 Mumbai UniversityMadhav Mishra
Β 
Running head LOGISTIC REGRESSION .docx
Running head LOGISTIC REGRESSION                                 .docxRunning head LOGISTIC REGRESSION                                 .docx
Running head LOGISTIC REGRESSION .docxwlynn1
Β 

Similar to Assessing relative importance using rsp scoring to generate (20)

ESTIMATING R 2 SHRINKAGE IN REGRESSION
ESTIMATING R 2 SHRINKAGE IN REGRESSIONESTIMATING R 2 SHRINKAGE IN REGRESSION
ESTIMATING R 2 SHRINKAGE IN REGRESSION
Β 
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Β 
Measure of Association
Measure of AssociationMeasure of Association
Measure of Association
Β 
Multinomial Logistic Regression.pdf
Multinomial Logistic Regression.pdfMultinomial Logistic Regression.pdf
Multinomial Logistic Regression.pdf
Β 
KIT-601 Lecture Notes-UNIT-2.pdf
KIT-601 Lecture Notes-UNIT-2.pdfKIT-601 Lecture Notes-UNIT-2.pdf
KIT-601 Lecture Notes-UNIT-2.pdf
Β 
Correlation Research Design
Correlation Research DesignCorrelation Research Design
Correlation Research Design
Β 
SPSS statistics - get help using SPSS
SPSS statistics - get help using SPSSSPSS statistics - get help using SPSS
SPSS statistics - get help using SPSS
Β 
Recommender system
Recommender systemRecommender system
Recommender system
Β 
artigo correlação policorica x correlaçãoperson pdf
artigo correlação policorica x correlaçãoperson pdfartigo correlação policorica x correlaçãoperson pdf
artigo correlação policorica x correlaçãoperson pdf
Β 
8064-an-excel-spreadsheet-and-vba-macro-for-model-selection-and-predictor-imp...
8064-an-excel-spreadsheet-and-vba-macro-for-model-selection-and-predictor-imp...8064-an-excel-spreadsheet-and-vba-macro-for-model-selection-and-predictor-imp...
8064-an-excel-spreadsheet-and-vba-macro-for-model-selection-and-predictor-imp...
Β 
Scope and objective of the assignment
Scope and objective of the assignmentScope and objective of the assignment
Scope and objective of the assignment
Β 
ALPHA LOGARITHM TRANSFORMED SEMI LOGISTIC DISTRIBUTION USING MAXIMUM LIKELIH...
ALPHA LOGARITHM TRANSFORMED SEMI LOGISTIC  DISTRIBUTION USING MAXIMUM LIKELIH...ALPHA LOGARITHM TRANSFORMED SEMI LOGISTIC  DISTRIBUTION USING MAXIMUM LIKELIH...
ALPHA LOGARITHM TRANSFORMED SEMI LOGISTIC DISTRIBUTION USING MAXIMUM LIKELIH...
Β 
Data Science - Part IV - Regression Analysis & ANOVA
Data Science - Part IV - Regression Analysis & ANOVAData Science - Part IV - Regression Analysis & ANOVA
Data Science - Part IV - Regression Analysis & ANOVA
Β 
marketing research & applications on SPSS
marketing research & applications on SPSSmarketing research & applications on SPSS
marketing research & applications on SPSS
Β 
Mixed models
Mixed modelsMixed models
Mixed models
Β 
Discriminant analysis.pptx
Discriminant analysis.pptxDiscriminant analysis.pptx
Discriminant analysis.pptx
Β 
Assigning Scores For Ordered Categorical Responses
Assigning Scores For Ordered Categorical ResponsesAssigning Scores For Ordered Categorical Responses
Assigning Scores For Ordered Categorical Responses
Β 
Machine Learning Unit 3 Semester 3 MSc IT Part 2 Mumbai University
Machine Learning Unit 3 Semester 3  MSc IT Part 2 Mumbai UniversityMachine Learning Unit 3 Semester 3  MSc IT Part 2 Mumbai University
Machine Learning Unit 3 Semester 3 MSc IT Part 2 Mumbai University
Β 
call for papers, research paper publishing, where to publish research paper, ...
call for papers, research paper publishing, where to publish research paper, ...call for papers, research paper publishing, where to publish research paper, ...
call for papers, research paper publishing, where to publish research paper, ...
Β 
Running head LOGISTIC REGRESSION .docx
Running head LOGISTIC REGRESSION                                 .docxRunning head LOGISTIC REGRESSION                                 .docx
Running head LOGISTIC REGRESSION .docx
Β 

Recently uploaded

Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -INandakishor Bhaurao Deshmukh
Β 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
Β 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
Β 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptArshadWarsi13
Β 
Cytokinin, mechanism and its application.pptx
Cytokinin, mechanism and its application.pptxCytokinin, mechanism and its application.pptx
Cytokinin, mechanism and its application.pptxVarshiniMK
Β 
Call Us ≽ 9953322196 β‰Ό Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 β‰Ό Call Girls In Lajpat Nagar (Delhi) |Call Us ≽ 9953322196 β‰Ό Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 β‰Ό Call Girls In Lajpat Nagar (Delhi) |aasikanpl
Β 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentationtahreemzahra82
Β 
Twin's paradox experiment is a meassurement of the extra dimensions.pptx
Twin's paradox experiment is a meassurement of the extra dimensions.pptxTwin's paradox experiment is a meassurement of the extra dimensions.pptx
Twin's paradox experiment is a meassurement of the extra dimensions.pptxEran Akiva Sinbar
Β 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
Β 
Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10ROLANARIBATO3
Β 
Call Girls in Munirka Delhi πŸ’―Call Us πŸ”8264348440πŸ”
Call Girls in Munirka Delhi πŸ’―Call Us πŸ”8264348440πŸ”Call Girls in Munirka Delhi πŸ’―Call Us πŸ”8264348440πŸ”
Call Girls in Munirka Delhi πŸ’―Call Us πŸ”8264348440πŸ”soniya singh
Β 
‏‏VIRUS - 123455555555555555555555555555555555555555
‏‏VIRUS -  123455555555555555555555555555555555555555‏‏VIRUS -  123455555555555555555555555555555555555555
‏‏VIRUS - 123455555555555555555555555555555555555555kikilily0909
Β 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
Β 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
Β 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
Β 
Call Girls in Mayapuri Delhi πŸ’―Call Us πŸ”9953322196πŸ” πŸ’―Escort.
Call Girls in Mayapuri Delhi πŸ’―Call Us πŸ”9953322196πŸ” πŸ’―Escort.Call Girls in Mayapuri Delhi πŸ’―Call Us πŸ”9953322196πŸ” πŸ’―Escort.
Call Girls in Mayapuri Delhi πŸ’―Call Us πŸ”9953322196πŸ” πŸ’―Escort.aasikanpl
Β 
Call Us ≽ 9953322196 β‰Ό Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 β‰Ό Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 β‰Ό Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 β‰Ό Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
Β 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth
Β 
Module 4: Mendelian Genetics and Punnett Square
Module 4:  Mendelian Genetics and Punnett SquareModule 4:  Mendelian Genetics and Punnett Square
Module 4: Mendelian Genetics and Punnett SquareIsiahStephanRadaza
Β 

Recently uploaded (20)

Hot Sexy call girls in Moti Nagar,πŸ” 9953056974 πŸ” escort Service
Hot Sexy call girls in  Moti Nagar,πŸ” 9953056974 πŸ” escort ServiceHot Sexy call girls in  Moti Nagar,πŸ” 9953056974 πŸ” escort Service
Hot Sexy call girls in Moti Nagar,πŸ” 9953056974 πŸ” escort Service
Β 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -I
Β 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Β 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
Β 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.ppt
Β 
Cytokinin, mechanism and its application.pptx
Cytokinin, mechanism and its application.pptxCytokinin, mechanism and its application.pptx
Cytokinin, mechanism and its application.pptx
Β 
Call Us ≽ 9953322196 β‰Ό Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 β‰Ό Call Girls In Lajpat Nagar (Delhi) |Call Us ≽ 9953322196 β‰Ό Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 β‰Ό Call Girls In Lajpat Nagar (Delhi) |
Β 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentation
Β 
Twin's paradox experiment is a meassurement of the extra dimensions.pptx
Twin's paradox experiment is a meassurement of the extra dimensions.pptxTwin's paradox experiment is a meassurement of the extra dimensions.pptx
Twin's paradox experiment is a meassurement of the extra dimensions.pptx
Β 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Β 
Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10
Β 
Call Girls in Munirka Delhi πŸ’―Call Us πŸ”8264348440πŸ”
Call Girls in Munirka Delhi πŸ’―Call Us πŸ”8264348440πŸ”Call Girls in Munirka Delhi πŸ’―Call Us πŸ”8264348440πŸ”
Call Girls in Munirka Delhi πŸ’―Call Us πŸ”8264348440πŸ”
Β 
‏‏VIRUS - 123455555555555555555555555555555555555555
‏‏VIRUS -  123455555555555555555555555555555555555555‏‏VIRUS -  123455555555555555555555555555555555555555
‏‏VIRUS - 123455555555555555555555555555555555555555
Β 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Β 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Β 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
Β 
Call Girls in Mayapuri Delhi πŸ’―Call Us πŸ”9953322196πŸ” πŸ’―Escort.
Call Girls in Mayapuri Delhi πŸ’―Call Us πŸ”9953322196πŸ” πŸ’―Escort.Call Girls in Mayapuri Delhi πŸ’―Call Us πŸ”9953322196πŸ” πŸ’―Escort.
Call Girls in Mayapuri Delhi πŸ’―Call Us πŸ”9953322196πŸ” πŸ’―Escort.
Β 
Call Us ≽ 9953322196 β‰Ό Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 β‰Ό Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 β‰Ό Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 β‰Ό Call Girls In Mukherjee Nagar(Delhi) |
Β 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
Β 
Module 4: Mendelian Genetics and Punnett Square
Module 4:  Mendelian Genetics and Punnett SquareModule 4:  Mendelian Genetics and Punnett Square
Module 4: Mendelian Genetics and Punnett Square
Β 

Assessing relative importance using rsp scoring to generate

  • 1. International Journal of Statistics and Probability; Vol. 4, No. 2; 2015 ISSN 1927-7032 E-ISSN 1927-7040 Published by Canadian Center of Science and Education 123 Assessing Relative Importance Using RSP Scoring to Generate Variable Importance Factor (VIF) Daniel Koh1 1 School of Business, SIM University, Singapore Correspondence: Daniel Koh, School of Business, SIM University, 461 Clementi Road, 599 491, Singapore. Tel: 65-6248-9746. E-mail: danielkoh005@unisim.edu.sg Received: March 17, 2015 Accepted: April 8, 2015 Online Published: April 27, 2015 doi:10.5539/ijsp.v4n2p123 URL: http://dx.doi.org/10.5539/ijsp.v4n2p123 Abstract Previous research has shown that the construction of VIF is challenging. Some researchers have sought to use orderly contribution of R2 (coefficient of determination) as measurement for relative importance of variable in a model, while others have sought the standardized parameter estimates b (beta) instead. These contributions have been proven to be very valuable to the literature. However, there is a lack of study in combining key properties of variable importance into one composite score. For example, an intuitive understanding of variable importance is by scoring reliability, significance and power (RSP) of it in the model. Thereafter the RSP scores can be aggregated together to form a composite score that reflects VIF. In this paper, the author seeks to prove the usefulness of the DS methodology. DS stands for Driverβ€Ÿs Score and is defined as the relative, practical importance of a variable based on RSP scoring. An industry data was used to generate DS for practical example in this paper. This DS is then translated into a 2x6 matrix where level of importance (LxI) is generated. The final outcome of this paper is to discuss the use of RSP scoring methodology, theoretical and practical use of DS and the possible future research that entails this paper. DS methodology is new to the existing literature. Keywords: variable importance, decomposition of variances, RSP Scoring, multiple linear regression 1. Introduction In recent history, much effort has been given to the study of variable importance factor (VIF). Researchers have approached this topic in several manners: taking the increase of R2 – coefficient of determination - as the usefulness of the regressors (Darlington, 1968), squared standardized coefficients and products of standardized coefficients with marginal correlations (Hoffman, 1960; Hooker & Yule, 1906), LMG method of using sequential sums of squares from the linear model (Lindeman, Merenda, & Gold, 1980), conditional variable importance for Random Forest (Strobl, Boulesteix, Kneib, Augustin, & Zeileis, 2008), the averaging method of variance decomposition (Kruskal, 1987; Chevan & Sutherland, 1991) and proportional marginal variance decomposition. However, most of the studies are founded on one dimension. Several authors which include Ehrenberg (1990), Stufken (1992), and Christensen (1992) have dismissed the usefulness and benefits of relative importance measure. The premise of this dismissal was that the decomposition of coefficient of determination is too simplistic and it is difficult to tease out relative importance among correlated variables which could potentially β€œdouble count” in the model. In this paper, the focus is in the independent measurement of relative importance and the discussion on teasing out interrelatedness between independent variables is deferred. The decomposition of coefficient of determination becomes a powerful tool when complementary scorings are given to improve accuracy in understanding relative importance of variables. Hence, this paper seeks to suggest a new method to assess relative importance of variable by considering reliability, significance and power, to the end that the final composite scores reflect the intuitive, practical understanding of relative importance better. Reliability is defined here as the invert of the sum of residual errors between the predicted and actual values. Significance is defined here as the heterogeneity of groupings with homogeneity in within-group distances or maximized distances between predicted values with expected mean of dependent variable and minimized distances between predicted values with actual values. Power is defined here as the positivity of the slope of the estimates, with greater or steeper slope leading to greater power. The intention is to develop a score that not only accounts for variance decomposition, but also practical meaningfulness and accuracy in utilizing a variable. It also accounts for the goodness of a predictor by scoring the standardized parameter estimates of the variable in
  • 2. www.ccsenet.org/ijsp International Journal of Statistics and Probability Vol. 4, No. 2; 2015 124 the model. The DS scoring methodology is new to our existing literature today. 2. Residual Errors as First Property One aspect of a good predictor is its minimized residual error values, which then contribute to a strong coefficient of determination, R2 . When residual errors are minimized, the chance of making mispredictions becomes lesser, leading to greater reliability for the predictor. A practical example would be the reliability of age in understanding income earnings. Between the independent variable age and gender, age is chosen to be the better predictor for income earnings because gender may not contribute as much between-group residual errors as compared to the sum of squares of model (SSM) for age. This is particularly true when a meritocratic society promotes merit for work experiences, of which only age most likely correlates positively with it strongly, regardless of gender type. Intuitively, the concept of reliability lies on the confidence of which one can get when the model is put to test. The scores can be decomposed into respondent level, whereby each respondent is given three scores for RSP, leading up to the final scoring of DS. Hence, the first function of DS is the invert of residual errors, which is first expressed in the following mathematical expression for multiple linear regression: Ε· = π‘Ž0 + π‘Ž1 π‘₯1 + π‘Ž2 π‘₯2 + … + π‘Ž 𝑛 π‘₯ 𝑛 + πœ€ (1) where π‘₯ ∈ ℝ denotes the regressor, 𝑦̂ ∈ ℝ denotes the dependent variable, π‘Ž denotes the parameter estimate and πœ€ denotes the error term of the model. A series of x is fitted into the model, generating a series of predicted values, Ε·. The residual for this model - 𝜁 - is then expressed as the absolute difference between the actual values of the dependent variable which is denoted by 𝑦 and the predicted values 𝑦̂. 𝜁 = |𝑦 βˆ’ Ε·| (2) The residual is then inverted to form common directionality with two other functions of the RSP framework. 𝜌 = 1 𝜁 = 1 |𝑦 βˆ’ Ε·| (3) The residual is then fitted onto the Gaussianβ€Ÿs cumulative distribution function - Ξ¦(𝜌) - , assuming that the variable is independent and identically distributed (I.I.D) under the Normal distribution, 𝑋 ~ 𝑁(0,1): Ξ¦(𝜌) = 1 𝜍√2πœ‹ ∫ π‘’βˆ’(πœŒβˆ’πœ‡ 𝜌) 2 / 2𝜍 𝜌 2 π‘‘πœŒ 𝜌 min 𝜌 (4) where 𝜌 denotes the inverse of residual error, πœ‡ 𝜌 denotes the mean of the residual errors, 𝜍𝜌 denotes the standard deviation of the residual error. The first function of DS is improved when 𝜍2 𝜌 is decreased: reliability increases when data are less sparsely distributed. This cumulative distribution function serves as a score for reliability on the observation or respondent level. The use of the magnitude of 𝑅2 contributions to assess relative importance was similarly proposed by Hoffman (1960) and later defended by Pratt (1987). 3. F-Ratios of the Residual Errors as Second Property While the residual errors are expected to be minimized, the significance of the residual errors is expected to be maximized. The motivation behind this property is to obtain scores that reflect variableβ€Ÿs distinctiveness among inter-groups through a study of variance ratio. For example, if a variable πœ† has unique and distinctive πœ… groupings in understanding income earning levels of a country, the F-Ratios due to πœ† should be greater as compared to those that have more homogenous groupings. The residual error in the first function is preferred over observations because the variance of residual errors (variance study of residual errors) is expected to reflect distinctiveness between groups better if they are truly
  • 3. www.ccsenet.org/ijsp International Journal of Statistics and Probability Vol. 4, No. 2; 2015 125 distinctive and unique than the observation values themselves. For example, when income is predicted using Age, the error between πœ… groupings in πœ† should have distinctive noises. This β€œdistinctive noises” should characterize their identity as unique groupings in the model. This is true for variables that are categorical, which will partition the data into kth groupings. Hence, the second function of DS has an inverse relationship with the first function of DS. The second function of DS - 𝜌 - is expressed in the following mathematical expression for a linear regression model: 𝐹𝜌 = π‘‰π‘Žπ‘Ÿπ‘–π‘Žπ‘›π‘π‘’π‘  𝑑𝑒𝑒 π‘‘π‘œ π‘€π‘œπ‘‘π‘’π‘™ π‘‰π‘Žπ‘Ÿπ‘–π‘Žπ‘›π‘π‘’π‘  𝑑𝑒𝑒 π‘‘π‘œ π‘…π‘’π‘ π‘–π‘‘π‘’π‘Žπ‘™ (5) 𝐹𝜌 = βˆ‘(𝑦̂ βˆ’ 𝑦̅)2 (𝐾 βˆ’ 1) βˆ‘(𝑦 βˆ’ 𝑦̂)2 (𝑁 βˆ’ 𝐾) (6) 𝐹𝜌 = βˆ‘(𝑦̂ βˆ’ 𝑦̅)2 βˆ‘(𝑦 βˆ’ 𝑦̂)2 ( 𝑁 βˆ’ 𝐾 𝐾 βˆ’ 1 * (7) where 𝐾 is the number of groupings, 𝑦̂ denotes predicted value of 𝑦, and 𝑦̅ denotes the average value of 𝑦, N denotes the sample size. The F-Ratios are then fitted into Fisherβ€Ÿs CDF, which is the integral of the PDF of F-distribution, assuming that the variables are independent and identically distributed (I.I.D) under the Fisherβ€Ÿs distribution: Ξ¦Fρ(Fρ,πΎβˆ’1,π‘βˆ’πΎ) = ∫ ( πΎβˆ’1 π‘βˆ’πΎ ) πΎβˆ’1 2 Fρ Kβˆ’1 2 βˆ’ 1 (1+ Kβˆ’1 Nβˆ’K Fρ) βˆ’ Nβˆ’1 2 [ * 1 2 (π‘βˆ’πΎ)+!* 1 2 (πΎβˆ’1)+! * 1 2 (π‘βˆ’πΎ)++* 1 2 (πΎβˆ’1)+βˆ’ 1 ] Fρ 0 (8) Φϱ = 1 βˆ’ ∫ ( 𝐾 βˆ’ 1 𝑁 βˆ’ 𝐾 ) πΎβˆ’1 2 Fρ Kβˆ’1 2 βˆ’ 1 (1 + K βˆ’ 1 N βˆ’ K Fρ) βˆ’ Nβˆ’1 2 [ * 1 2 (𝑁 βˆ’ 𝐾)+ ! * 1 2 (𝐾 βˆ’ 1)+ ! * 1 2 (𝑁 βˆ’ 𝐾)+ + * 1 2 (𝐾 βˆ’ 1)+ βˆ’ 1 ] Fρ 0 (9) As F-ratio 𝐹𝜌 considers residual errors 𝜁, the Fisherβ€Ÿs CDF Ξ¦Fρ(π‘₯,πΎβˆ’1,π‘βˆ’πΎ) is reversed (1 βˆ’ Ξ¦Fρ(Fρ,πΎβˆ’1,π‘βˆ’πΎ)) to generate significance score Φϱ, with greater residual errors leading to lower probability values. This arrangement allows the decomposition of F-Ratios to the observation level, where each observation is assigned an F-Ratio value. This cumulative distribution function which follows the Fisherβ€Ÿs Distribution serves as a score for Significance on the observation or respondent level. Significance is an important property in DS. If income earnings are to be understood by gender and dwelling type of individuals, the latter variable provides greater noises in residual error as the separating groups in the factor create more β€žnoisesβ€Ÿ than the variable gender. Or the noises between age and gender are significantly different as the residual errors due to variable Age may contain more noises than the variable gender. If the
  • 4. www.ccsenet.org/ijsp International Journal of Statistics and Probability Vol. 4, No. 2; 2015 126 distributions of errors among groupings are similar under the F-distribution, then the factor is less significant for use as the factor exhibits homogeneity in variances across all groups. Hence, the significance of a variable relies on the distinctiveness of errors between groups in factors, with homogeneity for within groups, or greater distances from sample mean, with lower distances from the model. 4. Standardized Regression Coefficients as Third Property The third and final function of DS is the standardized parameter estimate of the regressor. This is commonly known as the slope of the curve. In a linear regression, the slope of the curve is observed by the parameter estimates of the variable π‘Ž of the model. However, the use of unstandardized slope of the curve is not appropriate when scales of different variables are different. Hence, the power of regressors – or β€žsteepnessβ€Ÿ – is then observed by the standardized parameter estimate 𝑏 of the model. This standardized estimate reduces the metrical scale to a common vector across all regressors. It is expressed through the following mathematical expression: 𝑦̇̂ = 𝑏̇1 πœƒ1 + 𝑏̇2 πœƒ2 + … + 𝑏̇ π‘š πœƒ π‘š + πœ€ (10) 𝑏 π‘š Μ‡ = 𝑏 π‘š ( 𝜍π‘₯ π‘š 𝜍 𝑦 ) (11) πœƒπ‘– denotes the standardized values of the predictor π‘₯, 𝑦̇̂ denotes the predicted response variable from standardized predictors, 𝑏̇ π‘š denotes the standardized regression coefficient, 𝑏 π‘š denotes the unstandardized regression coefficient, 𝜍π‘₯ denotes the standard deviation of the predictor and 𝜍 𝑦 denotes the standard deviation of the response variable. The absolute standardized regression coefficient – |𝑏 π‘š Μ‡ | – is then converted into a ratio out of the sum of all absolute standardized coefficients, with 𝑛 number of parameters in a model: πœ‚ 𝑗 = |𝑏𝑗 Μ‡ | βˆ‘ |𝑏𝑗 Μ‡ |π‘š 𝑗=1 (12) where π‘š denotes the total number of parameter estimates in the model, 𝑏𝑗 Μ‡ denotes the parameter estimate of interest. While relative importance measure considers reliability and significance, power is still necessary for the understanding of relative importance of a variable in the model. When a powerful variable has low reliability and significance, a low DS score is expected. If reliability and significance scores are high, but power score is low, the DS still remains effective as the consideration that arises from the combined importance of reliability and significance outweigh the consideration for power. Intuitively, power or standardized regression coefficient is not the sole consideration of relative importance of a variable in the model but a combination of reliability, significance and power. 5. Driver’s Score (DS) Driverβ€Ÿs Score (DS) is an aggregated score of three properties: reliability, significance and power. It reflects the relative importance of a variable in the model practically. At the observation level, the Driversβ€Ÿ Score (𝛿) is the geometric mean of these three properties. The mathematical expression is expressed as: 𝛿 = √( 1 𝜍√2πœ‹ ∫ π‘’βˆ’(πœŒβˆ’πœ‡ 𝜌) 2 / 2𝜍 𝜌 2 π‘‘πœŒ 𝜌 min 𝜌 . (1 βˆ’ ∫ ( πΎβˆ’1 π‘βˆ’πΎ ) πΎβˆ’1 2 Fρ Kβˆ’1 2 βˆ’ 1 (1+ Kβˆ’1 Nβˆ’K Fρ) βˆ’ Nβˆ’1 2 [ * 1 2 (π‘βˆ’πΎ)+!* 1 2 (πΎβˆ’1)+! * 1 2 (π‘βˆ’πΎ)++* 1 2 (πΎβˆ’1)+βˆ’ 1 ] Fρ 0 ) . |𝑏 𝑗̇ | βˆ‘ |𝑏 𝑗̇ |π‘š 𝑗=1 ) 3 (13) As observed in the DS equation, the DS methodology is the cube-root of the ratio of three products to the sum of
  • 5. www.ccsenet.org/ijsp International Journal of Statistics and Probability Vol. 4, No. 2; 2015 127 all absolute standardized regression coefficient estimates. It is a function of Reliability (Gaussianβ€Ÿs CDF which has an exponential function), Significance (Fisherβ€Ÿs CDF which has a function of the hypergeometric function for 𝐹𝜌 by Beta function for 𝐹𝜌) and the absolute standardized regression coefficient of the variable of interest and an inverse function of the sum of all absolute standardized regression coefficient estimates. The decrease in other absolute standardized parameter estimates increases DS. This understanding has an important and practical value: the variable of interest in which it is positioned with other variables is influenced by the mix of the model. When DS is low, either two of the three properties or all properties are low, resulting to weaker driversβ€Ÿ influence. When DS is high, either two of the three properties or all properties are high, resulting to stronger driversβ€Ÿ influence. DS is bounded by a lower limit and an upper limit of 0 and 1 respectively. At the regressor level, the Driversβ€Ÿ Score (DS) is the arithmetic mean of all the geometric mean of all three properties at the observation level: 𝛿̅ = βˆ‘ ( √( 1 𝜍√2πœ‹ ∫ π‘’βˆ’(πœŒβˆ’πœ‡ 𝜌) 2 / 2𝜍 𝜌 2 π‘‘πœŒ 𝜌 min 𝜌 . ( 1βˆ’ ∫ ( πΎβˆ’1 π‘βˆ’πΎ ) πΎβˆ’1 2 Fρ Kβˆ’1 2 βˆ’ 1 (1+ Kβˆ’1 Nβˆ’K Fρ) βˆ’ Nβˆ’1 2 [ * 1 2 (π‘βˆ’πΎ)+!* 1 2 (πΎβˆ’1)+! * 1 2 (π‘βˆ’πΎ)++* 1 2 (πΎβˆ’1)+βˆ’ 1 ] Fρ 0 ) .|𝑏 𝑗̇ | ) 3 ) βˆ‘ |𝑏 𝑗̇ |π‘š 𝑗=1 .𝑛 (14) When DS at the regressor level is low, relative influence on the dependent variable in the model is low. When DS at the regressor level is high, relative influence on the dependent variable in the model is high. DS is bounded by a lower limit and an upper limit of 0 and 1 respectively. 6. Weighted Drivers’ Score (wDS) The weighted Driversβ€Ÿ Score (wDS) is a composite score of three properties (RSP), being weighted individually by their importance in the DS framework. For example, the score for reliability (R) is far more important than the score for significance (S) and power (P). Hence, the R score is given heavier weights than the rest. The wDS - 𝛿 𝑀 - can be expressed mathematically by assigning power to the properties: 𝛿 𝑀 = ( 1 𝜍√2πœ‹ ∫ π‘’βˆ’(πœŒβˆ’πœ‡ 𝜌) 2 / 2𝜍 𝜌 2 π‘‘πœŒ 𝜌 min 𝜌 ) w1 . ( 1βˆ’ ∫ ( πΎβˆ’1 π‘βˆ’πΎ ) πΎβˆ’1 2 Fρ Kβˆ’1 2 βˆ’ 1 (1+ Kβˆ’1 Nβˆ’K Fρ) βˆ’ Nβˆ’1 2 [ * 1 2 (π‘βˆ’πΎ)+!* 1 2 (πΎβˆ’1)+! * 1 2 (π‘βˆ’πΎ)++* 1 2 (πΎβˆ’1)+βˆ’ 1 ] Fρ 0 ) 𝑀2 . |𝑏 𝑗̇ | 𝑀3 (βˆ‘ |𝑏 𝑗̇ |π‘š 𝑗=1 ) 𝑀3 , 𝑀1 + 𝑀2 + 𝑀3 = 1 (15) At the regressor level, the weighted Driversβ€Ÿ Score which is relative importance of a variable in the model after adding weights is the arithmetic mean of all three properties at the observation level: 𝛿̅ 𝑀 = βˆ‘ ( ( 1 𝜍√2πœ‹ ∫ π‘’βˆ’(πœŒβˆ’πœ‡ 𝜌) 2 / 2𝜍 𝜌 2 π‘‘πœŒ 𝜌 min 𝜌 ) w1 . ( 1βˆ’ ∫ ( πΎβˆ’1 π‘βˆ’πΎ ) πΎβˆ’1 2 Fρ Kβˆ’1 2 βˆ’ 1 (1+ Kβˆ’1 Nβˆ’K Fρ) βˆ’ Nβˆ’1 2 [ * 1 2 (π‘βˆ’πΎ)+!* 1 2 (πΎβˆ’1)+! * 1 2 (π‘βˆ’πΎ)++* 1 2 (πΎβˆ’1)+βˆ’ 1 ] Fρ 0 ) 𝑀2 . |𝑏 𝑗̇ | 𝑀3 ) (βˆ‘ |𝑏 𝑗̇ |π‘š 𝑗=1 ) 𝑀3 .𝑛 (16) 7. Application of DS Using data from a Multinational Corporation (MNC) that is based in Singapore (Note 1), the spread of DS of 4 predictors and itsβ€Ÿ response variable are:
  • 6. www.ccsenet.org/ijsp International Journal of Statistics and Probability Vol. 4, No. 2; 2015 128 Figure 1. DS of v1 to v4 on dependent variable Based on insights that were gathered from professionals in the industry, v1 has the greatest impact or driving force in understanding the dependent variable. This impact or driving force is understood under the framework of the RSP model. To garner reliable, significant and powerful dependent variable, extra care must be given to v1. As it is observed from the chart above (Figure 1), the relationship between DS and its dependent variable takes on the inverse exponential function. This would explicitly highlight that the observations of RSP is often the inverse of the dependent variable. The tendency to mispredict is higher when dependent variable or its set of products gets too complex. In this area, the DS model has intuitively performed well. Figure 2. Scatter-plots of DS with Dependent Variable The plots in figure 2 show the area under the curve, which represents the magnitude of DS. V1 has the largest area under the curve. 8. VIF and Its Decomposition The relative importance of 𝑗 variable - 𝛿̅ - is the arithmetic mean of summation of all 𝛿 in 𝑗 variable: 0 10 20 30 40 50 0.000 0.100 0.200 0.300 0.400 0.500 0.600 0.700 Driver's Scores Driver's Scores with Dependent Variable v1 v2 v3 v4 0 10 20 30 40 50 0.00 0.09 0.12 0.14 0.18 0.20 0.22 0.25 0.28 0.32 0.35 0.39 0.43 0.48 0.54 Driver's Scores (DS) v1 DS 0 10 20 30 40 50 0.00 0.21 0.20 0.27 0.31 0.39 0.32 0.29 0.27 0.26 0.47 0.32 0.26 0.35 0.40 0.31 Driver's Scores (DS) v2 DS 0 10 20 30 40 50 0.00 0.09 0.08 0.05 0.12 0.07 0.22 0.21 0.12 0.11 0.16 0.14 0.18 0.12 0.23 0.14 Driver's Scores (DS) v3 DS 0 10 20 30 40 50 0.03 0.20 0.15 0.23 0.30 0.47 0.38 0.37 0.29 0.29 0.18 0.28 0.14 0.20 0.10 0.14 Driver's Scores (DS) v4 DS
  • 7. www.ccsenet.org/ijsp International Journal of Statistics and Probability Vol. 4, No. 2; 2015 129 𝛿𝑖,𝑗 = √Φ(𝜌 𝑖,𝑗). Ξ¦Fρ(Fρ,πΎβˆ’1,π‘βˆ’πΎ) . πœ‚ 𝑗 3 (17) 𝛿̅𝑗 = βˆ‘ 𝛿𝑖,𝑗 𝑛 𝑖=1 𝑛 (18) Figure 3. VIF chart As observed in the bar chart, the strongest relative importance of factor in the model is v1. This importance is decomposed into its RSP properties, which is shown in the table below: Table 1. Average RSP and DS Scores against Four Independent Variables R S P DS v1 0.5454 0.2148 0.3540 0.2867 v2 0.5481 0.3276 0.1830 0.2829 v3 0.5447 0.2067 0.0340 0.1363 v4 0.5364 0.1131 0.4290 0.2353 Mean 0.5437 0.2156 0.2500 0.2353 Note. The DS score is the geometric mean of all three RSP scores. In this example, the use of the standardized parameter estimate – P - as the relative influence factor is misleading as industry practitioners have clearly identified the importance of v1 rather than v4. This is also reflected in the DS score. The other use of variance decomposition – R – is also misleading, as it identifies v2 as the factor that has the strongest relative importance in the model instead of v1. The composite score of three properties clearly show that variable importance does not solely rely on one aspect or dimension of importance considerations, but three aspects: reliability, significance and power. A 2x6 Variable Importance Matrix (VIM) describes the characteristics of DS. A low score is understood as a score that is below 0.50. A high score is understood as a score that is above 0.50. For example, a mix of DS=0.90 (highest DS score in the model) and R=0.30 is characterized as the most important variable that has low reliability. From the VIM table, this variable is known as an Unreliably Important variable. Table 2. Variable Importance Matrix (VIM) Less Important Most Important Low Reliability Unreliably Trivial Unreliably Important High Reliability Reliably Trivial Reliably Important Low Significance Insignificantly Trivial Insignificantly Important High Significance Significantly Trivial Significantly Important Low Power Powerlessly Trivial Powerlessly Important High Power Powerfully Trivial Powerfully Important 0.29 0.28 0.14 0.24 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 v1 v2 v3 v4 VIF based on DS scoring VIF
  • 8. www.ccsenet.org/ijsp International Journal of Statistics and Probability Vol. 4, No. 2; 2015 130 A classification by levels (CLASS) is used here: If ALL RSP is low (i.e. 𝑅𝑆𝑃 < 0.50) β‡’ Level 4 If EITHER Two of RSP is low (i.e. 𝑅𝑃/𝑅𝑆/𝑆𝑃 < 0.50) β‡’ Level 3 If ONLY One of RSP is low (i.e. 𝑅/𝑆/𝑃 < 0.50) β‡’ Level 2 If NONE of RSP is low (i.e. 𝑅𝑆𝑃 β‰₯ 0.50) β‡’ Level 1 The CLASS logic is then combined with VIM to generate the final descriptive outcome of VIF for the variable of interest. For example, v1 is classified as level 3 Importance (L3I) variable. However, DS can be improved if the CLASS level improves from level 3 to level 1. Ideally, a good and strong driver for response variable is a level 1 importance (L1I) variable. To achieve such a score, RSP scores that are below 0.50 should be given extra attention. For example, v1 has high R score but low S and P scores. Hence, measures to create greater distinctiveness and power could potentially improve the final DS score. The benefit of this scoring methodology is 1) the flexibility to include categorical variables as a regressor, 2) the ability to aggregate three important properties of variable from both the theoretical and practical aspects of analytic, and 3) the simplicity at which an explanation could be given to laypeople. Categorical variables are treated as dummy variables in the linear regression model and itsβ€Ÿ F-ratios are similarly calculated by taking the sum of squared error due to model against the sum of squared error due to residual errors. To assign power to the DS methodology, absolute standardized regression coefficients for each dummy variable is assigned to each groupings in the factor. Hence, a metrical independent variable has one power assignment in a factor while the categorical independent variable has multiple power assignments for different groupings in a single factor. Theoretically, so long as the variable is normally distributed and conforms to the chi-square distribution, the usefulness of the DS methodology becomes essential for practical use. A single dimension of the DS methodology may be taken to generate VIF measurement, but it does not represent the natural and intuitive considerations of factor importance in practice. Practitioners often need a score that reflects reliability, significance and power of the predictors and conclude the importance of it thereafter. DS helps to meet the needs of those who wish to obtain a practical assessment of variable importance. 9. Limitations of DS DS has several limitations. Firstly, DS takes the absolute standardized regression coefficients as a measurement for power in a linear model. In cases where nonlinearity is observed, an alternative approach is needed to substitute this third property of DS. Also, DS does not account for multi-dimensional explanation of variances in a single vector space. This has implications when it comes to interrelatedness of predictors that could potentially form interaction terms. When interaction terms are formed, the DS methodology is not able to tease out interaction effects. Lastly, as categorical variables are converted into dummy variables in a model, the increase in parameter estimates with constant set of sample size makes the model unstable. Hence, the DS methodology becomes useful when only key factors that could potentially explain variances better are considered. These limitations have an impact in practical use when the relationship between predictors and response variable becomes too complex, resulting to multiple unknown β€žnoisesβ€Ÿ within the model that could affect the whole DS methodology. The use of DS should be jealously guarded by good and firm understanding of relationship between variables and the intuitive approach in understanding practical relationships is essential in the DS methodology. 10. Summary Managers often ask what drivers or variables influence the outcome of success more significantly over the other at the respondent level. For example, the main question that managers ask is the drivers which contribute to the success of an event. Fundamentally, the drivers that contribute mostly to success of an event are obtained by the correlation coefficient of the variables. However, this methodology falls apart when categorical variables are used or the requirement to assess additional parameters is apparent. DS is designed to overcome this limitation by decomposing the DS scores into its three properties – RSP. As the measurement for variable influence is seen at the observation or respondent level, the aggregation of scores can finally add up to the variable influence factor (VIF) measurement. Cost-cutting measures that lead to improved profitability are also possible by using DS. Businesses can utilize DS to determine which dimension or variable has the greatest importance and pitch marketing efforts on that dimension with a targeted focus on respondents who have higher RSP scores. If the amount of sales leads is a
  • 9. www.ccsenet.org/ijsp International Journal of Statistics and Probability Vol. 4, No. 2; 2015 131 stronger or more important factor in determining revenue and the higher RSP scores are found among the entrepreneurs, then companies can focus their marketing effort on generating leads by targeting the entrepreneurs first, then the others. Although it is fundamentally true that the outcome of revenue takes prerogative in marketing effort, the precision of the outcome and the significance of it have a key role to play in understanding variable importance. Targeted and focused marketing can generate income faster, which could then supplement additional capital to target the rest of the populations using low-cost marketing efforts. 11. Future Research Due to the limitation imposed by the DS methodology, more studies can be conducted to understand how variance decomposition occurs in a multi-dimensional setting in a single vector space. Particularly, studies can be conducted to examine independent variables that are correlated to each other and when these variables are categorical, more can be done to examine interaction terms and how it decomposes to the RSP scorings. Finally, more β€žreal-worldβ€Ÿ data is needed to assess the DS methodology, particularly in RSP scorings. A survey of factor importance can be distributed to industry practitioners to complement the findings of DS. The results from the survey can validate the strength of RSP scoring methodology. While the RSP scoring and its DS methodology is still at its conceptual stage, the use of it can bring many benefits to business practitioners and academic researchers. In light of the possible use of DS, more industrial report can utilize this RSP scoring method to generate analytic-based decision makings. References Chevan, A., & Sutherland, M. (1991). Hierarchical Partitioning. The American Statistician, 90-96. http://dx.doi.org/10.2307/2684366 Christensen, R. (1992). Comment on "Hierarchical Partitioning," by A. Chevan and M. Sutherland. The American Statistician , 74. Darlington, R. (1968). Multiple Regression in Psychological Research and Practice. Psychological Bulletin , 161-182. http://dx.doi.org/10.1037/h0025471 Ehrenberg, A. (1990). The Unimportance of Relative Importance. The American Statistician , 260. Hoffman, P. (1960). The Paramophic Representation of Clinical Judgment. Psychological Bulletin , 116-131. http://dx.doi.org/10.1037/h0047807 Hooker, R., & Yule, G. U. (1906). Note on Estimating the Relative Influence of Two Variables Upon a Third. Journal of the Royal Statistical Society , 197-200. http://dx.doi.org/10.2307/2339552 Kruskal, W. (1987). Relative Importance by Averging Over Orderings. The American Statistician, 6-10. Lindeman, R., Merenda, P., & Gold, R. (1980). Introduction to Bivariate and Multvariate Analysis. Glenview: Scott, Foresman. Pratt, J. W. (1987). Dividing the Indivisible: Using Simple Symmetry to Partition Variance Explained. Proceedings of Second Tampere Conference in Statistics, eds. (pp. 245-260). Finland: T. Pukkila and S. Puntamen. Strobl, C., Boulesteix, A., Kneib, T., Augustin, T., & Zeileis, A. (2008). Conditional Variable Importance for Random Forests. BMC Bioinformatics , 307. http://dx.doi.org/10.1186/1471-2105-9-307 Stufken, J. (1992). On Hierarchical Partitioning. The American Statistician , 70-71. Note 1: With permission granted from the Multinational Corporation (MNC), under the condition of full anonymity and non-disclosure. Copyrights Copyright for this article is retained by the author(s), with first publication rights granted to the journal. This is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).