SlideShare a Scribd company logo
1 of 22
Master the Art of Analytics
A Simplistic Explainer Series For Citizen Data Scientists
Journey Towards Augmented Analytics
Isotonic Regression
Terminologies
Introduction & Example
Standard input/tuning parameters & Sample UI
Sample output UI
Interpretation of Output
Limitations
Business use cases
What is
covered
Terminologies
• Predictors and Target variable:
• Target variable usually denoted by Y, is the variable being predicted and is also
called dependent variable, output variable, response variable or outcome
variable
• Predictor, usually denoted by X, sometimes called an independent or
explanatory variable, is a variable that is being used to predict the
target variable
• Correlation:
• Correlation is a statistical measure that indicates the extent to which two
variables fluctuate together
Terminologies (Continued...)
• Isotonic Constraints/Thresholds:
• These are the data points where we can estimate a set a linear model in
between each of the thresholds to minimize the error regarding the training
data
Terminologies (Continued...)
• Monotonic Constraints:
• These are either increasing or decreasing set of values which are typically
considered to be increasing in case of isotonic regression.
● xi is observed responses and
X=x1, x2,..., xn
• OBJECTIVE:
It is a variant of linear regression and allows
us to build model in piecewise linear manner
i.e., breaking up the problem into few or
many linear segments and perform linear
interpolation of each function.
• BENEFIT:
o Unlike linear regression this model is not
biased and is flexible.
o Helps in multidimensional scaling.
• MODEL:
o Isotonic regression is the technique of
fitting a free-form line to a sequence of
observations such that the fitted line that
lies close to the observations as possible.
● yi is a finite set of real
numbers Y=y1, y2,...,yn
● wi are positive weights
Introduction
Example: Isotonic Regression
Let’s conduct the Isotonic Regression Analysis on Admission Regression Data set on Independent Variables: CGPA, LOR, GRE_Score, TOEFL_Score and
Target Variable: Chance_of_Admit as shown below:
Independent
variables (Xi)
Target
Variable (Y)
Chance_of_admit CGPA LOR GRE_Score TOEFL_Score
0.46 8.0 3.0 308 110
0.64 8.18 3.0 312 98
0.72 8.79 2.5 319 110
0.45 7.46 2.5 290 104
0.57 7.46 2.5 311 98
Model is a an
excellent fit when
Adjusted R-Squared
> 0.7
R-Squared 0.782
Adjusted R-Squared 0.781
R-Squared: It shows the goodness of fit of the model. It
lies between 0 to 1 and closer the value to 1, better the
model.
Adjusted R-Squared: The adjusted R-squared is a
modified version of R-squared that has been adjusted
for the number of predictors in the model. It shows
whether adding additional predictors improve a
regression model or not.
Select the Target Variable
Chance_of_Admit
CGPA
LOR
GRE_Score
TOEFL_Score
Step
1
Select the Predictor Variable
Chance_of_Admit
CGPA
LOR
GRE_Score
TOEFL_Score
Step
2
More than one
predictors can be
selected
Step 3
isotonic = true
(This indicates the default value that is based
on the target variable, typically denoting the
increasing property of the isotonic regression).
By default, these parameters
should be set with the values
mentioned
Step 4
Display the output window containing following:
o Model Summary
o Line Fit Plot
o Residual Versus Fit Plot
Note:
▪ Categorical predictors should be auto detected and converted to dummy/binary variables before applying regression
▪ Decision on selection of predictors depends on business knowledge and the correlation value between the target variable and predictors
Standard Input/Tuning Parameters & Sample UI
Sample Output: 1. Model Summary
● R-Squared: It shows the goodness of fit of the model. It lies
between 0 to 1 and closer the value to 1, better the model.
Root Mean Square Error (RMSE) 0.066
Mean Absolute Error (MAE) 0.048
Mean_Absolute_Percentile_Error (MAPE) 0.0762619
Mean Percentage Error (MPE) -0.0111406
● Adjusted R-Squared: The adjusted R-squared is a modified
version of R-squared that has been adjusted for the number of
predictors in the model. It shows whether adding additional
predictors improve a regression model or not.
R-Squared 0.782
Adjusted R-Squared 0.781
Sample Output: 1. Model Summary (Continued)...
● Root Mean Square Error (RMSE): Square root of the average of squared differences between prediction
and actual observation. It is standard deviation of residual error.
● Mean Absolute Error (MAE): Average of the absolute differences between prediction and actual
observation.
● Mean_Absolute_Percentile_Error (MAPE): Mean Absolute Percentage ratio of residual over actual
observations.
● Mean Percentage Error (MPE): Mean Percentage Error conveys if there is more positive errors than
negative errors or vice-versa based upon its sign.
RMSE, MAE, MAPE and MPE are used to identify the variation in terms of errors from predicted to
actual values.
Lower the values, represent a better fit of the regression model.
Sample Output: 2. Interpretation
CGPA
LOR
GRE_Score
TOEFL_Score
Influencer's Importance
Influencer’s Importance chart is used to show impact of each predictor on target variable.
Target Variable: Chance_of_Admit
Sample Output: 3. Plots
CGPA Predicted Chance_of_Admit
Line Fit Plot Residual versus Fit Plot
Line fit plots are used to check the assumption of
linearity between each Xi & Y
Residual versus fit plot is used to check the
assumption of equal error variances & outliers
The line plot is plotted between Chance_of_Admit against CGPA.
The residual versus Fit plot is plotted between Predicted Chance_of_Admit and Standard Residuals.
Interpretation of Important Model Summary Statistics
RMSE R Squared
RMSE R-Squared
R-Squared: Adjusted R-Squared: RMSE:
• R-squared between 0 to 0.7
represents a model not fit well
and assumptions of normality
and linearity should be checked
for better fitment of a model.
• It shows the goodness of fit of
the model. It lies between 0 to 1
and closer this value to 1, better
the model
• The adjusted R-squared is a
modified version of R-squared
that has been adjusted for the
number of predictors in the
model. It shows whether adding
additional predictors improve a
regression model or not
• If the value is > 0.7, the model
shows a better correlation
between the dependent and
independent variables
• The more the variables, the
lesser is the adjusted R-squared
score
• Square root of the average of
squared differences between
prediction and actual
observation. It is standard
deviation of residual error.
• Lower values of RMSE indicate
a better fit. The value ranges 0
to ∞.
Interpretation of Important Model Summary Statistics
(Continued…)
RMSE R Squared
RMSE R-Squared
MAE: MAPE: MPE:
• Average of the absolute
differences between prediction
and actual observation
• Lower values of MAE indicate a
better fit. The value ranges 0 to
∞
• Like RMSE, it is a negatively
oriented score
• Mean Absolute Percentage
ratio of residual over actual
observations
• Lower the MAPE, better the
performance of the model
• Mean Percentage Error conveys
if there is more positive errors
than negative errors or vice-
versa based upon its sign
• In case of more negative errors
the system underestimates, and
in case of more positive errors
the system overestimates
Interpretation of Plots: Line Fit Plot
• This plot is used to plot the relationship between
each Xi (predictor) & Y (target variable) with Y-on-y
axis and each Xi on x axis
• As shown in the figure 1 in right, as temperature(X)
increases, so does the Yield(Y), hence there is a
linear relationship between X and Y and isotonic
regression is applicable on this data
• If line doesn’t display linearity as shown in figures 2
& 3 in right, then transformation can be applied on
that particular variable before proceeding with
model building
• If data transformation doesn’t help, then either
that variable(Xi) can be dropped from the analysis
or nonlinear model should be chosen depending
on the distribution pattern of scatter plot.
Figure 2
Figure 3
Figure 1 CGPA
Chance_of_Admit
Figure 2
Figure 3
Interpretation of Plots: Residual Versus Fit Plot
• It is the scattered plot of standardized residuals on
Y axis and predicted (fitted) values on X axis
• It is used to detect the unequal residual variances
and outliers in data
• Here are the characteristics of a well-behaved
residual vs. fits plot:
• The residuals should "bounce randomly" around
the 0 line and should roughly form a "horizontal
band" around the 0 line as shown in figure 1. This
suggests that the variances of the error terms are
equal
• No one residual should "stands out" from the
basic random pattern of residuals. This suggests
that there are no outliers.
⮚ Plots shown in
figures 2 & 3 above
depict unequal
error variances,
which is not
desirable for linear
regression analysis
Figure 1
Figure 2
Figure 3
Limitations
Time independent error
( fairly constant over time & lying within certain range)
Time dependent error (decreasing with time)
• Isotonic Regression is limited to predicting
numeric output i.e., dependent variable must be
numeric in nature
• Minimum sample size should be at least 20 cases
per independent variable
• Significant risk of overfitting for larger number of
isotonic constraints/thresholds.
• Isotonic Regression is monotonic and hence it is
not appropriate for fitting distributions that have
left and right tails.
Limitations (Continued…) Figure 1
Figure 2
• It does not fit derivatives, so it will not approximate
smooth curves like most distribution functions. It may be
useful to approximate heuristically the predicted values,
but would not be especially useful for extrapolation
beyond the extreme values of the x-axis data.
• Target/independent variables should be normally
distributed
• A normal distribution is an arrangement of a data set in
which most values cluster in the middle of the range and
the rest taper off symmetrically toward either extreme. It
will look like a bell curve as shown in figure 1 in right
• Outliers(observations lying outside overall pattern of
distribution) in data, both target as well as independent
variables can affect the analysis, hence outliers need to
be removed as shown in Figure 2 in right.
Business Use Case 1
Business Problem: Decide Loan Eligibility based on Applicant’s Annual income, Employment Period,
Debt to Income Ratio etc.
Input Data: Predictor/Independent Variable(s) to determine Applicant’s Loan Eligibility:
• House Ownership Status
• Job Grade
• Employment Length
• Annual Income
• Loan Verification Status
• Debt to Income Ratio
Business Benefit: Loan applicant’s can discover what predictors can lead towards the required loan
amount to be eligible for further proceedings in turn ensuring systematic banking approach and
also assist banks to check the loan eligibility criteria before sanctioning a loan to the applicant.
Business Use Case 2
Business Problem: Predicting diamond prices using basic measurement metrics.
Input Data: Predictor/Independent Variable(s) to determine the price of a Diamond:
• Carat weight of Diamond
• Quality of the Cut
• Diamond Color
• Clarity
• Depth
• The width of the diamond’s table
Business Benefit:
The predictive model will provide details on the pricing of diamonds and enable analysis of the most
prominent factors and trends in the diamond market.
Want to
Learn More?
Get in touch with us @
support@Smarten.com
And Do Checkout the Learning section
on
Smarten.com
September 2021

More Related Content

What's hot

Linear Regression vs Logistic Regression vs Poisson Regression
Linear Regression vs Logistic Regression vs Poisson RegressionLinear Regression vs Logistic Regression vs Poisson Regression
Linear Regression vs Logistic Regression vs Poisson RegressionKamil Bartocha
 
R square vs adjusted r square
R square vs adjusted r squareR square vs adjusted r square
R square vs adjusted r squareAkhilesh Joshi
 
Probabilistic Reasoning
Probabilistic ReasoningProbabilistic Reasoning
Probabilistic ReasoningJunya Tanaka
 
K-Folds Cross Validation Method
K-Folds Cross Validation MethodK-Folds Cross Validation Method
K-Folds Cross Validation MethodSHUBHAM GUPTA
 
Simple correlation & Regression analysis
Simple correlation & Regression analysisSimple correlation & Regression analysis
Simple correlation & Regression analysisAfra Fathima
 
Correlation analysis ppt
Correlation analysis pptCorrelation analysis ppt
Correlation analysis pptDavid Jaison
 
Data Analysis - Approach & Techniques
Data Analysis - Approach & TechniquesData Analysis - Approach & Techniques
Data Analysis - Approach & TechniquesInvenkLearn
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testingSumit Sharma
 
Exploratory data analysis project
Exploratory data analysis project Exploratory data analysis project
Exploratory data analysis project BabatundeSogunro
 
Exponential smoothing
Exponential smoothingExponential smoothing
Exponential smoothingJairo Moreno
 
Computing transformations
Computing transformationsComputing transformations
Computing transformationsTarun Gehlot
 
Machine learning and decision trees
Machine learning and decision treesMachine learning and decision trees
Machine learning and decision treesPadma Metta
 
Introduction To Predictive Analytics Part I
Introduction To Predictive Analytics   Part IIntroduction To Predictive Analytics   Part I
Introduction To Predictive Analytics Part Ijayroy
 
Data Science - Part IV - Regression Analysis & ANOVA
Data Science - Part IV - Regression Analysis & ANOVAData Science - Part IV - Regression Analysis & ANOVA
Data Science - Part IV - Regression Analysis & ANOVADerek Kane
 
Solving stepwise regression problems
Solving stepwise regression problemsSolving stepwise regression problems
Solving stepwise regression problemsSoma Sinha Roy
 
Unit IV UNCERTAINITY AND STATISTICAL REASONING in AI K.Sundar,AP/CSE,VEC
Unit IV UNCERTAINITY AND STATISTICAL REASONING in AI K.Sundar,AP/CSE,VECUnit IV UNCERTAINITY AND STATISTICAL REASONING in AI K.Sundar,AP/CSE,VEC
Unit IV UNCERTAINITY AND STATISTICAL REASONING in AI K.Sundar,AP/CSE,VECsundarKanagaraj1
 

What's hot (20)

Linear Regression vs Logistic Regression vs Poisson Regression
Linear Regression vs Logistic Regression vs Poisson RegressionLinear Regression vs Logistic Regression vs Poisson Regression
Linear Regression vs Logistic Regression vs Poisson Regression
 
Linear regression
Linear regressionLinear regression
Linear regression
 
MANOVA SPSS
MANOVA SPSSMANOVA SPSS
MANOVA SPSS
 
R square vs adjusted r square
R square vs adjusted r squareR square vs adjusted r square
R square vs adjusted r square
 
Probabilistic Reasoning
Probabilistic ReasoningProbabilistic Reasoning
Probabilistic Reasoning
 
K-Folds Cross Validation Method
K-Folds Cross Validation MethodK-Folds Cross Validation Method
K-Folds Cross Validation Method
 
Simple correlation & Regression analysis
Simple correlation & Regression analysisSimple correlation & Regression analysis
Simple correlation & Regression analysis
 
Pca analysis
Pca analysisPca analysis
Pca analysis
 
Correlation analysis ppt
Correlation analysis pptCorrelation analysis ppt
Correlation analysis ppt
 
Data Analysis - Approach & Techniques
Data Analysis - Approach & TechniquesData Analysis - Approach & Techniques
Data Analysis - Approach & Techniques
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Exploratory data analysis project
Exploratory data analysis project Exploratory data analysis project
Exploratory data analysis project
 
Exponential smoothing
Exponential smoothingExponential smoothing
Exponential smoothing
 
Computing transformations
Computing transformationsComputing transformations
Computing transformations
 
Machine learning and decision trees
Machine learning and decision treesMachine learning and decision trees
Machine learning and decision trees
 
Introduction To Predictive Analytics Part I
Introduction To Predictive Analytics   Part IIntroduction To Predictive Analytics   Part I
Introduction To Predictive Analytics Part I
 
Data Science - Part IV - Regression Analysis & ANOVA
Data Science - Part IV - Regression Analysis & ANOVAData Science - Part IV - Regression Analysis & ANOVA
Data Science - Part IV - Regression Analysis & ANOVA
 
Bayesian intro
Bayesian introBayesian intro
Bayesian intro
 
Solving stepwise regression problems
Solving stepwise regression problemsSolving stepwise regression problems
Solving stepwise regression problems
 
Unit IV UNCERTAINITY AND STATISTICAL REASONING in AI K.Sundar,AP/CSE,VEC
Unit IV UNCERTAINITY AND STATISTICAL REASONING in AI K.Sundar,AP/CSE,VECUnit IV UNCERTAINITY AND STATISTICAL REASONING in AI K.Sundar,AP/CSE,VEC
Unit IV UNCERTAINITY AND STATISTICAL REASONING in AI K.Sundar,AP/CSE,VEC
 

Similar to What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?

What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...Smarten Augmented Analytics
 
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...Smarten Augmented Analytics
 
What Is Generalized Linear Regression with Gaussian Distribution And How Can ...
What Is Generalized Linear Regression with Gaussian Distribution And How Can ...What Is Generalized Linear Regression with Gaussian Distribution And How Can ...
What Is Generalized Linear Regression with Gaussian Distribution And How Can ...Smarten Augmented Analytics
 
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...Smarten Augmented Analytics
 
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...Smarten Augmented Analytics
 
Distribution of EstimatesLinear Regression ModelAssume (yt,.docx
Distribution of EstimatesLinear Regression ModelAssume (yt,.docxDistribution of EstimatesLinear Regression ModelAssume (yt,.docx
Distribution of EstimatesLinear Regression ModelAssume (yt,.docxmadlynplamondon
 
Machine learning session4(linear regression)
Machine learning   session4(linear regression)Machine learning   session4(linear regression)
Machine learning session4(linear regression)Abhimanyu Dwivedi
 
Simple & Multiple Regression Analysis
Simple & Multiple Regression AnalysisSimple & Multiple Regression Analysis
Simple & Multiple Regression AnalysisShailendra Tomar
 
Simple lin regress_inference
Simple lin regress_inferenceSimple lin regress_inference
Simple lin regress_inferenceKemal İnciroğlu
 
Gradient Boosting Regression Analysis Reveals Dependent Variables and Interre...
Gradient Boosting Regression Analysis Reveals Dependent Variables and Interre...Gradient Boosting Regression Analysis Reveals Dependent Variables and Interre...
Gradient Boosting Regression Analysis Reveals Dependent Variables and Interre...Smarten Augmented Analytics
 
Regression Analysis.pptx
Regression Analysis.pptxRegression Analysis.pptx
Regression Analysis.pptxarsh260174
 
Regression Analysis Techniques.pptx
Regression Analysis Techniques.pptxRegression Analysis Techniques.pptx
Regression Analysis Techniques.pptxYutaItadori
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.pptTanyaWadhwani4
 
linearregression-1909240jhgg53948.pptx
linearregression-1909240jhgg53948.pptxlinearregression-1909240jhgg53948.pptx
linearregression-1909240jhgg53948.pptxbishalnandi2
 
Prediction of house price using multiple regression
Prediction of house price using multiple regressionPrediction of house price using multiple regression
Prediction of house price using multiple regressionvinovk
 

Similar to What is Isotonic Regression and How Can a Business Utilize it to Analyze Data? (20)

What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
 
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
 
What Is Generalized Linear Regression with Gaussian Distribution And How Can ...
What Is Generalized Linear Regression with Gaussian Distribution And How Can ...What Is Generalized Linear Regression with Gaussian Distribution And How Can ...
What Is Generalized Linear Regression with Gaussian Distribution And How Can ...
 
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
 
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
 
Distribution of EstimatesLinear Regression ModelAssume (yt,.docx
Distribution of EstimatesLinear Regression ModelAssume (yt,.docxDistribution of EstimatesLinear Regression ModelAssume (yt,.docx
Distribution of EstimatesLinear Regression ModelAssume (yt,.docx
 
Machine learning session4(linear regression)
Machine learning   session4(linear regression)Machine learning   session4(linear regression)
Machine learning session4(linear regression)
 
Simple & Multiple Regression Analysis
Simple & Multiple Regression AnalysisSimple & Multiple Regression Analysis
Simple & Multiple Regression Analysis
 
Simple lin regress_inference
Simple lin regress_inferenceSimple lin regress_inference
Simple lin regress_inference
 
Gradient Boosting Regression Analysis Reveals Dependent Variables and Interre...
Gradient Boosting Regression Analysis Reveals Dependent Variables and Interre...Gradient Boosting Regression Analysis Reveals Dependent Variables and Interre...
Gradient Boosting Regression Analysis Reveals Dependent Variables and Interre...
 
Regression Analysis.pptx
Regression Analysis.pptxRegression Analysis.pptx
Regression Analysis.pptx
 
Regression Analysis Techniques.pptx
Regression Analysis Techniques.pptxRegression Analysis Techniques.pptx
Regression Analysis Techniques.pptx
 
Ders 2 ols .ppt
Ders 2 ols .pptDers 2 ols .ppt
Ders 2 ols .ppt
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.ppt
 
MF Presentation.pptx
MF Presentation.pptxMF Presentation.pptx
MF Presentation.pptx
 
Simple Linear Regression.pptx
Simple Linear Regression.pptxSimple Linear Regression.pptx
Simple Linear Regression.pptx
 
Simple egression.pptx
Simple egression.pptxSimple egression.pptx
Simple egression.pptx
 
Quantitative Methods - Level II - CFA Program
Quantitative Methods - Level II - CFA ProgramQuantitative Methods - Level II - CFA Program
Quantitative Methods - Level II - CFA Program
 
linearregression-1909240jhgg53948.pptx
linearregression-1909240jhgg53948.pptxlinearregression-1909240jhgg53948.pptx
linearregression-1909240jhgg53948.pptx
 
Prediction of house price using multiple regression
Prediction of house price using multiple regressionPrediction of house price using multiple regression
Prediction of house price using multiple regression
 

More from Smarten Augmented Analytics

Crime Type Prediction - Augmented Analytics Use Case – Smarten
Crime Type Prediction - Augmented Analytics Use Case – SmartenCrime Type Prediction - Augmented Analytics Use Case – Smarten
Crime Type Prediction - Augmented Analytics Use Case – SmartenSmarten Augmented Analytics
 
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...Smarten Augmented Analytics
 
What Is Random Forest Classification And How Can It Help Your Business?
What Is Random Forest Classification And How Can It Help Your Business?What Is Random Forest Classification And How Can It Help Your Business?
What Is Random Forest Classification And How Can It Help Your Business?Smarten Augmented Analytics
 
Students' Academic Performance Predictive Analytics Use Case – Smarten
Students' Academic Performance Predictive Analytics Use Case – SmartenStudents' Academic Performance Predictive Analytics Use Case – Smarten
Students' Academic Performance Predictive Analytics Use Case – SmartenSmarten Augmented Analytics
 
Random Forest Regression Analysis Reveals Impact of Variables on Target Values
Random Forest Regression Analysis Reveals Impact of Variables on Target Values  Random Forest Regression Analysis Reveals Impact of Variables on Target Values
Random Forest Regression Analysis Reveals Impact of Variables on Target Values Smarten Augmented Analytics
 
Fraud Mitigation Predictive Analytics Use Case – Smarten
Fraud Mitigation Predictive Analytics Use Case – SmartenFraud Mitigation Predictive Analytics Use Case – Smarten
Fraud Mitigation Predictive Analytics Use Case – SmartenSmarten Augmented Analytics
 
Quality Control Predictive Analytics Use Case - Smarten
Quality Control Predictive Analytics Use Case - SmartenQuality Control Predictive Analytics Use Case - Smarten
Quality Control Predictive Analytics Use Case - SmartenSmarten Augmented Analytics
 
Machine Maintenance Management Predictive Analytics Use Case - Smarten
Machine Maintenance Management Predictive Analytics Use Case - SmartenMachine Maintenance Management Predictive Analytics Use Case - Smarten
Machine Maintenance Management Predictive Analytics Use Case - SmartenSmarten Augmented Analytics
 
Predictive Analytics Using External Data Augmented Analytics Use Case - Smarten
Predictive Analytics Using External Data Augmented Analytics Use Case - SmartenPredictive Analytics Using External Data Augmented Analytics Use Case - Smarten
Predictive Analytics Using External Data Augmented Analytics Use Case - SmartenSmarten Augmented Analytics
 
Marketing Optimization Augmented Analytics Use Cases - Smarten
Marketing Optimization Augmented Analytics Use Cases - SmartenMarketing Optimization Augmented Analytics Use Cases - Smarten
Marketing Optimization Augmented Analytics Use Cases - SmartenSmarten Augmented Analytics
 
Human Resource Attrition Augmented Analytics Use Case - Smarten
Human Resource Attrition Augmented Analytics Use Case - SmartenHuman Resource Attrition Augmented Analytics Use Case - Smarten
Human Resource Attrition Augmented Analytics Use Case - SmartenSmarten Augmented Analytics
 
Customer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - SmartenCustomer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - SmartenSmarten Augmented Analytics
 
What is Naïve Bayes Classification and How is it Used for Enterprise Analysis?
What is Naïve Bayes Classification and How is it Used for Enterprise Analysis?What is Naïve Bayes Classification and How is it Used for Enterprise Analysis?
What is Naïve Bayes Classification and How is it Used for Enterprise Analysis?Smarten Augmented Analytics
 
What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?Smarten Augmented Analytics
 
What is the Independent Samples T Test Method of Analysis and How Can it Bene...
What is the Independent Samples T Test Method of Analysis and How Can it Bene...What is the Independent Samples T Test Method of Analysis and How Can it Bene...
What is the Independent Samples T Test Method of Analysis and How Can it Bene...Smarten Augmented Analytics
 
What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...
What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...
What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...Smarten Augmented Analytics
 
What is Binary Logistic Regression Classification and How is it Used in Analy...
What is Binary Logistic Regression Classification and How is it Used in Analy...What is Binary Logistic Regression Classification and How is it Used in Analy...
What is Binary Logistic Regression Classification and How is it Used in Analy...Smarten Augmented Analytics
 
What is the Paired Sample T Test and How is it Beneficial to Business Analysis?
What is the Paired Sample T Test and How is it Beneficial to Business Analysis?What is the Paired Sample T Test and How is it Beneficial to Business Analysis?
What is the Paired Sample T Test and How is it Beneficial to Business Analysis?Smarten Augmented Analytics
 
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?Smarten Augmented Analytics
 
What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...
What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...
What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...Smarten Augmented Analytics
 

More from Smarten Augmented Analytics (20)

Crime Type Prediction - Augmented Analytics Use Case – Smarten
Crime Type Prediction - Augmented Analytics Use Case – SmartenCrime Type Prediction - Augmented Analytics Use Case – Smarten
Crime Type Prediction - Augmented Analytics Use Case – Smarten
 
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...
 
What Is Random Forest Classification And How Can It Help Your Business?
What Is Random Forest Classification And How Can It Help Your Business?What Is Random Forest Classification And How Can It Help Your Business?
What Is Random Forest Classification And How Can It Help Your Business?
 
Students' Academic Performance Predictive Analytics Use Case – Smarten
Students' Academic Performance Predictive Analytics Use Case – SmartenStudents' Academic Performance Predictive Analytics Use Case – Smarten
Students' Academic Performance Predictive Analytics Use Case – Smarten
 
Random Forest Regression Analysis Reveals Impact of Variables on Target Values
Random Forest Regression Analysis Reveals Impact of Variables on Target Values  Random Forest Regression Analysis Reveals Impact of Variables on Target Values
Random Forest Regression Analysis Reveals Impact of Variables on Target Values
 
Fraud Mitigation Predictive Analytics Use Case – Smarten
Fraud Mitigation Predictive Analytics Use Case – SmartenFraud Mitigation Predictive Analytics Use Case – Smarten
Fraud Mitigation Predictive Analytics Use Case – Smarten
 
Quality Control Predictive Analytics Use Case - Smarten
Quality Control Predictive Analytics Use Case - SmartenQuality Control Predictive Analytics Use Case - Smarten
Quality Control Predictive Analytics Use Case - Smarten
 
Machine Maintenance Management Predictive Analytics Use Case - Smarten
Machine Maintenance Management Predictive Analytics Use Case - SmartenMachine Maintenance Management Predictive Analytics Use Case - Smarten
Machine Maintenance Management Predictive Analytics Use Case - Smarten
 
Predictive Analytics Using External Data Augmented Analytics Use Case - Smarten
Predictive Analytics Using External Data Augmented Analytics Use Case - SmartenPredictive Analytics Using External Data Augmented Analytics Use Case - Smarten
Predictive Analytics Using External Data Augmented Analytics Use Case - Smarten
 
Marketing Optimization Augmented Analytics Use Cases - Smarten
Marketing Optimization Augmented Analytics Use Cases - SmartenMarketing Optimization Augmented Analytics Use Cases - Smarten
Marketing Optimization Augmented Analytics Use Cases - Smarten
 
Human Resource Attrition Augmented Analytics Use Case - Smarten
Human Resource Attrition Augmented Analytics Use Case - SmartenHuman Resource Attrition Augmented Analytics Use Case - Smarten
Human Resource Attrition Augmented Analytics Use Case - Smarten
 
Customer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - SmartenCustomer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - Smarten
 
What is Naïve Bayes Classification and How is it Used for Enterprise Analysis?
What is Naïve Bayes Classification and How is it Used for Enterprise Analysis?What is Naïve Bayes Classification and How is it Used for Enterprise Analysis?
What is Naïve Bayes Classification and How is it Used for Enterprise Analysis?
 
What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?
 
What is the Independent Samples T Test Method of Analysis and How Can it Bene...
What is the Independent Samples T Test Method of Analysis and How Can it Bene...What is the Independent Samples T Test Method of Analysis and How Can it Bene...
What is the Independent Samples T Test Method of Analysis and How Can it Bene...
 
What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...
What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...
What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...
 
What is Binary Logistic Regression Classification and How is it Used in Analy...
What is Binary Logistic Regression Classification and How is it Used in Analy...What is Binary Logistic Regression Classification and How is it Used in Analy...
What is Binary Logistic Regression Classification and How is it Used in Analy...
 
What is the Paired Sample T Test and How is it Beneficial to Business Analysis?
What is the Paired Sample T Test and How is it Beneficial to Business Analysis?What is the Paired Sample T Test and How is it Beneficial to Business Analysis?
What is the Paired Sample T Test and How is it Beneficial to Business Analysis?
 
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
 
What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...
What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...
What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...
 

Recently uploaded

Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 

Recently uploaded (20)

Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 

What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?

  • 1. Master the Art of Analytics A Simplistic Explainer Series For Citizen Data Scientists Journey Towards Augmented Analytics
  • 3. Terminologies Introduction & Example Standard input/tuning parameters & Sample UI Sample output UI Interpretation of Output Limitations Business use cases What is covered
  • 4. Terminologies • Predictors and Target variable: • Target variable usually denoted by Y, is the variable being predicted and is also called dependent variable, output variable, response variable or outcome variable • Predictor, usually denoted by X, sometimes called an independent or explanatory variable, is a variable that is being used to predict the target variable • Correlation: • Correlation is a statistical measure that indicates the extent to which two variables fluctuate together
  • 5. Terminologies (Continued...) • Isotonic Constraints/Thresholds: • These are the data points where we can estimate a set a linear model in between each of the thresholds to minimize the error regarding the training data
  • 6. Terminologies (Continued...) • Monotonic Constraints: • These are either increasing or decreasing set of values which are typically considered to be increasing in case of isotonic regression.
  • 7. ● xi is observed responses and X=x1, x2,..., xn • OBJECTIVE: It is a variant of linear regression and allows us to build model in piecewise linear manner i.e., breaking up the problem into few or many linear segments and perform linear interpolation of each function. • BENEFIT: o Unlike linear regression this model is not biased and is flexible. o Helps in multidimensional scaling. • MODEL: o Isotonic regression is the technique of fitting a free-form line to a sequence of observations such that the fitted line that lies close to the observations as possible. ● yi is a finite set of real numbers Y=y1, y2,...,yn ● wi are positive weights Introduction
  • 8. Example: Isotonic Regression Let’s conduct the Isotonic Regression Analysis on Admission Regression Data set on Independent Variables: CGPA, LOR, GRE_Score, TOEFL_Score and Target Variable: Chance_of_Admit as shown below: Independent variables (Xi) Target Variable (Y) Chance_of_admit CGPA LOR GRE_Score TOEFL_Score 0.46 8.0 3.0 308 110 0.64 8.18 3.0 312 98 0.72 8.79 2.5 319 110 0.45 7.46 2.5 290 104 0.57 7.46 2.5 311 98 Model is a an excellent fit when Adjusted R-Squared > 0.7 R-Squared 0.782 Adjusted R-Squared 0.781 R-Squared: It shows the goodness of fit of the model. It lies between 0 to 1 and closer the value to 1, better the model. Adjusted R-Squared: The adjusted R-squared is a modified version of R-squared that has been adjusted for the number of predictors in the model. It shows whether adding additional predictors improve a regression model or not.
  • 9. Select the Target Variable Chance_of_Admit CGPA LOR GRE_Score TOEFL_Score Step 1 Select the Predictor Variable Chance_of_Admit CGPA LOR GRE_Score TOEFL_Score Step 2 More than one predictors can be selected Step 3 isotonic = true (This indicates the default value that is based on the target variable, typically denoting the increasing property of the isotonic regression). By default, these parameters should be set with the values mentioned Step 4 Display the output window containing following: o Model Summary o Line Fit Plot o Residual Versus Fit Plot Note: ▪ Categorical predictors should be auto detected and converted to dummy/binary variables before applying regression ▪ Decision on selection of predictors depends on business knowledge and the correlation value between the target variable and predictors Standard Input/Tuning Parameters & Sample UI
  • 10. Sample Output: 1. Model Summary ● R-Squared: It shows the goodness of fit of the model. It lies between 0 to 1 and closer the value to 1, better the model. Root Mean Square Error (RMSE) 0.066 Mean Absolute Error (MAE) 0.048 Mean_Absolute_Percentile_Error (MAPE) 0.0762619 Mean Percentage Error (MPE) -0.0111406 ● Adjusted R-Squared: The adjusted R-squared is a modified version of R-squared that has been adjusted for the number of predictors in the model. It shows whether adding additional predictors improve a regression model or not. R-Squared 0.782 Adjusted R-Squared 0.781
  • 11. Sample Output: 1. Model Summary (Continued)... ● Root Mean Square Error (RMSE): Square root of the average of squared differences between prediction and actual observation. It is standard deviation of residual error. ● Mean Absolute Error (MAE): Average of the absolute differences between prediction and actual observation. ● Mean_Absolute_Percentile_Error (MAPE): Mean Absolute Percentage ratio of residual over actual observations. ● Mean Percentage Error (MPE): Mean Percentage Error conveys if there is more positive errors than negative errors or vice-versa based upon its sign. RMSE, MAE, MAPE and MPE are used to identify the variation in terms of errors from predicted to actual values. Lower the values, represent a better fit of the regression model.
  • 12. Sample Output: 2. Interpretation CGPA LOR GRE_Score TOEFL_Score Influencer's Importance Influencer’s Importance chart is used to show impact of each predictor on target variable. Target Variable: Chance_of_Admit
  • 13. Sample Output: 3. Plots CGPA Predicted Chance_of_Admit Line Fit Plot Residual versus Fit Plot Line fit plots are used to check the assumption of linearity between each Xi & Y Residual versus fit plot is used to check the assumption of equal error variances & outliers The line plot is plotted between Chance_of_Admit against CGPA. The residual versus Fit plot is plotted between Predicted Chance_of_Admit and Standard Residuals.
  • 14. Interpretation of Important Model Summary Statistics RMSE R Squared RMSE R-Squared R-Squared: Adjusted R-Squared: RMSE: • R-squared between 0 to 0.7 represents a model not fit well and assumptions of normality and linearity should be checked for better fitment of a model. • It shows the goodness of fit of the model. It lies between 0 to 1 and closer this value to 1, better the model • The adjusted R-squared is a modified version of R-squared that has been adjusted for the number of predictors in the model. It shows whether adding additional predictors improve a regression model or not • If the value is > 0.7, the model shows a better correlation between the dependent and independent variables • The more the variables, the lesser is the adjusted R-squared score • Square root of the average of squared differences between prediction and actual observation. It is standard deviation of residual error. • Lower values of RMSE indicate a better fit. The value ranges 0 to ∞.
  • 15. Interpretation of Important Model Summary Statistics (Continued…) RMSE R Squared RMSE R-Squared MAE: MAPE: MPE: • Average of the absolute differences between prediction and actual observation • Lower values of MAE indicate a better fit. The value ranges 0 to ∞ • Like RMSE, it is a negatively oriented score • Mean Absolute Percentage ratio of residual over actual observations • Lower the MAPE, better the performance of the model • Mean Percentage Error conveys if there is more positive errors than negative errors or vice- versa based upon its sign • In case of more negative errors the system underestimates, and in case of more positive errors the system overestimates
  • 16. Interpretation of Plots: Line Fit Plot • This plot is used to plot the relationship between each Xi (predictor) & Y (target variable) with Y-on-y axis and each Xi on x axis • As shown in the figure 1 in right, as temperature(X) increases, so does the Yield(Y), hence there is a linear relationship between X and Y and isotonic regression is applicable on this data • If line doesn’t display linearity as shown in figures 2 & 3 in right, then transformation can be applied on that particular variable before proceeding with model building • If data transformation doesn’t help, then either that variable(Xi) can be dropped from the analysis or nonlinear model should be chosen depending on the distribution pattern of scatter plot. Figure 2 Figure 3 Figure 1 CGPA Chance_of_Admit Figure 2 Figure 3
  • 17. Interpretation of Plots: Residual Versus Fit Plot • It is the scattered plot of standardized residuals on Y axis and predicted (fitted) values on X axis • It is used to detect the unequal residual variances and outliers in data • Here are the characteristics of a well-behaved residual vs. fits plot: • The residuals should "bounce randomly" around the 0 line and should roughly form a "horizontal band" around the 0 line as shown in figure 1. This suggests that the variances of the error terms are equal • No one residual should "stands out" from the basic random pattern of residuals. This suggests that there are no outliers. ⮚ Plots shown in figures 2 & 3 above depict unequal error variances, which is not desirable for linear regression analysis Figure 1 Figure 2 Figure 3
  • 18. Limitations Time independent error ( fairly constant over time & lying within certain range) Time dependent error (decreasing with time) • Isotonic Regression is limited to predicting numeric output i.e., dependent variable must be numeric in nature • Minimum sample size should be at least 20 cases per independent variable • Significant risk of overfitting for larger number of isotonic constraints/thresholds. • Isotonic Regression is monotonic and hence it is not appropriate for fitting distributions that have left and right tails.
  • 19. Limitations (Continued…) Figure 1 Figure 2 • It does not fit derivatives, so it will not approximate smooth curves like most distribution functions. It may be useful to approximate heuristically the predicted values, but would not be especially useful for extrapolation beyond the extreme values of the x-axis data. • Target/independent variables should be normally distributed • A normal distribution is an arrangement of a data set in which most values cluster in the middle of the range and the rest taper off symmetrically toward either extreme. It will look like a bell curve as shown in figure 1 in right • Outliers(observations lying outside overall pattern of distribution) in data, both target as well as independent variables can affect the analysis, hence outliers need to be removed as shown in Figure 2 in right.
  • 20. Business Use Case 1 Business Problem: Decide Loan Eligibility based on Applicant’s Annual income, Employment Period, Debt to Income Ratio etc. Input Data: Predictor/Independent Variable(s) to determine Applicant’s Loan Eligibility: • House Ownership Status • Job Grade • Employment Length • Annual Income • Loan Verification Status • Debt to Income Ratio Business Benefit: Loan applicant’s can discover what predictors can lead towards the required loan amount to be eligible for further proceedings in turn ensuring systematic banking approach and also assist banks to check the loan eligibility criteria before sanctioning a loan to the applicant.
  • 21. Business Use Case 2 Business Problem: Predicting diamond prices using basic measurement metrics. Input Data: Predictor/Independent Variable(s) to determine the price of a Diamond: • Carat weight of Diamond • Quality of the Cut • Diamond Color • Clarity • Depth • The width of the diamond’s table Business Benefit: The predictive model will provide details on the pricing of diamonds and enable analysis of the most prominent factors and trends in the diamond market.
  • 22. Want to Learn More? Get in touch with us @ support@Smarten.com And Do Checkout the Learning section on Smarten.com September 2021