SlideShare a Scribd company logo
1 of 24
Master the Art of Analytics
A Simplistic Explainer Series For Citizen Data Scientists
Journey Towards Augmented Analytics
Generalized Linear Regression
(Gaussian Distribution)
Terminologies
Introduction & Example
Standard input/tuning parameters & Sample UI
Sample output UI
Interpretation of Output
Limitations
Business use cases
What is
Covered
Terminologies
▪ Target variable usually denoted by Y, is the variable being predicted and is also called dependent variable,
output variable, response variable or outcome variable (E.g., One highlighted in red box in table below).
▪ Predictor, sometimes called an independent variable, is a variable that is being used to predict the
target variable (E.g., Variables highlighted in green box in table below).
The predictors highlighted in green box above constitutes of the attributes upon which the target variable
highlighted in red box (i.e., Loan amount) depends on.
Loan amount Debt to income ratio Grade Annual
income
Verification status
9000 30 A 9632 Not Verified
4800 26 B 5022 Not Verified
20000 25 B 5149 Not Verified
6000 29 A 5225 Verified
3000 27 C 5344 Verified
Terminologies (Continued…)
• Coefficients:
• It shows the magnitude as well as direction of impact of predictors on a target variable Y. It shows
the positive or negative relationship between dependent and independent variables.
• A positive coefficient indicates that as the value of the independent variable increases, the
mean of the dependent variable also tends to increase.
• A negative coefficient suggests that as the independent variable increases, the dependent
variable tends to decrease.
• P-Value:
• It is used to evaluate whether the corresponding predictor X has any significant impact on the
target variable Y.
• At a 95% confidence threshold, if the p-value for a predictor is < 0.05, then it has a significant
impact on the target variable.
• At a 95% confidence threshold, if the p-value for a predictor is > 0.05, then it doesn't have a
significant impact on the target variable.
Introduction
• Objective:
– The Generalized Linear Model (GLM) Regression is a flexible
generalization of ordinary linear regression that allows for
response variables that have error distribution models other
than a normal distribution.
• Benefit:
– It is a model with low complexity where the response
variables can have any form of exponential distribution
type like gaussian, Poisson etc. It is also easier to interpret
and allows us to understand how each predictor influences
the outcome.
• Model:
– yi here represents the gaussian family which during any
measurement values follow a normal distribution with an
equal number of measurements above and below the
mean value. The distribution provides a parameterized
mathematical function that can be used to calculate the
probability for any individual observation from the sample.
where 𝛳 is the canonical parameter that
represents the estimate of location and is
the dispersion parameter that represents the
scale.
GLM allows the distribution of y to take the
shape of many different exponential families:
Example: Generalized Linear Regression
(Gaussian Distribution)
Loan
Amount
Debt to Income
Ratio
Grade
Annual
Income
Verification Status
725 30 A 9632 Not Verified
1000 26 B 5022 Not Verified
1000 25 B 5149 Not Verified
1000 29 A 5225 Verified
1000 27 C 5344 Verified
Independent
variables (Xi)
Target
Variable (Y)
Let’s conduct the Generalized Linear regression (Gaussian Distribution) analysis on Loan Eligibility Data set on independent variables: Grade, Annual Income,
Debt to Income ratio, Verification status and target variable: Loan amount as shown below:
R-Squared 0.867
Adjusted R-Squared 0.868
Model is an excellent fit
when Adjusted R-Squared >
0.7
Adjusted R-Squared: The adjusted R-squared is a modified
version of R-squared that has been adjusted for the number
of predictors in the model. It shows whether adding
additional predictors improve a regression model or not. It
determines the goodness of fit of the model.
R-Squared: A coefficient of determination to represent the
proportion of variance in the target variable explained by a
regression model.
Standard Input/Tuning Parameters & Sample UI
Select the Target Variable
Debt to income ratio
Loan amount
Grade
Verification status
Annual income
Step
1
Step
2
More than one
predictors can be
selected
Step 3
family = gaussian
(This indicates the valid link functions
for each family, the first function being
gaussian which is also the default one).
By default, these parameters
should be set with the values
mentioned
Step 4
Display the output window containing
following:
o Model summary
o Line Fit plot
o Residual versus Fit plot
Note:
▪ Decision on selection of predictors depends on the business knowledge and the correlation value between target variable and predictors.
Select the Predictor
variable(s)
Debt to income ratio
Loan amount
Grade
Verification status
Annual income
Sample Output: 1. Interpretation
Influencer’s importance chart is used to show impact of each predictor on target variable.
Target Variable: Loan amount
Influencer’s Importance
Positive Negative Neutral
Sample Output: 2. Model Summary
R-Squared 0.867
Adjusted R-Squared 0.868
● R-Squared: A coefficient of determination to represent
the proportion of variance in the target variable
explained by a regression model.
● Adjusted R-Squared: A statistical measure determining
the variance in the target variable by considering only
the predictors influencing the target, rather than
considering all chosen predictor variables.
Root Mean Square Error (RMSE) 2847.658
Mean Absolute Error (MAE) 1979.2681
Mean Absolute Percentage Error (MAPE) 0.2484
Mean Percentage Error (MPE) -0.0735
Sample Output: 2. Model Summary (Continued…)
Variable Coefficient P-Value
Intercept 0.8272 0.0
Verification status (Not Verified) -0.6004 0.0
Grade(B) -0.0043 0.3950353
Grade(A) -0.0173 0.0016323
Grade(C) -0.0108 0.0556645
Grade(D) 0.0057 0.3778658
Grade(E) 0.027 0.0010318
Annual income 0 0.0177066
Debt to income ratio -0.0024 0.0
Variable Significance by P-value
Insignificant
Significant
Sample Output: 2. Model Summary (Continued…)
● P-Value: At a 95% confidence threshold, if the p-value for a predictor is < 0.05, then it has a significant impact
on the target variable. At a 95% confidence threshold, if the p-value for a predictor is > 0.05, then it doesn't
have a significant impact on the target variable.
● Root Mean Square Error (RMSE): Square root of the average of squared differences between prediction and
actual observation. It is a standard deviation of residual error.
● Mean Absolute Error (MAE): Average of the absolute differences between prediction and actual observation.
● Mean Absolute Percentage Error (MAPE): Mean Absolute Percentage ratio of residual over actual
observations.
● Mean Percentage Error (MPE): Mean Percentage Error conveys if there is more positive errors than negative
errors or vice-versa based upon its sign.
● RMSE, MAE, MAPE and MPE are used to identify the variation in terms of errors from predicted to actual
values.
● Lower the values, represent a better fit of the regression model.
Sample Output: 3. Predicted Class & Residuals
Loan amount Debt to income
ratio
Grade Annual income Verification
status
Predicted Loan
amount
Regression Residuals Regression
Standardized
Residuals
9000 25 B 11585 Verified 6393.003 2606.997 0.92
4800 25 E 9082 Not Verified 5664.367 -864.367 -0.305
20000 1 B 17627 Verified 24760.365 -4760.365 -1.681
6000 23 B 14689 Not Verified 8364.874 -2364.874 -0.835
3000 29 A 9523 Not Verified 2509.877 490.123 0.173
The data output will contain predicted class column along with the residuals and standardized residuals.
Sample Output: 3. Predicted Class & Residuals
• Residuals: The difference between the observed value of the dependent variable and the predicted
value is called the residual.
● Original Values - Predicted Values.
• Standardized Residuals: It is the ratio of the difference between the actual values and the predicted
values to the standard deviation of the predicted values. It is an indication of variation in predicted
value of target.
● Residuals / Standard Deviation of Residuals.
• Interpretations based on residual values:
● Positive values for the residual error indicates that the prediction is lower than actual value.
● Negative values for the residual error indicates that the prediction is higher than actual value.
● Zero value indicates that the prediction exactly matches with actual value.
Interpretation of Important Model Summary Statistics
RMSE R Squared
RMSE R-Squared
R-Squared: Adjusted R-Squared: P Value:
• A coefficient of determination
to represent the proportion of
variance in the target variable
explained by a regression
model.
• It lies between 0 to 1 and closer
this value to 1, better the model.
• A statistical measure
determining the variance in the
target variable by considering
only the predictors influencing
the target, rather than
considering all chosen predictor
variables.
• Adjusted R-squared < 0.5: The
model is not a good fit, and
predictions are not accurate.
• 0.5 <= Adjusted R-squared <
0.7: The model is a good fit, and
predictions are reasonably
accurate.
• Adjusted R-squared >= 0.7: The
model is a very good fit, and
predictions are accurate.
• At a 95% confidence threshold,
if the p-value for a predictor is <
0.05, then it has a significant
impact on the target variable.
• At a 95% confidence threshold,
if the p-value for a predictor is >
0.05, then it doesn't have a
significant impact on the target
variable.
RMSE:
• Square root of the
average of squared
differences between
prediction and
actual observation.
It is standard
deviation of residual
error.
• Lower values of
RMSE indicate a
better fit. The value
ranges 0 to ∞.
MAE:
• Average of the
absolute differences
between prediction
and actual
observation
• Lower values of MAE
indicate a better fit.
The value ranges 0
to ∞.
• Like RMSE, it is a
negatively oriented
score.
MAPE:
• Mean Absolute
Percentage ratio of
residual over actual
observations.
• Lower the MAPE,
better the
performance of the
model.
MPE:
• Mean Percentage
Error conveys if there
is more positive
errors than negative
errors or vice-versa
based upon its sign.
• In case of more
negative errors the
system
underestimates, and
in case of more
positive errors the
system
overestimates..
Interpretation of Important Model Summary
Statistics (Continued…)
Sample Output: 4. Plots
Line fit Plot Residual versus Fit Plot
Residual versus fit plot is used to check the assumption of
equal error variances & outliers
The Line fit plot is plotted between Loan amount against Annual income.
The residual versus Fit plot is plotted between Predicted Loan amount and Standardized Residuals.
Loan
amount
Standardized
Residuals
Annual income Predicted Loan amount
Line fit plots are used to check the assumption of linearity
between each Xi & Y
Interpretation of Plots:
Line Fit Plot
Line fit Plot
Loan
amount
Annual income
Figure 1
This plot is used to plot the relationship between
each Xi (predictor) & Y (target variable) with Y-on-y
axis and each Xi on x axis
As shown in the figure 1 in right, as Annual
Income(X) increases, so does the Loan Amount(Y),
hence there is a linear relationship between X and Y
and generalized linear regression (gaussian
distribution) regression is applicable on this data
If line doesn’t display linearity as shown in figures 2
& 3 in right, then transformation can be applied on
that variable before proceeding with model building
If data transformation doesn’t help, then either that
variable(Xi) can be dropped from the analysis or
nonlinear model should be chosen depending on the
distribution pattern of scatter plot.
Figure 2
Figure 3
Figure 2
Figure 3
Interpretation of Plots:
Residual Versus Fit Plot
It is the scattered plot of standardized residuals on Y
axis and predicted (fitted) values on X axis
It is used to detect the unequal residual variances and
outliers in data
Here are the characteristics of a well-behaved residual
vs. fits plot:
The residuals should "bounce randomly" around the 0
line and should roughly form a "horizontal band"
around the 0 line as shown in figure 1. This suggests
that the variances of the error terms are equal
No single residual should "stand out" from the basic
random pattern of residuals. This suggests that there
are no outliers.
Figure 2
⮚ Plots shown in figures 2 & 3 above depict unequal error
variances, which is not desirable for regression analysis
Figure 3
Residual versus Fit Plot
Standardized
Residuals
Predicted Loan amount
Figure 1
Limitations
● Generalized Linear Regression is limited
to predicting numeric output i.e.,
dependent variable must be numeric in
nature
● Minimum sample size should be at
least 20 cases per independent
variable.
● Sometimes the data can be categorical
and time series data may not be
normally distributed, both of which are
not supported by Gaussian distribution
in generalized linear regression model.
Time independent error ( fairly constant over time & lying within certain range)
Time dependent error (decreasing with time)
Limitations (Continued…)
● A normal distribution is an arrangement
of a data set in which most values cluster
in the middle of the range and the rest
taper off symmetrically towards extreme.
It will look like a bell curve as shown in
figure 1.
● Outliers in data (target as well as
independent variables) can affect the
analysis, hence outliers need to be
removed.
● Outliers are the observations lying
outside overall pattern of distribution as
shown in figure 2.
Figure 1
Figure 2
Business Use Case 1
• Business Problem: Product’s Profit Prediction
• Identifying the profit made by each product based upon various factors like its total revenue, number of
units sold, region of sale etc.
• Input Data:
• Predictor/independent variables:
• Total Revenue
• Units Sold
• Region
• Total Cost
• Target/dependent variable:
• Total Profit
• Business Benefit:
• The predictive model will help us identify, profit on different products based on the sales, region and
other cost factors.
Business Use Case 2
• Business Problem: Student’s Chance Of Admission Prediction
• To determine a student’s chance to get admission based on certain educational scores and factors.
• Input Data:
• Predictor/independent variables:
• CGPA
• GRE Score
• LOR
• Serial No
• TOEFL Score
• Target/dependent variable:
• Chance of admit
• Business Benefit:
• Using generalized linear regression, we can determine, to what extent a person qualifies to get an
admission based on various educational factors. This eases the entire process of admission and
allows the most eligible students to be selected.
Want to
Learn More?
Get in touch with us @
support@Smarten.com
And Do Checkout the Learning section
on
Smarten.com
September 2021

More Related Content

What's hot

Random Forest Regression Analysis Reveals Impact of Variables on Target Values
Random Forest Regression Analysis Reveals Impact of Variables on Target Values  Random Forest Regression Analysis Reveals Impact of Variables on Target Values
Random Forest Regression Analysis Reveals Impact of Variables on Target Values Smarten Augmented Analytics
 
What is the KMeans Clustering Algorithm and How Does an Enterprise Use it to ...
What is the KMeans Clustering Algorithm and How Does an Enterprise Use it to ...What is the KMeans Clustering Algorithm and How Does an Enterprise Use it to ...
What is the KMeans Clustering Algorithm and How Does an Enterprise Use it to ...Smarten Augmented Analytics
 
What is the Independent Samples T Test Method of Analysis and How Can it Bene...
What is the Independent Samples T Test Method of Analysis and How Can it Bene...What is the Independent Samples T Test Method of Analysis and How Can it Bene...
What is the Independent Samples T Test Method of Analysis and How Can it Bene...Smarten Augmented Analytics
 
What is the Multinomial-Logistic Regression Classification Algorithm and How ...
What is the Multinomial-Logistic Regression Classification Algorithm and How ...What is the Multinomial-Logistic Regression Classification Algorithm and How ...
What is the Multinomial-Logistic Regression Classification Algorithm and How ...Smarten Augmented Analytics
 
What is the Holt-Winters Forecasting Algorithm and How Can it be Used for Ent...
What is the Holt-Winters Forecasting Algorithm and How Can it be Used for Ent...What is the Holt-Winters Forecasting Algorithm and How Can it be Used for Ent...
What is the Holt-Winters Forecasting Algorithm and How Can it be Used for Ent...Smarten Augmented Analytics
 
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?Smarten Augmented Analytics
 
Marketing Optimization Augmented Analytics Use Cases - Smarten
Marketing Optimization Augmented Analytics Use Cases - SmartenMarketing Optimization Augmented Analytics Use Cases - Smarten
Marketing Optimization Augmented Analytics Use Cases - SmartenSmarten Augmented Analytics
 
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...Smarten Augmented Analytics
 
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...Smarten Augmented Analytics
 
Predictive Analytics Using External Data Augmented Analytics Use Case - Smarten
Predictive Analytics Using External Data Augmented Analytics Use Case - SmartenPredictive Analytics Using External Data Augmented Analytics Use Case - Smarten
Predictive Analytics Using External Data Augmented Analytics Use Case - SmartenSmarten Augmented Analytics
 
Machine Learning Project
Machine Learning ProjectMachine Learning Project
Machine Learning ProjectAbhishek Singh
 
What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?Smarten Augmented Analytics
 
What is FP Growth Analysis and How Can a Business Use Frequent Pattern Mining...
What is FP Growth Analysis and How Can a Business Use Frequent Pattern Mining...What is FP Growth Analysis and How Can a Business Use Frequent Pattern Mining...
What is FP Growth Analysis and How Can a Business Use Frequent Pattern Mining...Smarten Augmented Analytics
 
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...Smarten Augmented Analytics
 
Machine Learning Performance Evaluation: Tips and Pitfalls - Jose Hernandez O...
Machine Learning Performance Evaluation: Tips and Pitfalls - Jose Hernandez O...Machine Learning Performance Evaluation: Tips and Pitfalls - Jose Hernandez O...
Machine Learning Performance Evaluation: Tips and Pitfalls - Jose Hernandez O...PAPIs.io
 
Demand estimation by regression analysis
Demand estimation by regression analysisDemand estimation by regression analysis
Demand estimation by regression analysissohrab642
 
Introduction To Data Science Using R
Introduction To Data Science Using RIntroduction To Data Science Using R
Introduction To Data Science Using RANURAG SINGH
 
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACROBOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACROAnthony Kilili
 
Step by Step guide to executing an analytics project
Step by Step guide to executing an analytics projectStep by Step guide to executing an analytics project
Step by Step guide to executing an analytics projectRamkumar Ravichandran
 

What's hot (20)

Random Forest Regression Analysis Reveals Impact of Variables on Target Values
Random Forest Regression Analysis Reveals Impact of Variables on Target Values  Random Forest Regression Analysis Reveals Impact of Variables on Target Values
Random Forest Regression Analysis Reveals Impact of Variables on Target Values
 
What is the KMeans Clustering Algorithm and How Does an Enterprise Use it to ...
What is the KMeans Clustering Algorithm and How Does an Enterprise Use it to ...What is the KMeans Clustering Algorithm and How Does an Enterprise Use it to ...
What is the KMeans Clustering Algorithm and How Does an Enterprise Use it to ...
 
What is the Independent Samples T Test Method of Analysis and How Can it Bene...
What is the Independent Samples T Test Method of Analysis and How Can it Bene...What is the Independent Samples T Test Method of Analysis and How Can it Bene...
What is the Independent Samples T Test Method of Analysis and How Can it Bene...
 
What is the Multinomial-Logistic Regression Classification Algorithm and How ...
What is the Multinomial-Logistic Regression Classification Algorithm and How ...What is the Multinomial-Logistic Regression Classification Algorithm and How ...
What is the Multinomial-Logistic Regression Classification Algorithm and How ...
 
What is the Holt-Winters Forecasting Algorithm and How Can it be Used for Ent...
What is the Holt-Winters Forecasting Algorithm and How Can it be Used for Ent...What is the Holt-Winters Forecasting Algorithm and How Can it be Used for Ent...
What is the Holt-Winters Forecasting Algorithm and How Can it be Used for Ent...
 
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
 
Marketing Optimization Augmented Analytics Use Cases - Smarten
Marketing Optimization Augmented Analytics Use Cases - SmartenMarketing Optimization Augmented Analytics Use Cases - Smarten
Marketing Optimization Augmented Analytics Use Cases - Smarten
 
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
 
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
 
Predictive Analytics Using External Data Augmented Analytics Use Case - Smarten
Predictive Analytics Using External Data Augmented Analytics Use Case - SmartenPredictive Analytics Using External Data Augmented Analytics Use Case - Smarten
Predictive Analytics Using External Data Augmented Analytics Use Case - Smarten
 
Machine Learning Project
Machine Learning ProjectMachine Learning Project
Machine Learning Project
 
What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?
 
What is FP Growth Analysis and How Can a Business Use Frequent Pattern Mining...
What is FP Growth Analysis and How Can a Business Use Frequent Pattern Mining...What is FP Growth Analysis and How Can a Business Use Frequent Pattern Mining...
What is FP Growth Analysis and How Can a Business Use Frequent Pattern Mining...
 
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
 
Machine Learning Performance Evaluation: Tips and Pitfalls - Jose Hernandez O...
Machine Learning Performance Evaluation: Tips and Pitfalls - Jose Hernandez O...Machine Learning Performance Evaluation: Tips and Pitfalls - Jose Hernandez O...
Machine Learning Performance Evaluation: Tips and Pitfalls - Jose Hernandez O...
 
Demand Estimation
Demand EstimationDemand Estimation
Demand Estimation
 
Demand estimation by regression analysis
Demand estimation by regression analysisDemand estimation by regression analysis
Demand estimation by regression analysis
 
Introduction To Data Science Using R
Introduction To Data Science Using RIntroduction To Data Science Using R
Introduction To Data Science Using R
 
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACROBOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
 
Step by Step guide to executing an analytics project
Step by Step guide to executing an analytics projectStep by Step guide to executing an analytics project
Step by Step guide to executing an analytics project
 

Similar to What Is Generalized Linear Regression with Gaussian Distribution And How Can An Enterprise Use This Technique To Analyze Data?

Simple & Multiple Regression Analysis
Simple & Multiple Regression AnalysisSimple & Multiple Regression Analysis
Simple & Multiple Regression AnalysisShailendra Tomar
 
Logistic regression and analysis using statistical information
Logistic regression and analysis using statistical informationLogistic regression and analysis using statistical information
Logistic regression and analysis using statistical informationAsadJaved304231
 
Intro to data science
Intro to data scienceIntro to data science
Intro to data scienceANURAG SINGH
 
German credit score shivaram prakash
German credit score shivaram prakashGerman credit score shivaram prakash
German credit score shivaram prakashShivaram Prakash
 
Stat_AMBA_600_Problem Set3
Stat_AMBA_600_Problem Set3Stat_AMBA_600_Problem Set3
Stat_AMBA_600_Problem Set3Tyler Anton
 
Regression Analysis.ppt
Regression Analysis.pptRegression Analysis.ppt
Regression Analysis.pptAbebe334138
 
Project Week 71. Both graphs shows a.docx
Project Week 71. Both graphs shows a.docxProject Week 71. Both graphs shows a.docx
Project Week 71. Both graphs shows a.docxwkyra78
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.pptTanyaWadhwani4
 
Exploring Data
Exploring DataExploring Data
Exploring Datajmalpass
 
Customer Satisfaction Data - Multiple Linear Regression Model.pdf
Customer Satisfaction Data -  Multiple Linear Regression Model.pdfCustomer Satisfaction Data -  Multiple Linear Regression Model.pdf
Customer Satisfaction Data - Multiple Linear Regression Model.pdfruwanp2000
 
Satisfaction and loyalty
Satisfaction and loyaltySatisfaction and loyalty
Satisfaction and loyaltyTheDataNation
 
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...Smarten Augmented Analytics
 
manecohuhuhuhubasicEstimation-1.pptx
manecohuhuhuhubasicEstimation-1.pptxmanecohuhuhuhubasicEstimation-1.pptx
manecohuhuhuhubasicEstimation-1.pptxasdfg hjkl
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regressionzcreichenbach
 
Demand estimation and forecasting
Demand estimation and forecastingDemand estimation and forecasting
Demand estimation and forecastingshivraj negi
 
Market Research using SPSS _ Edu4Sure Sept 2023.ppt
Market Research using SPSS _ Edu4Sure Sept 2023.pptMarket Research using SPSS _ Edu4Sure Sept 2023.ppt
Market Research using SPSS _ Edu4Sure Sept 2023.pptEdu4Sure
 
Chapter 13 (1).pdf
Chapter 13 (1).pdfChapter 13 (1).pdf
Chapter 13 (1).pdfOyonFaiyaz
 
InstructionsView CAAE Stormwater video Too Big for Our Ditches.docx
InstructionsView CAAE Stormwater video Too Big for Our Ditches.docxInstructionsView CAAE Stormwater video Too Big for Our Ditches.docx
InstructionsView CAAE Stormwater video Too Big for Our Ditches.docxdirkrplav
 
Distribution of EstimatesLinear Regression ModelAssume (yt,.docx
Distribution of EstimatesLinear Regression ModelAssume (yt,.docxDistribution of EstimatesLinear Regression ModelAssume (yt,.docx
Distribution of EstimatesLinear Regression ModelAssume (yt,.docxmadlynplamondon
 

Similar to What Is Generalized Linear Regression with Gaussian Distribution And How Can An Enterprise Use This Technique To Analyze Data? (20)

Simple & Multiple Regression Analysis
Simple & Multiple Regression AnalysisSimple & Multiple Regression Analysis
Simple & Multiple Regression Analysis
 
Logistic regression and analysis using statistical information
Logistic regression and analysis using statistical informationLogistic regression and analysis using statistical information
Logistic regression and analysis using statistical information
 
Intro to data science
Intro to data scienceIntro to data science
Intro to data science
 
German credit score shivaram prakash
German credit score shivaram prakashGerman credit score shivaram prakash
German credit score shivaram prakash
 
Stat_AMBA_600_Problem Set3
Stat_AMBA_600_Problem Set3Stat_AMBA_600_Problem Set3
Stat_AMBA_600_Problem Set3
 
Regression Analysis.ppt
Regression Analysis.pptRegression Analysis.ppt
Regression Analysis.ppt
 
Project Week 71. Both graphs shows a.docx
Project Week 71. Both graphs shows a.docxProject Week 71. Both graphs shows a.docx
Project Week 71. Both graphs shows a.docx
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.ppt
 
Exploring Data
Exploring DataExploring Data
Exploring Data
 
Customer Satisfaction Data - Multiple Linear Regression Model.pdf
Customer Satisfaction Data -  Multiple Linear Regression Model.pdfCustomer Satisfaction Data -  Multiple Linear Regression Model.pdf
Customer Satisfaction Data - Multiple Linear Regression Model.pdf
 
Satisfaction and loyalty
Satisfaction and loyaltySatisfaction and loyalty
Satisfaction and loyalty
 
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
 
manecohuhuhuhubasicEstimation-1.pptx
manecohuhuhuhubasicEstimation-1.pptxmanecohuhuhuhubasicEstimation-1.pptx
manecohuhuhuhubasicEstimation-1.pptx
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Demand estimation and forecasting
Demand estimation and forecastingDemand estimation and forecasting
Demand estimation and forecasting
 
Market Research using SPSS _ Edu4Sure Sept 2023.ppt
Market Research using SPSS _ Edu4Sure Sept 2023.pptMarket Research using SPSS _ Edu4Sure Sept 2023.ppt
Market Research using SPSS _ Edu4Sure Sept 2023.ppt
 
Chapter 13 (1).pdf
Chapter 13 (1).pdfChapter 13 (1).pdf
Chapter 13 (1).pdf
 
Ders 2 ols .ppt
Ders 2 ols .pptDers 2 ols .ppt
Ders 2 ols .ppt
 
InstructionsView CAAE Stormwater video Too Big for Our Ditches.docx
InstructionsView CAAE Stormwater video Too Big for Our Ditches.docxInstructionsView CAAE Stormwater video Too Big for Our Ditches.docx
InstructionsView CAAE Stormwater video Too Big for Our Ditches.docx
 
Distribution of EstimatesLinear Regression ModelAssume (yt,.docx
Distribution of EstimatesLinear Regression ModelAssume (yt,.docxDistribution of EstimatesLinear Regression ModelAssume (yt,.docx
Distribution of EstimatesLinear Regression ModelAssume (yt,.docx
 

More from Smarten Augmented Analytics

Crime Type Prediction - Augmented Analytics Use Case – Smarten
Crime Type Prediction - Augmented Analytics Use Case – SmartenCrime Type Prediction - Augmented Analytics Use Case – Smarten
Crime Type Prediction - Augmented Analytics Use Case – SmartenSmarten Augmented Analytics
 
Students' Academic Performance Predictive Analytics Use Case – Smarten
Students' Academic Performance Predictive Analytics Use Case – SmartenStudents' Academic Performance Predictive Analytics Use Case – Smarten
Students' Academic Performance Predictive Analytics Use Case – SmartenSmarten Augmented Analytics
 
Fraud Mitigation Predictive Analytics Use Case – Smarten
Fraud Mitigation Predictive Analytics Use Case – SmartenFraud Mitigation Predictive Analytics Use Case – Smarten
Fraud Mitigation Predictive Analytics Use Case – SmartenSmarten Augmented Analytics
 
Quality Control Predictive Analytics Use Case - Smarten
Quality Control Predictive Analytics Use Case - SmartenQuality Control Predictive Analytics Use Case - Smarten
Quality Control Predictive Analytics Use Case - SmartenSmarten Augmented Analytics
 
Machine Maintenance Management Predictive Analytics Use Case - Smarten
Machine Maintenance Management Predictive Analytics Use Case - SmartenMachine Maintenance Management Predictive Analytics Use Case - Smarten
Machine Maintenance Management Predictive Analytics Use Case - SmartenSmarten Augmented Analytics
 
Human Resource Attrition Augmented Analytics Use Case - Smarten
Human Resource Attrition Augmented Analytics Use Case - SmartenHuman Resource Attrition Augmented Analytics Use Case - Smarten
Human Resource Attrition Augmented Analytics Use Case - SmartenSmarten Augmented Analytics
 
Customer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - SmartenCustomer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - SmartenSmarten Augmented Analytics
 
What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...
What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...
What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...Smarten Augmented Analytics
 
What is the Paired Sample T Test and How is it Beneficial to Business Analysis?
What is the Paired Sample T Test and How is it Beneficial to Business Analysis?What is the Paired Sample T Test and How is it Beneficial to Business Analysis?
What is the Paired Sample T Test and How is it Beneficial to Business Analysis?Smarten Augmented Analytics
 
What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...
What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...
What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...Smarten Augmented Analytics
 
What is SVM Classification Analysis and How Can It Benefit Business Analytics?
What is SVM Classification Analysis and How Can It Benefit Business Analytics?What is SVM Classification Analysis and How Can It Benefit Business Analytics?
What is SVM Classification Analysis and How Can It Benefit Business Analytics?Smarten Augmented Analytics
 
What is Outlier Analysis and How Can It Improve Analysis?
What is Outlier Analysis and How Can It Improve Analysis?What is Outlier Analysis and How Can It Improve Analysis?
What is Outlier Analysis and How Can It Improve Analysis?Smarten Augmented Analytics
 
What is the Decision Tree Analysis and How Does it Help a Business to Analyze...
What is the Decision Tree Analysis and How Does it Help a Business to Analyze...What is the Decision Tree Analysis and How Does it Help a Business to Analyze...
What is the Decision Tree Analysis and How Does it Help a Business to Analyze...Smarten Augmented Analytics
 

More from Smarten Augmented Analytics (13)

Crime Type Prediction - Augmented Analytics Use Case – Smarten
Crime Type Prediction - Augmented Analytics Use Case – SmartenCrime Type Prediction - Augmented Analytics Use Case – Smarten
Crime Type Prediction - Augmented Analytics Use Case – Smarten
 
Students' Academic Performance Predictive Analytics Use Case – Smarten
Students' Academic Performance Predictive Analytics Use Case – SmartenStudents' Academic Performance Predictive Analytics Use Case – Smarten
Students' Academic Performance Predictive Analytics Use Case – Smarten
 
Fraud Mitigation Predictive Analytics Use Case – Smarten
Fraud Mitigation Predictive Analytics Use Case – SmartenFraud Mitigation Predictive Analytics Use Case – Smarten
Fraud Mitigation Predictive Analytics Use Case – Smarten
 
Quality Control Predictive Analytics Use Case - Smarten
Quality Control Predictive Analytics Use Case - SmartenQuality Control Predictive Analytics Use Case - Smarten
Quality Control Predictive Analytics Use Case - Smarten
 
Machine Maintenance Management Predictive Analytics Use Case - Smarten
Machine Maintenance Management Predictive Analytics Use Case - SmartenMachine Maintenance Management Predictive Analytics Use Case - Smarten
Machine Maintenance Management Predictive Analytics Use Case - Smarten
 
Human Resource Attrition Augmented Analytics Use Case - Smarten
Human Resource Attrition Augmented Analytics Use Case - SmartenHuman Resource Attrition Augmented Analytics Use Case - Smarten
Human Resource Attrition Augmented Analytics Use Case - Smarten
 
Customer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - SmartenCustomer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - Smarten
 
What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...
What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...
What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...
 
What is the Paired Sample T Test and How is it Beneficial to Business Analysis?
What is the Paired Sample T Test and How is it Beneficial to Business Analysis?What is the Paired Sample T Test and How is it Beneficial to Business Analysis?
What is the Paired Sample T Test and How is it Beneficial to Business Analysis?
 
What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...
What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...
What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...
 
What is SVM Classification Analysis and How Can It Benefit Business Analytics?
What is SVM Classification Analysis and How Can It Benefit Business Analytics?What is SVM Classification Analysis and How Can It Benefit Business Analytics?
What is SVM Classification Analysis and How Can It Benefit Business Analytics?
 
What is Outlier Analysis and How Can It Improve Analysis?
What is Outlier Analysis and How Can It Improve Analysis?What is Outlier Analysis and How Can It Improve Analysis?
What is Outlier Analysis and How Can It Improve Analysis?
 
What is the Decision Tree Analysis and How Does it Help a Business to Analyze...
What is the Decision Tree Analysis and How Does it Help a Business to Analyze...What is the Decision Tree Analysis and How Does it Help a Business to Analyze...
What is the Decision Tree Analysis and How Does it Help a Business to Analyze...
 

Recently uploaded

ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 

Recently uploaded (20)

ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 

What Is Generalized Linear Regression with Gaussian Distribution And How Can An Enterprise Use This Technique To Analyze Data?

  • 1. Master the Art of Analytics A Simplistic Explainer Series For Citizen Data Scientists Journey Towards Augmented Analytics
  • 3. Terminologies Introduction & Example Standard input/tuning parameters & Sample UI Sample output UI Interpretation of Output Limitations Business use cases What is Covered
  • 4. Terminologies ▪ Target variable usually denoted by Y, is the variable being predicted and is also called dependent variable, output variable, response variable or outcome variable (E.g., One highlighted in red box in table below). ▪ Predictor, sometimes called an independent variable, is a variable that is being used to predict the target variable (E.g., Variables highlighted in green box in table below). The predictors highlighted in green box above constitutes of the attributes upon which the target variable highlighted in red box (i.e., Loan amount) depends on. Loan amount Debt to income ratio Grade Annual income Verification status 9000 30 A 9632 Not Verified 4800 26 B 5022 Not Verified 20000 25 B 5149 Not Verified 6000 29 A 5225 Verified 3000 27 C 5344 Verified
  • 5. Terminologies (Continued…) • Coefficients: • It shows the magnitude as well as direction of impact of predictors on a target variable Y. It shows the positive or negative relationship between dependent and independent variables. • A positive coefficient indicates that as the value of the independent variable increases, the mean of the dependent variable also tends to increase. • A negative coefficient suggests that as the independent variable increases, the dependent variable tends to decrease. • P-Value: • It is used to evaluate whether the corresponding predictor X has any significant impact on the target variable Y. • At a 95% confidence threshold, if the p-value for a predictor is < 0.05, then it has a significant impact on the target variable. • At a 95% confidence threshold, if the p-value for a predictor is > 0.05, then it doesn't have a significant impact on the target variable.
  • 6. Introduction • Objective: – The Generalized Linear Model (GLM) Regression is a flexible generalization of ordinary linear regression that allows for response variables that have error distribution models other than a normal distribution. • Benefit: – It is a model with low complexity where the response variables can have any form of exponential distribution type like gaussian, Poisson etc. It is also easier to interpret and allows us to understand how each predictor influences the outcome. • Model: – yi here represents the gaussian family which during any measurement values follow a normal distribution with an equal number of measurements above and below the mean value. The distribution provides a parameterized mathematical function that can be used to calculate the probability for any individual observation from the sample. where 𝛳 is the canonical parameter that represents the estimate of location and is the dispersion parameter that represents the scale. GLM allows the distribution of y to take the shape of many different exponential families:
  • 7. Example: Generalized Linear Regression (Gaussian Distribution) Loan Amount Debt to Income Ratio Grade Annual Income Verification Status 725 30 A 9632 Not Verified 1000 26 B 5022 Not Verified 1000 25 B 5149 Not Verified 1000 29 A 5225 Verified 1000 27 C 5344 Verified Independent variables (Xi) Target Variable (Y) Let’s conduct the Generalized Linear regression (Gaussian Distribution) analysis on Loan Eligibility Data set on independent variables: Grade, Annual Income, Debt to Income ratio, Verification status and target variable: Loan amount as shown below: R-Squared 0.867 Adjusted R-Squared 0.868 Model is an excellent fit when Adjusted R-Squared > 0.7 Adjusted R-Squared: The adjusted R-squared is a modified version of R-squared that has been adjusted for the number of predictors in the model. It shows whether adding additional predictors improve a regression model or not. It determines the goodness of fit of the model. R-Squared: A coefficient of determination to represent the proportion of variance in the target variable explained by a regression model.
  • 8. Standard Input/Tuning Parameters & Sample UI Select the Target Variable Debt to income ratio Loan amount Grade Verification status Annual income Step 1 Step 2 More than one predictors can be selected Step 3 family = gaussian (This indicates the valid link functions for each family, the first function being gaussian which is also the default one). By default, these parameters should be set with the values mentioned Step 4 Display the output window containing following: o Model summary o Line Fit plot o Residual versus Fit plot Note: ▪ Decision on selection of predictors depends on the business knowledge and the correlation value between target variable and predictors. Select the Predictor variable(s) Debt to income ratio Loan amount Grade Verification status Annual income
  • 9. Sample Output: 1. Interpretation Influencer’s importance chart is used to show impact of each predictor on target variable. Target Variable: Loan amount Influencer’s Importance Positive Negative Neutral
  • 10. Sample Output: 2. Model Summary R-Squared 0.867 Adjusted R-Squared 0.868 ● R-Squared: A coefficient of determination to represent the proportion of variance in the target variable explained by a regression model. ● Adjusted R-Squared: A statistical measure determining the variance in the target variable by considering only the predictors influencing the target, rather than considering all chosen predictor variables. Root Mean Square Error (RMSE) 2847.658 Mean Absolute Error (MAE) 1979.2681 Mean Absolute Percentage Error (MAPE) 0.2484 Mean Percentage Error (MPE) -0.0735
  • 11. Sample Output: 2. Model Summary (Continued…) Variable Coefficient P-Value Intercept 0.8272 0.0 Verification status (Not Verified) -0.6004 0.0 Grade(B) -0.0043 0.3950353 Grade(A) -0.0173 0.0016323 Grade(C) -0.0108 0.0556645 Grade(D) 0.0057 0.3778658 Grade(E) 0.027 0.0010318 Annual income 0 0.0177066 Debt to income ratio -0.0024 0.0 Variable Significance by P-value Insignificant Significant
  • 12. Sample Output: 2. Model Summary (Continued…) ● P-Value: At a 95% confidence threshold, if the p-value for a predictor is < 0.05, then it has a significant impact on the target variable. At a 95% confidence threshold, if the p-value for a predictor is > 0.05, then it doesn't have a significant impact on the target variable. ● Root Mean Square Error (RMSE): Square root of the average of squared differences between prediction and actual observation. It is a standard deviation of residual error. ● Mean Absolute Error (MAE): Average of the absolute differences between prediction and actual observation. ● Mean Absolute Percentage Error (MAPE): Mean Absolute Percentage ratio of residual over actual observations. ● Mean Percentage Error (MPE): Mean Percentage Error conveys if there is more positive errors than negative errors or vice-versa based upon its sign. ● RMSE, MAE, MAPE and MPE are used to identify the variation in terms of errors from predicted to actual values. ● Lower the values, represent a better fit of the regression model.
  • 13. Sample Output: 3. Predicted Class & Residuals Loan amount Debt to income ratio Grade Annual income Verification status Predicted Loan amount Regression Residuals Regression Standardized Residuals 9000 25 B 11585 Verified 6393.003 2606.997 0.92 4800 25 E 9082 Not Verified 5664.367 -864.367 -0.305 20000 1 B 17627 Verified 24760.365 -4760.365 -1.681 6000 23 B 14689 Not Verified 8364.874 -2364.874 -0.835 3000 29 A 9523 Not Verified 2509.877 490.123 0.173 The data output will contain predicted class column along with the residuals and standardized residuals.
  • 14. Sample Output: 3. Predicted Class & Residuals • Residuals: The difference between the observed value of the dependent variable and the predicted value is called the residual. ● Original Values - Predicted Values. • Standardized Residuals: It is the ratio of the difference between the actual values and the predicted values to the standard deviation of the predicted values. It is an indication of variation in predicted value of target. ● Residuals / Standard Deviation of Residuals. • Interpretations based on residual values: ● Positive values for the residual error indicates that the prediction is lower than actual value. ● Negative values for the residual error indicates that the prediction is higher than actual value. ● Zero value indicates that the prediction exactly matches with actual value.
  • 15. Interpretation of Important Model Summary Statistics RMSE R Squared RMSE R-Squared R-Squared: Adjusted R-Squared: P Value: • A coefficient of determination to represent the proportion of variance in the target variable explained by a regression model. • It lies between 0 to 1 and closer this value to 1, better the model. • A statistical measure determining the variance in the target variable by considering only the predictors influencing the target, rather than considering all chosen predictor variables. • Adjusted R-squared < 0.5: The model is not a good fit, and predictions are not accurate. • 0.5 <= Adjusted R-squared < 0.7: The model is a good fit, and predictions are reasonably accurate. • Adjusted R-squared >= 0.7: The model is a very good fit, and predictions are accurate. • At a 95% confidence threshold, if the p-value for a predictor is < 0.05, then it has a significant impact on the target variable. • At a 95% confidence threshold, if the p-value for a predictor is > 0.05, then it doesn't have a significant impact on the target variable.
  • 16. RMSE: • Square root of the average of squared differences between prediction and actual observation. It is standard deviation of residual error. • Lower values of RMSE indicate a better fit. The value ranges 0 to ∞. MAE: • Average of the absolute differences between prediction and actual observation • Lower values of MAE indicate a better fit. The value ranges 0 to ∞. • Like RMSE, it is a negatively oriented score. MAPE: • Mean Absolute Percentage ratio of residual over actual observations. • Lower the MAPE, better the performance of the model. MPE: • Mean Percentage Error conveys if there is more positive errors than negative errors or vice-versa based upon its sign. • In case of more negative errors the system underestimates, and in case of more positive errors the system overestimates.. Interpretation of Important Model Summary Statistics (Continued…)
  • 17. Sample Output: 4. Plots Line fit Plot Residual versus Fit Plot Residual versus fit plot is used to check the assumption of equal error variances & outliers The Line fit plot is plotted between Loan amount against Annual income. The residual versus Fit plot is plotted between Predicted Loan amount and Standardized Residuals. Loan amount Standardized Residuals Annual income Predicted Loan amount Line fit plots are used to check the assumption of linearity between each Xi & Y
  • 18. Interpretation of Plots: Line Fit Plot Line fit Plot Loan amount Annual income Figure 1 This plot is used to plot the relationship between each Xi (predictor) & Y (target variable) with Y-on-y axis and each Xi on x axis As shown in the figure 1 in right, as Annual Income(X) increases, so does the Loan Amount(Y), hence there is a linear relationship between X and Y and generalized linear regression (gaussian distribution) regression is applicable on this data If line doesn’t display linearity as shown in figures 2 & 3 in right, then transformation can be applied on that variable before proceeding with model building If data transformation doesn’t help, then either that variable(Xi) can be dropped from the analysis or nonlinear model should be chosen depending on the distribution pattern of scatter plot. Figure 2 Figure 3 Figure 2 Figure 3
  • 19. Interpretation of Plots: Residual Versus Fit Plot It is the scattered plot of standardized residuals on Y axis and predicted (fitted) values on X axis It is used to detect the unequal residual variances and outliers in data Here are the characteristics of a well-behaved residual vs. fits plot: The residuals should "bounce randomly" around the 0 line and should roughly form a "horizontal band" around the 0 line as shown in figure 1. This suggests that the variances of the error terms are equal No single residual should "stand out" from the basic random pattern of residuals. This suggests that there are no outliers. Figure 2 ⮚ Plots shown in figures 2 & 3 above depict unequal error variances, which is not desirable for regression analysis Figure 3 Residual versus Fit Plot Standardized Residuals Predicted Loan amount Figure 1
  • 20. Limitations ● Generalized Linear Regression is limited to predicting numeric output i.e., dependent variable must be numeric in nature ● Minimum sample size should be at least 20 cases per independent variable. ● Sometimes the data can be categorical and time series data may not be normally distributed, both of which are not supported by Gaussian distribution in generalized linear regression model. Time independent error ( fairly constant over time & lying within certain range) Time dependent error (decreasing with time)
  • 21. Limitations (Continued…) ● A normal distribution is an arrangement of a data set in which most values cluster in the middle of the range and the rest taper off symmetrically towards extreme. It will look like a bell curve as shown in figure 1. ● Outliers in data (target as well as independent variables) can affect the analysis, hence outliers need to be removed. ● Outliers are the observations lying outside overall pattern of distribution as shown in figure 2. Figure 1 Figure 2
  • 22. Business Use Case 1 • Business Problem: Product’s Profit Prediction • Identifying the profit made by each product based upon various factors like its total revenue, number of units sold, region of sale etc. • Input Data: • Predictor/independent variables: • Total Revenue • Units Sold • Region • Total Cost • Target/dependent variable: • Total Profit • Business Benefit: • The predictive model will help us identify, profit on different products based on the sales, region and other cost factors.
  • 23. Business Use Case 2 • Business Problem: Student’s Chance Of Admission Prediction • To determine a student’s chance to get admission based on certain educational scores and factors. • Input Data: • Predictor/independent variables: • CGPA • GRE Score • LOR • Serial No • TOEFL Score • Target/dependent variable: • Chance of admit • Business Benefit: • Using generalized linear regression, we can determine, to what extent a person qualifies to get an admission based on various educational factors. This eases the entire process of admission and allows the most eligible students to be selected.
  • 24. Want to Learn More? Get in touch with us @ support@Smarten.com And Do Checkout the Learning section on Smarten.com September 2021