SlideShare a Scribd company logo
Zac Bodner
Final Lab Assignment

In class this semester, we have already explored regression for explanatory purposes. For
example, we previously built a regression model to explain what effects certain independent
variables (trustworthiness, intelligence, “like me-ness”) have on certain dependent variables (I
have a good opinion of) in political candidates. These models were helpful in determining how
different candidates could employ certain tactics to gain favor (and hopefully votes) from voters.
In this regard, regression models are effective tools in explaining the reasons behind certain
phenomena - like why certain candidates do well with certain populations, or what factors make
us love ice cream - but their real power comes in how capable these models are of predicting
certain outcomes. For example, if we have an explanatory regression model that identifies
certain characteristics (variables) shared by both customers and non-customers in a particular
data set - could we then turn around and use that model to identify additional, probable
customers (and non-customers) in another, independent data set (and make tons of cash for our
employers in the process)?
We can and we will!
The following paper presents a step by step explanation of how to do this, based on our final lab
assignment of the semester.
The first thing we must do is divide the current data set (Customer, we’ve been using it all
semester) in half. This will allow us to confirm our model’s findings in one half of the data set on
the other. In other words, this will help us prove that our model will not only work on the data set
that we are testing, but in other, unrelated (random) data sets, as well.
The key to dividing a data set into equal, random halves, is to confirm that both sides are
distributed evenly on a number of variables. We have already divided the data set using SPSS
in a previous assignment, and confirmed the randomness and equality of both halves by
examining the distributions of some of these variables in a CROSSTABS setting. If the
differences in the distributions of these variables are very small, (fractions of a percent) then we
are good to go. From the previous assignment, we can confirm that our data set is split into two
randomized and equal halves.
From here, we must calculate some of the interactions between variables that we found in
another previous assignment - the CHAID segmentation. CHAID stands for Chi Square
Automatic Interaction Detector. It produces a tree that shows which variables contain the largest
segments of customers, and continues on by further dividing each segment. For example - the
tree starts by segmenting customers from non-customers. Then, it segments the customers
further by say, the market value of their home. Then it continues, by separating this market
value segment into each gender. By doing this, we can examine the percentage of customers in
each segment, and compare their concentration to the rest of the data set for targeting
purposes.
By identifying these interactions, we can make new variables to add to our model. This is what
we will do here. We do this because, even though CHAID is a great tool for segmenting the data
set, we are more interested in seeing the total, combined interactions. For this purpose, a
regression will always be the better option. To turn these segmentation interactions into
variables, we simply multiply some of our segments that demonstrated high concentrations of
customers. Here is a screenshot of a few that I used:
These interactions are computed, and added to the list of our data set’s independent variables.
From here, we can add them and all of our other independent variables to a step-wise
regression. First, we must make sure to select which half of the data-set we want to test. We will
test Half 1 - by selecting it in SPSS.
A step-wise regression is valuable for differentiating significant predictor (independent) variables
from insignificant ones. All you do is throw the kitchen sink (all of your variables) in, and SPSS
will find the ones with the strongest beta coefficients (relationships to our outcome variable) and
order them from highest to lowest in the model. For purposes of orderliness and ease of use, we
always want our models to be parsimonious - meaning they have as few variables as possible
while still making good predictions.
This being the case, I chose the first eleven variables the step-wise regression returned. You will
know when to stop adding variables by how much the total R-Square value increases. The R-
Square (and adjusted R-square - for multiple variables) is a measure of how much of the
variance in the outcome variable the independent variables in the model explain. If you have ten
variables that have an adjusted R-square of .159, or fifteen variables with a .164 - it’s best to
just use the ten variables, because each additional variable isn’t explaining much variance at
this point.
We’re cooking with gas now!
We have eleven solid, significant variables that we can now throw into a regular regression,
signified by switching the “Stepwise” option to the “Enter” option in SPSS. It is important to note
that for this assignment, we need to check the option to “replace missing values with the mean”
in SPSS.
This means that if any of the single members of our data population have missing values, we
will replace those missing values with the mean for that data point, instead of tossing the
member altogether. This way, we do not detract or add anything from the model, but we don’t
have to waste data. Luckily for us, of the eleven variables that made it into our model - there
were no missing data.
Now, we will input the variables into the “enter” regression, and save the output as a variable.
We will call the variable PREDICT, because we will use it later to predict, based on our
observations of customers and non-customers in this data-set, the likelihood of finding
additional customers among separate data-sets.
We will now divide this output into deciles, since we are concerned primarily with our model’s
capability of finding prospects based on how much they resemble the observed customers of
this data-set. SPSS does this fairly easily by going to Transform and selecting Rank Cases. We
then save that output as DECILES, and using CROSSTABS, we can compare our observed
customers with our ranked predictions. Here is a screenshot of this:
This is great; our prediction works. We would hope that the highest deciles (1) have the highest
concentration of customers, and vice-versa with the non-customers coming from the lowest
deciles (10). As we can see, it does. In the first decile, we have an 8.9% higher concentration of
customers than in the rest of the data-set.
We can take those odds to Vegas!
Now, we have to test this model on the other half, Half 2. To do this, we have to calculate a
score for our output that we can apply to the second half, but before we do this - we must
confirm the score we calculate correctly interprets the regression output that we have.
This is pretty easy, we go into Transform > Compute and enter in an equation based on our
variables in the model:
Our final score then looks like this, and we save it as the another variable in the set, SCORE:
We then compare this variable SCORE to PREDICT, and luckily for us - they are almost
identical. This means our score calculation is a correct interpretation of our regression model,
and can now be applied to an independent sample (data-set).
To get to our simulated, independent data set - we now select Half 2. With Half 2 active, we run
the code we just made, and then transform the output again into deciles. We will save these
deciles as DECILES2, and run the same CROSSTABS function as we did before.
If we have done our job correctly, this Crosstabs will look almost identical, and hopefully a little
better than the first one. Drumroll!!!!
Hallelujah! This crosstabs has a slightly higher percentage of customers in the first decile, but is
practically the same. Check out the comparison on the next page. Our predictive model works!
This is great news. We built a regression model and ran i on one data-set. If the model worked,
we would expect similar results by running the same model on a different data-set. We got those
similar results.
With these techniques and a handy tool like SPSS, we can take a data set, build a regression
model to find characteristics shared by customers and non-customers - then validate this model
on an independent data set. By doing this, we can significantly increase our chances of finding
new customers anywhere, which is obviously an incredibly valuable skill to have.
But remember, anyone can input numbers into SPSS. The real difference between a good and
great market researcher is being able to interpret those numbers, by asking good questions and
employing impeccable language!

More Related Content

What's hot

Spss software
Spss softwareSpss software
Spss software
ManonmaniA3
 
SPM User's Guide: Introducing MARS
SPM User's Guide: Introducing MARSSPM User's Guide: Introducing MARS
SPM User's Guide: Introducing MARS
Salford Systems
 
Statistical analysis in SPSS_
Statistical analysis in SPSS_ Statistical analysis in SPSS_
Statistical analysis in SPSS_
Dr. Anugamini Priya
 
Spss beginners
Spss beginnersSpss beginners
Spss beginners
Mbabazi Theos
 
Data processing & Analysis: SPSS an overview
Data processing & Analysis: SPSS an overviewData processing & Analysis: SPSS an overview
Data processing & Analysis: SPSS an overview
ATHUL RAVI
 
Nurses Data Analysis by Applied SPSS
Nurses Data Analysis by Applied SPSSNurses Data Analysis by Applied SPSS
Nurses Data Analysis by Applied SPSS
ijtsrd
 
Moderation and Meditation conducting in SPSS
Moderation and Meditation conducting in SPSSModeration and Meditation conducting in SPSS
Moderation and Meditation conducting in SPSS
Osama Yousaf
 
SOC2002 Lecture 11
SOC2002 Lecture 11SOC2002 Lecture 11
SOC2002 Lecture 11Bonnie Green
 
Outlier managment
Outlier managmentOutlier managment
Outlier managment
Siddhartha Harit
 
Spss presentation
Spss presentationSpss presentation
Spss presentation
Kabir Khan
 
Structural equation-models-introduction-kimmo-vehkalahti-2013
Structural equation-models-introduction-kimmo-vehkalahti-2013Structural equation-models-introduction-kimmo-vehkalahti-2013
Structural equation-models-introduction-kimmo-vehkalahti-2013
Kimmo Vehkalahti
 
Spss data analysis for univariate, bivariate and multivariate statistics by d...
Spss data analysis for univariate, bivariate and multivariate statistics by d...Spss data analysis for univariate, bivariate and multivariate statistics by d...
Spss data analysis for univariate, bivariate and multivariate statistics by d...
Dr. Sola Maitanmi
 
Lecture7a Applied Econometrics and Economic Modeling
Lecture7a Applied Econometrics and Economic ModelingLecture7a Applied Econometrics and Economic Modeling
Lecture7a Applied Econometrics and Economic Modeling
stone55
 
How to process data in SPSS ?
How to process data in SPSS ? How to process data in SPSS ?
How to process data in SPSS ?
Quix Kerala
 
Ibm spss statistics 19 brief guide
Ibm spss statistics 19 brief guideIbm spss statistics 19 brief guide
Ibm spss statistics 19 brief guide
Marketing Utopia
 
What is regression / Quantification of the impact ?
What is regression / Quantification of the impact ?What is regression / Quantification of the impact ?
What is regression / Quantification of the impact ?
Rupak Roy
 
Recep maz msb 701 quantitative analysis for managers
Recep maz msb 701 quantitative analysis for managersRecep maz msb 701 quantitative analysis for managers
Recep maz msb 701 quantitative analysis for managersrecepmaz
 
Statistical Approaches to Missing Data
Statistical Approaches to Missing DataStatistical Approaches to Missing Data
Statistical Approaches to Missing Data
DataCards
 

What's hot (20)

Spss software
Spss softwareSpss software
Spss software
 
Spss
SpssSpss
Spss
 
SPM User's Guide: Introducing MARS
SPM User's Guide: Introducing MARSSPM User's Guide: Introducing MARS
SPM User's Guide: Introducing MARS
 
Statistical analysis in SPSS_
Statistical analysis in SPSS_ Statistical analysis in SPSS_
Statistical analysis in SPSS_
 
Spss beginners
Spss beginnersSpss beginners
Spss beginners
 
Data processing & Analysis: SPSS an overview
Data processing & Analysis: SPSS an overviewData processing & Analysis: SPSS an overview
Data processing & Analysis: SPSS an overview
 
Nurses Data Analysis by Applied SPSS
Nurses Data Analysis by Applied SPSSNurses Data Analysis by Applied SPSS
Nurses Data Analysis by Applied SPSS
 
Moderation and Meditation conducting in SPSS
Moderation and Meditation conducting in SPSSModeration and Meditation conducting in SPSS
Moderation and Meditation conducting in SPSS
 
SOC2002 Lecture 11
SOC2002 Lecture 11SOC2002 Lecture 11
SOC2002 Lecture 11
 
Outlier managment
Outlier managmentOutlier managment
Outlier managment
 
Spss an introduction
Spss  an introductionSpss  an introduction
Spss an introduction
 
Spss presentation
Spss presentationSpss presentation
Spss presentation
 
Structural equation-models-introduction-kimmo-vehkalahti-2013
Structural equation-models-introduction-kimmo-vehkalahti-2013Structural equation-models-introduction-kimmo-vehkalahti-2013
Structural equation-models-introduction-kimmo-vehkalahti-2013
 
Spss data analysis for univariate, bivariate and multivariate statistics by d...
Spss data analysis for univariate, bivariate and multivariate statistics by d...Spss data analysis for univariate, bivariate and multivariate statistics by d...
Spss data analysis for univariate, bivariate and multivariate statistics by d...
 
Lecture7a Applied Econometrics and Economic Modeling
Lecture7a Applied Econometrics and Economic ModelingLecture7a Applied Econometrics and Economic Modeling
Lecture7a Applied Econometrics and Economic Modeling
 
How to process data in SPSS ?
How to process data in SPSS ? How to process data in SPSS ?
How to process data in SPSS ?
 
Ibm spss statistics 19 brief guide
Ibm spss statistics 19 brief guideIbm spss statistics 19 brief guide
Ibm spss statistics 19 brief guide
 
What is regression / Quantification of the impact ?
What is regression / Quantification of the impact ?What is regression / Quantification of the impact ?
What is regression / Quantification of the impact ?
 
Recep maz msb 701 quantitative analysis for managers
Recep maz msb 701 quantitative analysis for managersRecep maz msb 701 quantitative analysis for managers
Recep maz msb 701 quantitative analysis for managers
 
Statistical Approaches to Missing Data
Statistical Approaches to Missing DataStatistical Approaches to Missing Data
Statistical Approaches to Missing Data
 

Similar to Building a Regression Model using SPSS

introduction to spss
introduction to spssintroduction to spss
introduction to spssOmid Minooee
 
Data Coding and Data Management using SPSS
Data Coding and Data Management using SPSSData Coding and Data Management using SPSS
Data Coding and Data Management using SPSS
Melba Shaya Sweety
 
Structural equation modeling in amos
Structural equation modeling in amosStructural equation modeling in amos
Structural equation modeling in amos
Balaji P
 
Data Science - Part XV - MARS, Logistic Regression, & Survival Analysis
Data Science -  Part XV - MARS, Logistic Regression, & Survival AnalysisData Science -  Part XV - MARS, Logistic Regression, & Survival Analysis
Data Science - Part XV - MARS, Logistic Regression, & Survival Analysis
Derek Kane
 
Estimating Models Using Dummy VariablesYou have had plenty of op.docx
Estimating Models Using Dummy VariablesYou have had plenty of op.docxEstimating Models Using Dummy VariablesYou have had plenty of op.docx
Estimating Models Using Dummy VariablesYou have had plenty of op.docx
SANSKAR20
 
one-way-rm-anova-DE300.pdf
one-way-rm-anova-DE300.pdfone-way-rm-anova-DE300.pdf
one-way-rm-anova-DE300.pdf
luizsilva460739
 
HRUG - Linear regression with R
HRUG - Linear regression with RHRUG - Linear regression with R
HRUG - Linear regression with R
egoodwintx
 
Chapter 18,19
Chapter 18,19Chapter 18,19
Chapter 18,19
heba_ahmad
 
M08 BiasVarianceTradeoff
M08 BiasVarianceTradeoffM08 BiasVarianceTradeoff
M08 BiasVarianceTradeoff
Raman Kannan
 
Spss by vijay ambast
Spss by vijay ambastSpss by vijay ambast
Spss by vijay ambast
Vijay Ambast
 
Guide for building GLMS
Guide for building GLMSGuide for building GLMS
Guide for building GLMSAli T. Lotia
 
Statistical-Process-Control-Analysis-Unraveled_updated210
Statistical-Process-Control-Analysis-Unraveled_updated210Statistical-Process-Control-Analysis-Unraveled_updated210
Statistical-Process-Control-Analysis-Unraveled_updated210pbaxter
 
Linear logisticregression
Linear logisticregressionLinear logisticregression
Linear logisticregression
kongara
 
Class,BelowistheFormatyoushouldfollowwhentypingup.docx
Class,BelowistheFormatyoushouldfollowwhentypingup.docxClass,BelowistheFormatyoushouldfollowwhentypingup.docx
Class,BelowistheFormatyoushouldfollowwhentypingup.docx
mccormicknadine86
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Boston Institute of Analytics
 
B409 W11 Sas Collaborative Stats Guide V4.2
B409 W11 Sas Collaborative Stats Guide V4.2B409 W11 Sas Collaborative Stats Guide V4.2
B409 W11 Sas Collaborative Stats Guide V4.2marshalkalra
 
Lecture 1
Lecture 1Lecture 1
Lecture 1
Konpal Darakshan
 
Lobsters, Wine and Market Research
Lobsters, Wine and Market ResearchLobsters, Wine and Market Research
Lobsters, Wine and Market Research
Ted Clark
 
Data Preparation with the help of Analytics Methodology
Data Preparation with the help of Analytics MethodologyData Preparation with the help of Analytics Methodology
Data Preparation with the help of Analytics Methodology
Rupak Roy
 
Essay on-data-analysis
Essay on-data-analysisEssay on-data-analysis
Essay on-data-analysis
Raman Kannan
 

Similar to Building a Regression Model using SPSS (20)

introduction to spss
introduction to spssintroduction to spss
introduction to spss
 
Data Coding and Data Management using SPSS
Data Coding and Data Management using SPSSData Coding and Data Management using SPSS
Data Coding and Data Management using SPSS
 
Structural equation modeling in amos
Structural equation modeling in amosStructural equation modeling in amos
Structural equation modeling in amos
 
Data Science - Part XV - MARS, Logistic Regression, & Survival Analysis
Data Science -  Part XV - MARS, Logistic Regression, & Survival AnalysisData Science -  Part XV - MARS, Logistic Regression, & Survival Analysis
Data Science - Part XV - MARS, Logistic Regression, & Survival Analysis
 
Estimating Models Using Dummy VariablesYou have had plenty of op.docx
Estimating Models Using Dummy VariablesYou have had plenty of op.docxEstimating Models Using Dummy VariablesYou have had plenty of op.docx
Estimating Models Using Dummy VariablesYou have had plenty of op.docx
 
one-way-rm-anova-DE300.pdf
one-way-rm-anova-DE300.pdfone-way-rm-anova-DE300.pdf
one-way-rm-anova-DE300.pdf
 
HRUG - Linear regression with R
HRUG - Linear regression with RHRUG - Linear regression with R
HRUG - Linear regression with R
 
Chapter 18,19
Chapter 18,19Chapter 18,19
Chapter 18,19
 
M08 BiasVarianceTradeoff
M08 BiasVarianceTradeoffM08 BiasVarianceTradeoff
M08 BiasVarianceTradeoff
 
Spss by vijay ambast
Spss by vijay ambastSpss by vijay ambast
Spss by vijay ambast
 
Guide for building GLMS
Guide for building GLMSGuide for building GLMS
Guide for building GLMS
 
Statistical-Process-Control-Analysis-Unraveled_updated210
Statistical-Process-Control-Analysis-Unraveled_updated210Statistical-Process-Control-Analysis-Unraveled_updated210
Statistical-Process-Control-Analysis-Unraveled_updated210
 
Linear logisticregression
Linear logisticregressionLinear logisticregression
Linear logisticregression
 
Class,BelowistheFormatyoushouldfollowwhentypingup.docx
Class,BelowistheFormatyoushouldfollowwhentypingup.docxClass,BelowistheFormatyoushouldfollowwhentypingup.docx
Class,BelowistheFormatyoushouldfollowwhentypingup.docx
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project Presentation
 
B409 W11 Sas Collaborative Stats Guide V4.2
B409 W11 Sas Collaborative Stats Guide V4.2B409 W11 Sas Collaborative Stats Guide V4.2
B409 W11 Sas Collaborative Stats Guide V4.2
 
Lecture 1
Lecture 1Lecture 1
Lecture 1
 
Lobsters, Wine and Market Research
Lobsters, Wine and Market ResearchLobsters, Wine and Market Research
Lobsters, Wine and Market Research
 
Data Preparation with the help of Analytics Methodology
Data Preparation with the help of Analytics MethodologyData Preparation with the help of Analytics Methodology
Data Preparation with the help of Analytics Methodology
 
Essay on-data-analysis
Essay on-data-analysisEssay on-data-analysis
Essay on-data-analysis
 

More from Zac Bodner

Segmenting Audiences and Establishing Content Pillars
Segmenting Audiences and Establishing Content PillarsSegmenting Audiences and Establishing Content Pillars
Segmenting Audiences and Establishing Content PillarsZac Bodner
 
SPSS
SPSS SPSS
SPSS
Zac Bodner
 
Data Science Time!
Data Science Time!Data Science Time!
Data Science Time!
Zac Bodner
 
Data Science Time!
Data Science Time!Data Science Time!
Data Science Time!
Zac Bodner
 
Case Studies 101
Case Studies 101Case Studies 101
Case Studies 101
Zac Bodner
 
Case Methodolgy in Marketing Strategy: Brand Development
Case Methodolgy in Marketing Strategy: Brand DevelopmentCase Methodolgy in Marketing Strategy: Brand Development
Case Methodolgy in Marketing Strategy: Brand DevelopmentZac Bodner
 
Case Studies Galore
Case Studies GaloreCase Studies Galore
Case Studies Galore
Zac Bodner
 
Case Study: New Product Launch
Case Study: New Product LaunchCase Study: New Product Launch
Case Study: New Product LaunchZac Bodner
 
Case Study Time!
Case Study Time!Case Study Time!
Case Study Time!
Zac Bodner
 
Developing a Social Media Content Strategy
Developing a Social Media Content StrategyDeveloping a Social Media Content Strategy
Developing a Social Media Content StrategyZac Bodner
 
The Role of Innovation and Evidence-Based Decision Making in the 2016 U.S. El...
The Role of Innovation and Evidence-Based Decision Making in the 2016 U.S. El...The Role of Innovation and Evidence-Based Decision Making in the 2016 U.S. El...
The Role of Innovation and Evidence-Based Decision Making in the 2016 U.S. El...
Zac Bodner
 
RUH Collective
RUH CollectiveRUH Collective
RUH Collective
Zac Bodner
 
Stan Richards School of Advertising and Public Relations
Stan Richards School of Advertising and Public RelationsStan Richards School of Advertising and Public Relations
Stan Richards School of Advertising and Public Relations
Zac Bodner
 
YETI
YETIYETI

More from Zac Bodner (16)

Segmenting Audiences and Establishing Content Pillars
Segmenting Audiences and Establishing Content PillarsSegmenting Audiences and Establishing Content Pillars
Segmenting Audiences and Establishing Content Pillars
 
+
++
+
 
:)
:):)
:)
 
SPSS
SPSS SPSS
SPSS
 
Data Science Time!
Data Science Time!Data Science Time!
Data Science Time!
 
Data Science Time!
Data Science Time!Data Science Time!
Data Science Time!
 
Case Studies 101
Case Studies 101Case Studies 101
Case Studies 101
 
Case Methodolgy in Marketing Strategy: Brand Development
Case Methodolgy in Marketing Strategy: Brand DevelopmentCase Methodolgy in Marketing Strategy: Brand Development
Case Methodolgy in Marketing Strategy: Brand Development
 
Case Studies Galore
Case Studies GaloreCase Studies Galore
Case Studies Galore
 
Case Study: New Product Launch
Case Study: New Product LaunchCase Study: New Product Launch
Case Study: New Product Launch
 
Case Study Time!
Case Study Time!Case Study Time!
Case Study Time!
 
Developing a Social Media Content Strategy
Developing a Social Media Content StrategyDeveloping a Social Media Content Strategy
Developing a Social Media Content Strategy
 
The Role of Innovation and Evidence-Based Decision Making in the 2016 U.S. El...
The Role of Innovation and Evidence-Based Decision Making in the 2016 U.S. El...The Role of Innovation and Evidence-Based Decision Making in the 2016 U.S. El...
The Role of Innovation and Evidence-Based Decision Making in the 2016 U.S. El...
 
RUH Collective
RUH CollectiveRUH Collective
RUH Collective
 
Stan Richards School of Advertising and Public Relations
Stan Richards School of Advertising and Public RelationsStan Richards School of Advertising and Public Relations
Stan Richards School of Advertising and Public Relations
 
YETI
YETIYETI
YETI
 

Recently uploaded

SMM Cheap - No. 1 SMM panel in the world
SMM Cheap - No. 1 SMM panel in the worldSMM Cheap - No. 1 SMM panel in the world
SMM Cheap - No. 1 SMM panel in the world
smmpanel567
 
34-Rahul-Mande.pdf PROJECT REPORT MBA 4TH SEMESTER
34-Rahul-Mande.pdf PROJECT REPORT MBA 4TH SEMESTER34-Rahul-Mande.pdf PROJECT REPORT MBA 4TH SEMESTER
34-Rahul-Mande.pdf PROJECT REPORT MBA 4TH SEMESTER
DeepakTripathi733493
 
Traditional Store Audits are Outdated: A New Approach to Protecting Your Bran...
Traditional Store Audits are Outdated: A New Approach to Protecting Your Bran...Traditional Store Audits are Outdated: A New Approach to Protecting Your Bran...
Traditional Store Audits are Outdated: A New Approach to Protecting Your Bran...
Auxis Consulting & Outsourcing
 
How to Run Landing Page Tests On and Off Paid Social Platforms
How to Run Landing Page Tests On and Off Paid Social PlatformsHow to Run Landing Page Tests On and Off Paid Social Platforms
How to Run Landing Page Tests On and Off Paid Social Platforms
VWO
 
The New Era Of SEO - How AI Has Changed SEO Forever - Danny Leibrandt
The New Era Of SEO - How AI Has Changed SEO Forever - Danny LeibrandtThe New Era Of SEO - How AI Has Changed SEO Forever - Danny Leibrandt
The New Era Of SEO - How AI Has Changed SEO Forever - Danny Leibrandt
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
20221005110010_633d63baa84f6_learn___week_3_ch._5.pdf
20221005110010_633d63baa84f6_learn___week_3_ch._5.pdf20221005110010_633d63baa84f6_learn___week_3_ch._5.pdf
20221005110010_633d63baa84f6_learn___week_3_ch._5.pdf
levuag
 
Digital Money Maker Club – von Gunnar Kessler digital.
Digital Money Maker Club – von Gunnar Kessler digital.Digital Money Maker Club – von Gunnar Kessler digital.
Digital Money Maker Club – von Gunnar Kessler digital.
focsh890
 
Digital Marketing Training In Bangalore
Digital Marketing Training In BangaloreDigital Marketing Training In Bangalore
Digital Marketing Training In Bangalore
syedasifsyed46
 
Your Path to Profits - The Game-Changing Power of a Marketing - Daniel Bussius
Your Path to Profits - The Game-Changing Power of a Marketing - Daniel BussiusYour Path to Profits - The Game-Changing Power of a Marketing - Daniel Bussius
Your Path to Profits - The Game-Changing Power of a Marketing - Daniel Bussius
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
15 ideas and frameworks on the art of storytelling
15 ideas and frameworks on the art of storytelling15 ideas and frameworks on the art of storytelling
15 ideas and frameworks on the art of storytelling
Aatir Abdul Rauf
 
Digital Marketing Trends - Experts Insights on How
Digital Marketing Trends - Experts Insights on HowDigital Marketing Trends - Experts Insights on How
The New Era Of SEO - How AI Has Changed SEO Forever - Danny Leibrandt
The New Era Of SEO - How AI Has Changed SEO Forever - Danny LeibrandtThe New Era Of SEO - How AI Has Changed SEO Forever - Danny Leibrandt
The New Era Of SEO - How AI Has Changed SEO Forever - Danny Leibrandt
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
BLOOM_May2024 (r). Balmer Lawrie Online Monthly Bulletin
BLOOM_May2024 (r). Balmer Lawrie Online Monthly BulletinBLOOM_May2024 (r). Balmer Lawrie Online Monthly Bulletin
BLOOM_May2024 (r). Balmer Lawrie Online Monthly Bulletin
BalmerLawrie
 
Search Engine Marketing - Competitor and Keyword research
Search Engine Marketing  - Competitor and Keyword researchSearch Engine Marketing  - Competitor and Keyword research
Search Engine Marketing - Competitor and Keyword research
ETMARK ACADEMY
 
The What, Why & How of 3D and AR in Digital Commerce
The What, Why & How of 3D and AR in Digital CommerceThe What, Why & How of 3D and AR in Digital Commerce
The What, Why & How of 3D and AR in Digital Commerce
PushON Ltd
 
Generative AI - Unleash Creative Opportunity - Peter Weltman
Generative AI - Unleash Creative Opportunity - Peter WeltmanGenerative AI - Unleash Creative Opportunity - Peter Weltman
Generative AI - Unleash Creative Opportunity - Peter Weltman
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
SEO Master Class - Steve Wiideman, Wiideman Consulting Group
SEO Master Class - Steve Wiideman,  Wiideman Consulting GroupSEO Master Class - Steve Wiideman,  Wiideman Consulting Group
SEO Master Class - Steve Wiideman, Wiideman Consulting Group
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
Mastering Multi-Touchpoint Content Strategy: Navigate Fragmented User Journeys
Mastering Multi-Touchpoint Content Strategy: Navigate Fragmented User JourneysMastering Multi-Touchpoint Content Strategy: Navigate Fragmented User Journeys
Mastering Multi-Touchpoint Content Strategy: Navigate Fragmented User Journeys
Search Engine Journal
 
My Personal Brand Exploration by Mariano
My Personal Brand Exploration by MarianoMy Personal Brand Exploration by Mariano
My Personal Brand Exploration by Mariano
marianooscos
 
Digital Commerce Lecture for Advanced Digital & Social Media Strategy at UCLA...
Digital Commerce Lecture for Advanced Digital & Social Media Strategy at UCLA...Digital Commerce Lecture for Advanced Digital & Social Media Strategy at UCLA...
Digital Commerce Lecture for Advanced Digital & Social Media Strategy at UCLA...
Valters Lauzums
 

Recently uploaded (20)

SMM Cheap - No. 1 SMM panel in the world
SMM Cheap - No. 1 SMM panel in the worldSMM Cheap - No. 1 SMM panel in the world
SMM Cheap - No. 1 SMM panel in the world
 
34-Rahul-Mande.pdf PROJECT REPORT MBA 4TH SEMESTER
34-Rahul-Mande.pdf PROJECT REPORT MBA 4TH SEMESTER34-Rahul-Mande.pdf PROJECT REPORT MBA 4TH SEMESTER
34-Rahul-Mande.pdf PROJECT REPORT MBA 4TH SEMESTER
 
Traditional Store Audits are Outdated: A New Approach to Protecting Your Bran...
Traditional Store Audits are Outdated: A New Approach to Protecting Your Bran...Traditional Store Audits are Outdated: A New Approach to Protecting Your Bran...
Traditional Store Audits are Outdated: A New Approach to Protecting Your Bran...
 
How to Run Landing Page Tests On and Off Paid Social Platforms
How to Run Landing Page Tests On and Off Paid Social PlatformsHow to Run Landing Page Tests On and Off Paid Social Platforms
How to Run Landing Page Tests On and Off Paid Social Platforms
 
The New Era Of SEO - How AI Has Changed SEO Forever - Danny Leibrandt
The New Era Of SEO - How AI Has Changed SEO Forever - Danny LeibrandtThe New Era Of SEO - How AI Has Changed SEO Forever - Danny Leibrandt
The New Era Of SEO - How AI Has Changed SEO Forever - Danny Leibrandt
 
20221005110010_633d63baa84f6_learn___week_3_ch._5.pdf
20221005110010_633d63baa84f6_learn___week_3_ch._5.pdf20221005110010_633d63baa84f6_learn___week_3_ch._5.pdf
20221005110010_633d63baa84f6_learn___week_3_ch._5.pdf
 
Digital Money Maker Club – von Gunnar Kessler digital.
Digital Money Maker Club – von Gunnar Kessler digital.Digital Money Maker Club – von Gunnar Kessler digital.
Digital Money Maker Club – von Gunnar Kessler digital.
 
Digital Marketing Training In Bangalore
Digital Marketing Training In BangaloreDigital Marketing Training In Bangalore
Digital Marketing Training In Bangalore
 
Your Path to Profits - The Game-Changing Power of a Marketing - Daniel Bussius
Your Path to Profits - The Game-Changing Power of a Marketing - Daniel BussiusYour Path to Profits - The Game-Changing Power of a Marketing - Daniel Bussius
Your Path to Profits - The Game-Changing Power of a Marketing - Daniel Bussius
 
15 ideas and frameworks on the art of storytelling
15 ideas and frameworks on the art of storytelling15 ideas and frameworks on the art of storytelling
15 ideas and frameworks on the art of storytelling
 
Digital Marketing Trends - Experts Insights on How
Digital Marketing Trends - Experts Insights on HowDigital Marketing Trends - Experts Insights on How
Digital Marketing Trends - Experts Insights on How
 
The New Era Of SEO - How AI Has Changed SEO Forever - Danny Leibrandt
The New Era Of SEO - How AI Has Changed SEO Forever - Danny LeibrandtThe New Era Of SEO - How AI Has Changed SEO Forever - Danny Leibrandt
The New Era Of SEO - How AI Has Changed SEO Forever - Danny Leibrandt
 
BLOOM_May2024 (r). Balmer Lawrie Online Monthly Bulletin
BLOOM_May2024 (r). Balmer Lawrie Online Monthly BulletinBLOOM_May2024 (r). Balmer Lawrie Online Monthly Bulletin
BLOOM_May2024 (r). Balmer Lawrie Online Monthly Bulletin
 
Search Engine Marketing - Competitor and Keyword research
Search Engine Marketing  - Competitor and Keyword researchSearch Engine Marketing  - Competitor and Keyword research
Search Engine Marketing - Competitor and Keyword research
 
The What, Why & How of 3D and AR in Digital Commerce
The What, Why & How of 3D and AR in Digital CommerceThe What, Why & How of 3D and AR in Digital Commerce
The What, Why & How of 3D and AR in Digital Commerce
 
Generative AI - Unleash Creative Opportunity - Peter Weltman
Generative AI - Unleash Creative Opportunity - Peter WeltmanGenerative AI - Unleash Creative Opportunity - Peter Weltman
Generative AI - Unleash Creative Opportunity - Peter Weltman
 
SEO Master Class - Steve Wiideman, Wiideman Consulting Group
SEO Master Class - Steve Wiideman,  Wiideman Consulting GroupSEO Master Class - Steve Wiideman,  Wiideman Consulting Group
SEO Master Class - Steve Wiideman, Wiideman Consulting Group
 
Mastering Multi-Touchpoint Content Strategy: Navigate Fragmented User Journeys
Mastering Multi-Touchpoint Content Strategy: Navigate Fragmented User JourneysMastering Multi-Touchpoint Content Strategy: Navigate Fragmented User Journeys
Mastering Multi-Touchpoint Content Strategy: Navigate Fragmented User Journeys
 
My Personal Brand Exploration by Mariano
My Personal Brand Exploration by MarianoMy Personal Brand Exploration by Mariano
My Personal Brand Exploration by Mariano
 
Digital Commerce Lecture for Advanced Digital & Social Media Strategy at UCLA...
Digital Commerce Lecture for Advanced Digital & Social Media Strategy at UCLA...Digital Commerce Lecture for Advanced Digital & Social Media Strategy at UCLA...
Digital Commerce Lecture for Advanced Digital & Social Media Strategy at UCLA...
 

Building a Regression Model using SPSS

  • 1. Zac Bodner Final Lab Assignment

  • 2. In class this semester, we have already explored regression for explanatory purposes. For example, we previously built a regression model to explain what effects certain independent variables (trustworthiness, intelligence, “like me-ness”) have on certain dependent variables (I have a good opinion of) in political candidates. These models were helpful in determining how different candidates could employ certain tactics to gain favor (and hopefully votes) from voters. In this regard, regression models are effective tools in explaining the reasons behind certain phenomena - like why certain candidates do well with certain populations, or what factors make us love ice cream - but their real power comes in how capable these models are of predicting certain outcomes. For example, if we have an explanatory regression model that identifies certain characteristics (variables) shared by both customers and non-customers in a particular data set - could we then turn around and use that model to identify additional, probable customers (and non-customers) in another, independent data set (and make tons of cash for our employers in the process)? We can and we will! The following paper presents a step by step explanation of how to do this, based on our final lab assignment of the semester. The first thing we must do is divide the current data set (Customer, we’ve been using it all semester) in half. This will allow us to confirm our model’s findings in one half of the data set on the other. In other words, this will help us prove that our model will not only work on the data set that we are testing, but in other, unrelated (random) data sets, as well. The key to dividing a data set into equal, random halves, is to confirm that both sides are distributed evenly on a number of variables. We have already divided the data set using SPSS in a previous assignment, and confirmed the randomness and equality of both halves by examining the distributions of some of these variables in a CROSSTABS setting. If the differences in the distributions of these variables are very small, (fractions of a percent) then we are good to go. From the previous assignment, we can confirm that our data set is split into two randomized and equal halves. From here, we must calculate some of the interactions between variables that we found in another previous assignment - the CHAID segmentation. CHAID stands for Chi Square Automatic Interaction Detector. It produces a tree that shows which variables contain the largest segments of customers, and continues on by further dividing each segment. For example - the tree starts by segmenting customers from non-customers. Then, it segments the customers further by say, the market value of their home. Then it continues, by separating this market value segment into each gender. By doing this, we can examine the percentage of customers in each segment, and compare their concentration to the rest of the data set for targeting purposes. By identifying these interactions, we can make new variables to add to our model. This is what we will do here. We do this because, even though CHAID is a great tool for segmenting the data set, we are more interested in seeing the total, combined interactions. For this purpose, a regression will always be the better option. To turn these segmentation interactions into variables, we simply multiply some of our segments that demonstrated high concentrations of customers. Here is a screenshot of a few that I used:
  • 3. These interactions are computed, and added to the list of our data set’s independent variables. From here, we can add them and all of our other independent variables to a step-wise regression. First, we must make sure to select which half of the data-set we want to test. We will test Half 1 - by selecting it in SPSS. A step-wise regression is valuable for differentiating significant predictor (independent) variables from insignificant ones. All you do is throw the kitchen sink (all of your variables) in, and SPSS will find the ones with the strongest beta coefficients (relationships to our outcome variable) and order them from highest to lowest in the model. For purposes of orderliness and ease of use, we always want our models to be parsimonious - meaning they have as few variables as possible while still making good predictions. This being the case, I chose the first eleven variables the step-wise regression returned. You will know when to stop adding variables by how much the total R-Square value increases. The R- Square (and adjusted R-square - for multiple variables) is a measure of how much of the variance in the outcome variable the independent variables in the model explain. If you have ten variables that have an adjusted R-square of .159, or fifteen variables with a .164 - it’s best to just use the ten variables, because each additional variable isn’t explaining much variance at this point. We’re cooking with gas now! We have eleven solid, significant variables that we can now throw into a regular regression, signified by switching the “Stepwise” option to the “Enter” option in SPSS. It is important to note that for this assignment, we need to check the option to “replace missing values with the mean” in SPSS. This means that if any of the single members of our data population have missing values, we will replace those missing values with the mean for that data point, instead of tossing the member altogether. This way, we do not detract or add anything from the model, but we don’t have to waste data. Luckily for us, of the eleven variables that made it into our model - there were no missing data. Now, we will input the variables into the “enter” regression, and save the output as a variable. We will call the variable PREDICT, because we will use it later to predict, based on our observations of customers and non-customers in this data-set, the likelihood of finding additional customers among separate data-sets.
  • 4. We will now divide this output into deciles, since we are concerned primarily with our model’s capability of finding prospects based on how much they resemble the observed customers of this data-set. SPSS does this fairly easily by going to Transform and selecting Rank Cases. We then save that output as DECILES, and using CROSSTABS, we can compare our observed customers with our ranked predictions. Here is a screenshot of this: This is great; our prediction works. We would hope that the highest deciles (1) have the highest concentration of customers, and vice-versa with the non-customers coming from the lowest deciles (10). As we can see, it does. In the first decile, we have an 8.9% higher concentration of customers than in the rest of the data-set. We can take those odds to Vegas! Now, we have to test this model on the other half, Half 2. To do this, we have to calculate a score for our output that we can apply to the second half, but before we do this - we must confirm the score we calculate correctly interprets the regression output that we have.
  • 5. This is pretty easy, we go into Transform > Compute and enter in an equation based on our variables in the model: Our final score then looks like this, and we save it as the another variable in the set, SCORE: We then compare this variable SCORE to PREDICT, and luckily for us - they are almost identical. This means our score calculation is a correct interpretation of our regression model, and can now be applied to an independent sample (data-set). To get to our simulated, independent data set - we now select Half 2. With Half 2 active, we run the code we just made, and then transform the output again into deciles. We will save these deciles as DECILES2, and run the same CROSSTABS function as we did before. If we have done our job correctly, this Crosstabs will look almost identical, and hopefully a little better than the first one. Drumroll!!!!
  • 6. Hallelujah! This crosstabs has a slightly higher percentage of customers in the first decile, but is practically the same. Check out the comparison on the next page. Our predictive model works!
  • 7. This is great news. We built a regression model and ran i on one data-set. If the model worked, we would expect similar results by running the same model on a different data-set. We got those similar results. With these techniques and a handy tool like SPSS, we can take a data set, build a regression model to find characteristics shared by customers and non-customers - then validate this model on an independent data set. By doing this, we can significantly increase our chances of finding new customers anywhere, which is obviously an incredibly valuable skill to have. But remember, anyone can input numbers into SPSS. The real difference between a good and great market researcher is being able to interpret those numbers, by asking good questions and employing impeccable language!