- The document describes building a regression model on one half of a dataset to predict customers and non-customers based on variables.
- It then tests the model on the other half of the dataset by scoring it and comparing the results, which show a similar distribution of predicted customers and non-customers, validating the model.
- This demonstrates that the regression model can accurately predict outcomes on new, independent datasets, allowing identification of potential new customers.
SPSS for beginners, a short course about how novices can use SPSS to analyze their research findings. With this tutorial anyone becomes able to use SPSS for basic statistical analysis. No need to be a professional to use SPSS.
SPSS is powerful to analyze nurses data. This paper intends to support hospital leaders the benefits of data analyzing with applied SPSS. This paper intends to support the hospital managers and its office managers to know whether hourly salary depends upon nurse experiences and nurse types such as hospital nurse and office nurse. Moreover it analyzes the interesting deviation condition of hospital nurses and office nurses salaries. As SPSS's background algorithms, it showed the means algorithm for tables and graph. And then Sample data hourly wage data.sav' was downloaded from Google and was analyzed and viewed. It used IBM SPSS statistics version 23 and PYTHON version 3.7. Aung Cho | Aung Si Thu "Nurses Data Analysis by Applied SPSS" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-5 , August 2019, URL: https://www.ijtsrd.com/papers/ijtsrd25329.pdfPaper URL: https://www.ijtsrd.com/computer-science/data-miining/25329/nurses-data-analysis-by-applied-spss/aung-cho
Outlier Management, BASIC STATISTICS, Error, Accuracy, How to find Outliers, quartile, Data Management, Reporting and Evaluation, Communication & Corrective Action, Documentation,
Introduction to SEM (Structural Equation Models) - invited talk at the seminar "Analyzing and Interpreting Data" organized by the Finnish Doctoral Programme in Education and Learning (15 May 2013) in Vuosaari, Helsinki, Finland. Acknowledgements to Barbara Byrne for an excellent intro book of SEM.
SPSS for beginners, a short course about how novices can use SPSS to analyze their research findings. With this tutorial anyone becomes able to use SPSS for basic statistical analysis. No need to be a professional to use SPSS.
SPSS is powerful to analyze nurses data. This paper intends to support hospital leaders the benefits of data analyzing with applied SPSS. This paper intends to support the hospital managers and its office managers to know whether hourly salary depends upon nurse experiences and nurse types such as hospital nurse and office nurse. Moreover it analyzes the interesting deviation condition of hospital nurses and office nurses salaries. As SPSS's background algorithms, it showed the means algorithm for tables and graph. And then Sample data hourly wage data.sav' was downloaded from Google and was analyzed and viewed. It used IBM SPSS statistics version 23 and PYTHON version 3.7. Aung Cho | Aung Si Thu "Nurses Data Analysis by Applied SPSS" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-5 , August 2019, URL: https://www.ijtsrd.com/papers/ijtsrd25329.pdfPaper URL: https://www.ijtsrd.com/computer-science/data-miining/25329/nurses-data-analysis-by-applied-spss/aung-cho
Outlier Management, BASIC STATISTICS, Error, Accuracy, How to find Outliers, quartile, Data Management, Reporting and Evaluation, Communication & Corrective Action, Documentation,
Introduction to SEM (Structural Equation Models) - invited talk at the seminar "Analyzing and Interpreting Data" organized by the Finnish Doctoral Programme in Education and Learning (15 May 2013) in Vuosaari, Helsinki, Finland. Acknowledgements to Barbara Byrne for an excellent intro book of SEM.
Data Science - Part XV - MARS, Logistic Regression, & Survival AnalysisDerek Kane
This lecture provides an overview on extending the regression concepts brought forth in previous lectures. We will start off by going through a broad overview of the Multivariate Adaptive Regression Splines Algorithm, Logistic Regression, and then explore the Survival Analysis. The presentation will culminate with a real world example on how these techniques can be used in the US criminal justice system.
Estimating Models Using Dummy VariablesYou have had plenty of op.docxSANSKAR20
Estimating Models Using Dummy Variables
You have had plenty of opportunity to interpret coefficients for metric variables in regression models. Using and interpreting categorical variables takes just a little bit of extra practice. In this Discussion, you will have the opportunity to practice how to recode categorical variables so they can be used in a regression model and how to properly interpret the coefficients. Additionally, you will gain some practice in running diagnostics and identifying any potential problems with the model.
To prepare for this Discussion:
Review Warner’s Chapter 12 and Chapter 2 of the Wagner course text and the media program found in this week’s Learning Resources and consider the use of dummy variables.
Create a research question using the General Social Survey dataset that can be answered by multiple regression. Using the SPSS software, choose a categorical variable to dummy code as one of your predictor variables.
Estimate a multiple regression model that answers your research question. Post your response to the following:
What is your research question?
Interpret the coefficients for the model, specifically commenting on the dummy variable.
Run diagnostics for the regression model. Does the model meet all of the assumptions? Be sure and comment on what assumptions were not met and the possible implications. Is there any possible remedy for one the assumption violations?
Be sure to support your Main Post and Response Post with reference to the week’s Learning Resources and other scholarly evidence in APA Style.
Regression Diagnostics and Model Evaluation
Regression Diagnostics and Model Evaluation
Program Transcript
[MUSIC PLAYING]
MATT JONES: We've gone over estimating bivariate and multiple regression
models, but one thing we haven't talked about up to this point are some of the
assumptions of multiple regression models. It's very important to adhere to these
assumptions to have proper interpretation of our models. These assumptions
include linearity, independence of error, homoscedasticity, multicollinearity,
undue influence, and normal distribution of errors. Let's go back to SPSS to see
how we can test these assumptions and evaluate our models.
Let's go ahead and estimate a multiple regression model using respondent's
socioeconomic status index is the dependent variable, respondent's highest
education as an independent variable, and occupational prestige score as an
independent variable. But this time, let's request some additional information to
perform some diagnostics around our model.
Go to analyze, regression, and linear, since we are still using an ordinary least
squares method. We'll scroll down and enter my dependent variable first,
respondent socioeconomic index. My independent variables of occupational
prestige and highest year of school completed. I want to go over to statistics and
request some additional information. I will request collinearity ...
SPSS is widely used program for statistical analysis in social sciences, particularly in education and research. However, because of its potential, it is also widely used by market researchers, health-care researchers, survey organizations, governments and, most notably, data miners and big data professionals.
Class,
Below is the Format you should follow when typing up your paper on the current production– Please
include: “Underlined Title Headings” for each of the areas below and the six Aristotle Elements covered-
In addition, hand written notes, taken at the production should be in your Composition book along with
two “Plot Diagrams” that documents the plot structure of each act of the play.
Production Paper Structure/Format
Student’s Name:
Student’s Roster #:
Performance Date:
Show Title:
What is the Genre/Form?
(Tragedy/Comedy/Melodrama/Tragicomedy)
Justify your answer.
What Styles and Theatrical Conventions are utilized in the Production?
Opening/Lead Statement: Your opening paragraph should make a clear statement or point about the
production. A good opening statement will foreshadow the main points you intend to make in the body
of the paper.
Thought/Themes/Ideas: Write a paragraph or more, outlining, any thoughts, themes, and ideas
presented in the play. What is the playwright trying to say with this production?
Plot/Action: As briefly as possible, pinpoint the dramatic structure. How does the playwright organize
the story? Is there a prologue? Soliloquies (solo monologues) that move the action along? How is the
exposition handled? Is there a chorus and how are they utilized?
Language/Diction: What type of Language is used? Does the playwright use simple, straightforward,
everyday language? Does the playwright use more complex language? Please include a direct “quote”
from the play to demonstrate your statements regarding the language used.
Characters: Select and write a paragraph about any pivotal character in the play. You can choose any
character you wish. How does the character impact the story?
Music: Write a paragraph about the music and sounds presented in the production. How do you think the
music and/or sounds create mood and/or impact the overall feeling of the production?
Spectacle: Write about the impact and overall cohesiveness of the costumes, lighting and set design.
Does the spectacle enhance the production?
Closing: End your paper with clear statement about the overall impact of the elements utilized within the
production.
Correlation and Bivariate Regression
Correlation and Bivariate Regression
Program Transcript
MATT JONES: This week, we're performing a Pearson Correlation Test. To do
this, we can go to SPSS to perform this rather simple procedure. Like many of
our tests, go ahead and activate the Analyze button to get the drop down menu.
Because we're performing a correlation, we can move down to Correlate and
across to Bivariate. The Pearson Correlation Test is a bivariate test.
If you click on that, you'll see a box come up, Bivariate Correlations. Let's go
ahead and perform a bivariate correlation for respondent's socioeconomic status
index and the respondent's highest level of ...
Explore how data science can be used to predict employee churn using this data science project presentation, allowing organizations to proactively address retention issues. This student presentation from Boston Institute of Analytics showcases the methodology, insights, and implications of predicting employee turnover. visit https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/ for more data science insights
Building solid marketing strategies in today’s competitive market is impossible without sound market research. The right market information can boost your sales, position your product more effectively, and help you speak more effectively to your audience.
Reinforce and focus your marketing research skills. This highly interactive program, facilitated by an experienced marketing research professional, can provide you with the knowledge and tools you need to develop and manage research projects to meet your specific goals. Furthermore, the workshop debunks the myth that you have to spend a lot of money to gain valuable information for decision making. No prior marketing research experience is required!
Data Preparation with the help of Analytics MethodologyRupak Roy
Get involved with the steps of data preparation and data assessment using widely used methodologies for machine learning data science modeling.
Let me know if anything is required, ping me at google #bobrupakroy
Introduction to Data Analytics starting with
OLS.
This is the first of a series of essays. I will share essays on unsupervised learning, dimensionality reduction and anomaly/outlier detection.
Similar to Building a Regression Model using SPSS (20)
How does a clothing brand start a movement? What is style anyway? How does our image of self get altered when we're constantly seeking validation from our online and offline networks? What positive role can a clothing brand play in all of this?
Stan Richards School of Advertising and Public RelationsZac Bodner
What is the difference between a successful organization and a successful brand?
The Stan Richards School of Advertising and Public Relations is one of the great Ad Schools in the world. But like any brand, there is always room for improvement. The assignment: perform a case study on the SRS. After which, develop a media strategy to strengthen the SRS brand and attract prospective students.
This presentation is a summary of our findings and recommendations.
Stan Richards School of Advertising and Public Relations, UT Austin - Fall 2015.
YETI is one of the great and up and coming brands of the world. They have experienced nearly unprecedented growth and success since their beginnings in 2005. Now, they face the challenge of maintaining this momentum.
The assignment is simple: Find a new, profitable customer segment for YETI to attract. Then, make strategic recommendations (within the framework of the customer journey) by which to reach them, all while remaining true to the brand.
The following is a summary of our recommendations. McCombs School of Business, UT Austin - Fall 2016.
SMM Cheap - No. 1 SMM panel in the worldsmmpanel567
Boost your social media marketing with our SMM Panel services offering SMM Cheap services! Get cost-effective services for your business and increase followers, likes, and engagement across all social media platforms. Get affordable services perfect for businesses and influencers looking to increase their social proof. See how cheap SMM strategies can help improve your social media presence and be a pro at the social media game.
How to Run Landing Page Tests On and Off Paid Social PlatformsVWO
Join us for an exclusive webinar featuring Mariate, Alexandra and Nima where we will unveil a comprehensive blueprint for crafting a successful paid media strategy focused on landing page testing.With escalating costs in paid advertising, understanding how to maximize each visitor’s experience is crucial for retention and conversion.
This session will dive into the methodologies for executing and analyzing landing page tests within paid social channels, offering a blend of theoretical knowledge and practical insights.
The Pearmill team will guide you through the nuances of setting up and managing landing page experiments on paid social platforms. You will learn about the critical rules to follow, the structure of effective tests, optimal conversion duration and budget allocation.
The session will also cover data analysis techniques and criteria for graduating landing pages.
In the second part of the webinar, Pearmill will explore the use of A/B testing platforms. Discover common pitfalls to avoid in A/B testing and gain insights into analyzing A/B tests results effectively.
In this presentation, Danny Leibrandt explains the impact of AI on SEO and what Google has been doing about it. Learn how to take your SEO game to the next level and win over Google with his new strategy anyone can use. Get actionable steps to rank your name, your business, and your clients on Google - the right way.
Key Takeaways:
1. Real content is king
2. Find ways to show EEAT
3. Repurpose across all platforms
Digital Money Maker Club – von Gunnar Kessler digital.focsh890
Title One is a comprehensive examination of the impact of digital technologies on
modern society. In a world where technology continues to advance rapidly, this article delves into the nuances and complexities of the digital age, exploring Its implications across various sectors and aspects of life.
Digital marketing is the art and science of promoting products or services using digital channels to reach and engage with potential customers. It encompasses a wide range of online tactics and strategies aimed at increasing brand visibility, driving website traffic, generating leads, and ultimately, converting those leads into customers.
https://nidmindia.com/
Most small businesses struggle to see marketing results. In this session, we will eliminate any confusion about what to do next, solving your marketing problems so your business can thrive. You’ll learn how to create a foundational marketing OS (operating system) based on neuroscience and backed by real-world results. You’ll be taught how to develop deep customer connections, and how to have your CRM dynamically segment and sell at any stage in the customer’s journey. By the end of the session, you’ll remove confusion and chaos and replace it with clarity and confidence for long-term marketing success.
Key Takeaways:
• Uncover the power of a foundational marketing system that dynamically communicates with prospects and customers on autopilot.
• Harness neuroscience and Tribal Alignment to transform your communication strategies, turning potential clients into fans and those fans into loyal customers.
• Discover the art of automated segmentation, pinpointing your most lucrative customers and identifying the optimal moments for successful conversions.
• Streamline your business with a content production plan that eliminates guesswork, wasted time, and money.
The digital marketing industry is changing faster than ever and those who don’t adapt with the times are losing market share. Where should marketers be focusing their efforts? What strategies are the experts seeing get the best results? Get up-to-speed with the latest industry insights, trends and predictions for the future in this panel discussion with some leading digital marketing experts.
In this presentation, Danny Leibrandt explains the impact of AI on SEO and what Google has been doing about it. Learn how to take your SEO game to the next level and win over Google with his new strategy anyone can use. Get actionable steps to rank your name, your business, and your clients on Google - the right way.
Key Takeaways:
1. Real content is king
2. Find ways to show EEAT
3. Repurpose across all platforms
Search Engine Marketing - Competitor and Keyword researchETMARK ACADEMY
Over 2 Trillion searches are made per day in Google search, which means there are more than 2 Trillion visits happening across the websites of the world wide web.
People search various questions, phrases or words. But some words and phrases are searched
more often than others.
For example, the words, ‘running shoes’ are searched more often than ‘best road running
shoes for men’
These words or phrases which people use to search on Google are called Keywords.
Some keywords are searched more often than others. Number of times a keyword is searched
for in a month is called keyword volume.
Some keywords have more relevant results than others. For the phrase “running shoes” we
get more than 80M relevant results, whereas for “best road running shoes for men” we get
only 8.
The former keyword ‘running shoes’ has way more competition from popular websites to
new and small blogs, whereas the latter keyword doesn’t have that much competition. This
search competition for a keyword is called search difficulty of a keyword or keyword
difficulty.
In other words, if the keyword difficulty is ‘low’ or ‘easy’, there won’t be any competition
and if you target such keywords on your site, you can easily rank on the front page of Google.
Some keywords are searched for, just to know or to learn some information about something,
that’s their search intention. For example, “What shoe size should I choose?” or “How to pick
the right shoe size?”
These keywords which are searched just to know about stuff are called informational
keywords. Typically people who are searching this type of keywords are top of a Conversion
funnel.
Conversion funnel is the journey that search visitors go through on their way to an email
subscription or a premium subscription to the services you offer or a purchase of products
you sell or recommend using your referral link.
For some buyers, research is the most important part when they have to buy a product.
Depending on that, their journey either widens or narrows down. These types of buyers are
Researchers and they spend more time with informational keywords.
Conversion is the action you want from your search visitors. Number of conversions that you
get for every 100 search visitors is called Conversion rate.
People who are at different stages of a conversion funnel use different types of keywords.
The What, Why & How of 3D and AR in Digital CommercePushON Ltd
Vladimir Mulhem has over 20 years of experience in commercialising cutting edge creative technology across construction, marketing and retail.
Previously the founder and Tech and Innovation Director of Creative Content Works working with the likes of Next, John Lewis and JD Sport, he now helps retailers, brands and agencies solve challenges of applying the emerging technologies 3D, AR, VR and Gen AI to real-world problems.
In this webinar, Vladimir will be covering the following topics:
Applications of 3D and AR in Digital Commerce,
Benefits of 3D and AR,
Tools to create, manage and publish 3D and AR in Digital Commerce.
Come learn how YOU can Animate and Illuminate the World with Generative AI's Explosive Power. Come sit in the driver's seat and learn to harness this great technology.
The session includes a brief history of the evolution of search before diving into the roles technology, content, and links play in developing a powerful SEO strategy in a world of Generative AI and social search. Discover how to optimize for TikTok searches, Google's Gemini, and Search Generative Experience while developing a powerful arsenal of tools and templates to help maximize the effectiveness of your SEO initiatives.
Key Takeaways:
Understand how search engines work
Be able to find out where your users search
Know what is required for each discipline of SEO
Feel confident creating an SEO Plan
Confidently measure SEO performance
Mastering Multi-Touchpoint Content Strategy: Navigate Fragmented User JourneysSearch Engine Journal
Digital platforms are constantly multiplying, and with that, user engagement is becoming more intricate and fragmented.
So how do you effectively navigate distributing and tailoring your content across these various touchpoints?
Watch this webinar as we dive into the evolving landscape of content strategy tailored for today's fragmented user journeys. Understanding how to deliver your content to your users is more crucial than ever, and we’ll provide actionable tips for navigating these intricate challenges.
You’ll learn:
- How today’s users engage with content across various channels and devices.
- The latest methodologies for identifying and addressing content gaps to keep your content strategy proactive and relevant.
- What digital shelf space is and how your content strategy needs to pivot.
With Wayne Cichanski, we’ll explore innovative strategies to map out and meet the diverse needs of your audience, ensuring every piece of content resonates and connects, regardless of where or how it is consumed.
Digital Commerce Lecture for Advanced Digital & Social Media Strategy at UCLA...Valters Lauzums
E-commerce in 2024 is characterized by a dynamic blend of opportunities and significant challenges. Supply chain disruptions and inventory shortages are critical issues, leading to increased shipping delays and rising costs, which impact timely delivery and squeeze profit margins. Efficient logistics management is essential, yet it is often hampered by these external factors. Payment processing, while needing to ensure security and user convenience, grapples with preventing fraud and integrating diverse payment methods, adding another layer of complexity. Furthermore, fulfillment operations require a streamlined approach to handle volume spikes and maintain accuracy in order picking, packing, and shipping, all while meeting customers' heightened expectations for faster delivery times.
Amid these operational challenges, customer data has emerged as an important strategy. By focusing on personalization and enhancing customer experience from historical behavior, businesses can deliver improved website and brand experienced, better product recommendations, optimal promotions, and content to meet individual preferences. Better data analytics can also help in effectively creating marketing campaigns, improving customer retention, and driving product development and inventory management.
Innovative formats such as social commerce and live shopping are beginning to impact the digital commerce landscape, offering new ways to engage with customers and drive sales, and may provide opportunity for brands that have been priced out or seen a downturn with post-pandemic shopping behavior. Social commerce integrates shopping experiences directly into social media platforms, tapping into the massive user bases of these networks to increase reach and engagement. Live shopping, on the other hand, combines entertainment and real-time interaction, providing a dynamic platform for showcasing products and encouraging immediate purchases. These innovations not only enhance customer engagement but also provide valuable data for businesses to refine their strategies and deliver superior shopping experiences.
The e-commerce sector is evolving rapidly, and businesses that effectively manage operational challenges and implement innovative strategies are best positioned for long-term success.
2. In class this semester, we have already explored regression for explanatory purposes. For
example, we previously built a regression model to explain what effects certain independent
variables (trustworthiness, intelligence, “like me-ness”) have on certain dependent variables (I
have a good opinion of) in political candidates. These models were helpful in determining how
different candidates could employ certain tactics to gain favor (and hopefully votes) from voters.
In this regard, regression models are effective tools in explaining the reasons behind certain
phenomena - like why certain candidates do well with certain populations, or what factors make
us love ice cream - but their real power comes in how capable these models are of predicting
certain outcomes. For example, if we have an explanatory regression model that identifies
certain characteristics (variables) shared by both customers and non-customers in a particular
data set - could we then turn around and use that model to identify additional, probable
customers (and non-customers) in another, independent data set (and make tons of cash for our
employers in the process)?
We can and we will!
The following paper presents a step by step explanation of how to do this, based on our final lab
assignment of the semester.
The first thing we must do is divide the current data set (Customer, we’ve been using it all
semester) in half. This will allow us to confirm our model’s findings in one half of the data set on
the other. In other words, this will help us prove that our model will not only work on the data set
that we are testing, but in other, unrelated (random) data sets, as well.
The key to dividing a data set into equal, random halves, is to confirm that both sides are
distributed evenly on a number of variables. We have already divided the data set using SPSS
in a previous assignment, and confirmed the randomness and equality of both halves by
examining the distributions of some of these variables in a CROSSTABS setting. If the
differences in the distributions of these variables are very small, (fractions of a percent) then we
are good to go. From the previous assignment, we can confirm that our data set is split into two
randomized and equal halves.
From here, we must calculate some of the interactions between variables that we found in
another previous assignment - the CHAID segmentation. CHAID stands for Chi Square
Automatic Interaction Detector. It produces a tree that shows which variables contain the largest
segments of customers, and continues on by further dividing each segment. For example - the
tree starts by segmenting customers from non-customers. Then, it segments the customers
further by say, the market value of their home. Then it continues, by separating this market
value segment into each gender. By doing this, we can examine the percentage of customers in
each segment, and compare their concentration to the rest of the data set for targeting
purposes.
By identifying these interactions, we can make new variables to add to our model. This is what
we will do here. We do this because, even though CHAID is a great tool for segmenting the data
set, we are more interested in seeing the total, combined interactions. For this purpose, a
regression will always be the better option. To turn these segmentation interactions into
variables, we simply multiply some of our segments that demonstrated high concentrations of
customers. Here is a screenshot of a few that I used:
3. These interactions are computed, and added to the list of our data set’s independent variables.
From here, we can add them and all of our other independent variables to a step-wise
regression. First, we must make sure to select which half of the data-set we want to test. We will
test Half 1 - by selecting it in SPSS.
A step-wise regression is valuable for differentiating significant predictor (independent) variables
from insignificant ones. All you do is throw the kitchen sink (all of your variables) in, and SPSS
will find the ones with the strongest beta coefficients (relationships to our outcome variable) and
order them from highest to lowest in the model. For purposes of orderliness and ease of use, we
always want our models to be parsimonious - meaning they have as few variables as possible
while still making good predictions.
This being the case, I chose the first eleven variables the step-wise regression returned. You will
know when to stop adding variables by how much the total R-Square value increases. The R-
Square (and adjusted R-square - for multiple variables) is a measure of how much of the
variance in the outcome variable the independent variables in the model explain. If you have ten
variables that have an adjusted R-square of .159, or fifteen variables with a .164 - it’s best to
just use the ten variables, because each additional variable isn’t explaining much variance at
this point.
We’re cooking with gas now!
We have eleven solid, significant variables that we can now throw into a regular regression,
signified by switching the “Stepwise” option to the “Enter” option in SPSS. It is important to note
that for this assignment, we need to check the option to “replace missing values with the mean”
in SPSS.
This means that if any of the single members of our data population have missing values, we
will replace those missing values with the mean for that data point, instead of tossing the
member altogether. This way, we do not detract or add anything from the model, but we don’t
have to waste data. Luckily for us, of the eleven variables that made it into our model - there
were no missing data.
Now, we will input the variables into the “enter” regression, and save the output as a variable.
We will call the variable PREDICT, because we will use it later to predict, based on our
observations of customers and non-customers in this data-set, the likelihood of finding
additional customers among separate data-sets.
4. We will now divide this output into deciles, since we are concerned primarily with our model’s
capability of finding prospects based on how much they resemble the observed customers of
this data-set. SPSS does this fairly easily by going to Transform and selecting Rank Cases. We
then save that output as DECILES, and using CROSSTABS, we can compare our observed
customers with our ranked predictions. Here is a screenshot of this:
This is great; our prediction works. We would hope that the highest deciles (1) have the highest
concentration of customers, and vice-versa with the non-customers coming from the lowest
deciles (10). As we can see, it does. In the first decile, we have an 8.9% higher concentration of
customers than in the rest of the data-set.
We can take those odds to Vegas!
Now, we have to test this model on the other half, Half 2. To do this, we have to calculate a
score for our output that we can apply to the second half, but before we do this - we must
confirm the score we calculate correctly interprets the regression output that we have.
5. This is pretty easy, we go into Transform > Compute and enter in an equation based on our
variables in the model:
Our final score then looks like this, and we save it as the another variable in the set, SCORE:
We then compare this variable SCORE to PREDICT, and luckily for us - they are almost
identical. This means our score calculation is a correct interpretation of our regression model,
and can now be applied to an independent sample (data-set).
To get to our simulated, independent data set - we now select Half 2. With Half 2 active, we run
the code we just made, and then transform the output again into deciles. We will save these
deciles as DECILES2, and run the same CROSSTABS function as we did before.
If we have done our job correctly, this Crosstabs will look almost identical, and hopefully a little
better than the first one. Drumroll!!!!
6. Hallelujah! This crosstabs has a slightly higher percentage of customers in the first decile, but is
practically the same. Check out the comparison on the next page. Our predictive model works!
7. This is great news. We built a regression model and ran i on one data-set. If the model worked,
we would expect similar results by running the same model on a different data-set. We got those
similar results.
With these techniques and a handy tool like SPSS, we can take a data set, build a regression
model to find characteristics shared by customers and non-customers - then validate this model
on an independent data set. By doing this, we can significantly increase our chances of finding
new customers anywhere, which is obviously an incredibly valuable skill to have.
But remember, anyone can input numbers into SPSS. The real difference between a good and
great market researcher is being able to interpret those numbers, by asking good questions and
employing impeccable language!