SlideShare a Scribd company logo
© Palmatier, Petersen, and Germann 1
Survey Design and Testing to
Derive Customer Insights
Marketing Analytics
Based on First Principles :
Agenda
 Learning Objectives
 Introduction
 Principles of Questionnaire Design
 Types of Questions
 The Art of Asking Questions
 Questionnaire Layout
 Principles of Sampling
 Probability versus Quota Sampling
 Sample Size for Estimating the Population Mean
 Sample Size for Estimating Proportions
 Sample Size Heuristics
 Scales and Factor analysis
 Scale Development Process
 What is Factor analysis?
 Model Underlying Factor Analysis
 Number of Factors to Retain
 Interpretation of Factors
 Summary
 Takeaways
2
 Be able to describe the most common methods used to administer questionnaires.
Explain the advantages and disadvantages of these methods.
 Understand the different types of scales that exist.
 Be able to successfully design a questionnaire.
 Understand the difference between probability and quota sampling, and know when to
use which type of sampling.
 Be able to calculate the sample size needed to estimate the population mean and
proportion.
 Understand what a latent construct is.
 Know what factor analysis is and how it differs from principal component analysis
(PCA) and cluster analysis.
 Understand how factors can be rotated and know the difference between orthogonal
and oblique factor rotation.
3
Learning Objectives
Agenda
 Learning Objectives
 Introduction
 Principles of Questionnaire Design
 Types of Questions
 The Art of Asking Questions
 Questionnaire Layout
 Principles of Sampling
 Probability versus Quota Sampling
 Sample Size for Estimating the Population Mean
 Sample Size for Estimating Proportions
 Sample Size Heuristics
 Scales and Factor analysis
 Scale Development Process
 What is Factor analysis?
 Model Underlying Factor Analysis
 Number of Factors to Retain
 Interpretation of Factors
 Summary
 Takeaways
4
© Palmatier, Petersen, and Germann
Principles of Questionnaire Design
 Conducting marketing analytics requires data. Sometimes, the
data needed is already available to the researcher and does
not need to be collected. In that case, researchers use what is
referred to as secondary data, that is, data that was initially
collected for a different purpose (e.g., data from the Census
Bureau).
 At other times, new data needs to be collected. This type of
data is referred to as primary data. There are various ways to
obtain primary data.
 However, an oft-used method entails collecting data by means
of questionnaires.
5
Principles of Questionnaire Design
 A questionnaire is a formalized set of questions for obtaining
information from respondents to generate the data
necessary to accomplish the objectives of the research
project.
 Based on this definition, we can see that questionnaires
should:
 Include a set of specific questions that the respondents can and will
answer
 Should facilitate data processing. Specifically, it should be easy to
record questionnaire answers, and code and analyze the responses
 Provide the information and data needed to answer the question(s) the
researcher seeks to answer with the questionnaire
6
Principles of Questionnaire Design
Questionnaires can be administered in
various ways. The four most common
methods are:
Personal (i.e., face-to-face) interviews
Telephone interviews
Mail questionnaires
Online questionnaires
Each one of these methods has advantages
and disadvantages.
7
Principles of Questionnaire Design
8
Advantages Disadvantages
Personal Interviews
Flexibility in obtaining data Bias of interviewer
Face-to-face encounter Response bias
Ability to clear doubt in person Time requirements
Can arouse and keep interest Cost per completed interview is high
Can build rapport
Assistance of visual aid
Can clarify misunderstandings
Can probe for more complete answers
Telephone Interviews
Can be performed in a short period of time Calls must be kept as short as possible
Low non-response rate Bias of interviewer
Lower cost than personal interview Difficult to use visual aids
Ability to clear doubt in person Viewed as disrespectful by some
Can clarify misunderstandings Difficulty obtaining accurate sample
Can probe for more complete answers
Principles of Questionnaire Design
9
Advantages Disadvantages
Mail Questionnaires
Can cover broad respondent base High non-response rate (many people do not mail the
questionnaires back)
Lower cost compared to personal and telephone
interviews
Cost-per survey could be high
Often more thought given to responses (compared to
online)
Questions can be misinterpreted
Can be more effective when dealing with sensitive
topics if anonymous
Cannot control for the identity of the respondent
Can be difficult to compile accurate mailing list
Online Questionnaires
Fast turnaround in administering questionnaire High non-response rate
Relatively inexpensive Questions can be misinterpreted
Immediate return upon completion Respondents do not give answers much thought
Flexible design options (e.g., different questions
appear based on previous answers)
Cannot control for the identity of the respondent
Inexpensive online questionnaire solutions exist
Types of Questions
 There are two types of questions that are typically included in questionnaires:
 Open-ended questions. An example of an open-ended question is: “What comes
to your mind when you think of brand XYZ? Please list all your thoughts.” Some
advantages of open-ended questions are that they (i) can provide very rich
information and (ii) are good for exploratory research (for example, when the
researchers are unsure about the questions to ask). Some of the disadvantages
of open-ended questions are that they (i) are sometimes difficult to analyze and
(ii) can be burdensome for the respondent.
 Closed-ended questions. Examples of commonly used closed-ended question
types in questionnaires are:
 Multiple choice questions (e.g., which of the following running shoe brands have you
owned in the last 10 years: Nike, Adidas, Under Armour, Asics, New Balance, None of
the above);
 Dichotomous (or binary) questions (e.g., have you heard of brand XYZ? Yes, No);
 Scaled response questions (e.g., On a scale from 1 – 7, how likely are you to purchase
Nike shoes next time you purchase running shoes, where 1 means not at all likely and
7 means extremely likely).
10
10
Types of Questions
 The scales of closed-ended questions depend of the question being asked.
Importantly, there are four types of scales:
 Nominal scales are used in questions where the answer choices are
named but they do not have an order or direction.
 Ordinal scales have an order, and the order is important. However,
the difference between the answer choices is not known. What age
range are you in?" it's an ordinal variable.
 Interval scales have an order, and the difference between each
answer choice is known. Marketing researchers commonly use
Likert scales. Likert scales are commonly constructed with five,
seven, or nine points (see scaled response question example above)
and are typically treated as an interval scale.
 Ratio scales are the same as interval scales but they also have a true
zero. Examples are questions such as “how much do you spend on
running shoes every year?”
11
11
The Art of Asking Questions
 When designing a questionnaire, it is critical to make sure (i) the “right”
kind of questions are included and (ii) the questions are worded
appropriately.
 The wording of the questions should be clear, the questions should not
bias the respondent, and the respondent should be able and willing to
answer the questions.
 For example:
 “Do you own any stock? Yes, No.”
Surprising to the researchers, a high degree of stock ownership turned up in
rural areas. However, after investigating the responses some more, the
researchers realized many respondents thought the question was about
livestock (whereas the survey question was about financial products, i.e.,
stocks).
12
The Art of Asking Questions
 When composing a questionnaire, it is important for the researcher to be
clear on why she is asking a question.
 In summary, some of the keys to writing useful questions on a
questionnaire are as follows:
 Avoid complexity – use simple, conversational language
 Avoid leading and loaded questions
 Avoid ambiguity – be as specific as possible
 Avoid double-barreled questions
 Avoid making assumptions
 Avoid burdensome questions
 Avoid badly designed response categories
13
Questionnaire Layout
© Palmatier, Petersen, and Germann 14
 Questionnaires should be designed to appear as short as possible. One
common practice is to use multiple-grid layout where similar questions
and response alternatives are arranged in a grid format.
Please rate each of the following with regard to this flight, if applicable.
Excellent Good Fair Poor Awful
5 4 3 2 1
Courtesy and Treatment from the:
Skycap at airport . . . . . . . . . . . . . .
Airport Ticket CounterAgent . . . . .
Boarding Point (Gate) Agent . . . . .
Flight Attendants . . . . . . . . . . . . . .
Your Meal or Snack. . . . . . . . . . . . .
Beverage Service . . . . . . . . . . . . . .
Seat Comfort. . . . . . . . . . . . . . . . . .
Carry-On Stowage Space. . . . . . . .
Cabin Cleanliness . . . . . . . . . . . . .
Video/Stereo Entertainment . . . . . .
On-Time Departure . . . . . . . . . . . .
Questionnaire Layout
15
 Questionnaires should start with easy to answer questions to build
rapport with the respondent and to indicate that the questionnaire is not
burdensome.
 Tough and important questions should be placed in the middle of the
questionnaire. At that stage, the respondent has likely committed to
completing the questionnaire and is expected to answer these questions
accurately.
 Sensitive and/or demographic questions, in turn, should be asked at the
end of the questionnaire to avoid making the respondent feel uneasy
earlier on.
Questionnaire Layout
© Palmatier, Petersen, and Germann 16
 An Example:
Location Type Purpose/function Example
Starting questions Broad, general questions
To break the ice and establish a
rapport with the respondent
Do you own a DVD player?
Next few
questions
Simple and direct
questions
To reassure the respondent that
the survey is simple and easy to
answer
What brands of DVD
players did you consider
when you bought it?
Questions up to a
third of the
questionnaire
Focused questions
Relate more to the research
objectives and convey to the
respondent the area of research
What attributes did you
consider when you
purchased your DVD
player?
Major portion of
the questionnaire
Focused questions;
some may be difficult or
complicated
To obtain most of the
information required for the
research
Rank following attributes of
a DVD players based on
importance to you
Last few
questions
Personal questions that
may be perceived as
sensitive
To get classification and
demographic information about
the respondent
What is the highest level of
education you have
attained?
Agenda
 Learning Objectives
 Introduction
 Principles of Questionnaire Design
 Types of Questions
 The Art of Asking Questions
 Questionnaire Layout
 Principles of Sampling
 Probability versus Quota Sampling
 Sample Size for Estimating the Population Mean
 Sample Size for Estimating Proportions
 Sample Size Heuristics
 Scales and Factor analysis
 Scale Development Process
 What is Factor analysis?
 Model Underlying Factor Analysis
 Number of Factors to Retain
 Interpretation of Factors
 Summary
 Takeaways
17
© Palmatier, Petersen, and Germann
Probability versus Quota Sampling
 When administering questionnaires, researchers usually want to obtain a
big enough sample to make valid inferences about the population they
are studying and interested in.
 The question, however, is how many respondents do researchers have to
get feedback from to make valid inferences about the population of
interest. And how should the respondents be selected?
18
Probability versus Quota Sampling
 There are two broad ways to select respondents
 Probability sampling entails random selection of respondents (i.e., the
sample). For example, a company might have a database that includes
contact details of all its 10,000 customers. The company could randomly
select 300 customers for the survey.
 Quota sampling, as the name suggests, is based on quotas. For example,
a company may have decided it wants to collect data from 300
respondents. Rather than randomly selecting the 300 respondents from
the population of interest, the company’s researcher contacts every
person in his contact list until he has responses from 300 respondents.
Because the selection of the respondents is left to the subjective
judgment of the researcher – or in the example here, his non-random
contact list.
19
Sample Size for Estimating the Population
Mean
 Depending on the variable of interest, statistical methods
exist that researchers can use to calculate the necessary
sample size.
 Then they can calculate the sample size 𝑛 needed to estimate
the mean attitude using the following formula:
𝑛 =
1
𝑑2
𝑧𝛼 2
2
∗ 𝜎2 +
1
𝑁
 where 𝑁 is the population size; 𝜎2 is the population
variance; 𝑑 is the spread around the population
estimate (i.e., margin of error) the researchers are
willing to accept with a 100(1 – 𝛼) confidence. 𝑧𝛼 is the
Z-score based on the standard normal distribution.
20
Sample Size for Estimating Proportions
 Marketing researchers are often interested in understanding proportions.
For example, they may want to know the proportion of the population of
interest that would purchase their brand (vs. not purchase their brand).
 We can substitute 𝜎2 with
𝑁
𝑁−1
∗ 𝑝 ∗ (1 − 𝑝) (where 𝑝 is the proportion of
the population of interest) to get the equation to calculate the sample size
needed to estimate the proportion of the population of interest. This
gives us the following equation:
𝑛 =
𝑁 ∗ 𝑝 ∗ (1 − 𝑝)
𝑁 − 1 ∗
𝑑2
𝑧𝛼 2
2 + 𝑝 ∗ (1 − 𝑝)
21
 Suppose we want to estimate the average height (mean) of a certain population,
and we take a random sample of 100 individuals. Let's assume the population
standard deviation (σ) is known to be 3 inches.
 Define Parameters:
• Sample size (n): 100
• Population standard deviation (σ): 3 inches
• Desired confidence level: 95%
 Determine the Z-Score: For a 95% confidence level, the critical z-value is
approximately 1.96. You can find this value using a standard normal
distribution table or a calculator.
© Palmatier 22
Sample Size Heuristics
Although it is generally a good idea to use the
above formulas to calculate the required sample
size, some researchers rely on simple sample size
heuristics.
A common heuristic is to have at least 5 times as
many responses as there are survey questions,
and a minimum of 50 responses. Thus, if the
survey includes 20 questions, the sample size
should be 100 (i.e., 20 x 5).
23
Validity and reliability of questionnaire
24
So, we have to reduce these errors to prove scientific findings
How well do our measured variables “capture” the conceptual
variables?
Reliability
Construct Validity
CVs
CVs *
*
*
*
*
*
*
*
*
*
*
*
* *
*
*
*
The extent to which the variables
are free from random error, usually
determined by measuring the
variables more than once.
The extent to which a measured
variable actually measures the
conceptual variables that is design
to assess the extent to which it is
known to reflect the conceptual
variable other measured variables
Reliability as Internal Consistency
The extent to which the scores on
the items correlate with each other
and thus are all measuring the true
score rather than reflecting random
error.
Questionnaire 9/20
___ I feel I do not have much proud of.
___ On the whole, I am satisfied with myself
___ I certainly feel useless at times
___ At times I think I am no good at all
___ I have a number of good qualities
___ I am able to do things as well as others
How Do You Measure
Internal Consistency?
Coefficient Alpha
Split-half Reliability
Scale Development Process
 Survey constructs of interest are often latent and hence cannot be
directly observed.
 Researchers typically use scales to measure latent constructs.
 Scales usually include multiple questions where each question
captures a different (yet related) aspect of the latent construct.
 Marketers have developed hundreds of scales over the years, and
researchers use these existing scales on a regular basis.
 Sometimes, however, there are no existing scales, or the ones that
exist do not fully meet the needs of the researchers. In those cases,
researchers have to develop their own scale(s).
27
Scale Development Process
The scale development process typically follows a four-
phase iterative procedure.
Generating survey questions (also referred to as items) that
capture the essence of the construct of interest.
Researchers usually engage other researchers to evaluate the
clarity and appropriateness of each of the survey questions they
developed and to perhaps expand the list of questions.
Researchers usually administer the resulting questions to a
small sample of target respondents to assess any ambiguity or
difficulty that they experienced when responding.
Researchers will administer the survey questions (i.e., the scale)
along with other survey questions to target respondents.
28
Scale Development Process
Researchers will typically use the responses
to this survey to assess the reliability and
validity of the newly developed scale.
There are a number of analyses researchers
can use to assess scale (or construct)
reliability and validity. In what follows, we
focus on one of the key techniques: Factor
analysis.
29
What is Factor analysis?
Factor analysis is a statistical technique used to analyze
interrelationships (i.e., correlations) among a large
number of variables and to explain these variables in
terms of their common underlying dimensions (referred
to as factors).
The main objective of factor analysis is condensing the
information contained in the original variables into a
small set of factors with a minimum loss of information.
Considering scale development, factor analysis is used to
determine if all scale items (i.e., questions) load onto the
same factor.
30
What is Factor analysis?
 Factor analysis is very similar to principal components analysis (PCA)
 Indeed, both analyses are used to simplify the structure of a set of variables.
 However, the two analyses also differ in important ways.
 In PCA, the components are calculated as linear combinations of the underlying
variables whereas in factor analysis, the underlying variables are defined as linear
combinations of the factors.
 Moreover, in PCA, the goal is to explain as much of the total variance in the
variables as possible.
 In contrast, the goal of factor analysis is to explain the correlations among the
variables.
 Furthermore, PCA is typically used to reduce the data into a smaller number of
components whereas factor analysis is used to understand the constructs that
underlie the variables.
31
What is Factor analysis?
Factor analysis is also very similar to cluster
analysis discussed in
Both are used to analyze the structure of
relationships.
However, whereas factor analysis is used to understand the
structure of the relationship among variables, cluster
analysis is used to identify meaningful subgroups (or
clusters) of individuals (e.g., customers) or objects (e.g.
companies).
That is, the objective of cluster analysis is to classify a
sample of individuals or objects into a small number of
mutually exclusive clusters.
32
Model Underlying Factor Analysis
 The key purpose of factor analysis is to capture – if possible – the relationships among
many variables in terms of a few underlying (latent) factors.
 Thus, it is conceivable that each group of variables that load onto a specific factor
represent a single latent construct.
 The orthogonal factor model takes the following form:
𝑋1 − 𝜇1 = 𝑙11𝐹1 + 𝑙12𝐹2 + ⋯ + 𝑙1𝑚𝐹
𝑚 + 𝜀1
𝑋2 − 𝜇2 = 𝑙21𝐹1 + 𝑙22𝐹2 + ⋯ + 𝑙2𝑚𝐹𝑚 + 𝜀2
:
:
:
:
𝑋𝑛 − 𝜇𝑛 = 𝑙𝑛1𝐹1 + 𝑙𝑛2𝐹2 + ⋯ + 𝑙𝑛𝑚𝐹
𝑚 + 𝜀𝑛
where 𝑋𝑛 are the n variables included in the factor analysis, 𝐹𝑚 are the m factors
provided by the factor analysis, 𝑙𝑛𝑚 are the loadings of the n’s variable on the m’s factor,
and 𝜀𝑛 are the additional sources of variation of 𝑋𝑛 − 𝜇𝑛 not captured by the factors. 𝜇𝑛
are deviations that are unobservable, which distinguish the factor model from
multivariate regression models.
33
Model Underlying Factor Analysis
 In unrotated factor analysis, factors are extracted in the order of their
importance.
 The first factor is usually a general factor onto which almost every
variables loads.
 Thus, the first factor usually captures the largest amount of variance of
the variables.
 The second and subsequent factors then capture the residual amount of
variance, with each factor accounting for smaller portions of the variance.
 That said, factors can also be rotated to redistribute the variance from
earlier factors to later factors, thus facilitating the interpretation of the
various factors.
34
Number of Factors to Retain
 There are a number of criteria researchers use when deciding on how
many factors to retain in a factor analysis.
 The most commonly used technique is the “Latent Root (or Eigenvalue)
Criterion.” The rationale behind this technique is that any factor should
account for the majority of variance of at least one variable. Only factors
with an eigenvalue of at least 1 are retained.
 Another criterion researchers use to decide on the number of factors to
retain is the “Percentage of Variance Criterion.” This criterion is based on
attaining at least a specified amount of variance of the variables with the
factors.
 Some researchers use the “A Priori Criterion.” As the name suggests, in
this case, researchers decide before running the factor analysis how
many factors they want to retain. Thus, they tell the software to keep a
certain number of factors (for example, 4).
35
Interpretation of Factors
 Once the number of factors to retain has been decided upon,
the factors need to be interpreted.
 Researchers interpret factors based on the variables that
load onto the factors.
 Importantly, variables are typically associated with (i.e., load
onto) most if not all factors. However, they do so with
varying degrees.
 For example, a variable may have a factor loading of 0.12
with Factor 1 and a factor loading of 0.75 with Factor 2. Thus,
this variable would be used to interpret Factor 2 (and not
Factor 1).
36
Interpretation of Factors
 An important feature of Factor analysis is that factors can be rotated.
Sometimes rotations help with interpreting factors.
 In orthogonal factor
rotation, the axes are
maintained at 90 degrees
as the factors are rotated
about the origin.
 Thus, the factors remain
uncorrelated with each
other.
37
0.00 +.50 +1.0
Unrotated
Factor 1
+.50
+1.0
-1.0
-.50
-1.0 -.50
Unrotated
Factor 2 Rotated
Factor 2
Rotated
Factor 1
Variable 1
Variable 2
Variable 3
Variable 4
Variable 5
Variable 6
Problem in hand
 CR 100 YEAR OLD RETAILER, NOW WANTS TO DO A BRAND SURVEY
 CR EARNED A GOOD NAME AS A‘ RETAILER WHO MFG AND SELLS HIGH
QUALITY APPAREL PRODUCTS ‘. IT EXPANDED TO HOME GOODS, SMALL
APPLIANCES.
 IN THE COUSE OF TIME CR OPENED SOME ONLINE OUTLETS ALSO
 NOW IT HAS 7 PRODUCT CATEGORIES
 CR WANTED TO EVALATE
© Palmatier 38
Statement of retail brand audit for 100 customers : Likert
Scale of 1 and 7: 1 strongly disagree and 7 strongly agree
Across 6 retail brands; CHESTNUT R and Retailer A to E
Name Statement
Quality Retailer X sells high quality products
Last I know that Retailer X’s clothes last
Fit Retailer X cloth always fits well
Latest Retailer X SELLS LATEST CLOTHS
Trends
Stylish
Value
Bargain I find a good bargain at Retailer X
Worth I find Retailer X cloth are worth buying I
Satisfied I AM SATISFIED WITH RETAILER X
Purchase I WILL PURCHASE FROM RETAILER X AGAIN
Recommen
d
I recommend Retailer X cloth are worth buying I
39
Retail brand audit
Qualit
y
Last Fit Latest Trend
s
Stylis
h
Value Bargai
n
WorthSatisfi
ed
Purch
ase
Reco
mme
nd
Retail
er
2 1 1 7 7 5 3 3 4 6 7 7CR
7 7 7 3 4 3 1 1 1 2 1 3CR
3 4 2 3 3 5 6 7 6 2 2 1CR
4 5 3 7 7 7 1 2 2 7 7 6CR
4 4 4 3 3 3 1 1 1 6 6 6CR
3 4 3 7 5 6 1 1 1 1 3 2CR
Let’s do Factor Analysis in R AND Ascertain the audit result
40

More Related Content

Similar to factor survey_new Data Analytics Marketing.pptx

Module 2 part 6 and writing plan
Module 2 part 6 and writing planModule 2 part 6 and writing plan
Module 2 part 6 and writing planPaula Nottingham
 
Week6 7a- developing a questionnaire
Week6 7a- developing a questionnaireWeek6 7a- developing a questionnaire
Week6 7a- developing a questionnaire
Hafizul Mukhlis
 
Questionnaires 6 steps for research method.
Questionnaires 6 steps for research method.Questionnaires 6 steps for research method.
Questionnaires 6 steps for research method.
Namo Kim
 
Questionnaires
QuestionnairesQuestionnaires
Questionnaires
Anamika Sonawane
 
Data Quality Tips.pdf
Data Quality Tips.pdfData Quality Tips.pdf
Data Quality Tips.pdf
AhmedMohammed611235
 
Survey Design: Introduction & Overview
Survey Design: Introduction & OverviewSurvey Design: Introduction & Overview
Survey Design: Introduction & Overview
James Neill
 
Research methodology tools design
Research methodology   tools designResearch methodology   tools design
Research methodology tools design
PrestonAssociates
 
Carma internet research module: Survey reduction
Carma internet research module: Survey reductionCarma internet research module: Survey reduction
Carma internet research module: Survey reduction
Syracuse University
 
Smart survey
Smart surveySmart survey
Smart surveytieutam
 
How to design questionnaire
How to design questionnaireHow to design questionnaire
How to design questionnaire
sukesh gowda
 
Designing a survey questionnaire
Designing a survey questionnaireDesigning a survey questionnaire
Designing a survey questionnaire
Argie Ray Butalid
 
Questionnaire Vinay
Questionnaire VinayQuestionnaire Vinay
Questionnaire Vinaynibraspk
 
Questionnaire and schedules design
Questionnaire and schedules designQuestionnaire and schedules design
Questionnaire and schedules design
gentlemoro
 
How to Conduct a Survey gf form to anylyz
How to Conduct a Survey gf form to anylyzHow to Conduct a Survey gf form to anylyz
How to Conduct a Survey gf form to anylyz
edenjrodrigo
 
SURVEY TUTORIAL
SURVEY TUTORIALSURVEY TUTORIAL
SURVEY TUTORIALHanen Mhz
 
9.11.11 final tools for learning data gathering
9.11.11 final tools for learning data gathering9.11.11 final tools for learning data gathering
9.11.11 final tools for learning data gatheringPaula Nottingham
 
Right now customer-feedback-survey-best-practices-brief
Right now customer-feedback-survey-best-practices-briefRight now customer-feedback-survey-best-practices-brief
Right now customer-feedback-survey-best-practices-briefRightNow Technologies
 
Tools campus workshop 17.3.11 bapp wbs3835 qual r
Tools campus workshop 17.3.11 bapp wbs3835 qual rTools campus workshop 17.3.11 bapp wbs3835 qual r
Tools campus workshop 17.3.11 bapp wbs3835 qual rPaula Nottingham
 

Similar to factor survey_new Data Analytics Marketing.pptx (20)

Module 2 part 6 and writing plan
Module 2 part 6 and writing planModule 2 part 6 and writing plan
Module 2 part 6 and writing plan
 
Week6 7a- developing a questionnaire
Week6 7a- developing a questionnaireWeek6 7a- developing a questionnaire
Week6 7a- developing a questionnaire
 
Questionnaires 6 steps for research method.
Questionnaires 6 steps for research method.Questionnaires 6 steps for research method.
Questionnaires 6 steps for research method.
 
Questionnaires
QuestionnairesQuestionnaires
Questionnaires
 
Data Quality Tips.pdf
Data Quality Tips.pdfData Quality Tips.pdf
Data Quality Tips.pdf
 
Survey Design: Introduction & Overview
Survey Design: Introduction & OverviewSurvey Design: Introduction & Overview
Survey Design: Introduction & Overview
 
Research methodology tools design
Research methodology   tools designResearch methodology   tools design
Research methodology tools design
 
Carma internet research module: Survey reduction
Carma internet research module: Survey reductionCarma internet research module: Survey reduction
Carma internet research module: Survey reduction
 
Smart survey
Smart surveySmart survey
Smart survey
 
How to design questionnaire
How to design questionnaireHow to design questionnaire
How to design questionnaire
 
SOC2002 Lecture 5
SOC2002 Lecture 5SOC2002 Lecture 5
SOC2002 Lecture 5
 
Designing a survey questionnaire
Designing a survey questionnaireDesigning a survey questionnaire
Designing a survey questionnaire
 
Questionnaire Vinay
Questionnaire VinayQuestionnaire Vinay
Questionnaire Vinay
 
Questionnaire and schedules design
Questionnaire and schedules designQuestionnaire and schedules design
Questionnaire and schedules design
 
How to Conduct a Survey gf form to anylyz
How to Conduct a Survey gf form to anylyzHow to Conduct a Survey gf form to anylyz
How to Conduct a Survey gf form to anylyz
 
SURVEY TUTORIAL
SURVEY TUTORIALSURVEY TUTORIAL
SURVEY TUTORIAL
 
9.11.11 final tools for learning data gathering
9.11.11 final tools for learning data gathering9.11.11 final tools for learning data gathering
9.11.11 final tools for learning data gathering
 
Right now customer-feedback-survey-best-practices-brief
Right now customer-feedback-survey-best-practices-briefRight now customer-feedback-survey-best-practices-brief
Right now customer-feedback-survey-best-practices-brief
 
Surveys
SurveysSurveys
Surveys
 
Tools campus workshop 17.3.11 bapp wbs3835 qual r
Tools campus workshop 17.3.11 bapp wbs3835 qual rTools campus workshop 17.3.11 bapp wbs3835 qual r
Tools campus workshop 17.3.11 bapp wbs3835 qual r
 

Recently uploaded

Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Domenico Conte
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
alex933524
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
StarCompliance.io
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
Uber Ride Supply Demand Gap Analysis Report
Uber Ride Supply Demand Gap Analysis ReportUber Ride Supply Demand Gap Analysis Report
Uber Ride Supply Demand Gap Analysis Report
SatyamNeelmani2
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
correoyaya
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
theahmadsaood
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
James Polillo
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
elinavihriala
 

Recently uploaded (20)

Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
Uber Ride Supply Demand Gap Analysis Report
Uber Ride Supply Demand Gap Analysis ReportUber Ride Supply Demand Gap Analysis Report
Uber Ride Supply Demand Gap Analysis Report
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 

factor survey_new Data Analytics Marketing.pptx

  • 1. © Palmatier, Petersen, and Germann 1 Survey Design and Testing to Derive Customer Insights Marketing Analytics Based on First Principles :
  • 2. Agenda  Learning Objectives  Introduction  Principles of Questionnaire Design  Types of Questions  The Art of Asking Questions  Questionnaire Layout  Principles of Sampling  Probability versus Quota Sampling  Sample Size for Estimating the Population Mean  Sample Size for Estimating Proportions  Sample Size Heuristics  Scales and Factor analysis  Scale Development Process  What is Factor analysis?  Model Underlying Factor Analysis  Number of Factors to Retain  Interpretation of Factors  Summary  Takeaways 2
  • 3.  Be able to describe the most common methods used to administer questionnaires. Explain the advantages and disadvantages of these methods.  Understand the different types of scales that exist.  Be able to successfully design a questionnaire.  Understand the difference between probability and quota sampling, and know when to use which type of sampling.  Be able to calculate the sample size needed to estimate the population mean and proportion.  Understand what a latent construct is.  Know what factor analysis is and how it differs from principal component analysis (PCA) and cluster analysis.  Understand how factors can be rotated and know the difference between orthogonal and oblique factor rotation. 3 Learning Objectives
  • 4. Agenda  Learning Objectives  Introduction  Principles of Questionnaire Design  Types of Questions  The Art of Asking Questions  Questionnaire Layout  Principles of Sampling  Probability versus Quota Sampling  Sample Size for Estimating the Population Mean  Sample Size for Estimating Proportions  Sample Size Heuristics  Scales and Factor analysis  Scale Development Process  What is Factor analysis?  Model Underlying Factor Analysis  Number of Factors to Retain  Interpretation of Factors  Summary  Takeaways 4 © Palmatier, Petersen, and Germann
  • 5. Principles of Questionnaire Design  Conducting marketing analytics requires data. Sometimes, the data needed is already available to the researcher and does not need to be collected. In that case, researchers use what is referred to as secondary data, that is, data that was initially collected for a different purpose (e.g., data from the Census Bureau).  At other times, new data needs to be collected. This type of data is referred to as primary data. There are various ways to obtain primary data.  However, an oft-used method entails collecting data by means of questionnaires. 5
  • 6. Principles of Questionnaire Design  A questionnaire is a formalized set of questions for obtaining information from respondents to generate the data necessary to accomplish the objectives of the research project.  Based on this definition, we can see that questionnaires should:  Include a set of specific questions that the respondents can and will answer  Should facilitate data processing. Specifically, it should be easy to record questionnaire answers, and code and analyze the responses  Provide the information and data needed to answer the question(s) the researcher seeks to answer with the questionnaire 6
  • 7. Principles of Questionnaire Design Questionnaires can be administered in various ways. The four most common methods are: Personal (i.e., face-to-face) interviews Telephone interviews Mail questionnaires Online questionnaires Each one of these methods has advantages and disadvantages. 7
  • 8. Principles of Questionnaire Design 8 Advantages Disadvantages Personal Interviews Flexibility in obtaining data Bias of interviewer Face-to-face encounter Response bias Ability to clear doubt in person Time requirements Can arouse and keep interest Cost per completed interview is high Can build rapport Assistance of visual aid Can clarify misunderstandings Can probe for more complete answers Telephone Interviews Can be performed in a short period of time Calls must be kept as short as possible Low non-response rate Bias of interviewer Lower cost than personal interview Difficult to use visual aids Ability to clear doubt in person Viewed as disrespectful by some Can clarify misunderstandings Difficulty obtaining accurate sample Can probe for more complete answers
  • 9. Principles of Questionnaire Design 9 Advantages Disadvantages Mail Questionnaires Can cover broad respondent base High non-response rate (many people do not mail the questionnaires back) Lower cost compared to personal and telephone interviews Cost-per survey could be high Often more thought given to responses (compared to online) Questions can be misinterpreted Can be more effective when dealing with sensitive topics if anonymous Cannot control for the identity of the respondent Can be difficult to compile accurate mailing list Online Questionnaires Fast turnaround in administering questionnaire High non-response rate Relatively inexpensive Questions can be misinterpreted Immediate return upon completion Respondents do not give answers much thought Flexible design options (e.g., different questions appear based on previous answers) Cannot control for the identity of the respondent Inexpensive online questionnaire solutions exist
  • 10. Types of Questions  There are two types of questions that are typically included in questionnaires:  Open-ended questions. An example of an open-ended question is: “What comes to your mind when you think of brand XYZ? Please list all your thoughts.” Some advantages of open-ended questions are that they (i) can provide very rich information and (ii) are good for exploratory research (for example, when the researchers are unsure about the questions to ask). Some of the disadvantages of open-ended questions are that they (i) are sometimes difficult to analyze and (ii) can be burdensome for the respondent.  Closed-ended questions. Examples of commonly used closed-ended question types in questionnaires are:  Multiple choice questions (e.g., which of the following running shoe brands have you owned in the last 10 years: Nike, Adidas, Under Armour, Asics, New Balance, None of the above);  Dichotomous (or binary) questions (e.g., have you heard of brand XYZ? Yes, No);  Scaled response questions (e.g., On a scale from 1 – 7, how likely are you to purchase Nike shoes next time you purchase running shoes, where 1 means not at all likely and 7 means extremely likely). 10 10
  • 11. Types of Questions  The scales of closed-ended questions depend of the question being asked. Importantly, there are four types of scales:  Nominal scales are used in questions where the answer choices are named but they do not have an order or direction.  Ordinal scales have an order, and the order is important. However, the difference between the answer choices is not known. What age range are you in?" it's an ordinal variable.  Interval scales have an order, and the difference between each answer choice is known. Marketing researchers commonly use Likert scales. Likert scales are commonly constructed with five, seven, or nine points (see scaled response question example above) and are typically treated as an interval scale.  Ratio scales are the same as interval scales but they also have a true zero. Examples are questions such as “how much do you spend on running shoes every year?” 11 11
  • 12. The Art of Asking Questions  When designing a questionnaire, it is critical to make sure (i) the “right” kind of questions are included and (ii) the questions are worded appropriately.  The wording of the questions should be clear, the questions should not bias the respondent, and the respondent should be able and willing to answer the questions.  For example:  “Do you own any stock? Yes, No.” Surprising to the researchers, a high degree of stock ownership turned up in rural areas. However, after investigating the responses some more, the researchers realized many respondents thought the question was about livestock (whereas the survey question was about financial products, i.e., stocks). 12
  • 13. The Art of Asking Questions  When composing a questionnaire, it is important for the researcher to be clear on why she is asking a question.  In summary, some of the keys to writing useful questions on a questionnaire are as follows:  Avoid complexity – use simple, conversational language  Avoid leading and loaded questions  Avoid ambiguity – be as specific as possible  Avoid double-barreled questions  Avoid making assumptions  Avoid burdensome questions  Avoid badly designed response categories 13
  • 14. Questionnaire Layout © Palmatier, Petersen, and Germann 14  Questionnaires should be designed to appear as short as possible. One common practice is to use multiple-grid layout where similar questions and response alternatives are arranged in a grid format. Please rate each of the following with regard to this flight, if applicable. Excellent Good Fair Poor Awful 5 4 3 2 1 Courtesy and Treatment from the: Skycap at airport . . . . . . . . . . . . . . Airport Ticket CounterAgent . . . . . Boarding Point (Gate) Agent . . . . . Flight Attendants . . . . . . . . . . . . . . Your Meal or Snack. . . . . . . . . . . . . Beverage Service . . . . . . . . . . . . . . Seat Comfort. . . . . . . . . . . . . . . . . . Carry-On Stowage Space. . . . . . . . Cabin Cleanliness . . . . . . . . . . . . . Video/Stereo Entertainment . . . . . . On-Time Departure . . . . . . . . . . . .
  • 15. Questionnaire Layout 15  Questionnaires should start with easy to answer questions to build rapport with the respondent and to indicate that the questionnaire is not burdensome.  Tough and important questions should be placed in the middle of the questionnaire. At that stage, the respondent has likely committed to completing the questionnaire and is expected to answer these questions accurately.  Sensitive and/or demographic questions, in turn, should be asked at the end of the questionnaire to avoid making the respondent feel uneasy earlier on.
  • 16. Questionnaire Layout © Palmatier, Petersen, and Germann 16  An Example: Location Type Purpose/function Example Starting questions Broad, general questions To break the ice and establish a rapport with the respondent Do you own a DVD player? Next few questions Simple and direct questions To reassure the respondent that the survey is simple and easy to answer What brands of DVD players did you consider when you bought it? Questions up to a third of the questionnaire Focused questions Relate more to the research objectives and convey to the respondent the area of research What attributes did you consider when you purchased your DVD player? Major portion of the questionnaire Focused questions; some may be difficult or complicated To obtain most of the information required for the research Rank following attributes of a DVD players based on importance to you Last few questions Personal questions that may be perceived as sensitive To get classification and demographic information about the respondent What is the highest level of education you have attained?
  • 17. Agenda  Learning Objectives  Introduction  Principles of Questionnaire Design  Types of Questions  The Art of Asking Questions  Questionnaire Layout  Principles of Sampling  Probability versus Quota Sampling  Sample Size for Estimating the Population Mean  Sample Size for Estimating Proportions  Sample Size Heuristics  Scales and Factor analysis  Scale Development Process  What is Factor analysis?  Model Underlying Factor Analysis  Number of Factors to Retain  Interpretation of Factors  Summary  Takeaways 17 © Palmatier, Petersen, and Germann
  • 18. Probability versus Quota Sampling  When administering questionnaires, researchers usually want to obtain a big enough sample to make valid inferences about the population they are studying and interested in.  The question, however, is how many respondents do researchers have to get feedback from to make valid inferences about the population of interest. And how should the respondents be selected? 18
  • 19. Probability versus Quota Sampling  There are two broad ways to select respondents  Probability sampling entails random selection of respondents (i.e., the sample). For example, a company might have a database that includes contact details of all its 10,000 customers. The company could randomly select 300 customers for the survey.  Quota sampling, as the name suggests, is based on quotas. For example, a company may have decided it wants to collect data from 300 respondents. Rather than randomly selecting the 300 respondents from the population of interest, the company’s researcher contacts every person in his contact list until he has responses from 300 respondents. Because the selection of the respondents is left to the subjective judgment of the researcher – or in the example here, his non-random contact list. 19
  • 20. Sample Size for Estimating the Population Mean  Depending on the variable of interest, statistical methods exist that researchers can use to calculate the necessary sample size.  Then they can calculate the sample size 𝑛 needed to estimate the mean attitude using the following formula: 𝑛 = 1 𝑑2 𝑧𝛼 2 2 ∗ 𝜎2 + 1 𝑁  where 𝑁 is the population size; 𝜎2 is the population variance; 𝑑 is the spread around the population estimate (i.e., margin of error) the researchers are willing to accept with a 100(1 – 𝛼) confidence. 𝑧𝛼 is the Z-score based on the standard normal distribution. 20
  • 21. Sample Size for Estimating Proportions  Marketing researchers are often interested in understanding proportions. For example, they may want to know the proportion of the population of interest that would purchase their brand (vs. not purchase their brand).  We can substitute 𝜎2 with 𝑁 𝑁−1 ∗ 𝑝 ∗ (1 − 𝑝) (where 𝑝 is the proportion of the population of interest) to get the equation to calculate the sample size needed to estimate the proportion of the population of interest. This gives us the following equation: 𝑛 = 𝑁 ∗ 𝑝 ∗ (1 − 𝑝) 𝑁 − 1 ∗ 𝑑2 𝑧𝛼 2 2 + 𝑝 ∗ (1 − 𝑝) 21
  • 22.  Suppose we want to estimate the average height (mean) of a certain population, and we take a random sample of 100 individuals. Let's assume the population standard deviation (σ) is known to be 3 inches.  Define Parameters: • Sample size (n): 100 • Population standard deviation (σ): 3 inches • Desired confidence level: 95%  Determine the Z-Score: For a 95% confidence level, the critical z-value is approximately 1.96. You can find this value using a standard normal distribution table or a calculator. © Palmatier 22
  • 23. Sample Size Heuristics Although it is generally a good idea to use the above formulas to calculate the required sample size, some researchers rely on simple sample size heuristics. A common heuristic is to have at least 5 times as many responses as there are survey questions, and a minimum of 50 responses. Thus, if the survey includes 20 questions, the sample size should be 100 (i.e., 20 x 5). 23
  • 24. Validity and reliability of questionnaire 24
  • 25. So, we have to reduce these errors to prove scientific findings How well do our measured variables “capture” the conceptual variables? Reliability Construct Validity CVs CVs * * * * * * * * * * * * * * * * * The extent to which the variables are free from random error, usually determined by measuring the variables more than once. The extent to which a measured variable actually measures the conceptual variables that is design to assess the extent to which it is known to reflect the conceptual variable other measured variables
  • 26. Reliability as Internal Consistency The extent to which the scores on the items correlate with each other and thus are all measuring the true score rather than reflecting random error. Questionnaire 9/20 ___ I feel I do not have much proud of. ___ On the whole, I am satisfied with myself ___ I certainly feel useless at times ___ At times I think I am no good at all ___ I have a number of good qualities ___ I am able to do things as well as others How Do You Measure Internal Consistency? Coefficient Alpha Split-half Reliability
  • 27. Scale Development Process  Survey constructs of interest are often latent and hence cannot be directly observed.  Researchers typically use scales to measure latent constructs.  Scales usually include multiple questions where each question captures a different (yet related) aspect of the latent construct.  Marketers have developed hundreds of scales over the years, and researchers use these existing scales on a regular basis.  Sometimes, however, there are no existing scales, or the ones that exist do not fully meet the needs of the researchers. In those cases, researchers have to develop their own scale(s). 27
  • 28. Scale Development Process The scale development process typically follows a four- phase iterative procedure. Generating survey questions (also referred to as items) that capture the essence of the construct of interest. Researchers usually engage other researchers to evaluate the clarity and appropriateness of each of the survey questions they developed and to perhaps expand the list of questions. Researchers usually administer the resulting questions to a small sample of target respondents to assess any ambiguity or difficulty that they experienced when responding. Researchers will administer the survey questions (i.e., the scale) along with other survey questions to target respondents. 28
  • 29. Scale Development Process Researchers will typically use the responses to this survey to assess the reliability and validity of the newly developed scale. There are a number of analyses researchers can use to assess scale (or construct) reliability and validity. In what follows, we focus on one of the key techniques: Factor analysis. 29
  • 30. What is Factor analysis? Factor analysis is a statistical technique used to analyze interrelationships (i.e., correlations) among a large number of variables and to explain these variables in terms of their common underlying dimensions (referred to as factors). The main objective of factor analysis is condensing the information contained in the original variables into a small set of factors with a minimum loss of information. Considering scale development, factor analysis is used to determine if all scale items (i.e., questions) load onto the same factor. 30
  • 31. What is Factor analysis?  Factor analysis is very similar to principal components analysis (PCA)  Indeed, both analyses are used to simplify the structure of a set of variables.  However, the two analyses also differ in important ways.  In PCA, the components are calculated as linear combinations of the underlying variables whereas in factor analysis, the underlying variables are defined as linear combinations of the factors.  Moreover, in PCA, the goal is to explain as much of the total variance in the variables as possible.  In contrast, the goal of factor analysis is to explain the correlations among the variables.  Furthermore, PCA is typically used to reduce the data into a smaller number of components whereas factor analysis is used to understand the constructs that underlie the variables. 31
  • 32. What is Factor analysis? Factor analysis is also very similar to cluster analysis discussed in Both are used to analyze the structure of relationships. However, whereas factor analysis is used to understand the structure of the relationship among variables, cluster analysis is used to identify meaningful subgroups (or clusters) of individuals (e.g., customers) or objects (e.g. companies). That is, the objective of cluster analysis is to classify a sample of individuals or objects into a small number of mutually exclusive clusters. 32
  • 33. Model Underlying Factor Analysis  The key purpose of factor analysis is to capture – if possible – the relationships among many variables in terms of a few underlying (latent) factors.  Thus, it is conceivable that each group of variables that load onto a specific factor represent a single latent construct.  The orthogonal factor model takes the following form: 𝑋1 − 𝜇1 = 𝑙11𝐹1 + 𝑙12𝐹2 + ⋯ + 𝑙1𝑚𝐹 𝑚 + 𝜀1 𝑋2 − 𝜇2 = 𝑙21𝐹1 + 𝑙22𝐹2 + ⋯ + 𝑙2𝑚𝐹𝑚 + 𝜀2 : : : : 𝑋𝑛 − 𝜇𝑛 = 𝑙𝑛1𝐹1 + 𝑙𝑛2𝐹2 + ⋯ + 𝑙𝑛𝑚𝐹 𝑚 + 𝜀𝑛 where 𝑋𝑛 are the n variables included in the factor analysis, 𝐹𝑚 are the m factors provided by the factor analysis, 𝑙𝑛𝑚 are the loadings of the n’s variable on the m’s factor, and 𝜀𝑛 are the additional sources of variation of 𝑋𝑛 − 𝜇𝑛 not captured by the factors. 𝜇𝑛 are deviations that are unobservable, which distinguish the factor model from multivariate regression models. 33
  • 34. Model Underlying Factor Analysis  In unrotated factor analysis, factors are extracted in the order of their importance.  The first factor is usually a general factor onto which almost every variables loads.  Thus, the first factor usually captures the largest amount of variance of the variables.  The second and subsequent factors then capture the residual amount of variance, with each factor accounting for smaller portions of the variance.  That said, factors can also be rotated to redistribute the variance from earlier factors to later factors, thus facilitating the interpretation of the various factors. 34
  • 35. Number of Factors to Retain  There are a number of criteria researchers use when deciding on how many factors to retain in a factor analysis.  The most commonly used technique is the “Latent Root (or Eigenvalue) Criterion.” The rationale behind this technique is that any factor should account for the majority of variance of at least one variable. Only factors with an eigenvalue of at least 1 are retained.  Another criterion researchers use to decide on the number of factors to retain is the “Percentage of Variance Criterion.” This criterion is based on attaining at least a specified amount of variance of the variables with the factors.  Some researchers use the “A Priori Criterion.” As the name suggests, in this case, researchers decide before running the factor analysis how many factors they want to retain. Thus, they tell the software to keep a certain number of factors (for example, 4). 35
  • 36. Interpretation of Factors  Once the number of factors to retain has been decided upon, the factors need to be interpreted.  Researchers interpret factors based on the variables that load onto the factors.  Importantly, variables are typically associated with (i.e., load onto) most if not all factors. However, they do so with varying degrees.  For example, a variable may have a factor loading of 0.12 with Factor 1 and a factor loading of 0.75 with Factor 2. Thus, this variable would be used to interpret Factor 2 (and not Factor 1). 36
  • 37. Interpretation of Factors  An important feature of Factor analysis is that factors can be rotated. Sometimes rotations help with interpreting factors.  In orthogonal factor rotation, the axes are maintained at 90 degrees as the factors are rotated about the origin.  Thus, the factors remain uncorrelated with each other. 37 0.00 +.50 +1.0 Unrotated Factor 1 +.50 +1.0 -1.0 -.50 -1.0 -.50 Unrotated Factor 2 Rotated Factor 2 Rotated Factor 1 Variable 1 Variable 2 Variable 3 Variable 4 Variable 5 Variable 6
  • 38. Problem in hand  CR 100 YEAR OLD RETAILER, NOW WANTS TO DO A BRAND SURVEY  CR EARNED A GOOD NAME AS A‘ RETAILER WHO MFG AND SELLS HIGH QUALITY APPAREL PRODUCTS ‘. IT EXPANDED TO HOME GOODS, SMALL APPLIANCES.  IN THE COUSE OF TIME CR OPENED SOME ONLINE OUTLETS ALSO  NOW IT HAS 7 PRODUCT CATEGORIES  CR WANTED TO EVALATE © Palmatier 38
  • 39. Statement of retail brand audit for 100 customers : Likert Scale of 1 and 7: 1 strongly disagree and 7 strongly agree Across 6 retail brands; CHESTNUT R and Retailer A to E Name Statement Quality Retailer X sells high quality products Last I know that Retailer X’s clothes last Fit Retailer X cloth always fits well Latest Retailer X SELLS LATEST CLOTHS Trends Stylish Value Bargain I find a good bargain at Retailer X Worth I find Retailer X cloth are worth buying I Satisfied I AM SATISFIED WITH RETAILER X Purchase I WILL PURCHASE FROM RETAILER X AGAIN Recommen d I recommend Retailer X cloth are worth buying I 39
  • 40. Retail brand audit Qualit y Last Fit Latest Trend s Stylis h Value Bargai n WorthSatisfi ed Purch ase Reco mme nd Retail er 2 1 1 7 7 5 3 3 4 6 7 7CR 7 7 7 3 4 3 1 1 1 2 1 3CR 3 4 2 3 3 5 6 7 6 2 2 1CR 4 5 3 7 7 7 1 2 2 7 7 6CR 4 4 4 3 3 3 1 1 1 6 6 6CR 3 4 3 7 5 6 1 1 1 1 3 2CR Let’s do Factor Analysis in R AND Ascertain the audit result 40