SlideShare a Scribd company logo
1 of 28
Download to read offline
Weighting online data
Jeffrey Henning
Executive Director
Market Research Institute International
August
2019
Sponsors
Communication
Gold
Silver
Visit NewMR.org
www.marketresearchcourses.org
2
Learning Objective:
Describe the challenges in obtaining
representative samples and how
representative samples can be
improved at the selection stage or
through weighting.
Challenges:
Traditionally we have recommended
weighting for probability samples.
Should we recommend weighting for
non-probability samples? When?
• Post-stratification weighting is viewed as a
common solution to removing sampling bias.
• But it is often misrepresented as a simple
process of arithmetic…
Weighting
U.S. Men Women
18 to 54
79,184,164
169 responses
469K weight
79,017,200
199 responses
397K weight
55+
36,301,576
15 responses
2,420K weight
43,154,705
17 responses
2,539K weight
Cell Weighting
• Do you want to set minimum and maximum
weights? If so, to what? Why?
• How will you simplify the weighting scheme
if sample balance is too low (e.g., <70%)?
• Which questions to weight on?
Editorial Judgments
Age
Sex
Region
Race/ethnicity
Education
level
Household
income
Proprietary
measure
Rim Weighting / Raking
Age
Age by sex
Age by race/ethnicity
Age by education
Age by region
Sex
Sex by race/ethnicity
Sex by education
Sex by region
Race/ethnicity
Race/ethnicity by
education
Race/ethnicity by
region
Education
Education by region
Census division
Political party
affiliation
Political ideology
Voter registration
Evangelical Christian
identification
Raking with Interlocked Variables…
Age Age by sex
Age by
race/ethnicity
Age by
education
Age by region
Sex
Sex by
race/ethnicity
Sex by
education
Sex by region Race/ethnicity
Race/ethnicity
by education
Race/ethnicity
by region
Education
Education by
region
Census
division
Political party
affiliation
Political
ideology
Voter
registration
Evangelical
Christian
identification
…Using these Interlocked Variables
• “No religious or political questions.”
• “You were all over the place. Starts by asking
questions on products and services and ends on
political preferences.”
• “Do not ask political questions. It seemed very
strange at the end.”
Complaints about Weighting Questions
Political party
affiliation
Political
ideology
Voter
registration
Evangelical
Christian
identification
Pew Weights
Probability Panels
Differently than it does
Opt-in Panels
• Implicit assumption is respondents in a
demographic group are representative of the
people in that group we did not survey
§ Not true for seniors
• Non-Internet seniors differ on many dimensions
from Internet-using seniors
§ Not true for immigrant communities
• Acculturated vs. unacculturated
Key Assumption Behind Weighting
61%
74%
84%
89%
81%
65%
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018
% of U.S. adults with Internet Access
U.S. adults Less than $30,000 Less than high school graduate 65+
Internet Surveys Decimate the Population
Source: Pew Research
• Some researchers weight convenience
samples...
§ In the hope it does no harm
§ In the belief it improves quality
§ For the fact it redistributes demographics to
match target populations
Assumptions Behind Weighting
• Demographic weighted surveys - Reflect the composition of the target
audience, often using cells comprised of age, gender and region. However,
David Yeager and Jon Krosnick determined that demographically weighting non-
probability Internet samples to known population values did not consistently
produce more representative results.
• Demographic and attitudinal weighted surveys - Some have also used
attitudinal questions in their weighting functions, but there has been no
academic validation of this.
• Propensity weighted surveys - Propensity score weighting adjusts for the
likelihood of respondents to be online based on their demographics.
• Non-parametric weighted surveys - Brian Fine of ORU demonstrated that CART
analysis could be used to model representative results by modeling the
dependent variable as panelist source. Not yet been independently validated.
Weighting Recommendations (2014)
Pew Research into Weighting
Wait, Wait
Waiting until the weighting stage
to adjust is too late. The
combination of coverage error and
nonresponse in online panels
generally creates a sample that is
beyond fixing post hoc. We need
to do more at the selection stage.
Reg Baker (2013)
ESOMAR Ambassador
Men Women
18 to 54 79,184,164
169 133 responses
595K weight
79,017,200
199 132 responses
599K weight
55+ 36,301,576
15 61 responses
595K weight
43,154,705
17 72 responses
599K weight
Quota Sampling
Vendor A
• Sex by age by
region
Vendor B
• Sex by age
• Region
Vendor C
• Sex
• Age
• Education
• Census region
• Race/ethnicity
• Population
density
Vendor D
• Sex by age
• Sex by
education
• Age by
education
• Census region
• Race/ethnicity
Common Quota Schemes
• For non-probability samples not using quota
sampling:
§ Don’t weight the results
• For non-probability samples using quota
sampling:
§ Weight to correct oversamples
§ Weight interim studies using quota sampling to
correct for slow-filling cells
§ May not be able to weight crosstabs (technical
limitation of many survey packages)
Recommendations
• Include disqualified respondents when
screening general population
§ Screener should contain weighting variables
• If can’t include disqualified respondents, run
omnibus questions to estimate population
totals
• Note that it can be time consuming to track
down updated benchmarking information;
design your questions based on the
benchmarks you find
Easily Overlooked Items
Call for Further Research
• Analysis of different
weighting schemes,
especially political
questions and religion
for business questions
• Research into weight
trimming
• Research into sample
balance
• Research into N < 2000
studies
For Further Reading
Jeffrey Henning
Executive Director
Market Research Institute International
jhenning@mrii.org
@jhenning
https://www.linkedin.com/in/jhenning/
Sponsors
Communication
Gold
Silver
Visit NewMR.org
August
2019
Q & A
Ray Poynter
NewMR
Jeffrey Henning
MRII
(Market Research
Institute International)

More Related Content

What's hot

Cannonical correlation
Cannonical correlationCannonical correlation
Cannonical correlationdomsr
 
Multinomial Logistic Regression Analysis
Multinomial Logistic Regression AnalysisMultinomial Logistic Regression Analysis
Multinomial Logistic Regression AnalysisHARISH Kumar H R
 
Levels of Measurement
Levels of MeasurementLevels of Measurement
Levels of MeasurementSarfraz Ahmad
 
Basic statistics by Neeraj Bhandari ( Surkhet.Nepal )
Basic statistics by Neeraj Bhandari ( Surkhet.Nepal )Basic statistics by Neeraj Bhandari ( Surkhet.Nepal )
Basic statistics by Neeraj Bhandari ( Surkhet.Nepal )Neeraj Bhandari
 
Basics of Educational Statistics (Descriptive statistics)
Basics of Educational Statistics (Descriptive statistics)Basics of Educational Statistics (Descriptive statistics)
Basics of Educational Statistics (Descriptive statistics)HennaAnsari
 
Missing data and non response pdf
Missing data and non response pdfMissing data and non response pdf
Missing data and non response pdfAnuj Bhatia
 
Correlation VS Causation
Correlation VS CausationCorrelation VS Causation
Correlation VS CausationColleen Carmean
 
Multidimensional scaling1
Multidimensional scaling1Multidimensional scaling1
Multidimensional scaling1Carlo Magno
 
When to use, What Statistical Test for data Analysis modified.pptx
When to use, What Statistical Test for data Analysis modified.pptxWhen to use, What Statistical Test for data Analysis modified.pptx
When to use, What Statistical Test for data Analysis modified.pptxAsokan R
 
Correspondence analysis final
Correspondence analysis finalCorrespondence analysis final
Correspondence analysis finalsaba khan
 
Lesson 2 stationary_time_series
Lesson 2 stationary_time_seriesLesson 2 stationary_time_series
Lesson 2 stationary_time_seriesankit_ppt
 
Introduction to principal component analysis (pca)
Introduction to principal component analysis (pca)Introduction to principal component analysis (pca)
Introduction to principal component analysis (pca)Mohammed Musah
 

What's hot (20)

lfstat3e_ppt_01_rev.ppt
lfstat3e_ppt_01_rev.pptlfstat3e_ppt_01_rev.ppt
lfstat3e_ppt_01_rev.ppt
 
Logistic regression sage
Logistic regression sageLogistic regression sage
Logistic regression sage
 
Cannonical correlation
Cannonical correlationCannonical correlation
Cannonical correlation
 
Scales of measurement
Scales of measurementScales of measurement
Scales of measurement
 
On Samples And Sampling
On Samples And SamplingOn Samples And Sampling
On Samples And Sampling
 
Multinomial Logistic Regression Analysis
Multinomial Logistic Regression AnalysisMultinomial Logistic Regression Analysis
Multinomial Logistic Regression Analysis
 
Binary Logistic Regression
Binary Logistic RegressionBinary Logistic Regression
Binary Logistic Regression
 
Levels of Measurement
Levels of MeasurementLevels of Measurement
Levels of Measurement
 
Basic statistics by Neeraj Bhandari ( Surkhet.Nepal )
Basic statistics by Neeraj Bhandari ( Surkhet.Nepal )Basic statistics by Neeraj Bhandari ( Surkhet.Nepal )
Basic statistics by Neeraj Bhandari ( Surkhet.Nepal )
 
Basics of Educational Statistics (Descriptive statistics)
Basics of Educational Statistics (Descriptive statistics)Basics of Educational Statistics (Descriptive statistics)
Basics of Educational Statistics (Descriptive statistics)
 
Missing data and non response pdf
Missing data and non response pdfMissing data and non response pdf
Missing data and non response pdf
 
Correlation VS Causation
Correlation VS CausationCorrelation VS Causation
Correlation VS Causation
 
Multidimensional scaling1
Multidimensional scaling1Multidimensional scaling1
Multidimensional scaling1
 
When to use, What Statistical Test for data Analysis modified.pptx
When to use, What Statistical Test for data Analysis modified.pptxWhen to use, What Statistical Test for data Analysis modified.pptx
When to use, What Statistical Test for data Analysis modified.pptx
 
Chapter 1: Statistics
Chapter 1: StatisticsChapter 1: Statistics
Chapter 1: Statistics
 
Correspondence analysis final
Correspondence analysis finalCorrespondence analysis final
Correspondence analysis final
 
Multivariate Analysis
Multivariate AnalysisMultivariate Analysis
Multivariate Analysis
 
Lesson 2 stationary_time_series
Lesson 2 stationary_time_seriesLesson 2 stationary_time_series
Lesson 2 stationary_time_series
 
Introduction to principal component analysis (pca)
Introduction to principal component analysis (pca)Introduction to principal component analysis (pca)
Introduction to principal component analysis (pca)
 
Sampling Distribution
Sampling DistributionSampling Distribution
Sampling Distribution
 

Similar to How to weight online data

The facts about the qcs test o ps and tertiary entrance processes in australi...
The facts about the qcs test o ps and tertiary entrance processes in australi...The facts about the qcs test o ps and tertiary entrance processes in australi...
The facts about the qcs test o ps and tertiary entrance processes in australi...deborahakers
 
Inno­v­a­tive part­ner­ships to improve life­long brain health and customer/ ...
Inno­v­a­tive part­ner­ships to improve life­long brain health and customer/ ...Inno­v­a­tive part­ner­ships to improve life­long brain health and customer/ ...
Inno­v­a­tive part­ner­ships to improve life­long brain health and customer/ ...SharpBrains
 
Fit for Purpose Community Health Surveys: An Experiment in Three Communities
Fit for Purpose Community Health Surveys: An Experiment in Three CommunitiesFit for Purpose Community Health Surveys: An Experiment in Three Communities
Fit for Purpose Community Health Surveys: An Experiment in Three CommunitiesICF
 
Is it Cheating or Group Problem Solving
Is it Cheating or Group Problem SolvingIs it Cheating or Group Problem Solving
Is it Cheating or Group Problem SolvingGreg Friese
 
Collection of data
Collection of dataCollection of data
Collection of dataBaiju KT
 
Rss Oct 2011 Mixed Modes Pres5
Rss Oct 2011 Mixed Modes Pres5Rss Oct 2011 Mixed Modes Pres5
Rss Oct 2011 Mixed Modes Pres5GerryNicolaas
 
probability sampling
probability samplingprobability sampling
probability samplingRoshni Kapoor
 
Is it Cheating or Group Problem Solving?
Is it Cheating or Group Problem Solving?Is it Cheating or Group Problem Solving?
Is it Cheating or Group Problem Solving?Greg Friese
 
Data Inference
Data InferenceData Inference
Data InferenceL H
 
Greendex 2014 - Consumer Choice and the Environment - A Worldwide Tracking Su...
Greendex 2014 - Consumer Choice and the Environment - A Worldwide Tracking Su...Greendex 2014 - Consumer Choice and the Environment - A Worldwide Tracking Su...
Greendex 2014 - Consumer Choice and the Environment - A Worldwide Tracking Su...Sustainable Brands
 
INET Results-Based Accountability Workshop: May 2, 2014
INET Results-Based Accountability Workshop: May 2, 2014INET Results-Based Accountability Workshop: May 2, 2014
INET Results-Based Accountability Workshop: May 2, 2014Navicate
 
Is It Cheating or Group Problem Solving presented at MN Teaching and Learning...
Is It Cheating or Group Problem Solving presented at MN Teaching and Learning...Is It Cheating or Group Problem Solving presented at MN Teaching and Learning...
Is It Cheating or Group Problem Solving presented at MN Teaching and Learning...Greg Friese
 
Chp12 - Research Methods for Business By Authors Uma Sekaran and Roger Bougie
Chp12  - Research Methods for Business By Authors Uma Sekaran and Roger BougieChp12  - Research Methods for Business By Authors Uma Sekaran and Roger Bougie
Chp12 - Research Methods for Business By Authors Uma Sekaran and Roger BougieHassan Usman
 
Is it Cheating or Group Problem Solving
Is it Cheating or Group Problem Solving Is it Cheating or Group Problem Solving
Is it Cheating or Group Problem Solving Greg Friese
 

Similar to How to weight online data (20)

The facts about the qcs test o ps and tertiary entrance processes in australi...
The facts about the qcs test o ps and tertiary entrance processes in australi...The facts about the qcs test o ps and tertiary entrance processes in australi...
The facts about the qcs test o ps and tertiary entrance processes in australi...
 
Inno­v­a­tive part­ner­ships to improve life­long brain health and customer/ ...
Inno­v­a­tive part­ner­ships to improve life­long brain health and customer/ ...Inno­v­a­tive part­ner­ships to improve life­long brain health and customer/ ...
Inno­v­a­tive part­ner­ships to improve life­long brain health and customer/ ...
 
MD poverty indexes
MD poverty indexesMD poverty indexes
MD poverty indexes
 
LDI Research Seminar 1_28_11- Brian Elbel, PhD, MPH
LDI Research Seminar 1_28_11- Brian Elbel, PhD, MPHLDI Research Seminar 1_28_11- Brian Elbel, PhD, MPH
LDI Research Seminar 1_28_11- Brian Elbel, PhD, MPH
 
Fit for Purpose Community Health Surveys: An Experiment in Three Communities
Fit for Purpose Community Health Surveys: An Experiment in Three CommunitiesFit for Purpose Community Health Surveys: An Experiment in Three Communities
Fit for Purpose Community Health Surveys: An Experiment in Three Communities
 
Is it Cheating or Group Problem Solving
Is it Cheating or Group Problem SolvingIs it Cheating or Group Problem Solving
Is it Cheating or Group Problem Solving
 
Collection of data
Collection of dataCollection of data
Collection of data
 
Rss Oct 2011 Mixed Modes Pres5
Rss Oct 2011 Mixed Modes Pres5Rss Oct 2011 Mixed Modes Pres5
Rss Oct 2011 Mixed Modes Pres5
 
September 2019 Division Meeting
September 2019 Division MeetingSeptember 2019 Division Meeting
September 2019 Division Meeting
 
probability sampling
probability samplingprobability sampling
probability sampling
 
Sampling method
Sampling methodSampling method
Sampling method
 
Is it Cheating or Group Problem Solving?
Is it Cheating or Group Problem Solving?Is it Cheating or Group Problem Solving?
Is it Cheating or Group Problem Solving?
 
What have we learnt from randomized control trials
What have we learnt from randomized control trialsWhat have we learnt from randomized control trials
What have we learnt from randomized control trials
 
Bettinger Keynote: The Difficulty of Knowing and The "E" Word
Bettinger Keynote: The Difficulty of Knowing and The "E" WordBettinger Keynote: The Difficulty of Knowing and The "E" Word
Bettinger Keynote: The Difficulty of Knowing and The "E" Word
 
Data Inference
Data InferenceData Inference
Data Inference
 
Greendex 2014 - Consumer Choice and the Environment - A Worldwide Tracking Su...
Greendex 2014 - Consumer Choice and the Environment - A Worldwide Tracking Su...Greendex 2014 - Consumer Choice and the Environment - A Worldwide Tracking Su...
Greendex 2014 - Consumer Choice and the Environment - A Worldwide Tracking Su...
 
INET Results-Based Accountability Workshop: May 2, 2014
INET Results-Based Accountability Workshop: May 2, 2014INET Results-Based Accountability Workshop: May 2, 2014
INET Results-Based Accountability Workshop: May 2, 2014
 
Is It Cheating or Group Problem Solving presented at MN Teaching and Learning...
Is It Cheating or Group Problem Solving presented at MN Teaching and Learning...Is It Cheating or Group Problem Solving presented at MN Teaching and Learning...
Is It Cheating or Group Problem Solving presented at MN Teaching and Learning...
 
Chp12 - Research Methods for Business By Authors Uma Sekaran and Roger Bougie
Chp12  - Research Methods for Business By Authors Uma Sekaran and Roger BougieChp12  - Research Methods for Business By Authors Uma Sekaran and Roger Bougie
Chp12 - Research Methods for Business By Authors Uma Sekaran and Roger Bougie
 
Is it Cheating or Group Problem Solving
Is it Cheating or Group Problem Solving Is it Cheating or Group Problem Solving
Is it Cheating or Group Problem Solving
 

More from Ray Poynter

The State of AI in Insights and Research 2024: Results and Findings
The State of AI in Insights and Research 2024: Results and FindingsThe State of AI in Insights and Research 2024: Results and Findings
The State of AI in Insights and Research 2024: Results and FindingsRay Poynter
 
ResearchWiseAI - an artificial intelligence driven research data analysis tool
ResearchWiseAI - an artificial intelligence driven research data analysis toolResearchWiseAI - an artificial intelligence driven research data analysis tool
ResearchWiseAI - an artificial intelligence driven research data analysis toolRay Poynter
 
AI-powered interviewing: Best practices from Yasna
AI-powered interviewing: Best practices from YasnaAI-powered interviewing: Best practices from Yasna
AI-powered interviewing: Best practices from YasnaRay Poynter
 
Artificial Intelligence and Qual: The Story So Far
Artificial Intelligence and Qual: The Story So FarArtificial Intelligence and Qual: The Story So Far
Artificial Intelligence and Qual: The Story So FarRay Poynter
 
State of Research Insights in Q1, 2024 from NewMR
State of Research Insights in Q1, 2024 from NewMRState of Research Insights in Q1, 2024 from NewMR
State of Research Insights in Q1, 2024 from NewMRRay Poynter
 
Sudden Death of Beliefs
Sudden Death of BeliefsSudden Death of Beliefs
Sudden Death of BeliefsRay Poynter
 
Uncovering Consumers’ Hidden Narratives
Uncovering Consumers’ Hidden NarrativesUncovering Consumers’ Hidden Narratives
Uncovering Consumers’ Hidden NarrativesRay Poynter
 
Narrative Exploration of New Categories at Mondelēz
Narrative Exploration of New Categories at MondelēzNarrative Exploration of New Categories at Mondelēz
Narrative Exploration of New Categories at MondelēzRay Poynter
 
The Future in Focus
The Future in FocusThe Future in Focus
The Future in FocusRay Poynter
 
The Future in Focus
The Future in FocusThe Future in Focus
The Future in FocusRay Poynter
 
The State of Insights – September 2023
The State of Insights – September 2023The State of Insights – September 2023
The State of Insights – September 2023Ray Poynter
 
Research Thinking in the age of AI
Research Thinking in the age of AIResearch Thinking in the age of AI
Research Thinking in the age of AIRay Poynter
 
How might AI impact Research and Insights over the next two years?
How might AI impact Research and Insights over the next two years?How might AI impact Research and Insights over the next two years?
How might AI impact Research and Insights over the next two years?Ray Poynter
 
From Words to Wisdom: Unleashing the Potential of Language Models for Human-C...
From Words to Wisdom: Unleashing the Potential of Language Models for Human-C...From Words to Wisdom: Unleashing the Potential of Language Models for Human-C...
From Words to Wisdom: Unleashing the Potential of Language Models for Human-C...Ray Poynter
 
ChatGPT for Social Media Listening: practical application with YouScan’s Insi...
ChatGPT for Social Media Listening: practical application with YouScan’s Insi...ChatGPT for Social Media Listening: practical application with YouScan’s Insi...
ChatGPT for Social Media Listening: practical application with YouScan’s Insi...Ray Poynter
 
Using Generative AI to Assess the Quality of Open-Ended Responses in Surveys
Using Generative AI to Assess the Quality of Open-Ended Responses in SurveysUsing Generative AI to Assess the Quality of Open-Ended Responses in Surveys
Using Generative AI to Assess the Quality of Open-Ended Responses in SurveysRay Poynter
 
Exploring the future of verbatim coding with ChatGPT
Exploring the future of verbatim coding with ChatGPTExploring the future of verbatim coding with ChatGPT
Exploring the future of verbatim coding with ChatGPTRay Poynter
 
Using Generative AI to bring Qualitative Capabilities to Quantitative Surveys
Using Generative AI to bring Qualitative Capabilities to Quantitative SurveysUsing Generative AI to bring Qualitative Capabilities to Quantitative Surveys
Using Generative AI to bring Qualitative Capabilities to Quantitative SurveysRay Poynter
 
How AI / ChatGPT Drives Business Growth
How AI / ChatGPT Drives Business GrowthHow AI / ChatGPT Drives Business Growth
How AI / ChatGPT Drives Business GrowthRay Poynter
 
Tech for tech’s sake? Learnings from experiments with AI in consumer research
Tech for tech’s sake? Learnings from experiments with AI in consumer researchTech for tech’s sake? Learnings from experiments with AI in consumer research
Tech for tech’s sake? Learnings from experiments with AI in consumer researchRay Poynter
 

More from Ray Poynter (20)

The State of AI in Insights and Research 2024: Results and Findings
The State of AI in Insights and Research 2024: Results and FindingsThe State of AI in Insights and Research 2024: Results and Findings
The State of AI in Insights and Research 2024: Results and Findings
 
ResearchWiseAI - an artificial intelligence driven research data analysis tool
ResearchWiseAI - an artificial intelligence driven research data analysis toolResearchWiseAI - an artificial intelligence driven research data analysis tool
ResearchWiseAI - an artificial intelligence driven research data analysis tool
 
AI-powered interviewing: Best practices from Yasna
AI-powered interviewing: Best practices from YasnaAI-powered interviewing: Best practices from Yasna
AI-powered interviewing: Best practices from Yasna
 
Artificial Intelligence and Qual: The Story So Far
Artificial Intelligence and Qual: The Story So FarArtificial Intelligence and Qual: The Story So Far
Artificial Intelligence and Qual: The Story So Far
 
State of Research Insights in Q1, 2024 from NewMR
State of Research Insights in Q1, 2024 from NewMRState of Research Insights in Q1, 2024 from NewMR
State of Research Insights in Q1, 2024 from NewMR
 
Sudden Death of Beliefs
Sudden Death of BeliefsSudden Death of Beliefs
Sudden Death of Beliefs
 
Uncovering Consumers’ Hidden Narratives
Uncovering Consumers’ Hidden NarrativesUncovering Consumers’ Hidden Narratives
Uncovering Consumers’ Hidden Narratives
 
Narrative Exploration of New Categories at Mondelēz
Narrative Exploration of New Categories at MondelēzNarrative Exploration of New Categories at Mondelēz
Narrative Exploration of New Categories at Mondelēz
 
The Future in Focus
The Future in FocusThe Future in Focus
The Future in Focus
 
The Future in Focus
The Future in FocusThe Future in Focus
The Future in Focus
 
The State of Insights – September 2023
The State of Insights – September 2023The State of Insights – September 2023
The State of Insights – September 2023
 
Research Thinking in the age of AI
Research Thinking in the age of AIResearch Thinking in the age of AI
Research Thinking in the age of AI
 
How might AI impact Research and Insights over the next two years?
How might AI impact Research and Insights over the next two years?How might AI impact Research and Insights over the next two years?
How might AI impact Research and Insights over the next two years?
 
From Words to Wisdom: Unleashing the Potential of Language Models for Human-C...
From Words to Wisdom: Unleashing the Potential of Language Models for Human-C...From Words to Wisdom: Unleashing the Potential of Language Models for Human-C...
From Words to Wisdom: Unleashing the Potential of Language Models for Human-C...
 
ChatGPT for Social Media Listening: practical application with YouScan’s Insi...
ChatGPT for Social Media Listening: practical application with YouScan’s Insi...ChatGPT for Social Media Listening: practical application with YouScan’s Insi...
ChatGPT for Social Media Listening: practical application with YouScan’s Insi...
 
Using Generative AI to Assess the Quality of Open-Ended Responses in Surveys
Using Generative AI to Assess the Quality of Open-Ended Responses in SurveysUsing Generative AI to Assess the Quality of Open-Ended Responses in Surveys
Using Generative AI to Assess the Quality of Open-Ended Responses in Surveys
 
Exploring the future of verbatim coding with ChatGPT
Exploring the future of verbatim coding with ChatGPTExploring the future of verbatim coding with ChatGPT
Exploring the future of verbatim coding with ChatGPT
 
Using Generative AI to bring Qualitative Capabilities to Quantitative Surveys
Using Generative AI to bring Qualitative Capabilities to Quantitative SurveysUsing Generative AI to bring Qualitative Capabilities to Quantitative Surveys
Using Generative AI to bring Qualitative Capabilities to Quantitative Surveys
 
How AI / ChatGPT Drives Business Growth
How AI / ChatGPT Drives Business GrowthHow AI / ChatGPT Drives Business Growth
How AI / ChatGPT Drives Business Growth
 
Tech for tech’s sake? Learnings from experiments with AI in consumer research
Tech for tech’s sake? Learnings from experiments with AI in consumer researchTech for tech’s sake? Learnings from experiments with AI in consumer research
Tech for tech’s sake? Learnings from experiments with AI in consumer research
 

Recently uploaded

Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Role Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxRole Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxNikitaBankoti2
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxnegromaestrong
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701bronxfugly43
 

Recently uploaded (20)

Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Role Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxRole Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 

How to weight online data

  • 1. Weighting online data Jeffrey Henning Executive Director Market Research Institute International August 2019
  • 4. Learning Objective: Describe the challenges in obtaining representative samples and how representative samples can be improved at the selection stage or through weighting. Challenges: Traditionally we have recommended weighting for probability samples. Should we recommend weighting for non-probability samples? When?
  • 5. • Post-stratification weighting is viewed as a common solution to removing sampling bias. • But it is often misrepresented as a simple process of arithmetic… Weighting
  • 6. U.S. Men Women 18 to 54 79,184,164 169 responses 469K weight 79,017,200 199 responses 397K weight 55+ 36,301,576 15 responses 2,420K weight 43,154,705 17 responses 2,539K weight Cell Weighting
  • 7. • Do you want to set minimum and maximum weights? If so, to what? Why? • How will you simplify the weighting scheme if sample balance is too low (e.g., <70%)? • Which questions to weight on? Editorial Judgments
  • 9. Age Age by sex Age by race/ethnicity Age by education Age by region Sex Sex by race/ethnicity Sex by education Sex by region Race/ethnicity Race/ethnicity by education Race/ethnicity by region Education Education by region Census division Political party affiliation Political ideology Voter registration Evangelical Christian identification Raking with Interlocked Variables…
  • 10. Age Age by sex Age by race/ethnicity Age by education Age by region Sex Sex by race/ethnicity Sex by education Sex by region Race/ethnicity Race/ethnicity by education Race/ethnicity by region Education Education by region Census division Political party affiliation Political ideology Voter registration Evangelical Christian identification …Using these Interlocked Variables
  • 11. • “No religious or political questions.” • “You were all over the place. Starts by asking questions on products and services and ends on political preferences.” • “Do not ask political questions. It seemed very strange at the end.” Complaints about Weighting Questions Political party affiliation Political ideology Voter registration Evangelical Christian identification
  • 12. Pew Weights Probability Panels Differently than it does Opt-in Panels
  • 13. • Implicit assumption is respondents in a demographic group are representative of the people in that group we did not survey § Not true for seniors • Non-Internet seniors differ on many dimensions from Internet-using seniors § Not true for immigrant communities • Acculturated vs. unacculturated Key Assumption Behind Weighting
  • 14. 61% 74% 84% 89% 81% 65% 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 % of U.S. adults with Internet Access U.S. adults Less than $30,000 Less than high school graduate 65+ Internet Surveys Decimate the Population Source: Pew Research
  • 15. • Some researchers weight convenience samples... § In the hope it does no harm § In the belief it improves quality § For the fact it redistributes demographics to match target populations Assumptions Behind Weighting
  • 16. • Demographic weighted surveys - Reflect the composition of the target audience, often using cells comprised of age, gender and region. However, David Yeager and Jon Krosnick determined that demographically weighting non- probability Internet samples to known population values did not consistently produce more representative results. • Demographic and attitudinal weighted surveys - Some have also used attitudinal questions in their weighting functions, but there has been no academic validation of this. • Propensity weighted surveys - Propensity score weighting adjusts for the likelihood of respondents to be online based on their demographics. • Non-parametric weighted surveys - Brian Fine of ORU demonstrated that CART analysis could be used to model representative results by modeling the dependent variable as panelist source. Not yet been independently validated. Weighting Recommendations (2014)
  • 17. Pew Research into Weighting
  • 18.
  • 19. Wait, Wait Waiting until the weighting stage to adjust is too late. The combination of coverage error and nonresponse in online panels generally creates a sample that is beyond fixing post hoc. We need to do more at the selection stage. Reg Baker (2013) ESOMAR Ambassador
  • 20. Men Women 18 to 54 79,184,164 169 133 responses 595K weight 79,017,200 199 132 responses 599K weight 55+ 36,301,576 15 61 responses 595K weight 43,154,705 17 72 responses 599K weight Quota Sampling
  • 21. Vendor A • Sex by age by region Vendor B • Sex by age • Region Vendor C • Sex • Age • Education • Census region • Race/ethnicity • Population density Vendor D • Sex by age • Sex by education • Age by education • Census region • Race/ethnicity Common Quota Schemes
  • 22. • For non-probability samples not using quota sampling: § Don’t weight the results • For non-probability samples using quota sampling: § Weight to correct oversamples § Weight interim studies using quota sampling to correct for slow-filling cells § May not be able to weight crosstabs (technical limitation of many survey packages) Recommendations
  • 23. • Include disqualified respondents when screening general population § Screener should contain weighting variables • If can’t include disqualified respondents, run omnibus questions to estimate population totals • Note that it can be time consuming to track down updated benchmarking information; design your questions based on the benchmarks you find Easily Overlooked Items
  • 24. Call for Further Research • Analysis of different weighting schemes, especially political questions and religion for business questions • Research into weight trimming • Research into sample balance • Research into N < 2000 studies
  • 26. Jeffrey Henning Executive Director Market Research Institute International jhenning@mrii.org @jhenning https://www.linkedin.com/in/jhenning/
  • 28. August 2019 Q & A Ray Poynter NewMR Jeffrey Henning MRII (Market Research Institute International)