SlideShare a Scribd company logo
1 of 19
Propensity-Score Matching
(PSM)
Misgina Asmelash, Ph.D.
in Agricultural Economics and Management
YOM Institute of Economic Development
Nov 02, 2018
Propensity-Score Matching
(PSM)
The problem with matching is to credibly identify groups that look alike. Identification is a problem
because even if households are matched along a vector, X, of different characteristics, one would
rarely find two households that are exactly similar to each other in terms of many characteristics.
Because many possible characteristics exist, a common way of matching households is
propensity score matching.
Propensity-Score Matching (PSM): Overview
 Propensity score matching: constructs a statistical comparison group that
is based on a model of the probability of participating in the treatment, using
observed characteristics.
 Participants are then matched on the basis of this probability, or propensity
score, to nonparticipants.
 Match treated and untreated observations on the estimated probability of
being treated (propensity score).
 Different approaches are used to match participants and nonparticipants on
the basis of the propensity score.
Propensity-Score Matching (PSM): Overview
 Include nearest-neighbour (NN), matching, caliper and radius matching,
stratification and interval matching, kernel matching and local linear
matching (LLM).
 In PSM, each participant is matched to a nonparticipant on the basis of
a single propensity score, reflecting the probability of participating
conditional on their different observed characteristics X.
 Most commonly used:
 Match on the basis of the propensity score P(X) = Pr (Y=1|X)
 Y indicates participation in project
 Instead of attempting to create a match for each participant with exactly the same
value of X, we can instead match on the probability of participation.
PSM: Key Assumptions
 The validity of PSM depends on two conditions:
 (a) conditional independence (namely, that unobserved factors do not affect
participation); Means participation is independent of outcomes conditional on Xi.
 This is false if there are unobserved outcomes affecting participation.
 (b) sizable common support or overlap in propensity scores across the
participant and nonparticipant samples.
 Enables matching not just at the mean but balances the distribution of
observed characteristics across treatment and control.
Density
0 1
Propensity score
Region of
common
support
Density of scores for
participants
High probability of
participating given X
Density of scores
for non-
participants
Common support
Common Support
Common Support
Range of common support
Nearest Neighbor Matching
Which Neighbor?
Which Neighbor?
Caliper Matching
Kernel Matching
Kernel Matching
Steps in Score Matching
1. Need representative and comparable data for both treatment and
comparison groups.
2. Use a logit (or other discrete choice model) to estimate program
participations as a function of observable characteristics.
3. Use predicted values from logit to generate propensity score p(xi) for all
treatment and comparison group members
4. Match Pairs:
 Restrict sample to common support (as in Figure).
 Need to determine a tolerance limit: using matching methods, Such as:
 Nearest neighbors (NN), caliper and radius matching, kernel matching and
weighting by the propensity score.
5. Once matches are made, we can calculate impact by comparing the
means of outcomes across participants and their matched pairs.
PSM vs. Randomization
 Randomization does not require the untestable assumption of
independence conditional on observables.
 PSM requires large samples and good data:
1. Ideally, the same data source is used for participants and non-
participants
2. Participants and non-participants have access to similar institutions
and markets, and
3. The data include X variables capable of identifying program
participation and outcomes.
Lessons on Matching Methods
 Typically used when neither randomization or other quasi experimental
options are not possible.
 Case 1: no baseline. Can do ex-post matching
 Dangers of ex-post matching:
 Matching on variables that change due to participation (i.e., endogenous)
 Matching helps control only for OBSERVABLE differences, not
unobservable differences.
 Matching becomes much better in combination with other techniques, such
as:
 Exploiting baseline data for matching and using difference-in-difference
strategy
 If an assignment rule exists for project, can match on this rule
 Need good quality data: common support can be a problem if two groups
are very different.
Design When to use Advantages Disadvantages
Randomization Whenever feasible
When there is variation at the
individual or community level
Gold standard
Most powerful
Not always feasible
Not always ethical
Randomized
Encouragement
Design
When an intervention is
universally implemented
 Provides exogenous
variation for a subset of
beneficiaries
Only looks at sub-group of
sample
Power of encouragement
design only known ex post
Regression
Discontinuity
If an intervention has a clear,
sharp assignment rule
 Project beneficiaries often
must qualify through
established criteria
Only look at sub-group of
sample
Assignment rule in practice
often not implemented
strictly
Difference-in-
Differences
If two groups are growing at
similar rates
 Baseline and follow-up data
are available
Eliminates fixed differences
not related to treatment
Can be biased if trends
change
Ideally have 2 pre-
intervention periods of data
Matching  When other methods are not
possible
Overcomes observed
differences between
treatment and comparison
Assumes no unobserved
differences (often
implausible)
Summary…

More Related Content

Similar to Matching_Methods.ppt

Learning from a Class Imbalanced Public Health Dataset: a Cost-based Comparis...
Learning from a Class Imbalanced Public Health Dataset: a Cost-based Comparis...Learning from a Class Imbalanced Public Health Dataset: a Cost-based Comparis...
Learning from a Class Imbalanced Public Health Dataset: a Cost-based Comparis...
IJECEIAES
 
25_Anderson_Biostatistics_and_Epidemiology.ppt
25_Anderson_Biostatistics_and_Epidemiology.ppt25_Anderson_Biostatistics_and_Epidemiology.ppt
25_Anderson_Biostatistics_and_Epidemiology.ppt
PriyankaSharma89719
 
Face Construct And Criterion-Related Validity Essay
Face Construct And Criterion-Related Validity EssayFace Construct And Criterion-Related Validity Essay
Face Construct And Criterion-Related Validity Essay
Deb Birch
 
JSM2013,Proceedings,paper307699_79238,DSweitzer
JSM2013,Proceedings,paper307699_79238,DSweitzerJSM2013,Proceedings,paper307699_79238,DSweitzer
JSM2013,Proceedings,paper307699_79238,DSweitzer
Dennis Sweitzer
 
Draft AMCP 2006 Model Quality 4-4-06
Draft AMCP 2006 Model Quality 4-4-06Draft AMCP 2006 Model Quality 4-4-06
Draft AMCP 2006 Model Quality 4-4-06
Joe Gricar, MS
 
Multivariate Approaches in Nursing Research Assignment.pdf
Multivariate Approaches in Nursing Research Assignment.pdfMultivariate Approaches in Nursing Research Assignment.pdf
Multivariate Approaches in Nursing Research Assignment.pdf
bkbk37
 
Data analysis powerpoint
Data analysis powerpointData analysis powerpoint
Data analysis powerpoint
jamiebrandon
 
304 Part II • Predictive AnalyticsMachine LearningIntrodu.docx
304 Part II • Predictive AnalyticsMachine LearningIntrodu.docx304 Part II • Predictive AnalyticsMachine LearningIntrodu.docx
304 Part II • Predictive AnalyticsMachine LearningIntrodu.docx
priestmanmable
 

Similar to Matching_Methods.ppt (20)

SENIOR COMP FINAL
SENIOR COMP FINALSENIOR COMP FINAL
SENIOR COMP FINAL
 
Propensity Scores in Medical Device Trials
Propensity Scores in Medical Device TrialsPropensity Scores in Medical Device Trials
Propensity Scores in Medical Device Trials
 
Brm unit iv - cheet sheet
Brm   unit iv - cheet sheetBrm   unit iv - cheet sheet
Brm unit iv - cheet sheet
 
Selection of appropriate data analysis technique
Selection of appropriate data analysis techniqueSelection of appropriate data analysis technique
Selection of appropriate data analysis technique
 
statistical analysis.pptx
statistical analysis.pptxstatistical analysis.pptx
statistical analysis.pptx
 
Learning from a Class Imbalanced Public Health Dataset: a Cost-based Comparis...
Learning from a Class Imbalanced Public Health Dataset: a Cost-based Comparis...Learning from a Class Imbalanced Public Health Dataset: a Cost-based Comparis...
Learning from a Class Imbalanced Public Health Dataset: a Cost-based Comparis...
 
25_Anderson_Biostatistics_and_Epidemiology.ppt
25_Anderson_Biostatistics_and_Epidemiology.ppt25_Anderson_Biostatistics_and_Epidemiology.ppt
25_Anderson_Biostatistics_and_Epidemiology.ppt
 
Advanced statistics
Advanced statisticsAdvanced statistics
Advanced statistics
 
Patton1990
Patton1990Patton1990
Patton1990
 
EJ1241940.pdf
EJ1241940.pdfEJ1241940.pdf
EJ1241940.pdf
 
Face Construct And Criterion-Related Validity Essay
Face Construct And Criterion-Related Validity EssayFace Construct And Criterion-Related Validity Essay
Face Construct And Criterion-Related Validity Essay
 
Key Issues in Impact Evaluation: A MEET and GEMNet-Health Virtual Event
Key Issues in Impact Evaluation: A MEET and GEMNet-Health Virtual EventKey Issues in Impact Evaluation: A MEET and GEMNet-Health Virtual Event
Key Issues in Impact Evaluation: A MEET and GEMNet-Health Virtual Event
 
Discriminate analysis
Discriminate analysisDiscriminate analysis
Discriminate analysis
 
Nbe rcausalpredictionv111 lecture2
Nbe rcausalpredictionv111 lecture2Nbe rcausalpredictionv111 lecture2
Nbe rcausalpredictionv111 lecture2
 
JSM2013,Proceedings,paper307699_79238,DSweitzer
JSM2013,Proceedings,paper307699_79238,DSweitzerJSM2013,Proceedings,paper307699_79238,DSweitzer
JSM2013,Proceedings,paper307699_79238,DSweitzer
 
Draft AMCP 2006 Model Quality 4-4-06
Draft AMCP 2006 Model Quality 4-4-06Draft AMCP 2006 Model Quality 4-4-06
Draft AMCP 2006 Model Quality 4-4-06
 
Multivariate Approaches in Nursing Research Assignment.pdf
Multivariate Approaches in Nursing Research Assignment.pdfMultivariate Approaches in Nursing Research Assignment.pdf
Multivariate Approaches in Nursing Research Assignment.pdf
 
Data analysis powerpoint
Data analysis powerpointData analysis powerpoint
Data analysis powerpoint
 
AAPOR 2016 - Dutwin and Buskirk - Apples to Oranges
AAPOR 2016 - Dutwin and Buskirk - Apples to OrangesAAPOR 2016 - Dutwin and Buskirk - Apples to Oranges
AAPOR 2016 - Dutwin and Buskirk - Apples to Oranges
 
304 Part II • Predictive AnalyticsMachine LearningIntrodu.docx
304 Part II • Predictive AnalyticsMachine LearningIntrodu.docx304 Part II • Predictive AnalyticsMachine LearningIntrodu.docx
304 Part II • Predictive AnalyticsMachine LearningIntrodu.docx
 

More from Abebaw31 (11)

Project management course for MSc STUDENTS CHAPTER 1
Project management course for MSc STUDENTS CHAPTER 1Project management course for MSc STUDENTS CHAPTER 1
Project management course for MSc STUDENTS CHAPTER 1
 
Project management course for MSC students Chapter 4 and 5
Project management course for MSC students Chapter 4 and 5Project management course for MSC students Chapter 4 and 5
Project management course for MSC students Chapter 4 and 5
 
Risk management Chapter 2 for project monument course for MSC students
Risk management Chapter 2 for project monument course for MSC studentsRisk management Chapter 2 for project monument course for MSC students
Risk management Chapter 2 for project monument course for MSC students
 
Yom Institute Presentations lecture notes of project management
Yom Institute Presentations lecture notes of project managementYom Institute Presentations lecture notes of project management
Yom Institute Presentations lecture notes of project management
 
Yom Institute Presentation project management course
Yom Institute Presentation project management courseYom Institute Presentation project management course
Yom Institute Presentation project management course
 
Project management course lecture Notes from Yom college
Project management course lecture Notes from Yom collegeProject management course lecture Notes from Yom college
Project management course lecture Notes from Yom college
 
Chapter-3.pdf
Chapter-3.pdfChapter-3.pdf
Chapter-3.pdf
 
Chapter-4.pdf
Chapter-4.pdfChapter-4.pdf
Chapter-4.pdf
 
Essental of Project Management Lecture Slide.pdf
Essental of Project Management Lecture Slide.pdfEssental of Project Management Lecture Slide.pdf
Essental of Project Management Lecture Slide.pdf
 
2. PMBOK 5TH EDITION.pdf
2. PMBOK 5TH EDITION.pdf2. PMBOK 5TH EDITION.pdf
2. PMBOK 5TH EDITION.pdf
 
የማሽከርከር ህግ.pdf
የማሽከርከር ህግ.pdfየማሽከርከር ህግ.pdf
የማሽከርከር ህግ.pdf
 

Recently uploaded

"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
mphochane1998
 
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
HenryBriggs2
 
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
Epec Engineered Technologies
 

Recently uploaded (20)

Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
 
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 
Rums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdfRums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdf
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network Devices
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptx
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 

Matching_Methods.ppt

  • 1. Propensity-Score Matching (PSM) Misgina Asmelash, Ph.D. in Agricultural Economics and Management YOM Institute of Economic Development Nov 02, 2018
  • 2. Propensity-Score Matching (PSM) The problem with matching is to credibly identify groups that look alike. Identification is a problem because even if households are matched along a vector, X, of different characteristics, one would rarely find two households that are exactly similar to each other in terms of many characteristics. Because many possible characteristics exist, a common way of matching households is propensity score matching.
  • 3. Propensity-Score Matching (PSM): Overview  Propensity score matching: constructs a statistical comparison group that is based on a model of the probability of participating in the treatment, using observed characteristics.  Participants are then matched on the basis of this probability, or propensity score, to nonparticipants.  Match treated and untreated observations on the estimated probability of being treated (propensity score).  Different approaches are used to match participants and nonparticipants on the basis of the propensity score.
  • 4. Propensity-Score Matching (PSM): Overview  Include nearest-neighbour (NN), matching, caliper and radius matching, stratification and interval matching, kernel matching and local linear matching (LLM).  In PSM, each participant is matched to a nonparticipant on the basis of a single propensity score, reflecting the probability of participating conditional on their different observed characteristics X.  Most commonly used:  Match on the basis of the propensity score P(X) = Pr (Y=1|X)  Y indicates participation in project  Instead of attempting to create a match for each participant with exactly the same value of X, we can instead match on the probability of participation.
  • 5. PSM: Key Assumptions  The validity of PSM depends on two conditions:  (a) conditional independence (namely, that unobserved factors do not affect participation); Means participation is independent of outcomes conditional on Xi.  This is false if there are unobserved outcomes affecting participation.  (b) sizable common support or overlap in propensity scores across the participant and nonparticipant samples.  Enables matching not just at the mean but balances the distribution of observed characteristics across treatment and control.
  • 6. Density 0 1 Propensity score Region of common support Density of scores for participants High probability of participating given X Density of scores for non- participants Common support
  • 9. Range of common support
  • 16. Steps in Score Matching 1. Need representative and comparable data for both treatment and comparison groups. 2. Use a logit (or other discrete choice model) to estimate program participations as a function of observable characteristics. 3. Use predicted values from logit to generate propensity score p(xi) for all treatment and comparison group members 4. Match Pairs:  Restrict sample to common support (as in Figure).  Need to determine a tolerance limit: using matching methods, Such as:  Nearest neighbors (NN), caliper and radius matching, kernel matching and weighting by the propensity score. 5. Once matches are made, we can calculate impact by comparing the means of outcomes across participants and their matched pairs.
  • 17. PSM vs. Randomization  Randomization does not require the untestable assumption of independence conditional on observables.  PSM requires large samples and good data: 1. Ideally, the same data source is used for participants and non- participants 2. Participants and non-participants have access to similar institutions and markets, and 3. The data include X variables capable of identifying program participation and outcomes.
  • 18. Lessons on Matching Methods  Typically used when neither randomization or other quasi experimental options are not possible.  Case 1: no baseline. Can do ex-post matching  Dangers of ex-post matching:  Matching on variables that change due to participation (i.e., endogenous)  Matching helps control only for OBSERVABLE differences, not unobservable differences.  Matching becomes much better in combination with other techniques, such as:  Exploiting baseline data for matching and using difference-in-difference strategy  If an assignment rule exists for project, can match on this rule  Need good quality data: common support can be a problem if two groups are very different.
  • 19. Design When to use Advantages Disadvantages Randomization Whenever feasible When there is variation at the individual or community level Gold standard Most powerful Not always feasible Not always ethical Randomized Encouragement Design When an intervention is universally implemented  Provides exogenous variation for a subset of beneficiaries Only looks at sub-group of sample Power of encouragement design only known ex post Regression Discontinuity If an intervention has a clear, sharp assignment rule  Project beneficiaries often must qualify through established criteria Only look at sub-group of sample Assignment rule in practice often not implemented strictly Difference-in- Differences If two groups are growing at similar rates  Baseline and follow-up data are available Eliminates fixed differences not related to treatment Can be biased if trends change Ideally have 2 pre- intervention periods of data Matching  When other methods are not possible Overcomes observed differences between treatment and comparison Assumes no unobserved differences (often implausible) Summary…