SlideShare a Scribd company logo
INDIA HUMAN
DEVELOPMENT SURVEY
(IHDS)
TRAINING PROGRAM
MARCH 16, 2016
How to merge two rounds?
Merging Household Files
Relationship between IHDS-I
and IHDS-II households
IHDS-I sample
(N=41,554)
Replacement
households in
IHDS-II (N=2,134)
Split households
from round 1
(N=5,397)
Reinterview
Households
(N=34,621)
Attrition (N=6,911)
 Most important
concept in merging
two data files
1. Some households in
round 1 with no
match in round 2
and vice versa
2. Households in
round 1 match with
more than 1
household in round
2
Any questions?
 Who were chosen for reinterview?
 Recontact rate of 83%? What does it mean?
 How were replacement households chosen?
 What is a split household?
What is needed to merge
household files?
1. Round 1 household file – N=41,554
2. Round 2 household file – N=42,152
 (Why are there more cases in round 2?)
3. Linking file – N=42,152 – gives Round 1
identification codes for all Round 2
households that were reinterviewed, missing
linking codes for 2,134 households that are
new
Step 1 – Link round 2 data to
linking file to get round 1 ID
 use linkhh, clear
 sort STATEID DISTID PSUID HHID
HHSPLITID
 merge 1:1 STATEID DISTID PSUID HHID
HHSPLITID using round2HH
 sort STATEID DISTID PSUID HHID2005
HHSPLITID2005, gen(_mergeR2link)
 save round2HH_plus, replace
Step 2-Merge this Round 2+ file
with Round 1 file
 use round1HH
 rename HHID HHID2005
 rename HHSPLITID HHSPLITID2005
 sort STATEID DISTID PSUID HHID2005
HHSPLITID2005
 merge 1:m STATEID DISTID PSUID HHID2005
HHSPLITID2005 using round2HH_plus,
gen(_mergeR1R2)
 sort STATEID DISTID PSUID HHID HHSPLITID
 save mergedHHR1R2, replace
Cases in Merged file is superset
 Households surveyed in both rounds N=40,018
 Households surveyed in round 1 only (attrition)
N=6,911
 Households surveyd in round 2 only
(replacement) N=2,134
 Total N=49,063
 Keep only _mergeR1R2==3 for panel analysis
(N=40,018)
Merging Individual Files
Relationship between IHDS-I
and IHDS-II individuals
IHDS-I sample
(N=215,754)
New
individulas, new
HH (N=9,760)
New Ind in R1
HH (N=43,822)
Reinterview Ind
(N=150,995)
HH attrition
(N=29,299)
Ind. attrition in
interview hh
(N=35,464)
 Most important
concept in merging
two data files
1. Even reinterview
households have
new members
(births, marriages)
2. Even reinterview
households have
some members who
are no longer there
(deaths, marriages,
migration)
What is needed to merge
individual files?
1. Round 1 household file – N=215,754
2. Round 2 household file – N=204,568
 (Why are there more cases in round 2?)
3. Linking file – N=204,568 – gives Round 1
identification codes for all Round 2
households that were reinterviewed, missing
linking codes for 2,134 households that are
new
Step 1 – Link round 2 data to
linking file to get round 1 ID
 use linkind, clear
 sort STATEID DISTID PSUID HHID
HHSPLITID PERSONID
 merge 1:1 STATEID DISTID PSUID HHID
HHSPLITID PERONID using round2IND
 sort STATEID DISTID PSUID HHID2005
HHSPLITID2005, gen(_mergeR2link)
 save round2IND_plus, replace
Step 2-Merge this Round 2+ file
with Round 1 file
 use round1IND
 rename HHID HHID2005
 rename HHSPLITID HHSPLITID2005
 rename PERSONID PERSONID2005
 sort STATEID DISTID PSUID HHID2005
HHSPLITID2005 PERSONID2005
 merge 1:m STATEID DISTID PSUID HHID2005
HHSPLITID2005 PERSONID2005 using
round2IND_plus, gen(_mergeR1R2)
 sort STATEID DISTID PSUID HHID HHSPLITID
 save mergedINDR1R2, replace
Cases in Merged file is superset
 Individuals surveyed in both rounds N=150,988
 Individuals surveyed in round 1 only
(attrition/death/migration) N=64,766
 Individuals surveyd in round 2 only
(replacement/new) N=53,580
 Total N=269,334
 Keep only _mergeR1R2==3 for panel analysis
(N=150,988)
Evermarried woman file
linkage
Same process as individual file
linkage
 But only one thing to note, there was no ever
married woman file for 2004-5 so you will be
merging with the household file from 2004-5
Merging Caution
Merging overwrites variables
 So if you want to keep variables from round 1
and round 2 separate, before merging you may
want to rename all round 1 variables
 Typically we use the command
 Rename * x*
 Rename xSTATEID STATEID et. For merging
 So xr05 will be age in 20045 and r05 will be
age in 2011-12

More Related Content

What's hot

Theories of Economic Development
Theories of Economic DevelopmentTheories of Economic Development
Theories of Economic Development
Vitor Vieira Vasconcelos
 
Development of knowledge test and some other tests used in social science res...
Development of knowledge test and some other tests used in social science res...Development of knowledge test and some other tests used in social science res...
Development of knowledge test and some other tests used in social science res...
Sukanya Barua
 
Input – output model of economic development
Input – output model of economic developmentInput – output model of economic development
Input – output model of economic development
Ravi Varma reddy
 
Econometrics ch3
Econometrics ch3Econometrics ch3
Econometrics ch3
Baterdene Batchuluun
 
Harris-Todaro Migration Model and it's Applicability in Bangladesh
Harris-Todaro Migration Model and it's Applicability in BangladeshHarris-Todaro Migration Model and it's Applicability in Bangladesh
Harris-Todaro Migration Model and it's Applicability in Bangladesh
Mohaiminul Islam
 
Input output overview-for-mba-ii-sem
Input output overview-for-mba-ii-semInput output overview-for-mba-ii-sem
Input output overview-for-mba-ii-sem
Rahul Singh
 
Characteristics of underdeveloped economies
Characteristics of underdeveloped economiesCharacteristics of underdeveloped economies
Characteristics of underdeveloped economies
Georgi Mathew
 
Production function analysis
Production function analysisProduction function analysis
Production function analysis
Vaibhav verma
 
Chap14 multiple regression model building
Chap14 multiple regression model buildingChap14 multiple regression model building
Chap14 multiple regression model building
Uni Azza Aunillah
 
Estimation of Elasticities of Substitution for CES and VES Production Functions
Estimation of Elasticities of Substitution for CES and VES Production FunctionsEstimation of Elasticities of Substitution for CES and VES Production Functions
Estimation of Elasticities of Substitution for CES and VES Production Functions
idspak
 
The chi – square test
The chi – square testThe chi – square test
The chi – square test
Majesty Ortiz
 
Lewis model & rastow stages
Lewis model & rastow stagesLewis model & rastow stages
Lewis model & rastow stages
Naseem Ch
 
Heteroscedasticity
HeteroscedasticityHeteroscedasticity
Heteroscedasticity
Madurai Kamaraj University
 
Schultz’s transformation of traditional agriculture
Schultz’s transformation of traditional agricultureSchultz’s transformation of traditional agriculture
Schultz’s transformation of traditional agriculture
Vaibhav verma
 
LEAST COST METHOD
LEAST COST METHOD LEAST COST METHOD
LEAST COST METHOD
VishalHotchandani2
 
Solow Growth Model
Solow Growth ModelSolow Growth Model
Solow Growth Model
Жигжид
 
Input output analysis by roni bhowmik
Input output analysis by roni bhowmikInput output analysis by roni bhowmik
Input output analysis by roni bhowmik
Roni Bhowmik
 
Rural – urban migration
Rural – urban migrationRural – urban migration
Rural – urban migration
Vipin Valiyattoor
 
F test
F testF test
Analysis Of Variance - ANOVA
Analysis Of Variance - ANOVAAnalysis Of Variance - ANOVA
Analysis Of Variance - ANOVA
Saumya Bhatnagar
 

What's hot (20)

Theories of Economic Development
Theories of Economic DevelopmentTheories of Economic Development
Theories of Economic Development
 
Development of knowledge test and some other tests used in social science res...
Development of knowledge test and some other tests used in social science res...Development of knowledge test and some other tests used in social science res...
Development of knowledge test and some other tests used in social science res...
 
Input – output model of economic development
Input – output model of economic developmentInput – output model of economic development
Input – output model of economic development
 
Econometrics ch3
Econometrics ch3Econometrics ch3
Econometrics ch3
 
Harris-Todaro Migration Model and it's Applicability in Bangladesh
Harris-Todaro Migration Model and it's Applicability in BangladeshHarris-Todaro Migration Model and it's Applicability in Bangladesh
Harris-Todaro Migration Model and it's Applicability in Bangladesh
 
Input output overview-for-mba-ii-sem
Input output overview-for-mba-ii-semInput output overview-for-mba-ii-sem
Input output overview-for-mba-ii-sem
 
Characteristics of underdeveloped economies
Characteristics of underdeveloped economiesCharacteristics of underdeveloped economies
Characteristics of underdeveloped economies
 
Production function analysis
Production function analysisProduction function analysis
Production function analysis
 
Chap14 multiple regression model building
Chap14 multiple regression model buildingChap14 multiple regression model building
Chap14 multiple regression model building
 
Estimation of Elasticities of Substitution for CES and VES Production Functions
Estimation of Elasticities of Substitution for CES and VES Production FunctionsEstimation of Elasticities of Substitution for CES and VES Production Functions
Estimation of Elasticities of Substitution for CES and VES Production Functions
 
The chi – square test
The chi – square testThe chi – square test
The chi – square test
 
Lewis model & rastow stages
Lewis model & rastow stagesLewis model & rastow stages
Lewis model & rastow stages
 
Heteroscedasticity
HeteroscedasticityHeteroscedasticity
Heteroscedasticity
 
Schultz’s transformation of traditional agriculture
Schultz’s transformation of traditional agricultureSchultz’s transformation of traditional agriculture
Schultz’s transformation of traditional agriculture
 
LEAST COST METHOD
LEAST COST METHOD LEAST COST METHOD
LEAST COST METHOD
 
Solow Growth Model
Solow Growth ModelSolow Growth Model
Solow Growth Model
 
Input output analysis by roni bhowmik
Input output analysis by roni bhowmikInput output analysis by roni bhowmik
Input output analysis by roni bhowmik
 
Rural – urban migration
Rural – urban migrationRural – urban migration
Rural – urban migration
 
F test
F testF test
F test
 
Analysis Of Variance - ANOVA
Analysis Of Variance - ANOVAAnalysis Of Variance - ANOVA
Analysis Of Variance - ANOVA
 

Viewers also liked

Merging for ihds.info
Merging for ihds.infoMerging for ihds.info
Merging for ihds.info
Shantanu Mishra
 
Merging files (Data Structure)
Merging files (Data Structure)Merging files (Data Structure)
Merging files (Data Structure)
Tech_MX
 
Hashing PPT
Hashing PPTHashing PPT
Hashing PPT
Saurabh Kumar
 
Merging
Merging Merging
Merging
Shantanu Mishra
 
Algorithms for External Memory Sorting
Algorithms for External Memory SortingAlgorithms for External Memory Sorting
Algorithms for External Memory Sorting
Milind Gokhale
 
3.9 external sorting
3.9 external sorting3.9 external sorting
3.9 external sorting
Krish_ver2
 
Hashing
HashingHashing
Hashing
grahamwell
 
Ch17 Hashing
Ch17 HashingCh17 Hashing
Ch17 Hashing
leminhvuong
 
Sorting algorithms
Sorting algorithmsSorting algorithms
Sorting algorithms
Maher Alshammari
 
Hashing Techniques in Data Structures Part2
Hashing Techniques in Data Structures Part2Hashing Techniques in Data Structures Part2
Hashing Techniques in Data Structures Part2
SHAKOOR AB
 
Hashing
HashingHashing
Hashing
Ghaffar Khan
 
Hashing Technique In Data Structures
Hashing Technique In Data StructuresHashing Technique In Data Structures
Hashing Technique In Data Structures
SHAKOOR AB
 

Viewers also liked (12)

Merging for ihds.info
Merging for ihds.infoMerging for ihds.info
Merging for ihds.info
 
Merging files (Data Structure)
Merging files (Data Structure)Merging files (Data Structure)
Merging files (Data Structure)
 
Hashing PPT
Hashing PPTHashing PPT
Hashing PPT
 
Merging
Merging Merging
Merging
 
Algorithms for External Memory Sorting
Algorithms for External Memory SortingAlgorithms for External Memory Sorting
Algorithms for External Memory Sorting
 
3.9 external sorting
3.9 external sorting3.9 external sorting
3.9 external sorting
 
Hashing
HashingHashing
Hashing
 
Ch17 Hashing
Ch17 HashingCh17 Hashing
Ch17 Hashing
 
Sorting algorithms
Sorting algorithmsSorting algorithms
Sorting algorithms
 
Hashing Techniques in Data Structures Part2
Hashing Techniques in Data Structures Part2Hashing Techniques in Data Structures Part2
Hashing Techniques in Data Structures Part2
 
Hashing
HashingHashing
Hashing
 
Hashing Technique In Data Structures
Hashing Technique In Data StructuresHashing Technique In Data Structures
Hashing Technique In Data Structures
 

Recently uploaded

Texas Water Development Board Updates June 2024
Texas Water Development Board Updates June 2024Texas Water Development Board Updates June 2024
Texas Water Development Board Updates June 2024
Texas Alliance of Groundwater Districts
 
PAS PSDF Mop Up Workshop Presentation 2024 .pptx
PAS PSDF Mop Up Workshop Presentation 2024 .pptxPAS PSDF Mop Up Workshop Presentation 2024 .pptx
PAS PSDF Mop Up Workshop Presentation 2024 .pptx
PAS_Team
 
A Guide to AI for Smarter Nonprofits - Dr. Cori Faklaris, UNC Charlotte
A Guide to AI for Smarter Nonprofits - Dr. Cori Faklaris, UNC CharlotteA Guide to AI for Smarter Nonprofits - Dr. Cori Faklaris, UNC Charlotte
A Guide to AI for Smarter Nonprofits - Dr. Cori Faklaris, UNC Charlotte
Cori Faklaris
 
加急办理华威大学毕业证硕士文凭证书原版一模一样
加急办理华威大学毕业证硕士文凭证书原版一模一样加急办理华威大学毕业证硕士文凭证书原版一模一样
加急办理华威大学毕业证硕士文凭证书原版一模一样
uu1psyf6
 
2024: The FAR - Federal Acquisition Regulations, Part 38
2024: The FAR - Federal Acquisition Regulations, Part 382024: The FAR - Federal Acquisition Regulations, Part 38
2024: The FAR - Federal Acquisition Regulations, Part 38
JSchaus & Associates
 
Transit-Oriented Development Study Working Group Meeting
Transit-Oriented Development Study Working Group MeetingTransit-Oriented Development Study Working Group Meeting
Transit-Oriented Development Study Working Group Meeting
Cuyahoga County Planning Commission
 
Indira P.S Vs sub Collector Kochi - The settlement register is not a holy cow...
Indira P.S Vs sub Collector Kochi - The settlement register is not a holy cow...Indira P.S Vs sub Collector Kochi - The settlement register is not a holy cow...
Indira P.S Vs sub Collector Kochi - The settlement register is not a holy cow...
Jamesadhikaram land matter consultancy 9447464502
 
Monitoring Health for the SDGs - Global Health Statistics 2024 - WHO
Monitoring Health for the SDGs - Global Health Statistics 2024 - WHOMonitoring Health for the SDGs - Global Health Statistics 2024 - WHO
Monitoring Health for the SDGs - Global Health Statistics 2024 - WHO
Christina Parmionova
 
Milton Keynes Hospital Charity - A guide to leaving a gift in your Will
Milton Keynes Hospital Charity - A guide to leaving a gift in your WillMilton Keynes Hospital Charity - A guide to leaving a gift in your Will
Milton Keynes Hospital Charity - A guide to leaving a gift in your Will
fundraising4
 
2024: The FAR - Federal Acquisition Regulations, Part 39
2024: The FAR - Federal Acquisition Regulations, Part 392024: The FAR - Federal Acquisition Regulations, Part 39
2024: The FAR - Federal Acquisition Regulations, Part 39
JSchaus & Associates
 
Donate to charity during this holiday season
Donate to charity during this holiday seasonDonate to charity during this holiday season
Donate to charity during this holiday season
SERUDS INDIA
 
About Potato, The scientific name of the plant is Solanum tuberosum (L).
About Potato, The scientific name of the plant is Solanum tuberosum (L).About Potato, The scientific name of the plant is Solanum tuberosum (L).
About Potato, The scientific name of the plant is Solanum tuberosum (L).
Christina Parmionova
 
快速办理(UVM毕业证书)佛蒙特大学毕业证学位证一模一样
快速办理(UVM毕业证书)佛蒙特大学毕业证学位证一模一样快速办理(UVM毕业证书)佛蒙特大学毕业证学位证一模一样
快速办理(UVM毕业证书)佛蒙特大学毕业证学位证一模一样
yemqpj
 
IEA World Energy Investment June 2024- Statistics
IEA World Energy Investment June 2024- StatisticsIEA World Energy Investment June 2024- Statistics
IEA World Energy Investment June 2024- Statistics
Energy for One World
 
United Nations World Oceans Day 2024; June 8th " Awaken new dephts".
United Nations World Oceans Day 2024; June 8th " Awaken new dephts".United Nations World Oceans Day 2024; June 8th " Awaken new dephts".
United Nations World Oceans Day 2024; June 8th " Awaken new dephts".
Christina Parmionova
 
原版制作(英国Southampton毕业证书)南安普顿大学毕业证录取通知书一模一样
原版制作(英国Southampton毕业证书)南安普顿大学毕业证录取通知书一模一样原版制作(英国Southampton毕业证书)南安普顿大学毕业证录取通知书一模一样
原版制作(英国Southampton毕业证书)南安普顿大学毕业证录取通知书一模一样
3woawyyl
 
RFP for Reno's Community Assistance Center
RFP for Reno's Community Assistance CenterRFP for Reno's Community Assistance Center
RFP for Reno's Community Assistance Center
This Is Reno
 
快速办理(Bristol毕业证书)布里斯托大学毕业证Offer一模一样
快速办理(Bristol毕业证书)布里斯托大学毕业证Offer一模一样快速办理(Bristol毕业证书)布里斯托大学毕业证Offer一模一样
快速办理(Bristol毕业证书)布里斯托大学毕业证Offer一模一样
3woawyyl
 
2024: The FAR - Federal Acquisition Regulations, Part 40
2024: The FAR - Federal Acquisition Regulations, Part 402024: The FAR - Federal Acquisition Regulations, Part 40
2024: The FAR - Federal Acquisition Regulations, Part 40
JSchaus & Associates
 
Contributi dei parlamentari del PD - Contributi L. 3/2019
Contributi dei parlamentari del PD - Contributi L. 3/2019Contributi dei parlamentari del PD - Contributi L. 3/2019
Contributi dei parlamentari del PD - Contributi L. 3/2019
Partito democratico
 

Recently uploaded (20)

Texas Water Development Board Updates June 2024
Texas Water Development Board Updates June 2024Texas Water Development Board Updates June 2024
Texas Water Development Board Updates June 2024
 
PAS PSDF Mop Up Workshop Presentation 2024 .pptx
PAS PSDF Mop Up Workshop Presentation 2024 .pptxPAS PSDF Mop Up Workshop Presentation 2024 .pptx
PAS PSDF Mop Up Workshop Presentation 2024 .pptx
 
A Guide to AI for Smarter Nonprofits - Dr. Cori Faklaris, UNC Charlotte
A Guide to AI for Smarter Nonprofits - Dr. Cori Faklaris, UNC CharlotteA Guide to AI for Smarter Nonprofits - Dr. Cori Faklaris, UNC Charlotte
A Guide to AI for Smarter Nonprofits - Dr. Cori Faklaris, UNC Charlotte
 
加急办理华威大学毕业证硕士文凭证书原版一模一样
加急办理华威大学毕业证硕士文凭证书原版一模一样加急办理华威大学毕业证硕士文凭证书原版一模一样
加急办理华威大学毕业证硕士文凭证书原版一模一样
 
2024: The FAR - Federal Acquisition Regulations, Part 38
2024: The FAR - Federal Acquisition Regulations, Part 382024: The FAR - Federal Acquisition Regulations, Part 38
2024: The FAR - Federal Acquisition Regulations, Part 38
 
Transit-Oriented Development Study Working Group Meeting
Transit-Oriented Development Study Working Group MeetingTransit-Oriented Development Study Working Group Meeting
Transit-Oriented Development Study Working Group Meeting
 
Indira P.S Vs sub Collector Kochi - The settlement register is not a holy cow...
Indira P.S Vs sub Collector Kochi - The settlement register is not a holy cow...Indira P.S Vs sub Collector Kochi - The settlement register is not a holy cow...
Indira P.S Vs sub Collector Kochi - The settlement register is not a holy cow...
 
Monitoring Health for the SDGs - Global Health Statistics 2024 - WHO
Monitoring Health for the SDGs - Global Health Statistics 2024 - WHOMonitoring Health for the SDGs - Global Health Statistics 2024 - WHO
Monitoring Health for the SDGs - Global Health Statistics 2024 - WHO
 
Milton Keynes Hospital Charity - A guide to leaving a gift in your Will
Milton Keynes Hospital Charity - A guide to leaving a gift in your WillMilton Keynes Hospital Charity - A guide to leaving a gift in your Will
Milton Keynes Hospital Charity - A guide to leaving a gift in your Will
 
2024: The FAR - Federal Acquisition Regulations, Part 39
2024: The FAR - Federal Acquisition Regulations, Part 392024: The FAR - Federal Acquisition Regulations, Part 39
2024: The FAR - Federal Acquisition Regulations, Part 39
 
Donate to charity during this holiday season
Donate to charity during this holiday seasonDonate to charity during this holiday season
Donate to charity during this holiday season
 
About Potato, The scientific name of the plant is Solanum tuberosum (L).
About Potato, The scientific name of the plant is Solanum tuberosum (L).About Potato, The scientific name of the plant is Solanum tuberosum (L).
About Potato, The scientific name of the plant is Solanum tuberosum (L).
 
快速办理(UVM毕业证书)佛蒙特大学毕业证学位证一模一样
快速办理(UVM毕业证书)佛蒙特大学毕业证学位证一模一样快速办理(UVM毕业证书)佛蒙特大学毕业证学位证一模一样
快速办理(UVM毕业证书)佛蒙特大学毕业证学位证一模一样
 
IEA World Energy Investment June 2024- Statistics
IEA World Energy Investment June 2024- StatisticsIEA World Energy Investment June 2024- Statistics
IEA World Energy Investment June 2024- Statistics
 
United Nations World Oceans Day 2024; June 8th " Awaken new dephts".
United Nations World Oceans Day 2024; June 8th " Awaken new dephts".United Nations World Oceans Day 2024; June 8th " Awaken new dephts".
United Nations World Oceans Day 2024; June 8th " Awaken new dephts".
 
原版制作(英国Southampton毕业证书)南安普顿大学毕业证录取通知书一模一样
原版制作(英国Southampton毕业证书)南安普顿大学毕业证录取通知书一模一样原版制作(英国Southampton毕业证书)南安普顿大学毕业证录取通知书一模一样
原版制作(英国Southampton毕业证书)南安普顿大学毕业证录取通知书一模一样
 
RFP for Reno's Community Assistance Center
RFP for Reno's Community Assistance CenterRFP for Reno's Community Assistance Center
RFP for Reno's Community Assistance Center
 
快速办理(Bristol毕业证书)布里斯托大学毕业证Offer一模一样
快速办理(Bristol毕业证书)布里斯托大学毕业证Offer一模一样快速办理(Bristol毕业证书)布里斯托大学毕业证Offer一模一样
快速办理(Bristol毕业证书)布里斯托大学毕业证Offer一模一样
 
2024: The FAR - Federal Acquisition Regulations, Part 40
2024: The FAR - Federal Acquisition Regulations, Part 402024: The FAR - Federal Acquisition Regulations, Part 40
2024: The FAR - Federal Acquisition Regulations, Part 40
 
Contributi dei parlamentari del PD - Contributi L. 3/2019
Contributi dei parlamentari del PD - Contributi L. 3/2019Contributi dei parlamentari del PD - Contributi L. 3/2019
Contributi dei parlamentari del PD - Contributi L. 3/2019
 

Merging

  • 1. INDIA HUMAN DEVELOPMENT SURVEY (IHDS) TRAINING PROGRAM MARCH 16, 2016 How to merge two rounds?
  • 3. Relationship between IHDS-I and IHDS-II households IHDS-I sample (N=41,554) Replacement households in IHDS-II (N=2,134) Split households from round 1 (N=5,397) Reinterview Households (N=34,621) Attrition (N=6,911)  Most important concept in merging two data files 1. Some households in round 1 with no match in round 2 and vice versa 2. Households in round 1 match with more than 1 household in round 2
  • 4. Any questions?  Who were chosen for reinterview?  Recontact rate of 83%? What does it mean?  How were replacement households chosen?  What is a split household?
  • 5. What is needed to merge household files? 1. Round 1 household file – N=41,554 2. Round 2 household file – N=42,152  (Why are there more cases in round 2?) 3. Linking file – N=42,152 – gives Round 1 identification codes for all Round 2 households that were reinterviewed, missing linking codes for 2,134 households that are new
  • 6. Step 1 – Link round 2 data to linking file to get round 1 ID  use linkhh, clear  sort STATEID DISTID PSUID HHID HHSPLITID  merge 1:1 STATEID DISTID PSUID HHID HHSPLITID using round2HH  sort STATEID DISTID PSUID HHID2005 HHSPLITID2005, gen(_mergeR2link)  save round2HH_plus, replace
  • 7. Step 2-Merge this Round 2+ file with Round 1 file  use round1HH  rename HHID HHID2005  rename HHSPLITID HHSPLITID2005  sort STATEID DISTID PSUID HHID2005 HHSPLITID2005  merge 1:m STATEID DISTID PSUID HHID2005 HHSPLITID2005 using round2HH_plus, gen(_mergeR1R2)  sort STATEID DISTID PSUID HHID HHSPLITID  save mergedHHR1R2, replace
  • 8. Cases in Merged file is superset  Households surveyed in both rounds N=40,018  Households surveyed in round 1 only (attrition) N=6,911  Households surveyd in round 2 only (replacement) N=2,134  Total N=49,063  Keep only _mergeR1R2==3 for panel analysis (N=40,018)
  • 10. Relationship between IHDS-I and IHDS-II individuals IHDS-I sample (N=215,754) New individulas, new HH (N=9,760) New Ind in R1 HH (N=43,822) Reinterview Ind (N=150,995) HH attrition (N=29,299) Ind. attrition in interview hh (N=35,464)  Most important concept in merging two data files 1. Even reinterview households have new members (births, marriages) 2. Even reinterview households have some members who are no longer there (deaths, marriages, migration)
  • 11. What is needed to merge individual files? 1. Round 1 household file – N=215,754 2. Round 2 household file – N=204,568  (Why are there more cases in round 2?) 3. Linking file – N=204,568 – gives Round 1 identification codes for all Round 2 households that were reinterviewed, missing linking codes for 2,134 households that are new
  • 12. Step 1 – Link round 2 data to linking file to get round 1 ID  use linkind, clear  sort STATEID DISTID PSUID HHID HHSPLITID PERSONID  merge 1:1 STATEID DISTID PSUID HHID HHSPLITID PERONID using round2IND  sort STATEID DISTID PSUID HHID2005 HHSPLITID2005, gen(_mergeR2link)  save round2IND_plus, replace
  • 13. Step 2-Merge this Round 2+ file with Round 1 file  use round1IND  rename HHID HHID2005  rename HHSPLITID HHSPLITID2005  rename PERSONID PERSONID2005  sort STATEID DISTID PSUID HHID2005 HHSPLITID2005 PERSONID2005  merge 1:m STATEID DISTID PSUID HHID2005 HHSPLITID2005 PERSONID2005 using round2IND_plus, gen(_mergeR1R2)  sort STATEID DISTID PSUID HHID HHSPLITID  save mergedINDR1R2, replace
  • 14. Cases in Merged file is superset  Individuals surveyed in both rounds N=150,988  Individuals surveyed in round 1 only (attrition/death/migration) N=64,766  Individuals surveyd in round 2 only (replacement/new) N=53,580  Total N=269,334  Keep only _mergeR1R2==3 for panel analysis (N=150,988)
  • 16. Same process as individual file linkage  But only one thing to note, there was no ever married woman file for 2004-5 so you will be merging with the household file from 2004-5
  • 18. Merging overwrites variables  So if you want to keep variables from round 1 and round 2 separate, before merging you may want to rename all round 1 variables  Typically we use the command  Rename * x*  Rename xSTATEID STATEID et. For merging  So xr05 will be age in 20045 and r05 will be age in 2011-12