SlideShare a Scribd company logo
PFL data collection –
hands on session
Ruth Geraghty
Data Curator
Children’s Research Network
Finding and re-using research data workshop
UCD Health Sciences Library, 15th June 2017
Session 14:00 – 15:30
Overview of this session
• Presentation and Q&A with Ailbhe Booth, member of the PFL
evaluation team at UCD Geary (about 30 mins)
• Guided hands on session with a subset of data from archived PFL
collection using SPSS – chance to see inside an archived collection
(about an hour)
1. Recap of the Preparing for Life collection
Recap of this collection
Preparing for Life collection: Evaluation of the Preparing for Life
early childhood intervention (PFL), 2008 – 2015
Study number (SN): 0055-00
Where can I find out more about this collection?
https://www.ucd.ie/issda/data/pfl/
Where can I find out more about the PEI Research Initiative?
http://www.childrensresearchnetwork.org/knowledge/collection/prevention-and-
early-intervention
Data should always be cited in any new work. Use the following:
Northside Partnership; Orla Doyle; UCD Geary Institute PFL Evaluation Team
(2017). Preparing for Life collection: Evaluation of the Preparing for Life early
childhood intervention, 2008 - 2015. [collection]. Version 1. Dublin: Irish Social
Science Data Archive [distributor ] SN: 0055-00. ucd.ie/issda/pfl
Preparing for Life collection: Evaluation of the Preparing for Life
early childhood intervention (PFL), 2008 – 2015 Study number
(SN): 0055-00
How do I get the full PFL collection?
To access the data, please complete a request form for research
purposes, sign it, and send it to ISSDA by email.
Data will be disseminated on receipt of a fully completed, signed
form. Incomplete or unsigned forms will be returned to the data
requester for completion.
Recap on PFL intervention and evaluation
• Operated by Northside Partnership in Dublin
• Objective: improve levels of school readiness of young children from several disadvantaged
areas in North Dublin
• Intervention: pregnancy to age 4 (school entry)
• UCD Geary Institute (Dr. Orla Doyle)
• 2008 – 2015
• 7 waves of data collection:
• prenatal baseline, 6 months, 12 months, 18 months, 24 months, 36 months, 48 months
• Mixed methods approach
• Impact evaluation, implementation analysis, process evaluation, direct observation of children using
British Ability Scale at school entry
• Randomisation into low support and high support, plus a comparison community (LFP)
What we’ll look at today
Domains examined by PFL
EIGHT DOMAINS
1. Child development
2. Child health
3. Parenting
4. Home environment
5. Maternal health and wellbeing
6. Social support
7. Childcare and service use
8. Household factors and socioeconomic status
What we’ll look at today
2. Starting out
A run through of the basic features of the archived file
Contents of the archived file
Data: The archived dataset (in the case of PFL 1 data file per wave)
Survey: The survey instrument used to gather data (in the case of PFL 1 survey per wave). Copyrighted
scale material may be redacted from the archived version, but where this occurs a citation for the scale
is provided.
Codebooks: The codebook lists all variables in the archived dataset with some basic frequencies (in the
case of PFL a codebook for each wave). This codebook was created during the archiving process. Look
at the index at the end for a quick reference list of all variables in a data file.
Report: The published evaluation report by the researchers that created this dataset (in the case of
PFL 1 report per wave)
Please check that you have the following four
folders
Files for workshop 15 June
BL
12m
24m
48m
Inside each folder check you have these files
(includes contextual materials)
BL folder
PFL_BL_Data-sample
PFL-BL-Codebook
PFL-BL-Report
PFL-BL-Survey
12m folder
PFL_12m_Data-sample
PFL-12m-Codebook
PFL-12m-Report
PFL-12m-Survey
24m folder
PFL_24m_Data-sample
PFL-24m-Codebook
PFL-24m-Report
PFL-24m-Survey
48m folder
PFL_48m_Data-sample
PFL-48m-Codebook
PFL-48m-Report
PFL-48m-Survey
Check you can open each of these in SPSS
1. PFL_BL_Data-sample = pre-natal interview with expectant mother
2. PFL_12m_Data-sample = interview with mother when child was 1 year
old
3. PFL_24m_Data-sample = interview with mother when child was 2 years
old
4. PFL_48m_Data-sample = interview with mother when child was 4 years
old (final interview)
The archival version was processed, including anonymisation of potentially
identifying information
3. Using SPSS
Getting started in SPSS
Opening SPSS
1. IBM SPSS Statistics 24
2. Select ‘Open another
file’ option
Opening SPSS
3. Navigate to the PFL files on your machine
• PFL_BL_Data-sample.sav
• PFL_12m_Data-sample.sav
• PFL_24m_Data-sample.sav
• PFL_48m_Data-sample.sav
• Double click each to open in SPSS
• You can keep all 4 PFL files open simultaneously
Variable
view
Data
view
4. Working with the data files
A little bit of light analysis using SPSS
Please note!
• You are working with a sample of the data file
• Use the codebook to get an indication of the full set of variables in each wave
• But you are only able to see a sub-set today – codebook lists full archival
dataset
• Data should not be copied or saved onto another device, or taken out of this
room
• Codebooks, User Guide, Surveys and Reports are openly available on ISSDA
website
• If you would like to access this data please apply through ISSDA
Variable naming conventions of PEI-RI project
Survey variables: Variables that were generated by the survey correspond to the question number in the
survey, and are labelled to correspond as closely as possible to the original wording of the survey
question. Labels are sometimes composed from truncated survey questions due to character restrictions
in the software (see below).
In 24m data file: scroll down to variables d4, d5, d7, d8, d10 – compare these to the 24m survey
Derived variables: Variables that were created during data entry and analysis
In 24m data file: scroll down to poor_eating (v46) – this was derived from the previous questions ‘How
often does baby eat (v39-45)
Anonymisation variables: Variables that were created during anonymisation
In 24m data file: scroll down to d4 What is your current relationship status (v48) – this was recoded
into broader categories
Variable naming conventions of PEI-RI project
Scale variables: Individual scale items, domain scores and total scores are named with the scale
acronym which is capitalised for ease of reference . These acronyms are consistent across all waves
to facilitate the user to track specific measures across waves. A full list of scale acronym and their
full title and citation are available in Appendix 3. Where permission has been granted to reproduce
the scale contents in the archive, items are labelled so that they correspond as closely as possible to
the wording of the survey question. Where this permission has not been granted, individual item are
labelled with the scale title and sequence number.
In 24m data file: scroll down to CDI_baabaa McArthur-Bates CDI Toddler short – form A (v73 onwards)
to see scale items, and total scores
Conventions of PEI-RI project: Missing cases
are included
Why are there missing cases? All 332 cases are included in the data file for
each wave, so that individual wave files can be merged together if
required. However, data was not collected for every case at each data
collection point and consequently there are a small number of missing
cases per wave. Missing cases in the data file are indicated by the system
missing count and by the variable PFL_control. A count of missing cases
per wave is also provided at the start of the codebook for ease of
reference. Using the 12m file, run a frequency using the variable Trial Group:
PFL intervention or control [PFL_Control]
Method: Analyze / Descriptive Statistics / Frequencies
Result: 82 in LFP, 165 in PFL and 85 not included in wave
Conventions of PEI-RI project: Missing data
Why are there missing data? While participants were encouraged to answer all questions
during the interview, there were some instances where a participant either could not provide
a response to a question or did not wish to provide a response. Missing data are included in
the archived dataset (marked as missing or left blank) so that new users can manage missing
data in a manner that best suits their research design.
Non-response codes for categorical variables:
996 = missing
997 = not applicable
998 = refuse
Missing data for scales: Calculation rules for missing scale items are provided in Appendix of
user guide. Users should refer to the codebook for specific information on the calculation of
cut off points.
Merging wave files
• Same cases included in all waves (even where data not collected)
• Two files can be match using ID variable – make sure they are both
in ascending order (or at last the same order)
• Data / Merge files/ Add variables
• Excluded variables = variables that are common to both files
• New Active Dataset = variables that will be included in new file
• Tick Match cases on key variables and add PFL_ID to the Key Variables box
Try merging the BL and 12m file together
5. Demographic and health variables
A sample of variables from PFL
Household demographic variables
• Age
• Ethnicity
• No of children (first time mother)
• Highest level of education
• Literacy
• Current work status / change to work status
• Social housing
• Medical card, GP card
• Health insurance
• Social welfare payments
• Financial difficulty (self-assess)
• Household income (equivalised – anon)
• Household size (equiv size – anon)
• EU-SILC material deprivation
Maternal health behaviour variables
Baby’s health:
• Baby age, gender, weight,
height
• Health over last 6 months
• Immunisations
• Breastfeeding / child’s eating
• Child’s sleep behaviours
• TV and screen time
Mother’s health:
• Physical activity
• Substance use
• WHO5
• Subjective health assessment
Standardised measures – a small sample for
review
24m file
McArthur Bates – just 10 items
as sample
CBCL – just 10 items as sample
48m file
Child’s sleeping habits (CSH) –
full scale

More Related Content

What's hot

Using Data for Continuous Improvement Faculty Development Model - Competency-...
Using Data for Continuous Improvement Faculty Development Model - Competency-...Using Data for Continuous Improvement Faculty Development Model - Competency-...
Using Data for Continuous Improvement Faculty Development Model - Competency-...
Becky Lopanec
 
Improving Knowledge Discovery Through Development of Big Data to Knowledge S...
Improving Knowledge Discovery Through Development of  Big Data to Knowledge S...Improving Knowledge Discovery Through Development of  Big Data to Knowledge S...
Improving Knowledge Discovery Through Development of Big Data to Knowledge S...
Nicole Vasilevsky
 
In metrics we trust?
In metrics we trust?In metrics we trust?
In metrics we trust?
ORCID, Inc
 
Almaden presentation 15-dec-2015
Almaden presentation 15-dec-2015Almaden presentation 15-dec-2015
Almaden presentation 15-dec-2015
Paul Courtney
 
Summary of SA Trip
 Summary of SA Trip Summary of SA Trip
Summary of SA Trip
dolamore
 
Research Data Management and Reproducibility
Research Data Management and ReproducibilityResearch Data Management and Reproducibility
Research Data Management and Reproducibility
University of Liverpool Library
 
Public Access to Federally Funded Research
Public Access to Federally Funded ResearchPublic Access to Federally Funded Research
Public Access to Federally Funded Research
Kelly Broughton
 
Learning Analytics in Medical Education
Learning Analytics in Medical EducationLearning Analytics in Medical Education
Learning Analytics in Medical Education
Abelardo Pardo
 
SSC in Evidence Based Medicine - Internet resources
SSC in Evidence Based Medicine - Internet resources SSC in Evidence Based Medicine - Internet resources
SSC in Evidence Based Medicine - Internet resources
PaulaFunnell
 
Zhengfan Wang_Cv_final
Zhengfan Wang_Cv_finalZhengfan Wang_Cv_final
Zhengfan Wang_Cv_final
Zhengfan Wang
 
NIDM-Results: a model to share brain mapping statistical results
NIDM-Results: a model to share brain mapping statistical resultsNIDM-Results: a model to share brain mapping statistical results
NIDM-Results: a model to share brain mapping statistical results
Camille Maumet
 
Open approaches to OER impact research
Open approaches to OER impact research Open approaches to OER impact research
Open approaches to OER impact research
Robert Farrow
 
Education unified planning approach
Education unified planning approachEducation unified planning approach
Education unified planning approach
Results Leadership Group, LLC
 
Viney2011
Viney2011Viney2011
Implementation frameworks applied
Implementation frameworks appliedImplementation frameworks applied
Implementation frameworks applied
impscinetau
 
Understanding impact through alternative metrics: developing library-based as...
Understanding impact through alternative metrics: developing library-based as...Understanding impact through alternative metrics: developing library-based as...
Understanding impact through alternative metrics: developing library-based as...
Kristi Holmes
 
Evaluation-based advice with uncertain evidence
Evaluation-based advice with uncertain evidenceEvaluation-based advice with uncertain evidence
Evaluation-based advice with uncertain evidence
ACIL Allen Consulting
 
Towards Open Research: practices, experiences, barriers and opportunities
Towards Open Research: practices, experiences, barriers and opportunitiesTowards Open Research: practices, experiences, barriers and opportunities
Towards Open Research: practices, experiences, barriers and opportunities
London School of Hygiene and Tropical Medicine
 
Introduction to Learning Analytics - Framework and Implementation Concerns
Introduction to Learning Analytics - Framework and Implementation ConcernsIntroduction to Learning Analytics - Framework and Implementation Concerns
Introduction to Learning Analytics - Framework and Implementation Concerns
Tore Hoel
 
Doing a systematic review: top tips for progressing your review
Doing a systematic review: top tips for progressing your reviewDoing a systematic review: top tips for progressing your review
Doing a systematic review: top tips for progressing your review
University of Liverpool Library
 

What's hot (20)

Using Data for Continuous Improvement Faculty Development Model - Competency-...
Using Data for Continuous Improvement Faculty Development Model - Competency-...Using Data for Continuous Improvement Faculty Development Model - Competency-...
Using Data for Continuous Improvement Faculty Development Model - Competency-...
 
Improving Knowledge Discovery Through Development of Big Data to Knowledge S...
Improving Knowledge Discovery Through Development of  Big Data to Knowledge S...Improving Knowledge Discovery Through Development of  Big Data to Knowledge S...
Improving Knowledge Discovery Through Development of Big Data to Knowledge S...
 
In metrics we trust?
In metrics we trust?In metrics we trust?
In metrics we trust?
 
Almaden presentation 15-dec-2015
Almaden presentation 15-dec-2015Almaden presentation 15-dec-2015
Almaden presentation 15-dec-2015
 
Summary of SA Trip
 Summary of SA Trip Summary of SA Trip
Summary of SA Trip
 
Research Data Management and Reproducibility
Research Data Management and ReproducibilityResearch Data Management and Reproducibility
Research Data Management and Reproducibility
 
Public Access to Federally Funded Research
Public Access to Federally Funded ResearchPublic Access to Federally Funded Research
Public Access to Federally Funded Research
 
Learning Analytics in Medical Education
Learning Analytics in Medical EducationLearning Analytics in Medical Education
Learning Analytics in Medical Education
 
SSC in Evidence Based Medicine - Internet resources
SSC in Evidence Based Medicine - Internet resources SSC in Evidence Based Medicine - Internet resources
SSC in Evidence Based Medicine - Internet resources
 
Zhengfan Wang_Cv_final
Zhengfan Wang_Cv_finalZhengfan Wang_Cv_final
Zhengfan Wang_Cv_final
 
NIDM-Results: a model to share brain mapping statistical results
NIDM-Results: a model to share brain mapping statistical resultsNIDM-Results: a model to share brain mapping statistical results
NIDM-Results: a model to share brain mapping statistical results
 
Open approaches to OER impact research
Open approaches to OER impact research Open approaches to OER impact research
Open approaches to OER impact research
 
Education unified planning approach
Education unified planning approachEducation unified planning approach
Education unified planning approach
 
Viney2011
Viney2011Viney2011
Viney2011
 
Implementation frameworks applied
Implementation frameworks appliedImplementation frameworks applied
Implementation frameworks applied
 
Understanding impact through alternative metrics: developing library-based as...
Understanding impact through alternative metrics: developing library-based as...Understanding impact through alternative metrics: developing library-based as...
Understanding impact through alternative metrics: developing library-based as...
 
Evaluation-based advice with uncertain evidence
Evaluation-based advice with uncertain evidenceEvaluation-based advice with uncertain evidence
Evaluation-based advice with uncertain evidence
 
Towards Open Research: practices, experiences, barriers and opportunities
Towards Open Research: practices, experiences, barriers and opportunitiesTowards Open Research: practices, experiences, barriers and opportunities
Towards Open Research: practices, experiences, barriers and opportunities
 
Introduction to Learning Analytics - Framework and Implementation Concerns
Introduction to Learning Analytics - Framework and Implementation ConcernsIntroduction to Learning Analytics - Framework and Implementation Concerns
Introduction to Learning Analytics - Framework and Implementation Concerns
 
Doing a systematic review: top tips for progressing your review
Doing a systematic review: top tips for progressing your reviewDoing a systematic review: top tips for progressing your review
Doing a systematic review: top tips for progressing your review
 

Similar to PFL data collection – hands on session

Pathway to Practice: Incorporating Evidence into Military Family Services
Pathway to Practice: Incorporating Evidence into Military Family ServicesPathway to Practice: Incorporating Evidence into Military Family Services
Pathway to Practice: Incorporating Evidence into Military Family Services
MFLNFamilyDevelopmnt
 
PhRMA Some Early Thoughts
PhRMA Some Early ThoughtsPhRMA Some Early Thoughts
PhRMA Some Early Thoughts
Philip Bourne
 
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Barry Smith
 
Towards Privacy-Preserving Evaluation for Information Retrieval Models over I...
Towards Privacy-Preserving Evaluation for Information Retrieval Models over I...Towards Privacy-Preserving Evaluation for Information Retrieval Models over I...
Towards Privacy-Preserving Evaluation for Information Retrieval Models over I...
Twitter Inc.
 
Wheat Data Interoperability (1) by Esther DZALE YEUMO KABORE and Richard FULSS
Wheat Data Interoperability (1) by Esther DZALE YEUMO KABORE and Richard FULSSWheat Data Interoperability (1) by Esther DZALE YEUMO KABORE and Richard FULSS
Wheat Data Interoperability (1) by Esther DZALE YEUMO KABORE and Richard FULSS
CIARD Movement
 
Week 5 Lab 3· If you choose to download the software from http.docx
Week 5 Lab 3· If you choose to download the software from http.docxWeek 5 Lab 3· If you choose to download the software from http.docx
Week 5 Lab 3· If you choose to download the software from http.docx
cockekeshia
 
The Simulacrum, a Synthetic Cancer Dataset
The Simulacrum, a Synthetic Cancer DatasetThe Simulacrum, a Synthetic Cancer Dataset
The Simulacrum, a Synthetic Cancer Dataset
CongChen35
 
BioSHaRE: Maelstrom Research tools for data harmonization and co-analysis - I...
BioSHaRE: Maelstrom Research tools for data harmonization and co-analysis - I...BioSHaRE: Maelstrom Research tools for data harmonization and co-analysis - I...
BioSHaRE: Maelstrom Research tools for data harmonization and co-analysis - I...
Lisette Giepmans
 
NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...
NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...
NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...
National Information Standards Organization (NISO)
 
Data at the NIH: Some Early Thoughts
Data at the NIH: Some Early ThoughtsData at the NIH: Some Early Thoughts
Data at the NIH: Some Early Thoughts
Philip Bourne
 
Some Early Thoughts
Some Early ThoughtsSome Early Thoughts
Some Early Thoughts
Philip Bourne
 
Evaluating Impact of OVC Programs: Standardizing our methods
Evaluating Impact of OVC Programs: Standardizing our methodsEvaluating Impact of OVC Programs: Standardizing our methods
Evaluating Impact of OVC Programs: Standardizing our methods
MEASURE Evaluation
 
Research Data Management and Sharing for the Social Sciences and Humanities
Research Data Management and Sharing for the Social Sciences and HumanitiesResearch Data Management and Sharing for the Social Sciences and Humanities
Research Data Management and Sharing for the Social Sciences and Humanities
Rebekah Cummings
 
Ahrq Hai Collab Meeting 7.13 14.2009
Ahrq Hai Collab Meeting 7.13 14.2009Ahrq Hai Collab Meeting 7.13 14.2009
Ahrq Hai Collab Meeting 7.13 14.2009
Brad Doebbeling
 
statistics.pdf
statistics.pdfstatistics.pdf
statistics.pdf
Noname274365
 
Development of an Electronic Decision Support tool to optimize performance du...
Development of an Electronic Decision Support tool to optimize performance du...Development of an Electronic Decision Support tool to optimize performance du...
Development of an Electronic Decision Support tool to optimize performance du...
INSPIRE_Network
 
Basics of research in research methodology
Basics of research in research methodologyBasics of research in research methodology
Basics of research in research methodology
Shajun Nisha
 
DRIVE CENTRAL STUDY PLATFORM: Data flow, data quality and statistical analysi...
DRIVE CENTRAL STUDY PLATFORM: Data flow, data quality and statistical analysi...DRIVE CENTRAL STUDY PLATFORM: Data flow, data quality and statistical analysi...
DRIVE CENTRAL STUDY PLATFORM: Data flow, data quality and statistical analysi...
DRIVE research
 
Translating Evidence into Practice Data Collection Assignment.pdf
Translating Evidence into Practice Data Collection Assignment.pdfTranslating Evidence into Practice Data Collection Assignment.pdf
Translating Evidence into Practice Data Collection Assignment.pdf
sdfghj21
 
Data Governance in two different data archives: When is a federal data reposi...
Data Governance in two different data archives: When is a federal data reposi...Data Governance in two different data archives: When is a federal data reposi...
Data Governance in two different data archives: When is a federal data reposi...
Carolyn Ten Holter
 

Similar to PFL data collection – hands on session (20)

Pathway to Practice: Incorporating Evidence into Military Family Services
Pathway to Practice: Incorporating Evidence into Military Family ServicesPathway to Practice: Incorporating Evidence into Military Family Services
Pathway to Practice: Incorporating Evidence into Military Family Services
 
PhRMA Some Early Thoughts
PhRMA Some Early ThoughtsPhRMA Some Early Thoughts
PhRMA Some Early Thoughts
 
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
 
Towards Privacy-Preserving Evaluation for Information Retrieval Models over I...
Towards Privacy-Preserving Evaluation for Information Retrieval Models over I...Towards Privacy-Preserving Evaluation for Information Retrieval Models over I...
Towards Privacy-Preserving Evaluation for Information Retrieval Models over I...
 
Wheat Data Interoperability (1) by Esther DZALE YEUMO KABORE and Richard FULSS
Wheat Data Interoperability (1) by Esther DZALE YEUMO KABORE and Richard FULSSWheat Data Interoperability (1) by Esther DZALE YEUMO KABORE and Richard FULSS
Wheat Data Interoperability (1) by Esther DZALE YEUMO KABORE and Richard FULSS
 
Week 5 Lab 3· If you choose to download the software from http.docx
Week 5 Lab 3· If you choose to download the software from http.docxWeek 5 Lab 3· If you choose to download the software from http.docx
Week 5 Lab 3· If you choose to download the software from http.docx
 
The Simulacrum, a Synthetic Cancer Dataset
The Simulacrum, a Synthetic Cancer DatasetThe Simulacrum, a Synthetic Cancer Dataset
The Simulacrum, a Synthetic Cancer Dataset
 
BioSHaRE: Maelstrom Research tools for data harmonization and co-analysis - I...
BioSHaRE: Maelstrom Research tools for data harmonization and co-analysis - I...BioSHaRE: Maelstrom Research tools for data harmonization and co-analysis - I...
BioSHaRE: Maelstrom Research tools for data harmonization and co-analysis - I...
 
NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...
NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...
NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...
 
Data at the NIH: Some Early Thoughts
Data at the NIH: Some Early ThoughtsData at the NIH: Some Early Thoughts
Data at the NIH: Some Early Thoughts
 
Some Early Thoughts
Some Early ThoughtsSome Early Thoughts
Some Early Thoughts
 
Evaluating Impact of OVC Programs: Standardizing our methods
Evaluating Impact of OVC Programs: Standardizing our methodsEvaluating Impact of OVC Programs: Standardizing our methods
Evaluating Impact of OVC Programs: Standardizing our methods
 
Research Data Management and Sharing for the Social Sciences and Humanities
Research Data Management and Sharing for the Social Sciences and HumanitiesResearch Data Management and Sharing for the Social Sciences and Humanities
Research Data Management and Sharing for the Social Sciences and Humanities
 
Ahrq Hai Collab Meeting 7.13 14.2009
Ahrq Hai Collab Meeting 7.13 14.2009Ahrq Hai Collab Meeting 7.13 14.2009
Ahrq Hai Collab Meeting 7.13 14.2009
 
statistics.pdf
statistics.pdfstatistics.pdf
statistics.pdf
 
Development of an Electronic Decision Support tool to optimize performance du...
Development of an Electronic Decision Support tool to optimize performance du...Development of an Electronic Decision Support tool to optimize performance du...
Development of an Electronic Decision Support tool to optimize performance du...
 
Basics of research in research methodology
Basics of research in research methodologyBasics of research in research methodology
Basics of research in research methodology
 
DRIVE CENTRAL STUDY PLATFORM: Data flow, data quality and statistical analysi...
DRIVE CENTRAL STUDY PLATFORM: Data flow, data quality and statistical analysi...DRIVE CENTRAL STUDY PLATFORM: Data flow, data quality and statistical analysi...
DRIVE CENTRAL STUDY PLATFORM: Data flow, data quality and statistical analysi...
 
Translating Evidence into Practice Data Collection Assignment.pdf
Translating Evidence into Practice Data Collection Assignment.pdfTranslating Evidence into Practice Data Collection Assignment.pdf
Translating Evidence into Practice Data Collection Assignment.pdf
 
Data Governance in two different data archives: When is a federal data reposi...
Data Governance in two different data archives: When is a federal data reposi...Data Governance in two different data archives: When is a federal data reposi...
Data Governance in two different data archives: When is a federal data reposi...
 

More from ISSDA

Key issues in reusing data
Key issues in reusing dataKey issues in reusing data
Key issues in reusing data
ISSDA
 
Finding & accessing data
Finding & accessing dataFinding & accessing data
Finding & accessing data
ISSDA
 
Accessing and using TILDA data, available through ISSDA
Accessing and using TILDA data, available through ISSDAAccessing and using TILDA data, available through ISSDA
Accessing and using TILDA data, available through ISSDA
ISSDA
 
Irish Social Science Data Archive Services for Depositors & Researchers
Irish Social Science Data Archive Services for Depositors & ResearchersIrish Social Science Data Archive Services for Depositors & Researchers
Irish Social Science Data Archive Services for Depositors & Researchers
ISSDA
 
Qualitative data – anonymisation for sharing
Qualitative data – anonymisation for sharingQualitative data – anonymisation for sharing
Qualitative data – anonymisation for sharing
ISSDA
 
Anonymisation and Social Research
Anonymisation and Social ResearchAnonymisation and Social Research
Anonymisation and Social Research
ISSDA
 
Anonymising quantative data
Anonymising quantative dataAnonymising quantative data
Anonymising quantative data
ISSDA
 
Sports datasets available in ISSDA
Sports datasets available in ISSDASports datasets available in ISSDA
Sports datasets available in ISSDA
ISSDA
 

More from ISSDA (8)

Key issues in reusing data
Key issues in reusing dataKey issues in reusing data
Key issues in reusing data
 
Finding & accessing data
Finding & accessing dataFinding & accessing data
Finding & accessing data
 
Accessing and using TILDA data, available through ISSDA
Accessing and using TILDA data, available through ISSDAAccessing and using TILDA data, available through ISSDA
Accessing and using TILDA data, available through ISSDA
 
Irish Social Science Data Archive Services for Depositors & Researchers
Irish Social Science Data Archive Services for Depositors & ResearchersIrish Social Science Data Archive Services for Depositors & Researchers
Irish Social Science Data Archive Services for Depositors & Researchers
 
Qualitative data – anonymisation for sharing
Qualitative data – anonymisation for sharingQualitative data – anonymisation for sharing
Qualitative data – anonymisation for sharing
 
Anonymisation and Social Research
Anonymisation and Social ResearchAnonymisation and Social Research
Anonymisation and Social Research
 
Anonymising quantative data
Anonymising quantative dataAnonymising quantative data
Anonymising quantative data
 
Sports datasets available in ISSDA
Sports datasets available in ISSDASports datasets available in ISSDA
Sports datasets available in ISSDA
 

Recently uploaded

Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
74nqk8xf
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
Sachin Paul
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
Social Samosa
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
g4dpvqap0
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
zsjl4mimo
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 

Recently uploaded (20)

Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 

PFL data collection – hands on session

  • 1. PFL data collection – hands on session Ruth Geraghty Data Curator Children’s Research Network Finding and re-using research data workshop UCD Health Sciences Library, 15th June 2017 Session 14:00 – 15:30
  • 2. Overview of this session • Presentation and Q&A with Ailbhe Booth, member of the PFL evaluation team at UCD Geary (about 30 mins) • Guided hands on session with a subset of data from archived PFL collection using SPSS – chance to see inside an archived collection (about an hour)
  • 3. 1. Recap of the Preparing for Life collection Recap of this collection
  • 4. Preparing for Life collection: Evaluation of the Preparing for Life early childhood intervention (PFL), 2008 – 2015 Study number (SN): 0055-00 Where can I find out more about this collection? https://www.ucd.ie/issda/data/pfl/ Where can I find out more about the PEI Research Initiative? http://www.childrensresearchnetwork.org/knowledge/collection/prevention-and- early-intervention Data should always be cited in any new work. Use the following: Northside Partnership; Orla Doyle; UCD Geary Institute PFL Evaluation Team (2017). Preparing for Life collection: Evaluation of the Preparing for Life early childhood intervention, 2008 - 2015. [collection]. Version 1. Dublin: Irish Social Science Data Archive [distributor ] SN: 0055-00. ucd.ie/issda/pfl
  • 5. Preparing for Life collection: Evaluation of the Preparing for Life early childhood intervention (PFL), 2008 – 2015 Study number (SN): 0055-00 How do I get the full PFL collection? To access the data, please complete a request form for research purposes, sign it, and send it to ISSDA by email. Data will be disseminated on receipt of a fully completed, signed form. Incomplete or unsigned forms will be returned to the data requester for completion.
  • 6. Recap on PFL intervention and evaluation • Operated by Northside Partnership in Dublin • Objective: improve levels of school readiness of young children from several disadvantaged areas in North Dublin • Intervention: pregnancy to age 4 (school entry) • UCD Geary Institute (Dr. Orla Doyle) • 2008 – 2015 • 7 waves of data collection: • prenatal baseline, 6 months, 12 months, 18 months, 24 months, 36 months, 48 months • Mixed methods approach • Impact evaluation, implementation analysis, process evaluation, direct observation of children using British Ability Scale at school entry • Randomisation into low support and high support, plus a comparison community (LFP) What we’ll look at today
  • 7. Domains examined by PFL EIGHT DOMAINS 1. Child development 2. Child health 3. Parenting 4. Home environment 5. Maternal health and wellbeing 6. Social support 7. Childcare and service use 8. Household factors and socioeconomic status What we’ll look at today
  • 8. 2. Starting out A run through of the basic features of the archived file
  • 9. Contents of the archived file Data: The archived dataset (in the case of PFL 1 data file per wave) Survey: The survey instrument used to gather data (in the case of PFL 1 survey per wave). Copyrighted scale material may be redacted from the archived version, but where this occurs a citation for the scale is provided. Codebooks: The codebook lists all variables in the archived dataset with some basic frequencies (in the case of PFL a codebook for each wave). This codebook was created during the archiving process. Look at the index at the end for a quick reference list of all variables in a data file. Report: The published evaluation report by the researchers that created this dataset (in the case of PFL 1 report per wave)
  • 10. Please check that you have the following four folders Files for workshop 15 June BL 12m 24m 48m
  • 11. Inside each folder check you have these files (includes contextual materials) BL folder PFL_BL_Data-sample PFL-BL-Codebook PFL-BL-Report PFL-BL-Survey 12m folder PFL_12m_Data-sample PFL-12m-Codebook PFL-12m-Report PFL-12m-Survey 24m folder PFL_24m_Data-sample PFL-24m-Codebook PFL-24m-Report PFL-24m-Survey 48m folder PFL_48m_Data-sample PFL-48m-Codebook PFL-48m-Report PFL-48m-Survey
  • 12. Check you can open each of these in SPSS 1. PFL_BL_Data-sample = pre-natal interview with expectant mother 2. PFL_12m_Data-sample = interview with mother when child was 1 year old 3. PFL_24m_Data-sample = interview with mother when child was 2 years old 4. PFL_48m_Data-sample = interview with mother when child was 4 years old (final interview) The archival version was processed, including anonymisation of potentially identifying information
  • 13. 3. Using SPSS Getting started in SPSS
  • 14. Opening SPSS 1. IBM SPSS Statistics 24 2. Select ‘Open another file’ option
  • 15. Opening SPSS 3. Navigate to the PFL files on your machine • PFL_BL_Data-sample.sav • PFL_12m_Data-sample.sav • PFL_24m_Data-sample.sav • PFL_48m_Data-sample.sav • Double click each to open in SPSS • You can keep all 4 PFL files open simultaneously
  • 18. 4. Working with the data files A little bit of light analysis using SPSS
  • 19. Please note! • You are working with a sample of the data file • Use the codebook to get an indication of the full set of variables in each wave • But you are only able to see a sub-set today – codebook lists full archival dataset • Data should not be copied or saved onto another device, or taken out of this room • Codebooks, User Guide, Surveys and Reports are openly available on ISSDA website • If you would like to access this data please apply through ISSDA
  • 20. Variable naming conventions of PEI-RI project Survey variables: Variables that were generated by the survey correspond to the question number in the survey, and are labelled to correspond as closely as possible to the original wording of the survey question. Labels are sometimes composed from truncated survey questions due to character restrictions in the software (see below). In 24m data file: scroll down to variables d4, d5, d7, d8, d10 – compare these to the 24m survey Derived variables: Variables that were created during data entry and analysis In 24m data file: scroll down to poor_eating (v46) – this was derived from the previous questions ‘How often does baby eat (v39-45) Anonymisation variables: Variables that were created during anonymisation In 24m data file: scroll down to d4 What is your current relationship status (v48) – this was recoded into broader categories
  • 21. Variable naming conventions of PEI-RI project Scale variables: Individual scale items, domain scores and total scores are named with the scale acronym which is capitalised for ease of reference . These acronyms are consistent across all waves to facilitate the user to track specific measures across waves. A full list of scale acronym and their full title and citation are available in Appendix 3. Where permission has been granted to reproduce the scale contents in the archive, items are labelled so that they correspond as closely as possible to the wording of the survey question. Where this permission has not been granted, individual item are labelled with the scale title and sequence number. In 24m data file: scroll down to CDI_baabaa McArthur-Bates CDI Toddler short – form A (v73 onwards) to see scale items, and total scores
  • 22. Conventions of PEI-RI project: Missing cases are included Why are there missing cases? All 332 cases are included in the data file for each wave, so that individual wave files can be merged together if required. However, data was not collected for every case at each data collection point and consequently there are a small number of missing cases per wave. Missing cases in the data file are indicated by the system missing count and by the variable PFL_control. A count of missing cases per wave is also provided at the start of the codebook for ease of reference. Using the 12m file, run a frequency using the variable Trial Group: PFL intervention or control [PFL_Control] Method: Analyze / Descriptive Statistics / Frequencies Result: 82 in LFP, 165 in PFL and 85 not included in wave
  • 23. Conventions of PEI-RI project: Missing data Why are there missing data? While participants were encouraged to answer all questions during the interview, there were some instances where a participant either could not provide a response to a question or did not wish to provide a response. Missing data are included in the archived dataset (marked as missing or left blank) so that new users can manage missing data in a manner that best suits their research design. Non-response codes for categorical variables: 996 = missing 997 = not applicable 998 = refuse Missing data for scales: Calculation rules for missing scale items are provided in Appendix of user guide. Users should refer to the codebook for specific information on the calculation of cut off points.
  • 24. Merging wave files • Same cases included in all waves (even where data not collected) • Two files can be match using ID variable – make sure they are both in ascending order (or at last the same order) • Data / Merge files/ Add variables • Excluded variables = variables that are common to both files • New Active Dataset = variables that will be included in new file • Tick Match cases on key variables and add PFL_ID to the Key Variables box Try merging the BL and 12m file together
  • 25. 5. Demographic and health variables A sample of variables from PFL
  • 26. Household demographic variables • Age • Ethnicity • No of children (first time mother) • Highest level of education • Literacy • Current work status / change to work status • Social housing • Medical card, GP card • Health insurance • Social welfare payments • Financial difficulty (self-assess) • Household income (equivalised – anon) • Household size (equiv size – anon) • EU-SILC material deprivation
  • 27. Maternal health behaviour variables Baby’s health: • Baby age, gender, weight, height • Health over last 6 months • Immunisations • Breastfeeding / child’s eating • Child’s sleep behaviours • TV and screen time Mother’s health: • Physical activity • Substance use • WHO5 • Subjective health assessment
  • 28. Standardised measures – a small sample for review 24m file McArthur Bates – just 10 items as sample CBCL – just 10 items as sample 48m file Child’s sleeping habits (CSH) – full scale