SlideShare a Scribd company logo
1 of 17
Research 101: Data Preparation
Harold Gamero
Data preparation
Data coding
Data entry
Missing values
Data transformation
Patterns in outlier data
Normality tests
Dimensionality of the scales
Reliability of the scales
1
2
3
4
5
6
7
8
Data Coding
• Coding is the process of converting data to numerical values.
• A codebook is a document that details the scales of each variable, the responses to each
item and what numerical values correspond to each response category.
• In some cases, it is possible to directly code the respondent's answer (age, income).
• Sometimes it is necessary to assign values to represent each variable (sex, profession).
• Qualitative results (such as interviews) cannot be “coded” and analysed statistically.
Data Coding
Data Entry
• Data can be entered into spreadsheets, databases or specialized statistical programs
(SPSS, Mplus, Stata, R, etc.).
• In the case of SPSS, rows represent individuals and columns represent variables, items
or response categories.
• The data entered should be constantly monitored for errors or invalid questionnaires (e.g.
meaningless patterns: all 1 or 5).
• Surveys with these errors should be discarded from further statistical analysis.
Missing Values
• Missing values may be unavoidable.
• Identify whether they appear randomly or show a pattern.
• If there is a pattern, the problem lies in the instrument or in the method applied (pilot
test).
• Examine the extent of the missing data.
• Select the way in which these values will be (not) used.
• By default, programs delete questionnaires with missing data (listwise deletion).
• Some allow the estimation and replacement of them (imputation).
• 2 types of unbiased imputation: maximum likelihood and multiple imputation methods.
Data Transformation
In some cases, data must be presented in a different way than collected.
For example:
➢ Scales that have items posed inversely
➢ Items that must be summed to obtain scores per dimension or variable
➢ Variables to be aggregated to obtain indexes
➢ Data that should be grouped into categories or ranges (age groups)
Patterns in Outlier Data
• Atypical data may appear due to:
➢ Errors in the data collection process
➢ Accumulated effect of external factors
➢ Extraordinary events
➢ Extraordinary remarks
• Outliers should be excluded from the analysis when they are an error (e.g., illogical or
erroneously entered responses).
• Outliers can be identified using steam & leaf plots.
Patterns in Outlier Data
Outliers
Less
dispersion
More
dispersion
Normality Test
• To use the normal statistical indicators (parametric statistics), we must verify that the
statistical assumptions are met.
• For this we can use:
Histograms Q-Q normality plots
Normality Test
• To use the normal statistical indicators (parametric statistics), we must verify that the
statistical assumptions are met.
• For this we can use:
Kolmogorov–Smirnov
test
Shapiro–Wilk test
Dimensionality of the Scales
• The next step is to verify that the items of our scales have been correctly distributed
across the dimensions of the construct of interest.
• For example, Empowerment is a multidimensional construct with 5 factors or
dimensions (Spreitzer, 1995):
➢ Meaning
➢ Competition
➢ Self-determination
➢ Impact
➢ Security
Dimensionality of the Scales
Confirmatory Factor
Analysis (CFA)
shows the presence of 5
factors or dimensions.
Dimensionality of the Scales
Subsequently, it should be
corroborated that the
items of each factor are
distributed as proposed in
the model.
Reliability of the scales
• We must confirm the reliability of the scales in our sample.
• Depending on the type of scale used, the method for calculating this indicator will be
different.
• For scales with additive Likert-type items, the recommended method is Cronbach’s
Alpha coefficient or the Composite Reliability Test
• In the case of multidimensional constructs, reliability coefficients are calculated per
dimension.
• Reliability coefficients can range from 0 to 1. Being 1 = perfect reliability, and 0 = null
reliability (Commonly, values above 0.7 are acceptable).
Reliability of the scales
Thank you.
Harold Gamero

More Related Content

Similar to Research 101: Quantitative Data Preparation

Statistical Learning and Model Selection (1).pptx
Statistical Learning and Model Selection (1).pptxStatistical Learning and Model Selection (1).pptx
Statistical Learning and Model Selection (1).pptxrajalakshmi5921
 
Lecture 10 - Model Testing and Evaluation, a lecture in subject module Statis...
Lecture 10 - Model Testing and Evaluation, a lecture in subject module Statis...Lecture 10 - Model Testing and Evaluation, a lecture in subject module Statis...
Lecture 10 - Model Testing and Evaluation, a lecture in subject module Statis...Maninda Edirisooriya
 
Workshop on SPSS: Basic to Intermediate Level
Workshop on SPSS: Basic to Intermediate LevelWorkshop on SPSS: Basic to Intermediate Level
Workshop on SPSS: Basic to Intermediate LevelHiram Ting
 
Modelling and evaluation
Modelling and evaluationModelling and evaluation
Modelling and evaluationeShikshak
 
Brief Introduction to the 12 Steps of Evaluation Data Cleaning
Brief Introduction to the 12 Steps of Evaluation Data CleaningBrief Introduction to the 12 Steps of Evaluation Data Cleaning
Brief Introduction to the 12 Steps of Evaluation Data CleaningJennifer Morrow
 
Basic stat analysis using excel
Basic stat analysis using excelBasic stat analysis using excel
Basic stat analysis using excelParag Shah
 
Introduction to Data Management in Human Ecology
Introduction to Data Management in Human EcologyIntroduction to Data Management in Human Ecology
Introduction to Data Management in Human EcologyKern Rocke
 
Chapter 4 Classification in data sience .pdf
Chapter 4 Classification in data sience .pdfChapter 4 Classification in data sience .pdf
Chapter 4 Classification in data sience .pdfAschalewAyele2
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticsHiba Armouche
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learningSanghamitra Deb
 
Introduction to basic statistics
Introduction to basic statisticsIntroduction to basic statistics
Introduction to basic statisticsAnkit Katiyar
 
Introduction to basic statistics
Introduction to basic statisticsIntroduction to basic statistics
Introduction to basic statisticsothanatoso
 

Similar to Research 101: Quantitative Data Preparation (20)

Statistical Learning and Model Selection (1).pptx
Statistical Learning and Model Selection (1).pptxStatistical Learning and Model Selection (1).pptx
Statistical Learning and Model Selection (1).pptx
 
Lecture 10 - Model Testing and Evaluation, a lecture in subject module Statis...
Lecture 10 - Model Testing and Evaluation, a lecture in subject module Statis...Lecture 10 - Model Testing and Evaluation, a lecture in subject module Statis...
Lecture 10 - Model Testing and Evaluation, a lecture in subject module Statis...
 
Environmental statistics
Environmental statisticsEnvironmental statistics
Environmental statistics
 
Workshop on SPSS: Basic to Intermediate Level
Workshop on SPSS: Basic to Intermediate LevelWorkshop on SPSS: Basic to Intermediate Level
Workshop on SPSS: Basic to Intermediate Level
 
Statistics 1
Statistics 1Statistics 1
Statistics 1
 
Hm306 week 4
Hm306 week 4Hm306 week 4
Hm306 week 4
 
Hm306 week 4
Hm306 week 4Hm306 week 4
Hm306 week 4
 
Intro statistics
Intro statisticsIntro statistics
Intro statistics
 
RM UNIT 6.pptx
RM UNIT 6.pptxRM UNIT 6.pptx
RM UNIT 6.pptx
 
Modelling and evaluation
Modelling and evaluationModelling and evaluation
Modelling and evaluation
 
Brief Introduction to the 12 Steps of Evaluation Data Cleaning
Brief Introduction to the 12 Steps of Evaluation Data CleaningBrief Introduction to the 12 Steps of Evaluation Data Cleaning
Brief Introduction to the 12 Steps of Evaluation Data Cleaning
 
STATISTICS-E.pdf
STATISTICS-E.pdfSTATISTICS-E.pdf
STATISTICS-E.pdf
 
ANALYSIS OF DATA.pptx
ANALYSIS OF DATA.pptxANALYSIS OF DATA.pptx
ANALYSIS OF DATA.pptx
 
Basic stat analysis using excel
Basic stat analysis using excelBasic stat analysis using excel
Basic stat analysis using excel
 
Introduction to Data Management in Human Ecology
Introduction to Data Management in Human EcologyIntroduction to Data Management in Human Ecology
Introduction to Data Management in Human Ecology
 
Chapter 4 Classification in data sience .pdf
Chapter 4 Classification in data sience .pdfChapter 4 Classification in data sience .pdf
Chapter 4 Classification in data sience .pdf
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
Introduction to basic statistics
Introduction to basic statisticsIntroduction to basic statistics
Introduction to basic statistics
 
Introduction to basic statistics
Introduction to basic statisticsIntroduction to basic statistics
Introduction to basic statistics
 

More from Harold Gamero

Research 101: Inferential Quantitative Analysis
Research 101: Inferential Quantitative AnalysisResearch 101: Inferential Quantitative Analysis
Research 101: Inferential Quantitative AnalysisHarold Gamero
 
Research 101: Descriptive Quantitative Analysis
Research 101: Descriptive Quantitative AnalysisResearch 101: Descriptive Quantitative Analysis
Research 101: Descriptive Quantitative AnalysisHarold Gamero
 
Research 101: Research with Questionnaires
Research 101: Research with QuestionnairesResearch 101: Research with Questionnaires
Research 101: Research with QuestionnairesHarold Gamero
 
Research 101: Sampling Techniques in Research
Research 101: Sampling Techniques in ResearchResearch 101: Sampling Techniques in Research
Research 101: Sampling Techniques in ResearchHarold Gamero
 
Research 101: Scale Validity & Reliability
Research 101: Scale Validity & ReliabilityResearch 101: Scale Validity & Reliability
Research 101: Scale Validity & ReliabilityHarold Gamero
 
Research 101: Measurements of Constructs
Research 101: Measurements of ConstructsResearch 101: Measurements of Constructs
Research 101: Measurements of ConstructsHarold Gamero
 
Research 101: Scientific Research Designs
Research 101: Scientific Research DesignsResearch 101: Scientific Research Designs
Research 101: Scientific Research DesignsHarold Gamero
 
Research 101: Theories in Social Science
Research 101: Theories in Social ScienceResearch 101: Theories in Social Science
Research 101: Theories in Social ScienceHarold Gamero
 
Research 101: Qualitative Data Analysis.
Research 101: Qualitative Data Analysis.Research 101: Qualitative Data Analysis.
Research 101: Qualitative Data Analysis.Harold Gamero
 
Research 101: Transcription of Interviews
Research 101: Transcription of InterviewsResearch 101: Transcription of Interviews
Research 101: Transcription of InterviewsHarold Gamero
 
Research 101: Qualitative Research Designs
Research 101: Qualitative Research DesignsResearch 101: Qualitative Research Designs
Research 101: Qualitative Research DesignsHarold Gamero
 
Research 101: How to Read a Scientific Paper
Research 101: How to Read a Scientific PaperResearch 101: How to Read a Scientific Paper
Research 101: How to Read a Scientific PaperHarold Gamero
 
Research 101: Rigor in Qualitative Research
Research 101: Rigor in Qualitative ResearchResearch 101: Rigor in Qualitative Research
Research 101: Rigor in Qualitative ResearchHarold Gamero
 
Research 101: Qualitative vs. Quantitative Research
Research 101: Qualitative vs. Quantitative ResearchResearch 101: Qualitative vs. Quantitative Research
Research 101: Qualitative vs. Quantitative ResearchHarold Gamero
 
Research 101: Finding a Research Question
Research 101: Finding a Research QuestionResearch 101: Finding a Research Question
Research 101: Finding a Research QuestionHarold Gamero
 
Research 101: Types of Scientific Research
Research 101: Types of Scientific ResearchResearch 101: Types of Scientific Research
Research 101: Types of Scientific ResearchHarold Gamero
 
Research 101: What is (Scientific) Research
Research 101: What is (Scientific) ResearchResearch 101: What is (Scientific) Research
Research 101: What is (Scientific) ResearchHarold Gamero
 
Research 101: Key aspects of a Thesis .
Research 101: Key aspects of a Thesis  .Research 101: Key aspects of a Thesis  .
Research 101: Key aspects of a Thesis .Harold Gamero
 
Research 101: Literature Review .
Research 101: Literature Review        .Research 101: Literature Review        .
Research 101: Literature Review .Harold Gamero
 
Research 101: Academic Writing Style .
Research 101: Academic Writing Style   .Research 101: Academic Writing Style   .
Research 101: Academic Writing Style .Harold Gamero
 

More from Harold Gamero (20)

Research 101: Inferential Quantitative Analysis
Research 101: Inferential Quantitative AnalysisResearch 101: Inferential Quantitative Analysis
Research 101: Inferential Quantitative Analysis
 
Research 101: Descriptive Quantitative Analysis
Research 101: Descriptive Quantitative AnalysisResearch 101: Descriptive Quantitative Analysis
Research 101: Descriptive Quantitative Analysis
 
Research 101: Research with Questionnaires
Research 101: Research with QuestionnairesResearch 101: Research with Questionnaires
Research 101: Research with Questionnaires
 
Research 101: Sampling Techniques in Research
Research 101: Sampling Techniques in ResearchResearch 101: Sampling Techniques in Research
Research 101: Sampling Techniques in Research
 
Research 101: Scale Validity & Reliability
Research 101: Scale Validity & ReliabilityResearch 101: Scale Validity & Reliability
Research 101: Scale Validity & Reliability
 
Research 101: Measurements of Constructs
Research 101: Measurements of ConstructsResearch 101: Measurements of Constructs
Research 101: Measurements of Constructs
 
Research 101: Scientific Research Designs
Research 101: Scientific Research DesignsResearch 101: Scientific Research Designs
Research 101: Scientific Research Designs
 
Research 101: Theories in Social Science
Research 101: Theories in Social ScienceResearch 101: Theories in Social Science
Research 101: Theories in Social Science
 
Research 101: Qualitative Data Analysis.
Research 101: Qualitative Data Analysis.Research 101: Qualitative Data Analysis.
Research 101: Qualitative Data Analysis.
 
Research 101: Transcription of Interviews
Research 101: Transcription of InterviewsResearch 101: Transcription of Interviews
Research 101: Transcription of Interviews
 
Research 101: Qualitative Research Designs
Research 101: Qualitative Research DesignsResearch 101: Qualitative Research Designs
Research 101: Qualitative Research Designs
 
Research 101: How to Read a Scientific Paper
Research 101: How to Read a Scientific PaperResearch 101: How to Read a Scientific Paper
Research 101: How to Read a Scientific Paper
 
Research 101: Rigor in Qualitative Research
Research 101: Rigor in Qualitative ResearchResearch 101: Rigor in Qualitative Research
Research 101: Rigor in Qualitative Research
 
Research 101: Qualitative vs. Quantitative Research
Research 101: Qualitative vs. Quantitative ResearchResearch 101: Qualitative vs. Quantitative Research
Research 101: Qualitative vs. Quantitative Research
 
Research 101: Finding a Research Question
Research 101: Finding a Research QuestionResearch 101: Finding a Research Question
Research 101: Finding a Research Question
 
Research 101: Types of Scientific Research
Research 101: Types of Scientific ResearchResearch 101: Types of Scientific Research
Research 101: Types of Scientific Research
 
Research 101: What is (Scientific) Research
Research 101: What is (Scientific) ResearchResearch 101: What is (Scientific) Research
Research 101: What is (Scientific) Research
 
Research 101: Key aspects of a Thesis .
Research 101: Key aspects of a Thesis  .Research 101: Key aspects of a Thesis  .
Research 101: Key aspects of a Thesis .
 
Research 101: Literature Review .
Research 101: Literature Review        .Research 101: Literature Review        .
Research 101: Literature Review .
 
Research 101: Academic Writing Style .
Research 101: Academic Writing Style   .Research 101: Academic Writing Style   .
Research 101: Academic Writing Style .
 

Recently uploaded

CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 

Recently uploaded (20)

CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 

Research 101: Quantitative Data Preparation

  • 1. Research 101: Data Preparation Harold Gamero
  • 2. Data preparation Data coding Data entry Missing values Data transformation Patterns in outlier data Normality tests Dimensionality of the scales Reliability of the scales 1 2 3 4 5 6 7 8
  • 3. Data Coding • Coding is the process of converting data to numerical values. • A codebook is a document that details the scales of each variable, the responses to each item and what numerical values correspond to each response category. • In some cases, it is possible to directly code the respondent's answer (age, income). • Sometimes it is necessary to assign values to represent each variable (sex, profession). • Qualitative results (such as interviews) cannot be “coded” and analysed statistically.
  • 5. Data Entry • Data can be entered into spreadsheets, databases or specialized statistical programs (SPSS, Mplus, Stata, R, etc.). • In the case of SPSS, rows represent individuals and columns represent variables, items or response categories. • The data entered should be constantly monitored for errors or invalid questionnaires (e.g. meaningless patterns: all 1 or 5). • Surveys with these errors should be discarded from further statistical analysis.
  • 6. Missing Values • Missing values may be unavoidable. • Identify whether they appear randomly or show a pattern. • If there is a pattern, the problem lies in the instrument or in the method applied (pilot test). • Examine the extent of the missing data. • Select the way in which these values will be (not) used. • By default, programs delete questionnaires with missing data (listwise deletion). • Some allow the estimation and replacement of them (imputation). • 2 types of unbiased imputation: maximum likelihood and multiple imputation methods.
  • 7. Data Transformation In some cases, data must be presented in a different way than collected. For example: ➢ Scales that have items posed inversely ➢ Items that must be summed to obtain scores per dimension or variable ➢ Variables to be aggregated to obtain indexes ➢ Data that should be grouped into categories or ranges (age groups)
  • 8. Patterns in Outlier Data • Atypical data may appear due to: ➢ Errors in the data collection process ➢ Accumulated effect of external factors ➢ Extraordinary events ➢ Extraordinary remarks • Outliers should be excluded from the analysis when they are an error (e.g., illogical or erroneously entered responses). • Outliers can be identified using steam & leaf plots.
  • 9. Patterns in Outlier Data Outliers Less dispersion More dispersion
  • 10. Normality Test • To use the normal statistical indicators (parametric statistics), we must verify that the statistical assumptions are met. • For this we can use: Histograms Q-Q normality plots
  • 11. Normality Test • To use the normal statistical indicators (parametric statistics), we must verify that the statistical assumptions are met. • For this we can use: Kolmogorov–Smirnov test Shapiro–Wilk test
  • 12. Dimensionality of the Scales • The next step is to verify that the items of our scales have been correctly distributed across the dimensions of the construct of interest. • For example, Empowerment is a multidimensional construct with 5 factors or dimensions (Spreitzer, 1995): ➢ Meaning ➢ Competition ➢ Self-determination ➢ Impact ➢ Security
  • 13. Dimensionality of the Scales Confirmatory Factor Analysis (CFA) shows the presence of 5 factors or dimensions.
  • 14. Dimensionality of the Scales Subsequently, it should be corroborated that the items of each factor are distributed as proposed in the model.
  • 15. Reliability of the scales • We must confirm the reliability of the scales in our sample. • Depending on the type of scale used, the method for calculating this indicator will be different. • For scales with additive Likert-type items, the recommended method is Cronbach’s Alpha coefficient or the Composite Reliability Test • In the case of multidimensional constructs, reliability coefficients are calculated per dimension. • Reliability coefficients can range from 0 to 1. Being 1 = perfect reliability, and 0 = null reliability (Commonly, values above 0.7 are acceptable).