SlideShare a Scribd company logo
Reliability and validity
Why do we need Reliability & Validity?
           (Measurement Error)
A participant’s score on a particular measure consists of 2
components:
  Observed score = True score + Measurement Error
True Score = score that the participant would have
obtained if measurement was perfect—i.e., we were able
to measure without error
Measurement Error = the component of the observed
score that is the result of factors that distort the score from
its true value
Factors that Influence
   Measurement Error

• Transient states of the participants:
  (transient mood, health, fatigue-level, etc.)
• Stable attributes of the participants:
  (individual differences in intelligence,
  personality, motivation, etc.)
• Situational factors of the research setting:
  (room temperature, lighting, crowding, etc.)
Characteristics of
      Measures and
      Manipulations
Precision and clarity of operational
definitions
Training of observers
Number of independent observations
on which a score is based (more is
better?)
Measures that induce fatigue or fear
Actual Mistakes

  Equipment malfunction
  Errors in recording behaviors by observers
  Confusing response formats for self-reports
  Data entry errors



Measurement error undermines the reliability
(repeatability) of the measures we use
Reliability

• The reliability of a measure is an
  inverse function of measurement error:
• The more error, the less reliable the
  measure
• Reliable measures provide consistent
  measurement from occasion to
  occasion
Estimating Reliability

Total Variance       =    Variance due     +   Variance due
in a set of scores        to true scores          to error

Reliability          =    True-score       /    Total
                          Variance             Variance

Reliability can range from 0 to 1.0
When a reliability coefficient equals 0, the scores reflect
nothing but measurement error
Rule of Thumb: measures with reliability coefficients of
70% or greater have acceptable reliability
Different Methods for
   Assessing Reliability


Test-Retest Reliability
Inter-rater Reliability
Internal Consistency Reliability
Test-Retest Reliability

Test-retest reliability refers to the
consistency of participant’s responses
over time (usually a few weeks, why?)
Assumes the characteristic being
measured is stable over time—not
expected to change between test and
retest
Inter-rater Reliability

If a measurement involves behavioral
ratings by an observer/rater, we would
expect consistency among raters for a
reliable measure
Best to use at least 2 independent
raters, ‘blind’ to the ratings of other
observers
Precise operational definitions and well-
trained observers improve inter-rater
reliability
Internal Consistency
          Reliability
• Relevant for measures that consist of more
  than 1 item (e.g., total scores on scales, or
  when several behavioral observations are
  used to obtain a single score)
• Internal consistency refers to inter-item
  reliability, and assesses the degree of
  consistency among the items in a scale, or
  the different observations used to derive a
  score
• Want to be sure that all the items (or
  observations) are measuring the same
  construct
Estimates of Internal
           Consistency

• Item-total score consistency
• Split-half reliability: randomly divide items
  into 2 subsets and examine the consistency
  in total scores across the 2 subsets (any
  drawbacks?)
• Cronbach’s Alpha: conceptually, it is the
  average consistency across all possible split-
  half reliabilities
• Cronbach’s Alpha can be directly computed
  from data
Estimating the Validity of a
          Measure

• A good measure must not only be reliable,
  but also valid
• A valid measure measures what it is intended
  to measure
• Validity is not a property of a measure, but an
  indication of the extent to which an
  assessment measures a particular construct
  in a particular context—thus a measure may
  be valid for one purpose but not another
• A measure cannot be valid unless it is
  reliable, but a reliable measure may not be
  valid
Estimating Validity
Like reliability, validity is not absolute
Validity is the degree to which variability
(individual differences) in participant’s
scores on a particular measure, reflect
individual differences in the
characteristic or construct we want to
measure
Three types of measurement validity:
          Face Validity
          Construct Validity
Face Validity

• Face validity refers to the extent to which a
  measure ‘appears’ to measure what it is
  supposed to measure
• Not statistical—involves the judgment of the
  researcher (and the participants)
• A measure has face validity—’if people think
  it does’
• Just because a measure has face validity
  does not ensure that it is a valid measure
  (and measures lacking face validity can be
  valid)
Construct Validity
Most scientific investigations involve
hypothetical constructs—entities that
cannot be directly observed but are
inferred from empirical evidence (e.g.,
intelligence)
Construct validity is assessed by
studying the relationships between the
measure of a construct and scores on
measures of other constructs
We assess construct validity by seeing
whether a particular measure relates as
it should to other measures
Self-Esteem Example

• Scores on a measure of self-esteem
 should be positively related to
 measures of confidence and optimism

• But, negatively related to measures of
 insecurity and anxiety
Convergent and
    Discriminant Validity

• To have construct validity, a measure
  should both:
• Correlate with other measures that it
  should be related to (convergent
  validity)
• And, not correlate with measures that it
  should not correlate with (discriminant
  validity)
Criterion-Related
•                   Validity
    Refers to the extent to which a measure
  distinguishes participants on the basis of a
  particular behavioral criterion
• The Scholastic Aptitude Test (SAT) is valid to the
  extent that it distinguishes between students that
  do well in college versus those that do not
• A valid measure of marital conflict should
  correlate with behavioral observations (e.g.,
  number of fights)
• A valid measure of depressive symptoms should
  distinguish between subjects in treatment for
  depression and those who are not in treatment
Two Types of Criterion-
      Related Validity
  Concurrent validity
      measure and criterion are assessed at the
      same time
  Predictive validity
      elapsed time between the administration
      of the measure to be validated and the
  criterion is a relatively long period
  (e.g., months or years)
Predictive validity refers to a measure’s ability
  to distinguish participants on a relevant
  behavioral criterion at some point in the future
SAT Example

• High school seniors who score high on
  the the SAT are better prepared for
  college than low scorers (concurrent
  validity)
• Probably of greater interest to college
  admissions administrators, SAT scores
  predict academic performance four
  years later (predictive validity)
Thank you.

More Related Content

What's hot

Reliability and validity
Reliability and validityReliability and validity
Reliability and validity
Carlos Tian Chow Correos
 
Reliability and validity
Reliability and validityReliability and validity
Reliability and validity
shobhitsaxena67
 
Reliability for testing and assessment
Reliability for testing and assessmentReliability for testing and assessment
Reliability for testing and assessment
Erlwinmer Mangmang
 
Reliability & Validity
Reliability & ValidityReliability & Validity
Reliability & Validity
Ikbal Ahmed
 
Presentation validity
Presentation validityPresentation validity
Presentation validity
AshMusavi
 
Validity
ValidityValidity
Validity
Maury Martinez
 
Validity & reliability seminar
Validity & reliability seminarValidity & reliability seminar
Validity & reliability seminar
mrikara185
 
Validity (Educational Assessment)
Validity (Educational Assessment)Validity (Educational Assessment)
Validity (Educational Assessment)
HennaAnsari
 
Validity and reliability in assessment.
Validity and reliability in assessment. Validity and reliability in assessment.
Validity and reliability in assessment.
Tarek Tawfik Amin
 
RELIABILITY AND VALIDITY
RELIABILITY AND VALIDITYRELIABILITY AND VALIDITY
RELIABILITY AND VALIDITY
Joydeep Singh
 
Validation
ValidationValidation
Validation
Maury Martinez
 
Reliability (assessment of student learning I)
Reliability (assessment of student learning I)Reliability (assessment of student learning I)
Reliability (assessment of student learning I)Rey-ra Mora
 
Validity, Reliability and Feasibility
Validity, Reliability and FeasibilityValidity, Reliability and Feasibility
Validity, Reliability and Feasibility
Jasna3134
 
Validity and its types
Validity and its typesValidity and its types
Validity and its types
BibiNadia1
 
Reliability and validity
Reliability and validityReliability and validity
Reliability and validity
Kaimrc_Rss_Jd
 
Validity & reliability
Validity & reliabilityValidity & reliability
Validity & reliability
Praisy AB Vineesh
 
Reliability
ReliabilityReliability
reliability presentation.pptx
reliability presentation.pptxreliability presentation.pptx
reliability presentation.pptx
Ramsha Makhdum
 

What's hot (20)

Reliability and validity
Reliability and validityReliability and validity
Reliability and validity
 
Reliability and validity
Reliability and validityReliability and validity
Reliability and validity
 
Reliability and validity
Reliability and validityReliability and validity
Reliability and validity
 
Reliability and validity
Reliability and  validityReliability and  validity
Reliability and validity
 
Reliability for testing and assessment
Reliability for testing and assessmentReliability for testing and assessment
Reliability for testing and assessment
 
Reliability & Validity
Reliability & ValidityReliability & Validity
Reliability & Validity
 
Presentation validity
Presentation validityPresentation validity
Presentation validity
 
Validity
ValidityValidity
Validity
 
Validity & reliability seminar
Validity & reliability seminarValidity & reliability seminar
Validity & reliability seminar
 
Validity (Educational Assessment)
Validity (Educational Assessment)Validity (Educational Assessment)
Validity (Educational Assessment)
 
Validity and reliability in assessment.
Validity and reliability in assessment. Validity and reliability in assessment.
Validity and reliability in assessment.
 
RELIABILITY AND VALIDITY
RELIABILITY AND VALIDITYRELIABILITY AND VALIDITY
RELIABILITY AND VALIDITY
 
Validation
ValidationValidation
Validation
 
Reliability (assessment of student learning I)
Reliability (assessment of student learning I)Reliability (assessment of student learning I)
Reliability (assessment of student learning I)
 
Validity, Reliability and Feasibility
Validity, Reliability and FeasibilityValidity, Reliability and Feasibility
Validity, Reliability and Feasibility
 
Validity and its types
Validity and its typesValidity and its types
Validity and its types
 
Reliability and validity
Reliability and validityReliability and validity
Reliability and validity
 
Validity & reliability
Validity & reliabilityValidity & reliability
Validity & reliability
 
Reliability
ReliabilityReliability
Reliability
 
reliability presentation.pptx
reliability presentation.pptxreliability presentation.pptx
reliability presentation.pptx
 

Viewers also liked

Validity, its types, measurement & factors.
Validity, its types, measurement & factors.Validity, its types, measurement & factors.
Validity, its types, measurement & factors.Maheen Iftikhar
 
Reliability, validity, generalizability and the use of multi-item scales
Reliability, validity, generalizability and the use of multi-item scalesReliability, validity, generalizability and the use of multi-item scales
Reliability, validity, generalizability and the use of multi-item scales
dakter Cmc
 
Louzel Report - Reliability & validity
Louzel Report - Reliability & validity Louzel Report - Reliability & validity
Louzel Report - Reliability & validity Louzel Linejan
 
Validity and reliability of questionnaires
Validity and reliability of questionnairesValidity and reliability of questionnaires
Validity and reliability of questionnaires
Venkitachalam R
 
Validity, reliability & practicality
Validity, reliability & practicalityValidity, reliability & practicality
Validity, reliability & practicalitySamcruz5
 
Validity and Reliability
Validity and ReliabilityValidity and Reliability
Validity and Reliability
Maury Martinez
 
Week 9 validity and reliability
Week 9 validity and reliabilityWeek 9 validity and reliability
Week 9 validity and reliability
wawaaa789
 
Validity & reliability an interesting powerpoint slide i created
Validity & reliability  an interesting powerpoint slide i createdValidity & reliability  an interesting powerpoint slide i created
Validity & reliability an interesting powerpoint slide i createdSze Kai
 
validity its types and importance
validity its types and importancevalidity its types and importance
validity its types and importance
Ierine Joy Caserial
 
Inter item reliability with surveys
Inter item reliability with surveysInter item reliability with surveys
Inter item reliability with surveys
Ken Plummer
 
8. validity and reliability of research instruments
8. validity and reliability of research instruments8. validity and reliability of research instruments
8. validity and reliability of research instruments
Razif Shahril
 
ALT Approaches for Reliability
ALT Approaches for ReliabilityALT Approaches for Reliability
ALT Approaches for Reliability
Accendo Reliability
 
The ph d and beyond the apprenticeship model of learning
The ph d and beyond the apprenticeship model of learningThe ph d and beyond the apprenticeship model of learning
The ph d and beyond the apprenticeship model of learning
York University - Osgoode Hall Law School
 
Apprenticeship patterns
Apprenticeship patternsApprenticeship patterns
Apprenticeship patterns
Diana Rangaves, PharmD, CEO
 
Lesson 03 chapter 6 sampling
Lesson 03 chapter 6 samplingLesson 03 chapter 6 sampling
Lesson 03 chapter 6 sampling
Ning Ding
 
Dynamic Factor Rotation
Dynamic Factor RotationDynamic Factor Rotation
Dynamic Factor RotationIlan Gleiser
 
Level of Measurement, Frequency Distribution,Stem & Leaf
Level of Measurement, Frequency Distribution,Stem & Leaf   Level of Measurement, Frequency Distribution,Stem & Leaf
Level of Measurement, Frequency Distribution,Stem & Leaf
Qasim Raza
 
Measurement
MeasurementMeasurement
Measurementwilsone
 
Lecture 3 measurement, reliability and validity (
Lecture   3 measurement, reliability and validity (Lecture   3 measurement, reliability and validity (
Lecture 3 measurement, reliability and validity (La Islaa
 
Why we run cronbach’s alpha
Why we run cronbach’s alphaWhy we run cronbach’s alpha
Why we run cronbach’s alphaAiden Yeh
 

Viewers also liked (20)

Validity, its types, measurement & factors.
Validity, its types, measurement & factors.Validity, its types, measurement & factors.
Validity, its types, measurement & factors.
 
Reliability, validity, generalizability and the use of multi-item scales
Reliability, validity, generalizability and the use of multi-item scalesReliability, validity, generalizability and the use of multi-item scales
Reliability, validity, generalizability and the use of multi-item scales
 
Louzel Report - Reliability & validity
Louzel Report - Reliability & validity Louzel Report - Reliability & validity
Louzel Report - Reliability & validity
 
Validity and reliability of questionnaires
Validity and reliability of questionnairesValidity and reliability of questionnaires
Validity and reliability of questionnaires
 
Validity, reliability & practicality
Validity, reliability & practicalityValidity, reliability & practicality
Validity, reliability & practicality
 
Validity and Reliability
Validity and ReliabilityValidity and Reliability
Validity and Reliability
 
Week 9 validity and reliability
Week 9 validity and reliabilityWeek 9 validity and reliability
Week 9 validity and reliability
 
Validity & reliability an interesting powerpoint slide i created
Validity & reliability  an interesting powerpoint slide i createdValidity & reliability  an interesting powerpoint slide i created
Validity & reliability an interesting powerpoint slide i created
 
validity its types and importance
validity its types and importancevalidity its types and importance
validity its types and importance
 
Inter item reliability with surveys
Inter item reliability with surveysInter item reliability with surveys
Inter item reliability with surveys
 
8. validity and reliability of research instruments
8. validity and reliability of research instruments8. validity and reliability of research instruments
8. validity and reliability of research instruments
 
ALT Approaches for Reliability
ALT Approaches for ReliabilityALT Approaches for Reliability
ALT Approaches for Reliability
 
The ph d and beyond the apprenticeship model of learning
The ph d and beyond the apprenticeship model of learningThe ph d and beyond the apprenticeship model of learning
The ph d and beyond the apprenticeship model of learning
 
Apprenticeship patterns
Apprenticeship patternsApprenticeship patterns
Apprenticeship patterns
 
Lesson 03 chapter 6 sampling
Lesson 03 chapter 6 samplingLesson 03 chapter 6 sampling
Lesson 03 chapter 6 sampling
 
Dynamic Factor Rotation
Dynamic Factor RotationDynamic Factor Rotation
Dynamic Factor Rotation
 
Level of Measurement, Frequency Distribution,Stem & Leaf
Level of Measurement, Frequency Distribution,Stem & Leaf   Level of Measurement, Frequency Distribution,Stem & Leaf
Level of Measurement, Frequency Distribution,Stem & Leaf
 
Measurement
MeasurementMeasurement
Measurement
 
Lecture 3 measurement, reliability and validity (
Lecture   3 measurement, reliability and validity (Lecture   3 measurement, reliability and validity (
Lecture 3 measurement, reliability and validity (
 
Why we run cronbach’s alpha
Why we run cronbach’s alphaWhy we run cronbach’s alpha
Why we run cronbach’s alpha
 

Similar to Reliability & validity

Characteristics of a good test
Characteristics of a good testCharacteristics of a good test
Characteristics of a good testcyrilcoscos
 
unit 2.6.pptx
unit 2.6.pptxunit 2.6.pptx
unit 2.6.pptx
Sumit Kumar
 
Evaluation of Measurement Instruments.ppt
Evaluation of Measurement Instruments.pptEvaluation of Measurement Instruments.ppt
Evaluation of Measurement Instruments.ppt
CityComputers3
 
Research methodology measurement
Research methodology measurement Research methodology measurement
Research methodology measurement
49bhu
 
Research Methodology3_Measurement.pptx
Research Methodology3_Measurement.pptxResearch Methodology3_Measurement.pptx
Research Methodology3_Measurement.pptx
AamirMaqsood8
 
Questionnaire and Instrument validity
Questionnaire and Instrument validityQuestionnaire and Instrument validity
Questionnaire and Instrument validity
mdanaee
 
Reliability and validity- research-for BSC/PBBSC AND MSC NURSING
Reliability and validity- research-for BSC/PBBSC AND MSC NURSINGReliability and validity- research-for BSC/PBBSC AND MSC NURSING
Reliability and validity- research-for BSC/PBBSC AND MSC NURSING
SUCHITRARATI1976
 
Validity
ValidityValidity
Validity
ValidityValidity
Validity
manish740
 
Chapter 13 Measuremen and Scaling Concept Slides.ppt
Chapter 13 Measuremen and Scaling Concept Slides.pptChapter 13 Measuremen and Scaling Concept Slides.ppt
Chapter 13 Measuremen and Scaling Concept Slides.ppt
RajjaRashad1
 
Measurement and scaling
Measurement and scalingMeasurement and scaling
Measurement and scaling
Balaji P
 
251109 rm-c.s.-assessing measurement quality in quantitative studies
251109 rm-c.s.-assessing measurement quality in quantitative studies251109 rm-c.s.-assessing measurement quality in quantitative studies
251109 rm-c.s.-assessing measurement quality in quantitative studiesVivek Vasan
 
reliablity and validity in social sciences research
reliablity and validity  in social sciences researchreliablity and validity  in social sciences research
reliablity and validity in social sciences research
Sourabh Sharma
 
Topic 7 measurement in research
Topic 7   measurement in researchTopic 7   measurement in research
Topic 7 measurement in research
Dhani Ahmad
 
Rep
RepRep
Rep
Cedy_28
 
RM-3 SCY.pdf
RM-3 SCY.pdfRM-3 SCY.pdf
RM-3 SCY.pdf
gammephrem1989
 
Chapter 2 The Science of Psychological Measurement (Alivio, Ansula).pptx
Chapter 2 The Science of Psychological Measurement (Alivio, Ansula).pptxChapter 2 The Science of Psychological Measurement (Alivio, Ansula).pptx
Chapter 2 The Science of Psychological Measurement (Alivio, Ansula).pptx
HazelLansula1
 
Ag Extn.504 :- RESEARCH METHODS IN BEHAVIOURAL SCIENCE
Ag Extn.504 :-  RESEARCH METHODS IN BEHAVIOURAL SCIENCE  Ag Extn.504 :-  RESEARCH METHODS IN BEHAVIOURAL SCIENCE
Ag Extn.504 :- RESEARCH METHODS IN BEHAVIOURAL SCIENCE
Pradip Limbani
 
Reliability and validity1
Reliability and validity1Reliability and validity1
Reliability and validity1
MMIHS
 
Data collection reliability
Data collection reliabilityData collection reliability
Data collection reliability
Thangamani Ramalingam
 

Similar to Reliability & validity (20)

Characteristics of a good test
Characteristics of a good testCharacteristics of a good test
Characteristics of a good test
 
unit 2.6.pptx
unit 2.6.pptxunit 2.6.pptx
unit 2.6.pptx
 
Evaluation of Measurement Instruments.ppt
Evaluation of Measurement Instruments.pptEvaluation of Measurement Instruments.ppt
Evaluation of Measurement Instruments.ppt
 
Research methodology measurement
Research methodology measurement Research methodology measurement
Research methodology measurement
 
Research Methodology3_Measurement.pptx
Research Methodology3_Measurement.pptxResearch Methodology3_Measurement.pptx
Research Methodology3_Measurement.pptx
 
Questionnaire and Instrument validity
Questionnaire and Instrument validityQuestionnaire and Instrument validity
Questionnaire and Instrument validity
 
Reliability and validity- research-for BSC/PBBSC AND MSC NURSING
Reliability and validity- research-for BSC/PBBSC AND MSC NURSINGReliability and validity- research-for BSC/PBBSC AND MSC NURSING
Reliability and validity- research-for BSC/PBBSC AND MSC NURSING
 
Validity
ValidityValidity
Validity
 
Validity
ValidityValidity
Validity
 
Chapter 13 Measuremen and Scaling Concept Slides.ppt
Chapter 13 Measuremen and Scaling Concept Slides.pptChapter 13 Measuremen and Scaling Concept Slides.ppt
Chapter 13 Measuremen and Scaling Concept Slides.ppt
 
Measurement and scaling
Measurement and scalingMeasurement and scaling
Measurement and scaling
 
251109 rm-c.s.-assessing measurement quality in quantitative studies
251109 rm-c.s.-assessing measurement quality in quantitative studies251109 rm-c.s.-assessing measurement quality in quantitative studies
251109 rm-c.s.-assessing measurement quality in quantitative studies
 
reliablity and validity in social sciences research
reliablity and validity  in social sciences researchreliablity and validity  in social sciences research
reliablity and validity in social sciences research
 
Topic 7 measurement in research
Topic 7   measurement in researchTopic 7   measurement in research
Topic 7 measurement in research
 
Rep
RepRep
Rep
 
RM-3 SCY.pdf
RM-3 SCY.pdfRM-3 SCY.pdf
RM-3 SCY.pdf
 
Chapter 2 The Science of Psychological Measurement (Alivio, Ansula).pptx
Chapter 2 The Science of Psychological Measurement (Alivio, Ansula).pptxChapter 2 The Science of Psychological Measurement (Alivio, Ansula).pptx
Chapter 2 The Science of Psychological Measurement (Alivio, Ansula).pptx
 
Ag Extn.504 :- RESEARCH METHODS IN BEHAVIOURAL SCIENCE
Ag Extn.504 :-  RESEARCH METHODS IN BEHAVIOURAL SCIENCE  Ag Extn.504 :-  RESEARCH METHODS IN BEHAVIOURAL SCIENCE
Ag Extn.504 :- RESEARCH METHODS IN BEHAVIOURAL SCIENCE
 
Reliability and validity1
Reliability and validity1Reliability and validity1
Reliability and validity1
 
Data collection reliability
Data collection reliabilityData collection reliability
Data collection reliability
 

More from shefali84

100 short moral stories
100 short moral stories100 short moral stories
100 short moral stories
shefali84
 
Mbti for mana 5338 abbrev. 06232010
Mbti for mana 5338 abbrev. 06232010Mbti for mana 5338 abbrev. 06232010
Mbti for mana 5338 abbrev. 06232010shefali84
 
Leadershipppt 090414175137-phpapp01
Leadershipppt 090414175137-phpapp01Leadershipppt 090414175137-phpapp01
Leadershipppt 090414175137-phpapp01shefali84
 
Strategic management
Strategic managementStrategic management
Strategic managementshefali84
 
Basic concepts of measurement
Basic concepts of measurementBasic concepts of measurement
Basic concepts of measurementshefali84
 

More from shefali84 (7)

100 short moral stories
100 short moral stories100 short moral stories
100 short moral stories
 
Doc1
Doc1Doc1
Doc1
 
11 routing
11 routing11 routing
11 routing
 
Mbti for mana 5338 abbrev. 06232010
Mbti for mana 5338 abbrev. 06232010Mbti for mana 5338 abbrev. 06232010
Mbti for mana 5338 abbrev. 06232010
 
Leadershipppt 090414175137-phpapp01
Leadershipppt 090414175137-phpapp01Leadershipppt 090414175137-phpapp01
Leadershipppt 090414175137-phpapp01
 
Strategic management
Strategic managementStrategic management
Strategic management
 
Basic concepts of measurement
Basic concepts of measurementBasic concepts of measurement
Basic concepts of measurement
 

Recently uploaded

ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
Fundacja Rozwoju Społeczeństwa Przedsiębiorczego
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
AzmatAli747758
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
joachimlavalley1
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
PedroFerreira53928
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
RaedMohamed3
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
Celine George
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
How to Break the cycle of negative Thoughts
How to Break the cycle of negative ThoughtsHow to Break the cycle of negative Thoughts
How to Break the cycle of negative Thoughts
Col Mukteshwar Prasad
 

Recently uploaded (20)

ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
How to Break the cycle of negative Thoughts
How to Break the cycle of negative ThoughtsHow to Break the cycle of negative Thoughts
How to Break the cycle of negative Thoughts
 

Reliability & validity

  • 2. Why do we need Reliability & Validity? (Measurement Error) A participant’s score on a particular measure consists of 2 components: Observed score = True score + Measurement Error True Score = score that the participant would have obtained if measurement was perfect—i.e., we were able to measure without error Measurement Error = the component of the observed score that is the result of factors that distort the score from its true value
  • 3. Factors that Influence Measurement Error • Transient states of the participants: (transient mood, health, fatigue-level, etc.) • Stable attributes of the participants: (individual differences in intelligence, personality, motivation, etc.) • Situational factors of the research setting: (room temperature, lighting, crowding, etc.)
  • 4. Characteristics of Measures and Manipulations Precision and clarity of operational definitions Training of observers Number of independent observations on which a score is based (more is better?) Measures that induce fatigue or fear
  • 5. Actual Mistakes Equipment malfunction Errors in recording behaviors by observers Confusing response formats for self-reports Data entry errors Measurement error undermines the reliability (repeatability) of the measures we use
  • 6. Reliability • The reliability of a measure is an inverse function of measurement error: • The more error, the less reliable the measure • Reliable measures provide consistent measurement from occasion to occasion
  • 7. Estimating Reliability Total Variance = Variance due + Variance due in a set of scores to true scores to error Reliability = True-score / Total Variance Variance Reliability can range from 0 to 1.0 When a reliability coefficient equals 0, the scores reflect nothing but measurement error Rule of Thumb: measures with reliability coefficients of 70% or greater have acceptable reliability
  • 8. Different Methods for Assessing Reliability Test-Retest Reliability Inter-rater Reliability Internal Consistency Reliability
  • 9. Test-Retest Reliability Test-retest reliability refers to the consistency of participant’s responses over time (usually a few weeks, why?) Assumes the characteristic being measured is stable over time—not expected to change between test and retest
  • 10. Inter-rater Reliability If a measurement involves behavioral ratings by an observer/rater, we would expect consistency among raters for a reliable measure Best to use at least 2 independent raters, ‘blind’ to the ratings of other observers Precise operational definitions and well- trained observers improve inter-rater reliability
  • 11. Internal Consistency Reliability • Relevant for measures that consist of more than 1 item (e.g., total scores on scales, or when several behavioral observations are used to obtain a single score) • Internal consistency refers to inter-item reliability, and assesses the degree of consistency among the items in a scale, or the different observations used to derive a score • Want to be sure that all the items (or observations) are measuring the same construct
  • 12. Estimates of Internal Consistency • Item-total score consistency • Split-half reliability: randomly divide items into 2 subsets and examine the consistency in total scores across the 2 subsets (any drawbacks?) • Cronbach’s Alpha: conceptually, it is the average consistency across all possible split- half reliabilities • Cronbach’s Alpha can be directly computed from data
  • 13. Estimating the Validity of a Measure • A good measure must not only be reliable, but also valid • A valid measure measures what it is intended to measure • Validity is not a property of a measure, but an indication of the extent to which an assessment measures a particular construct in a particular context—thus a measure may be valid for one purpose but not another • A measure cannot be valid unless it is reliable, but a reliable measure may not be valid
  • 14. Estimating Validity Like reliability, validity is not absolute Validity is the degree to which variability (individual differences) in participant’s scores on a particular measure, reflect individual differences in the characteristic or construct we want to measure Three types of measurement validity: Face Validity Construct Validity
  • 15. Face Validity • Face validity refers to the extent to which a measure ‘appears’ to measure what it is supposed to measure • Not statistical—involves the judgment of the researcher (and the participants) • A measure has face validity—’if people think it does’ • Just because a measure has face validity does not ensure that it is a valid measure (and measures lacking face validity can be valid)
  • 16. Construct Validity Most scientific investigations involve hypothetical constructs—entities that cannot be directly observed but are inferred from empirical evidence (e.g., intelligence) Construct validity is assessed by studying the relationships between the measure of a construct and scores on measures of other constructs We assess construct validity by seeing whether a particular measure relates as it should to other measures
  • 17. Self-Esteem Example • Scores on a measure of self-esteem should be positively related to measures of confidence and optimism • But, negatively related to measures of insecurity and anxiety
  • 18. Convergent and Discriminant Validity • To have construct validity, a measure should both: • Correlate with other measures that it should be related to (convergent validity) • And, not correlate with measures that it should not correlate with (discriminant validity)
  • 19. Criterion-Related • Validity Refers to the extent to which a measure distinguishes participants on the basis of a particular behavioral criterion • The Scholastic Aptitude Test (SAT) is valid to the extent that it distinguishes between students that do well in college versus those that do not • A valid measure of marital conflict should correlate with behavioral observations (e.g., number of fights) • A valid measure of depressive symptoms should distinguish between subjects in treatment for depression and those who are not in treatment
  • 20. Two Types of Criterion- Related Validity Concurrent validity measure and criterion are assessed at the same time Predictive validity elapsed time between the administration of the measure to be validated and the criterion is a relatively long period (e.g., months or years) Predictive validity refers to a measure’s ability to distinguish participants on a relevant behavioral criterion at some point in the future
  • 21. SAT Example • High school seniors who score high on the the SAT are better prepared for college than low scorers (concurrent validity) • Probably of greater interest to college admissions administrators, SAT scores predict academic performance four years later (predictive validity)