SlideShare a Scribd company logo
1 of 12
Download to read offline
23-11-2015
1
Characteristics of a
Good Test
VALIDITY
The term validity refers to whether or not a test
measures what it intends to measure.
On a test with high validity the items will be closely linked
to the test’s intended focus. For many certification and
licensure tests this means that the items will be highly
related to a specific job or occupation. If a test has poor
validity then it does not measure the job-related content
and competencies it ought to.
There are several ways to estimate the validity of a test,
including content validity, construct validity, criterion-
related validity (concurrent & predictive) and face validity.
Compiled By:Dr.V.Singh
23-11-2015
2
VALIDITY
 Content”: related to objectives and their sampling.
 “Construct”: referring to the theory underlying the
target.
 “Criterion”: related to concrete criteria in the real
world. It can be concurrent or predictive.
 “Concurrent”: correlating high with another measure
already validated.
 “Predictive”: Capable of anticipating some later
measure.
 “Face”: related to the test overall appearance.
Compiled By:Dr.V.Singh
1. CONTENT VALIDITY
Content validity refers to the connections
between the test items and the subject-related
tasks. The test should evaluate only the content
related to the field of study in a manner
sufficiently representative, relevant, and
comprehensible.
Compiled By:Dr.V.Singh
23-11-2015
3
2. CONSTRUCT VALIDITY
It implies using the construct correctly
(concepts, ideas, notions). Construct validity
seeks agreement between a theoretical concept
and a specific measuring device or procedure.
For example, a test of intelligence nowadays
must include measures of multiple intelligences,
rather than just logical-mathematical and
linguistic ability measures.
Compiled By:Dr.V.Singh
3. CRITERION-RELATED
VALIDITY
Also referred to as instrumental validity, it
states that the criteria should be clearly
defined by the teacher in advance. It has to
take into account other teachers´ criteria to
be standardized and it also needs to
demonstrate the accuracy of a measure or
procedure compared to another measure or
procedure which has already been
demonstrated to be valid.
Compiled By:Dr.V.Singh
23-11-2015
4
4. CONCURRENT VALIDITY
Concurrent validity is a statistical method using
correlation, rather than a logical method.
Examinees who are known to be either masters or non-
masters on the content measured by the test are
identified before the test is administered. Once the
tests have been scored, the relationship between the
examinees’ status as either masters or non-masters and
their performance (i.e., pass or fail) is estimated based
on the test. This type of validity provides evidence that
the test is classifying examinees correctly. The stronger
the correlation is, the greater the concurrent validity of
the test is.
Compiled By:Dr.V.Singh
5. PREDICTIVE VALIDITY
This is another statistical approach to validity that
estimates the relationship of test scores to an
examinee's future performance as a master or non-
master. Predictive validity considers the question,
"How well does the test predict examinees' future
status as masters or non-masters?" For this type of
validity, the correlation that is computed is based on
the test results and the examinee’s later
performance. This type of validity is especially useful
for test purposes such as selection or admissions.
Compiled By:Dr.V.Singh
23-11-2015
5
6. FACE VALIDITY
Like content validity, face validity is determined by a
review of the items and not through the use of
statistical analyses. Unlike content validity, face
validity is not investigated through formal procedures.
Instead, anyone who looks over the test, including
examinees, may develop an informal opinion as to
whether or not the test is measuring what it is
supposed to measure. While it is clearly of some value
to have the test appear to be valid, face validity alone
is insufficient for establishing that the test is
measuring what it claims to measure.
Compiled By:Dr.V.Singh
RELIABILITY
Reliability is the extent to which an experiment,
test, or any measuring procedure shows the same
result on repeated trials. Without the agreement
of independent observers able to replicate
research procedures, or the ability to use research
tools and procedures that produce consistent
measurements, researchers would be unable to
satisfactorily draw conclusions, formulate
theories, or make claims about the generalizability
of their research. For researchers, four key types
of reliability are:
Compiled By:Dr.V.Singh
23-11-2015
6
RELIABILITY
 “Equivalency”: related to the co-occurrence of
two items
 “Stability”: related to time consistency
 “Internal”: related to the instruments
 “Inter-rater”: related to the examiners’
criterion
 “Intra-rater”: related to the examiners’
criterion
Compiled By:Dr.V.Singh
1. EQUIVALENCY RELIABILITY
Equivalency reliability is the extent to which two items measure
identical concepts at an identical level of difficulty. Equivalency
reliability is determined by relating two sets of test scores to
one another to highlight the degree of relationship or association.
For example, a researcher studying university English students
happened to notice that when some students were studying for
finals, they got sick. Intrigued by this, the researcher attempted
to observe how often, or to what degree, these two behaviors co-
occurred throughout the academic year. The researcher used the
results of the observations to assess the correlation between
“studying throughout the academic year” and “getting sick”. The
researcher concluded there was poor equivalency reliability
between the two actions. In other words, studying was not a
reliable predictor of getting sick.
Compiled By:Dr.V.Singh
23-11-2015
7
2. STABILITY RELIABILITY
Stability reliability (sometimes called test, re-
test reliability) is the agreement of measuring
instruments over time. To determine stability, a
measure or test is repeated on the same subjects
at a future date. Results are compared and
correlated with the initial test to give a measure
of stability. This method of evaluating reliability is
appropriate only if the phenomenon that the test
measures is known to be stable over the interval
between assessments. The possibility of practice
effects should also be taken into account.
Compiled By:Dr.V.Singh
3. INTERNAL CONSISTENCY
Internal consistency is the extent to which tests or
procedures assess the same characteristic, skill or
quality. It is a measure of the precision between the
measuring instruments used in a study. This type of
reliability often helps researchers interpret data
and predict the value of scores and the limits of the
relationship among variables. For example, analyzing
the internal reliability of the items on a vocabulary
quiz will reveal the extent to which the quiz focuses
on the examinee’s knowledge of words.
Compiled By:Dr.V.Singh
23-11-2015
8
4. INTER-RATER RELIABILITY
Inter-rater reliability is the extent to which two or more
individuals (coders or raters) agree. Inter-rater reliability
assesses the consistency of how a measuring system is
implemented. For example, when two or more teachers use a
rating scale with which they are rating the students’ oral
responses in an interview (1 being most negative, 5 being
most positive). If one researcher gives a "1" to a student
response, while another researcher gives a "5," obviously
the inter-rater reliability would be inconsistent. Inter-
rater reliability is dependent upon the ability of two or
more individuals to be consistent. Training, education and
monitoring skills can enhance inter-rater reliability.
Compiled By:Dr.V.Singh
4. INTRA-RATER RELIABILITY
Intra-rater reliability is a type of reliability
assessment in which the same assessment is completed
by the same rater on two or more occasions. These
different ratings are then compared, generally by
means of correlation. Since the same individual is
completing both assessments, the rater's subsequent
ratings are contaminated by knowledge of earlier
ratings.
Compiled By:Dr.V.Singh
23-11-2015
9
SOURCES OF ERROR
 Examinee (is a human being)
 Examiner (is a human being)
 Examination (is designed by and for
human beings)
Compiled By:Dr.V.Singh
RELATIONSHIP BETWEEN
VALIDITY & RELIABILITY
Validity and reliability are closely
related.
A test cannot be considered valid unless
the measurements resulting from it
are reliable.
Likewise, results from a test can be
reliable and not necessarily valid.
Compiled By:Dr.V.Singh
23-11-2015
10
PRACTICALITY
It refers to the economy of time, effort and
money in testing. In other words, a test should
be…
 Easy to design
 Easy to administer
 Easy to mark
 Easy to interpret (the results)
Compiled By:Dr.V.Singh
BACKWASH EFFECT
Backwash effect (also known as washback) is
the influence of testing on teaching and
learning. It is also the potential impact that the
form and content of a test may have on
learners’ conception of what is being assessed
(language proficiency) and what it involves.
Therefore, test designers, delivers and raters
have a particular responsibility, considering that
the testing process may have a substantial
impact, either positive or negative.
Compiled By:Dr.V.Singh
23-11-2015
11
LEVELS OF BACKWASH
It is believed that backwash is a subset of a test’s
impact on society, educational systems and
individuals. Thus, test impact operates at two levels:
 The micro level (the effect of the test on individual
students and teachers)
 The macro level (the impact of the test on society
and the educational system)
Bachman and Palmer (1996)
Compiled By:Dr.V.Singh
 Usability(easy to administer, scoring,
interpretation and application, availability
etc.)
 Objectivity
Compiled By:Dr.V.Singh
23-11-2015
12
Item analysis
 The difficulty level of the item
 The discrimination power of the item
 The effectiveness of alternative
Compiled By:Dr.V.Singh
THANKS
Compiled By:Dr.V.Singh

More Related Content

What's hot

Guidance TYPES OF TEST
Guidance TYPES OF TESTGuidance TYPES OF TEST
Guidance TYPES OF TESTJen_castle
 
Types of grading system
Types of grading systemTypes of grading system
Types of grading systemRedPaspas
 
Norm Referenced and Criterion Referenced
Norm Referenced and Criterion ReferencedNorm Referenced and Criterion Referenced
Norm Referenced and Criterion ReferencedDr. Amjad Ali Arain
 
Validity and objectivity of tests
Validity and objectivity of testsValidity and objectivity of tests
Validity and objectivity of testsbushra mushtaq
 
Characteristics of a good test
Characteristics of a good testCharacteristics of a good test
Characteristics of a good testALMA HERMOGINO
 
Validity of Assessment Tools
Validity of Assessment ToolsValidity of Assessment Tools
Validity of Assessment ToolsUmairaNasim
 
Difference between assessment, measurement and evaluation
Difference between assessment, measurement and evaluationDifference between assessment, measurement and evaluation
Difference between assessment, measurement and evaluationKiranMalik37
 
Achievement test - Teacher Made Test and Standardized Test - Characteristics,...
Achievement test - Teacher Made Test and Standardized Test - Characteristics,...Achievement test - Teacher Made Test and Standardized Test - Characteristics,...
Achievement test - Teacher Made Test and Standardized Test - Characteristics,...Suresh Babu
 
Criterion referenced test
Criterion referenced test Criterion referenced test
Criterion referenced test Ulfa
 
learning indicator n learning outcomes
learning indicator n learning outcomeslearning indicator n learning outcomes
learning indicator n learning outcomesJagadish Kumar Gupta
 
Educational research
Educational researchEducational research
Educational researchMukut Deori
 
Educational measurement, assessment and evaluation
Educational measurement, assessment and evaluationEducational measurement, assessment and evaluation
Educational measurement, assessment and evaluationBoyet Aluan
 
Distinction among measurement, assessment and evaluation
Distinction among measurement, assessment and evaluationDistinction among measurement, assessment and evaluation
Distinction among measurement, assessment and evaluationUSMAN GANI AL HAQUE
 
Qualities of a good test (1)
Qualities of a good test (1)Qualities of a good test (1)
Qualities of a good test (1)kimoya
 
Meaning, need and characteristics of evaluation
Meaning, need and characteristics of evaluationMeaning, need and characteristics of evaluation
Meaning, need and characteristics of evaluationDr. Priyamvada Saarsar
 

What's hot (20)

Construction of Test
Construction of TestConstruction of Test
Construction of Test
 
Reflective Teaching
Reflective TeachingReflective Teaching
Reflective Teaching
 
Guidance TYPES OF TEST
Guidance TYPES OF TESTGuidance TYPES OF TEST
Guidance TYPES OF TEST
 
Types of grading system
Types of grading systemTypes of grading system
Types of grading system
 
Rubrics ppt
Rubrics pptRubrics ppt
Rubrics ppt
 
Norm Referenced and Criterion Referenced
Norm Referenced and Criterion ReferencedNorm Referenced and Criterion Referenced
Norm Referenced and Criterion Referenced
 
Validity and objectivity of tests
Validity and objectivity of testsValidity and objectivity of tests
Validity and objectivity of tests
 
Characteristics of a good test
Characteristics of a good testCharacteristics of a good test
Characteristics of a good test
 
Rubric Development for Teachers
Rubric Development for TeachersRubric Development for Teachers
Rubric Development for Teachers
 
Teacher-made-test.pptx
Teacher-made-test.pptxTeacher-made-test.pptx
Teacher-made-test.pptx
 
Validity of Assessment Tools
Validity of Assessment ToolsValidity of Assessment Tools
Validity of Assessment Tools
 
Difference between assessment, measurement and evaluation
Difference between assessment, measurement and evaluationDifference between assessment, measurement and evaluation
Difference between assessment, measurement and evaluation
 
Achievement test - Teacher Made Test and Standardized Test - Characteristics,...
Achievement test - Teacher Made Test and Standardized Test - Characteristics,...Achievement test - Teacher Made Test and Standardized Test - Characteristics,...
Achievement test - Teacher Made Test and Standardized Test - Characteristics,...
 
Criterion referenced test
Criterion referenced test Criterion referenced test
Criterion referenced test
 
learning indicator n learning outcomes
learning indicator n learning outcomeslearning indicator n learning outcomes
learning indicator n learning outcomes
 
Educational research
Educational researchEducational research
Educational research
 
Educational measurement, assessment and evaluation
Educational measurement, assessment and evaluationEducational measurement, assessment and evaluation
Educational measurement, assessment and evaluation
 
Distinction among measurement, assessment and evaluation
Distinction among measurement, assessment and evaluationDistinction among measurement, assessment and evaluation
Distinction among measurement, assessment and evaluation
 
Qualities of a good test (1)
Qualities of a good test (1)Qualities of a good test (1)
Qualities of a good test (1)
 
Meaning, need and characteristics of evaluation
Meaning, need and characteristics of evaluationMeaning, need and characteristics of evaluation
Meaning, need and characteristics of evaluation
 

Similar to Module-14-1-Characterstics of a good test-Reliability,Validity....pdf

Test characteristics
Test characteristicsTest characteristics
Test characteristicsSamcruz5
 
Presentation Validity & Reliability
Presentation Validity & ReliabilityPresentation Validity & Reliability
Presentation Validity & Reliabilitysongoten77
 
Validity, reliability & practicality
Validity, reliability & practicalityValidity, reliability & practicality
Validity, reliability & practicalitySamcruz5
 
Educ 243 final report pepito
Educ 243 final report pepitoEduc 243 final report pepito
Educ 243 final report pepitodeped
 
VALIDITY
VALIDITYVALIDITY
VALIDITYANCYBS
 
Validity & reliability seminar
Validity & reliability seminarValidity & reliability seminar
Validity & reliability seminarmrikara185
 
Characteristics of Good test
Characteristics of Good testCharacteristics of Good test
Characteristics of Good testVikramjit Singh
 
Presentation validity
Presentation validityPresentation validity
Presentation validityAshMusavi
 
RELIABILITY AND VALIDITY
RELIABILITY AND VALIDITYRELIABILITY AND VALIDITY
RELIABILITY AND VALIDITYJoydeep Singh
 
Validity of a Research Tool
Validity of a Research ToolValidity of a Research Tool
Validity of a Research TooljobyVarghese22
 
Validity, Reliability ,Objective & Their Types
Validity, Reliability ,Objective & Their TypesValidity, Reliability ,Objective & Their Types
Validity, Reliability ,Objective & Their TypesMohammadRabbani18
 
Language testing and evaluation validity and reliability.
Language testing and evaluation validity and reliability.Language testing and evaluation validity and reliability.
Language testing and evaluation validity and reliability.Vadher Ankita
 
Principles of assessment
Principles of assessmentPrinciples of assessment
Principles of assessmentmunsif123
 
Qualities of a Good Test
Qualities of a Good TestQualities of a Good Test
Qualities of a Good TestDrSindhuAlmas
 

Similar to Module-14-1-Characterstics of a good test-Reliability,Validity....pdf (20)

Test characteristics
Test characteristicsTest characteristics
Test characteristics
 
Presentation Validity & Reliability
Presentation Validity & ReliabilityPresentation Validity & Reliability
Presentation Validity & Reliability
 
Validity, reliability & practicality
Validity, reliability & practicalityValidity, reliability & practicality
Validity, reliability & practicality
 
Rep
RepRep
Rep
 
Educ 243 final report pepito
Educ 243 final report pepitoEduc 243 final report pepito
Educ 243 final report pepito
 
VALIDITY
VALIDITYVALIDITY
VALIDITY
 
Validity & reliability seminar
Validity & reliability seminarValidity & reliability seminar
Validity & reliability seminar
 
Characteristics of Good test
Characteristics of Good testCharacteristics of Good test
Characteristics of Good test
 
Presentation validity
Presentation validityPresentation validity
Presentation validity
 
RELIABILITY AND VALIDITY
RELIABILITY AND VALIDITYRELIABILITY AND VALIDITY
RELIABILITY AND VALIDITY
 
Validity of a Research Tool
Validity of a Research ToolValidity of a Research Tool
Validity of a Research Tool
 
Validity, Reliability ,Objective & Their Types
Validity, Reliability ,Objective & Their TypesValidity, Reliability ,Objective & Their Types
Validity, Reliability ,Objective & Their Types
 
Language testing and evaluation validity and reliability.
Language testing and evaluation validity and reliability.Language testing and evaluation validity and reliability.
Language testing and evaluation validity and reliability.
 
Test quality validity
Test quality validityTest quality validity
Test quality validity
 
Validity Evidence
Validity EvidenceValidity Evidence
Validity Evidence
 
Qualities of good evaluation tool (1)
Qualities of good evaluation  tool (1)Qualities of good evaluation  tool (1)
Qualities of good evaluation tool (1)
 
Principles of assessment
Principles of assessmentPrinciples of assessment
Principles of assessment
 
Validity
ValidityValidity
Validity
 
EM&E.pptx
EM&E.pptxEM&E.pptx
EM&E.pptx
 
Qualities of a Good Test
Qualities of a Good TestQualities of a Good Test
Qualities of a Good Test
 

More from Vikramjit Singh

Measures of Central Tendency-Mean, Median , Mode- Dr. Vikramjit Singh
Measures of Central Tendency-Mean, Median , Mode- Dr. Vikramjit SinghMeasures of Central Tendency-Mean, Median , Mode- Dr. Vikramjit Singh
Measures of Central Tendency-Mean, Median , Mode- Dr. Vikramjit SinghVikramjit Singh
 
Non Parametric Test by Vikramjit Singh
Non Parametric Test by  Vikramjit SinghNon Parametric Test by  Vikramjit Singh
Non Parametric Test by Vikramjit SinghVikramjit Singh
 
Parametric Test by Vikramjit Singh
Parametric Test  by  Vikramjit SinghParametric Test  by  Vikramjit Singh
Parametric Test by Vikramjit SinghVikramjit Singh
 
Concept of Variables in Research by Vikramjit Singh
Concept of Variables in Research by  Vikramjit SinghConcept of Variables in Research by  Vikramjit Singh
Concept of Variables in Research by Vikramjit SinghVikramjit Singh
 
Research Tool and its Characterstics
Research Tool and its CharactersticsResearch Tool and its Characterstics
Research Tool and its CharactersticsVikramjit Singh
 
Research Tool - Types and Examples
Research Tool - Types and ExamplesResearch Tool - Types and Examples
Research Tool - Types and ExamplesVikramjit Singh
 
Causal Comparative Research- Vikramjit Singh.pdf
Causal Comparative Research- Vikramjit Singh.pdfCausal Comparative Research- Vikramjit Singh.pdf
Causal Comparative Research- Vikramjit Singh.pdfVikramjit Singh
 
Sample and Sampling Techniques.pdf
Sample and Sampling  Techniques.pdfSample and Sampling  Techniques.pdf
Sample and Sampling Techniques.pdfVikramjit Singh
 
Correlational Research in Detail with all Steps- Dr. Vikramjit Singh.pdf
Correlational Research in Detail with all Steps- Dr. Vikramjit  Singh.pdfCorrelational Research in Detail with all Steps- Dr. Vikramjit  Singh.pdf
Correlational Research in Detail with all Steps- Dr. Vikramjit Singh.pdfVikramjit Singh
 
Vikramjit Singh-Descriptive Research-Survey research.pdf
Vikramjit Singh-Descriptive Research-Survey research.pdfVikramjit Singh-Descriptive Research-Survey research.pdf
Vikramjit Singh-Descriptive Research-Survey research.pdfVikramjit Singh
 
Vikramjit Singh-Hypothesis Testing Basics_ Errors,Df,Power of Test,Level of S...
Vikramjit Singh-Hypothesis Testing Basics_ Errors,Df,Power of Test,Level of S...Vikramjit Singh-Hypothesis Testing Basics_ Errors,Df,Power of Test,Level of S...
Vikramjit Singh-Hypothesis Testing Basics_ Errors,Df,Power of Test,Level of S...Vikramjit Singh
 
Basics of Hypothesis_ Vikramjit Singh.pdf
Basics of Hypothesis_ Vikramjit Singh.pdfBasics of Hypothesis_ Vikramjit Singh.pdf
Basics of Hypothesis_ Vikramjit Singh.pdfVikramjit Singh
 
5E model lesson plan.pdf
5E model lesson plan.pdf5E model lesson plan.pdf
5E model lesson plan.pdfVikramjit Singh
 
Experiments and Prospects of Globalisation Towards Higher Education in India
Experiments and Prospects of Globalisation Towards Higher Education in IndiaExperiments and Prospects of Globalisation Towards Higher Education in India
Experiments and Prospects of Globalisation Towards Higher Education in IndiaVikramjit Singh
 
5E model lesson plan.pdf
5E model lesson plan.pdf5E model lesson plan.pdf
5E model lesson plan.pdfVikramjit Singh
 
E-Content-MCC-08- ICON Model.pdf
E-Content-MCC-08- ICON Model.pdfE-Content-MCC-08- ICON Model.pdf
E-Content-MCC-08- ICON Model.pdfVikramjit Singh
 
E-Content-MCC-07-The System Analysis Approach to Curriculum Development.pdf
E-Content-MCC-07-The System Analysis Approach  to Curriculum Development.pdfE-Content-MCC-07-The System Analysis Approach  to Curriculum Development.pdf
E-Content-MCC-07-The System Analysis Approach to Curriculum Development.pdfVikramjit Singh
 
E-Content-MCC-08-Portfolio Assessment.pdf
E-Content-MCC-08-Portfolio Assessment.pdfE-Content-MCC-08-Portfolio Assessment.pdf
E-Content-MCC-08-Portfolio Assessment.pdfVikramjit Singh
 
E-Content-MCC-08-5 E Model-Hindi.pdf
E-Content-MCC-08-5 E Model-Hindi.pdfE-Content-MCC-08-5 E Model-Hindi.pdf
E-Content-MCC-08-5 E Model-Hindi.pdfVikramjit Singh
 

More from Vikramjit Singh (20)

Measures of Central Tendency-Mean, Median , Mode- Dr. Vikramjit Singh
Measures of Central Tendency-Mean, Median , Mode- Dr. Vikramjit SinghMeasures of Central Tendency-Mean, Median , Mode- Dr. Vikramjit Singh
Measures of Central Tendency-Mean, Median , Mode- Dr. Vikramjit Singh
 
Non Parametric Test by Vikramjit Singh
Non Parametric Test by  Vikramjit SinghNon Parametric Test by  Vikramjit Singh
Non Parametric Test by Vikramjit Singh
 
Parametric Test by Vikramjit Singh
Parametric Test  by  Vikramjit SinghParametric Test  by  Vikramjit Singh
Parametric Test by Vikramjit Singh
 
Concept of Variables in Research by Vikramjit Singh
Concept of Variables in Research by  Vikramjit SinghConcept of Variables in Research by  Vikramjit Singh
Concept of Variables in Research by Vikramjit Singh
 
Research Tool and its Characterstics
Research Tool and its CharactersticsResearch Tool and its Characterstics
Research Tool and its Characterstics
 
Research Tool - Types and Examples
Research Tool - Types and ExamplesResearch Tool - Types and Examples
Research Tool - Types and Examples
 
Causal Comparative Research- Vikramjit Singh.pdf
Causal Comparative Research- Vikramjit Singh.pdfCausal Comparative Research- Vikramjit Singh.pdf
Causal Comparative Research- Vikramjit Singh.pdf
 
Sample and Sampling Techniques.pdf
Sample and Sampling  Techniques.pdfSample and Sampling  Techniques.pdf
Sample and Sampling Techniques.pdf
 
Correlational Research in Detail with all Steps- Dr. Vikramjit Singh.pdf
Correlational Research in Detail with all Steps- Dr. Vikramjit  Singh.pdfCorrelational Research in Detail with all Steps- Dr. Vikramjit  Singh.pdf
Correlational Research in Detail with all Steps- Dr. Vikramjit Singh.pdf
 
Vikramjit Singh-Descriptive Research-Survey research.pdf
Vikramjit Singh-Descriptive Research-Survey research.pdfVikramjit Singh-Descriptive Research-Survey research.pdf
Vikramjit Singh-Descriptive Research-Survey research.pdf
 
Vikramjit Singh-Hypothesis Testing Basics_ Errors,Df,Power of Test,Level of S...
Vikramjit Singh-Hypothesis Testing Basics_ Errors,Df,Power of Test,Level of S...Vikramjit Singh-Hypothesis Testing Basics_ Errors,Df,Power of Test,Level of S...
Vikramjit Singh-Hypothesis Testing Basics_ Errors,Df,Power of Test,Level of S...
 
Basics of Hypothesis_ Vikramjit Singh.pdf
Basics of Hypothesis_ Vikramjit Singh.pdfBasics of Hypothesis_ Vikramjit Singh.pdf
Basics of Hypothesis_ Vikramjit Singh.pdf
 
5E model lesson plan.pdf
5E model lesson plan.pdf5E model lesson plan.pdf
5E model lesson plan.pdf
 
Micro Lesson Plan
Micro Lesson PlanMicro Lesson Plan
Micro Lesson Plan
 
Experiments and Prospects of Globalisation Towards Higher Education in India
Experiments and Prospects of Globalisation Towards Higher Education in IndiaExperiments and Prospects of Globalisation Towards Higher Education in India
Experiments and Prospects of Globalisation Towards Higher Education in India
 
5E model lesson plan.pdf
5E model lesson plan.pdf5E model lesson plan.pdf
5E model lesson plan.pdf
 
E-Content-MCC-08- ICON Model.pdf
E-Content-MCC-08- ICON Model.pdfE-Content-MCC-08- ICON Model.pdf
E-Content-MCC-08- ICON Model.pdf
 
E-Content-MCC-07-The System Analysis Approach to Curriculum Development.pdf
E-Content-MCC-07-The System Analysis Approach  to Curriculum Development.pdfE-Content-MCC-07-The System Analysis Approach  to Curriculum Development.pdf
E-Content-MCC-07-The System Analysis Approach to Curriculum Development.pdf
 
E-Content-MCC-08-Portfolio Assessment.pdf
E-Content-MCC-08-Portfolio Assessment.pdfE-Content-MCC-08-Portfolio Assessment.pdf
E-Content-MCC-08-Portfolio Assessment.pdf
 
E-Content-MCC-08-5 E Model-Hindi.pdf
E-Content-MCC-08-5 E Model-Hindi.pdfE-Content-MCC-08-5 E Model-Hindi.pdf
E-Content-MCC-08-5 E Model-Hindi.pdf
 

Recently uploaded

What is 3 Way Matching Process in Odoo 17.pptx
What is 3 Way Matching Process in Odoo 17.pptxWhat is 3 Way Matching Process in Odoo 17.pptx
What is 3 Way Matching Process in Odoo 17.pptxCeline George
 
Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111GangaMaiya1
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxJisc
 
AIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptAIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptNishitharanjan Rout
 
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lessonQUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lessonhttgc7rh9c
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxPooja Bhuva
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSCeline George
 
Economic Importance Of Fungi In Food Additives
Economic Importance Of Fungi In Food AdditivesEconomic Importance Of Fungi In Food Additives
Economic Importance Of Fungi In Food AdditivesSHIVANANDaRV
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
Simple, Complex, and Compound Sentences Exercises.pdf
Simple, Complex, and Compound Sentences Exercises.pdfSimple, Complex, and Compound Sentences Exercises.pdf
Simple, Complex, and Compound Sentences Exercises.pdfstareducators107
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxPooja Bhuva
 
PANDITA RAMABAI- Indian political thought GENDER.pptx
PANDITA RAMABAI- Indian political thought GENDER.pptxPANDITA RAMABAI- Indian political thought GENDER.pptx
PANDITA RAMABAI- Indian political thought GENDER.pptxakanksha16arora
 
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdfFICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdfPondicherry University
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxannathomasp01
 
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdfUGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdfNirmal Dwivedi
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxCeline George
 
Tatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf artsTatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf artsNbelano25
 

Recently uploaded (20)

What is 3 Way Matching Process in Odoo 17.pptx
What is 3 Way Matching Process in Odoo 17.pptxWhat is 3 Way Matching Process in Odoo 17.pptx
What is 3 Way Matching Process in Odoo 17.pptx
 
Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
AIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptAIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.ppt
 
VAMOS CUIDAR DO NOSSO PLANETA! .
VAMOS CUIDAR DO NOSSO PLANETA!                    .VAMOS CUIDAR DO NOSSO PLANETA!                    .
VAMOS CUIDAR DO NOSSO PLANETA! .
 
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lessonQUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Economic Importance Of Fungi In Food Additives
Economic Importance Of Fungi In Food AdditivesEconomic Importance Of Fungi In Food Additives
Economic Importance Of Fungi In Food Additives
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Simple, Complex, and Compound Sentences Exercises.pdf
Simple, Complex, and Compound Sentences Exercises.pdfSimple, Complex, and Compound Sentences Exercises.pdf
Simple, Complex, and Compound Sentences Exercises.pdf
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
PANDITA RAMABAI- Indian political thought GENDER.pptx
PANDITA RAMABAI- Indian political thought GENDER.pptxPANDITA RAMABAI- Indian political thought GENDER.pptx
PANDITA RAMABAI- Indian political thought GENDER.pptx
 
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdfFICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
OS-operating systems- ch05 (CPU Scheduling) ...
OS-operating systems- ch05 (CPU Scheduling) ...OS-operating systems- ch05 (CPU Scheduling) ...
OS-operating systems- ch05 (CPU Scheduling) ...
 
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdfUGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Tatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf artsTatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf arts
 

Module-14-1-Characterstics of a good test-Reliability,Validity....pdf

  • 1. 23-11-2015 1 Characteristics of a Good Test VALIDITY The term validity refers to whether or not a test measures what it intends to measure. On a test with high validity the items will be closely linked to the test’s intended focus. For many certification and licensure tests this means that the items will be highly related to a specific job or occupation. If a test has poor validity then it does not measure the job-related content and competencies it ought to. There are several ways to estimate the validity of a test, including content validity, construct validity, criterion- related validity (concurrent & predictive) and face validity. Compiled By:Dr.V.Singh
  • 2. 23-11-2015 2 VALIDITY  Content”: related to objectives and their sampling.  “Construct”: referring to the theory underlying the target.  “Criterion”: related to concrete criteria in the real world. It can be concurrent or predictive.  “Concurrent”: correlating high with another measure already validated.  “Predictive”: Capable of anticipating some later measure.  “Face”: related to the test overall appearance. Compiled By:Dr.V.Singh 1. CONTENT VALIDITY Content validity refers to the connections between the test items and the subject-related tasks. The test should evaluate only the content related to the field of study in a manner sufficiently representative, relevant, and comprehensible. Compiled By:Dr.V.Singh
  • 3. 23-11-2015 3 2. CONSTRUCT VALIDITY It implies using the construct correctly (concepts, ideas, notions). Construct validity seeks agreement between a theoretical concept and a specific measuring device or procedure. For example, a test of intelligence nowadays must include measures of multiple intelligences, rather than just logical-mathematical and linguistic ability measures. Compiled By:Dr.V.Singh 3. CRITERION-RELATED VALIDITY Also referred to as instrumental validity, it states that the criteria should be clearly defined by the teacher in advance. It has to take into account other teachers´ criteria to be standardized and it also needs to demonstrate the accuracy of a measure or procedure compared to another measure or procedure which has already been demonstrated to be valid. Compiled By:Dr.V.Singh
  • 4. 23-11-2015 4 4. CONCURRENT VALIDITY Concurrent validity is a statistical method using correlation, rather than a logical method. Examinees who are known to be either masters or non- masters on the content measured by the test are identified before the test is administered. Once the tests have been scored, the relationship between the examinees’ status as either masters or non-masters and their performance (i.e., pass or fail) is estimated based on the test. This type of validity provides evidence that the test is classifying examinees correctly. The stronger the correlation is, the greater the concurrent validity of the test is. Compiled By:Dr.V.Singh 5. PREDICTIVE VALIDITY This is another statistical approach to validity that estimates the relationship of test scores to an examinee's future performance as a master or non- master. Predictive validity considers the question, "How well does the test predict examinees' future status as masters or non-masters?" For this type of validity, the correlation that is computed is based on the test results and the examinee’s later performance. This type of validity is especially useful for test purposes such as selection or admissions. Compiled By:Dr.V.Singh
  • 5. 23-11-2015 5 6. FACE VALIDITY Like content validity, face validity is determined by a review of the items and not through the use of statistical analyses. Unlike content validity, face validity is not investigated through formal procedures. Instead, anyone who looks over the test, including examinees, may develop an informal opinion as to whether or not the test is measuring what it is supposed to measure. While it is clearly of some value to have the test appear to be valid, face validity alone is insufficient for establishing that the test is measuring what it claims to measure. Compiled By:Dr.V.Singh RELIABILITY Reliability is the extent to which an experiment, test, or any measuring procedure shows the same result on repeated trials. Without the agreement of independent observers able to replicate research procedures, or the ability to use research tools and procedures that produce consistent measurements, researchers would be unable to satisfactorily draw conclusions, formulate theories, or make claims about the generalizability of their research. For researchers, four key types of reliability are: Compiled By:Dr.V.Singh
  • 6. 23-11-2015 6 RELIABILITY  “Equivalency”: related to the co-occurrence of two items  “Stability”: related to time consistency  “Internal”: related to the instruments  “Inter-rater”: related to the examiners’ criterion  “Intra-rater”: related to the examiners’ criterion Compiled By:Dr.V.Singh 1. EQUIVALENCY RELIABILITY Equivalency reliability is the extent to which two items measure identical concepts at an identical level of difficulty. Equivalency reliability is determined by relating two sets of test scores to one another to highlight the degree of relationship or association. For example, a researcher studying university English students happened to notice that when some students were studying for finals, they got sick. Intrigued by this, the researcher attempted to observe how often, or to what degree, these two behaviors co- occurred throughout the academic year. The researcher used the results of the observations to assess the correlation between “studying throughout the academic year” and “getting sick”. The researcher concluded there was poor equivalency reliability between the two actions. In other words, studying was not a reliable predictor of getting sick. Compiled By:Dr.V.Singh
  • 7. 23-11-2015 7 2. STABILITY RELIABILITY Stability reliability (sometimes called test, re- test reliability) is the agreement of measuring instruments over time. To determine stability, a measure or test is repeated on the same subjects at a future date. Results are compared and correlated with the initial test to give a measure of stability. This method of evaluating reliability is appropriate only if the phenomenon that the test measures is known to be stable over the interval between assessments. The possibility of practice effects should also be taken into account. Compiled By:Dr.V.Singh 3. INTERNAL CONSISTENCY Internal consistency is the extent to which tests or procedures assess the same characteristic, skill or quality. It is a measure of the precision between the measuring instruments used in a study. This type of reliability often helps researchers interpret data and predict the value of scores and the limits of the relationship among variables. For example, analyzing the internal reliability of the items on a vocabulary quiz will reveal the extent to which the quiz focuses on the examinee’s knowledge of words. Compiled By:Dr.V.Singh
  • 8. 23-11-2015 8 4. INTER-RATER RELIABILITY Inter-rater reliability is the extent to which two or more individuals (coders or raters) agree. Inter-rater reliability assesses the consistency of how a measuring system is implemented. For example, when two or more teachers use a rating scale with which they are rating the students’ oral responses in an interview (1 being most negative, 5 being most positive). If one researcher gives a "1" to a student response, while another researcher gives a "5," obviously the inter-rater reliability would be inconsistent. Inter- rater reliability is dependent upon the ability of two or more individuals to be consistent. Training, education and monitoring skills can enhance inter-rater reliability. Compiled By:Dr.V.Singh 4. INTRA-RATER RELIABILITY Intra-rater reliability is a type of reliability assessment in which the same assessment is completed by the same rater on two or more occasions. These different ratings are then compared, generally by means of correlation. Since the same individual is completing both assessments, the rater's subsequent ratings are contaminated by knowledge of earlier ratings. Compiled By:Dr.V.Singh
  • 9. 23-11-2015 9 SOURCES OF ERROR  Examinee (is a human being)  Examiner (is a human being)  Examination (is designed by and for human beings) Compiled By:Dr.V.Singh RELATIONSHIP BETWEEN VALIDITY & RELIABILITY Validity and reliability are closely related. A test cannot be considered valid unless the measurements resulting from it are reliable. Likewise, results from a test can be reliable and not necessarily valid. Compiled By:Dr.V.Singh
  • 10. 23-11-2015 10 PRACTICALITY It refers to the economy of time, effort and money in testing. In other words, a test should be…  Easy to design  Easy to administer  Easy to mark  Easy to interpret (the results) Compiled By:Dr.V.Singh BACKWASH EFFECT Backwash effect (also known as washback) is the influence of testing on teaching and learning. It is also the potential impact that the form and content of a test may have on learners’ conception of what is being assessed (language proficiency) and what it involves. Therefore, test designers, delivers and raters have a particular responsibility, considering that the testing process may have a substantial impact, either positive or negative. Compiled By:Dr.V.Singh
  • 11. 23-11-2015 11 LEVELS OF BACKWASH It is believed that backwash is a subset of a test’s impact on society, educational systems and individuals. Thus, test impact operates at two levels:  The micro level (the effect of the test on individual students and teachers)  The macro level (the impact of the test on society and the educational system) Bachman and Palmer (1996) Compiled By:Dr.V.Singh  Usability(easy to administer, scoring, interpretation and application, availability etc.)  Objectivity Compiled By:Dr.V.Singh
  • 12. 23-11-2015 12 Item analysis  The difficulty level of the item  The discrimination power of the item  The effectiveness of alternative Compiled By:Dr.V.Singh THANKS Compiled By:Dr.V.Singh