This was a volunteer work in order to summerize those relevant points for me, my partner and classmates.
Based on Prof. Glenn Fulcher's videos extracted from his own website:
http://languagetesting.info/video/main.html
Basically, we concentrated our work on the following videos:
Pretesting, J. D. Brown.
Vocabulary, John Read.
Reading, Caroline Clapham.
Speaking, Glenn Fulcher.
Listening, Gary Buck.
Item writing, Charles Stansfield.
Writing, Liz Hamp-Lyons.
1) The document describes a workshop used to help teachers assess students' speaking skills by making them more aware of the different criteria they could use and how the criteria should depend on the testing context.
2) The workshop involves teachers viewing student speaking samples, discussing what criteria affect their evaluations, being introduced to a list of common criteria, and examining how the criteria selection depends on factors like the test purpose and administration process.
3) The goal is to improve the validity and reliability of speaking assessments by making the criteria choices and weightings more explicit and tailored to the testing context.
This document discusses various methods for assessing speaking ability, including imitative, intensive, responsive, interactive, and extensive speaking tasks. It provides examples and descriptions of assessment tasks like word repetition, picture description, role plays, interviews, presentations and story retelling. Scoring methods are outlined for tasks like repetition, read-alouds, interviews and presentations. The document emphasizes specifying assessment criteria and eliciting optimal speaking outputs for reliable evaluation.
The document discusses assessing listening skills. It defines listening as an active process involving both linguistic and non-linguistic knowledge. Listening assessment is important because the act of listening cannot be observed. The document outlines different types of listening like intensive, extensive, selective, and responsive. It provides examples of assessment tasks that can measure various dimensions of listening including phonological recognition, paraphrasing, answering questions, note-taking, and retelling stories.
This document discusses different methods for testing vocabulary. It begins by defining vocabulary as the body of words used in a particular language and explores why vocabulary should be tested, such as for feedback and motivation. The document then examines different formats for testing vocabulary recognition, such as multiple choice, matching, and gap filling questions. It also looks at formats for testing vocabulary production, like transformation and table completion items. The document concludes that vocabulary is complex and individuals may know words in different ways based on their various associations and connections to the word.
This document discusses various methods for testing English grammar. It begins by asking what aspects of grammar should be measured and whether the focus should be on structures, functions, usage, or descriptive vs. prescriptive rules. Common test formats are then outlined, including recognition through multiple choice, true/false, and gap-filling questions, as well as production through sentence building and editing exercises. The document recommends providing context and measuring a test-taker's ability to communicate grammatically for different audiences and tasks. Sample grammar testing activities are described, like split sentences, describing pictures, and story-building to evaluate understanding of various tenses and sentence construction.
This document discusses testing grammar and vocabulary in language assessments. It begins by outlining reasons for testing grammar, including that grammatical ability limits other skills and it helps diagnose gaps for teachers and learners. It then provides examples of different item types for testing grammar, including gap filling, paraphrase, completion, and multiple choice. Scoring considerations for production-based grammar tests are mentioned. The document also discusses testing vocabulary and provides examples of item types, such as recognizing synonyms, definitions, and filling gaps. Pictures and definitions are provided as examples for production-based vocabulary testing.
The document provides an analysis of Section 2 of the TOEFL test, which assesses grammar. It discusses the test's purpose, date of application, practical considerations like administration and scoring, test construction, sample questions, analysis of grammar points and question types covered, reliabilities, validity, strengths, and concludes that the TOEFL is the most reliable standardized test for measuring non-native English speakers' university-level abilities.
Language testing (Testing Vocabulary)-Adapted from MadsenMelva Simangunsong
The document discusses different types of vocabulary tests:
1. Limited response tests involve students giving simple physical or verbal responses to demonstrate understanding of vocabulary words. They are good for beginners and avoid skills like reading/writing.
2. Multiple choice completion tests present sentences with a missing word and students select the correct word from options. They provide context but can be difficult to construct and allow cheating.
3. Multiple choice paraphrase tests underline a word in a sentence and students choose its closest meaning from options. They are easier to score but hard to find good synonyms.
4. Simple completion tests require students to write missing words that appear in example sentences. They reflect teaching approaches but fewer words can be tested
1) The document describes a workshop used to help teachers assess students' speaking skills by making them more aware of the different criteria they could use and how the criteria should depend on the testing context.
2) The workshop involves teachers viewing student speaking samples, discussing what criteria affect their evaluations, being introduced to a list of common criteria, and examining how the criteria selection depends on factors like the test purpose and administration process.
3) The goal is to improve the validity and reliability of speaking assessments by making the criteria choices and weightings more explicit and tailored to the testing context.
This document discusses various methods for assessing speaking ability, including imitative, intensive, responsive, interactive, and extensive speaking tasks. It provides examples and descriptions of assessment tasks like word repetition, picture description, role plays, interviews, presentations and story retelling. Scoring methods are outlined for tasks like repetition, read-alouds, interviews and presentations. The document emphasizes specifying assessment criteria and eliciting optimal speaking outputs for reliable evaluation.
The document discusses assessing listening skills. It defines listening as an active process involving both linguistic and non-linguistic knowledge. Listening assessment is important because the act of listening cannot be observed. The document outlines different types of listening like intensive, extensive, selective, and responsive. It provides examples of assessment tasks that can measure various dimensions of listening including phonological recognition, paraphrasing, answering questions, note-taking, and retelling stories.
This document discusses different methods for testing vocabulary. It begins by defining vocabulary as the body of words used in a particular language and explores why vocabulary should be tested, such as for feedback and motivation. The document then examines different formats for testing vocabulary recognition, such as multiple choice, matching, and gap filling questions. It also looks at formats for testing vocabulary production, like transformation and table completion items. The document concludes that vocabulary is complex and individuals may know words in different ways based on their various associations and connections to the word.
This document discusses various methods for testing English grammar. It begins by asking what aspects of grammar should be measured and whether the focus should be on structures, functions, usage, or descriptive vs. prescriptive rules. Common test formats are then outlined, including recognition through multiple choice, true/false, and gap-filling questions, as well as production through sentence building and editing exercises. The document recommends providing context and measuring a test-taker's ability to communicate grammatically for different audiences and tasks. Sample grammar testing activities are described, like split sentences, describing pictures, and story-building to evaluate understanding of various tenses and sentence construction.
This document discusses testing grammar and vocabulary in language assessments. It begins by outlining reasons for testing grammar, including that grammatical ability limits other skills and it helps diagnose gaps for teachers and learners. It then provides examples of different item types for testing grammar, including gap filling, paraphrase, completion, and multiple choice. Scoring considerations for production-based grammar tests are mentioned. The document also discusses testing vocabulary and provides examples of item types, such as recognizing synonyms, definitions, and filling gaps. Pictures and definitions are provided as examples for production-based vocabulary testing.
The document provides an analysis of Section 2 of the TOEFL test, which assesses grammar. It discusses the test's purpose, date of application, practical considerations like administration and scoring, test construction, sample questions, analysis of grammar points and question types covered, reliabilities, validity, strengths, and concludes that the TOEFL is the most reliable standardized test for measuring non-native English speakers' university-level abilities.
Language testing (Testing Vocabulary)-Adapted from MadsenMelva Simangunsong
The document discusses different types of vocabulary tests:
1. Limited response tests involve students giving simple physical or verbal responses to demonstrate understanding of vocabulary words. They are good for beginners and avoid skills like reading/writing.
2. Multiple choice completion tests present sentences with a missing word and students select the correct word from options. They provide context but can be difficult to construct and allow cheating.
3. Multiple choice paraphrase tests underline a word in a sentence and students choose its closest meaning from options. They are easier to score but hard to find good synonyms.
4. Simple completion tests require students to write missing words that appear in example sentences. They reflect teaching approaches but fewer words can be tested
This document discusses methods for assessing speaking ability. It outlines structured and observational approaches, with the structured approach involving tasks to evaluate specific oral skills and the observational approach unobtrusively observing natural speech. Rating systems can be holistic, providing an overall impression, or analytic, evaluating various communication aspects. Factors like linguistic competence, functional competence, and sociolinguistic competence are also discussed. The document provides details on testing speaking such as format, content, performance conditions, stimulus, timing, scoring, and considers valid assessment formats. Principles of assessment speaking are outlined related to reliability, validity, practicality, authenticity, approach, and wash back.
1. The document discusses aspects of making and testing vocabulary, including considerations for vocabulary tests and different techniques.
2. Key aspects that should be tested include a student's breadth and depth of vocabulary knowledge, as well as their understanding of word meanings, pronunciations, grammatical functions, collocations and contexts for use.
3. Different techniques for testing vocabulary are described, such as multiple choice questions, fill-in-the-blank exercises using target words in context, and Cloze tests where students supply missing words.
The document outlines the design of a speaking test for lower-intermediate English students in Chile. It includes 3 sections: [1] individual questions about pictures, [2] a discussion between two students, and [3] a role-play asking for and giving directions. The test assesses students' fluency, grammar, pronunciation, and vocabulary when giving directions in English. Sample materials include pictures, tasks, a rubric, and a student performance sample. Minor revisions are proposed to clarify the tasks and make them more realistic.
The document discusses various methods for assessing speaking ability, divided into 5 basic types - imitative, intensive, responsive, interactive, and extensive. It then provides details on designing assessment tasks for each type. For imitative speaking, tasks include repetition and read-alouds. Intensive tasks include directed responses, fill-in-the-blank dialogs, and picture prompts. Responsive tasks involve questions and answers or giving instructions. Interactive tasks include interviews, role-plays, and discussions. Extensive tasks include oral presentations, storytelling, and translations. The document emphasizes matching tasks to objectives and establishing reliable scoring criteria.
By watching this Power Point presentation, you'll acquire the necessary tools as well as basic information that is needed whenever you want to evaluate Vocabulary.
The document discusses various formats for testing grammar, including recognition (e.g. multiple choice, true/false) and production (e.g. sentence building, transformation). It notes advantages and disadvantages of different formats, and suggests they should provide context, require communicative responses to demonstrate meaning, and test production abilities, not just recognition. Picture-based and story-based activities are proposed to integrate testing grammar with communicative skills.
This document summarizes two articles about vocabulary assessment: Testing and Teaching Vocabulary by Paul Nation, and Second language vocabulary assessment: Current practices and new directions by John Read.
The Nation article discusses statistical methods for counting words in texts and determining vocabulary frequency. It also describes developing a vocabulary levels test with matching and definition items. The Read article reviews purposes of vocabulary size measurement and sampling high-frequency words from lists. It discusses the yes/no test format and efforts to measure depth of vocabulary knowledge through association tasks and scales. The article concludes by examining ways to assess vocabulary use in context.
The document discusses designing assessment tasks for intensive and responsive listening. For intensive listening, it provides examples of tasks that test recognizing phonological elements, morphological elements, stress patterns, single words, and paraphrasing sentences and dialogues. For responsive listening, it gives examples that assess responding appropriately to questions and open-ended responses to questions, noting the benefits and challenges of each.
The document discusses various formats for testing grammar, including recognition (e.g. multiple choice, true/false) and production (e.g. sentence building, transformation). It notes advantages and disadvantages of different formats, and suggests they should provide context, require communicative responses to demonstrate meaning, and test production abilities, not just recognition. Picture-based and story-based activities are proposed to integrate testing of grammar knowledge and skills.
The document discusses testing vocabulary and different formats for doing so. It describes what vocabulary is and how words are learned and organized in the mind. Various test formats are described such as multiple choice, matching, gap filling, and cloze tests. The most common recognition and production formats are outlined. Considerations for when to use each type are provided.
From the CALPER/LARC Testing and Assessment Webinar Series
Download the handouts: https://larc.sdsu.edu/archived-events/
View the recording: http://vimeo.com/63784749
This document discusses testing vocabulary. It begins by asking what vocabulary is and how words are learned. It then examines different formats for testing vocabulary, including multiple choice, matching, gap filling, and cloze tests. The document notes advantages and disadvantages of different formats. For example, multiple choice is easy to score but limits knowledge to options, while gap filling tests production ability but answers may have multiple possibilities. It emphasizes the importance of testing vocabulary within meaningful contexts and communicatively. In the end, it concludes that knowing a word involves a complex network of associations across different aspects of language.
This document discusses testing listening skills in language learning. It explains that listening tests assess how well students have mastered what has been taught and that listening is a key receptive skill for language proficiency. There are two main types of listening tests - discrete tests that evaluate specific listening elements or aspects of proficiency one at a time, and task-based tests that evaluate broader communicative listening skills like comprehending facts or interpreting intentions. Important considerations for developing listening tests include selecting appropriate texts and tasks, determining the number of speakers, choosing a live or recorded format, minimizing writing, and using multiple choice or other item types. Sample test formats described include dictation, following map directions, comprehending radio reports or talks, and answering comprehension questions
Topic 6 Assessing Language Skills and ContentYee Bee Choo
The document discusses different types of language skills assessment including listening, speaking, reading, and writing. It describes several categories within each skill:
- Listening is categorized into intensive, responsive, selective, and extensive tests to assess different listening abilities.
- Speaking assessments include imitative, intensive, responsive, interactive (transactional and interpersonal), and extensive tests.
- Reading tests focus on perceptive, selective, interactive, and extensive levels to measure bottom-up and top-down comprehension.
- Writing is initially assessed through imitative and intensive controlled tasks before moving to more open-ended responses.
Using Technology in the Spanish and French classroomMmeNero
This document provides an overview of how Shauna Néro, a French Immersion and Spanish teacher, uses technology in her language classrooms. Some key points:
- She uses technology to increase student engagement, empower digital native students, and allow for differentiated learning styles.
- Suggestions from other teachers on Twitter include using technology for oral communication practice, accommodating students with disabilities, and improving pronunciation.
- Néro describes various online tools and activities she uses, such as Edmodo, Voicethread, Wordle, Jeopardy, videos, and websites for exposing students to the target language and culture.
- She provides examples of assessing student oral skills development through recorded podcast
The document discusses various methods for assessing speaking skills in tests. It describes a continuum of speaking tasks from imitative to extensive. Imitative tasks involve simple repetition, while intensive tasks assess narrow grammatical skills through short responses. Responsive tasks involve simple interactions, and interactive tasks have longer, more complex exchanges. Extensive speaking includes speeches and presentations with limited feedback. The document also discusses techniques like role plays, interviews and games to assess speaking at different levels.
This document discusses assessing grammar and vocabulary in language assessment. It covers three main points:
1. Assessing grammar involves evaluating correct grammatical form, meaning, and use in communication. Grammatical competence has four components: grammatical, sociolinguistic, discourse, and strategic.
2. Various assessment tasks for grammar are described, including selected response, limited production, and extended production tasks.
3. Assessing vocabulary involves evaluating knowledge of words, phrases, and their appropriate use. Receptive and productive vocabulary tasks are outlined.
The document discusses assessing speaking skills in students. It begins by defining speaking and listing reasons to assess it, such as helping students and teachers understand strengths and weaknesses. It then covers what speaking involves, including micro and macro factors. The document provides guidance on planning a speaking assessment, such as choosing learning outcomes, designing a test or activity, developing a rubric, and considering the testing environment. Sample rubrics are included that assess pronunciation, vocabulary, accuracy, fluency, non-verbal communication skills, and ability to communicate. The document concludes by discussing next steps after assessment, such as remedial planning, feedback, and analyzing results data.
This document discusses strategies for assessing speaking skills in a second language. It describes the characteristics of spoken language, such as variation in speed, loudness, gestures, intonation, stress, rhythm, and pausing. It presents several methods for assessing speaking, including role plays, interviews, oral presentations, picture-cued story telling, and translations. Role plays can be structured or unstructured. Interviews involve warming up, level-checking, probing, and winding down. It's important to determine the appropriate criteria and tasks for assessing fluency versus accuracy in speaking.
Jeffrey Fulcher enjoys several hobbies including sports, movies, music, and books. His interests span multiple entertainment genres from athletic activities to films and music as well as written works of fiction. Overall, Jeffrey seems to have a diverse set of leisure pursuits that allow him to be actively engaged with different forms of media and pastimes.
The document discusses the history and development of artificial intelligence over the past several decades. It describes early work in the field starting in the 1950s and progress in areas such as problem solving, game playing, and machine learning. Recent advances in deep learning now allow AI systems to perform complex tasks like visual perception and language translation.
This document discusses methods for assessing speaking ability. It outlines structured and observational approaches, with the structured approach involving tasks to evaluate specific oral skills and the observational approach unobtrusively observing natural speech. Rating systems can be holistic, providing an overall impression, or analytic, evaluating various communication aspects. Factors like linguistic competence, functional competence, and sociolinguistic competence are also discussed. The document provides details on testing speaking such as format, content, performance conditions, stimulus, timing, scoring, and considers valid assessment formats. Principles of assessment speaking are outlined related to reliability, validity, practicality, authenticity, approach, and wash back.
1. The document discusses aspects of making and testing vocabulary, including considerations for vocabulary tests and different techniques.
2. Key aspects that should be tested include a student's breadth and depth of vocabulary knowledge, as well as their understanding of word meanings, pronunciations, grammatical functions, collocations and contexts for use.
3. Different techniques for testing vocabulary are described, such as multiple choice questions, fill-in-the-blank exercises using target words in context, and Cloze tests where students supply missing words.
The document outlines the design of a speaking test for lower-intermediate English students in Chile. It includes 3 sections: [1] individual questions about pictures, [2] a discussion between two students, and [3] a role-play asking for and giving directions. The test assesses students' fluency, grammar, pronunciation, and vocabulary when giving directions in English. Sample materials include pictures, tasks, a rubric, and a student performance sample. Minor revisions are proposed to clarify the tasks and make them more realistic.
The document discusses various methods for assessing speaking ability, divided into 5 basic types - imitative, intensive, responsive, interactive, and extensive. It then provides details on designing assessment tasks for each type. For imitative speaking, tasks include repetition and read-alouds. Intensive tasks include directed responses, fill-in-the-blank dialogs, and picture prompts. Responsive tasks involve questions and answers or giving instructions. Interactive tasks include interviews, role-plays, and discussions. Extensive tasks include oral presentations, storytelling, and translations. The document emphasizes matching tasks to objectives and establishing reliable scoring criteria.
By watching this Power Point presentation, you'll acquire the necessary tools as well as basic information that is needed whenever you want to evaluate Vocabulary.
The document discusses various formats for testing grammar, including recognition (e.g. multiple choice, true/false) and production (e.g. sentence building, transformation). It notes advantages and disadvantages of different formats, and suggests they should provide context, require communicative responses to demonstrate meaning, and test production abilities, not just recognition. Picture-based and story-based activities are proposed to integrate testing grammar with communicative skills.
This document summarizes two articles about vocabulary assessment: Testing and Teaching Vocabulary by Paul Nation, and Second language vocabulary assessment: Current practices and new directions by John Read.
The Nation article discusses statistical methods for counting words in texts and determining vocabulary frequency. It also describes developing a vocabulary levels test with matching and definition items. The Read article reviews purposes of vocabulary size measurement and sampling high-frequency words from lists. It discusses the yes/no test format and efforts to measure depth of vocabulary knowledge through association tasks and scales. The article concludes by examining ways to assess vocabulary use in context.
The document discusses designing assessment tasks for intensive and responsive listening. For intensive listening, it provides examples of tasks that test recognizing phonological elements, morphological elements, stress patterns, single words, and paraphrasing sentences and dialogues. For responsive listening, it gives examples that assess responding appropriately to questions and open-ended responses to questions, noting the benefits and challenges of each.
The document discusses various formats for testing grammar, including recognition (e.g. multiple choice, true/false) and production (e.g. sentence building, transformation). It notes advantages and disadvantages of different formats, and suggests they should provide context, require communicative responses to demonstrate meaning, and test production abilities, not just recognition. Picture-based and story-based activities are proposed to integrate testing of grammar knowledge and skills.
The document discusses testing vocabulary and different formats for doing so. It describes what vocabulary is and how words are learned and organized in the mind. Various test formats are described such as multiple choice, matching, gap filling, and cloze tests. The most common recognition and production formats are outlined. Considerations for when to use each type are provided.
From the CALPER/LARC Testing and Assessment Webinar Series
Download the handouts: https://larc.sdsu.edu/archived-events/
View the recording: http://vimeo.com/63784749
This document discusses testing vocabulary. It begins by asking what vocabulary is and how words are learned. It then examines different formats for testing vocabulary, including multiple choice, matching, gap filling, and cloze tests. The document notes advantages and disadvantages of different formats. For example, multiple choice is easy to score but limits knowledge to options, while gap filling tests production ability but answers may have multiple possibilities. It emphasizes the importance of testing vocabulary within meaningful contexts and communicatively. In the end, it concludes that knowing a word involves a complex network of associations across different aspects of language.
This document discusses testing listening skills in language learning. It explains that listening tests assess how well students have mastered what has been taught and that listening is a key receptive skill for language proficiency. There are two main types of listening tests - discrete tests that evaluate specific listening elements or aspects of proficiency one at a time, and task-based tests that evaluate broader communicative listening skills like comprehending facts or interpreting intentions. Important considerations for developing listening tests include selecting appropriate texts and tasks, determining the number of speakers, choosing a live or recorded format, minimizing writing, and using multiple choice or other item types. Sample test formats described include dictation, following map directions, comprehending radio reports or talks, and answering comprehension questions
Topic 6 Assessing Language Skills and ContentYee Bee Choo
The document discusses different types of language skills assessment including listening, speaking, reading, and writing. It describes several categories within each skill:
- Listening is categorized into intensive, responsive, selective, and extensive tests to assess different listening abilities.
- Speaking assessments include imitative, intensive, responsive, interactive (transactional and interpersonal), and extensive tests.
- Reading tests focus on perceptive, selective, interactive, and extensive levels to measure bottom-up and top-down comprehension.
- Writing is initially assessed through imitative and intensive controlled tasks before moving to more open-ended responses.
Using Technology in the Spanish and French classroomMmeNero
This document provides an overview of how Shauna Néro, a French Immersion and Spanish teacher, uses technology in her language classrooms. Some key points:
- She uses technology to increase student engagement, empower digital native students, and allow for differentiated learning styles.
- Suggestions from other teachers on Twitter include using technology for oral communication practice, accommodating students with disabilities, and improving pronunciation.
- Néro describes various online tools and activities she uses, such as Edmodo, Voicethread, Wordle, Jeopardy, videos, and websites for exposing students to the target language and culture.
- She provides examples of assessing student oral skills development through recorded podcast
The document discusses various methods for assessing speaking skills in tests. It describes a continuum of speaking tasks from imitative to extensive. Imitative tasks involve simple repetition, while intensive tasks assess narrow grammatical skills through short responses. Responsive tasks involve simple interactions, and interactive tasks have longer, more complex exchanges. Extensive speaking includes speeches and presentations with limited feedback. The document also discusses techniques like role plays, interviews and games to assess speaking at different levels.
This document discusses assessing grammar and vocabulary in language assessment. It covers three main points:
1. Assessing grammar involves evaluating correct grammatical form, meaning, and use in communication. Grammatical competence has four components: grammatical, sociolinguistic, discourse, and strategic.
2. Various assessment tasks for grammar are described, including selected response, limited production, and extended production tasks.
3. Assessing vocabulary involves evaluating knowledge of words, phrases, and their appropriate use. Receptive and productive vocabulary tasks are outlined.
The document discusses assessing speaking skills in students. It begins by defining speaking and listing reasons to assess it, such as helping students and teachers understand strengths and weaknesses. It then covers what speaking involves, including micro and macro factors. The document provides guidance on planning a speaking assessment, such as choosing learning outcomes, designing a test or activity, developing a rubric, and considering the testing environment. Sample rubrics are included that assess pronunciation, vocabulary, accuracy, fluency, non-verbal communication skills, and ability to communicate. The document concludes by discussing next steps after assessment, such as remedial planning, feedback, and analyzing results data.
This document discusses strategies for assessing speaking skills in a second language. It describes the characteristics of spoken language, such as variation in speed, loudness, gestures, intonation, stress, rhythm, and pausing. It presents several methods for assessing speaking, including role plays, interviews, oral presentations, picture-cued story telling, and translations. Role plays can be structured or unstructured. Interviews involve warming up, level-checking, probing, and winding down. It's important to determine the appropriate criteria and tasks for assessing fluency versus accuracy in speaking.
Jeffrey Fulcher enjoys several hobbies including sports, movies, music, and books. His interests span multiple entertainment genres from athletic activities to films and music as well as written works of fiction. Overall, Jeffrey seems to have a diverse set of leisure pursuits that allow him to be actively engaged with different forms of media and pastimes.
The document discusses the history and development of artificial intelligence over the past several decades. It describes early work in the field starting in the 1950s and progress in areas such as problem solving, game playing, and machine learning. Recent advances in deep learning now allow AI systems to perform complex tasks like visual perception and language translation.
This document lists publications by E.P. Fulcher spanning from 2001-2016 that focus on understanding emotional and affective responses to various stimuli like advertising, customer service, and road safety using implicit measures and neuromarketing techniques. Key areas of research include evaluative conditioning without contingency awareness, attentional bias to emotional information, implicit measures of brand engagement, and neuroscience methods for understanding consumer packaging perceptions. The publications appear in journals related to marketing, psychology, and transportation research.
Standardized testing can take two forms: norm-referenced which compares test takers to each other, and criterion-referenced which determines if an individual has achieved a specified standard. Norm-referenced testing aims to discriminate between test takers in order to distribute scarce resources like university places. It became popular during WWI when psychological testing was used to contribute to the war effort. Proponents viewed testing as a scientific process of quantifying and measuring abilities. However, others argue that defining and measuring constructs like traits is problematic. Test scores are distributed along a normal curve and take on meaning based on their position within that distribution compared to other test takers. Reliability ensures test scores are consistent over time without instruction.
The document discusses various approaches to classroom assessment including assessment for learning, dynamic assessment, self-assessment, and peer assessment. It provides details on each approach and compares them with standardized testing. Assessment for learning focuses on formative assessment and using assessment to improve teaching and learning. Dynamic assessment is based on Vygotsky's sociocultural theory and involves teachers mediating learning through interaction. Self-assessment and peer assessment help students evaluate their own and others' work to identify gaps and improve learning.
This document discusses key concepts in language testing and standardized tests. It covers two testing paradigms: norm-referenced testing which compares test takers to a group, and criterion-referenced testing which evaluates if a criterion is met. The document also defines standardized tests, explores why testing is viewed as a science, and examines concepts like central tendency, dispersion, reliability, and ways to calculate reliability such as using correlations and item analysis.
Practical Language Testing Glenn Fulchertranslatoran
Practical LanguageTesting
Glenn Fulcher
Specifications for testing and teaching.
A sample detailed specification for a
reading test.
In this section we present an example of an architecture for a reading test. This includes
the test framework that presents the test purpose, the target test takers, the criterion
domain and the rationale for the test content. The architecture is annotated with explanations
in text boxes. This is a detailed test specification. The complexities of coding
in test specifications of this kind are usually necessary in the design and assembly of
high-stakes tests where it is essential to achieve parallel forms. There are problems with
this type of specification for use in classroom assessment, which we deal with in Section
4 below.
This document discusses different approaches to classroom assessment, including assessment for learning, self-assessment, peer assessment, dynamic assessment, and the link between assessment and theories of second language acquisition. It provides details on key aspects of each approach, such as the benefits of formative feedback and longer wait times in assessment for learning. Dynamic assessment is discussed as being interactionist and focused on mediated learning experiences to help students progress to the next stage. Issues around subjectivity in self-assessment and the need for training students are also summarized.
The document discusses several models of communicative competence:
1. Canale and Swain's model includes grammatical, sociolinguistic, and strategic competence. It emphasizes the demonstration of knowledge in actual performance.
2. Bachman's model distinguishes knowledge from skills. It includes language competence, strategic competence, and psychophysiological mechanisms.
3. Celce-Murcia et al.'s model has five components: discourse competence, actional competence, sociocultural competence, linguistic competence, and strategic competence. It aims to generate detailed content for curriculum design.
The document provides an overview of key elements of APA style papers including sections, reference lists, internal documentation, and formatting guidelines. It discusses topics such as outlining papers, selecting verb tense, pronoun usage, and punctuation conventions. Guidelines are provided for reference list entries for different publication types as well as constructing in-text citations and handling personal communications.
The document provides an overview of test specifications and how to write test items and tasks. It discusses:
1. Test specifications (specs) guide the creation of test content and help ensure equivalence, reliability, and validity. Specs describe how to structure tests and make difficult authoring choices.
2. Effective test development is iterative and spec-driven. Specs evolve as tests are refined through discussion. Items and tasks should be written to fit evolving specs rather than independently.
3. Evidence-centered design (ECD) treats knowledge as scientific and provides a systematic framework for relating test performance to constructs. ECD models guide test design from defining constructs to assembling and delivering the full test.
Psycholinguistics is the study of the cognitive processes involved in language acquisition and use. Researchers have developed models to describe and predict specific language behaviors in bilinguals. One influential model is Levelt's speaking model, which describes the process of language production from intention to articulation of sounds. Paradis proposed the subset hypothesis based on research with bilingual aphasia patients. Language choice in speaking involves matching chunks from a pre-verbal message to the meaning and language of the appropriate lemma. Experimental studies examine how processing reflects a word, sentence, or text's structure and meaning in comprehension and production.
The document discusses the linguistic determinism theory proposed by Edward Sapir and Benjamin Lee Whorf, known as the Sapir-Whorf hypothesis. It states that the structure of a language affects its speakers' worldview or cognition. Specifically, it introduces the idea that language determines or influences thought, that language categories and structures condition the ways in which people think, and that speakers of different languages think and perceive reality in different ways. The document uses the example of Eskimo languages having different words for types of snow to illustrate how languages can categorize the world differently.
This document discusses the relationship between language and thought. It presents several perspectives on this relationship:
- Classical theorists like Plato argued that thought determines language, while others like J.B. Watson believed thought is language.
- Edward Sapir and Benjamin Lee Whorf proposed the Sapir-Whorf hypothesis, which suggests language influences or determines thought. For example, the structure of a language influences how speakers of that language view the world.
- There is evidence that different languages categorize and describe concepts like time, colors, and snow differently, potentially influencing thought. However, critics argue linguistic determinism is too strong and thought is not entirely constrained by language.
- The relationship is complex
1. The document discusses the Sapir-Whorf hypothesis, which proposes that the structure of a language influences its speakers' worldview and cognition. It describes studies showing how the categories used in a language to describe colors can influence color perception.
2. The text also introduces the concepts of "etic" and "emic" categories used in anthropology. Etic categories describe objective reality while emic categories reflect a culture's subjective perceptions based on its language and beliefs.
3. Finally, it discusses how semiotics studies how signs and their relationships construct meaning. Differences in how languages use signs can lead to divergent interpretations between cultures.
Theories of PSYCHOLINGUISTICS, Language acquisition, Noam Chomsky, Jean Piaget, F. B. Skinner, Innateness theory, Behaviorist theory, Cognitive theory.
This document discusses psycholinguistics and language acquisition. It defines psycholinguistics as the study of the relationship between the human mind and language. Key aspects covered include language production, which involves conceptualization, formulation, articulation and self-monitoring. Language acquisition examines first language acquisition in children and theories like nature vs nurture. Second language acquisition looks at factors like age and motivation and compares it to first language acquisition.
This document provides an overview of psycholinguistics and related topics. It defines psycholinguistics as the study of how people acquire, use, and process language in the brain. It discusses how psycholinguistics relates to other fields like psychology, linguistics, and neuroscience. It also covers key concepts like linguistic competence vs performance, the relationship between speech sounds and meaning, and the origins of modern psycholinguistics as a field of study.
Psycholinguistics studies the mental aspects of language and speech. It examines how humans acquire, use, and understand language through psychological and neurobiological factors. The document outlines key topics in psycholinguistics, including the definition of the field, characteristics of human language, how language varies worldwide, cognitive processes involved in language, and how children learn language in stages from birth.
This document provides an overview of the course Psycholinguistics. It will examine the psychological processes involved in language, including how language is acquired and how it interacts with other cognitive systems. Some key topics that will be discussed include models of language processing, the modularity of the language system, the role of innate linguistic knowledge versus environmental learning, and controversies in the field regarding rules, neural localization of language, and applied implications. Experimental methods like priming and brain imaging techniques will be important for exploring these open questions in psycholinguistics.
Task-based learning is a language teaching method where students complete communicative tasks using the target language. It focuses on meaningful language use rather than rote learning of grammar rules. A task has three stages: pre-task introduction, task completion in groups or pairs, and post-task reporting. This allows students to experiment, focus on communication over accuracy, and then improve their language for reporting. The method is effective at intermediate levels as it motivates students and provides opportunities for negotiated interaction, language recycling, and a focus on form.
The document discusses various methods to assess pronunciation learning at different stages:
1. Pre-program assessments diagnose learners' pronunciation skills through minimal pair discrimination exercises, word stress recognition tests, and intonation pattern identification tests.
2. During-program assessments obtain speech samples through reading passages and free speech to evaluate progress and focus instruction. Ongoing evaluations include self-monitoring, peer feedback, and teacher feedback.
3. Post-program assessments include formal tests like the Test of Spoken English and the Interagency Language Roundtable Oral Proficiency Interview, which measure pronunciation through tasks like reading aloud and describing pictures.
The document proposes using the Listen-Read-Discuss (L-R-D) strategy to teach report text reading comprehension to 11th grade science students in Indonesia. It notes current difficulties with vocabulary, memorization, and distinguishing text types. The study will give pre- and post-tests to experimental and control classes to determine if L-R-D increases comprehension scores compared to traditional instruction. L-R-D involves the teacher summarizing a text, students reading it, and a discussion. The study hypothesizes L-R-D will significantly improve reading scores.
This document discusses rubrics for assessing student work. It defines a rubric as a scoring tool that lists criteria and performance levels. Five reasons for using rubrics are provided: they clarify expectations, set standards, help students take responsibility, have value for stakeholders, and tell students to do careful work. The main components of rubrics are identified as dimensions, criteria, descriptors, and a scale. A rubric template is shown. Steps for constructing rubrics are outlined, including identifying the learning goal, task, and criteria. Examples of holistic and analytic rubrics for speaking and writing are also provided. Web resources for finding and sharing rubrics are listed at the end.
The document discusses various elicitation procedures used to obtain language samples from informants, including interviews, questionnaires, production tasks, and combining methods. Common elicitation techniques involve structured, semi-structured, and unstructured interviews, as well as discourse completion tasks, role plays, and picture description tasks. Elicitation aims to explore informants' backgrounds, behaviors, opinions, and language development in a naturalistic manner.
Variable and strategy of language testing by Beny Indra Natan Nadeak, S.PdBeny Nadeak
This document discusses variables and strategies for language testing. It identifies the key elements of language - sounds, grammar, vocabulary, meanings - and skills - speaking, listening, reading, writing - that can be tested as variables. When testing, one must decide whether to focus on separate elements, integrated skills, or a combination. Group versus individual testing and objective versus subjective scoring methods also influence testing strategy and how results are evaluated. The goal is to directly test language abilities through valid assessment of its core linguistic components.
Language Assessment Strategies(lesson 2)jessaboteros
The document provides information on assessing language skills. It discusses various assessment methods including selected responses, essays, performance assessments, and personal communication. These methods can be used to evaluate skills like writing, speaking, reading comprehension, and vocabulary. The document also explains that a language assessment scheme involves designing a structured approach using multiple methods to comprehensively measure an individual's language proficiency.
This document discusses standards-based instructional practices. It notes that scores on South Dakota reading assessments have been low in certain areas. It presents hypotheses for this, including that classroom rigor may not meet assessment rigor. It introduces the Standards In Practice model for ensuring classroom assignments demonstrate proficiency in content standards. The 6-step model includes teachers completing assignments, analyzing task demands, identifying standards, creating rubrics, scoring student work, and revising instruction as needed based on results. Bloom's Taxonomy and rigor are discussed in the context of this process.
This document discusses standards-based instructional practices. It notes that scores on South Dakota reading assessments have been low in certain areas. It presents hypotheses for this, including that classroom rigor may not match assessment rigor. It introduces the Standards In Practice model, a 6-step process for ensuring classroom assignments demonstrate proficiency in content standards. Steps include completing assignments, analyzing task demands, identifying standards, creating rubrics, scoring student work, and revising instruction based on results. Bloom's Taxonomy and rigor are discussed in the context of aligning standards, instruction, and assessment.
This document discusses standards-based instructional practices. It notes that scores on South Dakota reading assessments have been low in certain areas. It presents hypotheses for this, including that classroom rigor may not match assessment rigor. It introduces the Standards In Practice model, a 6-step process for ensuring classroom assignments demonstrate proficiency in content standards. Steps include completing assignments, analyzing task demands, identifying standards, creating rubrics, scoring student work, and revising instruction based on results. Bloom's Taxonomy and rigor are discussed in the context of aligning standards, instruction, and assessment.
Assessment is becoming more prominent in both ESL and EFL programs as a teacher it is something that you will be called to do on a continual basis. The purpose of this chapter was to explore language assessment it relates to young learners.
Task-based learning involves completing meaningful tasks using various language skills. It has three main stages: 1) a pre-task introduction and preparation, 2) a task cycle of completing the task, planning a report, and reporting back, and 3) post-task language analysis and practice. This approach aims to make language learning more authentic, reflective, and motivating compared to traditional Presentation-Practice-Production lessons by basing it around real-world tasks rather than predetermined language points.
This document discusses different elements of grammar instruction. It covers explicit vs implicit grammar teaching, the benefits of raising metalinguistic awareness, and focusing on auto-input and output processing. It also discusses inductive approaches to teaching grammar on a continuum from deduction to induction. Examples are provided of guided induction lessons, including analyzing language data in context and looking at drawings to induce patterns. Students are asked to discuss guided induction lesson plans and consider how effective the approach seems.
This document discusses testing vocabulary knowledge. It defines vocabulary and the importance of vocabulary for communication and language acquisition. It also defines testing and explains that testing is used to evaluate a student's level of knowledge and skills. The document discusses why teachers test vocabulary, including to understand a student's vocabulary development and comprehension. It also outlines different forms of testing vocabulary, such as multiple choice, matching, jumbled letters, and completion items.
This document discusses types of informal assessments, including cloze procedures. It provides details on how to create and administer cloze passages by deleting words in a pattern. Cloze procedures can be used to assess specific reading comprehension skills like vocabulary. The document also discusses the development and effectiveness of the cloze test procedure for determining the readability level of texts for students.
Teachers have several options for testing students' vocabulary knowledge. Recognition items like multiple choice questions allow students to select the correct meaning of a word from options. Production items require students to produce the word, definition, or synonym. Teachers should consider using pictures, definitions, gap filling, and oral tests. Well-designed vocabulary tests use context, include enough items to be reliable, and avoid issues like guessing. Teachers have resources available to help design valid and informative vocabulary assessments.
This document discusses bees and ecosystems. It begins by asking the reader to make predictions about bees, such as whether they are important pollinators or if their decline affects food systems. It then confirms that bees are crucial pollinators, their population has declined since WWII, and their loss impacts nutrition and landscapes. The document defines an ecosystem as all living and non-living things in an area that interact, giving the example of a puddle ecosystem. It notes that if one part of the system is disrupted, it impacts other parts, and hints that human agriculture problems could cause human extinction by disrupting ecosystems.
This document describes several classic children's party games such as hide-and-seek, tag, musical chairs, and pass the parcel. In hide-and-seek, one person counts while hiding and then tries to find the others. Tag involves one person chasing others to tag them and make them "it." Musical chairs has one fewer chair than players and those who don't find a seat when the music stops are out. Pass the parcel passes a parcel around a circle while music plays and whoever holds it when the music stops is out. The document encourages revisiting these old-fashioned games for fun.
El documento describe la taxonomía de Robert Marzano sobre los procesos mentales del sistema cognitivo. La taxonomía incluye el sistema de conciencia del ser, el sistema de metacognición, el sistema de cognición y el dominio del conocimiento. El sistema cognitivo se divide en cuatro procesos: conocimiento/recuerdo, comprensión, análisis y utilización del conocimiento.
Unit II Technology and Inventions Inglés IAPB 2014Diego ElCretino
PPT for educational pruposes.
It'll be used for intriducing Unit II Technology and Inventions from Mineduc text Teens Club - 1st Grade.
I recommend you to download the presentation, instead of show it directly from this page; it contains animations that may motivate students a little bit more.
Any comment, just leave it below.
This document discusses the advantages and disadvantages of television in Chile. It begins with an activity asking students to make a list of words related to television and share with the class. It then discusses debating the pros and cons of TV in Chile. It provides examples of some issues that were brought up in a television program about whether TV is dangerous, such as the impact of violent programming on children and the presentation of sexual content.
Antes de comenzar con esta actividad, se recomienda revisar el siguiente ppt: http://es.slideshare.net/igotamnesia/wth-is-facebook-ceia-mass-2013
TODOS LOS DERECHOS RESERVADOS!
Luego de presentar algunas características y funciones de esta red social, los estudiantes debieron realizar esta actividad: http://es.slideshare.net/igotamnesia/task-facebook
DERECHOS RESERVADOS.
Este documento presenta una lección sobre cognados y falsos cognados en inglés y español con el objetivo de identificarlos en un contexto determinado. Explica que los cognados son palabras similares en ambos idiomas que comparten el mismo significado, mientras que los falsos cognados parecen similares pero tienen significados diferentes. A continuación, proporciona ejemplos de cognados y falsos cognados, así como ejercicios para que los estudiantes practiquen identificarlos y clasificarlos como sustantivos, adjetivos o verbos.
This ppt was shown at UCSC as a material for those students who wanted to participate in alternative tutorials during the second-half strike period in 2011.
Mentors: Valentina Gallegos & Diego Ulloa
It's a project for education improvement at an specific school in Concepción, Chile.
It's important to mention that this project was just planned and not carried out due to some limitations such as time and the first-half 2011 strike period.
Participants: Joanne Abos-Padilla & Diego Ulloa
It's just a presentation of two pieces of paper together; one of them is the English version of the document and the other corresponds to the translation I made of it.
(This document was uploaded just for educational purposes)
This unit plan involves 4 lessons to teach secondary school students how to write a biography. Lesson 1 has students read and comprehend a biography of a famous singer. Lesson 2 focuses on listening to and understanding specific details about another singer's interview. Lesson 3 has students brainstorm words and verbs commonly found in biographies. Finally, in Lesson 4 students write and present their own short biography of a singer they like using a PowerPoint presentation. The overall aim is for students to be able to write their own biography by the end of the unit.
The 90-minute lesson plan aims to teach students about different types of music and their themes. Students will listen to short clips of famous songs and try to recognize the genre. They will discuss what kind of music they like and when they listen to it. Then students will listen to a full song and identify its genre and theme by discussing keywords. They will sequence lines from the song lyrics. Finally, students will sing parts of the song, write about the theme and feelings conveyed, and discuss if songs can help learn English.
Beyoncé sings about what life would be like if she were a boy for a day. She reflects that as a boy, she could drink beer with friends and chase after girls without repercussions. As a boy, she believes she could better understand how it feels to love a girl and be in a relationship. She would listen to her partner and not take her for granted like many boys do.
The speaker imagines turning off their phone to pretend they are sleeping alone so their partner thinks they are faithful while the speaker secretly spends time with someone else before returning home.
Listening Test 2 Module 2 Unit III EntertainmentDiego ElCretino
This document contains an English test with 3 parts:
1. A listening comprehension exercise to complete blanks in a conversation about movies.
2. An ordering exercise to put statements from a song in the correct sequence.
3. A multiple choice exercise to circle the feeling of a speaker about different movie genres.
The lesson plan aims to help 2nd grade students comprehend the trailer for "Toy Story 3" in 45 minutes. It begins with activating students' prior knowledge about friendship through pictures and questions. Students then predict what the video will be about and check their predictions while watching. Comprehension activities include identifying the speech type, answering true/false and multiple choice questions. Finally, students connect to the topic by choosing a toy and discussing toys' influence on their childhood.
The document provides an aim and context for a unit on entertainment choices. The aim is to comprehend general and specific ideas from a movie trailer. It then lists some key details about the plot of the movie "Toys" - Andy goes to college, the toys are hysterical, and Buzz Lightyear speaks French.
1) The lesson plan aims to teach students how to express their film preferences in writing.
2) Students will work in pairs to create a poster about their favorite film genre, including a picture, sentence expressing their preference, and reasons.
3) The teacher will monitor students as they work, noting errors to address later. Common mistakes like incorrect verb forms will be discussed to improve students' language use.
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Dr. Vinod Kumar Kanvaria
Exploiting Artificial Intelligence for Empowering Researchers and Faculty,
International FDP on Fundamentals of Research in Social Sciences
at Integral University, Lucknow, 06.06.2024
By Dr. Vinod Kumar Kanvaria
हिंदी वर्णमाला पीपीटी, hindi alphabet PPT presentation, hindi varnamala PPT, Hindi Varnamala pdf, हिंदी स्वर, हिंदी व्यंजन, sikhiye hindi varnmala, dr. mulla adam ali, hindi language and literature, hindi alphabet with drawing, hindi alphabet pdf, hindi varnamala for childrens, hindi language, hindi varnamala practice for kids, https://www.drmullaadamali.com
This presentation includes basic of PCOS their pathology and treatment and also Ayurveda correlation of PCOS and Ayurvedic line of treatment mentioned in classics.
This presentation was provided by Steph Pollock of The American Psychological Association’s Journals Program, and Damita Snow, of The American Society of Civil Engineers (ASCE), for the initial session of NISO's 2024 Training Series "DEIA in the Scholarly Landscape." Session One: 'Setting Expectations: a DEIA Primer,' was held June 6, 2024.
Leveraging Generative AI to Drive Nonprofit InnovationTechSoup
In this webinar, participants learned how to utilize Generative AI to streamline operations and elevate member engagement. Amazon Web Service experts provided a customer specific use cases and dived into low/no-code tools that are quick and easy to deploy through Amazon Web Service (AWS.)
How to Add Chatter in the odoo 17 ERP ModuleCeline George
In Odoo, the chatter is like a chat tool that helps you work together on records. You can leave notes and track things, making it easier to talk with your team and partners. Inside chatter, all communication history, activity, and changes will be displayed.
Main Java[All of the Base Concepts}.docxadhitya5119
This is part 1 of my Java Learning Journey. This Contains Custom methods, classes, constructors, packages, multithreading , try- catch block, finally block and more.
How to Build a Module in Odoo 17 Using the Scaffold MethodCeline George
Odoo provides an option for creating a module by using a single line command. By using this command the user can make a whole structure of a module. It is very easy for a beginner to make a module. There is no need to make each file manually. This slide will show how to create a module using the scaffold method.
Strategies for Effective Upskilling is a presentation by Chinwendu Peace in a Your Skill Boost Masterclass organisation by the Excellence Foundation for South Sudan on 08th and 09th June 2024 from 1 PM to 3 PM on each day.
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UPRAHUL
This Dissertation explores the particular circumstances of Mirzapur, a region located in the
core of India. Mirzapur, with its varied terrains and abundant biodiversity, offers an optimal
environment for investigating the changes in vegetation cover dynamics. Our study utilizes
advanced technologies such as GIS (Geographic Information Systems) and Remote sensing to
analyze the transformations that have taken place over the course of a decade.
The complex relationship between human activities and the environment has been the focus
of extensive research and worry. As the global community grapples with swift urbanization,
population expansion, and economic progress, the effects on natural ecosystems are becoming
more evident. A crucial element of this impact is the alteration of vegetation cover, which plays a
significant role in maintaining the ecological equilibrium of our planet.Land serves as the foundation for all human activities and provides the necessary materials for
these activities. As the most crucial natural resource, its utilization by humans results in different
'Land uses,' which are determined by both human activities and the physical characteristics of the
land.
The utilization of land is impacted by human needs and environmental factors. In countries
like India, rapid population growth and the emphasis on extensive resource exploitation can lead
to significant land degradation, adversely affecting the region's land cover.
Therefore, human intervention has significantly influenced land use patterns over many
centuries, evolving its structure over time and space. In the present era, these changes have
accelerated due to factors such as agriculture and urbanization. Information regarding land use and
cover is essential for various planning and management tasks related to the Earth's surface,
providing crucial environmental data for scientific, resource management, policy purposes, and
diverse human activities.
Accurate understanding of land use and cover is imperative for the development planning
of any area. Consequently, a wide range of professionals, including earth system scientists, land
and water managers, and urban planners, are interested in obtaining data on land use and cover
changes, conversion trends, and other related patterns. The spatial dimensions of land use and
cover support policymakers and scientists in making well-informed decisions, as alterations in
these patterns indicate shifts in economic and social conditions. Monitoring such changes with the
help of Advanced technologies like Remote Sensing and Geographic Information Systems is
crucial for coordinated efforts across different administrative levels. Advanced technologies like
Remote Sensing and Geographic Information Systems
9
Changes in vegetation cover refer to variations in the distribution, composition, and overall
structure of plant communities across different temporal and spatial scales. These changes can
occur natural.
This slide is special for master students (MIBS & MIFB) in UUM. Also useful for readers who are interested in the topic of contemporary Islamic banking.
1. Testing summary (videos)
Diego Ulloa
Lorena Salazar
Pre – testing
J. D. Brown
What is Pre – testing?
For this author, Pre testing is also called Piloting
It is when you take a group of language items and try them out with a group
of students.
It allows us to know which items will be better for all our students. It is done
when bottom and top students discriminate the items.
Why is it important to do Pre – Testing?
Because it helps us to know which items are useful and which aren’t.
Most teacher tend to think that all items are good but by pretesting they are able to
prove which ones really are and which one are not.
How do you select the items?
It helps to understand how well your items are working
1) It is important to develop more items that the ones that are going to be
needed (if you need 40 items for the final test, would be better if you
develop about 60 items)
2) Then administrate it to the students that are going to take the test. (Ideally
different students that the ones who are going to take the test)
3) Then write the items on a sheet of paper in columns (see example on the
video)
4) Then we are able to calculate the item difficulty which represents the
student’s percentage that did wrong on a specific item.
5) Then we can calculate the item discrimination what involves diving the
student into three groups (according to their scores)
Doing this you can calculate the item difficulty to each group.
Then you are able to choose the items you are going to test. You do it according to
the item discrimination and the item difficulty.
2. Testing summary (videos)
Diego Ulloa
Lorena Salazar
Vocabulary
John Read
At first, Vocabulary was not consider an important thing in assessing
Imbibed vocabulary: when you look at the vocabulary within the context of a
larger construct
Vocabulary is useful for:
- Placing students in Language
- Measuring progress
- Diagnoses
Vocabulary knowledge is important because it helps in fluent reading
comprehension
Measuring vocabulary size:
1) Take samples of a word frequency list
2) Then divide the text to know if those words are known or not
3) Then we make a list of the total vocabulary size of the native speaker or
the 2nd lg learner
Vocabulary tasks:
1) Descript vocabulary
2) Yes / no
3) Multiple choice
4) Matching task
5) Gap filling
Depth of vocabulary knowledge
It goes beyond that only testing student’s vocabulary meaning to assess other
aspects of word knowledge
Word associated format: the test taker is presented with the target word and then a
series of other words which are related (students have to select the words that are
related to the target word)
Vocabulary knowledge scale: students must write how well they know the word’s
meaning. They are also asked to give synonyms or translation and also use the
word in sentence.
Vocabulary is good for lexical units like idioms, phrasal verbs, collocations,
formulating sequences and lexical phrases
3. Testing summary (videos)
Diego Ulloa
Lorena Salazar
Reading
Caroline Clapham
- By taking just a passage it would test only identification of facts such as
addresses dates and names. So it would be better to think of abilities that go into
reading skills:
items that test inferencing,
a whole string of skills and
strategies used in reading
- You should have a list where one specifies the skills needed in reading and
then try samples of that list:
skimming,
scanning,
looking for small pieces of information, and
looking for the general meaning of a paragraph
How should you choose the reading passages?
- Considering the purpose of the students:
Use those sorts of texts that people will read when they are using English
(contextualization). These texts do not need to be totally authentic but look
like that at least.
Use several texts: it makes the reading text far more reliable.
o You can vary things like the genre the topic some students may be more
familiar with.
o short texts for intensive reading (when looking for specific facts)
o long texts for skimming and scanning for extensive reading (when looking
for general meaning)
How should you test these passages?
All depends on both students and circumstances. It’s better if you have more than
one sort of method:
Multiple Choice Questions (MCQ): very good for testing reading
comprehension but very difficult to write, so you can use this alternative only
if you are a trained and experienced writer in MCQ. It is compulsory to pre-
test them to see they are working well.
4. Testing summary (videos)
Diego Ulloa
Lorena Salazar
Short Answer Questions: very revealing by limiting the answer to say--for
instance, three words. It is also a fairly easy to mark.
Selection of Headings: very good alternative if you are trying to get
understanding of overall meaning of a passage only if they are provided in a
“matching” way.
Gapped Summary: for advanced readers, in which the test writer
summarizes a text with gaps and students should fill in those gaps.
Information Transfer: students get the text, understand it and then transfer
what they have read into a chart.
True or False Questions: they can work well if treated carefully because of
the 50%-getting-right-item issue.
5. Testing summary (videos)
Diego Ulloa
Lorena Salazar
Speaking
Glenn Fulcher
A speaking test is essentially composed of three parts:
1. Task: something that we give to the learners; it elicits language from them.
2. Rating Scale: it is used to grade the sample of language (the task).
3. Rater: It may be the interlocutor or different people that grade that sample
of language.
Where does the rating scale come from? Why do they seem to be pretty
important?
They have been designed in a number of different ways that suit teacher’s own
purposes:
Intuitional: The different levels of the rating scale are created on the base of
the teachers/interlocutors’ experiences. Fulcher calls this “armchair method”
because it is based more on one’s own intuition)
Empirical:
I. By collecting samples of language from the students to show them to
teachers and come to an agreement on which one is the best.
II. By collecting samples of language but analyzing what the students
are saying perhaps using discourse analysis or conversation
analysis and then use those descriptors to generate the rating
scales actually to write the different bands.
What about the raters?
Sometimes the rater is a separate persona or the interlocutor itself, but in the first
case it allows them to concentrate on the rating process and reliability.
Reliability is divided into two:
Intra-rater reliability: if an individual rater can agree with him or herself in
rating the same sample of language over a period of time.
Inter-rater reliability: if different raters can agree with each other when
they are marking or grading the same sample of language.
Is speaking the most difficult ability to test?
Testing speaking brings its own special problems:
6. Testing summary (videos)
Diego Ulloa
Lorena Salazar
In terms of validity:
o Rating Scales: they have to be well-written in terms of the definition
and specification of what we want to measure.
o Task: we need to be able to make sure that the kind of language
that we are listing can actually be rated using the rating scale
previously created, so the rating scale and the task design has to go
hand in hand.
o Generalisability: concerned with the use of the understanding of the
score that the student is given, i.e., the interpretation of the score out
of the context itself and making predictions about what the student is
or is not capable of doing in the future or in a non-test context.
In terms of practicality:
o Speaking tests are difficult to organize, so they need to have enough
trained raters.
7. Testing summary (videos)
Diego Ulloa
Lorena Salazar
Listening
Gary Buck
o Listening comprehension is a process of constructing meaning where two
different sort of knowledge play a great role on this: linguistic and
nonlinguistic knowledge.
o Listening comprehension is quite difficult because people do not speak in
sentences as in written language form: we find many pauses and hesitation;
we do not finish our “oral sentences”.
o Listening takes time in real time. This means that the processes must be
automatic and then the information needs to be memorized.
We find two problems when testing listening comprehension:
1. We cannot examine a piece of listening comprehension.
2. There is interference. It is possible that people interpret the listening in
different ways.
There are two types of listening
The interactive listening (when there is a speaker and a listener who change roles)
The non interactive listening (when there is only a listener. Ex: listen to radio)
There are two purposes in listening interaction:
1. Convey information
2. Establish relationship (social interaction – transactional use to provide
information)
What type of item task should we use for listening?
Dictation: It is the easiest way but it tests a very narrow range of language
skills. (It is easy to make and to mark)
Statement evaluation activities: in which we give people a statement and
they evaluate if it is true or not. (it also tests a very narrow range of skills)
Listening for longer discourses or a deeper level: we use comprehension
questions and information transfer items.
How can we do that communicative?
8. Testing summary (videos)
Diego Ulloa
Lorena Salazar
There are two ways to do it communicative:
1. Comprehending explicit linguistic information.
2. Comprehension in terms of interpreting the meaning in a broad context.
Listening comprehension can be defined as understanding samples of realistic
language and the process is automatic and in real time. This means to also
understand any inferences and implications of the content or context.
9. Testing summary (videos)
Diego Ulloa
Lorena Salazar
Item Writing
Charles Stansfield
What makes a good item writer?
They have to be an excellent writer in the language in which they are
writing.
They have to have teaching experience.
Are there any principles of good item writing?
Those principles vary according to the paper that one is developing.
In general, good writing items starts with looking at:
o test specifications
o the audience itself
o the purpose for the test
o the test content
o item writing guidelines
o sample items
What do you look for when reviewing an item?
This process is an integrated one that typically involves four issues:
Content review: the item is compared with the specifications we are looking
to see whether the reviewer agrees with the content classification of the
item.
Key check: It tends to be ignored. It is actually specified by the item writer
and submitted with the content classification, and the reviewer will consider
whether there are any other possible keys that should be listed in the key list
of acceptable responses.
Bias review: there are two things we look for here.
o Content Bias: it is the nature of the item going to favor a particular
group of examinees in terms of familiarity with that context or the
content. It has to be neutral.
o Sensitivity Bias: it has to do with wording of the item that is likely to
be offensive or insensitive to any particular group of examinees.
Editorial review: It includes re-write and re-word items, but the issues are
mainly mechanics, punctuation, spelling, grammar, clarity and style of
writing.
How many times should an item writer be reviewed?
10. Testing summary (videos)
Diego Ulloa
Lorena Salazar
As many as needed; every item writer needs multiple reviews.
The item review process consists of two horizontal stages:
o When the items go out to more than one reviewer and a set of
comments get back from them.
o When once those items are already compared, they have to be sent
to a different set of reviewers and then a set of comments get back
from those new reviewers.
11. Testing summary (videos)
Diego Ulloa
Lorena Salazar
Testing Writing
Liz Hamp-Lyons
What is writing assessment?
The most important characteristic is that in this form of assessment the test
taker actually writes as opposed to completing multiple choice items or
some other non-productive form.
We also expect to have:
o a focused topic
o specification of audience
o specification of purpose for writing
o the method by which the writing is going to be judged
o people or any computer system to make those judgments
o a scale for reporting the performance of the writers
o means of validity for the assessment instrument
How reliable are writing tests?
It depends on how carefully one works on ensuring reliability:
Having more than one item though it is difficult because it takes so long to
produce a text.
Having more than one judge, and a third when there is a disagreement
between those two judges.
Scoring different dimensions of the text (grammar, organization, content,
etc)
Having more than one writing occasion though it does not happen very often
because it is so time consuming.
What do we have to remember in order to end up with a writing test that is as
reliable as it can be and still keep validity that this kind of assessment has?
In terms of validity: it is a need to have people thinking about educational
goals, curriculum and syllabus to assess what has been taught.
In terms of reliability:
o The teacher has the opportunity to have more than one item.
o Teachers can work with other teachers to exchange pieces of writing
and discuss them.
If I’m looking at writing test from the point of view of the teacher or future test taker
what should I look for about the test? what information should be provided to me?
12. Testing summary (videos)
Diego Ulloa
Lorena Salazar
From the teacher’s viewpoint, the more information provided and positive the
assessment is the better, because teachers tend to pick out what students cannot
do rather than what they actually can do.