Using corpora to
enhance language
learning
Michael Barlow
Overview
wordlists
collocation lists
online concordancers
text analysis software
concordancers
ParaConc and Collocate
web-...
Wordlists – general and
specialised
Wordlists have been around since before the
invention of computers. General wordlists ...
Wordlists – general
Use existing wordlists such as West's General
Service List and recent updates. Coxhead's
Academic Word...
Kilgarriff Page
Academic Word List
Academic Word List
Academic Word List
• receptive list (based on morphological
derivations)
• the list excludes words found in non-academic
t...
Specialised Word List
• Create a wordlist from a corpus (using
concordancer or other utilities)
• May need to create your ...
BootCaT
Vocab Profile
• Tom Cobb's Vocab Profile
• http://www.lextutor.ca/vp/eng/
Collocation lists
• More difficult to find – use Collocation
Dictionary??
• Biber's work on lexical bundles
• Use concorda...
Concordancers
• Online concordancer
Concordancers
Concordancers –
americancorpus.org
Concordancers
• Using a concordancer in the classroom
• Corpus as a reference tool – query the corpus
– can you say “the g...
Concordancers – text
reconstruction exercises
Data-driven learning
(deductive)
Data-driven learning
(inductive)
Concordance data
• DDL – highlighting/noticing/discovery learning
• Highlight unexpected (for the learner)
distinctions, u...
Parallel concordance
data
• Parallel concordance works on translation
corpus
• Students need to have same L1
Concordance data
issues
• KWIC format
• Google effect
• Data overload
• Reauthenticating data
– Sabine Braun – includes di...
Parallel Corpora – DDL
(CHUJO, Kiyomi)
Parallel Corpora – DDL
(Chujo, Kiyomi)
Collocate
Software to extract collocations/terms
Word search + Span (2 words, 3 words etc.)
n-gram (bigram, trigram) list
...
Search for analysis
(Span = 2)
analysis - frequency
analysis - t-score
analysis - MI
Trigram search
Trigram -- by freq
Trigram -- alphabetical
Trigram -- by MI
Using batch mode –
Corpuslab.com
Familiar exercise authoring
Currently offline
Aims
avoid duplication of tasks -- identifying
common collocat...
Student View
Student View
Student View
Student View
Exercise types
Matching
Fill-the-gap
Multiple Choice
Reorder
Categorise
Exercise types
Matching*
Fill-the-gap
Multiple Choice
Reorder
Categorise*
Teacher view
Teacher view
Teacher view
Teacher view -
Resources
Resources
Teacher-generated resources
uploaded frequency lists
worksheets
Tracking
Teachers can track their exercises
“Class teachers” track students in their class
Tracking
Report for exercise Cat1
Tracking of student
School view
Register as a school
Create class names
Assign teachers to classes
Track students in classes
School view
School view
Resources
Site resources
corpora and simple concordancer
text analysis utilities
Text analysis utilities
Create frequency lists
Text analysis in terms of frequency bands
Collocational analysis of texts
Corpora
Teacher/Author resource
Sample corpus -- CSPAE
Add other corpora such as MICASE
Create various options for searchi...
Simple searching
Aims
Create a language learning site
Encourage and facilitate use of corpus data
Matching exercise (up to 5 columns)
Provi...
Aims
Use traditional exercise types that teachers
are familiar with
Give examples of creative uses of these
standard exerc...
Thank you
Enhancing Language Learning Using Corpora
Enhancing Language Learning Using Corpora
Enhancing Language Learning Using Corpora
Enhancing Language Learning Using Corpora
Enhancing Language Learning Using Corpora
Enhancing Language Learning Using Corpora
Enhancing Language Learning Using Corpora
Enhancing Language Learning Using Corpora
Enhancing Language Learning Using Corpora
Enhancing Language Learning Using Corpora
Upcoming SlideShare
Loading in...5
×

Enhancing Language Learning Using Corpora

1,278
-1

Published on

Published in: Education
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
1,278
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
77
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Enhancing Language Learning Using Corpora

  1. 1. Using corpora to enhance language learning Michael Barlow
  2. 2. Overview wordlists collocation lists online concordancers text analysis software concordancers ParaConc and Collocate web-based exercises data-driven learning materials
  3. 3. Wordlists – general and specialised Wordlists have been around since before the invention of computers. General wordlists are used for curriculum development, textbook writing etc. Also possible to produce a word list for a reading (or a possibly textbook)
  4. 4. Wordlists – general Use existing wordlists such as West's General Service List and recent updates. Coxhead's Academic Wordlist. Kilgarriff's Wordlists based on the BNC.
  5. 5. Kilgarriff Page
  6. 6. Academic Word List
  7. 7. Academic Word List
  8. 8. Academic Word List • receptive list (based on morphological derivations) • the list excludes words found in non-academic texts (even if they occur in academic texts) • do we need subject or genre-specific wordlists? (Hyland)
  9. 9. Specialised Word List • Create a wordlist from a corpus (using concordancer or other utilities) • May need to create your own corpus – BootCaT ?? Silvia Bernadini
  10. 10. BootCaT
  11. 11. Vocab Profile • Tom Cobb's Vocab Profile • http://www.lextutor.ca/vp/eng/
  12. 12. Collocation lists • More difficult to find – use Collocation Dictionary?? • Biber's work on lexical bundles • Use concordancer or utility to create ngram lists or locate collocations • Collocate – shown below
  13. 13. Concordancers • Online concordancer
  14. 14. Concordancers
  15. 15. Concordancers – americancorpus.org
  16. 16. Concordancers • Using a concordancer in the classroom • Corpus as a reference tool – query the corpus – can you say “the government are” – what is the difference between “for instance” and “for example” – Tim Johns – Data-driven Learning • (...caused economic development...)
  17. 17. Concordancers – text reconstruction exercises
  18. 18. Data-driven learning (deductive)
  19. 19. Data-driven learning (inductive)
  20. 20. Concordance data • DDL – highlighting/noticing/discovery learning • Highlight unexpected (for the learner) distinctions, uses etc. • Sequence data to build up knowledge
  21. 21. Parallel concordance data • Parallel concordance works on translation corpus • Students need to have same L1
  22. 22. Concordance data issues • KWIC format • Google effect • Data overload • Reauthenticating data – Sabine Braun – includes discourse perspective (Why did the speaker use that form?)
  23. 23. Parallel Corpora – DDL (CHUJO, Kiyomi)
  24. 24. Parallel Corpora – DDL (Chujo, Kiyomi)
  25. 25. Collocate Software to extract collocations/terms Word search + Span (2 words, 3 words etc.) n-gram (bigram, trigram) list Full extract -- collocations in a corpus
  26. 26. Search for analysis (Span = 2)
  27. 27. analysis - frequency
  28. 28. analysis - t-score
  29. 29. analysis - MI
  30. 30. Trigram search
  31. 31. Trigram -- by freq
  32. 32. Trigram -- alphabetical
  33. 33. Trigram -- by MI
  34. 34. Using batch mode –
  35. 35. Corpuslab.com Familiar exercise authoring Currently offline Aims avoid duplication of tasks -- identifying common collocations in Business English Provide corpus/analysis resources Bring corpus resources together with familiar exercise authoring
  36. 36. Student View
  37. 37. Student View
  38. 38. Student View
  39. 39. Student View
  40. 40. Exercise types Matching Fill-the-gap Multiple Choice Reorder Categorise
  41. 41. Exercise types Matching* Fill-the-gap Multiple Choice Reorder Categorise*
  42. 42. Teacher view
  43. 43. Teacher view
  44. 44. Teacher view
  45. 45. Teacher view - Resources
  46. 46. Resources Teacher-generated resources uploaded frequency lists worksheets
  47. 47. Tracking Teachers can track their exercises “Class teachers” track students in their class
  48. 48. Tracking
  49. 49. Report for exercise Cat1
  50. 50. Tracking of student
  51. 51. School view Register as a school Create class names Assign teachers to classes Track students in classes
  52. 52. School view
  53. 53. School view
  54. 54. Resources Site resources corpora and simple concordancer text analysis utilities
  55. 55. Text analysis utilities Create frequency lists Text analysis in terms of frequency bands Collocational analysis of texts
  56. 56. Corpora Teacher/Author resource Sample corpus -- CSPAE Add other corpora such as MICASE Create various options for searching that make use of corpus annotation
  57. 57. Simple searching
  58. 58. Aims Create a language learning site Encourage and facilitate use of corpus data Matching exercise (up to 5 columns) Provide access to word lists etc Provide text analysis tools
  59. 59. Aims Use traditional exercise types that teachers are familiar with Give examples of creative uses of these standard exercises
  60. 60. Thank you
  1. Gostou de algum slide específico?

    Recortar slides é uma maneira fácil de colecionar informações para acessar mais tarde.

×