SlideShare a Scribd company logo
1 of 37
1




   Writing with Open
   Tools
             (Part One)




09/11/2011      http://www.flickr.com/photos/mikekline/265954619/   Alannah Fitzgerald
2   Overview (part one)
    Introducing Corpus Linguistics
    Lexical knowledge: collocations, derivatives,
    register
    The Flexible Language Acquisition Project
    (FLAX)
    The British National Corpus (BNC)
    The Lextutor
    The Academic Wordlist (AWL)
    EAP practice resources
Intro to corpus linguistics
Let‟s start with three questions about English:

1.    What is the meaning of goalless?
2.    How is the word shall used in present-day British
      English? Think of some examples.
3.    Which is more commonly expressed in everyday
      English?
     a.   “I was a little disappointed…”
     b.   “I was very disappointed…”

     Adapted from Hoffmann et al., 2008
British National Corpus

http://www.natcorp.ox.ac.uk/
Focus on representation
The British National Corpus (BNC)
100 million-word static corpus 1978-1992
  Spoken (10%); Written (90%); Domain representation
BNCweb concordancer – free download

        http://bncweb.info/
BNC header information
http://flax.nzdl.org/greenstone3/flax
?a=fp&sa=home
Focus on automation
The Flexible Language Acquisition Project
(FLAX)
Web n-gram corpora generated and supplied by 2006
Google web dump
  500,000 words and 380 million five-grams
  GALL - Google Assisted Language Learning
    (Chinnery, 2008; Shei, 2008)
„Goalless‟ keyword search in FLAX
     http://flax2.nzdl.org/greenstone3/flax?
Distribution of shall I/we in the spoken component
                     of the BNC
Distribution of I/we shall in the spoken component
                     of the BNC
FLAX - Samples retrieved for I was a little
disappointed
BNC - Samples retrieved for I was a little
disappointed
BNC – Samples retrieved for I was very
disappointed
 FLAX Web Collocations Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=)
FLAX vs BNC?

•   Limitations with representativeness
     Identifyingregister on the Web is difficult
     Successful corpora are based on
      domains, genres, collections of document types
     The web is a “dirty corpus” Kilgariff & Grefenstette
      (2003, p. 342)

   FLAX cleaned by 30% using BNC wordlist
     Linked   externally to BNC, Yahoo
       Complementary   sources, both with limitations
Google‟s terms of services
“You agree not to access (or attempt to access)
any of the Services by any means other than
through the interface that is provided by
Google, unless you have been specifically
allowed to do so in a separate agreement with
Google.”

http:www.google.com/accounts/TOS Clause 5.3
Typical lexical errors
18



                                        telling
     a. He‟s very humorous. He‟s always doing
        jokes.                                collocation

           conversed

     b. We conversated for almost word families / derivatives
                                  one hour.
                                                  without delay

     c. …and compromise, the issue was resolved in
                                               register
     a jiffy.
http://flax.nzdl.org/greenstone3/flax
?a=fp&sa=home
OSS Mozilla




          http://www.flickr.com/photos/hindrik/2586245939/
21   FLAX Web Pronoun Phrases Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=)
Noticing Text Types – Issues of Register and
                  Genre




     FLAX Web Pronoun Phrases Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=)
22
FLAX Web Pronoun Phrases Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=)
23
Web Pronouns Phrases OER
24




     http://www.youtube.com/watch?v=Ns4nXsZ
Kibbitzers (Tim John‟s EAP
25
     pages)




http://www.lexically.net/TimJohns/Kibbitzer/timeap3.htm
Web Collocations (fact vs idea)
26




http://flax2.nzdl.org/greenstone3/flax?a=g&rt=r&sa=CollocationSearch&s=CollocationTypes&s1.wordClass=n&c=c
                                         ollodb&s1.query=&s1.multiple=on
Web Collocations (fact vs idea)
27
Compleat Lexical Tutor (Tom
Cobb)




       http://www.lextutor.ca/
Web Collocations OER
     http://www.lextutor.ca/vp/
29




     http://www.youtube.com/watch?v=iyZgZhHM
AWL Exercises (Nottingham)
30




http://www.nottingham.ac.uk/~alzsh3/acvocab/index.htm
UEFAP (Andy Gillett)
31




        http://www.uefap.com/index.htm
Specific EAP vocab (UEFAP)
32




                     http://www.uefap.com/vocab/vocfram.htm
FLAX User guides & demos




  FLAX Web Collocations & Phrases Excercises (by Shaoqun Wu http://www.cs.waikato.ac.nz/~shaoqun/tmp/instruction.html)
Speaking & Listening OER for
EAP




       http://openspires.oucs.ox.ac.uk/crunch/
Web Phrases OER
35




         http://www.youtube.com/watch?v=n67FBqBFm6I
36   FLAX Web Phrases Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=)
37   Preparation (part two)
     •   Samples of your own writing – soft copy
     •   Build your own corpus – collect ten
         academic articles in your discipline
     •   Writing analysis tools
     •   Specific academic word lists

More Related Content

Similar to Intro to corpus linguistics tools for EAP

ICT Tools for Teaching Vocabulary
ICT Tools for Teaching VocabularyICT Tools for Teaching Vocabulary
ICT Tools for Teaching VocabularyNatalia Katasonova
 
Flexible Open Language Education for a MultiLingual World
Flexible Open Language Education for a MultiLingual WorldFlexible Open Language Education for a MultiLingual World
Flexible Open Language Education for a MultiLingual WorldAlannah Fitzgerald
 
Free Software Presentation Dkg
Free Software Presentation DkgFree Software Presentation Dkg
Free Software Presentation Dkglightybug
 
Improving Flickr discovery through Wikipedias
Improving Flickr discovery through WikipediasImproving Flickr discovery through Wikipedias
Improving Flickr discovery through WikipediasFederico Gobbo
 
Enriching the semantic web tutorial session 1
Enriching the semantic web tutorial session 1Enriching the semantic web tutorial session 1
Enriching the semantic web tutorial session 1Tobias Wunner
 
Wreck a nice beach: adventures in speech recognition
Wreck a nice beach: adventures in speech recognitionWreck a nice beach: adventures in speech recognition
Wreck a nice beach: adventures in speech recognitionStephen Marquard
 
Arabic morphology and POS-tagging
Arabic morphology and POS-taggingArabic morphology and POS-tagging
Arabic morphology and POS-taggingbutest
 
CS4200 2019 Lecture 1: Introduction
CS4200 2019 Lecture 1: IntroductionCS4200 2019 Lecture 1: Introduction
CS4200 2019 Lecture 1: IntroductionEelco Visser
 
Open English Language Resources and Practices for Professional and Academic S...
Open English Language Resources and Practices for Professional and Academic S...Open English Language Resources and Practices for Professional and Academic S...
Open English Language Resources and Practices for Professional and Academic S...Alannah Fitzgerald
 
Blending in the Open
Blending in the OpenBlending in the Open
Blending in the Openbbridges51
 
Corpora in language teaching
Corpora in language teachingCorpora in language teaching
Corpora in language teachingJonathan Smart
 
Technology for Open Education - Training with Open E-resources (TOETOE) in La...
Technology for Open Education - Training with Open E-resources (TOETOE) in La...Technology for Open Education - Training with Open E-resources (TOETOE) in La...
Technology for Open Education - Training with Open E-resources (TOETOE) in La...Alannah Fitzgerald
 
What you Can Make Out of Linked Data
What you Can Make Out of Linked DataWhat you Can Make Out of Linked Data
What you Can Make Out of Linked DataMarco Fossati
 
The Great Beyond with Open English Language Resources
The Great Beyond with Open English Language ResourcesThe Great Beyond with Open English Language Resources
The Great Beyond with Open English Language ResourcesAlannah Fitzgerald
 
GSoC: How to get prepared and write a good proposal (or how to start contribu...
GSoC: How to get prepared and write a good proposal (or how to start contribu...GSoC: How to get prepared and write a good proposal (or how to start contribu...
GSoC: How to get prepared and write a good proposal (or how to start contribu...João Paulo Rechi Vita
 
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...Tobias Kuhn
 

Similar to Intro to corpus linguistics tools for EAP (20)

ICT Tools for Teaching Vocabulary
ICT Tools for Teaching VocabularyICT Tools for Teaching Vocabulary
ICT Tools for Teaching Vocabulary
 
E tools
E toolsE tools
E tools
 
Flexible Open Language Education for a MultiLingual World
Flexible Open Language Education for a MultiLingual WorldFlexible Open Language Education for a MultiLingual World
Flexible Open Language Education for a MultiLingual World
 
Free Software Presentation Dkg
Free Software Presentation DkgFree Software Presentation Dkg
Free Software Presentation Dkg
 
Improving Flickr discovery through Wikipedias
Improving Flickr discovery through WikipediasImproving Flickr discovery through Wikipedias
Improving Flickr discovery through Wikipedias
 
Enriching the semantic web tutorial session 1
Enriching the semantic web tutorial session 1Enriching the semantic web tutorial session 1
Enriching the semantic web tutorial session 1
 
Wreck a nice beach: adventures in speech recognition
Wreck a nice beach: adventures in speech recognitionWreck a nice beach: adventures in speech recognition
Wreck a nice beach: adventures in speech recognition
 
Arabic morphology and POS-tagging
Arabic morphology and POS-taggingArabic morphology and POS-tagging
Arabic morphology and POS-tagging
 
CS4200 2019 Lecture 1: Introduction
CS4200 2019 Lecture 1: IntroductionCS4200 2019 Lecture 1: Introduction
CS4200 2019 Lecture 1: Introduction
 
Open English Language Resources and Practices for Professional and Academic S...
Open English Language Resources and Practices for Professional and Academic S...Open English Language Resources and Practices for Professional and Academic S...
Open English Language Resources and Practices for Professional and Academic S...
 
Blending in the Open
Blending in the OpenBlending in the Open
Blending in the Open
 
Corpora in language teaching
Corpora in language teachingCorpora in language teaching
Corpora in language teaching
 
Technology for Open Education - Training with Open E-resources (TOETOE) in La...
Technology for Open Education - Training with Open E-resources (TOETOE) in La...Technology for Open Education - Training with Open E-resources (TOETOE) in La...
Technology for Open Education - Training with Open E-resources (TOETOE) in La...
 
What you Can Make Out of Linked Data
What you Can Make Out of Linked DataWhat you Can Make Out of Linked Data
What you Can Make Out of Linked Data
 
The Great Beyond with Open English Language Resources
The Great Beyond with Open English Language ResourcesThe Great Beyond with Open English Language Resources
The Great Beyond with Open English Language Resources
 
GSoC: How to get prepared and write a good proposal (or how to start contribu...
GSoC: How to get prepared and write a good proposal (or how to start contribu...GSoC: How to get prepared and write a good proposal (or how to start contribu...
GSoC: How to get prepared and write a good proposal (or how to start contribu...
 
Procedural programming
Procedural programmingProcedural programming
Procedural programming
 
Word tools
Word toolsWord tools
Word tools
 
SECCLL 2010
SECCLL 2010SECCLL 2010
SECCLL 2010
 
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
 

More from Alannah Fitzgerald

F-Lingo: Integrating lexical feature identification into MOOC platforms for l...
F-Lingo: Integrating lexical feature identification into MOOC platforms for l...F-Lingo: Integrating lexical feature identification into MOOC platforms for l...
F-Lingo: Integrating lexical feature identification into MOOC platforms for l...Alannah Fitzgerald
 
F-Lingo & FLAX: Automated open data-driven language learning in MOOCs
F-Lingo & FLAX: Automated open data-driven language learning in MOOCsF-Lingo & FLAX: Automated open data-driven language learning in MOOCs
F-Lingo & FLAX: Automated open data-driven language learning in MOOCsAlannah Fitzgerald
 
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...Alannah Fitzgerald
 
EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...
EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...
EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...Alannah Fitzgerald
 
Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)
Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)
Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)Alannah Fitzgerald
 
From clarion calls to auto-complete errors: a nascent discourse on openness ...
From clarion calls to auto-complete errors: a nascent discourse on openness ...From clarion calls to auto-complete errors: a nascent discourse on openness ...
From clarion calls to auto-complete errors: a nascent discourse on openness ...Alannah Fitzgerald
 
Converging cultures of open in language resources development
Converging cultures of open in language resources developmentConverging cultures of open in language resources development
Converging cultures of open in language resources developmentAlannah Fitzgerald
 
Developing Open Access Content into Academic English Resources for Data-Drive...
Developing Open Access Content into Academic English Resources for Data-Drive...Developing Open Access Content into Academic English Resources for Data-Drive...
Developing Open Access Content into Academic English Resources for Data-Drive...Alannah Fitzgerald
 
When a MOOC became a GROOC we all became co-creators
When a MOOC became a GROOC we all became co-creatorsWhen a MOOC became a GROOC we all became co-creators
When a MOOC became a GROOC we all became co-creatorsAlannah Fitzgerald
 
Serendipitous Innovation with Academic English Resources
Serendipitous Innovation with Academic English ResourcesSerendipitous Innovation with Academic English Resources
Serendipitous Innovation with Academic English ResourcesAlannah Fitzgerald
 
Bridging Formal and Informal Learning for Second Language Writing in FLAX
Bridging Formal and Informal Learning for Second Language Writing in FLAX Bridging Formal and Informal Learning for Second Language Writing in FLAX
Bridging Formal and Informal Learning for Second Language Writing in FLAX Alannah Fitzgerald
 
Setting a Precedent with Open Resources Development in English for Specific A...
Setting a Precedent with Open Resources Development in English for Specific A...Setting a Precedent with Open Resources Development in English for Specific A...
Setting a Precedent with Open Resources Development in English for Specific A...Alannah Fitzgerald
 
The Open-Source FLAX Language System
The Open-Source FLAX Language System The Open-Source FLAX Language System
The Open-Source FLAX Language System Alannah Fitzgerald
 
FLAX: Flexible Language Acquisition with Open Data-Driven Learning
FLAX: Flexible Language Acquisition with Open Data-Driven LearningFLAX: Flexible Language Acquisition with Open Data-Driven Learning
FLAX: Flexible Language Acquisition with Open Data-Driven LearningAlannah Fitzgerald
 
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...Alannah Fitzgerald
 
Sharing an Open Methodology for Building Domain-specific Corpora for EAP
Sharing an Open Methodology for Building Domain-specific Corpora for EAP Sharing an Open Methodology for Building Domain-specific Corpora for EAP
Sharing an Open Methodology for Building Domain-specific Corpora for EAP Alannah Fitzgerald
 
Resources at the Interface of Openness for Academic English
Resources at the Interface of Openness for Academic EnglishResources at the Interface of Openness for Academic English
Resources at the Interface of Openness for Academic EnglishAlannah Fitzgerald
 
Downstream with Open Educational Resources and Practices: rEAPing the rewards...
Downstream with Open Educational Resources and Practices: rEAPing the rewards...Downstream with Open Educational Resources and Practices: rEAPing the rewards...
Downstream with Open Educational Resources and Practices: rEAPing the rewards...Alannah Fitzgerald
 
Designing Open Linguistic Support
Designing Open Linguistic SupportDesigning Open Linguistic Support
Designing Open Linguistic SupportAlannah Fitzgerald
 

More from Alannah Fitzgerald (20)

F-Lingo: Integrating lexical feature identification into MOOC platforms for l...
F-Lingo: Integrating lexical feature identification into MOOC platforms for l...F-Lingo: Integrating lexical feature identification into MOOC platforms for l...
F-Lingo: Integrating lexical feature identification into MOOC platforms for l...
 
F-Lingo & FLAX: Automated open data-driven language learning in MOOCs
F-Lingo & FLAX: Automated open data-driven language learning in MOOCsF-Lingo & FLAX: Automated open data-driven language learning in MOOCs
F-Lingo & FLAX: Automated open data-driven language learning in MOOCs
 
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...
 
EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...
EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...
EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...
 
Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)
Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)
Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)
 
EThOS for Academic English
EThOS for Academic EnglishEThOS for Academic English
EThOS for Academic English
 
From clarion calls to auto-complete errors: a nascent discourse on openness ...
From clarion calls to auto-complete errors: a nascent discourse on openness ...From clarion calls to auto-complete errors: a nascent discourse on openness ...
From clarion calls to auto-complete errors: a nascent discourse on openness ...
 
Converging cultures of open in language resources development
Converging cultures of open in language resources developmentConverging cultures of open in language resources development
Converging cultures of open in language resources development
 
Developing Open Access Content into Academic English Resources for Data-Drive...
Developing Open Access Content into Academic English Resources for Data-Drive...Developing Open Access Content into Academic English Resources for Data-Drive...
Developing Open Access Content into Academic English Resources for Data-Drive...
 
When a MOOC became a GROOC we all became co-creators
When a MOOC became a GROOC we all became co-creatorsWhen a MOOC became a GROOC we all became co-creators
When a MOOC became a GROOC we all became co-creators
 
Serendipitous Innovation with Academic English Resources
Serendipitous Innovation with Academic English ResourcesSerendipitous Innovation with Academic English Resources
Serendipitous Innovation with Academic English Resources
 
Bridging Formal and Informal Learning for Second Language Writing in FLAX
Bridging Formal and Informal Learning for Second Language Writing in FLAX Bridging Formal and Informal Learning for Second Language Writing in FLAX
Bridging Formal and Informal Learning for Second Language Writing in FLAX
 
Setting a Precedent with Open Resources Development in English for Specific A...
Setting a Precedent with Open Resources Development in English for Specific A...Setting a Precedent with Open Resources Development in English for Specific A...
Setting a Precedent with Open Resources Development in English for Specific A...
 
The Open-Source FLAX Language System
The Open-Source FLAX Language System The Open-Source FLAX Language System
The Open-Source FLAX Language System
 
FLAX: Flexible Language Acquisition with Open Data-Driven Learning
FLAX: Flexible Language Acquisition with Open Data-Driven LearningFLAX: Flexible Language Acquisition with Open Data-Driven Learning
FLAX: Flexible Language Acquisition with Open Data-Driven Learning
 
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...
 
Sharing an Open Methodology for Building Domain-specific Corpora for EAP
Sharing an Open Methodology for Building Domain-specific Corpora for EAP Sharing an Open Methodology for Building Domain-specific Corpora for EAP
Sharing an Open Methodology for Building Domain-specific Corpora for EAP
 
Resources at the Interface of Openness for Academic English
Resources at the Interface of Openness for Academic EnglishResources at the Interface of Openness for Academic English
Resources at the Interface of Openness for Academic English
 
Downstream with Open Educational Resources and Practices: rEAPing the rewards...
Downstream with Open Educational Resources and Practices: rEAPing the rewards...Downstream with Open Educational Resources and Practices: rEAPing the rewards...
Downstream with Open Educational Resources and Practices: rEAPing the rewards...
 
Designing Open Linguistic Support
Designing Open Linguistic SupportDesigning Open Linguistic Support
Designing Open Linguistic Support
 

Recently uploaded

Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptx4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptxmary850239
 
Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17Celine George
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptxmary850239
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQuiz Club NITW
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfPrerana Jadhav
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Projectjordimapav
 
Sulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their usesSulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their usesVijayaLaxmi84
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDhatriParmar
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17Celine George
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...DhatriParmar
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Association for Project Management
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseCeline George
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1GloryAnnCastre1
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
CHEST Proprioceptive neuromuscular facilitation.pptx
CHEST Proprioceptive neuromuscular facilitation.pptxCHEST Proprioceptive neuromuscular facilitation.pptx
CHEST Proprioceptive neuromuscular facilitation.pptxAneriPatwari
 

Recently uploaded (20)

Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptx4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptx
 
Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdf
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Project
 
Sulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their usesSulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their uses
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17
 
prashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Professionprashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Profession
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 Database
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
CHEST Proprioceptive neuromuscular facilitation.pptx
CHEST Proprioceptive neuromuscular facilitation.pptxCHEST Proprioceptive neuromuscular facilitation.pptx
CHEST Proprioceptive neuromuscular facilitation.pptx
 

Intro to corpus linguistics tools for EAP

  • 1. 1 Writing with Open Tools (Part One) 09/11/2011 http://www.flickr.com/photos/mikekline/265954619/ Alannah Fitzgerald
  • 2. 2 Overview (part one) Introducing Corpus Linguistics Lexical knowledge: collocations, derivatives, register The Flexible Language Acquisition Project (FLAX) The British National Corpus (BNC) The Lextutor The Academic Wordlist (AWL) EAP practice resources
  • 3. Intro to corpus linguistics Let‟s start with three questions about English: 1. What is the meaning of goalless? 2. How is the word shall used in present-day British English? Think of some examples. 3. Which is more commonly expressed in everyday English? a. “I was a little disappointed…” b. “I was very disappointed…” Adapted from Hoffmann et al., 2008
  • 5. Focus on representation The British National Corpus (BNC) 100 million-word static corpus 1978-1992 Spoken (10%); Written (90%); Domain representation
  • 6. BNCweb concordancer – free download http://bncweb.info/
  • 9. Focus on automation The Flexible Language Acquisition Project (FLAX) Web n-gram corpora generated and supplied by 2006 Google web dump 500,000 words and 380 million five-grams GALL - Google Assisted Language Learning (Chinnery, 2008; Shei, 2008)
  • 10. „Goalless‟ keyword search in FLAX http://flax2.nzdl.org/greenstone3/flax?
  • 11. Distribution of shall I/we in the spoken component of the BNC
  • 12. Distribution of I/we shall in the spoken component of the BNC
  • 13. FLAX - Samples retrieved for I was a little disappointed
  • 14. BNC - Samples retrieved for I was a little disappointed
  • 15. BNC – Samples retrieved for I was very disappointed FLAX Web Collocations Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=)
  • 16. FLAX vs BNC? • Limitations with representativeness  Identifyingregister on the Web is difficult  Successful corpora are based on domains, genres, collections of document types  The web is a “dirty corpus” Kilgariff & Grefenstette (2003, p. 342)  FLAX cleaned by 30% using BNC wordlist  Linked externally to BNC, Yahoo  Complementary sources, both with limitations
  • 17. Google‟s terms of services “You agree not to access (or attempt to access) any of the Services by any means other than through the interface that is provided by Google, unless you have been specifically allowed to do so in a separate agreement with Google.” http:www.google.com/accounts/TOS Clause 5.3
  • 18. Typical lexical errors 18 telling a. He‟s very humorous. He‟s always doing jokes. collocation conversed b. We conversated for almost word families / derivatives one hour. without delay c. …and compromise, the issue was resolved in register a jiffy.
  • 20. OSS Mozilla http://www.flickr.com/photos/hindrik/2586245939/
  • 21. 21 FLAX Web Pronoun Phrases Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=)
  • 22. Noticing Text Types – Issues of Register and Genre FLAX Web Pronoun Phrases Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=) 22
  • 23. FLAX Web Pronoun Phrases Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=) 23
  • 24. Web Pronouns Phrases OER 24 http://www.youtube.com/watch?v=Ns4nXsZ
  • 25. Kibbitzers (Tim John‟s EAP 25 pages) http://www.lexically.net/TimJohns/Kibbitzer/timeap3.htm
  • 26. Web Collocations (fact vs idea) 26 http://flax2.nzdl.org/greenstone3/flax?a=g&rt=r&sa=CollocationSearch&s=CollocationTypes&s1.wordClass=n&c=c ollodb&s1.query=&s1.multiple=on
  • 27. Web Collocations (fact vs idea) 27
  • 28. Compleat Lexical Tutor (Tom Cobb) http://www.lextutor.ca/
  • 29. Web Collocations OER http://www.lextutor.ca/vp/ 29 http://www.youtube.com/watch?v=iyZgZhHM
  • 31. UEFAP (Andy Gillett) 31 http://www.uefap.com/index.htm
  • 32. Specific EAP vocab (UEFAP) 32 http://www.uefap.com/vocab/vocfram.htm
  • 33. FLAX User guides & demos FLAX Web Collocations & Phrases Excercises (by Shaoqun Wu http://www.cs.waikato.ac.nz/~shaoqun/tmp/instruction.html)
  • 34. Speaking & Listening OER for EAP http://openspires.oucs.ox.ac.uk/crunch/
  • 35. Web Phrases OER 35 http://www.youtube.com/watch?v=n67FBqBFm6I
  • 36. 36 FLAX Web Phrases Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=)
  • 37. 37 Preparation (part two) • Samples of your own writing – soft copy • Build your own corpus – collect ten academic articles in your discipline • Writing analysis tools • Specific academic word lists