SlideShare a Scribd company logo
1 of 31
Sharing an Open Methodology for 
Building Domain-specific Corpora for EAP 
Martin Barge, William Tweddle, 
Saima Sherazi, Alannah Fitzgerald 
http://creativecommons.org/weblog/entry/35165/
Outline 
• FLAX Language Project at Waikato University 
• Developing an EAP Resource Interface between 
Traditional EAP and Massive Open Online Courses 
• Developing ESAP Collections in FLAX (Academic 
English for Law at QMUL) 
– What’s in the Demo Collection and What’s to Come! 
– Formatting Open Access Articles for FLAX Corpora 
• Fully Open Texts 
– Beyond Parsing with Text Augmentation & Linked Data 
– Lexical Bundles, Collocations, Wordlists, Cherry Picking 
Functions 
– Building in Interactivity 
• Design-based Research with FLAX, Queen Mary and 
the OER Research Hub 
– Research & Development Cycles with Design-based 
Research for Iterating Collections Development 
– Rapid Prototyping of Online Demo Collections to Evaluate 
the Design Process and to Share with Stakeholders
FLAX Language at Waikato University 
http://flax.nzdl.org FLAX image by permission of non-commercial reuse by Jane Galloway
FLAX Language Project at the 
Greenstone Digital Library Lab, 
Waikato University NZ 
Professor Ian Witten 
FLAX Project Lead 
Dr Shaoqun Wu 
FLAX Project Lead Researcher & Developer
QM’s Critical Thinking & Writing in Law 
• Queen Mary’s Critical Thinking and Writing in Law 
(CTWL) Programme has been running successfully for 
over 7 years. 
• It is delivered by QM Language Centre’s EAP/ESAP 
team as part of the Insessional provision. 
• Over 600-800 LLM students enroll on it every year. 
• A team of 6-7 EAP tutors teach on it, and are under 
constant pressure to develop better and new 
materials for their high calibre students.
The FLAX System for Subject- 
Specific Corpus Development 
Corpus Linguistics – pioneered by Sinclair 1991. 
DDL – Data-Driven-Learning – term coined by Johns 1991. 
An empirical method of linguistic enquiry 
•Used to discover the lexico-grammatical properties of genre or text-type 
•Used to discover the key terminology given field or discipline – English 
for Specific Academic Purposes (ESAP) 
•Used for exploring collocations: 
“You shall know a word by the company it keeps.” (Frith, 1957:11)
Collaboration with Subject Specialists 
“In the emerging academic literacies approach 
involving cooperation between subject specialists and 
writing teachers, the aim is to help the students 
develop metacognitive awareness of the roles and 
functions of writing in that discipline, to enable them to 
stand back from it and observe how it functions, and 
then to help them gradually participate in the genres, 
where genre is understood as a constellation of actions 
rather than a list of formal features.” (Breeze, 2012)
Benefits 
• Inductive – promotes critical thinking 
• Promotes learner autonomy 
• Based on evidence, not instinct 
• Especially relevant for ESP and ESAP 
Limitations 
• Need for Ts and Sts to have technical skills to use corpora and 
concordancers 
• Need for access to corpora and software programmes 
• Large amount of data can be overwhelming 
“Every student is Sherlock Holmes.” (Johns, 2002:108)
Interfacing Traditional EAP & MOOCs
ESAP Law Collections in FLAX 
Type of media in the FLAX 
Law Collections 
Number and source of items in the FLAX 
Law Collections 
Podcast audio files & transcripts 
(OpenSpires) 
10-15 Lectures (Oxford Law Faculty & the Centre 
for Socio-Legal Studies) 
MOOC lecture transcripts & 
videos (streamed via YouTube & 
Vimeo) 
4 MOOC Collections: Copyright Law (Harvard/edX), 
English Common Law (Uni. of London/Coursera), 
Age of Globalization (Texas at Austin/edX), 
Environmental Law & Politics (OpenYale) 
Student PhD thesis writing and 
Pre-sessional for Law ESAP essay 
writing 
70 QMUL EThoS Theses at the British Library (Open 
Access but not licensed with Creative Commons – 
will need permission to develop for Non- 
Commercial Educational & Research purposes); 20+ 
Essays from QMUL Law Pre-sessional 
Open Access research articles 
(relevant to QMUL Law and EAP 
for Law and Globalisation) 
40 Articles (DOAJ - Directory of Open Access 
Journals)
Formatting OA Articles for FLAX
Working with Full Texts
Text Augmentation + Text Parsing
Law Corpus Wikify Function in FLAX
Wordlist from OA Articles
Collocations from Law Lectures
Linking Collocations in Law-Specific Corpus to 
Reference Collections in FLAX 
(BNC, BAWE, Wikipedia)
Lexical Bundles from Law Lectures
Building Interactivity into FLAX
FLAX Activities Continued
FLAX Do-It-Yourself Podcast Corpora 
with Oxford OER 
http://www.youtube.com/watch?v=Si24d3Z-8nQ
FLAX Do-It-Yourself Podcast Corpora 2: 
Building interactivity into your collections 
http://www.youtube.com/watch?v=fysDzYjbhh0
Developing Podcast Activities in FLAX
Close Exercises in FLAX
Scrambled Sentences in FLAX
Drag ‘n’ Drop exercises in FLAX
Learning Collocations in FLAX
Automated Collocations Guessing in 
FLAX (drawing on the British National Corpus)
Design-Based Research Cycles with FLAX, 
the OER Research Hub & Queen Mary 
• Practitioners/Researchers involved in iterative 
development of ESAP language collections 
– Interfacing with open Law resources 
Open Access articles, Open Government research reports with 
contributions from QMUL Law professors, Case Law, Open lectures, 
Openly-licensed student writing 
– Developing expertise with open tools and resources 
– Developing interaction within the corpus and derivatives 
from the corpus 
– Documenting the collections development process for 
sharing across the EAP and Open Education sectors
Free to Do Whatever You Want 
• Open Resources for EAP 
Soup Dragons: 
– Building ESAP Corpora 
– Developing Interactivity into 
ESAP Corpora 
– Developing ESAP Course Book 
and Lesson Plan Derivatives 
– Researching and Developing 
ESAP Corpora & Derivatives 
– Researching and Developing 
Corpus Tools e.g. Interfaces, 
Text Augmentation and 
Linked Data Approaches 
http://en.wikipedia.org/wiki/The_Soup_Dragons
Thank You 
FLAX Language Project flax.nzdl.org 
Shaoqun Wu: shaoqun@waikato.ac.nz / Ian Witten: ihw@cs.waikato.ac.nz 
OER Research Hub http://oerresearchhub.org/ 
Alannah Fitzgerald: a_fitzg@education.concordia.ca; @AlannahFitz; 
www.alannahfitzgerald.org TOETOE Blog; Slideshare: 
http://www.slideshare.net/AlannahOpenEd/ 
The Language Centre – Queen Mary University of London http://language-centre. 
sllf.qmul.ac.uk/ 
Martin Barge m.i.barge@qmul.ac.uk 
William Tweddle w.tweddle@qmul.ac.uk 
Saima Sherazi s.n.sherazi@qmul.ac.uk

More Related Content

What's hot

Web Archiving Profile - WADL 2013
Web Archiving Profile - WADL 2013Web Archiving Profile - WADL 2013
Web Archiving Profile - WADL 2013
Ahmed AlSum
 
Who and What Links to the Internet Archive
Who and What Links to the Internet ArchiveWho and What Links to the Internet Archive
Who and What Links to the Internet Archive
Yasmin AlNoamany, PhD
 
Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...
The European Library
 
Carolyn Palaima: SALALM Pecha Kucha 2012
Carolyn Palaima: SALALM Pecha Kucha 2012Carolyn Palaima: SALALM Pecha Kucha 2012
Carolyn Palaima: SALALM Pecha Kucha 2012
alisonhicks0
 
Manchester Seminar Liberate Your Library October 2009
Manchester Seminar   Liberate Your Library   October 2009Manchester Seminar   Liberate Your Library   October 2009
Manchester Seminar Liberate Your Library October 2009
Jonathan Field
 

What's hot (20)

Web Archiving Profile - WADL 2013
Web Archiving Profile - WADL 2013Web Archiving Profile - WADL 2013
Web Archiving Profile - WADL 2013
 
Visualizing the Transcribe Bentham Corpus
Visualizing the Transcribe Bentham CorpusVisualizing the Transcribe Bentham Corpus
Visualizing the Transcribe Bentham Corpus
 
NJVR: The NanJing Vocabulary Repository
NJVR: The NanJing Vocabulary RepositoryNJVR: The NanJing Vocabulary Repository
NJVR: The NanJing Vocabulary Repository
 
Who and What Links to the Internet Archive
Who and What Links to the Internet ArchiveWho and What Links to the Internet Archive
Who and What Links to the Internet Archive
 
Eaa2014 open access_session_4_g.eberhardt+n.riedl_topoi_final_13092014
Eaa2014 open access_session_4_g.eberhardt+n.riedl_topoi_final_13092014Eaa2014 open access_session_4_g.eberhardt+n.riedl_topoi_final_13092014
Eaa2014 open access_session_4_g.eberhardt+n.riedl_topoi_final_13092014
 
Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...
 
PESC-Kirchhoff-ALA Annual 2015 NISO Update
PESC-Kirchhoff-ALA Annual 2015 NISO UpdatePESC-Kirchhoff-ALA Annual 2015 NISO Update
PESC-Kirchhoff-ALA Annual 2015 NISO Update
 
Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web
Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web
Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web
 
RLUK Warwick Meeting | Iron Mountain, Jeremy Suratt
RLUK Warwick Meeting | Iron Mountain, Jeremy SurattRLUK Warwick Meeting | Iron Mountain, Jeremy Suratt
RLUK Warwick Meeting | Iron Mountain, Jeremy Suratt
 
Presentation DFG Bonn 16 september 2015
Presentation DFG Bonn 16 september 2015Presentation DFG Bonn 16 september 2015
Presentation DFG Bonn 16 september 2015
 
sw owl
 sw owl sw owl
sw owl
 
Carolyn Palaima: SALALM Pecha Kucha 2012
Carolyn Palaima: SALALM Pecha Kucha 2012Carolyn Palaima: SALALM Pecha Kucha 2012
Carolyn Palaima: SALALM Pecha Kucha 2012
 
The Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New TechnologiesThe Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New Technologies
 
Oceanic Exchanges presentation
Oceanic Exchanges presentationOceanic Exchanges presentation
Oceanic Exchanges presentation
 
Open sonar martinreynaert
Open sonar martinreynaertOpen sonar martinreynaert
Open sonar martinreynaert
 
Environmental trends and OCLC Research, a presentation at the University of N...
Environmental trends and OCLC Research, a presentation at the University of N...Environmental trends and OCLC Research, a presentation at the University of N...
Environmental trends and OCLC Research, a presentation at the University of N...
 
20181106 survey on challenges of question answering in the semantic web saltlux
20181106 survey on challenges of question answering in the semantic web saltlux20181106 survey on challenges of question answering in the semantic web saltlux
20181106 survey on challenges of question answering in the semantic web saltlux
 
Manchester Seminar Liberate Your Library October 2009
Manchester Seminar   Liberate Your Library   October 2009Manchester Seminar   Liberate Your Library   October 2009
Manchester Seminar Liberate Your Library October 2009
 
Tentative steps in mining UK theses
Tentative steps in mining UK thesesTentative steps in mining UK theses
Tentative steps in mining UK theses
 
NISO Webinar: Understanding Critical Elements of E-books: Part 2: Heritage Lo...
NISO Webinar: Understanding Critical Elements of E-books: Part 2: Heritage Lo...NISO Webinar: Understanding Critical Elements of E-books: Part 2: Heritage Lo...
NISO Webinar: Understanding Critical Elements of E-books: Part 2: Heritage Lo...
 

Similar to Sharing an Open Methodology for Building Domain-specific Corpora for EAP

Resources at the Interface of Openness for Academic English
Resources at the Interface of Openness for Academic EnglishResources at the Interface of Openness for Academic English
Resources at the Interface of Openness for Academic English
The Open Education Consortium
 
Flexible Open Language Education for a Multicultural World
Flexible Open Language Education for a Multicultural WorldFlexible Open Language Education for a Multicultural World
Flexible Open Language Education for a Multicultural World
The Open Education Consortium
 
Open access in developng countries african
Open access in developng countries africanOpen access in developng countries african
Open access in developng countries african
Dr Lendy Spires
 

Similar to Sharing an Open Methodology for Building Domain-specific Corpora for EAP (20)

Resources at the Interface of Openness for Academic English
Resources at the Interface of Openness for Academic EnglishResources at the Interface of Openness for Academic English
Resources at the Interface of Openness for Academic English
 
Bridging Formal and Informal Learning for Second Language Writing in FLAX
Bridging Formal and Informal Learning for Second Language Writing in FLAX Bridging Formal and Informal Learning for Second Language Writing in FLAX
Bridging Formal and Informal Learning for Second Language Writing in FLAX
 
Resources at the Interface of Openness for Academic English
Resources at the Interface of Openness for Academic EnglishResources at the Interface of Openness for Academic English
Resources at the Interface of Openness for Academic English
 
Resources at the Interface of Openness for Academic English
Resources at the Interface of Openness for Academic EnglishResources at the Interface of Openness for Academic English
Resources at the Interface of Openness for Academic English
 
Flexible Open Language Education for a MultiLingual World
Flexible Open Language Education for a MultiLingual WorldFlexible Open Language Education for a MultiLingual World
Flexible Open Language Education for a MultiLingual World
 
FLAX Weaving with Oxford Open Educational Resources: Open Practices for Engli...
FLAX Weaving with Oxford Open Educational Resources: Open Practices for Engli...FLAX Weaving with Oxford Open Educational Resources: Open Practices for Engli...
FLAX Weaving with Oxford Open Educational Resources: Open Practices for Engli...
 
Delhi University OER for ELT Collections Building
Delhi University OER for ELT Collections BuildingDelhi University OER for ELT Collections Building
Delhi University OER for ELT Collections Building
 
Downstream with Open Educational Resources and Practices: rEAPing the rewards...
Downstream with Open Educational Resources and Practices: rEAPing the rewards...Downstream with Open Educational Resources and Practices: rEAPing the rewards...
Downstream with Open Educational Resources and Practices: rEAPing the rewards...
 
Flexible Open Language Education for a Multicultural World
Flexible Open Language Education for a Multicultural WorldFlexible Open Language Education for a Multicultural World
Flexible Open Language Education for a Multicultural World
 
Radio Ga Ga: corpus-based resources, you’ve yet to have your finest hour
Radio Ga Ga: corpus-based resources, you’ve yet to have your finest hourRadio Ga Ga: corpus-based resources, you’ve yet to have your finest hour
Radio Ga Ga: corpus-based resources, you’ve yet to have your finest hour
 
Dr. Elizabeth M. Williams: Taking the plunge: Crossing continents library to ...
Dr. Elizabeth M. Williams: Taking the plunge: Crossing continents library to ...Dr. Elizabeth M. Williams: Taking the plunge: Crossing continents library to ...
Dr. Elizabeth M. Williams: Taking the plunge: Crossing continents library to ...
 
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...
 
Developing corpus-based resources for language learning: looking back in "hope"
Developing corpus-based resources for language learning: looking back in "hope"Developing corpus-based resources for language learning: looking back in "hope"
Developing corpus-based resources for language learning: looking back in "hope"
 
Beyond Content: Open Educational Practices for English Language Education
Beyond Content: Open Educational Practices for English Language EducationBeyond Content: Open Educational Practices for English Language Education
Beyond Content: Open Educational Practices for English Language Education
 
A Pledge for Open English
A Pledge for Open EnglishA Pledge for Open English
A Pledge for Open English
 
Building Open Educational Resources for EAP at Hanoi Open University
Building Open Educational Resources for EAP at Hanoi Open UniversityBuilding Open Educational Resources for EAP at Hanoi Open University
Building Open Educational Resources for EAP at Hanoi Open University
 
Oh, what a BAWE! The British Academic Written English corpus
Oh, what a BAWE! The British Academic Written English corpusOh, what a BAWE! The British Academic Written English corpus
Oh, what a BAWE! The British Academic Written English corpus
 
Love is a stranger in an open car to tempt you in and drive you far away... t...
Love is a stranger in an open car to tempt you in and drive you far away... t...Love is a stranger in an open car to tempt you in and drive you far away... t...
Love is a stranger in an open car to tempt you in and drive you far away... t...
 
Presentation - First International Library Staff Exchange Week, Zagreb
Presentation - First International Library Staff Exchange Week, ZagrebPresentation - First International Library Staff Exchange Week, Zagreb
Presentation - First International Library Staff Exchange Week, Zagreb
 
Open access in developng countries african
Open access in developng countries africanOpen access in developng countries african
Open access in developng countries african
 

More from Alannah Fitzgerald

When a MOOC became a GROOC we all became co-creators
When a MOOC became a GROOC we all became co-creatorsWhen a MOOC became a GROOC we all became co-creators
When a MOOC became a GROOC we all became co-creators
Alannah Fitzgerald
 

More from Alannah Fitzgerald (17)

F-Lingo: Integrating lexical feature identification into MOOC platforms for l...
F-Lingo: Integrating lexical feature identification into MOOC platforms for l...F-Lingo: Integrating lexical feature identification into MOOC platforms for l...
F-Lingo: Integrating lexical feature identification into MOOC platforms for l...
 
F-Lingo & FLAX: Automated open data-driven language learning in MOOCs
F-Lingo & FLAX: Automated open data-driven language learning in MOOCsF-Lingo & FLAX: Automated open data-driven language learning in MOOCs
F-Lingo & FLAX: Automated open data-driven language learning in MOOCs
 
EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...
EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...
EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...
 
EThOS for Academic English
EThOS for Academic EnglishEThOS for Academic English
EThOS for Academic English
 
From clarion calls to auto-complete errors: a nascent discourse on openness ...
From clarion calls to auto-complete errors: a nascent discourse on openness ...From clarion calls to auto-complete errors: a nascent discourse on openness ...
From clarion calls to auto-complete errors: a nascent discourse on openness ...
 
When a MOOC became a GROOC we all became co-creators
When a MOOC became a GROOC we all became co-creatorsWhen a MOOC became a GROOC we all became co-creators
When a MOOC became a GROOC we all became co-creators
 
Serendipitous Innovation with Academic English Resources
Serendipitous Innovation with Academic English ResourcesSerendipitous Innovation with Academic English Resources
Serendipitous Innovation with Academic English Resources
 
Setting a Precedent with Open Resources Development in English for Specific A...
Setting a Precedent with Open Resources Development in English for Specific A...Setting a Precedent with Open Resources Development in English for Specific A...
Setting a Precedent with Open Resources Development in English for Specific A...
 
Open English Language Resources and Practices for Professional and Academic S...
Open English Language Resources and Practices for Professional and Academic S...Open English Language Resources and Practices for Professional and Academic S...
Open English Language Resources and Practices for Professional and Academic S...
 
Designing Open Linguistic Support
Designing Open Linguistic SupportDesigning Open Linguistic Support
Designing Open Linguistic Support
 
A story of reuse: Open Oxford resources for ELT
A story of reuse: Open Oxford resources for ELTA story of reuse: Open Oxford resources for ELT
A story of reuse: Open Oxford resources for ELT
 
Crowdsourcing Open Corpus-based Resources for EAP
Crowdsourcing Open Corpus-based Resources for EAPCrowdsourcing Open Corpus-based Resources for EAP
Crowdsourcing Open Corpus-based Resources for EAP
 
Making digital open educational resources for EAP
Making digital open educational resources for EAPMaking digital open educational resources for EAP
Making digital open educational resources for EAP
 
Braving OER battles in Brazil
Braving OER battles in BrazilBraving OER battles in Brazil
Braving OER battles in Brazil
 
Emancipatory English in India
Emancipatory English in IndiaEmancipatory English in India
Emancipatory English in India
 
Vietnam’s Open University rising dragon
Vietnam’s Open University rising dragonVietnam’s Open University rising dragon
Vietnam’s Open University rising dragon
 
Confucian dynamism in the Chinese ELT context
Confucian dynamism in the Chinese ELT contextConfucian dynamism in the Chinese ELT context
Confucian dynamism in the Chinese ELT context
 

Recently uploaded

Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
EADTU
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Recently uploaded (20)

Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
dusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learningdusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learning
 
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
 
Play hard learn harder: The Serious Business of Play
Play hard learn harder:  The Serious Business of PlayPlay hard learn harder:  The Serious Business of Play
Play hard learn harder: The Serious Business of Play
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
VAMOS CUIDAR DO NOSSO PLANETA! .
VAMOS CUIDAR DO NOSSO PLANETA!                    .VAMOS CUIDAR DO NOSSO PLANETA!                    .
VAMOS CUIDAR DO NOSSO PLANETA! .
 
How to Add a Tool Tip to a Field in Odoo 17
How to Add a Tool Tip to a Field in Odoo 17How to Add a Tool Tip to a Field in Odoo 17
How to Add a Tool Tip to a Field in Odoo 17
 
Model Attribute _rec_name in the Odoo 17
Model Attribute _rec_name in the Odoo 17Model Attribute _rec_name in the Odoo 17
Model Attribute _rec_name in the Odoo 17
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptx21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptx
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
How to Manage Call for Tendor in Odoo 17
How to Manage Call for Tendor in Odoo 17How to Manage Call for Tendor in Odoo 17
How to Manage Call for Tendor in Odoo 17
 
Our Environment Class 10 Science Notes pdf
Our Environment Class 10 Science Notes pdfOur Environment Class 10 Science Notes pdf
Our Environment Class 10 Science Notes pdf
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Economic Importance Of Fungi In Food Additives
Economic Importance Of Fungi In Food AdditivesEconomic Importance Of Fungi In Food Additives
Economic Importance Of Fungi In Food Additives
 

Sharing an Open Methodology for Building Domain-specific Corpora for EAP

  • 1. Sharing an Open Methodology for Building Domain-specific Corpora for EAP Martin Barge, William Tweddle, Saima Sherazi, Alannah Fitzgerald http://creativecommons.org/weblog/entry/35165/
  • 2. Outline • FLAX Language Project at Waikato University • Developing an EAP Resource Interface between Traditional EAP and Massive Open Online Courses • Developing ESAP Collections in FLAX (Academic English for Law at QMUL) – What’s in the Demo Collection and What’s to Come! – Formatting Open Access Articles for FLAX Corpora • Fully Open Texts – Beyond Parsing with Text Augmentation & Linked Data – Lexical Bundles, Collocations, Wordlists, Cherry Picking Functions – Building in Interactivity • Design-based Research with FLAX, Queen Mary and the OER Research Hub – Research & Development Cycles with Design-based Research for Iterating Collections Development – Rapid Prototyping of Online Demo Collections to Evaluate the Design Process and to Share with Stakeholders
  • 3. FLAX Language at Waikato University http://flax.nzdl.org FLAX image by permission of non-commercial reuse by Jane Galloway
  • 4. FLAX Language Project at the Greenstone Digital Library Lab, Waikato University NZ Professor Ian Witten FLAX Project Lead Dr Shaoqun Wu FLAX Project Lead Researcher & Developer
  • 5. QM’s Critical Thinking & Writing in Law • Queen Mary’s Critical Thinking and Writing in Law (CTWL) Programme has been running successfully for over 7 years. • It is delivered by QM Language Centre’s EAP/ESAP team as part of the Insessional provision. • Over 600-800 LLM students enroll on it every year. • A team of 6-7 EAP tutors teach on it, and are under constant pressure to develop better and new materials for their high calibre students.
  • 6. The FLAX System for Subject- Specific Corpus Development Corpus Linguistics – pioneered by Sinclair 1991. DDL – Data-Driven-Learning – term coined by Johns 1991. An empirical method of linguistic enquiry •Used to discover the lexico-grammatical properties of genre or text-type •Used to discover the key terminology given field or discipline – English for Specific Academic Purposes (ESAP) •Used for exploring collocations: “You shall know a word by the company it keeps.” (Frith, 1957:11)
  • 7. Collaboration with Subject Specialists “In the emerging academic literacies approach involving cooperation between subject specialists and writing teachers, the aim is to help the students develop metacognitive awareness of the roles and functions of writing in that discipline, to enable them to stand back from it and observe how it functions, and then to help them gradually participate in the genres, where genre is understood as a constellation of actions rather than a list of formal features.” (Breeze, 2012)
  • 8. Benefits • Inductive – promotes critical thinking • Promotes learner autonomy • Based on evidence, not instinct • Especially relevant for ESP and ESAP Limitations • Need for Ts and Sts to have technical skills to use corpora and concordancers • Need for access to corpora and software programmes • Large amount of data can be overwhelming “Every student is Sherlock Holmes.” (Johns, 2002:108)
  • 10. ESAP Law Collections in FLAX Type of media in the FLAX Law Collections Number and source of items in the FLAX Law Collections Podcast audio files & transcripts (OpenSpires) 10-15 Lectures (Oxford Law Faculty & the Centre for Socio-Legal Studies) MOOC lecture transcripts & videos (streamed via YouTube & Vimeo) 4 MOOC Collections: Copyright Law (Harvard/edX), English Common Law (Uni. of London/Coursera), Age of Globalization (Texas at Austin/edX), Environmental Law & Politics (OpenYale) Student PhD thesis writing and Pre-sessional for Law ESAP essay writing 70 QMUL EThoS Theses at the British Library (Open Access but not licensed with Creative Commons – will need permission to develop for Non- Commercial Educational & Research purposes); 20+ Essays from QMUL Law Pre-sessional Open Access research articles (relevant to QMUL Law and EAP for Law and Globalisation) 40 Articles (DOAJ - Directory of Open Access Journals)
  • 13. Text Augmentation + Text Parsing
  • 14. Law Corpus Wikify Function in FLAX
  • 15. Wordlist from OA Articles
  • 17. Linking Collocations in Law-Specific Corpus to Reference Collections in FLAX (BNC, BAWE, Wikipedia)
  • 18. Lexical Bundles from Law Lectures
  • 21. FLAX Do-It-Yourself Podcast Corpora with Oxford OER http://www.youtube.com/watch?v=Si24d3Z-8nQ
  • 22. FLAX Do-It-Yourself Podcast Corpora 2: Building interactivity into your collections http://www.youtube.com/watch?v=fysDzYjbhh0
  • 26. Drag ‘n’ Drop exercises in FLAX
  • 28. Automated Collocations Guessing in FLAX (drawing on the British National Corpus)
  • 29. Design-Based Research Cycles with FLAX, the OER Research Hub & Queen Mary • Practitioners/Researchers involved in iterative development of ESAP language collections – Interfacing with open Law resources Open Access articles, Open Government research reports with contributions from QMUL Law professors, Case Law, Open lectures, Openly-licensed student writing – Developing expertise with open tools and resources – Developing interaction within the corpus and derivatives from the corpus – Documenting the collections development process for sharing across the EAP and Open Education sectors
  • 30. Free to Do Whatever You Want • Open Resources for EAP Soup Dragons: – Building ESAP Corpora – Developing Interactivity into ESAP Corpora – Developing ESAP Course Book and Lesson Plan Derivatives – Researching and Developing ESAP Corpora & Derivatives – Researching and Developing Corpus Tools e.g. Interfaces, Text Augmentation and Linked Data Approaches http://en.wikipedia.org/wiki/The_Soup_Dragons
  • 31. Thank You FLAX Language Project flax.nzdl.org Shaoqun Wu: shaoqun@waikato.ac.nz / Ian Witten: ihw@cs.waikato.ac.nz OER Research Hub http://oerresearchhub.org/ Alannah Fitzgerald: a_fitzg@education.concordia.ca; @AlannahFitz; www.alannahfitzgerald.org TOETOE Blog; Slideshare: http://www.slideshare.net/AlannahOpenEd/ The Language Centre – Queen Mary University of London http://language-centre. sllf.qmul.ac.uk/ Martin Barge m.i.barge@qmul.ac.uk William Tweddle w.tweddle@qmul.ac.uk Saima Sherazi s.n.sherazi@qmul.ac.uk

Editor's Notes

  1. TIRF is The Int. Research Foundation of English language education that is partly funding Alannah to be in the UK to work with QMUL along with the OER Research Hub
  2. CTWL students are target users at QMUL for the Law collections in FLAX in addition to the Pre-Sessional Law-strand students on the summer programme at QMUL. An additional target user group are the MOOC learners registered on the courses where we have reused their MOOC lectures for this corpus.
  3. The Access limitation with corpus-based approaches is dealt with at 3 levels by FLAX: Free and Open Access of the software (and the code) and most of the corpus resources used. In the case of this Law corpus all resources are open and free. Accessible interfaces in FLAX that have been designed for the non-expert corpus user, namely language teachers and learners and anybody wanting linguistic support with specific academic resources, here in the case of MOOC learners who are not registered language learners. FLAX avoids the complex querying language which most corpus-based tools rely on users understanding. Accessible Open Educational Resources and Open Access resources that can be further used in the development of corpus-based derivatives for classroom use as exemplified with this Law ESAP corpus.
  4. Aiming for flexible ESAP (English for Specific Academic Purposes) resources for uptake in traditional classroom-based EAP and in online and open education, including MOOCs.
  5. Less than half of all Open Access journals are published using Creative Commons licenses so this is where Open Educational Resources and Open Source Software have more in common than they do with OA. But there are OA journals we can use and most of which are published under the most flexible Creative Commons licenses e.g. CC-BY with only a few being the most restrictive e.g. CC-ND. Depending on the field there will be less or more OA journals. There are not many OA journals for Law but there are many Openly-licensed government papers in the field of Law. We will look at adding samples of these also in future. Being able to show demo corpora like the one we are building in FLAX online, enables us to explain to e.g. the British Library, what our intended uses are for theses writing for NC Educational and Research Development purposes for ESAP.
  6. OA articles have been pre-formatted by journals and to remove this formatting is somewhat of a challenge. Martin Barge has developed the first iteration of a formatting OA tool which can export text sections with relevant code into html format for use in FLAX. More iterations of this tool will be developed as we continue to rebuild the corpus.
  7. Text augmentation – linking in other data resources, here Wikipedia, to enhance the efficacy of the corpus
  8. A further example of text augmentation, whereby the smaller subject-specific corpus is linked to larger corpora (The British National Corpus, The British Academic Written English Corpus and Wikipedia as a Corpus) and further resources (Roget’s Thesaurus, Wikipedia,Wiktionary) for comparison across relevant corpora and for further linguistic support for key terms and phrases.
  9. Open resources that you can do whatever you want with: corpus building and online activities based on the corpus with open source software as in the FLAX project; developing course book derivatives from the open resources; researching the effectiveness of the corpus for future iterations of collections building and interface designs.
  10. Please add your contact detes here, QMUL peeps!