SlideShare a Scribd company logo
. The British National Corpus (BNC) is one of the mostimportant corpus in the field of linguistics. The content of BCN contains British English data from the late twentiethcentury. This corpus covers a variety of differentgenres.
 CREATION OF THE BRITISH NATIONAL CORPUS (BCN) The project was developed by an academic consortium called the BCN Consortium, The British Library and the British Academy. In addition, some other academic research centres are the theUniversity Centre for Computer Corpus Research on Language and the Oxford University Computing Services.
The construction of the corpus began in 1991 and it was finished in 1994. Although no more texts were added to the corpus, there was a revision of this work carried out in 2001 with the publication of the BNC Worldand again in 2007 with a new edition called BNC XML Edition.
The corpus is divided into two types of different type of corpora which are: ● The BCN Sampler is a collection of one million written words.  ● The BNC Baby collects aboutfour one-million word samples which belong to different genres
The British National Corpus follows the Guidelines of the TextEncodingInititative. There are two different parts which constitute the corpus: ● Written part: (90%). It covers data from several sources like books, periodicals, brochures and leaflets. In addition the written part covers regional and national newspapers, journals for all ages and interests, academic books, popular fiction, university essays etc
● Spokenpart(10%): That partextractsinformation from orthographictranscriptions of informal conversations and spoken language collected in different contexts.
Why to use The British national corpus (bnc)? The BNC can be used to know about aspects we did not know about a word and to check our thoughts about its meaning. Moreover, the corpus can help to find out the meaning of a word not just what we think it means. The BNC offers many options like for instance to know if a word can collocate with other set of words, if it is grammatically right in some specific contexts.
If we look for the word the word  “bent” plus the preposition “on” the BNC indicates that this combination of words appear together in a specific context. From a grammatical point of view, the British National Corpus determines that “Bent to” can only be followed by a noun or noun phrase, or by verb plus the suffix- ing. Let’s look at it in the next image:
HOW TO USE THE BRITISH NATIONAL CORPUS There exists two ways of using the British National Corpus according to its complexity: ●Xaira:  It can be used to check the spelling of a word, compare different variants to measure the frequency of use and if a certain word is part of the BCN. ● The BNC Simple Search:  It is a quick way of searching a word / phrase. This type of search can be used to check the spelling of a word and also to compare the frequency and variants of a word.
If we use the BNC Simple search, it is necessary to type the word or phrase in the search box that the person wants to find. Once the word/ phrase has been search a list of up to 50 selected instances headed by a note of the total frequency of use of them appears on the screen.
If we want to look for more complex queries we should add the following characters to the words. The _ character is used to match single words, while the = character allows the restriction of chains of speech and the use of braces {} helps to define a certain expressions
In addition, in the screen, four options are part of the option “display”of the corpus when we are looking for a word: LIST, CHART, KWICand COMPARE. Then there are three more options under the label of search string which are: word, collocationand pos list.
In addition, there is a section called “sorting and limits”. The sorting can be looked in terms frequency, relevance and alphabetical order.
The corpus includes several categories or labels of texts from different nature which are ““spoken”, fiction”, “magazine”, “newspaper” or “non- academic texts”
For instance, if we look for the word “couch”,the corpus shows us that this word collocates with different words: lying, lay, room, potato etc. After having clicking on one of this word several examples will appear on the screen. The corpus allows looking for a word or phrase but at the same time the possibility of finding collocations. To look for a collocation is as easier as to type the word which wants to be searched and automatically an asterisk will appear on the box of collocation. Once the search has been produced the corpus displays a list of words which collocates with the word.Let’s see:
The KWIC search enriches the corpus because it helps the person which is looking for the word to know in which grammatical structures and contexts the word appears. For example, if we look for the word: “shoes” the corpus shows in colours the different words which can be used with this word. “A new pair of”, “the soles of our”, “the second hand”, “new polished”, or “thousands of” etc.
COMPARISON BETWEEN THE BRITISH NATIONAL CORPUS AND THE COCA In terms of size there is a huge difference between both corpuses as the COCA is four times bigger than the BNC. The COCA is made up of 410 + million words in opposition to the BNC which covers 100 million words.  In relation to the composition of both corpuses the COCA focuses on spoken, popular magazines, academic texts and each of those genres means a 20% of the total.
However, the BNC is strictly divided in a 90% which is written while the other 10% is spoken English. As a result the COCA deals with more recent information as the corpus was updated while BNC focuses more on everyday language.
Information sources British National Corpus. (2011, April 9th ). In Wikipedia. Retrieved 19: 40, April 9th , 2011, from: http://en.wikipedia.org/w/index.php?title=British_National_Corpus&oldid=328182118 British National Corpus . http://www.natcorp.ox.ac.uk/ .Retrieved 9th April, 2011 BYU-BNC: BRITISH NATIONAL CORPUS. Mark Davies / Brigham Young University. Retrieved 19:40, April 9th , 2011, from http://corpus.byu.edu/bnc/ Encoding the British National Corpus. Retrieved 19:40, April 9th 2011 from http://xml.coverpages.org/bnc-encoding2.html “Phrases in English” (PIE) and the British National Corpus. Retrieved 19:40, April 9th, 2011. http://pie.usna.edu/

More Related Content

What's hot

what's a dictionary?
 what's a dictionary? what's a dictionary?
what's a dictionary?
Duygu Aşıklar
 
A BRIEF LIST OF MISUSED ENGLISH TERMS IN EU PUBLICATIONS
A BRIEF LIST OF MISUSED  ENGLISH TERMS IN EU  PUBLICATIONSA BRIEF LIST OF MISUSED  ENGLISH TERMS IN EU  PUBLICATIONS
A BRIEF LIST OF MISUSED ENGLISH TERMS IN EU PUBLICATIONS
Dmytro Lysiuk
 
Dictionary and Thesaurus
Dictionary and ThesaurusDictionary and Thesaurus
Dictionary and Thesaurus
RidaRana9
 
General information on dictionary use
General information on dictionary useGeneral information on dictionary use
General information on dictionary use
Ibrahim Muneer
 
corpus linguistics and lexicography
corpus linguistics and lexicographycorpus linguistics and lexicography
corpus linguistics and lexicographyayfa
 
Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguisticsRaul Vargas
 
Corpora in language teaching
Corpora in language teachingCorpora in language teaching
Corpora in language teachingJonathan Smart
 
Using corpora in instruction
Using corpora in instructionUsing corpora in instruction
Using corpora in instructionJonathan Smart
 
The Use of Corpus Linguistics in Lexicography
The Use of Corpus Linguistics in LexicographyThe Use of Corpus Linguistics in Lexicography
The Use of Corpus Linguistics in LexicographyIhsan Ibadurrahman
 
Free websites to improve your English
Free websites to improve your EnglishFree websites to improve your English
Free websites to improve your English
Grace Academy
 
Corpus linguistics the basics
Corpus linguistics the basicsCorpus linguistics the basics
Corpus linguistics the basics
Jorge Baptista
 
Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguistics
King Saud University
 
The Oxford English Dictionary Online
The Oxford English Dictionary OnlineThe Oxford English Dictionary Online
The Oxford English Dictionary OnlineTamara Nogueira
 
Lexicography
LexicographyLexicography
Lexicography
Sadia Irshad
 
Bilingual dictionaries 3
Bilingual dictionaries 3Bilingual dictionaries 3
Bilingual dictionaries 3
عنتر فؤاد علي
 
Introduction to dictionary
Introduction to dictionaryIntroduction to dictionary
Introduction to dictionaryMaaz Campus
 
Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguistics
gabrielarosa3008
 
Corpus Linguistics
Corpus LinguisticsCorpus Linguistics
Corpus Linguistics
Prof.Ravindra Borse
 

What's hot (20)

what's a dictionary?
 what's a dictionary? what's a dictionary?
what's a dictionary?
 
Oxford English Dictionary
Oxford English DictionaryOxford English Dictionary
Oxford English Dictionary
 
Dictionaries
DictionariesDictionaries
Dictionaries
 
A BRIEF LIST OF MISUSED ENGLISH TERMS IN EU PUBLICATIONS
A BRIEF LIST OF MISUSED  ENGLISH TERMS IN EU  PUBLICATIONSA BRIEF LIST OF MISUSED  ENGLISH TERMS IN EU  PUBLICATIONS
A BRIEF LIST OF MISUSED ENGLISH TERMS IN EU PUBLICATIONS
 
Dictionary and Thesaurus
Dictionary and ThesaurusDictionary and Thesaurus
Dictionary and Thesaurus
 
General information on dictionary use
General information on dictionary useGeneral information on dictionary use
General information on dictionary use
 
corpus linguistics and lexicography
corpus linguistics and lexicographycorpus linguistics and lexicography
corpus linguistics and lexicography
 
Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguistics
 
Corpora in language teaching
Corpora in language teachingCorpora in language teaching
Corpora in language teaching
 
Using corpora in instruction
Using corpora in instructionUsing corpora in instruction
Using corpora in instruction
 
The Use of Corpus Linguistics in Lexicography
The Use of Corpus Linguistics in LexicographyThe Use of Corpus Linguistics in Lexicography
The Use of Corpus Linguistics in Lexicography
 
Free websites to improve your English
Free websites to improve your EnglishFree websites to improve your English
Free websites to improve your English
 
Corpus linguistics the basics
Corpus linguistics the basicsCorpus linguistics the basics
Corpus linguistics the basics
 
Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguistics
 
The Oxford English Dictionary Online
The Oxford English Dictionary OnlineThe Oxford English Dictionary Online
The Oxford English Dictionary Online
 
Lexicography
LexicographyLexicography
Lexicography
 
Bilingual dictionaries 3
Bilingual dictionaries 3Bilingual dictionaries 3
Bilingual dictionaries 3
 
Introduction to dictionary
Introduction to dictionaryIntroduction to dictionary
Introduction to dictionary
 
Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguistics
 
Corpus Linguistics
Corpus LinguisticsCorpus Linguistics
Corpus Linguistics
 

Similar to British national corpus

Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguistics
Irum Malik
 
Corpus linguistics intro
Corpus linguistics introCorpus linguistics intro
Corpus linguistics intro
Alex Curtis
 
Oh, what a BAWE! The British Academic Written English corpus
Oh, what a BAWE! The British Academic Written English corpusOh, what a BAWE! The British Academic Written English corpus
Oh, what a BAWE! The British Academic Written English corpus
Alannah Fitzgerald
 
Computer assisted text and corpus analysis
Computer assisted text and corpus analysisComputer assisted text and corpus analysis
Computer assisted text and corpus analysis
RubyaShaheen
 
Corpus Analysis in Corpus linguistics
Corpus Analysis in Corpus linguistics Corpus Analysis in Corpus linguistics
Corpus Analysis in Corpus linguistics
Umm-e-Rooman Yaqoob
 
Corpus Linguistics
Corpus LinguisticsCorpus Linguistics
Corpus Linguistics
Fatima Batool
 
Corpus linguistics, ch6
Corpus linguistics, ch6Corpus linguistics, ch6
Corpus linguistics, ch6
VivaAs
 
Radio Ga Ga: corpus-based resources, you’ve yet to have your finest hour
Radio Ga Ga: corpus-based resources, you’ve yet to have your finest hourRadio Ga Ga: corpus-based resources, you’ve yet to have your finest hour
Radio Ga Ga: corpus-based resources, you’ve yet to have your finest hour
Alannah Fitzgerald
 
What corpora are available? by David Y. W.D
What corpora are available? by David Y. W.DWhat corpora are available? by David Y. W.D
What corpora are available? by David Y. W.D
RajpootBhatti5
 
The Usage of Because of-Words in British National Corpus
 The Usage of Because of-Words in British National Corpus The Usage of Because of-Words in British National Corpus
The Usage of Because of-Words in British National Corpus
Research Journal of Education
 
11 terms in Corpus Linguistics1 (2)
11 terms in Corpus Linguistics1 (2)11 terms in Corpus Linguistics1 (2)
11 terms in Corpus Linguistics1 (2)
ThennarasuSakkan
 
Concordancing 1
Concordancing 1Concordancing 1
Concordancing 1Hala Fawzi
 
corpus linguistics.pptx
corpus linguistics.pptxcorpus linguistics.pptx
corpus linguistics.pptx
Subramanian Mani
 
Mat. ling pow
Mat. ling powMat. ling pow
Mat. ling pow
Laura P
 
A Rule-Based Approach for Aligning Japanese-Spanish Sentences from A Comparab...
A Rule-Based Approach for Aligning Japanese-Spanish Sentences from A Comparab...A Rule-Based Approach for Aligning Japanese-Spanish Sentences from A Comparab...
A Rule-Based Approach for Aligning Japanese-Spanish Sentences from A Comparab...
kevig
 
A RULE-BASED APPROACH FOR ALIGNING JAPANESE-SPANISH SENTENCES FROM A COMPARAB...
A RULE-BASED APPROACH FOR ALIGNING JAPANESE-SPANISH SENTENCES FROM A COMPARAB...A RULE-BASED APPROACH FOR ALIGNING JAPANESE-SPANISH SENTENCES FROM A COMPARAB...
A RULE-BASED APPROACH FOR ALIGNING JAPANESE-SPANISH SENTENCES FROM A COMPARAB...
kevig
 
Corpus Linguistics
Corpus LinguisticsCorpus Linguistics
Corpus Linguistics
Prof.Ravindra Borse
 

Similar to British national corpus (20)

Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguistics
 
Corpus linguistics intro
Corpus linguistics introCorpus linguistics intro
Corpus linguistics intro
 
Oh, what a BAWE! The British Academic Written English corpus
Oh, what a BAWE! The British Academic Written English corpusOh, what a BAWE! The British Academic Written English corpus
Oh, what a BAWE! The British Academic Written English corpus
 
Corpus
CorpusCorpus
Corpus
 
Computer assisted text and corpus analysis
Computer assisted text and corpus analysisComputer assisted text and corpus analysis
Computer assisted text and corpus analysis
 
Corpus Analysis in Corpus linguistics
Corpus Analysis in Corpus linguistics Corpus Analysis in Corpus linguistics
Corpus Analysis in Corpus linguistics
 
Corpus Linguistics
Corpus LinguisticsCorpus Linguistics
Corpus Linguistics
 
lexicographic evidence
lexicographic evidencelexicographic evidence
lexicographic evidence
 
Corpus linguistics, ch6
Corpus linguistics, ch6Corpus linguistics, ch6
Corpus linguistics, ch6
 
Radio Ga Ga: corpus-based resources, you’ve yet to have your finest hour
Radio Ga Ga: corpus-based resources, you’ve yet to have your finest hourRadio Ga Ga: corpus-based resources, you’ve yet to have your finest hour
Radio Ga Ga: corpus-based resources, you’ve yet to have your finest hour
 
Eccup webinar part 1
Eccup webinar part 1Eccup webinar part 1
Eccup webinar part 1
 
What corpora are available? by David Y. W.D
What corpora are available? by David Y. W.DWhat corpora are available? by David Y. W.D
What corpora are available? by David Y. W.D
 
The Usage of Because of-Words in British National Corpus
 The Usage of Because of-Words in British National Corpus The Usage of Because of-Words in British National Corpus
The Usage of Because of-Words in British National Corpus
 
11 terms in Corpus Linguistics1 (2)
11 terms in Corpus Linguistics1 (2)11 terms in Corpus Linguistics1 (2)
11 terms in Corpus Linguistics1 (2)
 
Concordancing 1
Concordancing 1Concordancing 1
Concordancing 1
 
corpus linguistics.pptx
corpus linguistics.pptxcorpus linguistics.pptx
corpus linguistics.pptx
 
Mat. ling pow
Mat. ling powMat. ling pow
Mat. ling pow
 
A Rule-Based Approach for Aligning Japanese-Spanish Sentences from A Comparab...
A Rule-Based Approach for Aligning Japanese-Spanish Sentences from A Comparab...A Rule-Based Approach for Aligning Japanese-Spanish Sentences from A Comparab...
A Rule-Based Approach for Aligning Japanese-Spanish Sentences from A Comparab...
 
A RULE-BASED APPROACH FOR ALIGNING JAPANESE-SPANISH SENTENCES FROM A COMPARAB...
A RULE-BASED APPROACH FOR ALIGNING JAPANESE-SPANISH SENTENCES FROM A COMPARAB...A RULE-BASED APPROACH FOR ALIGNING JAPANESE-SPANISH SENTENCES FROM A COMPARAB...
A RULE-BASED APPROACH FOR ALIGNING JAPANESE-SPANISH SENTENCES FROM A COMPARAB...
 
Corpus Linguistics
Corpus LinguisticsCorpus Linguistics
Corpus Linguistics
 

Recently uploaded

Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdfAdversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Po-Chuan Chen
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
MIRIAMSALINAS13
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
CarlosHernanMontoyab2
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 

Recently uploaded (20)

Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdfAdversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 

British national corpus

  • 1. . The British National Corpus (BNC) is one of the mostimportant corpus in the field of linguistics. The content of BCN contains British English data from the late twentiethcentury. This corpus covers a variety of differentgenres.
  • 2. CREATION OF THE BRITISH NATIONAL CORPUS (BCN) The project was developed by an academic consortium called the BCN Consortium, The British Library and the British Academy. In addition, some other academic research centres are the theUniversity Centre for Computer Corpus Research on Language and the Oxford University Computing Services.
  • 3. The construction of the corpus began in 1991 and it was finished in 1994. Although no more texts were added to the corpus, there was a revision of this work carried out in 2001 with the publication of the BNC Worldand again in 2007 with a new edition called BNC XML Edition.
  • 4. The corpus is divided into two types of different type of corpora which are: ● The BCN Sampler is a collection of one million written words. ● The BNC Baby collects aboutfour one-million word samples which belong to different genres
  • 5. The British National Corpus follows the Guidelines of the TextEncodingInititative. There are two different parts which constitute the corpus: ● Written part: (90%). It covers data from several sources like books, periodicals, brochures and leaflets. In addition the written part covers regional and national newspapers, journals for all ages and interests, academic books, popular fiction, university essays etc
  • 6. ● Spokenpart(10%): That partextractsinformation from orthographictranscriptions of informal conversations and spoken language collected in different contexts.
  • 7. Why to use The British national corpus (bnc)? The BNC can be used to know about aspects we did not know about a word and to check our thoughts about its meaning. Moreover, the corpus can help to find out the meaning of a word not just what we think it means. The BNC offers many options like for instance to know if a word can collocate with other set of words, if it is grammatically right in some specific contexts.
  • 8. If we look for the word the word “bent” plus the preposition “on” the BNC indicates that this combination of words appear together in a specific context. From a grammatical point of view, the British National Corpus determines that “Bent to” can only be followed by a noun or noun phrase, or by verb plus the suffix- ing. Let’s look at it in the next image:
  • 9.
  • 10. HOW TO USE THE BRITISH NATIONAL CORPUS There exists two ways of using the British National Corpus according to its complexity: ●Xaira: It can be used to check the spelling of a word, compare different variants to measure the frequency of use and if a certain word is part of the BCN. ● The BNC Simple Search: It is a quick way of searching a word / phrase. This type of search can be used to check the spelling of a word and also to compare the frequency and variants of a word.
  • 11. If we use the BNC Simple search, it is necessary to type the word or phrase in the search box that the person wants to find. Once the word/ phrase has been search a list of up to 50 selected instances headed by a note of the total frequency of use of them appears on the screen.
  • 12.
  • 13. If we want to look for more complex queries we should add the following characters to the words. The _ character is used to match single words, while the = character allows the restriction of chains of speech and the use of braces {} helps to define a certain expressions
  • 14. In addition, in the screen, four options are part of the option “display”of the corpus when we are looking for a word: LIST, CHART, KWICand COMPARE. Then there are three more options under the label of search string which are: word, collocationand pos list.
  • 15.
  • 16. In addition, there is a section called “sorting and limits”. The sorting can be looked in terms frequency, relevance and alphabetical order.
  • 17. The corpus includes several categories or labels of texts from different nature which are ““spoken”, fiction”, “magazine”, “newspaper” or “non- academic texts”
  • 18. For instance, if we look for the word “couch”,the corpus shows us that this word collocates with different words: lying, lay, room, potato etc. After having clicking on one of this word several examples will appear on the screen. The corpus allows looking for a word or phrase but at the same time the possibility of finding collocations. To look for a collocation is as easier as to type the word which wants to be searched and automatically an asterisk will appear on the box of collocation. Once the search has been produced the corpus displays a list of words which collocates with the word.Let’s see:
  • 19.
  • 20.
  • 21.
  • 22. The KWIC search enriches the corpus because it helps the person which is looking for the word to know in which grammatical structures and contexts the word appears. For example, if we look for the word: “shoes” the corpus shows in colours the different words which can be used with this word. “A new pair of”, “the soles of our”, “the second hand”, “new polished”, or “thousands of” etc.
  • 23.
  • 24.
  • 25. COMPARISON BETWEEN THE BRITISH NATIONAL CORPUS AND THE COCA In terms of size there is a huge difference between both corpuses as the COCA is four times bigger than the BNC. The COCA is made up of 410 + million words in opposition to the BNC which covers 100 million words. In relation to the composition of both corpuses the COCA focuses on spoken, popular magazines, academic texts and each of those genres means a 20% of the total.
  • 26. However, the BNC is strictly divided in a 90% which is written while the other 10% is spoken English. As a result the COCA deals with more recent information as the corpus was updated while BNC focuses more on everyday language.
  • 27. Information sources British National Corpus. (2011, April 9th ). In Wikipedia. Retrieved 19: 40, April 9th , 2011, from: http://en.wikipedia.org/w/index.php?title=British_National_Corpus&oldid=328182118 British National Corpus . http://www.natcorp.ox.ac.uk/ .Retrieved 9th April, 2011 BYU-BNC: BRITISH NATIONAL CORPUS. Mark Davies / Brigham Young University. Retrieved 19:40, April 9th , 2011, from http://corpus.byu.edu/bnc/ Encoding the British National Corpus. Retrieved 19:40, April 9th 2011 from http://xml.coverpages.org/bnc-encoding2.html “Phrases in English” (PIE) and the British National Corpus. Retrieved 19:40, April 9th, 2011. http://pie.usna.edu/