This document summarizes a research study that analyzed differences in frequencies of linking verbs and relative pronouns in written language. The study aimed to analyze two types of linguistic associations: lexical-grammatical associations and grammatical-grammatical associations. For lexical-grammatical analysis, four linking verbs (be, seem, look, feel) were analyzed to identify differences in occurrences. For grammatical-grammatical analysis, two relative pronouns (who, that) were analyzed when used as subjects in restrictive adjective clauses to identify differences in occurrences. The findings showed no significant differences between linking verb occurrences, though "be" occurred most frequently and "look" least frequently. Significant differences were found between relative pronoun occurrences, with "who
American Journal of Multidisciplinary Research and Development is indexed, refereed and peer-reviewed journal, which is designed to publish research articles.
This document discusses how to design, acquire, and process a collection of linguistic data to form the raw material for a dictionary. It explains that a reliable dictionary requires evidence from language used in real communicative acts. While introspection and informant testing provide some evidence, they are limited due to subjectivity. Therefore, observation of language in use through large text corpora is indispensable. Key considerations in corpus design include size, inclusion of different text types and styles to avoid bias, and ensuring representativeness.
This research describes an attempt to establish a pedagogically useful list of the most frequent semantically non-compositional multi-word combinations for English for Journalism learners in an EFL context, who need to read English news in their field of study. The list was compiled from the NOW (News on the Web) Corpus, the largest English news database by far. In consideration of opaque multi-word combinations in widespread use and pedagogical value, the researcher applied a set of selection criteria when using the corpus. Based on frequency, meaningfulness, and semantic non-compositionality, a total of 318 non-compositional multi-word combinations of 2 to 5 words with the exclusion of phrasal verbs were selected and they accounted for approximately 2% of the total words in the corpus. The list, not highly technical in nature, contains the most commonly-used multi-word units traversing various topic areas and news readers may encounter these phrasal expressions very often. As with other individual word lists, it is hoped that this opaque expressions list may serve as a reference for English for Journalism teaching.
This document summarizes a study that examines the relationship between oral and written discourse in second language English speakers. The study hypothesizes that there is a positive working relationship between the two. Data was collected from a French doctoral student, including a writing sample and recorded speech sample. While the results showed a weak relationship, analysis of the speech without fillers revealed native-like patterns. The document reviews theories on language acquisition and the influence of first language on second language learning.
This document provides an overview of various lexical concepts that can be investigated using a corpus including: phrase, phraseology, collocation, colligation, word formation processes (compounding, blending, derivation, acronyms, clipping), lexical sets (synonyms, antonyms), polysemy, and conversion between word classes. It defines each concept and provides examples found in a corpus, typically including the word and the number of concordance hits. The document discusses how corpora can provide objective evidence to study the frequency and patterns of lexical items in a language.
Corpus Approaches to the Language of Literature 2008Martin Wynne
An overview of current approaches to the study of literature which make use of the techniques, tools and resources of corpus linguistics. Written and presented in 2008.
American Journal of Multidisciplinary Research and Development is indexed, refereed and peer-reviewed journal, which is designed to publish research articles.
This document discusses how to design, acquire, and process a collection of linguistic data to form the raw material for a dictionary. It explains that a reliable dictionary requires evidence from language used in real communicative acts. While introspection and informant testing provide some evidence, they are limited due to subjectivity. Therefore, observation of language in use through large text corpora is indispensable. Key considerations in corpus design include size, inclusion of different text types and styles to avoid bias, and ensuring representativeness.
This research describes an attempt to establish a pedagogically useful list of the most frequent semantically non-compositional multi-word combinations for English for Journalism learners in an EFL context, who need to read English news in their field of study. The list was compiled from the NOW (News on the Web) Corpus, the largest English news database by far. In consideration of opaque multi-word combinations in widespread use and pedagogical value, the researcher applied a set of selection criteria when using the corpus. Based on frequency, meaningfulness, and semantic non-compositionality, a total of 318 non-compositional multi-word combinations of 2 to 5 words with the exclusion of phrasal verbs were selected and they accounted for approximately 2% of the total words in the corpus. The list, not highly technical in nature, contains the most commonly-used multi-word units traversing various topic areas and news readers may encounter these phrasal expressions very often. As with other individual word lists, it is hoped that this opaque expressions list may serve as a reference for English for Journalism teaching.
This document summarizes a study that examines the relationship between oral and written discourse in second language English speakers. The study hypothesizes that there is a positive working relationship between the two. Data was collected from a French doctoral student, including a writing sample and recorded speech sample. While the results showed a weak relationship, analysis of the speech without fillers revealed native-like patterns. The document reviews theories on language acquisition and the influence of first language on second language learning.
This document provides an overview of various lexical concepts that can be investigated using a corpus including: phrase, phraseology, collocation, colligation, word formation processes (compounding, blending, derivation, acronyms, clipping), lexical sets (synonyms, antonyms), polysemy, and conversion between word classes. It defines each concept and provides examples found in a corpus, typically including the word and the number of concordance hits. The document discusses how corpora can provide objective evidence to study the frequency and patterns of lexical items in a language.
Corpus Approaches to the Language of Literature 2008Martin Wynne
An overview of current approaches to the study of literature which make use of the techniques, tools and resources of corpus linguistics. Written and presented in 2008.
This document discusses the use of corpus linguistics in language teaching and learning. It provides examples of how corpora can be used for vocabulary acquisition, grammar instruction, and developing classroom activities. Specific corpora mentioned include the British National Corpus and the International Corpus of Learner English. Benefits of corpus linguistics include helping learners understand typical language use and aiding in lifelong language learning.
David hirsh current perspectives in second language vocabulary research 2012Sokreth Dos
This chapter discusses current themes and possible new directions in second language vocabulary research. Key current themes explored include measuring vocabulary knowledge through vocabulary size and depth tests, investigating the nature of word knowledge along receptive-productive and partial-precise dimensions, examining the effects of incidental and explicit learning, and identifying suitable words for language learning through word lists. Potential new areas of research include better understanding the transfer of word knowledge from receptive to productive use, the concept of partial versus precise word knowledge, measuring differences in depth of vocabulary knowledge, and exploring vocabulary as interconnected semantic networks rather than isolated word forms.
The document discusses lexical bundles, which are frequently occurring multi-word expressions in a given register or genre. Lexical bundles are not complete grammatical structures or idioms, but rather bridge phrases. They serve important functions in constructing discourse and indicating success within discourse communities. The document provides examples of lexical bundles from conversation and academic prose and categorizes them based on structural patterns and functional classification. It also discusses the importance of teaching lexical bundles to language learners.
The document provides background information on derivational processes in the Matbat language. It begins by discussing how language is used for communication between humans and as an identifier of one's cultural background. It then notes that morphology, the study of how words are formed, is divided into inflection and derivation. The author examines the derivational process in the Matbat language by looking at changes in word class, such as from adjective to verb or noun. Some examples of derivational suffixes changing nouns in Matbat are provided. The purpose of the research is to investigate this derivational process in the language. It is expected that the results will provide useful information for language learners and teachers. The scope is limited to looking at changes between adjective
The Ekegusii Determiner Phrase Analysis in the Minimalist ProgramBasweti Nobert
Among some of the recent syntactic developments, the noun phrase has been reanalyzed
as a determiner phrase (DP). This study analyses the Ekegusii determiner
phrase (DP) with an inquiry into the relationship between agreement of the INFL
(sentence) and concord in the noun phrase (determiner phrase). It hypothesizes that
the Ekegusii sentential Agreement has a symmetrical relationship with the Ekegusii
Determiner Phrase internal concord and feature checking theory and full
interpretation (FI) in the Minimalist Program is adequate in the analysis of the
internal structure of the Ekegusii DP. In employing the Minimalist Program (MP),
the study shall first seek to establish the domain of the NP in the Ekegusii DP and
go ahead to do an investigation into the adequacy of the Minimalist Program in
analyzing the Ekegusii DP. This study is also geared towards establishing the order
of determiners in the DP between the D-head and the NP complement. The study
concludes that the principles of feature checking and full interpretation in the
minimalist program are mutually crucial in ensuring that Ekegusii constructions (DP
and even the sentence) are grammatical (converge). This emphasizes the fact that
the MP is adequate in Ekegusii DP analysis.
Ideational Grammatical Metaphor in Scientific Texts: A Hallidayan PerspectiveBahram Kazemian
This document summarizes a research paper that analyzes the use of ideational grammatical metaphor in scientific texts using a Hallidayan systemic functional linguistics perspective. The paper examines nominalization frequency and process types through an analysis of a corpus of 10 scientific texts. The analysis found that scientific texts highly utilize nominalization as a form of ideational grammatical metaphor. This allows scientific writing to achieve technicality and rationality through more abstract and condensed information presentation.
Borrowings in texts a case of tanzanian newspapersAlexander Decker
This document summarizes a study on the use of borrowed English words in Tanzanian newspapers from 2007 to 2009. The study analyzed over 400 borrowed words collected from newspapers and found that most were nouns. It also examined how newspaper writers integrated these borrowed words into Kiswahili through spelling and phonological adaptations. The study found that writers mostly used Kiswahili orthography when writing borrowed words and made phonological changes to follow Kiswahili syllable structure and consonant rules. Interviews with writers found that borrowed words were used for concepts that Kiswahili lacked words for or when Kiswahili words were lengthy.
Investigating the Difficulties Faced by Iraqi EFL Learners in Using Light Verbs
Zahraa Adnan Fadhil Al-Murib,
Department of English, College of Education, Islamic University, Babylon, Iraq
This study deals with light verb constructions as being syntactically functional and semantically bleached. It aims at investigating the difficulties faced by Iraqi EFL Learners (fourth-year university students) in recognizing and producing this kind of constructions. Thus, to achieve this aim, it is hypothesized that Iraqi EFL Learners are unable to collocate the light verb with its proper noun phrase both at the recognition and the production levels. And in order to achieve the aim of the study and examine its hypothesis, the following procedures are adopted: (1) Presenting a theoretical background about light verb constructions including their definition, structure, and their relationship with collocations, (2) Designing a diagnostic test to investigate the difficulties that might face the learners while being exposed to these constructions, (3) Analyzing the findings of the test to reveal the learner’s recognition and performance in dealing with light verb constructions.
Keywords: Light Verb Constructions, Collocations, Difficulties
The Sixth International Conference on Languages, Linguistics, Translation and Literature
9-10 October 2021 , Ahwaz
For more information, please visit the conference website:
WWW.LLLD.IR
1. The document discusses differing perspectives on the role of Universal Grammar (UG) in second language acquisition (SLA). Specifically, it examines whether interlanguage (IL) representation is constrained by UG.
2. There has been debate around terms like "direct access" and "full access" to UG in SLA research. Some argue L2 learners can reset UG parameters to new settings, while others argue they are restricted to their L1 settings.
3. The document argues that research should focus more on describing the nature of IL representations in their own right, rather than constantly comparing them to native speaker grammars of the L1 or L2. Comparing ILGs to other grammars
Corpus linguistics refers to large collections of natural language texts that can be analyzed using computational techniques. Creating corpora involves collecting and transcribing written or spoken texts, which is time-consuming. Well known corpora include the British National Corpus and the Lancaster-Oslo-Bergen Corpus. There are different types of corpora depending on the research questions or topics being investigated. Analyzing corpora can provide insights into patterns of language use across different text types and contexts. Teachers and material developers can use corpus analyses to inform instruction and claims about linguistic features.
I discuss the basics of corpus linguistics, the application of corpus linguistics on linguistic studies and second language learning, as well as some freely available corpus linguistics resources for beginner corpus linguists.
Citation: Zubaidi, N. (2021). Corpus linguistics: An introduction. UM de Universe 2021. doi: 10.13140/RG.2.2.25479.11683
Code-Mixing as a Marker of Gender Identity in SMS language in Pakistaniosrjce
This research deals with the gender based code-mixing in mobile phone SMS texting in Pakistani
society. It is generally observable fact that code-mixing is used by both males and females while sending
messages through their mobile phones. This research has explored the happening of code mixing i.e. mixing of
the two varieties or two different languages, which can mark the gender differences in using text messages. In
this research paper quantitative method is used to collect and analyze the data. One thousand non-forwarded
conversational SMS were collected through random sampling of 25 males and 25 females with an equal
quantity of sample from both counter parts. The findings of the study have shown that there are gender based
differences in SMS language used in Pakistan on the basis of Code-Mixing in term of frequency, nature and
topic & purpose of conversation. The findings are generalized for all males and females citizens of Pakistan.
This paper analyses the structure patterns of code-switching quantitatively and qualitatively based on EFL classroom discourse. Through the detailed analysis, the paper finds that there are different structure patterns in which teachers often switch their codes in English classroom. These structure patterns are reflected in different language levels: words and phrases level, clausal and sentence level. The functions of code-switching are determined by those structure patterns that teachers will choose for different purposes in the process of teaching.
کتیب الملخصات - المؤتمر الدولي السادس حول القضايا الراهنة للغات، علم اللغة، الترجمة و الأدب
9-10 أكتوبر 2021 ، الأهواز
لمزید من المعلومات، ﯾرﺟﯽ زﯾﺎرة ﻣوﻗﻌﻧﺎ اﻹﻟﮐﺗروﻧﻲ : WWW.LLLD.IR
لا تتردد فی مراسلتنا للاجابة عن ای استفسارات.
اللجنة المنظمة للمؤتمر،
الأهواز / الصندوق البريدی 61335-4619:
الهاتف :32931199-61 (98+)
الفاکس:32931198-61(98+)
النقال و رقم للتواصل عبر الواتس اب : 9165088772(98+)
WWW.LLLD.IR، البريد اﻹﻟﮑﺘﺮوﻧﻲ: info@pahi.ir
Corpus linguistics is the study of language through large collections of real-world texts called corpora. It is empirical, analyzing actual language usage patterns using computers. Corpora come in many types but aim to represent language broadly through balanced samples of different genres. Building a corpus involves collecting, storing, and organizing electronic text samples. Basic analyses of corpora can reveal word frequencies and linguistic patterns through tools like concordances.
The document discusses corpus linguistics and different types of corpora. It defines corpus linguistics as the study of language based on large collections of electronic texts, known as corpora. It describes general corpora, specialized corpora, historical/diachronic corpora, regional corpora, learner corpora, multilingual corpora, comparable corpora, and parallel corpora. It also discusses corpus annotation, concordancing, frequency and keyword lists, collocation, and software used for corpus analysis.
This document outlines the corpus research method for analyzing the use of the present perfect versus the simple past in American English. It discusses using the Corpus of Contemporary American English (COCA) to study how adverbs like "already" and "yet" influence whether the present perfect or simple past is used. The document also provides information on how to properly use COCA for linguistic research, noting the large amount of data it contains and things for researchers to be aware of like the written source of most of the language data.
The document discusses the usefulness of corpora for language teaching and learning. It explains that corpora allow researchers to make generalizations about language as a whole by analyzing large collections of authentic texts. In contrast to relying only on intuitions, corpora provide evidence of real language usage through numerous examples from different contexts. This helps address misconceptions and test assumptions. The document also provides guidance on developing classroom corpora and considerations for corpus design and effective use in materials and investigations.
Corpus linguistics is the study of language based on large collections of real-world language samples stored electronically. It allows for reliable, accurate, and replicable analysis of language at a large scale and in new ways not previously possible. A corpus is a large collection of written or spoken language samples that is stored electronically and can be analyzed using specialized software. Corpus linguistics provides insights into language usage that were previously difficult to obtain at a large scale through computer-assisted analysis of large text collections.
Introductory lecture on Corpus Linguistics. Contents: Corpus linguistics: past and present, What is a corpus?, Why use computers to study language? Corpus-based vs. Intuition-based approach, Theory vs. Methodology.
This lecture was based on McEnery et al. 2006. Corpus-based Language Studies. An Advanced resource book. Routlege.
Applying Corpus-Based Findings To Form-Focused Instruction The Case Of Repor...Justin Knight
The document summarizes a study that compared how reported speech is presented in English language textbooks versus how it is used in real language based on corpus data. The study found that textbooks focus mainly on indirect reported speech and backshifting verbs to past tense, but neglect many structures used in natural language. Corpus data from two studies of reported speech was then used to identify 10 principles for how it should be taught. The document proposes applying these corpus-based findings to design classroom activities and materials for teaching reported speech that are grounded in theories of second language acquisition.
UNIVERSALITY IN TRANSLATION: AN ANALYSIS OF TRANSLATION INTERFERENCE IN MULTI...John1Lorcan
Universality in translation has been a research focus in translation studies since the publication of Mona
Baker’s seminal paper in 1990s. The relevant studies mainly explore the various universality rules
governing translation in written texts, however, little attention has been devoted to translation universality
in translating multi-modal texts. This study, with Systemic Functional Grammar and Visual Grammar as
theoretical framework, verifies the validity of Law of Interference, one of the universal translation rules, in
multimodal texts. By statistically comparing transitivity, theme choice, thematic progression and
information value of two magazine articles and their translations, the paper finds that most make-up of the
source text is transferred into the target text or rendered into existing patterns, and that more negative
transfers are observed in transitivity, split theme TP and information value when translating from a major
language. The paper concludes that the Law of Interference is stronger in visual mode while weaker in
verbal mode. The present study shows that the law of interference can be observed in translating
multimodal texts. It also implies that the translation of multimodal texts makes more visible the power of
English, and it helps to perpetuate the hegemony of English in the global world.
This document discusses the use of corpus linguistics in language teaching and learning. It provides examples of how corpora can be used for vocabulary acquisition, grammar instruction, and developing classroom activities. Specific corpora mentioned include the British National Corpus and the International Corpus of Learner English. Benefits of corpus linguistics include helping learners understand typical language use and aiding in lifelong language learning.
David hirsh current perspectives in second language vocabulary research 2012Sokreth Dos
This chapter discusses current themes and possible new directions in second language vocabulary research. Key current themes explored include measuring vocabulary knowledge through vocabulary size and depth tests, investigating the nature of word knowledge along receptive-productive and partial-precise dimensions, examining the effects of incidental and explicit learning, and identifying suitable words for language learning through word lists. Potential new areas of research include better understanding the transfer of word knowledge from receptive to productive use, the concept of partial versus precise word knowledge, measuring differences in depth of vocabulary knowledge, and exploring vocabulary as interconnected semantic networks rather than isolated word forms.
The document discusses lexical bundles, which are frequently occurring multi-word expressions in a given register or genre. Lexical bundles are not complete grammatical structures or idioms, but rather bridge phrases. They serve important functions in constructing discourse and indicating success within discourse communities. The document provides examples of lexical bundles from conversation and academic prose and categorizes them based on structural patterns and functional classification. It also discusses the importance of teaching lexical bundles to language learners.
The document provides background information on derivational processes in the Matbat language. It begins by discussing how language is used for communication between humans and as an identifier of one's cultural background. It then notes that morphology, the study of how words are formed, is divided into inflection and derivation. The author examines the derivational process in the Matbat language by looking at changes in word class, such as from adjective to verb or noun. Some examples of derivational suffixes changing nouns in Matbat are provided. The purpose of the research is to investigate this derivational process in the language. It is expected that the results will provide useful information for language learners and teachers. The scope is limited to looking at changes between adjective
The Ekegusii Determiner Phrase Analysis in the Minimalist ProgramBasweti Nobert
Among some of the recent syntactic developments, the noun phrase has been reanalyzed
as a determiner phrase (DP). This study analyses the Ekegusii determiner
phrase (DP) with an inquiry into the relationship between agreement of the INFL
(sentence) and concord in the noun phrase (determiner phrase). It hypothesizes that
the Ekegusii sentential Agreement has a symmetrical relationship with the Ekegusii
Determiner Phrase internal concord and feature checking theory and full
interpretation (FI) in the Minimalist Program is adequate in the analysis of the
internal structure of the Ekegusii DP. In employing the Minimalist Program (MP),
the study shall first seek to establish the domain of the NP in the Ekegusii DP and
go ahead to do an investigation into the adequacy of the Minimalist Program in
analyzing the Ekegusii DP. This study is also geared towards establishing the order
of determiners in the DP between the D-head and the NP complement. The study
concludes that the principles of feature checking and full interpretation in the
minimalist program are mutually crucial in ensuring that Ekegusii constructions (DP
and even the sentence) are grammatical (converge). This emphasizes the fact that
the MP is adequate in Ekegusii DP analysis.
Ideational Grammatical Metaphor in Scientific Texts: A Hallidayan PerspectiveBahram Kazemian
This document summarizes a research paper that analyzes the use of ideational grammatical metaphor in scientific texts using a Hallidayan systemic functional linguistics perspective. The paper examines nominalization frequency and process types through an analysis of a corpus of 10 scientific texts. The analysis found that scientific texts highly utilize nominalization as a form of ideational grammatical metaphor. This allows scientific writing to achieve technicality and rationality through more abstract and condensed information presentation.
Borrowings in texts a case of tanzanian newspapersAlexander Decker
This document summarizes a study on the use of borrowed English words in Tanzanian newspapers from 2007 to 2009. The study analyzed over 400 borrowed words collected from newspapers and found that most were nouns. It also examined how newspaper writers integrated these borrowed words into Kiswahili through spelling and phonological adaptations. The study found that writers mostly used Kiswahili orthography when writing borrowed words and made phonological changes to follow Kiswahili syllable structure and consonant rules. Interviews with writers found that borrowed words were used for concepts that Kiswahili lacked words for or when Kiswahili words were lengthy.
Investigating the Difficulties Faced by Iraqi EFL Learners in Using Light Verbs
Zahraa Adnan Fadhil Al-Murib,
Department of English, College of Education, Islamic University, Babylon, Iraq
This study deals with light verb constructions as being syntactically functional and semantically bleached. It aims at investigating the difficulties faced by Iraqi EFL Learners (fourth-year university students) in recognizing and producing this kind of constructions. Thus, to achieve this aim, it is hypothesized that Iraqi EFL Learners are unable to collocate the light verb with its proper noun phrase both at the recognition and the production levels. And in order to achieve the aim of the study and examine its hypothesis, the following procedures are adopted: (1) Presenting a theoretical background about light verb constructions including their definition, structure, and their relationship with collocations, (2) Designing a diagnostic test to investigate the difficulties that might face the learners while being exposed to these constructions, (3) Analyzing the findings of the test to reveal the learner’s recognition and performance in dealing with light verb constructions.
Keywords: Light Verb Constructions, Collocations, Difficulties
The Sixth International Conference on Languages, Linguistics, Translation and Literature
9-10 October 2021 , Ahwaz
For more information, please visit the conference website:
WWW.LLLD.IR
1. The document discusses differing perspectives on the role of Universal Grammar (UG) in second language acquisition (SLA). Specifically, it examines whether interlanguage (IL) representation is constrained by UG.
2. There has been debate around terms like "direct access" and "full access" to UG in SLA research. Some argue L2 learners can reset UG parameters to new settings, while others argue they are restricted to their L1 settings.
3. The document argues that research should focus more on describing the nature of IL representations in their own right, rather than constantly comparing them to native speaker grammars of the L1 or L2. Comparing ILGs to other grammars
Corpus linguistics refers to large collections of natural language texts that can be analyzed using computational techniques. Creating corpora involves collecting and transcribing written or spoken texts, which is time-consuming. Well known corpora include the British National Corpus and the Lancaster-Oslo-Bergen Corpus. There are different types of corpora depending on the research questions or topics being investigated. Analyzing corpora can provide insights into patterns of language use across different text types and contexts. Teachers and material developers can use corpus analyses to inform instruction and claims about linguistic features.
I discuss the basics of corpus linguistics, the application of corpus linguistics on linguistic studies and second language learning, as well as some freely available corpus linguistics resources for beginner corpus linguists.
Citation: Zubaidi, N. (2021). Corpus linguistics: An introduction. UM de Universe 2021. doi: 10.13140/RG.2.2.25479.11683
Code-Mixing as a Marker of Gender Identity in SMS language in Pakistaniosrjce
This research deals with the gender based code-mixing in mobile phone SMS texting in Pakistani
society. It is generally observable fact that code-mixing is used by both males and females while sending
messages through their mobile phones. This research has explored the happening of code mixing i.e. mixing of
the two varieties or two different languages, which can mark the gender differences in using text messages. In
this research paper quantitative method is used to collect and analyze the data. One thousand non-forwarded
conversational SMS were collected through random sampling of 25 males and 25 females with an equal
quantity of sample from both counter parts. The findings of the study have shown that there are gender based
differences in SMS language used in Pakistan on the basis of Code-Mixing in term of frequency, nature and
topic & purpose of conversation. The findings are generalized for all males and females citizens of Pakistan.
This paper analyses the structure patterns of code-switching quantitatively and qualitatively based on EFL classroom discourse. Through the detailed analysis, the paper finds that there are different structure patterns in which teachers often switch their codes in English classroom. These structure patterns are reflected in different language levels: words and phrases level, clausal and sentence level. The functions of code-switching are determined by those structure patterns that teachers will choose for different purposes in the process of teaching.
کتیب الملخصات - المؤتمر الدولي السادس حول القضايا الراهنة للغات، علم اللغة، الترجمة و الأدب
9-10 أكتوبر 2021 ، الأهواز
لمزید من المعلومات، ﯾرﺟﯽ زﯾﺎرة ﻣوﻗﻌﻧﺎ اﻹﻟﮐﺗروﻧﻲ : WWW.LLLD.IR
لا تتردد فی مراسلتنا للاجابة عن ای استفسارات.
اللجنة المنظمة للمؤتمر،
الأهواز / الصندوق البريدی 61335-4619:
الهاتف :32931199-61 (98+)
الفاکس:32931198-61(98+)
النقال و رقم للتواصل عبر الواتس اب : 9165088772(98+)
WWW.LLLD.IR، البريد اﻹﻟﮑﺘﺮوﻧﻲ: info@pahi.ir
Corpus linguistics is the study of language through large collections of real-world texts called corpora. It is empirical, analyzing actual language usage patterns using computers. Corpora come in many types but aim to represent language broadly through balanced samples of different genres. Building a corpus involves collecting, storing, and organizing electronic text samples. Basic analyses of corpora can reveal word frequencies and linguistic patterns through tools like concordances.
The document discusses corpus linguistics and different types of corpora. It defines corpus linguistics as the study of language based on large collections of electronic texts, known as corpora. It describes general corpora, specialized corpora, historical/diachronic corpora, regional corpora, learner corpora, multilingual corpora, comparable corpora, and parallel corpora. It also discusses corpus annotation, concordancing, frequency and keyword lists, collocation, and software used for corpus analysis.
This document outlines the corpus research method for analyzing the use of the present perfect versus the simple past in American English. It discusses using the Corpus of Contemporary American English (COCA) to study how adverbs like "already" and "yet" influence whether the present perfect or simple past is used. The document also provides information on how to properly use COCA for linguistic research, noting the large amount of data it contains and things for researchers to be aware of like the written source of most of the language data.
The document discusses the usefulness of corpora for language teaching and learning. It explains that corpora allow researchers to make generalizations about language as a whole by analyzing large collections of authentic texts. In contrast to relying only on intuitions, corpora provide evidence of real language usage through numerous examples from different contexts. This helps address misconceptions and test assumptions. The document also provides guidance on developing classroom corpora and considerations for corpus design and effective use in materials and investigations.
Corpus linguistics is the study of language based on large collections of real-world language samples stored electronically. It allows for reliable, accurate, and replicable analysis of language at a large scale and in new ways not previously possible. A corpus is a large collection of written or spoken language samples that is stored electronically and can be analyzed using specialized software. Corpus linguistics provides insights into language usage that were previously difficult to obtain at a large scale through computer-assisted analysis of large text collections.
Introductory lecture on Corpus Linguistics. Contents: Corpus linguistics: past and present, What is a corpus?, Why use computers to study language? Corpus-based vs. Intuition-based approach, Theory vs. Methodology.
This lecture was based on McEnery et al. 2006. Corpus-based Language Studies. An Advanced resource book. Routlege.
Applying Corpus-Based Findings To Form-Focused Instruction The Case Of Repor...Justin Knight
The document summarizes a study that compared how reported speech is presented in English language textbooks versus how it is used in real language based on corpus data. The study found that textbooks focus mainly on indirect reported speech and backshifting verbs to past tense, but neglect many structures used in natural language. Corpus data from two studies of reported speech was then used to identify 10 principles for how it should be taught. The document proposes applying these corpus-based findings to design classroom activities and materials for teaching reported speech that are grounded in theories of second language acquisition.
UNIVERSALITY IN TRANSLATION: AN ANALYSIS OF TRANSLATION INTERFERENCE IN MULTI...John1Lorcan
Universality in translation has been a research focus in translation studies since the publication of Mona
Baker’s seminal paper in 1990s. The relevant studies mainly explore the various universality rules
governing translation in written texts, however, little attention has been devoted to translation universality
in translating multi-modal texts. This study, with Systemic Functional Grammar and Visual Grammar as
theoretical framework, verifies the validity of Law of Interference, one of the universal translation rules, in
multimodal texts. By statistically comparing transitivity, theme choice, thematic progression and
information value of two magazine articles and their translations, the paper finds that most make-up of the
source text is transferred into the target text or rendered into existing patterns, and that more negative
transfers are observed in transitivity, split theme TP and information value when translating from a major
language. The paper concludes that the Law of Interference is stronger in visual mode while weaker in
verbal mode. The present study shows that the law of interference can be observed in translating
multimodal texts. It also implies that the translation of multimodal texts makes more visible the power of
English, and it helps to perpetuate the hegemony of English in the global world.
This document discusses contrastive analysis as a tool for comparing two languages to identify similarities and differences. It can be used to predict difficulties for language learners by examining differences between their first language (L1) and the target second language (L2). The document outlines the basic steps of contrastive analysis, including describing the phonemic inventories and comparing sounds, syntax, and other linguistic features between L1 and L2. Contrastive analysis was an early and influential theory for predicting language learning difficulties but has limitations and has been supplemented by other approaches.
An Ethnography Of Communication Viva Voce In A Ghanaian UniversityHolly Fisher
This document summarizes a study that examined viva voce examinations at a Ghanaian university using an ethnographic approach. The study analyzed three recordings of viva voce defenses focusing on setting, participants, and act sequence as outlined in Hymes' framework for ethnography of communication. The analysis revealed that the formal setting of the viva voce highlights the professional nature of academics. It also showed that the asymmetric relationship between candidates and assessors is demonstrated through forms of address and language choices. Finally, the study found that the viva voce examinations follow four schematic structures.
کتیب الملخصات - المؤتمر الدولي السابع حول القضايا الراهنة للغات، علم اللغة، الترجمة و الأدب
12-11 يونيو 2022 ، الأهواز
لمزید من المعلومات، ﯾرﺟﯽ زﯾﺎرة ﻣوﻗﻌﻧﺎ اﻹﻟﮐﺗروﻧﻲ : WWW.LLLD.IR
لا تتردد فی مراسلتنا للاجابة عن ای استفسارات.
اللجنة المنظمة للمؤتمر،
الأهواز / الصندوق البريدی 61335-4619:
الهاتف :32931199-61 (98+)
الفاکس:32931198-61(98+)
النقال و رقم للتواصل عبر الواتس اب : 9165088772(98+)
WWW.LLLD.IR، البريد اﻹﻟﮑﺘﺮوﻧﻲ: info@pahi.ir
Book of Abstracts - The Seventh International Conference on Languages, Linguistics, Translation and Literature
11-12 June 2022 , Ahwaz
For more information, please visit the conference website:
WWW.LLLD.IR
Please feel free to write if there is any query.
The Conference Secretariat,
Ahwaz 61335-4619 Iran
(+98) 61-32931199
(+98) 61-32931198
(+98) 916-5088772 (WhatsApp Number)
WWW.LLLD.IR, Email: info@pahi.ir
1. Contrastive analysis aimed to compare languages to predict learners' errors and difficulties, and help design language teaching.
2. While initially considered the ultimate solution, controversies emerged regarding its predictions and claims.
3. Three versions developed - the strong version predicted all errors based on differences, the weak version used differences to explain errors, and the moderate version rejected strong and weak claims.
1. Studies of multi-word units using corpus data have shown that language is highly phraseological in nature, with the majority of texts composed of common words following common patterns.
2. Research into n-grams and other multi-word units has enhanced our understanding of language, showing that most everyday words do not have independent meanings but rather contribute to the meaning of larger lexical units.
3. The study of multi-word units has implications for applied fields like lexicography, language teaching, and English for academic/specific purposes by providing insights into typical phraseologies used in different contexts and genres.
The Input Learner Learners Forward Throughout...Tiffany Sandoval
This document provides an analysis of Robert Frost's poem "Stopping by Woods on a Snowy Evening" through a linguistic and stylistic lens. It introduces stylistics as the study of appropriate language use and style in writing. The analysis will examine Frost's style and how it shapes the interpretation of the poem. It describes Frost as an American poet known for his philosophical poetry dealing with existential questions about life, death, and humanity's place in the universe. The analysis will observe Frost's style in this particular poem.
Analysis Of Argument Structures Inquiries Into Effective WritingCarrie Romero
The document summarizes a study that analyzed the argument structures in essays written by Japanese university students. The study found that most students' arguments predominantly used justificatory structures, presenting data and evidence to support their central claim. However, few students employed more complex argument structures like warrants or rebuttals. This suggests L2 linguistic knowledge and subject knowledge play an important role in shaping arguments. The study implies that explicitly teaching both macro and micro argument structures, as well as activities involving diverse perspectives, could help improve students' argumentative writing skills.
This document discusses corpus linguistics and its applications in semantic and pragmatic studies. It provides definitions and examples of corpus linguistics, prominent corpora that are used in research, and how corpus linguistics can be applied to study semantic prosody. The document also discusses how corpus linguistics can inform the study of semantics versus pragmatics and provides examples of studies analyzing nominal compounds, genre analysis, and other linguistic features using corpus-driven approaches.
A New Look To Research Article Abstracts (RAAs) Of Novice Academic Writers ...Dustin Pytko
This document summarizes a research article that examines the rhetorical structures and use of metadiscourse in research article abstracts written by novice EFL graduate students. It begins with an introduction that describes the purpose and importance of research article abstracts. It then reviews relevant literature on rhetorical structure models, disciplinary differences, and comparisons of native and non-native novice writers. Key findings from previous research are discussed.
This document discusses 7 approaches to discourse analysis:
1. Conversation analysis examines the structure and organization of natural conversation.
2. Ethnography analyzes language use within social and cultural contexts through observation and interviews.
3. Corpus-based analysis uses large text databases to study language patterns and variations in genres.
4. Multimodal analysis views communication as involving multiple modes beyond just language.
5. Genre analysis describes conventional language patterns associated with academic and professional settings.
6. Critical discourse analysis critically examines how language relates to power and social inequality.
7. Mediated discourse analysis focuses on how social actions are carried out through discourse within cultural and historical contexts.
This document discusses corpus linguistics and quantitative research design. It defines a corpus as a large collection of texts used for linguistic analysis. Corpus linguistics allows researchers to empirically test hypotheses about language patterns and features based on large amounts of real-world data. Quantitative analysis of corpus data shows how frequently certain words, constructions, and patterns are used. Specialized corpora can focus on particular text types, languages, or learner language. Various software tools are used to analyze corpora through frequency lists, keyword lists, collocation analysis, and other methods.
The document discusses the linguistic approach to teaching, which uses a child's native language as an associative tool to help learn new words and spelling patterns. It then provides a high-level overview of the field of linguistics, including the scientific study of language form, meaning, and context. Some key areas covered include phonetics, semantics, pragmatics, grammar, and sociolinguistics.
Applied linguistics began in the 1950s with the founding of the University of Edinburgh School of Applied Linguistics in 1956 and the Center of Applied Linguistics in Washington D.C. in 1957. The British Association of Applied Linguistics was formally established in 1967 to promote the study of language use, acquisition, and teaching. Applied linguistics is concerned with the role of language in people's lives and problems associated with language use, drawing from linguistics as well as other fields like education, sociology, and anthropology. It differs from linguistics in that linguistics focuses only on describing language itself, while applied linguistics seeks to address real-world language issues.
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...Sarah Morrow
This document presents the results of a corpus-based analysis of terminology in the social sciences and humanities fields of legal science and administrative science in Indonesia. Keywords and word clusters were identified from specialized corpora in these fields to extract term candidates based on their linguistic components. Collocation analysis was then used to further examine two term candidates, "LINGKUNGAN HIDUP" and "BUDAYA PERUSAHAAN", to recognize their specific meanings based on habitually co-occurring words, fulfilling the cognitive component for terminology. This analysis demonstrated how corpus linguistics can be integrated with the communicative theory of terminology to study terminology from its linguistic and cognitive perspectives.
Corpus-Based Studies of Legal Language for Translation Purposes:Lucja Biel
This document discusses the potential of corpus-based studies as a methodology for researching legal translation and as a tool in translator training. It begins by providing background on the history of corpus linguistics and defining key terms. It then describes the main types of corpora relevant to translation studies: monolingual corpora, comparable corpora (monolingual and bilingual), and parallel corpora. The document outlines four major trajectories of corpus-based research on legal language: comparing legal language to general language; comparing legal genres; analyzing temporal variations; and analyzing cross-linguistic variations. Finally, it notes limitations of legal corpora, including availability, size, and over-representation of legislation.
A corpus driven comparative analysis of modal verbs in pakistani and british ...Alexander Decker
This document discusses a corpus-driven comparative analysis of modal verbs in Pakistani and British English fiction. It begins by introducing the study, which compiled corpora of 1 million words each from Pakistani English fiction (PEF) and British English fiction (BEF). Part-of-speech tagging was performed on both corpora using CLAWS tagging, and concordance lines of modal verbs were manually explored using Antconc software. The study aims to identify differences in modal verb usage between PEF and BEF and provide stylistic interpretations. A literature review covers previous research on modal verb classification and analyses of modal verbs in different varieties of English. The methodology explains that both qualitative and quantitative methods are used, including corpus compilation, POS tagging, and concord
Similar to Differences in Frequencies between Linking Verbs and Relative Pronouns in Written Language (20)
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfMalak Abu Hammad
Discover how MongoDB Atlas and vector search technology can revolutionize your application's search capabilities. This comprehensive presentation covers:
* What is Vector Search?
* Importance and benefits of vector search
* Practical use cases across various industries
* Step-by-step implementation guide
* Live demos with code snippets
* Enhancing LLM capabilities with vector search
* Best practices and optimization strategies
Perfect for developers, AI enthusiasts, and tech leaders. Learn how to leverage MongoDB Atlas to deliver highly relevant, context-aware search results, transforming your data retrieval process. Stay ahead in tech innovation and maximize the potential of your applications.
#MongoDB #VectorSearch #AI #SemanticSearch #TechInnovation #DataScience #LLM #MachineLearning #SearchTechnology
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc
How does your privacy program stack up against your peers? What challenges are privacy teams tackling and prioritizing in 2024?
In the fifth annual Global Privacy Benchmarks Survey, we asked over 1,800 global privacy professionals and business executives to share their perspectives on the current state of privacy inside and outside of their organizations. This year’s report focused on emerging areas of importance for privacy and compliance professionals, including considerations and implications of Artificial Intelligence (AI) technologies, building brand trust, and different approaches for achieving higher privacy competence scores.
See how organizational priorities and strategic approaches to data security and privacy are evolving around the globe.
This webinar will review:
- The top 10 privacy insights from the fifth annual Global Privacy Benchmarks Survey
- The top challenges for privacy leaders, practitioners, and organizations in 2024
- Key themes to consider in developing and maintaining your privacy program
Maruthi Prithivirajan, Head of ASEAN & IN Solution Architecture, Neo4j
Get an inside look at the latest Neo4j innovations that enable relationship-driven intelligence at scale. Learn more about the newest cloud integrations and product enhancements that make Neo4j an essential choice for developers building apps with interconnected data and generative AI.
In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject.
We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup.
Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved.
The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring.
The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise.
By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.
Dr. Sean Tan, Head of Data Science, Changi Airport Group
Discover how Changi Airport Group (CAG) leverages graph technologies and generative AI to revolutionize their search capabilities. This session delves into the unique search needs of CAG’s diverse passengers and customers, showcasing how graph data structures enhance the accuracy and relevance of AI-generated search results, mitigating the risk of “hallucinations” and improving the overall customer journey.
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIVladimir Iglovikov, Ph.D.
Presented by Vladimir Iglovikov:
- https://www.linkedin.com/in/iglovikov/
- https://x.com/viglovikov
- https://www.instagram.com/ternaus/
This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation.
Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners.
This case study covers various aspects, including:
People: The contributors and community that have supported Albumentations.
Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions.
Challenges: The hurdles in monetizing open-source projects and measuring user engagement.
Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration.
Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community.
Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations.
Mental Health: Maintaining balance and not feeling pressured by user demands.
Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth.
Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects.
Explore more about Albumentations and join the community at:
GitHub: https://github.com/albumentations-team/albumentations
Website: https://albumentations.ai/
LinkedIn: https://www.linkedin.com/company/100504475
Twitter: https://x.com/albumentations
Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank
Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2024/06/building-and-scaling-ai-applications-with-the-nx-ai-manager-a-presentation-from-network-optix/
Robin van Emden, Senior Director of Data Science at Network Optix, presents the “Building and Scaling AI Applications with the Nx AI Manager,” tutorial at the May 2024 Embedded Vision Summit.
In this presentation, van Emden covers the basics of scaling edge AI solutions using the Nx tool kit. He emphasizes the process of developing AI models and deploying them globally. He also showcases the conversion of AI models and the creation of effective edge AI pipelines, with a focus on pre-processing, model conversion, selecting the appropriate inference engine for the target hardware and post-processing.
van Emden shows how Nx can simplify the developer’s life and facilitate a rapid transition from concept to production-ready applications.He provides valuable insights into developing scalable and efficient edge AI solutions, with a strong focus on practical implementation.
20 Comprehensive Checklist of Designing and Developing a WebsitePixlogix Infotech
Dive into the world of Website Designing and Developing with Pixlogix! Looking to create a stunning online presence? Look no further! Our comprehensive checklist covers everything you need to know to craft a website that stands out. From user-friendly design to seamless functionality, we've got you covered. Don't miss out on this invaluable resource! Check out our checklist now at Pixlogix and start your journey towards a captivating online presence today.
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slackshyamraj55
Discover the seamless integration of RPA (Robotic Process Automation), COMPOSER, and APM with AWS IDP enhanced with Slack notifications. Explore how these technologies converge to streamline workflows, optimize performance, and ensure secure access, all while leveraging the power of AWS IDP and real-time communication via Slack notifications.
Building RAG with self-deployed Milvus vector database and Snowpark Container...Zilliz
This talk will give hands-on advice on building RAG applications with an open-source Milvus database deployed as a docker container. We will also introduce the integration of Milvus with Snowpark Container Services.
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024Neo4j
Neha Bajwa, Vice President of Product Marketing, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Securing your Kubernetes cluster_ a step-by-step guide to success !
Differences in Frequencies between Linking Verbs and Relative Pronouns in Written Language
1. Thanaa Alhabuobi
International Journal of Computational Linguistics (IJCL), Volume (11) : Issue (2) : 2020 34
Differences in Frequencies between Linking Verbs and Relative
Pronouns in Written Language
Thanaa Alhabuobi thamar_5@hotmail.com
Arts & Humanities/ Language & Translation
Taibah University
Almadinah Almunawarrah/ 344/ Saudi Arabia
Abstract
This study was conducted primarily to investigate two areas of linguistic analysis. The study
aimed to analyze two types of associations: lexical-grammatical associations and grammatical-
grammatical associations. The method used in this study was descriptive and analytic. With
regard to lexical-grammatical associations, four linking verbs were selected to explore differences
among them with respect to their occurrences in the selected texts. Concerning grammatical-
grammatical associations, two relative pronouns were selected to identify differences among
them with respect to their occurrences in the selected texts. The pronouns were specifically
selected in terms of their use as a subject in essential (restrictive) adjective clauses. For
purposes of the current research, seventeen articles were selected from Internet databases. The
topics of all articles involved health-life problems. The Simple Concordancer Program was
applied to find collocations and frequencies of the target words. The study was delimited to four
linking verbs: be, seem, look, and feel. It was also delimited to two relative pronouns used as a
subject in restrictive adjective clauses. The findings of the study revealed no significant
differences between occurrences of the linking verbs. However, the verb (be) occurred relatively
more frequently than (seem, look, and feel). On the other hand, the verb (look) was the least
frequent verb among all verbs. The findings also showed significant differences between
occurrences of the subject relative pronouns. More specifically, the pronoun (who) occurred more
frequently than the pronoun (that).
Keywords: Linguistics, Computer, Concordances, Linking Verbs, Pronouns.
1. INTRODUCTION
There have been an enormous number of language studies that explore the nature of linguistic
data. Perhaps, this work was to explore why certain patterns are common while others are not.
However, most studies emphasized the structure of language. In addition, these studies handled
a relatively limited number of corpora. Later, the idea of investigating linguistic data and what
relates to its associations became different. The impact of technology and the use of the
extended corpus-linguistics led to a tangible change in areas of linguistic research. Given these
factors, many new aspects appeared and became plausible and more substantial than the old
ones. The relationship between genre and the use of language patterns became worthy of being
studied as a means of exploring which patterns are more common in fiction work than in non-
fiction work. Another example involves identifying differences between written forms and spoken
forms in terms of grammar, lexicon, pragmatics, and semantics. The invention of the computer
also contributed to this matter; previously, this labor usually took a lot of time and did not produce
reasonably accurate results. Nowadays, it has become faster, easier, and more accurate with the
use of computational linguistics.
1.1 Statement of the Problem
Investigating various associations that occur in language structures became a modern approach
to meet the new trend in linguistic research. These associations are of different types and present
a rich field upon which linguists can explore the characteristics of both language use and
2. Thanaa Alhabuobi
International Journal of Computational Linguistics (IJCL), Volume (11) : Issue (2) : 2020 35
language structure (Biber, et al., 1998). The impact of computer use opens the door wide for
corpus linguistics to be an applicable approach and an ample device for identifying and analyzing
various associations across language structures. Consequently, the analyses of those
associations are needed to identify different language uses, generalize rules, and explain the
nature of language in terms of applied and theoretical linguistics. The integrative approach that
involves corpus linguistics and computational linguistics is necessary for that purpose. Thus,
there is a growing need to study and analyze lexical-grammatical associations to identify which
lexis, in particular, occurs with which pattern more frequently; this is to serve both grammar and
lexicon. In addition, it is important to analyze and study grammatical-grammatical associations to
identify the relationships that arise when certain patterns are combined in terms of
grammaticality, ungrammaticality, function, and use.
1.2 Purpose of the Study
The main aim of this study was to explore and analyze language structures in terms of two
associations: lexical-grammatical associations and grammatical-grammatical associations.
Concerning lexical-grammatical associations, the study aimed to explore the differences in
frequencies among four English linking verbs (be, seem, look, feel). In other words, it aimed to
identify which verb occurred more frequently. Concerning grammatical-grammatical associations,
the study aimed to discover differences in frequencies between two relative pronouns used as a
subject in essential (restrictive) adjective clauses. In other words, the study attempted to find
which pronoun occurred more frequently.
1.3 Questions of the Study
In light of the study’s purpose, the following study questions were generated:
1- To what extent do the linking verbs differ in their occurrences in the selected textual
structures?
a. What is the most frequent linking verb occurring in the selected textual
structures?
b. What is the least frequent linking verb occurring in the selected textual
structures?
2- To what extent do the subject relative pronouns differ in their occurrences in the selected
textual structures?
a. What is the most frequent subject relative pronoun occurring in the selected
textual structures?
b. What is the least frequent subject relative pronoun occurring in the selected
textual structures?
1.4 Delimitations of the Study
This study was delimited to two types of linguistic analysis: lexical-grammatical analysis and
grammatical-grammatical analysis. The lexical-grammatical analysis involved four English linking
verbs: be, seem, feel, and look. The linking verbs are supposed to be followed immediately by
adjectives. The verbs were investigated in all their possible forms. The second type was
grammatical-grammatical associations. This analysis involved two relative pronouns: who and
that. These two relative pronouns were limited to their use for people and as a subject of the
dependent adjective clauses in their restrictive (essential) type. The subject of the texts was
delimited to health issues and all materials were electronically achieved. The study was also
delimited to written form patterns in one restricted topic concerning health-life problems.
2. REVIEW OF LITERATURE
2.1 Introduction
This section deals with the review of literature related to the topic of the current research paper.
The review sought to shed light on the concept of corpus linguistics and to provide a brief
presentation of its historical background. Then the relationship between the corpus-based
approach and the use of computers was discussed in light of the importance of that relationship in
3. Thanaa Alhabuobi
International Journal of Computational Linguistics (IJCL), Volume (11) : Issue (2) : 2020 36
the field of linguistic research. Afterward, the role of concordances and the issues attached to it,
such as collocations and KWIC, were investigated. Two main issues were also discussed in this
section because they contributed to purposes of the current study; they were lexical-grammatical
analysis and grammatical-grammatical analysis. Concerning lexical-grammatical analysis, linking
verbs were discussed and elaborated. At the level of grammatical-grammatical analysis, adjective
clauses were explained and identified.
2.2 Corpus Linguistics between Concept and History
There cannot be one ideal paradigm through which linguistics to different languages is
undertaken as an axiom. Over centuries, endeavors never stopped, nor did they integrate to
achieve consensus. Thus, decades of "studies and investigations" always result in newer trends
in linguistics, particularly at two main levels: structure and use. Previously, pioneers in the field of
language were concerned mainly with studying the surface level of language—in other words,
how small units combine to form larger units with regard to constituent class. Then these
associations are judged in terms of grammaticality and ungrammaticality.
Recently, there has been a strong shift toward a new trend in language studies, (Biber, et al.,
1998). The new approach to linguistics is to use corpora to analyze associations at various levels:
lexical-lexical, lexical-grammatical, grammatical-grammatical, and linguistic-non linguistic.
Particularly worthy of note, concepts such as function and use were focused on rather than
structures and rules. More importantly, brand-new areas have been investigated as factors
affecting language use, such as register, genre, and dialects. For example, the language of fiction
is compared to the language of non-fiction to explore the distinctive characteristics of both uses.
Traditionally, a large number of investigations were dedicated to language analyses. The
analyses were usually done manually and were devoted to issues of the classical approach to
language study, as mentioned above. Of course, the compilation of corpora was neither
qualitative nor quantitative enough, as there were no computerized devices. Moreover, the labor
of analysis itself was greatly time-consuming and the results were relatively inaccurate. In other
words, the use of corpora is not a new method for language studies, but was limited and
incomprehensive with regard to accuracy, variety, language function, qualitative vs. qualitative
nature, and generalizations.
This leads us to consider what is meant by corpus linguistics. McEnery, et al. (2006) and Samson
(1995:4) define corpus as "a body of naturally occurring language”, So it seems, as asserted by
Wisniewski (2007:1), that corpora refers to a language in its two forms—written and spoken—that
involve millions of words. Chen (2004) supports the same idea and adds that corpora are
authentic and electronic language databases that can be downloaded from the Internet or set on
desktops as software. Gries (2009) postulates that corpus linguistics is a controversial notion and
can be thought of as a method, model, and/or theory. Moreover, a corpus is thought of as a
compilation of more than one text used for empirical linguistics (Mcenery & Wilson, 1996). On the
other hand, Meyer (2002) claims that corpora are texts or part of texts; consequently, they are
used in relation to the functional approach to language analysis.
2.3 Computer and Corpus-Based Approach in Linguistic Research
The era of technology evolution yielded a pivotal revolution in the field of computer sciences. It
was to be expected that this development would affect all fields, especially linguistic research.
Therefore, specialists described a very important onset years ago through which the use of
computers became essential in analytical investigations. This resulted in what we today call the
corpus-based approach to linguistic research. In the present time, as maintained by McEnery,
et.al, (2006) and Samson (1995), the corpus-based approach implicitly involves the use of
computers widely and effectively because of the ample characteristics that computer programs
have. Thus, as a result, an underlying integration occurred between the corpus-based approach
and computational linguistics to provide more respective results of analyses with regard to many
factors, such as representativeness, cost, accuracy, and generalizations.
4. Thanaa Alhabuobi
International Journal of Computational Linguistics (IJCL), Volume (11) : Issue (2) : 2020 37
Thus, the corpus-based approach and computational linguistics NLP are broadly dedicated to
studying the characteristics of grammatical rules, language function, pragmatics, language
teaching (Chang & Chang, 2004), language learning and acquisition, and discourse analyses,
and are also devoted to lexicography development (Mcenery & Wilson, 1996). Greenbaum (1996)
investigated the impact of computer use on corpus linguistics with regard to students’ learning,
specifically, the use of phrasal lexical. He maintained that software is very effective in enabling
the analyst to determine the associations in a learner's writing. He also asserted that the use of
computers in relation to corpus linguistics provided rich statistical measures regarding
quantitative characteristics.
Another framework, which was presented by Moon (1998), ensured the change in lexicology and
lexicography by using a large number of corpora and corpus linguistics. The most important piece
of information that can be obtained from the corpus linguistics approach is word frequency with
regard to lexical associations. This leads us to the importance of concordances that are used to
derive collocations. On the other hand, collocations—word associations in context—are an
essential method of counting the commonsense occurrences of a word across texts. In this
regard, Moon (1998) claimed that the collocation-based approach of corpora is a promising
device used to detect lexical behaviors.
In sum, the integrative approach that involves computational linguistics and corpus linguistics is
regarded as an ideal axiom of the analytical model to language investigations. In this regard, this
approach suits the two main branches of linguistics, i.e., theoretical linguistics and applied
linguistics. Consequently, we can say that the use of computers is greatly sufficient in the field of
lexical-grammatical associations and grammatical-grammatical associations, which is the main
aim of our current research.
2.4 Concordances
The previous discussion stated a claim that corpus linguistics plays a pivotal role in language
investigations with the help of NLP. Also, we stated that corpus or corpora comprise a body of
language and collections of a huge number of spoken and written patterns that are compiled and
classified according to certain regulations and not haphazardly. In other words, today, we can find
types of corpora such as learner corpora, diachronic corpora, general corpora, written corpora,
etc. (Greenbaum, 1996).
Now, corpus or corpora provide us with unlimited numbers of concordances. Concordances are
defined as lists of words in a context that means that a word appears in combination with other
words in lines. This combination is called collocation and is defined by Wouden (1997:5) as
follows: "COLLOCATION is a term that refers to the mechanism, or fact, that certain words are
regularly found in the company of other words”, For example, certain verbs are used with certain
nouns; we say "make a decision" not "take a decision". Consequently, KWIC—a key word in a
context—is an important device used to benefit from collocations to analyze the characteristics of
different language associations (Renouf & Kehoe, 2006).
2.5 Areas of the Use of Concordances
2.5.1 Analysis of Lexical-Grammatical Associations
The relationship between lexis and grammar is worth being studied. In this respect, Schonefeld
(2001) stated that although lexicon and grammar are two aspects of a language, and some
emphasize lexicon while others emphasize grammar, the interface between them remains a
plausible notion.
However, as mentioned above, the corpus-based approach in relation to computational linguistics
provides a paradigm through which to investigate many aspects of language associations. More
specifically, specialists in this field may make use of certain software to analyze lexical-
grammatical associations by the means of collocations and KWICs. This refers to the exploration
of the relationship between grammar and lexis. In other words, this type of analysis helps in
identifying the occurrences of certain grammar with certain vocabulary. For example, Biber, et al.
5. Thanaa Alhabuobi
International Journal of Computational Linguistics (IJCL), Volume (11) : Issue (2) : 2020 38
(1998) argue that two words (little, small) may appear differently with respect to collocations even
though they have almost the same meaning. This means that the two words have distinctive
grammatical features. The former is used with mass nouns, whereas the latter is used with count
nouns. Consequently, the use of a key word within collocations is a device that helps the analyst
identify grammatical properties and the relationship between lexis and grammar.
2.5.1.1 Linking Verbs
Talking about verbs means to deal with word classes such as nouns, verbs, adjectives, etc. Verbs
are also called lexical words or contentives. They refer to "doing" because they stand for the
process in meaning (Freeborn, 1994). Verbs are classified into various categories: tensed (finite),
non-tensed (non-finite), regular, irregular, transitive, and intransitive. Often, verbs—if transitive—
are followed immediately by NP object, while if they are intransitive, they are followed by
adjuncts, i.e., PP or AP. The English language has what is called a linking verb, or copulative
verb, which, as stated by Leech (2006), is a predicate that is immediately followed by an adjective
and that might appear in another fashion when followed by any complement.
The English language has 12 linking verbs: be, feel, look, smell, sound, taste, appear, seem,
become, get, turn, and grow. For instance, one may say "The weather became warmer", "Do you
still feel hungry?" and "The future is looking good". It is worth noting that the verbs "look" and
"appear" mean "seem" in this case and that the verbs "get”, "turn”, and "grow" mean "become"
(Azar, 2002).
2.5.2 Analysis of Grammatical-Grammatical Associations
Grammatical-grammatical association is one of the richest areas through which corpus linguistics
can present generalized patterns in relation to different language uses. In addition, this type of
analysis sheds light on the principles that govern structures in combination. The contribution of
corpus linguistics is clear regarding grammaticality and ungrammaticality counted for native
speakers' production of language, whether in its written or spoken form. In this regard, McEnery,
et al. (2006) stated that corpora have a strong influence on recently published references of
grammar. He also maintained that corpus-based grammar can explore the distinctive features
that exist between spoken patterns and written patterns using registers.
The combination of corpus linguistics and computational linguistics as stated by Sampson (1995)
deals with a rich analysis to identify properties of surface grammar, i.e., word classes and
patterns, and the features of logical grammar in relation to the semantic aspect. An important
issue related to this aspect was discussed by Renouf (2006), who established a correlation
between the use of corpus linguistics and grammatical issues in terms of the characteristics of a
reference grammar and corpus linguistics methodology relevant to writing grammar and corpus
linguistics research. Several features can be achieved through an analysis of corpora, whether
written forms or spoken forms. For example, this type of analysis may help in identifying patterns,
such as dependent clauses, within structures—in other words, what phrases precede and what
phrases follow the dependent clause in a construction.
2.5.2.1 Adjective Clauses
As defined by Todd (1987:63), a clause "is a group of words which contains a finite verb”,
Clauses in general are of several types; they are classified mainly into independent clauses,
which are sequences of words that can stand by themselves as separate sentences, and
dependent clauses, which are considered a part of a sentence in the sense that they can't stand
in isolation. Dependent clauses are of three types: adjective clauses, noun clauses, and adverb
clauses (Azar, 2002).
Adjective clauses usually modify and describe a noun that is mentioned in the higher position in a
sentence by providing further information; they are also called relative clauses (Traugott & Pratt,
1980). The adjective clause is combined with the main clause, independent, by relative pronouns:
who, that, which, whom, whose, where, and when.
6. Thanaa Alhabuobi
International Journal of Computational Linguistics (IJCL), Volume (11) : Issue (2) : 2020 39
However, relative pronouns have various functions; for instance, who, that, and which can be
used as subjects of the adjective clause: who is used for people, which is used for things, and
that is used for both people and things. Furthermore, the adjective clause is classified into two
additional sub-categories with respect to necessity.
If the adjective clause is unnecessary in the sense that it adds additional information only about
the noun it describes, commas are used and the clause can be omitted. It is called non-essential
or non-restrictive; an equivalent term is "parenthetical". Non-essential clauses permit all pronouns
except that.
In contrast, if the adjective clause is necessary to the noun it identifies, no commas are used, and
it cannot be omitted. This type of clause is called essential or restrictive; an equivalent term is
"contrastive". When an essential clause exits, any possible pronoun may be used.
3. METHODOLOGY AND PROCEDURE
Introduction
This research paper aims to explore the lexical-grammatical associations and grammatical-
grammatical associations using the corpus-based approach. The objective of this research is to
find the frequency of four linking verbs in the English language: be, seem, look, and feel. It also
aims to identify which of those verbs is more common. The second aim of the current research is
to discover the frequency of two relative pronouns used in dependent adjective clauses—who
and that—and to determine which pronoun is more frequent.
3.1 Research Design
This study is descriptive and analytic. Throughout this research, lexical-grammatical associations
and grammatical-grammatical associations were investigated using the corpus-based approach.
Concerning lexical-grammatical associations, three linking verbs were investigated with respect to
frequent occurrences in the texts. On the other hand, grammatical-grammatical associations were
investigated with regard to two relative pronouns used as a subject in dependent adjective
clauses. Frequencies of all target words were determined to explore common occurrences.
3.2 Data Sources
The study data were collected from various sources. Data concerning the review of literature was
collected from electronic sources and publications of books and references. The researcher also
made use of the course-pack and lectures presented by Professor Mahmoud Saleh. Electronic
materials were a helpful source concerning text compilation. Data concerning textual analysis
were collected from the Internet using the downloading technique; then all texts were saved and
documented by name in folders.
3.3 Study Samples
The samples of the current study involved 17 articles, papers, and investigations. All materials
were electronic in the sense that they were obtained from Internet websites and then compiled as
software. All samples were of one type, i.e., health problems and lifestyle. They were collected
from newspapers and various health institutions’ websites. The following table illustrates the
samples.
N Article N W W N V P N T P
1 BBC News 476 209 454
2 Canada Free Press 738 346 741
3 Daily Mirror 260 139 240
4 Health MSN 936 442 927
5 Lancaster Guardian 1138 421 1188
6 Mental Health Anxiety 1755 653 1736
7 Msn Lifestyle 672 293 653
8 National Health Service 419 199 417
9 National Inst. Child 889 371 869
7. Thanaa Alhabuobi
International Journal of Computational Linguistics (IJCL), Volume (11) : Issue (2) : 2020 40
10 New York Times 1175 448 11211342111
11 Problems of Life -Fear 635 266 615
12 Problems of Life-Guilt 1239 410 1228
13 The Associated Press 830 422 809
14 The Guardian 1002 429 994
15 The Independent 1021 378 907
16 The Telegraph 568 223 486
17 Times on Line 6011 1042 2602
Total 19764 6691 16000
TABLE 1: Samples of the study.
N W W = Number of words in Word Processing Program
NVP = Number of Vocabulary in Plain text
NTP = Number of Tokens in Plain Text
3.4 Instrumentation and Procedures
Purposes of the current research paper is to find the frequency of two types of associations: 1)
lexical-grammatical associations, by finding the higher frequency among four linking verbs: be,
look, feel, and seem, and 2) grammatical-grammatical associations. This purpose involved two
relative pronouns (who, that) used primarily as a subject in dependent adjective clauses.
The researcher used a software program called SCP (Simple Concordancer Program) to find the
frequencies of the research targets.
First Phase
The first phase was the compilation of the data. For purposes of this study, 17 electronic sources
were searched to obtain written texts in the field of health. The materials averaged between 150
and 1000 words. The overall number of words of the entirety of the materials was 19,764.
Second Phase
After compilation of the texts, the data were manipulated and converted to Plain Text. The
conversion process was carried out to enable the SCP to deal with the texts.
Third Phase
In this phase, the researcher dealt with the materials one by one to count the frequencies of:
1-The linking verb (be) in its seven forms: is, was, are, were, be, been, being.
2-The linking verb (seem) in its four forms: seem, seems, seemed, seeming.
3-The linking verb (look) in its four forms: look, looks, looked, looking.
4-The linking verb (feel) in its four forms: feel, feels, felt, feeling.
5-The relative pronoun (who) as a subject of the adjective clause.
6-The relative pronoun (that) as a subject of the adjective clause.
To achieve the above objectives, the researcher used the option (word) and KWIC available in
the program. Then the project statistics of each material were found. This research adopted raw
count type to extract the word frequency.
3.5 Data Analysis and Statistics
The current study utilized the following statistical methods to analyze the data:
The Simple Concordancer Program
Collocations
KWICS
Descriptive Statistics:
1. Frequencies
2. Averages
8. Thanaa Alhabuobi
International Journal of Computational Linguistics (IJCL), Volume (11) : Issue (2) : 2020 41
4. RESULTS AND DISCUSSION
4.1 Introduction
The current research aimed to identify the two types of associations. The first type was lexical-
grammatical associations and the second type was grammatical-grammatical associations.
Concerning the first type, the study aimed to explore the frequencies of the four linking verbs (be,
seem, look, feel) by counting their total occurrences in the entirety of the texts. The second type
was concerned with exploring the frequencies of the two subject pronouns (who and that) used as
a subject in restrictive adjective clauses. Thus, this section was divided into two sub-sections.
The first sub-section dealt with lexical-grammatical associations, and the second sub-section
dealt with grammatical-grammatical associations.
4.2 Section (A) Frequency of Linking verbs
Article N W W N V P N T P N (be)
BBC News 476 209 454 9
Canada Free Press 738 346 741 9
Daily Mirror 260 139 240 2
Health MSN 936 442 927 10
Lancaster Guardian 1138 421 1188 18
Mental Health Anxiety 1755 653 1736 29
Msn Lifestyle 672 293 653 6
National Health Service 419 199 417 10
National Inst. Child 889 371 869 11
New York Times 1175 448 11211342111 15
Problems of Life -Fear 635 266 615 12
Problems of Life-Guilt 1239 410 1228 28
The Associated Press 830 422 809 5
The Guardian 1002 429 994 13
The Independent 1021 378 907 12
The Telegraph 568 223 486 2
Times on Line 6011 1042 2602 28
Total 19764 6691 16000 219
TABLE 2: Frequency of Verb (Be).
The results in table (2) indicate the distribution of verb (be) among the whole of articles. The table
shows that the overall number of verb (be) was 219. More specifically, verb (be) represents 3.27
% of the total number of NVP.
Article N W W N V P N T P N (seem)
BBC News 476 209 454 0
Canada Free Press 738 346 741 0
Daily Mirror 260 139 240 0
Health MSN 936 442 927 1
Lancaster Guardian 1138 421 1188 1
Mental Health Anxiety 1755 653 1736 1
Msn Lifestyle 672 293 653 0
National Health Service 419 199 417 0
National Inst. Child 889 371 869 0
New York Times 1175 448 1134 0
Problems of Life -Fear 635 266 615 3
Problems of Life-Guilt 1239 410 1228 0
The Associated Press 830 422 809 0
The Guardian 1002 429 994 0
The Independent 1021 378 907 0
The Telegraph 568 223 486 0
Times on Line 6011 1042 2602 3
Total 19764 6691 16000 9
TABLE 3: Frequency of Verb (Seem).
9. Thanaa Alhabuobi
International Journal of Computational Linguistics (IJCL), Volume (11) : Issue (2) : 2020 42
As indicated in table (3), the linking verb (seem) represents 0.13 % of the overall number of the
NVP. The distribution of the verb among the texts achieved (9) as a total number of the frequency
of the verb.
Article N W W N V P N T P N (Look)
BBC News 476 209 454 0
Canada Free Press 738 346 741 0
Daily Mirror 260 139 240 0
Health MSN 936 442 927 0
Lancaster Guardian 1138 421 1188 0
Mental Health Anxiety 1755 653 1736 0
Msn Lifestyle 672 293 653 0
National Health Service 419 199 417 0
National Inst. Child 889 371 869 0
New York Times 1175 448 1134 0
Problems of Life -Fear 635 266 615 0
Problems of Life-Guilt 1239 410 1228 0
The Associated Press 830 422 809 0
The Guardian 1002 429 994 1
The Independent 1021 378 907 0
The Telegraph 568 223 486 0
Times on Line 6011 1042 2602 1
Total 19764 6691 16000 2
TABLE 4: Frequency of Verb (Look).
According to table (4), the frequency of the verb (look) in all texts was only (2). This indicates that
the distribution of the verb among the whole texts is 0.02% with regard to NVP.
Article N W W N V P N T P N (Feel)
BBC News 476 209 454 0
Canada Free Press 738 346 741 0
Daily Mirror 260 139 240 0
Health MSN 936 442 927 0
Lancaster Guardian 1138 421 1188 0
Mental Health Anxiety 1755 653 1736 5
Msn Lifestyle 672 293 653 0
National Health Service 419 199 417 0
National Inst. Child 889 371 869 0
New York Times 1175 448 1134 0
Problems of Life -Fear 635 266 615 2
Problems of Life-Guilt 1239 410 1228 6
The Associated Press 830 422 809 0
The Guardian 1002 429 994 2
The Independent 1021 378 907 0
The Telegraph 568 223 486 0
Times on Line 6011 1042 2602 1
Total 19764 6691 16000 16
TABLE 5: Frequency of Verb (Feel).
The results in table (5) indicate the distribution of the verb (feel) among the whole of articles. The
table shows that the overall number of verb (feel) was 16. More specifically, the verb (feel)
represents 0.23 % of the total number of NVP.
10. Thanaa Alhabuobi
International Journal of Computational Linguistics (IJCL), Volume (11) : Issue (2) : 2020 43
4.3 Section (B) Frequency of Relative Pronouns
Article N W W N V P N T P N (who)
BBC News 476 209 454 8
Canada Free Press 738 346 741 3
Daily Mirror 260 139 240 0
Health MSN 936 442 927 3
Lancaster Guardian 1138 421 1188 3
Mental Health Anxiety 1755 653 1736 2
Msn Lifestyle 672 293 653 0
National Health Service 419 199 417 1
National Inst. Child 889 371 869 14
New York Times 1175 448 11211342111 6
Problems of Life -Fear 635 266 615 1
Problems of Life-Guilt 1239 410 1228 5
The Associated Press 830 422 809 3
The Guardian 1002 429 994 4
The Independent 1021 378 907 5
The Telegraph 568 223 486 5
Times on Line 6011 1042 2602 7
Total 19764 6691 16000 70
TABLE 6: Frequency of Pronoun (who).
Data in table (6) illustrate that the total number of the relative pronoun (who) is 70.
Article N W W N V P N T P N (that)
BBC News 476 209 454 0
Canada Free Press 738 346 741 0
Daily Mirror 260 139 240 0
Health MSN 936 442 927 2
Lancaster Guardian 1138 421 1188 1
Mental Health Anxiety 1755 653 1736 11
Msn Lifestyle 672 293 653 0
National Health Service 419 199 417 1
National Inst. Child 889 371 869 1
New York Times 1175 448 11211342111 5
Problems of Life -Fear 635 266 615 3
Problems of Life-Guilt 1239 410 1228 1
The Associated Press 830 422 809 0
The Guardian 1002 429 994 1
The Independent 1021 378 907 0
The Telegraph 568 223 486 0
Times on Line 6011 1042 2602 0
Total 19764 6691 16000 26
TABLE 7: Frequency of Pronoun (that).
The results in the above table indicate that the pronoun (that) is used 26 times in total.
5. FINDINGS OF THE STUDY
The results will be discussed according to the two types of associations investigated throughout
this research paper.
5.2 Findings Concerning Lexical-Grammatical Associations
The tables displayed in this section reveal that the whole number of the selected texts was 6691
via types in the plain text. The frequency of the verb (be) as a linking verb indicated that it
represented 3.27% of the total average. The results for the other linking verbs were as follows:
11. Thanaa Alhabuobi
International Journal of Computational Linguistics (IJCL), Volume (11) : Issue (2) : 2020 44
seem represented 0.13%, feel represented 0.23%, and look represented 0.02%. These results
relatively indicated that the use of (be) as a linking verb across lexical-grammatical associations
was more frequent as compared to the other linking verbs. More specifically, the difference
among all verbs is not of a high value; in other words, the average of all verbs, especially (look,
seem, and feel), is convergent. However, the verb (look) was the least frequent among all verbs.
5.3 Findings Concerning Grammatical-Grammatical Associations
The results showed that the frequency of the subject pronoun (who) was 70, whereas the
frequency of the subject pronoun (that) was 26. The results indicated a significant difference
between the uses of the two subject pronouns in relati ve clauses. In other words, the findings
showed that the pronoun (who) occurred more frequently than the pronoun (that) in essential
(restrictive) adjective clauses.
5.4 Summary and Conclusions
The current research paper investigated two areas of linguistic analysis. There were two main
purposes for developing this paper. The first purpose was related to the analysis of lexical-
grammatical associations. In this respect, the frequencies of four linking verbs were identified to
explore differences among these verbs with respect to their occurrences in the texts. The verbs
selected for this research were be, seem, look, and feel. All forms of the verbs were investigated
for an exploration of their frequency.
The second purpose of this study was to analyze grammatical-grammatical associations. Thus,
two relative pronouns used to introduce adjective clauses were investigated. In this respect, the
frequencies of the selected pronouns were identified to explore differences among them. The
purpose was to find which pronoun occurred more frequently than the other one. The adjective
clauses were delimited to the essential (restrictive type), and the pronouns were delimited to their
use as a subject of the relative clause.
Two approaches were used for purposes of this research: the corpus-based approach and the
computational linguistics approach. Thereby, the Simple Concordancer Program was used to
identify the differences in frequencies using collocations and KWICS. Two essential research
questions were developed for purposes of this study. 1- To what extent do the linking verbs differ
in their occurrences in the selected textual structures? 2- To what extent do the subject relative
pronouns differ in their occurrences in the selected textual structures?
The findings concerning lexical-grammatical associations indicated that there were no significant
differences between the occurrenc es of the linking verbs. However, the verb (be) occurred
relatively more frequently than (seem, look, and feel). On the other hand, the verb (look) was the
least frequent verb among all verbs. The findings concerning grammatical-grammatical
associations indicated significant differences between the occurrences of the subject relative
pronouns. More specifically, the pronoun (who) occurred more frequently than did the pronoun
(that).
The current research could shed light on the linguistic structures as a writing style used by
different newspapers with respect to two linguistic aspects: lexical-grammatical and grammatical-
grammatical associations. Nevertheless, it would be very beneficial to approach some critical
comparison between the results of this research and other research findings in the same field.
However, there was no research dedicated to exploring the frequencies of the use of the linking
verbs and the relative pronouns, at least to my best knowledge. In this matter, the researcher
tried a lot of journals to find articles and/or research investigated corpus-based analysis for the
purpose of lexical-grammatical associations and grammatical-grammatical associations, of any
sort. Thus, it is strongly recommended that this research is duplicated to seek other issues. For
example, a research may be conducted to explore the reasons behind the more frequent use of
(who) than the use of (that). This research used newspapers as a source of data analysis. It
might be useful to duplicate this research taking data from spoken discourse to get insight if there
are differences between written and spoken forms.
12. Thanaa Alhabuobi
International Journal of Computational Linguistics (IJCL), Volume (11) : Issue (2) : 2020 45
6. REFERENCES
[1] Azar, B. Understanding and Using English Grammar. New York: Longman, (2002).
[2] Biber, D., Conrad, S. & Reppen, R. Corpus Linguistics: Investigating Language Structure
and Use. Cambridge: CUP, (1998).
[3] Chang, J & Chang, Y. (2010, January 21) Computer Assisted Language Learning Based on
Corpora and Natural Language Processing: The Experience of Project CANDLE. IWLeL
2004: An Interactive Workshop on Language e-Learning 15 – 23.
[4] Available: http://dspace.wul.waseda.ac.jp/dspace/bitstream/2065/1391/1/02.pdf. [2004].
[5] Chen, Y. (2004). The Use of Corpora in the Vocabulary Classroom. The Internet TESL
Journal: Vol. X, No.9. Available: http://iteslj.org/Techniques/Chen-Corpora.html [January 20,
2010].
[6] Freeborn, D. A Course Book in English Grammar. London: The Macmillan Press, (1987).
[7] Greenbaum, S. The international Corpus of English. Oxford: Clarendon Press, (1996).
[8] Gries, S. (2009). What is Corpus Linguistics. Language and Linguistics Compass: Vol.3, pp.
1-17.
Available:http://www.linguistics.ucsb.edu/faculty/stgries/research/CorpLing_Compass.pdf
[January 20, 2010]
[9] Leech, G. A Glossary of English Grammar. Edinburgh: Edinburgh University Press, (2006).
[10] Mcenery, T. & Wilson, A. Corpus Linguistics. Edinburgh: Edinburgh University Press, (1996).
[11] McEnery, T., Xiao, R., & Tono, Y. Corpus-Based Language Studies. New York: Routledge,
(2006).
[12] Meyer, CH. English Corpus Linguistics: An Introduction. England: Cambridge University
Press, (2002).
[13] Moon, R. Fixed Expressions and Idioms in English: A Corpus-Based Approach. Oxford:
Clarendon Press, (1998).
[14] Renouf, A. & Kehoe, A. The Changing Face of Corpus Linguistics. Amsterdam: Rodopi,
(2006).
[15] Samson, G. English for the Computer. Oxford: Clarendon Press, (1995).
[16] Schonefeld, D. Where Lexicon and Syntax Meet. New York. Mouton de Gruyter, (2001).
[17] Todd, L. An Introduction to Linguistics. England: Longman York Press, (1987).
[18] Traugott, E. & Pratt, M. Linguistics for Students of Literature. Florida: Harcourt Brace
Jovanovich, Inc., (1980).
[19] Wisniewski, K. (2007). Corpus Linguistics. Encyclopedia of Language and Linguistics.
Oxford: Elsevier. Available: From the World Wide Web: http://www.tlumaczenia-
angielski.info/linguistics/corpus-linguistics.htm [January 20, 2010].
[20] Wouden, T. Negative Contexts: Collocation, Polarity and Multiple Negation. London:
Routledge, (1997).
13. Thanaa Alhabuobi
International Journal of Computational Linguistics (IJCL), Volume (11) : Issue (2) : 2020 46
7. APPENDICES
Some Axioms Referring to the Results of the Concordancer Processes
Figure 1
that
10 life, previous work has shown that 95% of victims are not
17 bruises, or burns/•/Injuries that are in the shape of an
55 ./Researchers have found that boys and girls living
56 do so again. It is important that preventive measures are
68 the abuse, reassure him/her that you believe him/her, that
69 her that you believe him/her, that he/she is right to tell
70 she is right to tell you, and that he/she is not bad./•/Tell
77 on Hospital Care recommends that children who are
77 as adults. Research indicates that children without
78 suspected of abuse indicates that training the interviewer
78 correct the interviewer and that it is acceptable for the
Figure 2
who
1 As Bones Age, Who’s at Risk for Fracture?/By
4 age, this question arises: Who should be treated with
5 especially older white women, who are at highest risk of one
6 bill. Among elderly people who fracture a hip, 10 percent
12 as they were at age 30 but who have not yet become
12 lead to osteoporosis. Who among them would most
24 bone mineral density range who have the highest risk of
26 for a 60-year-old white woman who is 5 feet and 110 pounds,
27 110-pound woman had a parent who suffered a hip fracture,
32 of vitamin D daily. For those who take omega-3 fatty acids,
Figure 3
that
14 future - all the bad things that may happen to me tomorrow
16 evil spirits./* being alone - that no one cares or
24 someone to marry./* death - that old enemy./How to handle
26 the bottom of our minds, so that perhaps we don't even
29 , "Today is the tomorrow that you spent your time
33 actually need is a new mind, that thinks differently! But
38 . Particularly, do things that will help other people in
Figure 4
who
19 it is not deserved./* People who survive a bad accident or
22 . Someone said, (do you know who?), 'The truth will set you
23 - that we forgive the person who hurt us./This may be a
35 the past. But we know a man who can! What we are
39 people./It is like the man who bought a car from a garage
46 There is the story of the man who wrote to the tax office./"
14. Thanaa Alhabuobi
International Journal of Computational Linguistics (IJCL), Volume (11) : Issue (2) : 2020 47
Figure 5
that
3 healthy eating. The premise that hunger makes food look
4 studies have suggested that the so-called hunger
5 States, scientists suggest that ghrelin might also work
6 full./"What we show is that there may be situations
7 or alcohol. Zigman said that his team speculated that
7 said that his team speculated that ghrelin might also
8 could be defined, as things that make us feel better. "
8 us reorganise our memory so that we remember how to get
9 eats -- and wants to eat -- that high-calorie dessert."/
10 , they evaluated whether mice that were fully sated
10 found high-fat food over one that had only offered regular
10 bland chow. They found that when mice in this
10 strongly preferred the room that had been paired with the
11 cells in the stomach that time the release of a
11 time the release of a hormone that makes animals anticipate
12 , are the first to show that these cells, which
12 by a circadian clock that is set by mealtime
13 patterns./They also show that the hormone's release
14 hungry./The scientists show that the stomach cells release
15 ./Previous studies have shown that people given ghrelin
15 . The new research suggests that the stomach tells the
16 ."/The research also suggests that ghrelin, the only known
16 drug developers. Unlike drugs that focus on satiety, those
16 that focus on satiety, those that target ghrelin could help
16 appetite before dieters take that first bite./The third
20 ./The researchers report that around half (50.9 per
22 "./The researchers also found that both men and women in the
23 " group./The authors conclude that a combination of eating
Figure 6
is
2 life/The key to happiness is not money or education. But
2 students for 72 years, it is something that all of us,
5 on a scale of 1 to 9, where 1 is perfectly miserable and 9
5 is perfectly miserable and 9 is perfectly happy, you circle
9 of the good life. The project is one of the longest-running
10 and fluorescent lights that is littered with the detritus
15 factors which in toto is commonly interpreted as
23 . “The atmosphere of the home is one of happiness and
25 stewards,” you wrote. “It is something that has been
26 in my own psychic picture is a fuller realisation of my
35 . “Social anxiety disorder” is distinguished from shyness.
35 from shyness. Depression is defined as errors in
35 a common space. “Much of what is labelled mental illness,”
36 us immoral.”/This perspective is shaped by a long-term view.
36 at any given time, Vaillant is more like a biographer,
36 . The good news, he argues, is that diseases – and people,
15. Thanaa Alhabuobi
International Journal of Computational Linguistics (IJCL), Volume (11) : Issue (2) : 2020 48
Figure 7
was
11 late eighties – I thought it was a good time to examine
11 to examine both. To do so, I was granted unprecedented
12 told me recently, “but it was contemporary, real-time
12 many decades,” he said, “it was like looking through the
18 presidential Cabinet, and one was President. There was a
18 and one was President. There was a bestselling novelist (
23 But you recovered fully, that was your way. “I could
25 Air Force – “The whole thing was like a game,” you said –
26 on not having any. This was probably because they were
30 started the project when he was 33. He would spend the
Figure 8
seem
33 the person employing them, seem crazy to anyone else. One
Figure 9
are
10 . The files holding the data are as thick as unabridged
10 1993. All that’s missing are the IBM punch cards used
11 the roughly half still living are in their late eighties – I
25 profession. “Our lives are like the talents in the
31 (or “defence mechanisms”) are unconscious thoughts and
33 ./At the bottom of the pile are the unhealthiest, or “
33 to anyone else. One level up are the “immature” adaptations
34 intimacy./“Neurotic” defences are common in “normal” people.
35 . If we use defences well, we are deemed mentally healthy,
36 the “psychotic” adaptations are common in toddlers, and
36 the “immature” adaptations are essential in later
37 with reality that his peers are both moving towards and
Figure 10
were
18 amid the shimmering successes were darker hues. As early as
18 Bock didn’t get it. “They were normal when I picked them
23 , you played all the sports, were good to your sisters, and
24 and ideals.” Basically, they were in a swoon. They
26 and drinking. In 1951 – you were 31 – you wrote, “I think
26 was probably because they were too deeply buried and I
36 , the Grant Study men were twice as likely to use
36 ones, but in middle life they were four times as likely to
36 into old age. When they were between 50 and 75,