SlideShare a Scribd company logo
1 of 7
Download to read offline
International Journal on Natural Language Computing (IJNLC) Vol. 3, No.4, August 2014 
WORD SENSE DISAMBIGUATION USING WSD 
SPECIFIC WORDNET OF POLYSEMY WORDS 
Udaya Raj Dhungana1, Subarna Shakya2, Kabita Baral3 and Bharat Sharma4 
1, 2, 4Department of Electronics and Computer Engineering, Central Campus, IOE, 
Tribhuvan University, Lalitpur, Nepal 
3Department of Computer Science, GBS, Lamachaur, Kaski, Nepal 
ABSTRACT 
This paper presents a new model of WordNet that is used to disambiguate the correct sense of polysemy 
word based on the clue words. The related words for each sense of a polysemy word as well as single sense 
word are referred to as the clue words. The conventional WordNet organizes nouns, verbs, adjectives and 
adverbs together into sets of synonyms called synsets each expressing a different concept. In contrast to the 
structure of WordNet, we developed a new model of WordNet that organizes the different senses of 
polysemy words as well as the single sense words based on the clue words. These clue words for each sense 
of a polysemy word as well as for single sense word are used to disambiguate the correct meaning of the 
polysemy word in the given context using knowledge based Word Sense Disambiguation (WSD) algorithms. 
The clue word can be a noun, verb, adjective or adverb. 
KEYWORDS 
Word Sense Disambiguation, WordNet, Polysemy Words, Synset, Hypernymy, Context word, Clue Words 
1. INTRODUCTION 
Words that express two or more different meanings when used in different contexts are referred to 
as polysemy or multi-sense words. Every natural language contains such polysemy words in its 
vocabulary. Such polysemy words create a big problem during the translation of one natural 
language to another. To translate the correct meaning of the polysemy word, the machine must 
first know the context in which the polysemy word has been used. Only after this, the machine 
can find out the correct meaning of the word in the particular context and can translate the 
meaning of that word into the correct word in another language. The process of finding the 
correct meaning of the polysemy word using machine by analyzing the context in which the 
polysemy word has been used is referred as Word Sense Disambiguation (WSD). 
Although different methods have been tested to find the correct sense of the polysemy word, 
accuracy at satisfactory level has not been obtained yet. Among the different methods used for 
WSD, this research focused its study on the knowledge-based approach. The knowledge-based 
approaches use the resources such as dictionaries thesauri, ontology, collocation etc to 
disambiguate a word in a given context [1]. 
These days, the WordNet is becoming popular as a resource to be used in knowledge-based 
approach to disambiguate the meanings of polysemy words. WordNet is a lexical database 
developed at Princeton University for English language [2]. The WordNet organizes nouns, verbs, 
adjectives and adverbs into the groups of synonyms and describes the relationships between these 
DOI : 10.5121/ijnlc.2014.3405 73
International Journal on Natural Language Computing (IJNLC) Vol. 3, No.4, August 2014 
synonym groups forming a semantic network among the words. After the development of English 
WordNet, many other WordNet on other languages such as Spanish WordNet, Italian WordNet, 
Hindi WordNet etc were built. These WordNet are used as one important resource to 
disambiguate the different meanings of a polysemy words in respective languages. To 
disambiguate the meaning of a polysemy word using the WordNet, the related words from synset, 
gloss and different levels of hypernym are collected from the WordNet database and these related 
words are compared to find the overlaps using different WSD algorithm. However, the collection 
of related words from synset, gloss and different level of hypernym that are taken from the 
WordNet contains only very few words that can be used to disambiguate the correct sense of a 
polysemy word in the given context [3]. This increases only the computational overhead for the 
WSD algorithm and for the system. Moreover, according to [3] as we go increasing the levels of 
hypernym to collect the related words, the sense of the words becomes more general and the two 
different senses of a polysemy word are found to have the same hypernym. This therefore creates 
another ambiguity in ambiguity. To overcome this problem, we have developed a new model to 
organize both the single sense and multi-sense words. 
74 
2. RELATED TASKS 
In 1986, Lesk Michael [4] developed an algorithm called Lesk algorithm to identify senses of 
polysemy words. He used the overlap of word definition from the Oxford Advanced Learner’s 
Dictionary of Current English (OALD) to disambiguate the word senses. 
Banerjee and Pedersen [5] adapted the original Lesk algorithm to use the lexical database 
WordNet. They used Senseval-2 word sense disambiguation exercise to evaluate their system and 
the overall accuracy was found to be 32%. To compare two glosses, they used the longest 
sequence of one or more consecutive words that occurs in both glosses. For each overlap, a square 
of the number of words in the overlap is calculated and the final score is the sum of all overlaps. 
Sinha M. et. al. [6] developed automatic WSD for Hindi language using Hindi WordNet at IIT 
Bombay. They used statistical method for determining the senses. The system could disambiguate 
the nouns only. They compared the context of the polysemy word in a sentence with the contexts 
constructed from the WordNet. They evaluated the system using the Hindi corpora provided by 
the Central Institute of Indian Languages (CIIL). The accuracy of their algorithm was found in the 
range from 40% to 70%. They used simple overlap method to determine the winner sense. 
In [7], Shrestha N. et. al. used the Lesk algorithm to disambiguate the Nepali ambiguous words. 
They modified the Lesk algorithm in such a way that only the words in the sentence without their 
synset, gloss, examples and hypernym are taken as context words. Each word in the context 
words is compared with each word in the collection of words formed by the synset, gloss, 
examples and hypernym of each sense of the target word. They did not include the synset, gloss, 
example and hypernym of the context words in the collection of context words. Moreover, the 
number of example for each sense of the target word was only one. 
In [3], Dhungana and Shakya used the adapted Lesk algorithm to disambiguate the polysemy 
word in Nepali language. The experiments performed on 348 words (including the different 
senses of 59 polysemy words and context words) with the test data containing 201 Nepali 
sentences shows the accuracy of their system to be 88.05%. This accuracy is found to be 
increased by 16.41%, in compared to the accuracy of the system developed by [7]. 
Context word refers to the word that is used in context with the target word. The target word is 
the word which has multiple senses at different contexts and its meaning needs to be 
disambiguated in the given context. Hypernymy is a semantic relation between two synsets to
International Journal on Natural Language Computing (IJNLC) Vol. 3, No.4, August 2014 
show super-set hood (is a kind of) relation. For example,  (Transliteration: Belpatra) is a 
kind of  (Transliteration: Paat, Transliteration in English: Leaf). Therefore,  is a 
hypernymy and  is hyponymy [8]. 
75 
3. STATEMENT OF RESEARCH PROBLEM 
3.1. Problem Statement 
From analyzing many research works on WSD using WordNet such as [4], [5], [6], [7], [9] and 
[3], it was noticed that the WordNet is very useful resource to use for word sense disambiguation. 
However, it is not exactly suitable for knowledge-based, overlap selection WSD approaches. The 
reason is that the WordNet is built for general purpose in NLP tasks but is not focused for WSD. 
WordNet contains huge amount of information for words that are arranged with semantic relation. 
In all WSD methods, the WordNet is used to take a large number of words to disambiguate the 
meaning of multi-sense word. However, only very few words taken from the WordNet are used to 
disambiguate the different senses of a multi-sense word. In this sense, all the efforts such as 
processing time for CPU and memory to store large number of unused words are wasted. 
Furthermore, it is noticed that the words taken from the WordNet to disambiguate the multiple 
meaning of the multi-sense word itself creates the ambiguity resulting in decrease in accuracy. 
Another important point noticed from [3] is that when deeper levels of the hypernymy from the 
WordNet are used, the correctly disambiguated polysemy words are also incorrectly 
disambiguated. The reason is that the deeper (probably from second and/or third) levels of 
Hypernymy of all the different sense of a polysemy word are same as shown in Fig.1. That’s why, 
there is no any basis to disambiguate the different sense of the polysemy word. 
Example: WSD and Hypernymy in English WordNet: Let us consider a polysemy word Pen in 
English language. In WordNet, the word has five different meanings as a noun. The five 
meanings of the word Pen with the different levels of hypernymy are shown in Fig. 1. First look 
at the Fig.1b and Fig.1c. Here, we can see that the only the meaning of two different words are 
different. The hypernymy of these two senses are the same. In such case, there is no meaning of 
using the hypernymy of the two different senses of the polysemy word to disambiguate their 
meanings. Inclusion of these hypernymy is just waste of computational effort and waste of 
memory for the system. 
3.2. Solution Approach 
What we need is that the organization of the polysemy words must be organized in such a way 
that it should contain only those words that are sufficient to disambiguate the different senses of a 
polysemy word and it should not again introduce ambiguity as it is in the case of using WordNet. 
Although WordNet is very useful lexical resource that can be used to disambiguate the different 
meanings of a polysemy word, it has still some limitations to use in WSD algorithms as discussed 
in previous section. Therefore, to improve the accuracy of the knowledge-based overlap selection 
WSD algorithms dramatically, we need to develop a new logical model that organizes the words 
in such a way that it disambiguates the meanings of a polysemy word correctly and efficiently 
resulting in higher accuracy than that can be obtained from the use of WordNet.
International Journal on Natural Language Computing (IJNLC) Vol. 3, No.4, August 2014 
76 
3.2.1. Organization of Clue Words 
Unlike in WordNet, we organized the different senses of polysemy word as well as single sense 
words. We grouped each sense of a polysemy word based on the verb, noun, adverb and adjective 
with which the sense of the polysemy word can be used in a sentence. The organization of the 
single sense is also done in the same way. This organization of words results in a new model of 
WordNet which is focused on WSD. This new model of WordNet contains all the necessary and 
sufficient words/information that can be used to disambiguate the senses of a polysemy word. It 
restricts the unnecessary words/information and therefore does not create ambiguity in ambiguity 
like WordNet. For instance, a sample organization of different meanings of a polysemy word Pen 
in English language as a noun is shown in Fig. 2. 
Figure 2. Organization of different senses of noun Pen in new model of WordNet based on clue words 
3.2.1. Algorithm 
The algorithm is very simple. First, we find collection of clue words for each sense of a polysemy 
word. The number of collection of words for a polysemy word is equal to the number of different 
senses of the polysemy. Let say these collection of words be T1, T2, T3,...,Tn where n is the 
number of different senses that the polysemy word has. After this, we find the collection of words 
(say C) from the context window. 
4. INTENDS 
The main objectives of this research work include: 
1. To organize the senses of polysemy words based on clue words. 
2. To develop a new model of WordNet based on clue words for both polysemy and 
single sense words. 
3. To test whether using the conventional or using the new model WordNet, the WSD 
algorithm gives the higher accuracy.
International Journal on Natural Language Computing (IJNLC) Vol. 3, No.4, August 2014 
77 
5. HYPOTHESIS 
The conventional WordNet is very useful resource for language learning and language processing 
tasks. However, it is not suitable for knowledge-based WSD algorithms to get higher accuracy. 
We belief that organizing the different senses of polysemy words and context words based on 
clue words best suits for knowledge-based WSD algorithms resulting in higher accuracy. 
Moreover this reduces the computational effort for the system and saves the system memory 
while processing. 
6. RESEARCH METHODOLOGY 
Depending upon the nature of our research, we used experiment type of research strategy. We 
used the algorithm and test data already developed by [3] to test our new model of WordNet for 
WSD. Based on the research question, we set up two experiments with different settings. In first 
experiment, we used the system developed by [3] using the conventional sample Nepali WordNet. 
In second experiment, we only replaced the conventional sample Nepali WordNet by our new 
model of WordNet organized with clue words keeping all the other settings constant as in first 
experiment. 
6.1. Experiment 1 
Lexical Resource: In first experiment, we used the conventional sample Nepali WordNet 
developed by [3]. This conventional sample Nepali WordNet contains 348 words including 59 
polysemy words. The words in this WordNet are organized as in English WordNet. 
Experiment Setting: In this setting, we included the synset, gloss, example and hypernym of 
each word of in the collection of context words to form the final collection of context words and 
compared these words with the collection words of each sense of target word to determine 
overlaps. The number of examples for each word provided in this case is more than four in 
average. Moreover, we did not include the target word in the collection of context word. 
6.2. Experiment 2 
Lexical Resource: In this Second experiment, we used our new model of WordNet that is 
focused for word sense disambiguation. 
Experiment Setting: The experiment setting is same as the setting in Experiment 1 except that 
we replaced the conventional sample Nepali WordNet by our new model of WordNet organized 
with clue words. 
7. RESULT AND DISCUSSIONS 
After setting the two experiments, we executed the experiments using the test data developed by 
[3]. The test data consists of 201 different Nepali sentences each containing polysemy word. In 
experiment 1, we found that the polysemy words in 117 test sentences out of 201 are correctly 
disambiguated. This shows the accuracy of the system with conventional sample Nepali WordNet 
found to be 88.059%. In experiment 2, we found that the polysemy words in 184 test sentences 
out of 201 are correctly disambiguated. At this time, the accuracy of the system is found to be 
91.543%. Therefore, the accuracy of the system using our new model of WordNet that is specific
International Journal on Natural Language Computing (IJNLC) Vol. 3, No.4, August 2014 
to WSD is found to be increased by 3.484%. The accuracies obtained in the two experiments are 
shown in Table 1. 
78 
Table 1. Heading and text fonts. 
Experiment 
No 
No. Of polysemy words 
that are correctly 
disambiguated 
No. Of polysemy words 
that are correctly 
disambiguated 
Accuracy 
in 
percentage 
1 177 24 88.059 
2 184 17 91.543 
8. CONCLUSIONS 
The WordNet organizes the words in the lexical database based on their meanings of the words 
instead of their forms as in dictionaries. It groups the nouns, verbs, adjectives and adverbs 
together into synonym sets, each expressing a distinct concept [2]. The words in a synonym set 
can be interchangeably used in many contexts. The main relationship among the words in 
WordNet is the synonym. From analyzing many research works on WSD using WordNet such as 
[4], [5], [6], [7], [9] and [3], we have noticed that the WordNet is not exactly suitable to use with 
knowledge-based, overlap selection WSD approaches. The reason is that the WordNet is built for 
general purpose in NLP tasks but not focused for WSD. In all WSD methods that used WordNet, 
the WordNet is used to take a large number of words to disambiguate the meaning of multi-sense 
word. But it is noticed that only very few words taken from the WordNet are used to 
disambiguate the different senses of a multi-sense word. In this sense, all the efforts such as 
processing time for CPU and memory to store large number of unused words are wasted. To 
defeat with these problems of WordNet, we developed a new model of WordNet that organizes 
the different senses of polysemy words as well as single sense word using the clue words. The 
experiment shows that the accuracy of the system using our new model of WordNet is found to be 
increased by 3.484% in compare with the accuracy of the system in [3]. This evidence leads us to 
conclude that our new model of WordNet gives the higher accuracy for the knowledge-based 
WSD algorithms rather than using the conventional WordNet. 
ACKNOWLEDGEMENTS 
We would like to kindly express our sincere thanks to Madan Puraskar Pustakalaya (MPP), Patan 
Dhoka, Lalitpur for providing Nepali corpus and Nepali online dictionary which formed the main 
resources of this research. We are equally thankful to Mr. Balram Prasain, Lecturer, Tribhuwan 
University for his valuable help in this research. 
REFERENCES 
[1] N. Ide and J. Véronis, “Word sense disambiguation: The state of the art,” Computational 
Linguistics, pp. 1–40, 1998. 
[2] G. A. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. J. Miller, “Introduction to wordnet: An 
on-line lexical database,” International Journal of Lexicography, 1998. 
[3] U. R. Dhungana and S. Shakya, “Word sense disambiguation in nepali language,” in The Fourth 
International Conference on Digital Information and Communication Technology and Its 
Application (DICTAP2014), Bangkok, Thailand, 2014, pp. 46–50. 
[4] M. E. Lesk, “Automatic sense disambiguation using machine readable dictionaries: How to tell a 
pine cone from a ice cream cone,” in SIGDOC Conference, Toronto, Ontario, Canada, 1986.
International Journal on Natural Language Computing (IJNLC) Vol. 3, No.4, August 2014 
[5] S. Banerjee and T. Pedersen, “An adapted lesk algorithm for word sense disambiguation using 
wordnet,” in Third International Conference on Intelligent Text Processing and Computational 
Linguistics, Gelbukh, 2002. 
[6] M. Sinha, M. K. Reddy, P. Bhattacharyya, P. Pandey, and L. Kashyap, “Hindi word sense 
79 
disambiguation,” Master’s thesis, Indian Institute of Technology Bombay, Mumbai, India, 2004. 
[7] N. Shrestha, A. V. H. Patrick, and S. K. Bista, “Resources for nepali word sense disambiguation,” 
in IEEE International conference on Natural Language Processing and Knowledge Engineering 
(IEEE NLP-KE’08), Beijing, China, 2008. 
[8] P. Bhattacharyya, P. Pande, and L. Lupu, “Hindi wordnet,” Indian Institute of Technology 
Bombay, Mumbai, India, Tech. Rep., 2008. 
[9] N. Shrestha, A. V. H. Patrick, and S. K. Bista, “Nepali word sense disambiguation using lesk 
algorithm,” Master’s thesis, Kathmandu University, Dhulikhel, Kavre, Nepal, 2004.

More Related Content

What's hot

ANALYSIS OF MWES IN HINDI TEXT USING NLTK
ANALYSIS OF MWES IN HINDI TEXT USING NLTKANALYSIS OF MWES IN HINDI TEXT USING NLTK
ANALYSIS OF MWES IN HINDI TEXT USING NLTKijnlc
 
Information retrieval based on word sens 1
Information retrieval based on word sens 1Information retrieval based on word sens 1
Information retrieval based on word sens 1ATHMAN HAJ-HAMOU
 
Ijartes v1-i1-002
Ijartes v1-i1-002Ijartes v1-i1-002
Ijartes v1-i1-002IJARTES
 
Automatic classification of bengali sentences based on sense definitions pres...
Automatic classification of bengali sentences based on sense definitions pres...Automatic classification of bengali sentences based on sense definitions pres...
Automatic classification of bengali sentences based on sense definitions pres...ijctcm
 
Cross lingual similarity discrimination with translation characteristics
Cross lingual similarity discrimination with translation characteristicsCross lingual similarity discrimination with translation characteristics
Cross lingual similarity discrimination with translation characteristicsijaia
 
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...kevig
 
Exploiting rules for resolving ambiguity in marathi language text
Exploiting rules for resolving ambiguity in marathi language textExploiting rules for resolving ambiguity in marathi language text
Exploiting rules for resolving ambiguity in marathi language texteSAT Journals
 
Pronominal anaphora resolution in
Pronominal anaphora resolution inPronominal anaphora resolution in
Pronominal anaphora resolution inijfcstjournal
 
Statistically-Enhanced New Word Identification
Statistically-Enhanced New Word IdentificationStatistically-Enhanced New Word Identification
Statistically-Enhanced New Word IdentificationAndi Wu
 
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATIONAN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATIONijnlc
 
A ROBUST THREE-STAGE HYBRID FRAMEWORK FOR ENGLISH TO BANGLA TRANSLITERATION
A ROBUST THREE-STAGE HYBRID FRAMEWORK FOR ENGLISH TO BANGLA TRANSLITERATIONA ROBUST THREE-STAGE HYBRID FRAMEWORK FOR ENGLISH TO BANGLA TRANSLITERATION
A ROBUST THREE-STAGE HYBRID FRAMEWORK FOR ENGLISH TO BANGLA TRANSLITERATIONkevig
 
Ijarcet vol-3-issue-1-9-11
Ijarcet vol-3-issue-1-9-11Ijarcet vol-3-issue-1-9-11
Ijarcet vol-3-issue-1-9-11Dhabal Sethi
 
INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITC...
INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITC...INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITC...
INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITC...kevig
 
Developing links of compound sentences for parsing through marathi link gramm...
Developing links of compound sentences for parsing through marathi link gramm...Developing links of compound sentences for parsing through marathi link gramm...
Developing links of compound sentences for parsing through marathi link gramm...ijnlc
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
A Tool to Search and Convert Reduplicate Words from Hindi to Punjabi
A Tool to Search and Convert Reduplicate Words from Hindi to PunjabiA Tool to Search and Convert Reduplicate Words from Hindi to Punjabi
A Tool to Search and Convert Reduplicate Words from Hindi to PunjabiIJERA Editor
 
Amharic WSD using WordNet
Amharic WSD using WordNetAmharic WSD using WordNet
Amharic WSD using WordNetSeid Hassen
 
Chat bot using text similarity approach
Chat bot using text similarity approachChat bot using text similarity approach
Chat bot using text similarity approachdinesh_joshy
 

What's hot (19)

Detecting Paraphrases in Marathi Language
Detecting Paraphrases in Marathi LanguageDetecting Paraphrases in Marathi Language
Detecting Paraphrases in Marathi Language
 
ANALYSIS OF MWES IN HINDI TEXT USING NLTK
ANALYSIS OF MWES IN HINDI TEXT USING NLTKANALYSIS OF MWES IN HINDI TEXT USING NLTK
ANALYSIS OF MWES IN HINDI TEXT USING NLTK
 
Information retrieval based on word sens 1
Information retrieval based on word sens 1Information retrieval based on word sens 1
Information retrieval based on word sens 1
 
Ijartes v1-i1-002
Ijartes v1-i1-002Ijartes v1-i1-002
Ijartes v1-i1-002
 
Automatic classification of bengali sentences based on sense definitions pres...
Automatic classification of bengali sentences based on sense definitions pres...Automatic classification of bengali sentences based on sense definitions pres...
Automatic classification of bengali sentences based on sense definitions pres...
 
Cross lingual similarity discrimination with translation characteristics
Cross lingual similarity discrimination with translation characteristicsCross lingual similarity discrimination with translation characteristics
Cross lingual similarity discrimination with translation characteristics
 
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
 
Exploiting rules for resolving ambiguity in marathi language text
Exploiting rules for resolving ambiguity in marathi language textExploiting rules for resolving ambiguity in marathi language text
Exploiting rules for resolving ambiguity in marathi language text
 
Pronominal anaphora resolution in
Pronominal anaphora resolution inPronominal anaphora resolution in
Pronominal anaphora resolution in
 
Statistically-Enhanced New Word Identification
Statistically-Enhanced New Word IdentificationStatistically-Enhanced New Word Identification
Statistically-Enhanced New Word Identification
 
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATIONAN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
 
A ROBUST THREE-STAGE HYBRID FRAMEWORK FOR ENGLISH TO BANGLA TRANSLITERATION
A ROBUST THREE-STAGE HYBRID FRAMEWORK FOR ENGLISH TO BANGLA TRANSLITERATIONA ROBUST THREE-STAGE HYBRID FRAMEWORK FOR ENGLISH TO BANGLA TRANSLITERATION
A ROBUST THREE-STAGE HYBRID FRAMEWORK FOR ENGLISH TO BANGLA TRANSLITERATION
 
Ijarcet vol-3-issue-1-9-11
Ijarcet vol-3-issue-1-9-11Ijarcet vol-3-issue-1-9-11
Ijarcet vol-3-issue-1-9-11
 
INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITC...
INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITC...INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITC...
INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITC...
 
Developing links of compound sentences for parsing through marathi link gramm...
Developing links of compound sentences for parsing through marathi link gramm...Developing links of compound sentences for parsing through marathi link gramm...
Developing links of compound sentences for parsing through marathi link gramm...
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
 
A Tool to Search and Convert Reduplicate Words from Hindi to Punjabi
A Tool to Search and Convert Reduplicate Words from Hindi to PunjabiA Tool to Search and Convert Reduplicate Words from Hindi to Punjabi
A Tool to Search and Convert Reduplicate Words from Hindi to Punjabi
 
Amharic WSD using WordNet
Amharic WSD using WordNetAmharic WSD using WordNet
Amharic WSD using WordNet
 
Chat bot using text similarity approach
Chat bot using text similarity approachChat bot using text similarity approach
Chat bot using text similarity approach
 

Viewers also liked

Taxonomy extraction from automotive natural language requirements using unsup...
Taxonomy extraction from automotive natural language requirements using unsup...Taxonomy extraction from automotive natural language requirements using unsup...
Taxonomy extraction from automotive natural language requirements using unsup...ijnlc
 
IMPLEMENTING A SUBCATEGORIZED PROBABILISTIC DEFINITE CLAUSE GRAMMAR FOR VIETN...
IMPLEMENTING A SUBCATEGORIZED PROBABILISTIC DEFINITE CLAUSE GRAMMAR FOR VIETN...IMPLEMENTING A SUBCATEGORIZED PROBABILISTIC DEFINITE CLAUSE GRAMMAR FOR VIETN...
IMPLEMENTING A SUBCATEGORIZED PROBABILISTIC DEFINITE CLAUSE GRAMMAR FOR VIETN...ijnlc
 
Event detection and summarization based on social networks and semantic query...
Event detection and summarization based on social networks and semantic query...Event detection and summarization based on social networks and semantic query...
Event detection and summarization based on social networks and semantic query...ijnlc
 
Processing vietnamese news titles to answer relative questions in vnewsqa ict...
Processing vietnamese news titles to answer relative questions in vnewsqa ict...Processing vietnamese news titles to answer relative questions in vnewsqa ict...
Processing vietnamese news titles to answer relative questions in vnewsqa ict...ijnlc
 
Heval yet another human evaluation
Heval yet another human evaluationHeval yet another human evaluation
Heval yet another human evaluationijnlc
 
UNL-ization of Numbers and Ordinals in Punjabi with IAN
UNL-ization of Numbers and Ordinals in Punjabi with IANUNL-ization of Numbers and Ordinals in Punjabi with IAN
UNL-ization of Numbers and Ordinals in Punjabi with IANijnlc
 
SEMANTIC PROCESSING MECHANISM FOR LISTENING AND COMPREHENSION IN VNSCALENDAR ...
SEMANTIC PROCESSING MECHANISM FOR LISTENING AND COMPREHENSION IN VNSCALENDAR ...SEMANTIC PROCESSING MECHANISM FOR LISTENING AND COMPREHENSION IN VNSCALENDAR ...
SEMANTIC PROCESSING MECHANISM FOR LISTENING AND COMPREHENSION IN VNSCALENDAR ...ijnlc
 
Understanding and estimation of
Understanding and estimation ofUnderstanding and estimation of
Understanding and estimation ofijnlc
 
SEMANTIC PARSING OF SIMPLE SENTENCES IN UNIFICATION-BASED VIETNAMESE GRAMMAR
SEMANTIC PARSING OF SIMPLE SENTENCES IN UNIFICATION-BASED VIETNAMESE GRAMMARSEMANTIC PARSING OF SIMPLE SENTENCES IN UNIFICATION-BASED VIETNAMESE GRAMMAR
SEMANTIC PARSING OF SIMPLE SENTENCES IN UNIFICATION-BASED VIETNAMESE GRAMMARijnlc
 
Kridanta analysis for sanskrit
Kridanta analysis for sanskritKridanta analysis for sanskrit
Kridanta analysis for sanskritijnlc
 
OPINION MINING AND ANALYSIS: A SURVEY
OPINION MINING AND ANALYSIS: A SURVEYOPINION MINING AND ANALYSIS: A SURVEY
OPINION MINING AND ANALYSIS: A SURVEYijnlc
 
G2 pil a grapheme to-phoneme conversion tool for the italian language
G2 pil a grapheme to-phoneme conversion tool for the italian languageG2 pil a grapheme to-phoneme conversion tool for the italian language
G2 pil a grapheme to-phoneme conversion tool for the italian languageijnlc
 
ALGORITHM FOR TEXT TO GRAPH CONVERSION
ALGORITHM FOR TEXT TO GRAPH CONVERSION ALGORITHM FOR TEXT TO GRAPH CONVERSION
ALGORITHM FOR TEXT TO GRAPH CONVERSION ijnlc
 
KANNADA NAMED ENTITY RECOGNITION AND CLASSIFICATION
KANNADA NAMED ENTITY RECOGNITION AND CLASSIFICATIONKANNADA NAMED ENTITY RECOGNITION AND CLASSIFICATION
KANNADA NAMED ENTITY RECOGNITION AND CLASSIFICATIONijnlc
 
A N H YBRID A PPROACH TO W ORD S ENSE D ISAMBIGUATION W ITH A ND W ITH...
A N H YBRID  A PPROACH TO  W ORD  S ENSE  D ISAMBIGUATION  W ITH  A ND  W ITH...A N H YBRID  A PPROACH TO  W ORD  S ENSE  D ISAMBIGUATION  W ITH  A ND  W ITH...
A N H YBRID A PPROACH TO W ORD S ENSE D ISAMBIGUATION W ITH A ND W ITH...ijnlc
 
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...ijnlc
 
Contextual Analysis for Middle Eastern Languages with Hidden Markov Models
Contextual Analysis for Middle Eastern Languages with Hidden Markov ModelsContextual Analysis for Middle Eastern Languages with Hidden Markov Models
Contextual Analysis for Middle Eastern Languages with Hidden Markov Modelsijnlc
 
A SIGNATURE BASED DRAVIDIAN SIGN LANGUAGE RECOGNITION BY SPARSE REPRESENTATION
A SIGNATURE BASED DRAVIDIAN SIGN LANGUAGE RECOGNITION BY SPARSE REPRESENTATIONA SIGNATURE BASED DRAVIDIAN SIGN LANGUAGE RECOGNITION BY SPARSE REPRESENTATION
A SIGNATURE BASED DRAVIDIAN SIGN LANGUAGE RECOGNITION BY SPARSE REPRESENTATIONijnlc
 
Document excel
Document excelDocument excel
Document excelguest4ba72dc
 
Cs Calcio Balilla 29.05.09
Cs Calcio Balilla 29.05.09Cs Calcio Balilla 29.05.09
Cs Calcio Balilla 29.05.09Claudia Porchietto
 

Viewers also liked (20)

Taxonomy extraction from automotive natural language requirements using unsup...
Taxonomy extraction from automotive natural language requirements using unsup...Taxonomy extraction from automotive natural language requirements using unsup...
Taxonomy extraction from automotive natural language requirements using unsup...
 
IMPLEMENTING A SUBCATEGORIZED PROBABILISTIC DEFINITE CLAUSE GRAMMAR FOR VIETN...
IMPLEMENTING A SUBCATEGORIZED PROBABILISTIC DEFINITE CLAUSE GRAMMAR FOR VIETN...IMPLEMENTING A SUBCATEGORIZED PROBABILISTIC DEFINITE CLAUSE GRAMMAR FOR VIETN...
IMPLEMENTING A SUBCATEGORIZED PROBABILISTIC DEFINITE CLAUSE GRAMMAR FOR VIETN...
 
Event detection and summarization based on social networks and semantic query...
Event detection and summarization based on social networks and semantic query...Event detection and summarization based on social networks and semantic query...
Event detection and summarization based on social networks and semantic query...
 
Processing vietnamese news titles to answer relative questions in vnewsqa ict...
Processing vietnamese news titles to answer relative questions in vnewsqa ict...Processing vietnamese news titles to answer relative questions in vnewsqa ict...
Processing vietnamese news titles to answer relative questions in vnewsqa ict...
 
Heval yet another human evaluation
Heval yet another human evaluationHeval yet another human evaluation
Heval yet another human evaluation
 
UNL-ization of Numbers and Ordinals in Punjabi with IAN
UNL-ization of Numbers and Ordinals in Punjabi with IANUNL-ization of Numbers and Ordinals in Punjabi with IAN
UNL-ization of Numbers and Ordinals in Punjabi with IAN
 
SEMANTIC PROCESSING MECHANISM FOR LISTENING AND COMPREHENSION IN VNSCALENDAR ...
SEMANTIC PROCESSING MECHANISM FOR LISTENING AND COMPREHENSION IN VNSCALENDAR ...SEMANTIC PROCESSING MECHANISM FOR LISTENING AND COMPREHENSION IN VNSCALENDAR ...
SEMANTIC PROCESSING MECHANISM FOR LISTENING AND COMPREHENSION IN VNSCALENDAR ...
 
Understanding and estimation of
Understanding and estimation ofUnderstanding and estimation of
Understanding and estimation of
 
SEMANTIC PARSING OF SIMPLE SENTENCES IN UNIFICATION-BASED VIETNAMESE GRAMMAR
SEMANTIC PARSING OF SIMPLE SENTENCES IN UNIFICATION-BASED VIETNAMESE GRAMMARSEMANTIC PARSING OF SIMPLE SENTENCES IN UNIFICATION-BASED VIETNAMESE GRAMMAR
SEMANTIC PARSING OF SIMPLE SENTENCES IN UNIFICATION-BASED VIETNAMESE GRAMMAR
 
Kridanta analysis for sanskrit
Kridanta analysis for sanskritKridanta analysis for sanskrit
Kridanta analysis for sanskrit
 
OPINION MINING AND ANALYSIS: A SURVEY
OPINION MINING AND ANALYSIS: A SURVEYOPINION MINING AND ANALYSIS: A SURVEY
OPINION MINING AND ANALYSIS: A SURVEY
 
G2 pil a grapheme to-phoneme conversion tool for the italian language
G2 pil a grapheme to-phoneme conversion tool for the italian languageG2 pil a grapheme to-phoneme conversion tool for the italian language
G2 pil a grapheme to-phoneme conversion tool for the italian language
 
ALGORITHM FOR TEXT TO GRAPH CONVERSION
ALGORITHM FOR TEXT TO GRAPH CONVERSION ALGORITHM FOR TEXT TO GRAPH CONVERSION
ALGORITHM FOR TEXT TO GRAPH CONVERSION
 
KANNADA NAMED ENTITY RECOGNITION AND CLASSIFICATION
KANNADA NAMED ENTITY RECOGNITION AND CLASSIFICATIONKANNADA NAMED ENTITY RECOGNITION AND CLASSIFICATION
KANNADA NAMED ENTITY RECOGNITION AND CLASSIFICATION
 
A N H YBRID A PPROACH TO W ORD S ENSE D ISAMBIGUATION W ITH A ND W ITH...
A N H YBRID  A PPROACH TO  W ORD  S ENSE  D ISAMBIGUATION  W ITH  A ND  W ITH...A N H YBRID  A PPROACH TO  W ORD  S ENSE  D ISAMBIGUATION  W ITH  A ND  W ITH...
A N H YBRID A PPROACH TO W ORD S ENSE D ISAMBIGUATION W ITH A ND W ITH...
 
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
 
Contextual Analysis for Middle Eastern Languages with Hidden Markov Models
Contextual Analysis for Middle Eastern Languages with Hidden Markov ModelsContextual Analysis for Middle Eastern Languages with Hidden Markov Models
Contextual Analysis for Middle Eastern Languages with Hidden Markov Models
 
A SIGNATURE BASED DRAVIDIAN SIGN LANGUAGE RECOGNITION BY SPARSE REPRESENTATION
A SIGNATURE BASED DRAVIDIAN SIGN LANGUAGE RECOGNITION BY SPARSE REPRESENTATIONA SIGNATURE BASED DRAVIDIAN SIGN LANGUAGE RECOGNITION BY SPARSE REPRESENTATION
A SIGNATURE BASED DRAVIDIAN SIGN LANGUAGE RECOGNITION BY SPARSE REPRESENTATION
 
Document excel
Document excelDocument excel
Document excel
 
Cs Calcio Balilla 29.05.09
Cs Calcio Balilla 29.05.09Cs Calcio Balilla 29.05.09
Cs Calcio Balilla 29.05.09
 

Similar to WSD model organizes polysemy words

AMBIGUITY-AWARE DOCUMENT SIMILARITY
AMBIGUITY-AWARE DOCUMENT SIMILARITYAMBIGUITY-AWARE DOCUMENT SIMILARITY
AMBIGUITY-AWARE DOCUMENT SIMILARITYijnlc
 
Ny3424442448
Ny3424442448Ny3424442448
Ny3424442448IJERA Editor
 
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTSLEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTSijwscjournal
 
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTSLEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTSijwscjournal
 
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTSLEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTSijwscjournal
 
Improvement wsd dictionary using annotated corpus and testing it with simplif...
Improvement wsd dictionary using annotated corpus and testing it with simplif...Improvement wsd dictionary using annotated corpus and testing it with simplif...
Improvement wsd dictionary using annotated corpus and testing it with simplif...csandit
 
Paper id 25201466
Paper id 25201466Paper id 25201466
Paper id 25201466IJRAT
 
A supervised word sense disambiguation method using ontology and context know...
A supervised word sense disambiguation method using ontology and context know...A supervised word sense disambiguation method using ontology and context know...
A supervised word sense disambiguation method using ontology and context know...Alexander Decker
 
Using automated lexical resources in arabic sentence subjectivity
Using automated lexical resources in arabic sentence subjectivityUsing automated lexical resources in arabic sentence subjectivity
Using automated lexical resources in arabic sentence subjectivityijaia
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI) International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI) inventionjournals
 
A performance of svm with modified lesk approach for word sense disambiguatio...
A performance of svm with modified lesk approach for word sense disambiguatio...A performance of svm with modified lesk approach for word sense disambiguatio...
A performance of svm with modified lesk approach for word sense disambiguatio...eSAT Journals
 
SETSWANA PART OF SPEECH TAGGING
SETSWANA PART OF SPEECH TAGGINGSETSWANA PART OF SPEECH TAGGING
SETSWANA PART OF SPEECH TAGGINGkevig
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorWaqas Tariq
 
USING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITY
USING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITYUSING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITY
USING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITYijaia
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESkevig
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESkevig
 
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On SilenceSegmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On Silencepaperpublications3
 
Word sense disambiguation a survey
Word sense disambiguation  a surveyWord sense disambiguation  a survey
Word sense disambiguation a surveyijctcm
 

Similar to WSD model organizes polysemy words (20)

AMBIGUITY-AWARE DOCUMENT SIMILARITY
AMBIGUITY-AWARE DOCUMENT SIMILARITYAMBIGUITY-AWARE DOCUMENT SIMILARITY
AMBIGUITY-AWARE DOCUMENT SIMILARITY
 
Ny3424442448
Ny3424442448Ny3424442448
Ny3424442448
 
ijcai11
ijcai11ijcai11
ijcai11
 
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTSLEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
 
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTSLEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
 
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTSLEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
 
Improvement wsd dictionary using annotated corpus and testing it with simplif...
Improvement wsd dictionary using annotated corpus and testing it with simplif...Improvement wsd dictionary using annotated corpus and testing it with simplif...
Improvement wsd dictionary using annotated corpus and testing it with simplif...
 
Paper id 25201466
Paper id 25201466Paper id 25201466
Paper id 25201466
 
A supervised word sense disambiguation method using ontology and context know...
A supervised word sense disambiguation method using ontology and context know...A supervised word sense disambiguation method using ontology and context know...
A supervised word sense disambiguation method using ontology and context know...
 
Nlp ambiguity presentation
Nlp ambiguity presentationNlp ambiguity presentation
Nlp ambiguity presentation
 
Using automated lexical resources in arabic sentence subjectivity
Using automated lexical resources in arabic sentence subjectivityUsing automated lexical resources in arabic sentence subjectivity
Using automated lexical resources in arabic sentence subjectivity
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI) International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
A performance of svm with modified lesk approach for word sense disambiguatio...
A performance of svm with modified lesk approach for word sense disambiguatio...A performance of svm with modified lesk approach for word sense disambiguatio...
A performance of svm with modified lesk approach for word sense disambiguatio...
 
SETSWANA PART OF SPEECH TAGGING
SETSWANA PART OF SPEECH TAGGINGSETSWANA PART OF SPEECH TAGGING
SETSWANA PART OF SPEECH TAGGING
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
 
USING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITY
USING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITYUSING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITY
USING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITY
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
 
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On SilenceSegmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
 
Word sense disambiguation a survey
Word sense disambiguation  a surveyWord sense disambiguation  a survey
Word sense disambiguation a survey
 

Recently uploaded

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 

Recently uploaded (20)

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 

WSD model organizes polysemy words

  • 1. International Journal on Natural Language Computing (IJNLC) Vol. 3, No.4, August 2014 WORD SENSE DISAMBIGUATION USING WSD SPECIFIC WORDNET OF POLYSEMY WORDS Udaya Raj Dhungana1, Subarna Shakya2, Kabita Baral3 and Bharat Sharma4 1, 2, 4Department of Electronics and Computer Engineering, Central Campus, IOE, Tribhuvan University, Lalitpur, Nepal 3Department of Computer Science, GBS, Lamachaur, Kaski, Nepal ABSTRACT This paper presents a new model of WordNet that is used to disambiguate the correct sense of polysemy word based on the clue words. The related words for each sense of a polysemy word as well as single sense word are referred to as the clue words. The conventional WordNet organizes nouns, verbs, adjectives and adverbs together into sets of synonyms called synsets each expressing a different concept. In contrast to the structure of WordNet, we developed a new model of WordNet that organizes the different senses of polysemy words as well as the single sense words based on the clue words. These clue words for each sense of a polysemy word as well as for single sense word are used to disambiguate the correct meaning of the polysemy word in the given context using knowledge based Word Sense Disambiguation (WSD) algorithms. The clue word can be a noun, verb, adjective or adverb. KEYWORDS Word Sense Disambiguation, WordNet, Polysemy Words, Synset, Hypernymy, Context word, Clue Words 1. INTRODUCTION Words that express two or more different meanings when used in different contexts are referred to as polysemy or multi-sense words. Every natural language contains such polysemy words in its vocabulary. Such polysemy words create a big problem during the translation of one natural language to another. To translate the correct meaning of the polysemy word, the machine must first know the context in which the polysemy word has been used. Only after this, the machine can find out the correct meaning of the word in the particular context and can translate the meaning of that word into the correct word in another language. The process of finding the correct meaning of the polysemy word using machine by analyzing the context in which the polysemy word has been used is referred as Word Sense Disambiguation (WSD). Although different methods have been tested to find the correct sense of the polysemy word, accuracy at satisfactory level has not been obtained yet. Among the different methods used for WSD, this research focused its study on the knowledge-based approach. The knowledge-based approaches use the resources such as dictionaries thesauri, ontology, collocation etc to disambiguate a word in a given context [1]. These days, the WordNet is becoming popular as a resource to be used in knowledge-based approach to disambiguate the meanings of polysemy words. WordNet is a lexical database developed at Princeton University for English language [2]. The WordNet organizes nouns, verbs, adjectives and adverbs into the groups of synonyms and describes the relationships between these DOI : 10.5121/ijnlc.2014.3405 73
  • 2. International Journal on Natural Language Computing (IJNLC) Vol. 3, No.4, August 2014 synonym groups forming a semantic network among the words. After the development of English WordNet, many other WordNet on other languages such as Spanish WordNet, Italian WordNet, Hindi WordNet etc were built. These WordNet are used as one important resource to disambiguate the different meanings of a polysemy words in respective languages. To disambiguate the meaning of a polysemy word using the WordNet, the related words from synset, gloss and different levels of hypernym are collected from the WordNet database and these related words are compared to find the overlaps using different WSD algorithm. However, the collection of related words from synset, gloss and different level of hypernym that are taken from the WordNet contains only very few words that can be used to disambiguate the correct sense of a polysemy word in the given context [3]. This increases only the computational overhead for the WSD algorithm and for the system. Moreover, according to [3] as we go increasing the levels of hypernym to collect the related words, the sense of the words becomes more general and the two different senses of a polysemy word are found to have the same hypernym. This therefore creates another ambiguity in ambiguity. To overcome this problem, we have developed a new model to organize both the single sense and multi-sense words. 74 2. RELATED TASKS In 1986, Lesk Michael [4] developed an algorithm called Lesk algorithm to identify senses of polysemy words. He used the overlap of word definition from the Oxford Advanced Learner’s Dictionary of Current English (OALD) to disambiguate the word senses. Banerjee and Pedersen [5] adapted the original Lesk algorithm to use the lexical database WordNet. They used Senseval-2 word sense disambiguation exercise to evaluate their system and the overall accuracy was found to be 32%. To compare two glosses, they used the longest sequence of one or more consecutive words that occurs in both glosses. For each overlap, a square of the number of words in the overlap is calculated and the final score is the sum of all overlaps. Sinha M. et. al. [6] developed automatic WSD for Hindi language using Hindi WordNet at IIT Bombay. They used statistical method for determining the senses. The system could disambiguate the nouns only. They compared the context of the polysemy word in a sentence with the contexts constructed from the WordNet. They evaluated the system using the Hindi corpora provided by the Central Institute of Indian Languages (CIIL). The accuracy of their algorithm was found in the range from 40% to 70%. They used simple overlap method to determine the winner sense. In [7], Shrestha N. et. al. used the Lesk algorithm to disambiguate the Nepali ambiguous words. They modified the Lesk algorithm in such a way that only the words in the sentence without their synset, gloss, examples and hypernym are taken as context words. Each word in the context words is compared with each word in the collection of words formed by the synset, gloss, examples and hypernym of each sense of the target word. They did not include the synset, gloss, example and hypernym of the context words in the collection of context words. Moreover, the number of example for each sense of the target word was only one. In [3], Dhungana and Shakya used the adapted Lesk algorithm to disambiguate the polysemy word in Nepali language. The experiments performed on 348 words (including the different senses of 59 polysemy words and context words) with the test data containing 201 Nepali sentences shows the accuracy of their system to be 88.05%. This accuracy is found to be increased by 16.41%, in compared to the accuracy of the system developed by [7]. Context word refers to the word that is used in context with the target word. The target word is the word which has multiple senses at different contexts and its meaning needs to be disambiguated in the given context. Hypernymy is a semantic relation between two synsets to
  • 3. International Journal on Natural Language Computing (IJNLC) Vol. 3, No.4, August 2014 show super-set hood (is a kind of) relation. For example, (Transliteration: Belpatra) is a kind of (Transliteration: Paat, Transliteration in English: Leaf). Therefore, is a hypernymy and is hyponymy [8]. 75 3. STATEMENT OF RESEARCH PROBLEM 3.1. Problem Statement From analyzing many research works on WSD using WordNet such as [4], [5], [6], [7], [9] and [3], it was noticed that the WordNet is very useful resource to use for word sense disambiguation. However, it is not exactly suitable for knowledge-based, overlap selection WSD approaches. The reason is that the WordNet is built for general purpose in NLP tasks but is not focused for WSD. WordNet contains huge amount of information for words that are arranged with semantic relation. In all WSD methods, the WordNet is used to take a large number of words to disambiguate the meaning of multi-sense word. However, only very few words taken from the WordNet are used to disambiguate the different senses of a multi-sense word. In this sense, all the efforts such as processing time for CPU and memory to store large number of unused words are wasted. Furthermore, it is noticed that the words taken from the WordNet to disambiguate the multiple meaning of the multi-sense word itself creates the ambiguity resulting in decrease in accuracy. Another important point noticed from [3] is that when deeper levels of the hypernymy from the WordNet are used, the correctly disambiguated polysemy words are also incorrectly disambiguated. The reason is that the deeper (probably from second and/or third) levels of Hypernymy of all the different sense of a polysemy word are same as shown in Fig.1. That’s why, there is no any basis to disambiguate the different sense of the polysemy word. Example: WSD and Hypernymy in English WordNet: Let us consider a polysemy word Pen in English language. In WordNet, the word has five different meanings as a noun. The five meanings of the word Pen with the different levels of hypernymy are shown in Fig. 1. First look at the Fig.1b and Fig.1c. Here, we can see that the only the meaning of two different words are different. The hypernymy of these two senses are the same. In such case, there is no meaning of using the hypernymy of the two different senses of the polysemy word to disambiguate their meanings. Inclusion of these hypernymy is just waste of computational effort and waste of memory for the system. 3.2. Solution Approach What we need is that the organization of the polysemy words must be organized in such a way that it should contain only those words that are sufficient to disambiguate the different senses of a polysemy word and it should not again introduce ambiguity as it is in the case of using WordNet. Although WordNet is very useful lexical resource that can be used to disambiguate the different meanings of a polysemy word, it has still some limitations to use in WSD algorithms as discussed in previous section. Therefore, to improve the accuracy of the knowledge-based overlap selection WSD algorithms dramatically, we need to develop a new logical model that organizes the words in such a way that it disambiguates the meanings of a polysemy word correctly and efficiently resulting in higher accuracy than that can be obtained from the use of WordNet.
  • 4. International Journal on Natural Language Computing (IJNLC) Vol. 3, No.4, August 2014 76 3.2.1. Organization of Clue Words Unlike in WordNet, we organized the different senses of polysemy word as well as single sense words. We grouped each sense of a polysemy word based on the verb, noun, adverb and adjective with which the sense of the polysemy word can be used in a sentence. The organization of the single sense is also done in the same way. This organization of words results in a new model of WordNet which is focused on WSD. This new model of WordNet contains all the necessary and sufficient words/information that can be used to disambiguate the senses of a polysemy word. It restricts the unnecessary words/information and therefore does not create ambiguity in ambiguity like WordNet. For instance, a sample organization of different meanings of a polysemy word Pen in English language as a noun is shown in Fig. 2. Figure 2. Organization of different senses of noun Pen in new model of WordNet based on clue words 3.2.1. Algorithm The algorithm is very simple. First, we find collection of clue words for each sense of a polysemy word. The number of collection of words for a polysemy word is equal to the number of different senses of the polysemy. Let say these collection of words be T1, T2, T3,...,Tn where n is the number of different senses that the polysemy word has. After this, we find the collection of words (say C) from the context window. 4. INTENDS The main objectives of this research work include: 1. To organize the senses of polysemy words based on clue words. 2. To develop a new model of WordNet based on clue words for both polysemy and single sense words. 3. To test whether using the conventional or using the new model WordNet, the WSD algorithm gives the higher accuracy.
  • 5. International Journal on Natural Language Computing (IJNLC) Vol. 3, No.4, August 2014 77 5. HYPOTHESIS The conventional WordNet is very useful resource for language learning and language processing tasks. However, it is not suitable for knowledge-based WSD algorithms to get higher accuracy. We belief that organizing the different senses of polysemy words and context words based on clue words best suits for knowledge-based WSD algorithms resulting in higher accuracy. Moreover this reduces the computational effort for the system and saves the system memory while processing. 6. RESEARCH METHODOLOGY Depending upon the nature of our research, we used experiment type of research strategy. We used the algorithm and test data already developed by [3] to test our new model of WordNet for WSD. Based on the research question, we set up two experiments with different settings. In first experiment, we used the system developed by [3] using the conventional sample Nepali WordNet. In second experiment, we only replaced the conventional sample Nepali WordNet by our new model of WordNet organized with clue words keeping all the other settings constant as in first experiment. 6.1. Experiment 1 Lexical Resource: In first experiment, we used the conventional sample Nepali WordNet developed by [3]. This conventional sample Nepali WordNet contains 348 words including 59 polysemy words. The words in this WordNet are organized as in English WordNet. Experiment Setting: In this setting, we included the synset, gloss, example and hypernym of each word of in the collection of context words to form the final collection of context words and compared these words with the collection words of each sense of target word to determine overlaps. The number of examples for each word provided in this case is more than four in average. Moreover, we did not include the target word in the collection of context word. 6.2. Experiment 2 Lexical Resource: In this Second experiment, we used our new model of WordNet that is focused for word sense disambiguation. Experiment Setting: The experiment setting is same as the setting in Experiment 1 except that we replaced the conventional sample Nepali WordNet by our new model of WordNet organized with clue words. 7. RESULT AND DISCUSSIONS After setting the two experiments, we executed the experiments using the test data developed by [3]. The test data consists of 201 different Nepali sentences each containing polysemy word. In experiment 1, we found that the polysemy words in 117 test sentences out of 201 are correctly disambiguated. This shows the accuracy of the system with conventional sample Nepali WordNet found to be 88.059%. In experiment 2, we found that the polysemy words in 184 test sentences out of 201 are correctly disambiguated. At this time, the accuracy of the system is found to be 91.543%. Therefore, the accuracy of the system using our new model of WordNet that is specific
  • 6. International Journal on Natural Language Computing (IJNLC) Vol. 3, No.4, August 2014 to WSD is found to be increased by 3.484%. The accuracies obtained in the two experiments are shown in Table 1. 78 Table 1. Heading and text fonts. Experiment No No. Of polysemy words that are correctly disambiguated No. Of polysemy words that are correctly disambiguated Accuracy in percentage 1 177 24 88.059 2 184 17 91.543 8. CONCLUSIONS The WordNet organizes the words in the lexical database based on their meanings of the words instead of their forms as in dictionaries. It groups the nouns, verbs, adjectives and adverbs together into synonym sets, each expressing a distinct concept [2]. The words in a synonym set can be interchangeably used in many contexts. The main relationship among the words in WordNet is the synonym. From analyzing many research works on WSD using WordNet such as [4], [5], [6], [7], [9] and [3], we have noticed that the WordNet is not exactly suitable to use with knowledge-based, overlap selection WSD approaches. The reason is that the WordNet is built for general purpose in NLP tasks but not focused for WSD. In all WSD methods that used WordNet, the WordNet is used to take a large number of words to disambiguate the meaning of multi-sense word. But it is noticed that only very few words taken from the WordNet are used to disambiguate the different senses of a multi-sense word. In this sense, all the efforts such as processing time for CPU and memory to store large number of unused words are wasted. To defeat with these problems of WordNet, we developed a new model of WordNet that organizes the different senses of polysemy words as well as single sense word using the clue words. The experiment shows that the accuracy of the system using our new model of WordNet is found to be increased by 3.484% in compare with the accuracy of the system in [3]. This evidence leads us to conclude that our new model of WordNet gives the higher accuracy for the knowledge-based WSD algorithms rather than using the conventional WordNet. ACKNOWLEDGEMENTS We would like to kindly express our sincere thanks to Madan Puraskar Pustakalaya (MPP), Patan Dhoka, Lalitpur for providing Nepali corpus and Nepali online dictionary which formed the main resources of this research. We are equally thankful to Mr. Balram Prasain, Lecturer, Tribhuwan University for his valuable help in this research. REFERENCES [1] N. Ide and J. VĂ©ronis, “Word sense disambiguation: The state of the art,” Computational Linguistics, pp. 1–40, 1998. [2] G. A. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. J. Miller, “Introduction to wordnet: An on-line lexical database,” International Journal of Lexicography, 1998. [3] U. R. Dhungana and S. Shakya, “Word sense disambiguation in nepali language,” in The Fourth International Conference on Digital Information and Communication Technology and Its Application (DICTAP2014), Bangkok, Thailand, 2014, pp. 46–50. [4] M. E. Lesk, “Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from a ice cream cone,” in SIGDOC Conference, Toronto, Ontario, Canada, 1986.
  • 7. International Journal on Natural Language Computing (IJNLC) Vol. 3, No.4, August 2014 [5] S. Banerjee and T. Pedersen, “An adapted lesk algorithm for word sense disambiguation using wordnet,” in Third International Conference on Intelligent Text Processing and Computational Linguistics, Gelbukh, 2002. [6] M. Sinha, M. K. Reddy, P. Bhattacharyya, P. Pandey, and L. Kashyap, “Hindi word sense 79 disambiguation,” Master’s thesis, Indian Institute of Technology Bombay, Mumbai, India, 2004. [7] N. Shrestha, A. V. H. Patrick, and S. K. Bista, “Resources for nepali word sense disambiguation,” in IEEE International conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE’08), Beijing, China, 2008. [8] P. Bhattacharyya, P. Pande, and L. Lupu, “Hindi wordnet,” Indian Institute of Technology Bombay, Mumbai, India, Tech. Rep., 2008. [9] N. Shrestha, A. V. H. Patrick, and S. K. Bista, “Nepali word sense disambiguation using lesk algorithm,” Master’s thesis, Kathmandu University, Dhulikhel, Kavre, Nepal, 2004.