Based on the sense definition of words available in the Bengali WordNet, an attempt is made to classify the Bengali sentences automatically into different groups in accordance with their underlying senses. The input sentences are collected from 50 different categories of the Bengali text corpus developed in the TDIL project of the Govt. of India, while information about the different senses of particular ambiguous lexical item is collected from Bengali WordNet. In an experimental basis we have used Naive Bayes probabilistic model as a useful classifier of sentences. We have applied the algorithm over 1747 sentences that contain a particular Bengali lexical item which, because of its ambiguous nature, is able to trigger different senses that render sentences in different meanings. In our experiment we have achieved around 84% accurate result on the sense classification over the total input sentences. We have analyzed those residual sentences that did not comply with our experiment and did affect the results to note that in many cases, wrong syntactic structures and less semantic information are the main hurdles in semantic classification of sentences. The applicational relevance of this study is attested in automatic text classification, machine learning, information extraction, and word sense disambiguation
11.development of a feature extraction technique for online character recogni...Alexander Decker
The document describes a study that developed a hybrid feature extraction technique for online character recognition. The technique combines geometrical and statistical features. Geometrical features included stroke information (number, pressure, junctions, horizontal projection count) and contour pixels. Statistical features included zoning, which divides the character image into zones and calculates the percentage of black pixels in each zone. A hybrid algorithm was created that integrated geometrical and statistical features to take advantage of their complementarity and gain new insights into character properties. The goal was to improve recognition performance over existing single-feature techniques.
Development of a feature extraction technique for online character recognitio...Alexander Decker
The document describes a study that developed a hybrid feature extraction technique for online character recognition. The technique combines geometrical and statistical features. Geometrical features included stroke information (number, pressure, junctions, horizontal projection count) and contour pixels. Statistical features included zoning, which divides the character image into zones and calculates the percentage of black pixels in each zone. A hybrid algorithm was created that integrated geometrical and statistical features to take advantage of their complementarity and gain new insights into character properties. The goal was to improve recognition performance over existing single-feature techniques.
Trends of machine learning in 2020 - International Journal of Artificial Inte...gerogepatton
This document discusses a hybrid machine learning algorithm to automatically classify measurement types from NASA's airborne measurement data archive. The goal is to develop an efficient and accurate algorithm that meets performance metrics. The proposed algorithm uses decision trees to select and weight features, then applies a weighted Naive Bayes classifier due to correlated inputs. It was deployed successfully at an industrial scale, balancing performance and resources required.
Here are a few thoughts in response to the questions posed:
1. If we took seriously the opportunities for learning in the digital era, our research and technology design would focus more on interactive, collaborative, and experiential modes of learning. We would leverage the connectivity of digital networks to foster open-ended exploration and knowledge-building across traditional boundaries. Assessment would emphasize real-world problem-solving and creation over rote memorization.
2. Moving beyond traditional academic structures could encourage more cross-disciplinary, project-based learning. Students would have more flexibility to follow their interests and passions, pursuing learning experiences both inside and outside the classroom. Communities may form around shared challenges rather than departments. Evaluation would focus on competencies developed
Top 10 Natural Language Processing Trends in 2020 - International Journal on ...kevig
Natural Language Processing is a programmed approach to analyze text that is based on both a set of theories and a set of technologies. This forum aims to bring together researchers who have designed and build software that will analyze, understand, and generate languages that humans use naturally to address computers.
11.development of a writer independent online handwritten character recogniti...Alexander Decker
This document describes the development of an online handwritten character recognition system using a modified hybrid neural network model. It developed a hybrid feature extraction technique that combines stroke information, contour pixels, and zoning of characters to create feature vectors. A hybrid neural network model combining modified counterpropagation and optical backpropagation networks was also developed. Experiments using 6,200 character samples from 50 subjects achieved a 99% recognition rate with an average recognition time of 2 milliseconds when testing samples from new subjects.
P. Sathiya is seeking a position that allows her to utilize her skills and knowledge to help an organization grow. She has a B.Tech in IT from Adhiyamaan College of Engineering with an 8.29 CGPA. Her skills include MySQL, Visual Basic, and interests in DBMS and networking. She has participated in paper presentations, workshops, seminars and conferences on topics such as human brain intelligence, web design, mobile application development, and statistical source anonymity in sensor networks.
11.development of a feature extraction technique for online character recogni...Alexander Decker
The document describes a study that developed a hybrid feature extraction technique for online character recognition. The technique combines geometrical and statistical features. Geometrical features included stroke information (number, pressure, junctions, horizontal projection count) and contour pixels. Statistical features included zoning, which divides the character image into zones and calculates the percentage of black pixels in each zone. A hybrid algorithm was created that integrated geometrical and statistical features to take advantage of their complementarity and gain new insights into character properties. The goal was to improve recognition performance over existing single-feature techniques.
Development of a feature extraction technique for online character recognitio...Alexander Decker
The document describes a study that developed a hybrid feature extraction technique for online character recognition. The technique combines geometrical and statistical features. Geometrical features included stroke information (number, pressure, junctions, horizontal projection count) and contour pixels. Statistical features included zoning, which divides the character image into zones and calculates the percentage of black pixels in each zone. A hybrid algorithm was created that integrated geometrical and statistical features to take advantage of their complementarity and gain new insights into character properties. The goal was to improve recognition performance over existing single-feature techniques.
Trends of machine learning in 2020 - International Journal of Artificial Inte...gerogepatton
This document discusses a hybrid machine learning algorithm to automatically classify measurement types from NASA's airborne measurement data archive. The goal is to develop an efficient and accurate algorithm that meets performance metrics. The proposed algorithm uses decision trees to select and weight features, then applies a weighted Naive Bayes classifier due to correlated inputs. It was deployed successfully at an industrial scale, balancing performance and resources required.
Here are a few thoughts in response to the questions posed:
1. If we took seriously the opportunities for learning in the digital era, our research and technology design would focus more on interactive, collaborative, and experiential modes of learning. We would leverage the connectivity of digital networks to foster open-ended exploration and knowledge-building across traditional boundaries. Assessment would emphasize real-world problem-solving and creation over rote memorization.
2. Moving beyond traditional academic structures could encourage more cross-disciplinary, project-based learning. Students would have more flexibility to follow their interests and passions, pursuing learning experiences both inside and outside the classroom. Communities may form around shared challenges rather than departments. Evaluation would focus on competencies developed
Top 10 Natural Language Processing Trends in 2020 - International Journal on ...kevig
Natural Language Processing is a programmed approach to analyze text that is based on both a set of theories and a set of technologies. This forum aims to bring together researchers who have designed and build software that will analyze, understand, and generate languages that humans use naturally to address computers.
11.development of a writer independent online handwritten character recogniti...Alexander Decker
This document describes the development of an online handwritten character recognition system using a modified hybrid neural network model. It developed a hybrid feature extraction technique that combines stroke information, contour pixels, and zoning of characters to create feature vectors. A hybrid neural network model combining modified counterpropagation and optical backpropagation networks was also developed. Experiments using 6,200 character samples from 50 subjects achieved a 99% recognition rate with an average recognition time of 2 milliseconds when testing samples from new subjects.
P. Sathiya is seeking a position that allows her to utilize her skills and knowledge to help an organization grow. She has a B.Tech in IT from Adhiyamaan College of Engineering with an 8.29 CGPA. Her skills include MySQL, Visual Basic, and interests in DBMS and networking. She has participated in paper presentations, workshops, seminars and conferences on topics such as human brain intelligence, web design, mobile application development, and statistical source anonymity in sensor networks.
Munhyong Kim is a PhD candidate in Linguistics at Seoul National University, specializing in Natural Language Processing and Computational Linguistics under advisor Shin Hyopil. He has authored and co-authored several publications in journals and conferences. His research experiences include projects on sentiment analysis, temporal expression analysis, and machine translation between structurally different languages. He has also held teaching assistant positions at SNU.
Thair Khdour is an Associate Professor of Computer Science from Jordan. He received his Ph.D from the University of Essex in 2008. His research interests include artificial intelligence, information retrieval, machine learning, information security, and other topics. He has over 10 years of experience teaching at universities in Jordan and has published several journal articles.
Harish Tureha is seeking a position that allows him to apply his engineering skills to real-world problems. He has a M.Tech in Information Technology from College of Technology, GB Pant University with 7.87/10 CGPA. His technical skills include programming languages C and Java and simulation software NS2. He has work experience teaching at College of Technology, GB Pant University for six months and completed professional training at Kortek Electronics and HCL Infosystems. His personal strengths are a positive attitude and adaptability. He was captain of his university basketball team and represented his university in several tournaments.
This document discusses offline handwritten Devanagari script recognition using a probabilistic neural network. It begins with an abstract that outlines the goal of recognizing offline handwritten Devanagari numerals using structural and local features classified with a probabilistic neural network classifier. The introduction provides background on handwritten numeral recognition challenges. The document then reviews related work on character recognition from the early 1900s to modern advancements, describes the Devanagari script, discusses theoretical neural network and proposed recognition methods, and concludes that accurate recognition depends on the input quality and more efficient, accurate systems are needed to recognize varied writing styles.
The International Journal of Network Security & Its Applications (IJNSA) -- ...IJNSA Journal
The International Journal of Network Security & Its Applications (IJNSA) is a bi monthly open access peer-reviewed journal that publishes articles which contribute new results in all areas of the computer Network Security & its applications. The journal focuses on all technical and practical aspects of security and its applications for wired and wireless networks. The goal of this journal is to bring together researchers and practitioners from academia and industry to focus on understanding Modern security threats and countermeasures, and establishing new collaborations in these areas.
This document provides a summary of Satanjeev Banerjee's education, work experience, areas of research interest, publications, software projects, and references. It details his PhD studies in language technologies at Carnegie Mellon University, as well as his master's degrees from CMU and the University of Minnesota Duluth. His work experience includes research assistantships at CMU working on topics like speech summarization and meeting understanding. He has numerous publications in these areas and has developed software like the SmartNotes meeting recording system and the METEOR machine translation evaluation metric.
September 2022: Top 10 Read Articles in Natural Language Computingkevig
Natural Language Processing is a programmed approach to analyze text that is based on both a set of theories and a set of technologies. This forum aims to bring together researchers who have designed and build software that will analyze, understand, and generate languages that humans use naturally to address computers.
International Journal of Engineering and Science Invention (IJESI) inventionjournals
International Journal of Engineering and Science Invention (IJESI) is an international journal intended for professionals and researchers in all fields of computer science and electronics. IJESI publishes research articles and reviews within the whole field Engineering Science and Technology, new teaching methods, assessment, validation and the impact of new technologies and it will continue to provide information on the latest trends and developments in this ever-expanding subject. The publications of papers are selected through double peer reviewed to ensure originality, relevance, and readability. The articles published in our journal can be accessed online
A comparative analysis of particle swarm optimization and k means algorithm f...ijnlc
The volume of digitized text documents on the web have been increasing rapidly. As there is huge collection
of data on the web there is a need for grouping(clustering) the documents into clusters for speedy
information retrieval. Clustering of documents is collection of documents into groups such that the
documents within each group are similar to each other and not to documents of other groups. Quality of
clustering result depends greatly on the representation of text and the clustering algorithm. This paper
presents a comparative analysis of three algorithms namely K-means, Particle swarm Optimization (PSO)
and hybrid PSO+K-means algorithm for clustering of text documents using WordNet. The common way of
representing a text document is bag of terms. The bag of terms representation is often unsatisfactory as it
does not exploit the semantics. In this paper, texts are represented in terms of synsets corresponding to a
word. Bag of terms data representation of text is thus enriched with synonyms from WordNet. K-means,
Particle Swarm Optimization (PSO) and hybrid PSO+K-means algorithms are applied for clustering of
text in Nepali language. Experimental evaluation is performed by using intra cluster similarity and inter
cluster similarity.
New research articles 2020 october issue international journal of multimedi...ijma
The International Journal of Multimedia & Its Applications (IJMA) is a bi monthly open access peer-reviewed journal that publishes articles which contribute new results in all areas of the Multimedia & its applications. The journal focuses on all technical and practical aspects of Multimedia and its applications. The goal of this journal is to bring together researchers and practitioners from academia and industry to focus on understanding recent developments this arena, and establishing new collaborations in these areas.
Natural Language Processing is a programmed approach to analyze text that is based on both a set of theories and a set of technologies. This forum aims to bring together researchers who have designed and build software that will analyze, understand, and generate languages that humans use naturally to address computers.
New Research Articles 2020 May Issue International Journal of Software Engin...ijseajournal
This document proposes an agent-based approach to systematically specify auditability requirements during goal-oriented requirements engineering. It presents a case study applying this approach to the design of a system called LawDisTrA that distributes lawsuits among judges in a transparent manner. The approach uses an interdependency graph to capture different facets of transparency and their operationalization. An evaluation of a implemented LawDisTrA system that distributed over 300,000 lawsuits demonstrated the ability of the presented approach to address the cross-organizational nature of transparency through adequate auditability techniques.
Top 5 MOST VIEWED LANGUAGE COMPUTING ARTICLE - International Journal on Natur...kevig
Natural Language Processing is a programmed approach to analyze text that is based on both a set of theories and a set of technologies. This forum aims to bring together researchers who have designed and build software that will analyze, understand, and generate languages that humans use naturally to address computers.
Possibility of interdisciplinary research software engineering andnatural lan...Nakul Sharma
This document discusses the possibility of interdisciplinary research between software engineering and natural language processing. It provides a literature review of research papers from 2003 to 2014 related to applying tools and techniques from one field to the other. Some key areas discussed include generating UML diagrams from natural language text, developing ontologies to clarify meanings, and potential issues with joint research like determining complexity of sentences. The document proposes a flowchart for how artifacts could be analyzed using tasks from either field to enable interdisciplinary research.
A Questionnaire Developed For Conducting Fieldwork On Endangered And Indigeno...Martha Brown
This document presents a questionnaire for conducting fieldwork on endangered and indigenous languages in India. The questionnaire was developed through discussions with linguists and is designed to create dictionaries and basic grammars for documented languages. It includes sections on details of language experts, language vitality, diversity and attitudes, word and sentence lists, anthropological questions, and demographic profiling. The goal is to document languages in a standardized yet flexible way while balancing academic and community needs. Picture books and videos are used to elicit unique linguistic aspects for each language.
A Comprehensive Study On Natural Language Processing And Natural Language Int...Scott Bou
The document provides a comprehensive overview of natural language processing (NLP) and natural language interfaces to databases (NLIDBs). It discusses the different levels of NLP including morphological, lexical, syntactic, semantic and pragmatic analysis. It also describes various approaches used to develop NLIDBs, including symbolic, empirical, connectionist and maximum entropy approaches. Additionally, it outlines the history of NLP and NLIDBs, covering early work in machine translation and historically developed systems like LUNAR.
May 2024: Top 10 Read Articles in Software Engineering & Applications Interna...sebastianku31
Welcome To IJSEA ...!!!
Call for papers___!
International Journal of Software Engineering & Applications(IJSEA)
ISSN:0975-3834 [Online]; 0975-4679 [Print]
ERA Indexed, H Index 31
Web Page URL : https://airccse.org/journal/ijsea/ijsea.html
Submission URL :https://airccse.com/submissioncs/home.html
Contact Us : ijseajournal@airccse.org or ijsea@aircconline.com
May 2024: Top 10 Read Articles Posted Url:https://www.academia.edu/119977684/April_2024_Top_10_Read_Articles_in_Software_Engineering_and_Applications_International_Journal_of_Software_Engineering_and_Applications_IJSEA_ERA_Indexed
May 2022: Top 10 Read Articles in Data Mining & Knowledge Management ProcessIJDKP
Data mining and knowledge discovery in databases have been attracting a significant amount of research, industry, and media attention of late. There is an urgent need for a new generation of computational theories and tools to assist researchers in extracting useful information from the rapidly growing volumes of digital data.
Continuous bag of words cbow word2vec word embedding work .pdfdevangmittal4
Continuous bag of words (cbow) word2vec word embedding work is that it tends to predict the
probability of a word given a context. A context may be a single word or a group of words. But for
simplicity, I will take a single context word and try to predict a single target word.
The purpose of this question is to be able to create a word embedding for the given data set.
data set text:
In linguistics word embeddings were discussed in the research area of distributional semantics. It
aims to quantify and categorize semantic similarities between linguistic items based on their
distributional properties in large samples of language data. The underlying idea that "a word is
characterized by the company it keeps" was popularized by Firth.
The technique of representing words as vectors has roots in the 1960s with the development of
the vector space model for information retrieval. Reducing the number of dimensions using
singular value decomposition then led to the introduction of latent semantic analysis in the late
1980s.In 2000 Bengio et al. provided in a series of papers the "Neural probabilistic language
models" to reduce the high dimensionality of words representations in contexts by "learning a
distributed representation for words". (Bengio et al, 2003). Word embeddings come in two different
styles, one in which words are expressed as vectors of co-occurring words, and another in which
words are expressed as vectors of linguistic contexts in which the words occur; these different
styles are studied in (Lavelli et al, 2004). Roweis and Saul published in Science how to use
"locally linear embedding" (LLE) to discover representations of high dimensional data structures.
The area developed gradually and really took off after 2010, partly because important advances
had been made since then on the quality of vectors and the training speed of the model.
There are many branches and many research groups working on word embeddings. In 2013, a
team at Google led by Tomas Mikolov created word2vec, a word embedding toolkit which can train
vector space models faster than the previous approaches. Most new word embedding techniques
rely on a neural network architecture instead of more traditional n-gram models and unsupervised
learning.
Limitations
One of the main limitations of word embeddings (word vector space models in general) is that
possible meanings of a word are conflated into a single representation (a single vector in the
semantic space). Sense embeddings are a solution to this problem: individual meanings of words
are represented as distinct vectors in the space.
For biological sequences: BioVectors
Word embeddings for n-grams in biological sequences (e.g. DNA, RNA, and Proteins) for
bioinformatics applications have been proposed by Asgari and Mofrad. Named bio-vectors
(BioVec) to refer to biological sequences in general with protein-vectors (ProtVec) for proteins
(amino-acid sequences) and gene-vectors (GeneVec) for gene sequences, this representa.
Top 2 cited papers in 2017 - International Journal of Artificial Intelligenc...gerogepatton
The International Journal of Artificial Intelligence & Applications (IJAIA) is a bi monthly open access peer-reviewed journal that publishes articles which contribute new results in all areas of the Artificial Intelligence & Applications (IJAIA). It is an international journal intended for professionals and researchers in all fields of AI for researchers, programmers, and software and hardware manufacturers. The journal also aims to publish new attempts in the form of special issues on emerging areas in Artificial Intelligence and applications.
The AIRCC's International Journal of Computer Science and Information Technology (IJCSIT) is devoted to fields of Computer Science and Information Systems. The IJCSIT is a peer-reviewed scientific journal published in electronic form as well as print form. The mission of this journal is to publish original contributions in its field in order to propagate knowledge amongst its readers and to be a reference publication.
Munhyong Kim is a PhD candidate in Linguistics at Seoul National University, specializing in Natural Language Processing and Computational Linguistics under advisor Shin Hyopil. He has authored and co-authored several publications in journals and conferences. His research experiences include projects on sentiment analysis, temporal expression analysis, and machine translation between structurally different languages. He has also held teaching assistant positions at SNU.
Thair Khdour is an Associate Professor of Computer Science from Jordan. He received his Ph.D from the University of Essex in 2008. His research interests include artificial intelligence, information retrieval, machine learning, information security, and other topics. He has over 10 years of experience teaching at universities in Jordan and has published several journal articles.
Harish Tureha is seeking a position that allows him to apply his engineering skills to real-world problems. He has a M.Tech in Information Technology from College of Technology, GB Pant University with 7.87/10 CGPA. His technical skills include programming languages C and Java and simulation software NS2. He has work experience teaching at College of Technology, GB Pant University for six months and completed professional training at Kortek Electronics and HCL Infosystems. His personal strengths are a positive attitude and adaptability. He was captain of his university basketball team and represented his university in several tournaments.
This document discusses offline handwritten Devanagari script recognition using a probabilistic neural network. It begins with an abstract that outlines the goal of recognizing offline handwritten Devanagari numerals using structural and local features classified with a probabilistic neural network classifier. The introduction provides background on handwritten numeral recognition challenges. The document then reviews related work on character recognition from the early 1900s to modern advancements, describes the Devanagari script, discusses theoretical neural network and proposed recognition methods, and concludes that accurate recognition depends on the input quality and more efficient, accurate systems are needed to recognize varied writing styles.
The International Journal of Network Security & Its Applications (IJNSA) -- ...IJNSA Journal
The International Journal of Network Security & Its Applications (IJNSA) is a bi monthly open access peer-reviewed journal that publishes articles which contribute new results in all areas of the computer Network Security & its applications. The journal focuses on all technical and practical aspects of security and its applications for wired and wireless networks. The goal of this journal is to bring together researchers and practitioners from academia and industry to focus on understanding Modern security threats and countermeasures, and establishing new collaborations in these areas.
This document provides a summary of Satanjeev Banerjee's education, work experience, areas of research interest, publications, software projects, and references. It details his PhD studies in language technologies at Carnegie Mellon University, as well as his master's degrees from CMU and the University of Minnesota Duluth. His work experience includes research assistantships at CMU working on topics like speech summarization and meeting understanding. He has numerous publications in these areas and has developed software like the SmartNotes meeting recording system and the METEOR machine translation evaluation metric.
September 2022: Top 10 Read Articles in Natural Language Computingkevig
Natural Language Processing is a programmed approach to analyze text that is based on both a set of theories and a set of technologies. This forum aims to bring together researchers who have designed and build software that will analyze, understand, and generate languages that humans use naturally to address computers.
International Journal of Engineering and Science Invention (IJESI) inventionjournals
International Journal of Engineering and Science Invention (IJESI) is an international journal intended for professionals and researchers in all fields of computer science and electronics. IJESI publishes research articles and reviews within the whole field Engineering Science and Technology, new teaching methods, assessment, validation and the impact of new technologies and it will continue to provide information on the latest trends and developments in this ever-expanding subject. The publications of papers are selected through double peer reviewed to ensure originality, relevance, and readability. The articles published in our journal can be accessed online
A comparative analysis of particle swarm optimization and k means algorithm f...ijnlc
The volume of digitized text documents on the web have been increasing rapidly. As there is huge collection
of data on the web there is a need for grouping(clustering) the documents into clusters for speedy
information retrieval. Clustering of documents is collection of documents into groups such that the
documents within each group are similar to each other and not to documents of other groups. Quality of
clustering result depends greatly on the representation of text and the clustering algorithm. This paper
presents a comparative analysis of three algorithms namely K-means, Particle swarm Optimization (PSO)
and hybrid PSO+K-means algorithm for clustering of text documents using WordNet. The common way of
representing a text document is bag of terms. The bag of terms representation is often unsatisfactory as it
does not exploit the semantics. In this paper, texts are represented in terms of synsets corresponding to a
word. Bag of terms data representation of text is thus enriched with synonyms from WordNet. K-means,
Particle Swarm Optimization (PSO) and hybrid PSO+K-means algorithms are applied for clustering of
text in Nepali language. Experimental evaluation is performed by using intra cluster similarity and inter
cluster similarity.
New research articles 2020 october issue international journal of multimedi...ijma
The International Journal of Multimedia & Its Applications (IJMA) is a bi monthly open access peer-reviewed journal that publishes articles which contribute new results in all areas of the Multimedia & its applications. The journal focuses on all technical and practical aspects of Multimedia and its applications. The goal of this journal is to bring together researchers and practitioners from academia and industry to focus on understanding recent developments this arena, and establishing new collaborations in these areas.
Natural Language Processing is a programmed approach to analyze text that is based on both a set of theories and a set of technologies. This forum aims to bring together researchers who have designed and build software that will analyze, understand, and generate languages that humans use naturally to address computers.
New Research Articles 2020 May Issue International Journal of Software Engin...ijseajournal
This document proposes an agent-based approach to systematically specify auditability requirements during goal-oriented requirements engineering. It presents a case study applying this approach to the design of a system called LawDisTrA that distributes lawsuits among judges in a transparent manner. The approach uses an interdependency graph to capture different facets of transparency and their operationalization. An evaluation of a implemented LawDisTrA system that distributed over 300,000 lawsuits demonstrated the ability of the presented approach to address the cross-organizational nature of transparency through adequate auditability techniques.
Top 5 MOST VIEWED LANGUAGE COMPUTING ARTICLE - International Journal on Natur...kevig
Natural Language Processing is a programmed approach to analyze text that is based on both a set of theories and a set of technologies. This forum aims to bring together researchers who have designed and build software that will analyze, understand, and generate languages that humans use naturally to address computers.
Possibility of interdisciplinary research software engineering andnatural lan...Nakul Sharma
This document discusses the possibility of interdisciplinary research between software engineering and natural language processing. It provides a literature review of research papers from 2003 to 2014 related to applying tools and techniques from one field to the other. Some key areas discussed include generating UML diagrams from natural language text, developing ontologies to clarify meanings, and potential issues with joint research like determining complexity of sentences. The document proposes a flowchart for how artifacts could be analyzed using tasks from either field to enable interdisciplinary research.
A Questionnaire Developed For Conducting Fieldwork On Endangered And Indigeno...Martha Brown
This document presents a questionnaire for conducting fieldwork on endangered and indigenous languages in India. The questionnaire was developed through discussions with linguists and is designed to create dictionaries and basic grammars for documented languages. It includes sections on details of language experts, language vitality, diversity and attitudes, word and sentence lists, anthropological questions, and demographic profiling. The goal is to document languages in a standardized yet flexible way while balancing academic and community needs. Picture books and videos are used to elicit unique linguistic aspects for each language.
A Comprehensive Study On Natural Language Processing And Natural Language Int...Scott Bou
The document provides a comprehensive overview of natural language processing (NLP) and natural language interfaces to databases (NLIDBs). It discusses the different levels of NLP including morphological, lexical, syntactic, semantic and pragmatic analysis. It also describes various approaches used to develop NLIDBs, including symbolic, empirical, connectionist and maximum entropy approaches. Additionally, it outlines the history of NLP and NLIDBs, covering early work in machine translation and historically developed systems like LUNAR.
May 2024: Top 10 Read Articles in Software Engineering & Applications Interna...sebastianku31
Welcome To IJSEA ...!!!
Call for papers___!
International Journal of Software Engineering & Applications(IJSEA)
ISSN:0975-3834 [Online]; 0975-4679 [Print]
ERA Indexed, H Index 31
Web Page URL : https://airccse.org/journal/ijsea/ijsea.html
Submission URL :https://airccse.com/submissioncs/home.html
Contact Us : ijseajournal@airccse.org or ijsea@aircconline.com
May 2024: Top 10 Read Articles Posted Url:https://www.academia.edu/119977684/April_2024_Top_10_Read_Articles_in_Software_Engineering_and_Applications_International_Journal_of_Software_Engineering_and_Applications_IJSEA_ERA_Indexed
May 2022: Top 10 Read Articles in Data Mining & Knowledge Management ProcessIJDKP
Data mining and knowledge discovery in databases have been attracting a significant amount of research, industry, and media attention of late. There is an urgent need for a new generation of computational theories and tools to assist researchers in extracting useful information from the rapidly growing volumes of digital data.
Continuous bag of words cbow word2vec word embedding work .pdfdevangmittal4
Continuous bag of words (cbow) word2vec word embedding work is that it tends to predict the
probability of a word given a context. A context may be a single word or a group of words. But for
simplicity, I will take a single context word and try to predict a single target word.
The purpose of this question is to be able to create a word embedding for the given data set.
data set text:
In linguistics word embeddings were discussed in the research area of distributional semantics. It
aims to quantify and categorize semantic similarities between linguistic items based on their
distributional properties in large samples of language data. The underlying idea that "a word is
characterized by the company it keeps" was popularized by Firth.
The technique of representing words as vectors has roots in the 1960s with the development of
the vector space model for information retrieval. Reducing the number of dimensions using
singular value decomposition then led to the introduction of latent semantic analysis in the late
1980s.In 2000 Bengio et al. provided in a series of papers the "Neural probabilistic language
models" to reduce the high dimensionality of words representations in contexts by "learning a
distributed representation for words". (Bengio et al, 2003). Word embeddings come in two different
styles, one in which words are expressed as vectors of co-occurring words, and another in which
words are expressed as vectors of linguistic contexts in which the words occur; these different
styles are studied in (Lavelli et al, 2004). Roweis and Saul published in Science how to use
"locally linear embedding" (LLE) to discover representations of high dimensional data structures.
The area developed gradually and really took off after 2010, partly because important advances
had been made since then on the quality of vectors and the training speed of the model.
There are many branches and many research groups working on word embeddings. In 2013, a
team at Google led by Tomas Mikolov created word2vec, a word embedding toolkit which can train
vector space models faster than the previous approaches. Most new word embedding techniques
rely on a neural network architecture instead of more traditional n-gram models and unsupervised
learning.
Limitations
One of the main limitations of word embeddings (word vector space models in general) is that
possible meanings of a word are conflated into a single representation (a single vector in the
semantic space). Sense embeddings are a solution to this problem: individual meanings of words
are represented as distinct vectors in the space.
For biological sequences: BioVectors
Word embeddings for n-grams in biological sequences (e.g. DNA, RNA, and Proteins) for
bioinformatics applications have been proposed by Asgari and Mofrad. Named bio-vectors
(BioVec) to refer to biological sequences in general with protein-vectors (ProtVec) for proteins
(amino-acid sequences) and gene-vectors (GeneVec) for gene sequences, this representa.
Top 2 cited papers in 2017 - International Journal of Artificial Intelligenc...gerogepatton
The International Journal of Artificial Intelligence & Applications (IJAIA) is a bi monthly open access peer-reviewed journal that publishes articles which contribute new results in all areas of the Artificial Intelligence & Applications (IJAIA). It is an international journal intended for professionals and researchers in all fields of AI for researchers, programmers, and software and hardware manufacturers. The journal also aims to publish new attempts in the form of special issues on emerging areas in Artificial Intelligence and applications.
The AIRCC's International Journal of Computer Science and Information Technology (IJCSIT) is devoted to fields of Computer Science and Information Systems. The IJCSIT is a peer-reviewed scientific journal published in electronic form as well as print form. The mission of this journal is to publish original contributions in its field in order to propagate knowledge amongst its readers and to be a reference publication.
Automatic classification of bengali sentences based on sense definitions pres...ijctcm
Based on the sense definition of words available in the Bengali WordNet, an attempt is made to classify the
Bengali sentences automatically into different groups in accordance with their underlying senses. The input
sentences are collected from 50 different categories of the Bengali text corpus developed in the TDIL
project of the Govt. of India, while information about the different senses of particular ambiguous lexical
item is collected from Bengali WordNet. In an experimental basis we have used Naive Bayes probabilistic
model as a useful classifier of sentences. We have applied the algorithm over 1747 sentences that contain a
particular Bengali lexical item which, because of its ambiguous nature, is able to trigger different senses
that render sentences in different meanings. In our experiment we have achieved around 84% accurate
result on the sense classification over the total input sentences. We have analyzed those residual sentences
that did not comply with our experiment and did affect the results to note that in many cases, wrong
syntactic structures and less semantic information are the main hurdles in semantic classification of
sentences. The applicational relevance of this study is attested in automatic text classification, machine
learning, information extraction, and word sense disambiguation
Hints of the outbreak are detected through the modified circumstances favoring the outbreaks, like the warm weather contributing to epidermal outbreaks or the loss of sanitation leading to cholera outbreaks typically relying on the routine reports from the healthcare facilities, secondary data like attendance monitoring at workplaces and schools, the web, and the media play a significant informational source with more than 60% of the initial outbreak reporting to the informal sources. Through the application of natural language processing methods and machine learning technologies, a pipeline is developed which extracts the critical entities like country, confirmed case counts, disease, and case dates, which are mandatory entities from the epidemiological article and are saved in the database thereby facilitating the data entry easier. The advantages are the facilitation of relevant score articles shown first, thereby providing the web service results termed EventEpi integrated into the Event Based Surveillance (EBS) workflows.
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUEJournal For Research
Natural Language Processing (NLP) techniques are one of the most used techniques in the field of computer applications. It has become one of the vast and advanced techniques. Language is the means of communication or interaction among humans and in present scenario when everything is dependent on machine or everything is computerized, communication between computer and human has become a necessity. To fulfill this necessity NLP has been emerged as the means of interaction which narrows the gap between machines (computers) and humans. It was evolved from the study of linguistics which was passed through the Turing test to check the similarity between data but it was limited to small set of data. Later on various algorithms were developed along with the concept of AI (Artificial Intelligence) for the successful execution of NLP. In this paper, the main emphasis is on the different techniques of NLP which have been developed till now, their applications and the comparison of all those techniques on different parameters.
Effect of word embedding vector dimensionality on sentiment analysis through ...IAESIJAI
Word embedding has become the most popular method of lexical description
in a given context in the natural language processing domain, especially
through the word to vector (Word2Vec) and global vectors (GloVe)
implementations. Since GloVe is a pre-trained model that provides access to
word mapping vectors on many dimensionalities, a large number of
applications rely on its prowess, especially in the field of sentiment analysis.
However, in the literature, we found that in many cases, GloVe is
implemented with arbitrary dimensionalities (often 300d) regardless of the
length of the text to be analyzed. In this work, we conducted a study that
identifies the effect of the dimensionality of word embedding mapping
vectors on short and long texts in a sentiment analysis context. The results
suggest that as the dimensionality of the vectors increases, the performance
metrics of the model also increase for long texts. In contrast, for short texts,
we recorded a threshold at which dimensionality does not matter.
Similar to April 2020 most read artilce in contro theory & computer controlling (20)
Executive Directors Chat Leveraging AI for Diversity, Equity, and InclusionTechSoup
Let’s explore the intersection of technology and equity in the final session of our DEI series. Discover how AI tools, like ChatGPT, can be used to support and enhance your nonprofit's DEI initiatives. Participants will gain insights into practical AI applications and get tips for leveraging technology to advance their DEI goals.
This presentation was provided by Steph Pollock of The American Psychological Association’s Journals Program, and Damita Snow, of The American Society of Civil Engineers (ASCE), for the initial session of NISO's 2024 Training Series "DEIA in the Scholarly Landscape." Session One: 'Setting Expectations: a DEIA Primer,' was held June 6, 2024.
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...PECB
Denis is a dynamic and results-driven Chief Information Officer (CIO) with a distinguished career spanning information systems analysis and technical project management. With a proven track record of spearheading the design and delivery of cutting-edge Information Management solutions, he has consistently elevated business operations, streamlined reporting functions, and maximized process efficiency.
Certified as an ISO/IEC 27001: Information Security Management Systems (ISMS) Lead Implementer, Data Protection Officer, and Cyber Risks Analyst, Denis brings a heightened focus on data security, privacy, and cyber resilience to every endeavor.
His expertise extends across a diverse spectrum of reporting, database, and web development applications, underpinned by an exceptional grasp of data storage and virtualization technologies. His proficiency in application testing, database administration, and data cleansing ensures seamless execution of complex projects.
What sets Denis apart is his comprehensive understanding of Business and Systems Analysis technologies, honed through involvement in all phases of the Software Development Lifecycle (SDLC). From meticulous requirements gathering to precise analysis, innovative design, rigorous development, thorough testing, and successful implementation, he has consistently delivered exceptional results.
Throughout his career, he has taken on multifaceted roles, from leading technical project management teams to owning solutions that drive operational excellence. His conscientious and proactive approach is unwavering, whether he is working independently or collaboratively within a team. His ability to connect with colleagues on a personal level underscores his commitment to fostering a harmonious and productive workplace environment.
Date: May 29, 2024
Tags: Information Security, ISO/IEC 27001, ISO/IEC 42001, Artificial Intelligence, GDPR
-------------------------------------------------------------------------------
Find out more about ISO training and certification services
Training: ISO/IEC 27001 Information Security Management System - EN | PECB
ISO/IEC 42001 Artificial Intelligence Management System - EN | PECB
General Data Protection Regulation (GDPR) - Training Courses - EN | PECB
Webinars: https://pecb.com/webinars
Article: https://pecb.com/article
-------------------------------------------------------------------------------
For more information about PECB:
Website: https://pecb.com/
LinkedIn: https://www.linkedin.com/company/pecb/
Facebook: https://www.facebook.com/PECBInternational/
Slideshare: http://www.slideshare.net/PECBCERTIFICATION
The simplified electron and muon model, Oscillating Spacetime: The Foundation...RitikBhardwaj56
Discover the Simplified Electron and Muon Model: A New Wave-Based Approach to Understanding Particles delves into a groundbreaking theory that presents electrons and muons as rotating soliton waves within oscillating spacetime. Geared towards students, researchers, and science buffs, this book breaks down complex ideas into simple explanations. It covers topics such as electron waves, temporal dynamics, and the implications of this model on particle physics. With clear illustrations and easy-to-follow explanations, readers will gain a new outlook on the universe's fundamental nature.
Strategies for Effective Upskilling is a presentation by Chinwendu Peace in a Your Skill Boost Masterclass organisation by the Excellence Foundation for South Sudan on 08th and 09th June 2024 from 1 PM to 3 PM on each day.
How to Fix the Import Error in the Odoo 17Celine George
An import error occurs when a program fails to import a module or library, disrupting its execution. In languages like Python, this issue arises when the specified module cannot be found or accessed, hindering the program's functionality. Resolving import errors is crucial for maintaining smooth software operation and uninterrupted development processes.
বাংলাদেশের অর্থনৈতিক সমীক্ষা ২০২৪ [Bangladesh Economic Review 2024 Bangla.pdf] কম্পিউটার , ট্যাব ও স্মার্ট ফোন ভার্সন সহ সম্পূর্ণ বাংলা ই-বুক বা pdf বই " সুচিপত্র ...বুকমার্ক মেনু 🔖 ও হাইপার লিংক মেনু 📝👆 যুক্ত ..
আমাদের সবার জন্য খুব খুব গুরুত্বপূর্ণ একটি বই ..বিসিএস, ব্যাংক, ইউনিভার্সিটি ভর্তি ও যে কোন প্রতিযোগিতা মূলক পরীক্ষার জন্য এর খুব ইম্পরট্যান্ট একটি বিষয় ...তাছাড়া বাংলাদেশের সাম্প্রতিক যে কোন ডাটা বা তথ্য এই বইতে পাবেন ...
তাই একজন নাগরিক হিসাবে এই তথ্য গুলো আপনার জানা প্রয়োজন ...।
বিসিএস ও ব্যাংক এর লিখিত পরীক্ষা ...+এছাড়া মাধ্যমিক ও উচ্চমাধ্যমিকের স্টুডেন্টদের জন্য অনেক কাজে আসবে ...
How to Add Chatter in the odoo 17 ERP ModuleCeline George
In Odoo, the chatter is like a chat tool that helps you work together on records. You can leave notes and track things, making it easier to talk with your team and partners. Inside chatter, all communication history, activity, and changes will be displayed.
How to Build a Module in Odoo 17 Using the Scaffold MethodCeline George
Odoo provides an option for creating a module by using a single line command. By using this command the user can make a whole structure of a module. It is very easy for a beginner to make a module. There is no need to make each file manually. This slide will show how to create a module using the scaffold method.
A Strategic Approach: GenAI in EducationPeter Windle
Artificial Intelligence (AI) technologies such as Generative AI, Image Generators and Large Language Models have had a dramatic impact on teaching, learning and assessment over the past 18 months. The most immediate threat AI posed was to Academic Integrity with Higher Education Institutes (HEIs) focusing their efforts on combating the use of GenAI in assessment. Guidelines were developed for staff and students, policies put in place too. Innovative educators have forged paths in the use of Generative AI for teaching, learning and assessments leading to pockets of transformation springing up across HEIs, often with little or no top-down guidance, support or direction.
This Gasta posits a strategic approach to integrating AI into HEIs to prepare staff, students and the curriculum for an evolving world and workplace. We will highlight the advantages of working with these technologies beyond the realm of teaching, learning and assessment by considering prompt engineering skills, industry impact, curriculum changes, and the need for staff upskilling. In contrast, not engaging strategically with Generative AI poses risks, including falling behind peers, missed opportunities and failing to ensure our graduates remain employable. The rapid evolution of AI technologies necessitates a proactive and strategic approach if we are to remain relevant.
April 2020 most read artilce in contro theory & computer controlling
1. “April 2020: Top Read
Articles in Control Theory
and Computer Modelling”
International Journal of Control Theory and
Computer Modelling (IJCTCM)
ISSN : 2249-1155 [Online]; 2319 - 4138 [print].
http://airccse.org/journal/ijctcm/ijctcm.html
2. AUTOMATIC CLASSIFICATION OF BENGALI
SENTENCES BASED ON SENSE DEFINITIONS
PRESENT IN BENGALI WORDNET
Alok Ranjan Pal1
, Diganta Saha2
and Niladri Sekhar Dash3
1
Dept. of Computer Science and Eng., College of Engineering and Management,
Kolaghat
2
Dept. of Computer Science and Eng., Jadavpur University, Kolkata
3
Linguistic Research Unit, Indian Statistical Institute, Kolkata
ABSTRACT
Based on the sense definition of words available in the Bengali WordNet, an attempt is made to
classify the Bengali sentences automatically into different groups in accordance with their
underlying senses. The input sentences are collected from 50 different categories of the Bengali text
corpus developed in the TDIL project of the Govt. of India, while information about the different
senses of particular ambiguous lexical item is collected from Bengali WordNet. In an experimental
basis we have used Naive Bayes probabilistic model as a useful classifier of sentences. We have
applied the algorithm over 1747 sentences that contain a particular Bengali lexical item which,
because of its ambiguous nature, is able to trigger different senses that render sentences in different
meanings. In our experiment we have achieved around 84% accurate result on the sense
classification over the total input sentences. We have analyzed those residual sentences that did not
comply with our experiment and did affect the results to note that in many cases, wrong syntactic
structures and less semantic information are the main hurdles in semantic classification of sentences.
The applicational relevance of this study is attested in automatic text classification, machine
learning, information extraction, and word sense disambiguation
KEYWORDS
Natural Language Processing, Bengali Word Sense Disambiguation, Bengali WordNet, Naïve Bayes
Classification.
For More Details: http://airccse.org/journal/ijctcm/papers/5115ijctcm01.pdf
Volume Link: http://airccse.org/journal/ijctcm/vol5.html
3. REFERENCES
[1] Ide, N., Véronis, J., (1998) “Word Sense Disambiguation: The State of the Art”, Computational
Linguistics, Vol. 24, No. 1, Pp. 1-40.
[2] Cucerzan, R.S., C. Schafer, and D. Yarowsky, (2002) “Combining classifiers for word sense
disambiguation”, Natural Language Engineering, Vol. 8, No. 4, Cambridge University Press,
Pp. 327- 341.
[3] Nameh, M., S., Fakhrahmad, M., Jahromi, M.Z., (2011) “A New Approach to Word Sense
Disambiguation Based on Context Similarity”, Proceedings of the World Congress on
Engineering, Vol. I.
[4] Xiaojie, W., Matsumoto, Y., (2003) “Chinese word sense disambiguation by combining pseudo
training data”, Proceedings of The International Conference on Natural Language Processing
and Knowledge Engineering, Pp. 138-143.
[5] Navigli, R. (2009) “Word Sense Disambiguation: a Survey”, ACM Computing Surveys, Vol. 41,
No.2, ACM Press, Pp. 1-69.
[6] Gaizauskas, R., (1997) “Gold Standard Datasets for Evaluating Word Sense Disambiguation
Programs”, Computer Speech and Language, Vol. 12, No. 3, Special Issue on Evaluation of
Speech and Language Technology, Pp. 453-472.
[7] Seo, H., Chung, H., Rim, H., Myaeng, S. H., Kim, S., (2004) “Unsupervised word sense
disambiguation using WordNet relatives”, Computer Speech and Language, Vol. 18, No. 3, Pp.
253- 273.
[8] G. Miller, (1991) “WordNet: An on-line lexical database”, International Journal of Lexicography,
Vol.3, No. 4.
[9] Kolte, S.G., Bhirud, S.G., (2008) “Word Sense Disambiguation Using WordNet Domains”, First
International Conference on Digital Object Identifier, Pp. 1187-1191.
[10] Liu, Y., Scheuermann, P., Li, X., Zhu, X. (2007) “Using WordNet to Disambiguate Word
Senses for Text Classification”, Proceedings of the 7th International Conference on
Computational Science, Springer-Verlag, Pp. 781 - 789.
[11] Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J., (1990) “WordNet An on-line
Lexical Database”, International Journal of Lexicography, 3(4): 235-244.
[12] Miller, G.A., (1993) “WordNet: A Lexical Database”, Comm. ACM, Vol. 38, No. 11, Pp. 39-
41.
[13] Cañas, A.J., A. Valerio, J. Lalinde-Pulido, M. Carvalho, and M. Arguedas, (2003) “Using
WordNet for Word Sense Disambiguation to Support Concept Map Construction”, String
Processing and Information Retrieval, Pp. 350-359.
[14] http://en.wikipedia.org/wiki/Naive_bayes
4. [15] Dash, N.S., (2007) Indian scenario in language corpus generation. In, Dash, Ni.S., P. Dasgupta
and P. Sarkar (Eds.) Rainbow of Linguistics: Vol. I. Kolkata: T. Media Publication. Pp. 129-
162.
[16] Dash, N.S., (1999) “Corpus oriented Bangla language processing”, Jadavpur Journal of
Philosophy. 11(1): 1-28.
[17] Dash, N.S., (2000) “Bangla pronouns-a corpus based study”, Literary and Linguistic
Computing. 15(4): 433-444.
[18] Dash, N.S., (2004) Language Corpora: Present Indian Need, Indian Statistical Institute, Kolkata,
http://www. elda. org/en/proj/scalla/SCALLA2004/dash. pdf.
[19] Dash, N.S. (2005) Methods in Madness of Bengali Spelling: A Corpus-based Investigation,
South Asian Language Rewiew, Vol. XV, No. 2.
[20] Dash, N.S., (2012), From KCIE to LDC-IL: Some Milestones in NLP Journey in Indian
Multilingual Panorama. Indian Linguistics. 73(1-4): 129-146.
[21] Dash, N.S. and B.B. Chaudhuri, (2001) “A corpus based study of the Bangla language”, Indian
Journal of Linguistics. 20: 19-40.
[22] Dash, N.S. and B.B. Chaudhuri, (2001) “Corpus-based empirical analysis of form, function and
frequency of characters used in Bangla”, Published in Rayson, P. , Wilson, A. , McEnery, T. ,
Hardie, A. , and Khoja, S. , (eds.) Special issue of the Proceedings of the Corpus Linguistics
2001 Conference, Lancaster: Lancaster University Press. UK. 13: 144-157. 2001.
[23] Dash, N.S. and B.B. Chaudhuri., (2002) Corpus generation and text processing, International
Journal of Dravidian Linguistics. 31(1): 25-44.
[24] Dash, N.S. and B.B. Chaudhuri, (2002) “Using Text Corpora for Understanding Polysemy in
Bangla”, Proceedings of the Language Engineering Conference (LEC'02) IEEE.
[25] Dolamic, L. and J. Savoy, (2010) “Comparative Study of Indexing and Search Strategies for the
Hindi, Marathi and Bengali Languages”, ACM Transactions on Asian Language Information
Processing, 9(3): 1-24.
[26] Jurafsky, D. and J.H. Martin, (2000) Speech and Language Processing, ISBN 81-7808-594-1,
Pearson Education Asia, page no: 604.
[27] http://www.isical.ac.in/~lru/wordnetnew/
5. WORD SENSE DISAMBIGUATION: A SURVEY
Alok Ranjan Pal1
and Diganta Saha2
1
Dept. of Computer Science and Engg., College of Engg. and Mgmt, Kolaghat
2
Dept. of Computer Science and Engg., Jadavpur University, Kolkata
ABSTRACT
In this paper, we made a survey on Word Sense Disambiguation (WSD). Near about in all major
languages around the world, research in WSD has been conducted upto different extents. In this
paper, we have gone through a survey regarding the different approaches adopted in different
research works, the State of the Art in the performance in this domain, recent works in different
Indian languages and finally a survey in Bengali language. We have made a survey on different
competitions in this field and the bench mark results, obtained from those competitions.
Keywords:
Natural Languages Processing, Word Sense Disambiguation
For More Details: http://airccse.org/journal/ijctcm/papers/5315ijctcm01.pdf
Volume Link: http://airccse.org/journal/ijctcm/vol5.html
6. REFERENCES
[1] Ide, N., Véronis, J., (1998) “Word Sense Disambiguation: The State of the
Art”,Computational Linguistics, Vol. 24, No. 1, Pp. 1-40.
[2] Cucerzan, R.S., C. Schafer, and D. Yarowsky, (2002) “Combining classifiers for word sense
disambiguation”, Natural Language Engineering, Vol. 8, No. 4, Cambridge University
Press,Pp. 327- 341.
[3] Nameh, M. S., Fakhrahmad, M., Jahromi, M.Z., (2011) “A New Approach to Word Sense
Disambiguation Based on Context Similarity”, Proceedings of the World Congress on
Engineering,Vol. I.
[4] Xiaojie, W., Matsumoto, Y., (2003) “Chinese word sense disambiguation by combining
pseudo training data”,Proceedings of The International Conference on Natural Language
Processing and Knowledge Engineering, Pp. 138-143.
[5] Navigli, R. (2009) “Word Sense Disambiguation: a Survey”, ACM Computing Surveys, Vol.
41,No.2, ACM Press, Pp. 1-69.
[6] Word Sense Disambiguation; Algorithms and Applications, Edited by Eneko Agirre and
Philip Edmonds, Springer, VOLUME 33.
[7] Seo, H., Chung, H., Rim, H., Myaeng, S. H., Kim, S., (2004) “Unsupervised word sense
disambiguation using WordNet relatives”, Computer Speech and Language, Vol. 18, No. 3,
Pp. 253-273.
[8] Miller, G., (1991) “WordNet: An on-line lexical database”, International Journal of
Lexicography,Vol.3,No. 4.
[9] Kolte, S.G., Bhirud, S.G., (2008) “Word Sense Disambiguation Using WordNet Domains”,
First International Conference on Digital Object Identifier, Pp. 1187-1191.
[10] Liu, Y., Scheuermann, P., Li, X., Zhu, X. (2007) “Using WordNet to Disambiguate Word
Senses for Text Classification”, Proceedings of the 7th International Conference on
Computational Science, Springer-Verlag, Pp. 781 - 789.
[11] Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J., (1990) “WordNet An on-
line Lexical Database”, International Journal of Lexicography, 3(4): 235-244.
[12] Miller, G.A., (1993) “WordNet: A Lexical Database”, Comm. ACM, Vol. 38, No. 11, Pp. 39-
41.
[13] Cañas, A.J., A. Valerio, J. Lalinde-Pulido, M. Carvalho, and M. Arguedas, (2003) “Using
WordNet for Word Sense Disambiguation to Support Concept Map Construction”, String
Processing and Information Retrieval, Pp. 350-359.
7. [14] Marine, C., Dekai, W.U.,(2005) “Word Sense Disambiguation vs. Statistical Machine
Translation”, Proceedings of the 43rd Annual Meeting of the ACL , Ann Arbor, June 2005,
pages 387–394.
[15] http://www.ling.gu.se/~sl/Undervisning/StatMet11/wsd-mt.pdf date: 14/05/2015
[16] http://nlp.cs.nyu.edu/sk-symposium/note/P-28.pdf date: 14/05/2015
[17] Yee, S. C., Hwee, T. N., David, C., (2007) “Word Sense Disambiguation Improves Statistical
Machine Translation”, Proceedings of the 45th Annual Meeting of the Association of
Computational Linguistics, pages 33–40, Prague, Czech Republic, June 2007.
[18] Sanderson, M.,(1994) “Word Sense Disambiguation and Information Retrieval”, Proceedings
of the 17th Annual International ACM SIGIR conference on Research and Development in
Information Retrieval, SIGIR’94, July 03-06, Dublin, Ireland, Springer, New York, pp 142-
151.
[19] Christopher, S., Michael, P. O., John, T.,(2003) “Word Sense Disambiguation in Information
Retrieval Revisited”, SIGIR’03, July 28–August 1, 2003, Toronto, Canada.
[20] http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.65.6828&rep=rep1&type=pdf
date:4/05/2015
[21] http://www.aclweb.org/anthology/P12-1029date: 14/05/2015
[22] https://www.comp.nus.edu.sg/~nght/pubs/esair11.pdf date: 14/05/2015
[23] http://cui.unige.ch/isi/reports/2008/CLEF2008-LNCS.pdf date: 14/05/2015
[24] Banerjee, S., Pedersen, T.,(2002) "An adapted Lesk algorithm for word sense disambiguation
using WordNet", In Proceedings of the Third International Conference on Intelligent Text
Processing and Computational Linguistics, Mexico City, February.
[25] Lesk, M.,(1986) "Automatic Sense Disambiguation Using Machine Readable Dictionaries:
How to Tell a Pine Cone from an Ice Cream Cone", Proceedings of SIGDOC.
[26] http://www.dlsi.ua.es/projectes/srim/publicaciones/CICling-2002.pdf date: 14/05/2015
[27] Mittal, K. and Jain, A.,(2015)“WORD SENSE DISAMBIGUATION METHOD USING
SEMANTIC SIMILARITY MEASURES AND OWA OPERATOR”, ICTACT JOURNAL
ON SOFT COMPUTING: SPECIAL ISSUE ON SOFT –COMPUTING THEORY,
APPLICATION AND IMPLICATIONS IN ENGINEERING AND TECHNOLOGY,
JANUARY, 2015, VOLUME: 05, ISSUE: 02.
[28] http://www.d.umn.edu/~tpederse/Pubs/cicling2003-3.pdf%date: 14/05/2015
[29] http://www.aclweb.org/anthology/U04-1021date: 14/05/2015
[30] http://www.aclweb.org/anthology/C10-2142date: 14/05/2015
8. [31] Diana, M.C., Carroll, J., “Disambiguating Nouns, Verbs, and Adjectives Using Automatically
Acquired Selectional Preferences”, Computational Linguistics, Volume 29, Number 4, pp.
639-654.
[32] Patrick, Y. and Timothy, B.,(2006) “Verb Sense Disambiguation Using Selectional
Preferences Extracted with a State-of-the-art Semantic Role Labeler”, Proceedings of the
2006 Australasian Language Technology Workshop (ALTW2006), pages 139–148.
[33] http://link.springer.com/article/10.1023/A%3A1002674829964#page-1 date: 14/05/2015