This document summarizes a research paper that proposes a lexicon-based method called NBLex to predict emotions in Kannada-English code-switch text. The paper describes creating a Kannada-English code-switch corpus by collecting comments from YouTube and annotating them. A Naive Bayes classifier is used to build the NBLex lexicon and predict emotions. Evaluation shows the NBLex method achieves 87.9% accuracy for emotion prediction, outperforming Naive Bayes and BiLSTM baselines. The paper concludes that a simple additive lexicon approach can be an alternative to machine learning for emotion prediction in code-switch text.
Analysis of Opinionated Text for Opinion Miningmlaij
In sentiment analysis, the polarities of the opinions expressed on an object/feature are determined to assess the sentiment of a sentence or document whether it is positive/negative/neutral. Naturally, the object/feature is a noun representation which refers to a product or a component of a product, let’s say, the "lens" in a camera and opinions emanating on it are captured in adjectives, verbs, adverbs and noun words themselves. Apart from such words, other meta-information and diverse effective features are also going to play an important role in influencing the sentiment polarity and contribute significantly to the performance of the system. In this paper, some of the associated information/meta-data are explored and investigated in the sentiment text. Based on the analysis results presented here, there is scope for further assessment and utilization of the meta-information as features in text categorization, ranking text document, identification of spam documents and polarity classification problems.
Sentence level sentiment polarity calculation for customer reviews by conside...eSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
Sentimental analysis is a context based mining of text, which extracts and identify subjective information from a text or sentence provided. Here the main concept is extracting the sentiment of the text using machine learning techniques such as LSTM Long short term memory . This text classification method analyses the incoming text and determines whether the underlined emotion is positive or negative along with probability associated with that positive or negative statements. Probability depicts the strength of a positive or negative statement, if the probability is close to zero, it implies that the sentiment is strongly negative and if probability is close to1, it means that the statement is strongly positive. Here a web application is created to deploy this model using a Python based micro framework called flask. Many other methods, such as RNN and CNN, are inefficient when compared to LSTM. Dirash A R | Dr. S K Manju Bargavi "LSTM Based Sentiment Analysis" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-5 | Issue-4 , June 2021, URL: https://www.ijtsrd.compapers/ijtsrd42345.pdf Paper URL: https://www.ijtsrd.comcomputer-science/data-processing/42345/lstm-based-sentiment-analysis/dirash-a-r
Evaluating sentiment analysis and word embedding techniques on BrexitIAESIJAI
In this study, we investigate the effectiveness of pre-trained word embeddings for sentiment analysis on a real-world topic, namely Brexit. We compare the performance of several popular word embedding models such global vectors for word representation (GloVe), FastText, word to vec (word2vec), and embeddings from language models (ELMo) on a dataset of tweets related to Brexit and evaluate their ability to classify the sentiment of the tweets as positive, negative, or neutral. We find that pre-trained word embeddings provide useful features for sentiment analysis and can significantly improve the performance of machine learning models. We also discuss the challenges and limitations of applying these models to complex, real-world texts such as those related to Brexit.
Analysis of Opinionated Text for Opinion Miningmlaij
In sentiment analysis, the polarities of the opinions expressed on an object/feature are determined to assess the sentiment of a sentence or document whether it is positive/negative/neutral. Naturally, the object/feature is a noun representation which refers to a product or a component of a product, let’s say, the "lens" in a camera and opinions emanating on it are captured in adjectives, verbs, adverbs and noun words themselves. Apart from such words, other meta-information and diverse effective features are also going to play an important role in influencing the sentiment polarity and contribute significantly to the performance of the system. In this paper, some of the associated information/meta-data are explored and investigated in the sentiment text. Based on the analysis results presented here, there is scope for further assessment and utilization of the meta-information as features in text categorization, ranking text document, identification of spam documents and polarity classification problems.
Sentence level sentiment polarity calculation for customer reviews by conside...eSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
Sentimental analysis is a context based mining of text, which extracts and identify subjective information from a text or sentence provided. Here the main concept is extracting the sentiment of the text using machine learning techniques such as LSTM Long short term memory . This text classification method analyses the incoming text and determines whether the underlined emotion is positive or negative along with probability associated with that positive or negative statements. Probability depicts the strength of a positive or negative statement, if the probability is close to zero, it implies that the sentiment is strongly negative and if probability is close to1, it means that the statement is strongly positive. Here a web application is created to deploy this model using a Python based micro framework called flask. Many other methods, such as RNN and CNN, are inefficient when compared to LSTM. Dirash A R | Dr. S K Manju Bargavi "LSTM Based Sentiment Analysis" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-5 | Issue-4 , June 2021, URL: https://www.ijtsrd.compapers/ijtsrd42345.pdf Paper URL: https://www.ijtsrd.comcomputer-science/data-processing/42345/lstm-based-sentiment-analysis/dirash-a-r
Evaluating sentiment analysis and word embedding techniques on BrexitIAESIJAI
In this study, we investigate the effectiveness of pre-trained word embeddings for sentiment analysis on a real-world topic, namely Brexit. We compare the performance of several popular word embedding models such global vectors for word representation (GloVe), FastText, word to vec (word2vec), and embeddings from language models (ELMo) on a dataset of tweets related to Brexit and evaluate their ability to classify the sentiment of the tweets as positive, negative, or neutral. We find that pre-trained word embeddings provide useful features for sentiment analysis and can significantly improve the performance of machine learning models. We also discuss the challenges and limitations of applying these models to complex, real-world texts such as those related to Brexit.
Automatic classification of bengali sentences based on sense definitions pres...ijctcm
Based on the sense definition of words available in the Bengali WordNet, an attempt is made to classify the
Bengali sentences automatically into different groups in accordance with their underlying senses. The input
sentences are collected from 50 different categories of the Bengali text corpus developed in the TDIL
project of the Govt. of India, while information about the different senses of particular ambiguous lexical
item is collected from Bengali WordNet. In an experimental basis we have used Naive Bayes probabilistic
model as a useful classifier of sentences. We have applied the algorithm over 1747 sentences that contain a
particular Bengali lexical item which, because of its ambiguous nature, is able to trigger different senses
that render sentences in different meanings. In our experiment we have achieved around 84% accurate
result on the sense classification over the total input sentences. We have analyzed those residual sentences
that did not comply with our experiment and did affect the results to note that in many cases, wrong
syntactic structures and less semantic information are the main hurdles in semantic classification of
sentences. The applicational relevance of this study is attested in automatic text classification, machine
learning, information extraction, and word sense disambiguation
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORKijnlc
In recent years, there has been an increasing use of social media among people in Myanmar and writing review on social media pages about the product, movie, and trip are also popular among people. Moreover, most of the people are going to find the review pages about the product they want to buy before deciding whether they should buy it or not. Extracting and receiving useful reviews over interesting products is very important and time consuming for people. Sentiment analysis is one of the important processes for extracting useful reviews of the products. In this paper, the Convolutional LSTM neural network architecture is proposed to analyse the sentiment classification of cosmetic reviews written in Myanmar Language. The paper also intends to build the cosmetic reviews dataset for deep learning and sentiment lexicon in Myanmar Language.
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Networkkevig
In recent years, there has been an increasing use of social media among people in Myanmar and writing
review on social media pages about the product, movie, and trip are also popular among people. Moreover,
most of the people are going to find the review pages about the product they want to buy before deciding
whether they should buy it or not. Extracting and receiving useful reviews over interesting products is very
important and time consuming for people. Sentiment analysis is one of the important processes for extracting
useful reviews of the products. In this paper, the Convolutional LSTM neural network architecture is
proposed to analyse the sentiment classification of cosmetic reviews written in Myanmar Language. The
paper also intends to build the cosmetic reviews dataset for deep learning and sentiment lexicon in Myanmar
Language.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
Abstract This paper represents a Semantic Analyzer for checking the semantic correctness of the given input text. We describe our system as the one which analyzes the text by comparing it with the meaning of the words given in the WordNet. The Semantic Analyzer thus developed not only detects and displays semantic errors in the text but it also corrects them. Keywords: Part of Speech (POS) Tagger, Morphological Analyzer, Syntactic Analyzer, Semantic Analyzer, Natural Language (NL)
Enhanced sentiment analysis based on improved word embeddings and XGboost IJECEIAES
Sentiment analysis is a well-known and rapidly expanding study topic in natural language processing (NLP) and text classification. This approach has evolved into a critical component of many applications, including politics, business, advertising, and marketing. Most current research focuses on obtaining sentiment features through lexical and syntactic analysis. Word embeddings explicitly express these characteristics. This article proposes a novel method, improved words vector for sentiments analysis (IWVS), using XGboost to improve the F1-score of sentiment classification. The proposed method constructed sentiment vectors by averaging the word embeddings (Sentiment2Vec). We also investigated the Polarized lexicon for classifying positive and negative sentiments. The sentiment vectors formed a feature space to which the examined sentiment text was mapped to. Those features were input into the chosen classifier (XGboost). We compared the F1-score of sentiment classification using our method via different machine learning models and sentiment datasets. We compare the quality of our proposition to that of baseline models, term frequency-inverse document frequency (TF-IDF) and Doc2vec, and the results show that IWVS performs better on the F1-measure for sentiment classification. At the same time, XGBoost with IWVS features was the best model in our evaluation.
An Approach To Automatic Text Summarization Using Simplified Lesk Algorithm A...ijctcm
Text Summarization is a way to produce a text, which contains the significant portion of information of the
original text(s). Different methodologies are developed till now depending upon several parameters to find
the summary as the position, format and type of the sentences in an input text, formats of different words,
frequency of a particular word in a text etc. But according to different languages and input sources, these
parameters are varied. As result the performance of the algorithm is greatly affected. The proposed
approach summarizes a text without depending upon those parameters. Here, the relevance of the
sentences within the text is derived by Simplified Lesk algorithm and WordNet, an online dictionary. This
approach is not only independent of the format of the text and position of a sentence in a text, as the
sentences are arranged at first according to their relevance before the summarization process, the
percentage of summarization can be varied according to needs. The proposed approach gives around 80%
accurate results on 50% summarization of the original text with respect to the manually summarized result,
performed on 50 different types and lengths of texts. We have achieved satisfactory results even upto 25%
summarization of the original text.
Sentiment classification aims to detect information such as opinions, explicit , implicit feelings expressed
in text. The most existing approaches are able to detect either explicit expressions or implicit expressions of
sentiments in the text separately. In this proposed framework it will detect both Implicit and Explicit
expressions available in the meeting transcripts. It will classify the Positive, Negative, Neutral words and
also identify the topic of the particular meeting transcripts by using fuzzy logic. This paper aims to add
some additional features for improving the classification method. The quality of the sentiment classification
is improved using proposed fuzzy logic framework .In this fuzzy logic it includes the features like Fuzzy
rules and Fuzzy C-means algorithm.The quality of the output is evaluated using the parameters such as
precision, recall, f-measure. Here Fuzzy C-means Clustering technique measured in terms of Purity and
Entropy. The data set was validated using 10-fold cross validation method and observed 95% confidence
interval between the accuracy values .Finally, the proposed fuzzy logic method produced more than 85 %
accurate results and error rate is very less compared to existing sentiment classification techniques.
A novel meta-embedding technique for drug reviews sentiment analysisIAESIJAI
Traditional word embedding models have been used in the feature extraction process of deep learning models for sentiment analysis. However, these models ignore the sentiment properties of words while maintaining the contextual relationships and have inadequate representation for domain-specific words. This paper proposes a method to develop a meta embedding model by exploiting domain sentiment polarity and adverse drug reaction (ADR) features to render word embedding models more suitable for medical sentiment analysis. The proposed lexicon is developed from the medical blogs corpus. The polarity scores of the existing lexicons are adjusted to assign new polarity score to each word. The neural network model utilizes sentiment lexicons and ADR in learning refined word embedding. The refined embedding obtained from the proposed approach is concatenated with original word vectors, lexicon vectors, and ADR feature to form a meta-embedding model which maintains both contextual and sentimental properties. The final meta-embedding acts as a feature extractor to assess the effectiveness of the model in drug reviews sentiment analysis. The experiments are conducted on global vectors (GloVE) and skip-gram word2vector (Word2Vec) models. The empirical results demonstrate the proposed meta-embedding model outperforms traditional word embedding in different performance measures.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
A prior case study of natural language processing on different domain IJECEIAES
In the present state of digital world, computer machine do not understand the human’s ordinary language. This is the great barrier between humans and digital systems. Hence, researchers found an advanced technology that provides information to the users from the digital machine. However, natural language processing (i.e. NLP) is a branch of AI that has significant implication on the ways that computer machine and humans can interact. NLP has become an essential technology in bridging the communication gap between humans and digital data. Thus, this study provides the necessity of the NLP in the current computing world along with different approaches and their applications. It also, highlights the key challenges in the development of new NLP model.
Generate fuzzy string-matching to build self attention on Indonesian medical-...IJECEIAES
Chatbot is a form of interactive conversation that requires quick and precise answers. The process of identifying answers to users’ questions involves string matching and handling incorrect spelling. Therefore, a system that can independently predict and correct letters is highly necessary. The approach used to address this issue is to enhance the fuzzy string-matching method by incorporating several features for self-attention. The combination of fuzzy string-matching methods employed includes Jaro Winkler distance + Levenshtein Damerau distance and Damerau Levenshtein + Rabin Carp. The reason for using this combination is their ability not only to match strings but also to correct word typing errors. This research contributes by developing a self-attention mechanism through a modified fuzzy string- matching model with enhanced word feature structures. The goal is to utilize this self-attention mechanism in constructing the Indonesian medical bidirectional encoder representations from transformers (IM-BERT). This will serve as a foundation for additional features to provide accurate answers in the Indonesian medical question and answer system, achieving an exact match of 85.7% and an F1-score of 87.6%.
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESkevig
Distributed language representation has become the most widely used technique for language representation in various natural language processing tasks. Most of the natural language processing models that are based on deep learning techniques use already pre-trained distributed word representations, commonly called word embeddings. Determining the most qualitative word embeddings is of crucial importance for such models. However, selecting the appropriate word embeddings is a perplexing task since the projected embedding space is not intuitive to humans. In this paper, we explore different approaches for creating distributed word representations. We perform an intrinsic evaluation of several state-of-the-art word embedding methods. Their performance on capturing word similarities is analysed with existing benchmark datasets for word pairs similarities. The research in this paper conducts a correlation analysis between ground truth word similarities and similarities obtained by different word embedding methods.
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESkevig
Distributed language representation has become the most widely used technique for language representation in various natural language processing tasks. Most of the natural language processing models that are based on deep learning techniques use already pre-trained distributed word representations, commonly called word embeddings. Determining the most qualitative word embeddings is of crucial importance for such models. However, selecting the appropriate word embeddings is a perplexing task since the projected embedding space is not intuitive to humans.In this paper, we explore different approaches for creating distributed word representations. We perform an intrinsic evaluation of several state-of-the-art word embedding methods. Their performance on capturing word similarities is analysed with existing benchmark datasets for word pairs similarities. The research in this paper conducts a correlation analysis between ground truth word similarities and similarities obtained by different word embedding methods.
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUEJournal For Research
Natural Language Processing (NLP) techniques are one of the most used techniques in the field of computer applications. It has become one of the vast and advanced techniques. Language is the means of communication or interaction among humans and in present scenario when everything is dependent on machine or everything is computerized, communication between computer and human has become a necessity. To fulfill this necessity NLP has been emerged as the means of interaction which narrows the gap between machines (computers) and humans. It was evolved from the study of linguistics which was passed through the Turing test to check the similarity between data but it was limited to small set of data. Later on various algorithms were developed along with the concept of AI (Artificial Intelligence) for the successful execution of NLP. In this paper, the main emphasis is on the different techniques of NLP which have been developed till now, their applications and the comparison of all those techniques on different parameters.
Bibliometric analysis highlighting the role of women in addressing climate ch...IJECEIAES
Fossil fuel consumption increased quickly, contributing to climate change
that is evident in unusual flooding and draughts, and global warming. Over
the past ten years, women's involvement in society has grown dramatically,
and they succeeded in playing a noticeable role in reducing climate change.
A bibliometric analysis of data from the last ten years has been carried out to
examine the role of women in addressing the climate change. The analysis's
findings discussed the relevant to the sustainable development goals (SDGs),
particularly SDG 7 and SDG 13. The results considered contributions made
by women in the various sectors while taking geographic dispersion into
account. The bibliometric analysis delves into topics including women's
leadership in environmental groups, their involvement in policymaking, their
contributions to sustainable development projects, and the influence of
gender diversity on attempts to mitigate climate change. This study's results
highlight how women have influenced policies and actions related to climate
change, point out areas of research deficiency and recommendations on how
to increase role of the women in addressing the climate change and
achieving sustainability. To achieve more successful results, this initiative
aims to highlight the significance of gender equality and encourage
inclusivity in climate change decision-making processes.
Voltage and frequency control of microgrid in presence of micro-turbine inter...IJECEIAES
The active and reactive load changes have a significant impact on voltage
and frequency. In this paper, in order to stabilize the microgrid (MG) against
load variations in islanding mode, the active and reactive power of all
distributed generators (DGs), including energy storage (battery), diesel
generator, and micro-turbine, are controlled. The micro-turbine generator is
connected to MG through a three-phase to three-phase matrix converter, and
the droop control method is applied for controlling the voltage and
frequency of MG. In addition, a method is introduced for voltage and
frequency control of micro-turbines in the transition state from gridconnected mode to islanding mode. A novel switching strategy of the matrix
converter is used for converting the high-frequency output voltage of the
micro-turbine to the grid-side frequency of the utility system. Moreover,
using the switching strategy, the low-order harmonics in the output current
and voltage are not produced, and consequently, the size of the output filter
would be reduced. In fact, the suggested control strategy is load-independent
and has no frequency conversion restrictions. The proposed approach for
voltage and frequency regulation demonstrates exceptional performance and
favorable response across various load alteration scenarios. The suggested
strategy is examined in several scenarios in the MG test systems, and the
simulation results are addressed.
More Related Content
Similar to NBLex: emotion prediction in Kannada-English code-switchtext using naïve bayes lexicon approach
Automatic classification of bengali sentences based on sense definitions pres...ijctcm
Based on the sense definition of words available in the Bengali WordNet, an attempt is made to classify the
Bengali sentences automatically into different groups in accordance with their underlying senses. The input
sentences are collected from 50 different categories of the Bengali text corpus developed in the TDIL
project of the Govt. of India, while information about the different senses of particular ambiguous lexical
item is collected from Bengali WordNet. In an experimental basis we have used Naive Bayes probabilistic
model as a useful classifier of sentences. We have applied the algorithm over 1747 sentences that contain a
particular Bengali lexical item which, because of its ambiguous nature, is able to trigger different senses
that render sentences in different meanings. In our experiment we have achieved around 84% accurate
result on the sense classification over the total input sentences. We have analyzed those residual sentences
that did not comply with our experiment and did affect the results to note that in many cases, wrong
syntactic structures and less semantic information are the main hurdles in semantic classification of
sentences. The applicational relevance of this study is attested in automatic text classification, machine
learning, information extraction, and word sense disambiguation
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORKijnlc
In recent years, there has been an increasing use of social media among people in Myanmar and writing review on social media pages about the product, movie, and trip are also popular among people. Moreover, most of the people are going to find the review pages about the product they want to buy before deciding whether they should buy it or not. Extracting and receiving useful reviews over interesting products is very important and time consuming for people. Sentiment analysis is one of the important processes for extracting useful reviews of the products. In this paper, the Convolutional LSTM neural network architecture is proposed to analyse the sentiment classification of cosmetic reviews written in Myanmar Language. The paper also intends to build the cosmetic reviews dataset for deep learning and sentiment lexicon in Myanmar Language.
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Networkkevig
In recent years, there has been an increasing use of social media among people in Myanmar and writing
review on social media pages about the product, movie, and trip are also popular among people. Moreover,
most of the people are going to find the review pages about the product they want to buy before deciding
whether they should buy it or not. Extracting and receiving useful reviews over interesting products is very
important and time consuming for people. Sentiment analysis is one of the important processes for extracting
useful reviews of the products. In this paper, the Convolutional LSTM neural network architecture is
proposed to analyse the sentiment classification of cosmetic reviews written in Myanmar Language. The
paper also intends to build the cosmetic reviews dataset for deep learning and sentiment lexicon in Myanmar
Language.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
Abstract This paper represents a Semantic Analyzer for checking the semantic correctness of the given input text. We describe our system as the one which analyzes the text by comparing it with the meaning of the words given in the WordNet. The Semantic Analyzer thus developed not only detects and displays semantic errors in the text but it also corrects them. Keywords: Part of Speech (POS) Tagger, Morphological Analyzer, Syntactic Analyzer, Semantic Analyzer, Natural Language (NL)
Enhanced sentiment analysis based on improved word embeddings and XGboost IJECEIAES
Sentiment analysis is a well-known and rapidly expanding study topic in natural language processing (NLP) and text classification. This approach has evolved into a critical component of many applications, including politics, business, advertising, and marketing. Most current research focuses on obtaining sentiment features through lexical and syntactic analysis. Word embeddings explicitly express these characteristics. This article proposes a novel method, improved words vector for sentiments analysis (IWVS), using XGboost to improve the F1-score of sentiment classification. The proposed method constructed sentiment vectors by averaging the word embeddings (Sentiment2Vec). We also investigated the Polarized lexicon for classifying positive and negative sentiments. The sentiment vectors formed a feature space to which the examined sentiment text was mapped to. Those features were input into the chosen classifier (XGboost). We compared the F1-score of sentiment classification using our method via different machine learning models and sentiment datasets. We compare the quality of our proposition to that of baseline models, term frequency-inverse document frequency (TF-IDF) and Doc2vec, and the results show that IWVS performs better on the F1-measure for sentiment classification. At the same time, XGBoost with IWVS features was the best model in our evaluation.
An Approach To Automatic Text Summarization Using Simplified Lesk Algorithm A...ijctcm
Text Summarization is a way to produce a text, which contains the significant portion of information of the
original text(s). Different methodologies are developed till now depending upon several parameters to find
the summary as the position, format and type of the sentences in an input text, formats of different words,
frequency of a particular word in a text etc. But according to different languages and input sources, these
parameters are varied. As result the performance of the algorithm is greatly affected. The proposed
approach summarizes a text without depending upon those parameters. Here, the relevance of the
sentences within the text is derived by Simplified Lesk algorithm and WordNet, an online dictionary. This
approach is not only independent of the format of the text and position of a sentence in a text, as the
sentences are arranged at first according to their relevance before the summarization process, the
percentage of summarization can be varied according to needs. The proposed approach gives around 80%
accurate results on 50% summarization of the original text with respect to the manually summarized result,
performed on 50 different types and lengths of texts. We have achieved satisfactory results even upto 25%
summarization of the original text.
Sentiment classification aims to detect information such as opinions, explicit , implicit feelings expressed
in text. The most existing approaches are able to detect either explicit expressions or implicit expressions of
sentiments in the text separately. In this proposed framework it will detect both Implicit and Explicit
expressions available in the meeting transcripts. It will classify the Positive, Negative, Neutral words and
also identify the topic of the particular meeting transcripts by using fuzzy logic. This paper aims to add
some additional features for improving the classification method. The quality of the sentiment classification
is improved using proposed fuzzy logic framework .In this fuzzy logic it includes the features like Fuzzy
rules and Fuzzy C-means algorithm.The quality of the output is evaluated using the parameters such as
precision, recall, f-measure. Here Fuzzy C-means Clustering technique measured in terms of Purity and
Entropy. The data set was validated using 10-fold cross validation method and observed 95% confidence
interval between the accuracy values .Finally, the proposed fuzzy logic method produced more than 85 %
accurate results and error rate is very less compared to existing sentiment classification techniques.
A novel meta-embedding technique for drug reviews sentiment analysisIAESIJAI
Traditional word embedding models have been used in the feature extraction process of deep learning models for sentiment analysis. However, these models ignore the sentiment properties of words while maintaining the contextual relationships and have inadequate representation for domain-specific words. This paper proposes a method to develop a meta embedding model by exploiting domain sentiment polarity and adverse drug reaction (ADR) features to render word embedding models more suitable for medical sentiment analysis. The proposed lexicon is developed from the medical blogs corpus. The polarity scores of the existing lexicons are adjusted to assign new polarity score to each word. The neural network model utilizes sentiment lexicons and ADR in learning refined word embedding. The refined embedding obtained from the proposed approach is concatenated with original word vectors, lexicon vectors, and ADR feature to form a meta-embedding model which maintains both contextual and sentimental properties. The final meta-embedding acts as a feature extractor to assess the effectiveness of the model in drug reviews sentiment analysis. The experiments are conducted on global vectors (GloVE) and skip-gram word2vector (Word2Vec) models. The empirical results demonstrate the proposed meta-embedding model outperforms traditional word embedding in different performance measures.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
A prior case study of natural language processing on different domain IJECEIAES
In the present state of digital world, computer machine do not understand the human’s ordinary language. This is the great barrier between humans and digital systems. Hence, researchers found an advanced technology that provides information to the users from the digital machine. However, natural language processing (i.e. NLP) is a branch of AI that has significant implication on the ways that computer machine and humans can interact. NLP has become an essential technology in bridging the communication gap between humans and digital data. Thus, this study provides the necessity of the NLP in the current computing world along with different approaches and their applications. It also, highlights the key challenges in the development of new NLP model.
Generate fuzzy string-matching to build self attention on Indonesian medical-...IJECEIAES
Chatbot is a form of interactive conversation that requires quick and precise answers. The process of identifying answers to users’ questions involves string matching and handling incorrect spelling. Therefore, a system that can independently predict and correct letters is highly necessary. The approach used to address this issue is to enhance the fuzzy string-matching method by incorporating several features for self-attention. The combination of fuzzy string-matching methods employed includes Jaro Winkler distance + Levenshtein Damerau distance and Damerau Levenshtein + Rabin Carp. The reason for using this combination is their ability not only to match strings but also to correct word typing errors. This research contributes by developing a self-attention mechanism through a modified fuzzy string- matching model with enhanced word feature structures. The goal is to utilize this self-attention mechanism in constructing the Indonesian medical bidirectional encoder representations from transformers (IM-BERT). This will serve as a foundation for additional features to provide accurate answers in the Indonesian medical question and answer system, achieving an exact match of 85.7% and an F1-score of 87.6%.
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESkevig
Distributed language representation has become the most widely used technique for language representation in various natural language processing tasks. Most of the natural language processing models that are based on deep learning techniques use already pre-trained distributed word representations, commonly called word embeddings. Determining the most qualitative word embeddings is of crucial importance for such models. However, selecting the appropriate word embeddings is a perplexing task since the projected embedding space is not intuitive to humans. In this paper, we explore different approaches for creating distributed word representations. We perform an intrinsic evaluation of several state-of-the-art word embedding methods. Their performance on capturing word similarities is analysed with existing benchmark datasets for word pairs similarities. The research in this paper conducts a correlation analysis between ground truth word similarities and similarities obtained by different word embedding methods.
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESkevig
Distributed language representation has become the most widely used technique for language representation in various natural language processing tasks. Most of the natural language processing models that are based on deep learning techniques use already pre-trained distributed word representations, commonly called word embeddings. Determining the most qualitative word embeddings is of crucial importance for such models. However, selecting the appropriate word embeddings is a perplexing task since the projected embedding space is not intuitive to humans.In this paper, we explore different approaches for creating distributed word representations. We perform an intrinsic evaluation of several state-of-the-art word embedding methods. Their performance on capturing word similarities is analysed with existing benchmark datasets for word pairs similarities. The research in this paper conducts a correlation analysis between ground truth word similarities and similarities obtained by different word embedding methods.
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUEJournal For Research
Natural Language Processing (NLP) techniques are one of the most used techniques in the field of computer applications. It has become one of the vast and advanced techniques. Language is the means of communication or interaction among humans and in present scenario when everything is dependent on machine or everything is computerized, communication between computer and human has become a necessity. To fulfill this necessity NLP has been emerged as the means of interaction which narrows the gap between machines (computers) and humans. It was evolved from the study of linguistics which was passed through the Turing test to check the similarity between data but it was limited to small set of data. Later on various algorithms were developed along with the concept of AI (Artificial Intelligence) for the successful execution of NLP. In this paper, the main emphasis is on the different techniques of NLP which have been developed till now, their applications and the comparison of all those techniques on different parameters.
Bibliometric analysis highlighting the role of women in addressing climate ch...IJECEIAES
Fossil fuel consumption increased quickly, contributing to climate change
that is evident in unusual flooding and draughts, and global warming. Over
the past ten years, women's involvement in society has grown dramatically,
and they succeeded in playing a noticeable role in reducing climate change.
A bibliometric analysis of data from the last ten years has been carried out to
examine the role of women in addressing the climate change. The analysis's
findings discussed the relevant to the sustainable development goals (SDGs),
particularly SDG 7 and SDG 13. The results considered contributions made
by women in the various sectors while taking geographic dispersion into
account. The bibliometric analysis delves into topics including women's
leadership in environmental groups, their involvement in policymaking, their
contributions to sustainable development projects, and the influence of
gender diversity on attempts to mitigate climate change. This study's results
highlight how women have influenced policies and actions related to climate
change, point out areas of research deficiency and recommendations on how
to increase role of the women in addressing the climate change and
achieving sustainability. To achieve more successful results, this initiative
aims to highlight the significance of gender equality and encourage
inclusivity in climate change decision-making processes.
Voltage and frequency control of microgrid in presence of micro-turbine inter...IJECEIAES
The active and reactive load changes have a significant impact on voltage
and frequency. In this paper, in order to stabilize the microgrid (MG) against
load variations in islanding mode, the active and reactive power of all
distributed generators (DGs), including energy storage (battery), diesel
generator, and micro-turbine, are controlled. The micro-turbine generator is
connected to MG through a three-phase to three-phase matrix converter, and
the droop control method is applied for controlling the voltage and
frequency of MG. In addition, a method is introduced for voltage and
frequency control of micro-turbines in the transition state from gridconnected mode to islanding mode. A novel switching strategy of the matrix
converter is used for converting the high-frequency output voltage of the
micro-turbine to the grid-side frequency of the utility system. Moreover,
using the switching strategy, the low-order harmonics in the output current
and voltage are not produced, and consequently, the size of the output filter
would be reduced. In fact, the suggested control strategy is load-independent
and has no frequency conversion restrictions. The proposed approach for
voltage and frequency regulation demonstrates exceptional performance and
favorable response across various load alteration scenarios. The suggested
strategy is examined in several scenarios in the MG test systems, and the
simulation results are addressed.
Enhancing battery system identification: nonlinear autoregressive modeling fo...IJECEIAES
Precisely characterizing Li-ion batteries is essential for optimizing their
performance, enhancing safety, and prolonging their lifespan across various
applications, such as electric vehicles and renewable energy systems. This
article introduces an innovative nonlinear methodology for system
identification of a Li-ion battery, employing a nonlinear autoregressive with
exogenous inputs (NARX) model. The proposed approach integrates the
benefits of nonlinear modeling with the adaptability of the NARX structure,
facilitating a more comprehensive representation of the intricate
electrochemical processes within the battery. Experimental data collected
from a Li-ion battery operating under diverse scenarios are employed to
validate the effectiveness of the proposed methodology. The identified
NARX model exhibits superior accuracy in predicting the battery's behavior
compared to traditional linear models. This study underscores the
importance of accounting for nonlinearities in battery modeling, providing
insights into the intricate relationships between state-of-charge, voltage, and
current under dynamic conditions.
Smart grid deployment: from a bibliometric analysis to a surveyIJECEIAES
Smart grids are one of the last decades' innovations in electrical energy.
They bring relevant advantages compared to the traditional grid and
significant interest from the research community. Assessing the field's
evolution is essential to propose guidelines for facing new and future smart
grid challenges. In addition, knowing the main technologies involved in the
deployment of smart grids (SGs) is important to highlight possible
shortcomings that can be mitigated by developing new tools. This paper
contributes to the research trends mentioned above by focusing on two
objectives. First, a bibliometric analysis is presented to give an overview of
the current research level about smart grid deployment. Second, a survey of
the main technological approaches used for smart grid implementation and
their contributions are highlighted. To that effect, we searched the Web of
Science (WoS), and the Scopus databases. We obtained 5,663 documents
from WoS and 7,215 from Scopus on smart grid implementation or
deployment. With the extraction limitation in the Scopus database, 5,872 of
the 7,215 documents were extracted using a multi-step process. These two
datasets have been analyzed using a bibliometric tool called bibliometrix.
The main outputs are presented with some recommendations for future
research.
Use of analytical hierarchy process for selecting and prioritizing islanding ...IJECEIAES
One of the problems that are associated to power systems is islanding
condition, which must be rapidly and properly detected to prevent any
negative consequences on the system's protection, stability, and security.
This paper offers a thorough overview of several islanding detection
strategies, which are divided into two categories: classic approaches,
including local and remote approaches, and modern techniques, including
techniques based on signal processing and computational intelligence.
Additionally, each approach is compared and assessed based on several
factors, including implementation costs, non-detected zones, declining
power quality, and response times using the analytical hierarchy process
(AHP). The multi-criteria decision-making analysis shows that the overall
weight of passive methods (24.7%), active methods (7.8%), hybrid methods
(5.6%), remote methods (14.5%), signal processing-based methods (26.6%),
and computational intelligent-based methods (20.8%) based on the
comparison of all criteria together. Thus, it can be seen from the total weight
that hybrid approaches are the least suitable to be chosen, while signal
processing-based methods are the most appropriate islanding detection
method to be selected and implemented in power system with respect to the
aforementioned factors. Using Expert Choice software, the proposed
hierarchy model is studied and examined.
Enhancing of single-stage grid-connected photovoltaic system using fuzzy logi...IJECEIAES
The power generated by photovoltaic (PV) systems is influenced by
environmental factors. This variability hampers the control and utilization of
solar cells' peak output. In this study, a single-stage grid-connected PV
system is designed to enhance power quality. Our approach employs fuzzy
logic in the direct power control (DPC) of a three-phase voltage source
inverter (VSI), enabling seamless integration of the PV connected to the
grid. Additionally, a fuzzy logic-based maximum power point tracking
(MPPT) controller is adopted, which outperforms traditional methods like
incremental conductance (INC) in enhancing solar cell efficiency and
minimizing the response time. Moreover, the inverter's real-time active and
reactive power is directly managed to achieve a unity power factor (UPF).
The system's performance is assessed through MATLAB/Simulink
implementation, showing marked improvement over conventional methods,
particularly in steady-state and varying weather conditions. For solar
irradiances of 500 and 1,000 W/m2
, the results show that the proposed
method reduces the total harmonic distortion (THD) of the injected current
to the grid by approximately 46% and 38% compared to conventional
methods, respectively. Furthermore, we compare the simulation results with
IEEE standards to evaluate the system's grid compatibility.
Enhancing photovoltaic system maximum power point tracking with fuzzy logic-b...IJECEIAES
Photovoltaic systems have emerged as a promising energy resource that
caters to the future needs of society, owing to their renewable, inexhaustible,
and cost-free nature. The power output of these systems relies on solar cell
radiation and temperature. In order to mitigate the dependence on
atmospheric conditions and enhance power tracking, a conventional
approach has been improved by integrating various methods. To optimize
the generation of electricity from solar systems, the maximum power point
tracking (MPPT) technique is employed. To overcome limitations such as
steady-state voltage oscillations and improve transient response, two
traditional MPPT methods, namely fuzzy logic controller (FLC) and perturb
and observe (P&O), have been modified. This research paper aims to
simulate and validate the step size of the proposed modified P&O and FLC
techniques within the MPPT algorithm using MATLAB/Simulink for
efficient power tracking in photovoltaic systems.
Adaptive synchronous sliding control for a robot manipulator based on neural ...IJECEIAES
Robot manipulators have become important equipment in production lines, medical fields, and transportation. Improving the quality of trajectory tracking for
robot hands is always an attractive topic in the research community. This is a
challenging problem because robot manipulators are complex nonlinear systems
and are often subject to fluctuations in loads and external disturbances. This
article proposes an adaptive synchronous sliding control scheme to improve trajectory tracking performance for a robot manipulator. The proposed controller
ensures that the positions of the joints track the desired trajectory, synchronize
the errors, and significantly reduces chattering. First, the synchronous tracking
errors and synchronous sliding surfaces are presented. Second, the synchronous
tracking error dynamics are determined. Third, a robust adaptive control law is
designed,the unknown components of the model are estimated online by the neural network, and the parameters of the switching elements are selected by fuzzy
logic. The built algorithm ensures that the tracking and approximation errors
are ultimately uniformly bounded (UUB). Finally, the effectiveness of the constructed algorithm is demonstrated through simulation and experimental results.
Simulation and experimental results show that the proposed controller is effective with small synchronous tracking errors, and the chattering phenomenon is
significantly reduced.
Remote field-programmable gate array laboratory for signal acquisition and de...IJECEIAES
A remote laboratory utilizing field-programmable gate array (FPGA) technologies enhances students’ learning experience anywhere and anytime in embedded system design. Existing remote laboratories prioritize hardware access and visual feedback for observing board behavior after programming, neglecting comprehensive debugging tools to resolve errors that require internal signal acquisition. This paper proposes a novel remote embeddedsystem design approach targeting FPGA technologies that are fully interactive via a web-based platform. Our solution provides FPGA board access and debugging capabilities beyond the visual feedback provided by existing remote laboratories. We implemented a lab module that allows users to seamlessly incorporate into their FPGA design. The module minimizes hardware resource utilization while enabling the acquisition of a large number of data samples from the signal during the experiments by adaptively compressing the signal prior to data transmission. The results demonstrate an average compression ratio of 2.90 across three benchmark signals, indicating efficient signal acquisition and effective debugging and analysis. This method allows users to acquire more data samples than conventional methods. The proposed lab allows students to remotely test and debug their designs, bridging the gap between theory and practice in embedded system design.
Detecting and resolving feature envy through automated machine learning and m...IJECEIAES
Efficiently identifying and resolving code smells enhances software project quality. This paper presents a novel solution, utilizing automated machine learning (AutoML) techniques, to detect code smells and apply move method refactoring. By evaluating code metrics before and after refactoring, we assessed its impact on coupling, complexity, and cohesion. Key contributions of this research include a unique dataset for code smell classification and the development of models using AutoGluon for optimal performance. Furthermore, the study identifies the top 20 influential features in classifying feature envy, a well-known code smell, stemming from excessive reliance on external classes. We also explored how move method refactoring addresses feature envy, revealing reduced coupling and complexity, and improved cohesion, ultimately enhancing code quality. In summary, this research offers an empirical, data-driven approach, integrating AutoML and move method refactoring to optimize software project quality. Insights gained shed light on the benefits of refactoring on code quality and the significance of specific features in detecting feature envy. Future research can expand to explore additional refactoring techniques and a broader range of code metrics, advancing software engineering practices and standards.
Smart monitoring technique for solar cell systems using internet of things ba...IJECEIAES
Rapidly and remotely monitoring and receiving the solar cell systems status parameters, solar irradiance, temperature, and humidity, are critical issues in enhancement their efficiency. Hence, in the present article an improved smart prototype of internet of things (IoT) technique based on embedded system through NodeMCU ESP8266 (ESP-12E) was carried out experimentally. Three different regions at Egypt; Luxor, Cairo, and El-Beheira cities were chosen to study their solar irradiance profile, temperature, and humidity by the proposed IoT system. The monitoring data of solar irradiance, temperature, and humidity were live visualized directly by Ubidots through hypertext transfer protocol (HTTP) protocol. The measured solar power radiation in Luxor, Cairo, and El-Beheira ranged between 216-1000, 245-958, and 187-692 W/m 2 respectively during the solar day. The accuracy and rapidity of obtaining monitoring results using the proposed IoT system made it a strong candidate for application in monitoring solar cell systems. On the other hand, the obtained solar power radiation results of the three considered regions strongly candidate Luxor and Cairo as suitable places to build up a solar cells system station rather than El-Beheira.
An efficient security framework for intrusion detection and prevention in int...IJECEIAES
Over the past few years, the internet of things (IoT) has advanced to connect billions of smart devices to improve quality of life. However, anomalies or malicious intrusions pose several security loopholes, leading to performance degradation and threat to data security in IoT operations. Thereby, IoT security systems must keep an eye on and restrict unwanted events from occurring in the IoT network. Recently, various technical solutions based on machine learning (ML) models have been derived towards identifying and restricting unwanted events in IoT. However, most ML-based approaches are prone to miss-classification due to inappropriate feature selection. Additionally, most ML approaches applied to intrusion detection and prevention consider supervised learning, which requires a large amount of labeled data to be trained. Consequently, such complex datasets are impossible to source in a large network like IoT. To address this problem, this proposed study introduces an efficient learning mechanism to strengthen the IoT security aspects. The proposed algorithm incorporates supervised and unsupervised approaches to improve the learning models for intrusion detection and mitigation. Compared with the related works, the experimental outcome shows that the model performs well in a benchmark dataset. It accomplishes an improved detection accuracy of approximately 99.21%.
Developing a smart system for infant incubators using the internet of things ...IJECEIAES
This research is developing an incubator system that integrates the internet of things and artificial intelligence to improve care for premature babies. The system workflow starts with sensors that collect data from the incubator. Then, the data is sent in real-time to the internet of things (IoT) broker eclipse mosquito using the message queue telemetry transport (MQTT) protocol version 5.0. After that, the data is stored in a database for analysis using the long short-term memory network (LSTM) method and displayed in a web application using an application programming interface (API) service. Furthermore, the experimental results produce as many as 2,880 rows of data stored in the database. The correlation coefficient between the target attribute and other attributes ranges from 0.23 to 0.48. Next, several experiments were conducted to evaluate the model-predicted value on the test data. The best results are obtained using a two-layer LSTM configuration model, each with 60 neurons and a lookback setting 6. This model produces an R 2 value of 0.934, with a root mean square error (RMSE) value of 0.015 and a mean absolute error (MAE) of 0.008. In addition, the R 2 value was also evaluated for each attribute used as input, with a result of values between 0.590 and 0.845.
A review on internet of things-based stingless bee's honey production with im...IJECEIAES
Honey is produced exclusively by honeybees and stingless bees which both are well adapted to tropical and subtropical regions such as Malaysia. Stingless bees are known for producing small amounts of honey and are known for having a unique flavor profile. Problem identified that many stingless bees collapsed due to weather, temperature and environment. It is critical to understand the relationship between the production of stingless bee honey and environmental conditions to improve honey production. Thus, this paper presents a review on stingless bee's honey production and prediction modeling. About 54 previous research has been analyzed and compared in identifying the research gaps. A framework on modeling the prediction of stingless bee honey is derived. The result presents the comparison and analysis on the internet of things (IoT) monitoring systems, honey production estimation, convolution neural networks (CNNs), and automatic identification methods on bee species. It is identified based on image detection method the top best three efficiency presents CNN is at 98.67%, densely connected convolutional networks with YOLO v3 is 97.7%, and DenseNet201 convolutional networks 99.81%. This study is significant to assist the researcher in developing a model for predicting stingless honey produced by bee's output, which is important for a stable economy and food security.
A trust based secure access control using authentication mechanism for intero...IJECEIAES
The internet of things (IoT) is a revolutionary innovation in many aspects of our society including interactions, financial activity, and global security such as the military and battlefield internet. Due to the limited energy and processing capacity of network devices, security, energy consumption, compatibility, and device heterogeneity are the long-term IoT problems. As a result, energy and security are critical for data transmission across edge and IoT networks. Existing IoT interoperability techniques need more computation time, have unreliable authentication mechanisms that break easily, lose data easily, and have low confidentiality. In this paper, a key agreement protocol-based authentication mechanism for IoT devices is offered as a solution to this issue. This system makes use of information exchange, which must be secured to prevent access by unauthorized users. Using a compact contiki/cooja simulator, the performance and design of the suggested framework are validated. The simulation findings are evaluated based on detection of malicious nodes after 60 minutes of simulation. The suggested trust method, which is based on privacy access control, reduced packet loss ratio to 0.32%, consumed 0.39% power, and had the greatest average residual energy of 0.99 mJoules at 10 nodes.
Fuzzy linear programming with the intuitionistic polygonal fuzzy numbersIJECEIAES
In real world applications, data are subject to ambiguity due to several factors; fuzzy sets and fuzzy numbers propose a great tool to model such ambiguity. In case of hesitation, the complement of a membership value in fuzzy numbers can be different from the non-membership value, in which case we can model using intuitionistic fuzzy numbers as they provide flexibility by defining both a membership and a non-membership functions. In this article, we consider the intuitionistic fuzzy linear programming problem with intuitionistic polygonal fuzzy numbers, which is a generalization of the previous polygonal fuzzy numbers found in the literature. We present a modification of the simplex method that can be used to solve any general intuitionistic fuzzy linear programming problem after approximating the problem by an intuitionistic polygonal fuzzy number with n edges. This method is given in a simple tableau formulation, and then applied on numerical examples for clarity.
The performance of artificial intelligence in prostate magnetic resonance im...IJECEIAES
Prostate cancer is the predominant form of cancer observed in men worldwide. The application of magnetic resonance imaging (MRI) as a guidance tool for conducting biopsies has been established as a reliable and well-established approach in the diagnosis of prostate cancer. The diagnostic performance of MRI-guided prostate cancer diagnosis exhibits significant heterogeneity due to the intricate and multi-step nature of the diagnostic pathway. The development of artificial intelligence (AI) models, specifically through the utilization of machine learning techniques such as deep learning, is assuming an increasingly significant role in the field of radiology. In the realm of prostate MRI, a considerable body of literature has been dedicated to the development of various AI algorithms. These algorithms have been specifically designed for tasks such as prostate segmentation, lesion identification, and classification. The overarching objective of these endeavors is to enhance diagnostic performance and foster greater agreement among different observers within MRI scans for the prostate. This review article aims to provide a concise overview of the application of AI in the field of radiology, with a specific focus on its utilization in prostate MRI.
Seizure stage detection of epileptic seizure using convolutional neural networksIJECEIAES
According to the World Health Organization (WHO), seventy million individuals worldwide suffer from epilepsy, a neurological disorder. While electroencephalography (EEG) is crucial for diagnosing epilepsy and monitoring the brain activity of epilepsy patients, it requires a specialist to examine all EEG recordings to find epileptic behavior. This procedure needs an experienced doctor, and a precise epilepsy diagnosis is crucial for appropriate treatment. To identify epileptic seizures, this study employed a convolutional neural network (CNN) based on raw scalp EEG signals to discriminate between preictal, ictal, postictal, and interictal segments. The possibility of these characteristics is explored by examining how well timedomain signals work in the detection of epileptic signals using intracranial Freiburg Hospital (FH), scalp Children's Hospital Boston-Massachusetts Institute of Technology (CHB-MIT) databases, and Temple University Hospital (TUH) EEG. To test the viability of this approach, two types of experiments were carried out. Firstly, binary class classification (preictal, ictal, postictal each versus interictal) and four-class classification (interictal versus preictal versus ictal versus postictal). The average accuracy for stage detection using CHB-MIT database was 84.4%, while the Freiburg database's time-domain signals had an accuracy of 79.7% and the highest accuracy of 94.02% for classification in the TUH EEG database when comparing interictal stage to preictal stage.
Analysis of driving style using self-organizing maps to analyze driver behaviorIJECEIAES
Modern life is strongly associated with the use of cars, but the increase in acceleration speeds and their maneuverability leads to a dangerous driving style for some drivers. In these conditions, the development of a method that allows you to track the behavior of the driver is relevant. The article provides an overview of existing methods and models for assessing the functioning of motor vehicles and driver behavior. Based on this, a combined algorithm for recognizing driving style is proposed. To do this, a set of input data was formed, including 20 descriptive features: About the environment, the driver's behavior and the characteristics of the functioning of the car, collected using OBD II. The generated data set is sent to the Kohonen network, where clustering is performed according to driving style and degree of danger. Getting the driving characteristics into a particular cluster allows you to switch to the private indicators of an individual driver and considering individual driving characteristics. The application of the method allows you to identify potentially dangerous driving styles that can prevent accidents.
Hyperspectral object classification using hybrid spectral-spatial fusion and ...IJECEIAES
Because of its spectral-spatial and temporal resolution of greater areas, hyperspectral imaging (HSI) has found widespread application in the field of object classification. The HSI is typically used to accurately determine an object's physical characteristics as well as to locate related objects with appropriate spectral fingerprints. As a result, the HSI has been extensively applied to object identification in several fields, including surveillance, agricultural monitoring, environmental research, and precision agriculture. However, because of their enormous size, objects require a lot of time to classify; for this reason, both spectral and spatial feature fusion have been completed. The existing classification strategy leads to increased misclassification, and the feature fusion method is unable to preserve semantic object inherent features; This study addresses the research difficulties by introducing a hybrid spectral-spatial fusion (HSSF) technique to minimize feature size while maintaining object intrinsic qualities; Lastly, a soft-margins kernel is proposed for multi-layer deep support vector machine (MLDSVM) to reduce misclassification. The standard Indian pines dataset is used for the experiment, and the outcome demonstrates that the HSSF-MLDSVM model performs substantially better in terms of accuracy and Kappa coefficient.
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdffxintegritypublishin
Advancements in technology unveil a myriad of electrical and electronic breakthroughs geared towards efficiently harnessing limited resources to meet human energy demands. The optimization of hybrid solar PV panels and pumped hydro energy supply systems plays a pivotal role in utilizing natural resources effectively. This initiative not only benefits humanity but also fosters environmental sustainability. The study investigated the design optimization of these hybrid systems, focusing on understanding solar radiation patterns, identifying geographical influences on solar radiation, formulating a mathematical model for system optimization, and determining the optimal configuration of PV panels and pumped hydro storage. Through a comparative analysis approach and eight weeks of data collection, the study addressed key research questions related to solar radiation patterns and optimal system design. The findings highlighted regions with heightened solar radiation levels, showcasing substantial potential for power generation and emphasizing the system's efficiency. Optimizing system design significantly boosted power generation, promoted renewable energy utilization, and enhanced energy storage capacity. The study underscored the benefits of optimizing hybrid solar PV panels and pumped hydro energy supply systems for sustainable energy usage. Optimizing the design of solar PV panels and pumped hydro energy supply systems as examined across diverse climatic conditions in a developing country, not only enhances power generation but also improves the integration of renewable energy sources and boosts energy storage capacities, particularly beneficial for less economically prosperous regions. Additionally, the study provides valuable insights for advancing energy research in economically viable areas. Recommendations included conducting site-specific assessments, utilizing advanced modeling tools, implementing regular maintenance protocols, and enhancing communication among system components.
About
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
• Remote control: Parallel or serial interface.
• Compatible with MAFI CCR system.
• Compatible with IDM8000 CCR.
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
• Easy in configuration using DIP switches.
Technical Specifications
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
Key Features
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
• Remote control: Parallel or serial interface
• Compatible with MAFI CCR system
• Copatiable with IDM8000 CCR
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
Application
• Remote control: Parallel or serial interface.
• Compatible with MAFI CCR system.
• Compatible with IDM8000 CCR.
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
• Easy in configuration using DIP switches.
Cosmetic shop management system project report.pdfKamal Acharya
Buying new cosmetic products is difficult. It can even be scary for those who have sensitive skin and are prone to skin trouble. The information needed to alleviate this problem is on the back of each product, but it's thought to interpret those ingredient lists unless you have a background in chemistry.
Instead of buying and hoping for the best, we can use data science to help us predict which products may be good fits for us. It includes various function programs to do the above mentioned tasks.
Data file handling has been effectively used in the program.
The automated cosmetic shop management system should deal with the automation of general workflow and administration process of the shop. The main processes of the system focus on customer's request where the system is able to search the most appropriate products and deliver it to the customers. It should help the employees to quickly identify the list of cosmetic product that have reached the minimum quantity and also keep a track of expired date for each cosmetic product. It should help the employees to find the rack number in which the product is placed.It is also Faster and more efficient way.
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...Amil Baba Dawood bangali
Contact with Dawood Bhai Just call on +92322-6382012 and we'll help you. We'll solve all your problems within 12 to 24 hours and with 101% guarantee and with astrology systematic. If you want to take any personal or professional advice then also you can call us on +92322-6382012 , ONLINE LOVE PROBLEM & Other all types of Daily Life Problem's.Then CALL or WHATSAPP us on +92322-6382012 and Get all these problems solutions here by Amil Baba DAWOOD BANGALI
#vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore#blackmagicformarriage #aamilbaba #kalajadu #kalailam #taweez #wazifaexpert #jadumantar #vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore #blackmagicforlove #blackmagicformarriage #aamilbaba #kalajadu #kalailam #taweez #wazifaexpert #jadumantar #vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore #Amilbabainuk #amilbabainspain #amilbabaindubai #Amilbabainnorway #amilbabainkrachi #amilbabainlahore #amilbabaingujranwalan #amilbabainislamabad
Welcome to WIPAC Monthly the magazine brought to you by the LinkedIn Group Water Industry Process Automation & Control.
In this month's edition, along with this month's industry news to celebrate the 13 years since the group was created we have articles including
A case study of the used of Advanced Process Control at the Wastewater Treatment works at Lleida in Spain
A look back on an article on smart wastewater networks in order to see how the industry has measured up in the interim around the adoption of Digital Transformation in the Water Industry.
NBLex: emotion prediction in Kannada-English code-switchtext using naïve bayes lexicon approach
1. International Journal of Electrical and Computer Engineering (IJECE)
Vol. 13, No. 2, April 2023, pp. 2068~2077
ISSN: 2088-8708, DOI: 10.11591/ijece.v13i2.pp2068-2077 2068
Journal homepage: http://ijece.iaescore.com
NBLex: emotion prediction in Kannada-English code-switch
text using naïve bayes lexicon approach
Ramesh Chundi1
, Vishwanath R. Hulipalled2
, Jay Bharthish Simha3
1
School of Computer Science and Applications, REVA University, Bangalore, India
2
School of Computing and Information Technology, REVA University, Bangalore, India
3
Abiba Systems, CTO, and RACE Labs, REVA University, Bangalore, India
Article Info ABSTRACT
Article history:
Received Apr 13, 2022
Revised Sep 21, 2022
Accepted Oct 14, 2022
Emotion analysis is a process of identifying the human emotions derived
from the various data sources. Emotions can be expressed either in
monolingual text or code-switch text. Emotion prediction can be performed
through machine learning (ML), or deep learning (DL), or lexicon-based
approach. ML and DL approaches are computationally expensive and
require training data. Whereas, the lexicon-based approach does not require
any training data and it takes very less time to predict the emotions in
comparison with ML and DL. In this paper, we proposed a lexicon-based
method called non-binding lower extremity exoskeleton (NBLex) to predict
the emotions associated with Kannada-English code-switch text that no one
has addressed till now. We applied the One-vs-Rest approach to generate the
scores for lexicon and also to predict the emotions from the code-switch text.
The accuracy of the proposed model NBLex (87.9%) is better than naïve
bayes (NB) (85.8%) and bidirectional long short-term memory neural
network (BiLSTM) (84.7%) and for true positive rate (TPR), the NBLex
(50.6%) is better than NB (37.0%) and BiLSTM (42.2%). From our
approach, it is observed that a simple additive model (lexicon approach) can
also be an alternative model to predict the emotions in code-switch text.
Keywords:
Code-switch text
Emotion analysis
Emotion prediction
Lexicon based approach
One-vs-rest
Text analytics
This is an open access article under the CC BY-SA license.
Corresponding Author:
Ramesh Chundi
School of Computer Science and Applications, REVA University
Rukmini Knowledge Park, Kattigenahalli, Srinivasa Nagar, Yelahanka, Bangalore-560064, India
Email: chundiramesh@gmail.com
1. INTRODUCTION
Many social networking websites like YouTube, Facebook, and Twitter, are available for users to
express emotional text content in online every day. Prediction of emotions from these social media text will
help us to understand the human emotional behavior towards trends and public events. Emotion prediction is
to identifying the emotions like anger, disgust, and joy, from text content. According to Plutchik’s wheel of
emotions, there are eight basic emotions (anger, anticipation, disgust, fear, joy, sadness, surprise, and trust)
and two sentiment classes (positive and negative) [1].
In the recent past lot of work has been carried out on the monolingual text to predict the emotions by
several researchers [2], [3] However, most of the internet users in India are multilingual (can read and write
more than two languages) or bilingual (at least they can read and write two languages). According to the
leading newspaper The times of India report (Nov 7th
, 2018), 52% of Indians are bilingual and 18% are
multilingual [4]. This multilingualism makes the internet users to use different languages while writing the
text comments. This type of substituting the languages is known as code-switching or code-mixing [5]. Code-
switch is a very common occurrence [6]–[8] and users can write the comments either in monolingual text or
2. Int J Elec & Comp Eng ISSN: 2088-8708
NBLex: emotion prediction in Kannada-English code-switch text using … (Ramesh Chundi)
2069
code-switch text in social networking websites. To illustrate, Ex1 refers to monolingual text, where both
source and scripting languages are same (English) and Ex2 refers to code-switch text, where the source and
scripting languages are different (source is Kannada language and scripting is in the English language).
- Ex1: “Apj Abdul kalam is the ever best & talented President of India”
- Ex2: “Avrge enu artha haguthe, bidu guru”
- Translation: “What they can understand, leave it, sir”
Emotions can be expressed either in monolingual text or code-switch text. In Ex1 (monolingual
text), user expressed joy emotion and in Ex2 (code-switch text), user expressed sad emotion. Hence, many
people nowadays use code-switch text in various situations which is convenient for them. The main
advantage of using code-switch text in social networking websites is that, people no need to follow any
linguistic and grammatical rules.
Figure 1 shows the different approaches for emotion prediction. There are three approaches to
predict the emotions, namely machine learning (ML) approach, the deep learning (DL) approach and lexicon-
based approach. Both ML and DL approaches are to perform the prediction task by learning the patterns from
the training dataset and test the performance on the test dataset. Support vector machine (SVM) and naïve
bayes (NB) are the examples of ML approach and similarly, long short-term memory (LSTM) and
bidirectional long short-term memory neural network (BiLSTM) are examples for the DL approaches.
Whereas, the lexicon approach performs the prediction task by checking the words in the lexicon, if the
words appear in the lexicon, then take the values of the words and add them to the score.
Figure 1. Different approaches for emotion prediction
Predicting the emotions expressed in code-switch text is a challenging task in comparison with
predicting the emotions in the monolingual text since both source and scripting languages are not the same.
Also, the out-of-vocabulary (OOV) words are more common on social networking websites [9]. Some of the
researchers tried to predict the emotions in code-switch text using ML and DL techniques [10]–[13].
However, there are some limitations with ML and DL techniques as they need training data, they are
computationally expensive, and also incapable to handle OOV words. One of the alternative approaches is
machine translation, which translates the code-switch text into monolingual text using machine translation
techniques. To translate this type of text requires more additional efforts to improve the performance of the
translation process and sometimes the translation is incorrect. For example, considering Ex3, Google
translator has given an incorrect translation.
- Ex3: “avlgee kannada baralla avl enu kannada kalstale”
- Translation: “She doesn’t know Kannada how she can teach Kannada”
- Google Translation: “What is Kannada?”
To address the above limitations, few researchers are trying to use the lexicon-based approach for
emotion prediction [14]. Here, the lexicon-based approach does not require any training data and these are
computationally not expensive in comparison with ML and DL. The lexicon-based approach uses a list of
pre-labeled words that are classified into various emotions [15]. The lexicon-based approach classified into
two types, the first one is corpus-based approach and the second one is dictionary-based approach. In the
corpus-based approach, lexicon will be built based on the co-occurrence of a word (statistical) or semantic of
a word. In dictionary-based approach, first will collect a few words (seed words) with emotional labels. Next
3. ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 2, April 2023: 2068-2077
2070
step, we will use the Bootstrap method to find all synonyms of seed words from the dictionary and add these
synonyms to the seed list. Some of the well-known emotion lexicons are affective norms for English words
(ANEW) [16], linguistic norms and word count (LIWC) [17], WordNet-affect dictionary emotion lexicon
[18], National Research Council Canada (NRC) word-emotion association lexicon [19], and DepecheMood
emotional lexicon [20]. The dictionary-based approach is not a better approach for code-switch text since it
has limited vocabulary at the same time social media text is OOV. In the recent past, the relevant literature
on this topic has been carried out by many researchers who worked on monolingual text to predict the
emotions due to the availability of large-scale monolingual data in social media networks and also it is easy
to implement in comparison with code-switch text.
Wang et al. [10] proposed a method called BLSTM-multiple classifiers (BLSTM-MC) to study the
association between various emotions in each code-switch post and predicts all the emotions conveyed by
each post. Tang et al. [11] examined how to fine-tune the bidirectional encoder representations from
transformers (BERT) model for multi-label sentiment (emotions) analysis in code-switch text. This approach
contains a collection of pre-trained models and the fine-tuning techniques of BERT. Data augmentation,
under sampling, and ensemble learning are used to get a balanced sample from an unbalanced sample.
Mohammad and Turney [19] discussed emotion prediction in the paper titled “Crowdsourcing a word–
emotion association lexicon”. In this, they created a huge and high-quality word–emotion, and word–polarity
association lexicon quickly and inexpensively. They also used 10,000 Word-Sense pairs, and each pair of
word-sense is associated with 8 basic emotions. However, this work can be enhanced to create lexicons in
other languages.
Staiano and Guerini [20] build an emotional lexicon called DepecheMood with approximately
37,000 words marked with emotion values from a social news network. This lexicon shows substantial
progress over the state-of-art unsupervised approach with high coverage and high precision. In this work,
there is a scope to enhance for testing the unusual value decomposition on the word-by-document matrices.
Purver and Battersby [21] proposed the cross-convention method for multi-class emotion detection in Twitter
messages with no manual interference using automatically labeled data. This approach has given a reasonable
performance at individual emotion prediction. However, this work is better for few of the emotions such as
happy, sad, and anger in comparison with other emotions such as fear, surprise, and disgust. Wen and Wan
[22] proposed an approach called class sequential rules to predict the emotions on a Chinese benchmark
dataset. This approach has shown greater performance in comparison with other approaches. The
performance of this approach can be improved by using additional social network information and discourse
information.
Hasan et al. [23] used the Circumplex model to classify the individual emotional positions. This
model has outperformed with 90% Accuracy in comparison with k-nearest neighbors (KNN), decision tree
(DT), SVM, and NB. Further, this approach can be extended to examine the sequential nature of the emotions
and how they change over the period. Mohammad and Kiritchenko [24] proposed that emotion-word
Hashtags are good manual labels of emotions in tweets and also proposed a method to generate the large
lexicon of word emotion. This is the first lexicon with real-valued word emotion scores. This work can be
extend to analyzing the difference in emotions associated with different morphological forms of the emotion
words. Al-Aziz et al. [25] proposed the model that combines the lexicon approach and multi-criteria
decision-making approach (MCDM). This model can be used to classify the text into different emotions and
also classify Arabic text with mixed emotions. The dataset which is used in this work is very small
(200 tweets) and which needs to be tested on the large dataset to know the performance.
Gupta et al. [26] proposed a method to identify the emotions such as happy, sad, and angry using the
deep learning approach LSTM. In this method, they combined the advantages of semantic and sentiment-
based embedding. This method can be extend to train the context-aware models. Schuff et al. [27] explores
the relationship between emotion annotation and the other annotation such as sentiment as well as stance
layers. Here, BiLSTM method performed better than all other approaches. Mohammad and Bravo-Marquez [28]
used a method called best-worst scaling (BWS) to improve annotation consistency and find the related fine-
grained emotion scores. They identified that Emotion-Word Hashtags will carry more intense emotions.
Rabeya et al. [29] proposed an emotion prediction model to find the emotions from Bengali text. A
lexicon-based Backtracking approach is used to find the sentiments of sentences to prove how often users
express emotion at the end of the sentence. The corpus used in this work is very small (351 lexicons only)
and which is not enough to get decent accuracy. Batbaatar et al. [30] proposed a method called semantic-
emotion neural network (SENN) by adding both emotional information and semantic/syntactic information.
This model is mainly divided into two sub-networks. First, BiLSTM which finds the contextual information
and focuses on a semantic relationship. Secondly, convolutional neural networks (CNN) are to extract
emotional information and focuses on emotional relationships. Further, SENN performance can be improved
by using the higher emotion word embeddings.
4. Int J Elec & Comp Eng ISSN: 2088-8708
NBLex: emotion prediction in Kannada-English code-switch text using … (Ramesh Chundi)
2071
Currently, most of the emotion prediction research work has been done on monolingual text but very
few researchers have performed emotion prediction on code-switch text due to the complexity of the text.
Lee and Wang [31] proposed a multiple-classifier based automatic approach to predict the emotions in the
Chinese-English code-switch corpus. This approach shows that both English text and Chinese text are active
to predict the emotions. Wang et al. [32] used a term-document Bipartite Graph to integrate both bilingual
and sentimental information and then used a propagation-based method to acquire and predict the emotions.
This work further can be extended to discover the association between emotions and caused languages for
identifying the emotions. Lee and Wang [33] proposed a multi-view learning structure to study and predict
the emotions through both monolingual and bilingual views. The investigational studies validate that the
proposed approach significantly outperforms various strong baselines. Wang et al. [34] proposed a joint
factor graph model to predict the emotions in code-switching text. Here, the Factor function is used to
discover the association between different emotions, and a belief propagation algorithm is used to study and
predict the model.
When it comes to emotion prediction in code-switching text of Indian languages, very few
researchers are tried to predict the emotions. Vijay et al. [12] proposed a supervised classification approach
that uses different machine learning methods SVM to find the emotion associated with Hindi-English code-
mixed data using character level, word level, and lexicon based features. To achieve better results, the corpus
can be annotated with part-of-speech (POS) tags at the word level. Appidi et al. [13] performed an
experiment on Kannada-English code-mixed data to predict the emotions. They used SVM and deep learning
method LSTM. The accuracy of SVM is 30% and the accuracy of LSTM is 32%. However, the corpus needs
to be annotated with POS tags at the word level.
The corpus-based approach will handle the above limitation since there is no limit to vocabulary. In
this paper, we proposed a corpus-based lexicon approach called NBLex to predict the emotions in Kannada-
English code-switch text. We used NB classifier to build the proposed method.
The structure of the remaining paper is as follows. Section 2 we proposed the lexicon generation and
emotion prediction. Comparative result analysis is discussed in section 3. Finally, the conclusion and future
works are discussed in section 4.
2. METHOD
2.1. Lexicon generation
In this section, we discuss Kannada-English code-switch (KECS) corpus creation and annotation.
Further, we will discuss the process for creating the NBLex lexicon. In order to develop KECS corpus, we
assume only those comments which are having primarily code-switch nature. We consider the comments as
code-switch, even if one word differs from the monolingual condition (i.e., source and scripting languages
must be the same). Here, all the words in our corpus are English script only.
To create the corpus KECS, initially, we gathered approximately 7526 Kannada-English code-
switch comments from YouTube.com based on various topics such as politics, movies, celebrities, and social
events. We performed the pre-processing task to eliminate noisy data and the pre-processing task includes the
removal of special characters, symbols, and digits as they will not have any importance in the process of
lexicon creation since these words will not carry any emotional meaning. The next step is to carry out the
process of annotation. In this process manual labeling of emotions (anger, anticipation, disgust, fear, joy,
sadness, surprise, and trust) is made, since we do not have any automatic labeling methods for Kannada-
English code-switch text.
In lexicon creation, NB classifier is used to generate scores for each word along with the one-vs-rest
approach. In this approach, every time we consider one category of emotion as positive (true or 1) and all
other categories of emotions as Not_Positive (False or 0) e.g., to generate the score for the anger emotion
category, we consider all emotion under anger emotion category as 1 and rest of the emotions as 0. lexicon
creation starts with building the frequency dictionary where, Keys tuples (word, label) and values are
frequencies. The label value is either 1 or 0 and frequency is an integer value (the number of times that word
and label tuple present in the data).
The next step in lexicon creation is to find the positive and Not_Positive frequency of the word from
the frequency dictionary. Once frequencies are computed, then we calculate the total number of positive
words, the total number of Not_Positive words and also, we calculate the total number of words from the
data. After doing all these, we need to find positive and Not_Positive probability for each of the word. We
used 10-fold cross-validation while generating the scores for each word in the dictionary to make stronger
values. To compute the positive probability of the word we are using (1).
P(pw) =
pf + 1
Np + T
(1)
5. ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 2, April 2023: 2068-2077
2072
where, P(pw) is the positive probability of the word, pf is a positive frequency of the word, Np is the total
number of positive words in corpus, and T is the total number of words in the corpus. To compute the
Not_Positive probability of the word we are using (2).
P(nw) =
nf + 1
Nn + T
(2)
where, P(nw) is the Not_Positive probability of the word, nf is the Not_Positive frequency of the word, Nn is
the total number of Not_Positive words in the corpus. Finally, we compute the score for every word in the
dictionary by using (3).
Score = log
P(pw)
P(nw)
(3)
where score is the total value of the sentence.
After generating the scores for each of the words in the dictionary, manually we removed one -letter
words and English stop words since they will not have any emotional meaning. Also, we removed nouns
(names) and other non-emotional words from the lexicon since they will not have emotional significance in
the process of emotion prediction. This entire process is used to generate the scores for one emotion. The
same process is followed for the rest of the emotions to generate the scores.
Once the final lexicon is ready, to find the dependency among the emotions, we need to check the
relationship between different emotions. To perform this, we applied the correlation technique on lexicon and
found that fear emotion has a positive correlation with surprise and anticipation emotions. One of the reasons
is that the number of comments under these categories (anticipation, fear, and surprise) is very less in
comparison with other categories of emotions.
Table 1 shows the correlation between the various emotions. It is observed that the positive
correlation between fear and surprise as well as fear and anticipation both have the 0.7. And also, it is
observed that the rest of the emotions are either independent or negatively correlated with other emotions.
Table 1. Correlation between emotions
Anger Anticipation Disgust Fear Joy Sad Surprise Trust
Anger 1 0.1 0.1 0.2 -0.3 0.0 0.1 -0.2
Anticipation 0.1 1 0.2 0.7 0.1 0.3 0.5 0.2
Disgust 0.1 0.2 1 0.3 -0.2 0.0 0.2 -0.1
Fear 0.2 0.7 0.3 1 0.2 0.4 0.7 0.3
Joy -0.3 0.1 -0.2 0.2 1 -0.1 0.2 0.3
Sad 0.0 0.3 0.0 0.4 -0.1 1 0.3 0.0
Surprise 0.1 0.5 0.2 0.7 0.2 0.3 1 0.2
Trust -0.2 0.2 -0.1 0.3 0.3 0.0 0.2 1
2.2. Emotion prediction
Figure 2 shows the complete steps for emotion prediction process. To perform emotion prediction
using a lexicon-based approach, we gathered total of 1882 Kannada-English code-switch text comments from
YouTube.com based on different topics. The pre-processing task is carried out to improve the accuracy of the
emotion prediction task by eliminating noisy data such as symbols, digits, and other special characters. As
such there is no automated labeling system for Kannada-English code-switch text, the annotation has been
done for emotions by a linguistic expert manually. Further, the tokenization task is carried out to split the
entire text into tokens to calculate the sentence score. To predict the emotions, first we need to calculate the
score for each comment using (4).
SS = ∑ Score(w)
n
i=0
(4)
where, SS refers sentence score.
The detailed steps of emotion prediction are shown in algorithm 1. Here, prediction is performed
based upon one emotion at a time. e.g., first, predict the Anger emotion by assuming all other emotions are
non-anger. Next, the anticipation emotion is predicted by assuming all other emotions are non-anticipation.
Like this, the same approach is followed for the rest of the 6 emotions. Once score is calculated, now we
need to perform emotion prediction using (5).
6. Int J Elec & Comp Eng ISSN: 2088-8708
NBLex: emotion prediction in Kannada-English code-switch text using … (Ramesh Chundi)
2073
Predict = {
1 (Positive), if SS > 0
0 (Not_Positive), otherwise
(5)
Algorithm 1. Emotion prediction
Input: t, Score /* t – Input text
Score – Set of words (Lexicon) with values */
Output: Emotion /* Positive or Not_Positive */
/*Caluclating score of t (Text comment) */
Cal_Score (t, Score)
{
1. SS = 0 /* SS is the temp variable to store the score of t */
2. for each w in t /* For each word w from t compute the score */
3. if w in Score then
4. SS += Score.get(w)
5. end if
6. end for
7. return SS
8. }
/* Predicting the Emotion of each text comment in test_data */
Emotion_Prediction (test_data)
{
9. Predict = [] /* is a empty list */
10. for each t in test_data /* for each comment (t) in the test_data compute the score */
11. if Cal_Score(t, Score) > 0 then
12. Predict_i = 1 (Positive)
13. else
14. Predict_i = 0 (Not-Positive)
15. end if
16. end for
17. Predict.append(Predict_i)
18. }
Figure 2. Emotion prediction process
3. RESULTS AND DISCUSSION
The results of the proposed NBLex lexicon method is compared with the ML (NB) and DLg
(BiLSTM) methods. There are 7526 Kannada-English code-switch text comments used to train both the
models separately and applied 10-flod cross-validation while training both the models. All these code-switch
comments are manually labeled with eight basic emotions (anger, anticipation, disgust, fear, joy, sad,
surprise, and trust). Totally, 1882 code-switch comments are used as test dataset.
7. ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 2, April 2023: 2068-2077
2074
3.1. Naïve Bayes
Here, two types of vectorizations such as bag-of-words (BOW) and term frequency inverse
document frequency (TF-IDF) are performed and then predict the emotions on the test dataset. Considering
only accuracy, the BOW method is showing the better results for some of the emotions (anger and joy) and
TF-IDF showing the good results for rest of the emotions (anticipation, disgust, fear, sad, surprise and trust).
Considering both accuracy and true positives, the BOW method performance is much better than TF-IDF as
observed in the Table 2.
The BOW method produces better results for anger emotion (78.4%). The BOW and TF-IDF are
almost the same for joy (84.7% and 84.6%) and for trust (78.9% and 79%) emotions. For the rest of the
emotions (anticipation, disgust, fear, sad, and surprise) TF-IDF is much better. However, when considering
the true positives, BOW is provides very good results in comparison with TF-IDF. Surprising that the true
positives are 0 for emotions like anticipation, fear, and sad in TF-IDF, it is due to sample bias and
Distribution bias which are the main reasons for getting 0 true positives in TF-IDF. Also, the score of TF-IDF
is always between 0 and 1 whereas, the score of BOW is a natural number. The BOW approach is compared
with BiLSTM and lexicon. TF-IDF is not a suitable method since the performance of any method will not be
depend only on accuracy but also it depends upon both the accuracy and true positives.
Table 2. Accuracy and true positives of BOW and TF-IDF
Emotions Accuracy True Positives
BOW TF-IDF BOW TF-IDF
Anger 78.4 74.1 289 68
Anticipation 91.3 95.9 18 0
Disgust 78.9 81.8 155 12
Fear 97.8 99.4 1 0
Joy 84.7 84.6 489 321
Sad 83.9 87.2 70 0
Surprise 92.5 97 4 1
Trust 78.9 79 231 69
3.2. Bidirectional long-short term memory
To predict the emotions, the BiLSTM approach is applied to the test dataset. The BiLSTM approach
outperforms for fear (98.8%) emotion, but the total number of comments is very less (12). Generally, the DL
models work well for large datasets. However, in this scenario, the BiLSTM model performs well on a small
dataset. This can be observed in Table 3 which shows the accuracy and true positives.
3.3. NBLex lexicon
NBLex lexicon is applied on test dataset to predict emotions. Table 4 shows the accuracy and true
positives results obtained for various emotions. The fear emotion achieved better accuracy (94%) in
comparison with all other emotions for total of 12 comments. The reason for this condition is based on the
emotion dependency.
Table 3. Accuracy and true positives of BiLSTM
Emotions Accuracy True Positives
Anger 75.6 294
Anticipation 88.2 28
Disgust 81.1 145
Fear 98.8 2
Joy 83.3 440
Sad 81.9 85
Surprise 91.7 14
Trust 77.2 261
Table 4. Accuracy and true positives of NBLex lexicon
Emotions Accuracy True Positives
Anger 83.2 332
Anticipation 92.3 32
Disgust 85.8 207
Fear 94 3
Joy 87.5 510
Sad 84.5 102
Surprise 92.4 17
Trust 83.8 284
From Table 5, it is observed that the NBLex is outperformed in terms of accuracy for most of the
emotions (anger, anticipation, disgust, joy, sad, and trust) in comparison with other approaches like NB and
BiLSTM. For fear emotion, BiLSTM produces remarkable results in comparison with other approaches like
NB and NBLex. For the surprise emotion, both NB and NBLex are almost showing the same results (92.5%
and 92.4%). From Table 6, it is observed that the results obtained from proposed model NBLex is better than
NB and BiLSTM for all emotions in terms of true positives why because it is due to there is no limit for
vocabulary and it can handle OOV words.
8. Int J Elec & Comp Eng ISSN: 2088-8708
NBLex: emotion prediction in Kannada-English code-switch text using … (Ramesh Chundi)
2075
Figure 3 shows the accuracy of the various emotions for proposed model NBLex and other models
NB, BiLSTM. From Figure 3, it is observed that the NBLex performs well for most of the emotions
(6 emotions out of 8 emotions) in terms of accuracy in comparison with the NB and BiLSTM.
Figure 4 shows the true positive rate (TPR) of the various emotions for proposed model NBLex and other
models NB, BiLSTM. From Figure 4, it is observed that the NBLex performs well for all the emotions in
terms of TPR in comparison with the NB and BiLSTM approaches.
Table 5. Accuracy comparison of NB, BiLSTM,
and NBLex
Emotions NB BiLSTM NBLex
Anger 78.4 75.6 83.2
Anticipation 91.3 88.2 92.3
Disgust 78.9 81.1 85.8
Fear 97.8 98.8 94
Joy 84.7 83.3 87.5
Sad 83.9 81.9 84.5
Surprise 92.5 91.7 92.4
Trust 78.9 77.2 83.8
Average Accuracy 85.8 84.7 87.9
Table 6. True positives rate of NB, BiLSTM,
and NBLex
Emotions NB BiLSTM NBLex
Anger 52.2 53.1 60.0
Anticipation 23.0 35.8 41.0
Disgust 43.7 40.9 58.4
Fear 8.3 16.6 25.0
Joy 82.8 74.5 86.4
Sad 29.0 35.2 42.3
Surprise 7.0 24.5 29.8
Trust 50.3 56.8 61.8
Average TPR 37.0 42.2 50.6
Figure 3. Accuracy of various emotions
Figure 4. True positive rate of various emoticons
0
20
40
60
80
100
120
Anger Anticipation Disgust Fear Joy Sad Surprise Trust
Accuracy
Emotions
Accuracy of NB, BiLSTM, and NBLex
NB BiLSTM NBLex
0
20
40
60
80
100
Anger Anticipation Disgust Fear Joy Sad Surprise Trust
True
Positive
Rate
Emotions
True Positive Rate of NB, BiLSTM and NBLex
NB BiLSTM NBLex
9. ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 2, April 2023: 2068-2077
2076
4. CONCLUSION
In this paper, we contributed NBLex for emotion prediction in Kannada-English code-switch text.
The proposed approach NBLex provides optimal results (87.9%) in terms of accuracy in comparison with
other traditional NB (85.8%) and BiLSTM (84.7%) techniques. Also, in terms of TPR, the NBLex produces
50.6% which is better than NB (37.0%) and BiLSTM (42.2%). From our contribution, it shows that a simple
additive model (i.e., lexicon) is also be an alternative approach to predict the emotions in Kannada-English
code-switch text. This is the first work that aims to predict the emotions in Kannada-English code-switch text
by using lexicon approach.
Further, in future we want to enhance our work by addressing following three points: i) adding
additional emotional data and increasing the accuracy as well as TPR, ii) contextual detection of emotions
from the corpus, and iii) AB testing for engagement activities like pacification of anger customers or increasing net
profit score (NPR) of happy customers, as the proposed model detects these two prominent emotions effectively.
ACKNOWLEDGEMENTS
The authors want to thank the REVA University management for their support of this research
activity.
REFERENCES
[1] R. Plutchik, “A psychoevolutionary theory of emotions,” Social Science Information, vol. 21, no. 4–5, pp. 529–553, Jul. 1982,
doi: 10.1177/053901882021004003.
[2] C. O. Alm, D. Roth, and R. Sproat, “Emotions from text: machine learning for text-based emotion prediction,” in Proceedings of
human language technology conference and conference on empirical methods in natural language processing, 2005,
pp. 579–586.
[3] Y. Chen, S. Y. M. Lee, S. Li, and C.-R. Huang, “Emotion cause detection with linguistic constructions,” in Proceedings of the
23rd International Conference on Computational Linguistics (Coling 2010), 2010, pp. 179–187.
[4] P. R. Nagarajan, “52% of India’s urban youth are now bilingual, 18% speak three languages’,” The Times of India, 2018.
https://timesofindia.indiatimes.com/india/52-of-indias-urban-youth-are-now-bilingual-18-speak-three-
languages/articleshow/66530958.cms (accessed Jun 16, 2022).
[5] J. Lipski, “Code-switching and the problem of bilingual competence,” Aspects of bilingualism. Columbia: Hornbeam Press Inc,
pp. 250–264, 1978.
[6] M. Gysels, “French in urban Lubumbashi Swahili: Codeswitching, borrowing, or both?,” Journal of Multilingual and
Multicultural Development, vol. 13, no. 1–2, pp. 41–55, Jan. 1992, doi: 10.1080/01434632.1992.9994482.
[7] C. Myers-Scotton, Duelling languages: Grammatical structure in codeswitching. Oxford University Press, 1997.
[8] P. Muysken and Others, Bilingual speech: A typology of code-mixing. Cambridge University Press, 2000.
[9] T. Baldwin, P. Cook, M. Lui, A. MacKinlay, and L. Wang, “How noisy social media text, How diffrnt social media sources?,” in
Proceedings of the Sixth International Joint Conference on Natural Language Processing, Oct. 2013, pp. 356–364.
[10] T. Wang, X. Yang, C. Ouyang, A. Guo, Y. Liu, and Z. Li, “A multi-emotion classification method based on BLSTM-MC in code-
switching Text,” in CCF International Conference on Natural Language Processing and Chinese Computing, 2018, pp. 190–199,
doi: 10.1007/978-3-319-99501-4_16.
[11] T. Tang, X. Tang, and T. Yuan, “Fine-tuning BERT for multi-label sentiment analysis in unbalanced code-switching text,” IEEE
Access, vol. 8, pp. 193248–193256, 2020, doi: 10.1109/ACCESS.2020.3030468.
[12] D. Vijay, A. Bohra, V. Singh, S. S. Akhtar, and M. Shrivastava, “Corpus creation and emotion prediction for Hindi-English code-
mixed social media text,” in Proceedings of the 2018 Conference of the North American Chapter of the Association for
Computational Linguistics: Student Research Workshop, 2018, pp. 128–135, doi: 10.18653/v1/N18-4018.
[13] A. R. Appidi, V. K. Srirangam, D. Suhas, and M. Shrivastava, “Creation of corpus and analysis in code-mixed Kannada-English
twitter data for emotion prediction,” in Proceedings of the 28th International Conference on Computational Linguistics, 2020,
pp. 6703–6709, doi: 10.18653/v1/2020.coling-main.587.
[14] R. Kamal, M. A. Shah, C. Maple, M. Masood, A. Wahid, and A. Mehmood, “Emotion classification and crowd source sensing; A
lexicon based approach,” IEEE Access, vol. 7, pp. 27124–27134, 2019, doi: 10.1109/ACCESS.2019.2892624.
[15] S. Mohammad, “From once upon a time to happily ever after: Tracking emotions in mail and books,” Decision Support Systems,
vol. 53, no. 4, pp. 730–741, 2012.
[16] M. M. Bradley and P. J. Lang, “Affective norms for English words (ANEW): Instruction manual and affective ratings,” Technical
Report C-1, The Center for Research in Psychophysiology, University of Florida, 1999.
[17] J. Pennebaker, M. E. Francis, and R. J. Booth, “Linguistic Inquiry and Word Count (LIWC): LIWC2001,” Mahway: Lawrence
Erlbaum Associates, 2001.
[18] C. Strapparava and A. Valitutti, “WordNet-affect: an affective extension of WordNet,” Proceedings of the Fourth International
Conference on Language Resources and Evaluation (LREC’04), May 2004.
[19] S. M. Mohammad and P. D. Turney, “Crowdsourcing a word-emotion association lexicon,” Computational Intelligence, vol. 29,
no. 3, pp. 436–465, Aug. 2013, doi: 10.1111/j.1467-8640.2012.00460.x.
[20] J. Staiano and M. Guerini, “Depeche mood: a Lexicon for emotion analysis from crowd annotated news,” in Proceedings of the
52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Jun. 2014, pp. 427–433, doi:
10.3115/v1/P14-2070.
[21] M. Purver and S. Battersby, “Experimenting with distant supervision for emotion classification,” in Proceedings of the 13th
Conference of the European Chapter of the Association for Computational Linguistics, Apr. 2012, pp. 482–491.
[22] S. Wen and X. Wan, “Emotion classification in microblog texts using class sequential rules,” in Proceedings of the Twenty-Eighth
AAAI Conference on Artificial Intelligence, 2014, pp. 187–193.
[23] M. Hasan, E. A. Rundensteiner, and E. O. Agu, “EMOTEX: Detecting emotions in twitter messages,” in 2014 ASE
10. Int J Elec & Comp Eng ISSN: 2088-8708
NBLex: emotion prediction in Kannada-English code-switch text using … (Ramesh Chundi)
2077
Bigdata/Socialcom/Cybersecurity Conference, 2014, pp. 1–10.
[24] S. M. Mohammad and S. Kiritchenko, “Using hashtags to capture fine emotion categories from tweets,” Computational
Intelligence, vol. 31, no. 2, pp. 301–326, 2015.
[25] A. M. Abd Al-Aziz, M. Gheith, and A. S. Eldin, “Lexicon based and multi-criteria decision making (MCDM) approach for
detecting emotions from Arabic microblog text,” in 2015 First International Conference on Arabic Computational Linguistics
(ACLing), Apr. 2015, pp. 100–105, doi: 10.1109/ACLing.2015.21.
[26] U. Gupta, A. Chatterjee, R. Srikanth, and P. Agrawal, “A sentiment-and-semantics-based àpproach for emotion detection in
textual conversations,” arXiv:1707.06996. arXiv, Jul. 2017.
[27] H. Schuff, J. Barnes, J. Mohme, S. Padó, and R. Klinger, “Annotation, modelling and analysis of fine-grained emotions on a
stance and sentiment detection corpus,” in Proceedings of the 8th Workshop on Computational Approaches to Subjectivity,
Sentiment and Social Media Analysis, 2017, pp. 13–23, doi: 10.18653/v1/W17-5203.
[28] S. Mohammad and F. Bravo-Marquez, “Emotion intensities in tweets,” in Proceedings of the 6th Joint Conference on Lexical and
Computational Semantics (SEM 2017), Aug. 2017, pp. 65–77, doi: 10.18653/v1/S17-1007.
[29] T. Rabeya, S. Ferdous, H. S. Ali, and N. R. Chakraborty, “A survey on emotion detection: A lexicon based backtracking approach
for detecting emotion from Bengali text,” 2017 20th International Conference of Computer and Information Technology (ICCIT),
pp. 1–7, Dec. 2017, doi: 10.1109/ICCITECHN.2017.8281855.
[30] E. Batbaatar, M. Li, and K. H. Ryu, “Semantic-emotion neural network for emotion recognition from text,” IEEE Access, vol. 7,
pp. 111866–111878, 2019, doi: 10.1109/ACCESS.2019.2934529.
[31] S. Lee and Z. Wang, “Emotion in code-switching texts: Corpus construction and analysis,” in Proceedings of the Eighth SIGHAN
Workshop on Chinese Language Processing, Jul. 2015, pp. 91–99, doi: 10.18653/v1/W15-3116.
[32] Z. Wang, S. Lee, S. Li, and G. Zhou, “Emotion detection in code-switching texts via bilingual and sentimental information,” in
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint
Conference on Natural Language Processing (Volume 2: Short Papers), Jul. 2015, pp. 763–768, doi: 10.3115/v1/P15-2125.
[33] S. Y. M. Lee and Z. Wang, “Multi-view learning for emotion detection in code-switching texts,” in 2015 International
Conference on Asian Language Processing (IALP), Oct. 2015, pp. 90–93, doi: 10.1109/IALP.2015.7451539.
[34] Z. Wang, S. Y. M. Lee, S. Li, and G. Zhou, “Emotion analysis in code-switching text with joint factor graph model,” IEEE/ACM
Transactions on Audio, Speech, and Language Processing, vol. 25, no. 3, pp. 469–480, Mar. 2017, doi:
10.1109/TASLP.2016.2637280.
BIOGRAPHIES OF AUTHORS
Ramesh Chundi received the B.Sc degree in computer science and MCA degree
from Sri Venkateswara University, India, in 2004 and 2007, respectively. Currently pursuing
Ph.D degree in Computer Science and Applications from REVA University, India. His
research interests include natural language processing (NLP), artificial intelligence, ML, DL,
data analytics, and data mining. He can be contacted at email: chundiramesh@gmail.com.
Vishwanath R. Hulipalled is a Professor in the School of Computing and IT,
REVA University, Bangalore, Karnataka, India. He completed BE, ME and Ph.D. in Computer
Science and Engineering. His area of Interests includes Machine Learning, Natural Language
Processing, Data Analytics and Time Series Mining. He has more than 24 years of academic
experience and research. He authored more than 50 research articles in reputed journals and
conference proceedings. He can be contacted at email: vishwanth.rh@reva.edu.in.
Jay Bharthish Simha is the CTO of ABIBA Systems and Chief Mentor at
RACE Labs, REVA University. He completed his BE (Mech), M.Tech (Mech), M.Phil.(CS)
and Ph.D. (AI). His area of interest includes fuzzy logic, soft computing, machine learning,
deep learning, and applications. He has more than 20 years of industrial experience and
4 years of academic experience. He has authored/co-authored more than 50 journal and
conference publications. He can be contacted at email: jay.b.simha@reva.edu.in and
jay.b.simha@abibasystems.com.