This document evaluates the performance of Support Vector Machine (SVM) and Decision Tree (DT) classifiers for emotion recognition from Malay folklores. 100 documents from children's short stories were collected and annotated with four emotion labels: happy, angry, fearful, and sad. Term Frequency-Inverse Document Frequency (TF-IDF) features were extracted from the documents. Both SVM and DT were tested on a 80/20 training/test split of the data. Results showed that the DT classifier outperformed SVM with an accuracy over 22.2%. However, the overall emotion recognition accuracy was only moderate, suggesting room for improvement.
Sentiment analysis is an important current research area. The demand for sentiment analysis and classification is growing day by day; this paper presents a novel method to classify Urdu documents as previously no work recorded on sentiment classification for Urdu text. We consider the problem by determining whether the review or sentence is positive, negative or neutral. For the purpose we use two machine learning methods Naïve Bayes and Support Vector Machines (SVM) . Firstly the documents are preprocessed and the sentiments features are extracted, then the polarity has been calculated, judged and classify through Machine learning methods.
A SURVEY OF S ENTIMENT CLASSIFICATION TECHNIQUES USED FOR I NDIAN REGIONA...ijcsa
Sentiment Analysis is a natural language processing
task that extracts sentiment from various text for
ms
and classifies them according to positive, negative
or neutral polarity. It analyzes emotions, feeling
s, and
the attitude of a speaker or a writer towards a con
text. This paper gives comparative study of various
sentiment classification techniques and also discus
ses in detail two main categories of sentiment
classification techniques these are machine based a
nd lexicon based. The paper also presents challenge
s
associated with sentiment analysis along with lexic
al resources available.
Text to Emotion Extraction Using Supervised Machine Learning TechniquesTELKOMNIKA JOURNAL
Proliferation of internet and social media has greatly increased the popularity of text
communication. People convey their sentiment and emotion through text which promotes lively
communication. Consequently, a tremendous amount of emotional text is generated on different social
media and blogs in every moment. This has raised the necessity of automated tool for emotion mining from
text. There are various rule based approaches of emotion extraction form text based on emotion intensity
lexicon. However, creating emotion intensity lexicon is a time consuming and tedious process. Moreover,
there is no hard and fast rule for assigning emotion intensity to words. To solve these difficulties, we
propose a machine learning based approach of emotion extraction from text which relies on annotated
example rather emotion intensity lexicon. We investigated Multinomial Naïve Bayesian (MNB) Classifier,
Artificial Neural Network (ANN) and Support Vector Machine (SVM) for mining emotion from text. In our
setup, SVM outperformed other classifiers with promising accuracy.
A survey on sentiment analysis and opinion miningeSAT Journals
Abstract Sentiment analysis is a machine learning approach in which machines analyze and classify the human’s sentiments, emotions, opinions etc about some topic which are expressed in the form of either text or speech. The textual data available in the web is increasing day by day. In order to enhance the sales of a product and to improve the customer satisfaction, most of the on-line shopping sites provide the opportunity to customers to write reviews about products. These reviews are large in number and to mine the overall sentiment or opinion polarity from all of them, sentiment analysis can be used. Manual analysis of such large number of reviews is practically impossible. Therefore automated approach of a machine has significant role in solving this hard problem. The major challenge of the area of Sentiment analysis and Opinion mining lies in identifying the emotions expressed in these texts. This literature survey is done to study the sentiment analysis problem in-depth and to familiarize with other works done on the subject. Index Terms: Sentiment Analysis, Opinion Mining, Cross Domain Sentiment Analysis
Sentiment analysis is an important current research area. The demand for sentiment analysis and classification is growing day by day; this paper presents a novel method to classify Urdu documents as previously no work recorded on sentiment classification for Urdu text. We consider the problem by determining whether the review or sentence is positive, negative or neutral. For the purpose we use two machine learning methods Naïve Bayes and Support Vector Machines (SVM) . Firstly the documents are preprocessed and the sentiments features are extracted, then the polarity has been calculated, judged and classify through Machine learning methods.
A SURVEY OF S ENTIMENT CLASSIFICATION TECHNIQUES USED FOR I NDIAN REGIONA...ijcsa
Sentiment Analysis is a natural language processing
task that extracts sentiment from various text for
ms
and classifies them according to positive, negative
or neutral polarity. It analyzes emotions, feeling
s, and
the attitude of a speaker or a writer towards a con
text. This paper gives comparative study of various
sentiment classification techniques and also discus
ses in detail two main categories of sentiment
classification techniques these are machine based a
nd lexicon based. The paper also presents challenge
s
associated with sentiment analysis along with lexic
al resources available.
Text to Emotion Extraction Using Supervised Machine Learning TechniquesTELKOMNIKA JOURNAL
Proliferation of internet and social media has greatly increased the popularity of text
communication. People convey their sentiment and emotion through text which promotes lively
communication. Consequently, a tremendous amount of emotional text is generated on different social
media and blogs in every moment. This has raised the necessity of automated tool for emotion mining from
text. There are various rule based approaches of emotion extraction form text based on emotion intensity
lexicon. However, creating emotion intensity lexicon is a time consuming and tedious process. Moreover,
there is no hard and fast rule for assigning emotion intensity to words. To solve these difficulties, we
propose a machine learning based approach of emotion extraction from text which relies on annotated
example rather emotion intensity lexicon. We investigated Multinomial Naïve Bayesian (MNB) Classifier,
Artificial Neural Network (ANN) and Support Vector Machine (SVM) for mining emotion from text. In our
setup, SVM outperformed other classifiers with promising accuracy.
A survey on sentiment analysis and opinion miningeSAT Journals
Abstract Sentiment analysis is a machine learning approach in which machines analyze and classify the human’s sentiments, emotions, opinions etc about some topic which are expressed in the form of either text or speech. The textual data available in the web is increasing day by day. In order to enhance the sales of a product and to improve the customer satisfaction, most of the on-line shopping sites provide the opportunity to customers to write reviews about products. These reviews are large in number and to mine the overall sentiment or opinion polarity from all of them, sentiment analysis can be used. Manual analysis of such large number of reviews is practically impossible. Therefore automated approach of a machine has significant role in solving this hard problem. The major challenge of the area of Sentiment analysis and Opinion mining lies in identifying the emotions expressed in these texts. This literature survey is done to study the sentiment analysis problem in-depth and to familiarize with other works done on the subject. Index Terms: Sentiment Analysis, Opinion Mining, Cross Domain Sentiment Analysis
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
Sentiment classification is an ongoing field and interesting area of research because of its application in various fields collecting review from people about products and social and political events through the web. Currently, Sentiment Analysis concentrates for subjective statements or on subjectivity and overlook objective statements which carry sentiment(s). During the sentiment classification more challenging problem are faced due to the ambiguous sense of words, negation words and intensifier. Due to its importance the correct sense of target word is extracted and determined for which the similarity arise in WordNet Glosses. This paper presents a survey covering the techniques and methods in sentiment analysis and challenges appear in the field.
Creation of speech corpus for emotion analysis in Gujarati language and its e...IJECEIAES
In the last couple of years emotion recognition has proven its significance in the area of artificial intelligence and man machine communication. Emotion recognition can be done using speech and image (facial expression), this paper deals with SER (speech emotion recognition) only. For emotion recognition emotional speech database is essential. In this paper we have proposed emotional database which is developed in Gujarati language, one of the official’s language of India. The proposed speech corpus bifurcate six emotional states as: sadness, surprise, anger, disgust, fear, happiness. To observe effect of different emotions, analysis of proposed Gujarati speech database is carried out using efficient speech parameters like pitch, energy and MFCC using MATLAB Software.
Natural Language Processing Theory, Applications and Difficultiesijtsrd
The promise of a powerful computing device to help people in productivity as well as in recreation can only be realized with proper human machine communication. Automatic recognition and understanding of spoken language is the first step toward natural human machine interaction. Research in this field has produced remarkable results, leading to many exciting expectations and new challenges. This field is known as Natural language Processing. In this paper the natural language generation and Natural language understanding is discussed. Difficulties in NLU, applications and comparison with structured programming language are also discussed here. Mrs. Anjali Gharat | Mrs. Helina Tandel | Mr. Ketan Bagade "Natural Language Processing Theory, Applications and Difficulties" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-6 , October 2019, URL: https://www.ijtsrd.com/papers/ijtsrd28092.pdf Paper URL: https://www.ijtsrd.com/engineering/computer-engineering/28092/natural-language-processing-theory-applications-and-difficulties/mrs-anjali-gharat
Several attempts had been made to analyze emotion words in the fields of linguistics, psychology and sociology; with the advent of computers, the analyses of these words have taken a different dimension. Unfortunately, limited attempts have so far been made to using interval type-2 fuzzy logic (IT2FL) to analyze these words in native languages. This study used IT2FL to analyze Igbo emotion words. IT2F sets are computed using the interval approach method which is divided into two parts: the data part and the fuzzy set part. The data part preprocessed data and its statistics computed for the interval that survived the preprocessing stages while the fuzzy set part determined the nature of the footprint of uncertainty; the IT2F set mathematical models for each emotion characteristics of each emotion word is also computed. The data used in this work was collected from fifteen subjects who were asked to enter an interval for each of the emotion characteristics: Valence, Activation and Dominance on an interval survey of the thirty Igbo emotion words. With this, the words are being analyzed and can be used for the purposes of translation between vocabularies in consideration to context.
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...TELKOMNIKA JOURNAL
Sentiment analysis in short informal texts like product reviews is more challenging. Short texts are
sparse, noisy, and lack of context information. Traditional text classification methods may not be suitable
for analyzing sentiment of short texts given all those difficulties. A common approach to overcome these
problems is to enrich the original texts with additional semantics to make it appear like a large document of
text. Then, traditional classification methods can be applied to it. In this study, we developed an automatic
sentiment analysis system of short informal Indonesian texts using Naïve Bayes and Synonym Based
Feature Expansion. The system consists of three main stages, preprocessing and normalization, features
expansion and classification. After preprocessing and normalization, we utilize Kateglo to find some
synonyms of every words in original texts and append them. Finally, the text is classified using Naïve
Bayes. The experiment shows that the proposed method can improve the performance of sentiment
analysis of short informal Indonesian product reviews. The best sentiment classification performance using
proposed feature expansion is obtained by accuracy of 98%.The experiment also show that feature
expansion will give higher improvement in small number of training data than in the large number of them.
Sentiment classification aims to detect information such as opinions, explicit , implicit feelings expressed
in text. The most existing approaches are able to detect either explicit expressions or implicit expressions of
sentiments in the text separately. In this proposed framework it will detect both Implicit and Explicit
expressions available in the meeting transcripts. It will classify the Positive, Negative, Neutral words and
also identify the topic of the particular meeting transcripts by using fuzzy logic. This paper aims to add
some additional features for improving the classification method. The quality of the sentiment classification
is improved using proposed fuzzy logic framework .In this fuzzy logic it includes the features like Fuzzy
rules and Fuzzy C-means algorithm.The quality of the output is evaluated using the parameters such as
precision, recall, f-measure. Here Fuzzy C-means Clustering technique measured in terms of Purity and
Entropy. The data set was validated using 10-fold cross validation method and observed 95% confidence
interval between the accuracy values .Finally, the proposed fuzzy logic method produced more than 85 %
accurate results and error rate is very less compared to existing sentiment classification techniques.
Business intelligence analytics using sentiment analysis-a surveyIJECEIAES
Sentiment analysis (SA) is the study and analysis of sentiments, appraisals and impressions by people about entities, person, happening, topics and services. SA uses text analysis techniques and natural language processing methods to locate and extract information from big data. As most of the people are networked themselves through social websites, they use to express their sentiments through these websites.These sentiments are proved fruitful to an individual, business, government for making decisions. The impressions posted on different available sources are being used by organization to know the market mood about the services they are providing. Analyzing huge moods expressed with different features, style have raised challenge for users. This paper focuses on understanding the fundamentals of sentiment analysis, the techniques used for sentiment extraction and analysis. These techniques are then compared for accuracy, advantages and limitations. Based on the accuracy for expexted approach, we may use the suitable technique.
An Improved sentiment classification for objective word.IJSRD
Sentiment classification is an ongoing field and interesting area of research because of its application in various fields. Customer sentiments play a very important role in daily life. Currently, Sentiment classification focused on subjective statements and ignores objective statements which also carry sentiment. During the sentiment classification, problem is faced due to the ambiguous sense (meaning) of words and negation words. In word sense disambiguation method semantic scores calculated from SentiWordNet of WordNet glosses terms. The correct sense of the word is extracted and determined similarity in WordNet glosses terms. SentiWordNet extract first sense of word which used in general sense. This work aims at improving the sentiment classification by modifying the sentiment values returned by SentiWordNet and compare classification accuracy of support vector machine and naïve bays.
Mining Opinion Features in Customer ReviewsIJCERT JOURNAL
Now days, E-commerce systems have become extremely important. Large numbers of customers are choosing online shopping because of its convenience, reliability, and cost. Client generated information and especially item reviews are significant sources of data for consumers to make informed buy choices and for makers to keep track of customer’s opinions. It is difficult for customers to make purchasing decisions based on only pictures and short product descriptions. On the other hand, mining product reviews has become a hot research topic and prior researches are mostly based on pre-specified product features to analyse the opinions. Natural Language Processing (NLP) techniques such as NLTK for Python can be applied to raw customer reviews and keywords can be extracted. This paper presents a survey on the techniques used for designing software to mine opinion features in reviews. Elven IEEE papers are selected and a comparison is made between them. These papers are representative of the significant improvements in opinion mining in the past decade.
A SURVEY OF MACHINE LEARNING TECHNIQUES FOR SENTIMENT CLASSIFICATIONijcsa
Opinion Mining also called as Sentiment Analysis is a process that provides with the subjective informationfor the text provided. In other words we can say that it analyzes person’s opinion, evaluations, emotions,appraisals, etc. towards a particular product, event, issue, service, topic, etc. This paper focuses on the machine learning techniques used for sentiment analysis and opinion mining. These methods are furthercompared on the basis of their accuracy, advantages and limitations.
An Approach of Human Emotional States Classification and Modeling from EEGCSCJournals
In this paper, a new approach is proposed to model the emotional states from EEG signals with mathematical expressions based on wavelet analysis and trust region algorithm. EEG signals are collected in different emotional states and some salient features are extracted through temporal and spectral analysis to indicate the dispersion which will unify different states. The maximum classification accuracy of emotion is obtained for DWT analysis rather than FFT and statistical analysis. So DWT analysis is considered as the best suited for mathematical modeling of human emotions. The emotional states are modeled with different mathematical expressions using the obtained coefficients from trust region algorithm that can be compared with the sub-band wavelet coefficients of different states. The proposed approach is verified with the adjusted R-square percentage and the sum of square errors. The adjusted R- square percentage of the mathematical modeled states are 78.4% for relax, 77.18% for motor action; however for memory, pleasant, enjoying music and fear they are 93%, 95.6%, 97.7% and 91.5% respectively. The proposed system is reliable that can be applied for practical real time implementation of human emotion based systems.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
The peer-reviewed International Journal of Engineering Inventions (IJEI) is started with a mission to encourage contribution to research in Science and Technology. Encourage and motivate researchers in challenging areas of Sciences and Technology.
MACHINE LEARNING ALGORITHMS FOR MYANMAR NEWS CLASSIFICATIONijnlc
Text classification is a very important research area in machine learning. Artificial Intelligence is reshaping text classification techniques to better acquire knowledge. In spite of the growth and spread of AI in text mining research for various languages such as English, Japanese, Chinese, etc., its role with respect to Myanmar text is not well understood yet. The aim of this paper is comparative study of machine learning algorithms such as Naïve Bayes (NB), k-nearest neighbours (KNN), support vector machine (SVM) algorithms for Myanmar Language News classification. There is no comparative study of machine learning algorithms in Myanmar News. The news is classified into one of four categories (political, Business, Entertainment and Sport). Dataset is collected from 12,000 documents belongs to 4 categories. Well-known algorithms are applied on collected Myanmar language News dataset from websites. The goal of text classification is to classify documents into a certain number of pre-defined categories. News corpus is used for training and testing purpose of the classifier. Feature selection method, chi square algorithm achieves comparable performance across a number of classifiers. In this paper, the experimental results also show support vector machine is better accuracy to other classification algorithms employed in this research. Due to Myanmar Language is complex, it is more important to study and understand the nature of data before proceeding into mining.
Emotion detection on social media status in Myanmar language IJECEIAES
Many social media emerged and provided services during these years. Most people, especially in Myanmar, use them to express their emotions or moods, learn subjects, sell products, read up-to-date news, and communicate with each other. Emotion detection on social users makes critical tasks in the opinion mining and sentiment analysis. This paper presents the emotion detection system on social media (Facebook) user status or post written in Myanmar (Burmese) language. Before the emotion detection process, the user posts are pre-processed under segmentation, stemming, part-of-speech (POS) tagging, and stop word removal. The system then uses our preconstructed Myanmar word-emotion Lexicon, M-Lexicon, to extract the emotion words from the segmented POS post. The system provides six types of emotion such as surprise, disgust, fear, anger, sadness, and happiness. The system applies naïve Bayes (NB) emotion classifier to examine the emotion in the case of more than two words with different emotion values are extracted. The classifiers also classify the emotion of the users on their posts. The experiment shows that the system can detect 85% accuracy in NB based emotion detection while 86% in recurrent neural network (RNN).
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
Sentiment classification is an ongoing field and interesting area of research because of its application in various fields collecting review from people about products and social and political events through the web. Currently, Sentiment Analysis concentrates for subjective statements or on subjectivity and overlook objective statements which carry sentiment(s). During the sentiment classification more challenging problem are faced due to the ambiguous sense of words, negation words and intensifier. Due to its importance the correct sense of target word is extracted and determined for which the similarity arise in WordNet Glosses. This paper presents a survey covering the techniques and methods in sentiment analysis and challenges appear in the field.
Creation of speech corpus for emotion analysis in Gujarati language and its e...IJECEIAES
In the last couple of years emotion recognition has proven its significance in the area of artificial intelligence and man machine communication. Emotion recognition can be done using speech and image (facial expression), this paper deals with SER (speech emotion recognition) only. For emotion recognition emotional speech database is essential. In this paper we have proposed emotional database which is developed in Gujarati language, one of the official’s language of India. The proposed speech corpus bifurcate six emotional states as: sadness, surprise, anger, disgust, fear, happiness. To observe effect of different emotions, analysis of proposed Gujarati speech database is carried out using efficient speech parameters like pitch, energy and MFCC using MATLAB Software.
Natural Language Processing Theory, Applications and Difficultiesijtsrd
The promise of a powerful computing device to help people in productivity as well as in recreation can only be realized with proper human machine communication. Automatic recognition and understanding of spoken language is the first step toward natural human machine interaction. Research in this field has produced remarkable results, leading to many exciting expectations and new challenges. This field is known as Natural language Processing. In this paper the natural language generation and Natural language understanding is discussed. Difficulties in NLU, applications and comparison with structured programming language are also discussed here. Mrs. Anjali Gharat | Mrs. Helina Tandel | Mr. Ketan Bagade "Natural Language Processing Theory, Applications and Difficulties" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-6 , October 2019, URL: https://www.ijtsrd.com/papers/ijtsrd28092.pdf Paper URL: https://www.ijtsrd.com/engineering/computer-engineering/28092/natural-language-processing-theory-applications-and-difficulties/mrs-anjali-gharat
Several attempts had been made to analyze emotion words in the fields of linguistics, psychology and sociology; with the advent of computers, the analyses of these words have taken a different dimension. Unfortunately, limited attempts have so far been made to using interval type-2 fuzzy logic (IT2FL) to analyze these words in native languages. This study used IT2FL to analyze Igbo emotion words. IT2F sets are computed using the interval approach method which is divided into two parts: the data part and the fuzzy set part. The data part preprocessed data and its statistics computed for the interval that survived the preprocessing stages while the fuzzy set part determined the nature of the footprint of uncertainty; the IT2F set mathematical models for each emotion characteristics of each emotion word is also computed. The data used in this work was collected from fifteen subjects who were asked to enter an interval for each of the emotion characteristics: Valence, Activation and Dominance on an interval survey of the thirty Igbo emotion words. With this, the words are being analyzed and can be used for the purposes of translation between vocabularies in consideration to context.
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...TELKOMNIKA JOURNAL
Sentiment analysis in short informal texts like product reviews is more challenging. Short texts are
sparse, noisy, and lack of context information. Traditional text classification methods may not be suitable
for analyzing sentiment of short texts given all those difficulties. A common approach to overcome these
problems is to enrich the original texts with additional semantics to make it appear like a large document of
text. Then, traditional classification methods can be applied to it. In this study, we developed an automatic
sentiment analysis system of short informal Indonesian texts using Naïve Bayes and Synonym Based
Feature Expansion. The system consists of three main stages, preprocessing and normalization, features
expansion and classification. After preprocessing and normalization, we utilize Kateglo to find some
synonyms of every words in original texts and append them. Finally, the text is classified using Naïve
Bayes. The experiment shows that the proposed method can improve the performance of sentiment
analysis of short informal Indonesian product reviews. The best sentiment classification performance using
proposed feature expansion is obtained by accuracy of 98%.The experiment also show that feature
expansion will give higher improvement in small number of training data than in the large number of them.
Sentiment classification aims to detect information such as opinions, explicit , implicit feelings expressed
in text. The most existing approaches are able to detect either explicit expressions or implicit expressions of
sentiments in the text separately. In this proposed framework it will detect both Implicit and Explicit
expressions available in the meeting transcripts. It will classify the Positive, Negative, Neutral words and
also identify the topic of the particular meeting transcripts by using fuzzy logic. This paper aims to add
some additional features for improving the classification method. The quality of the sentiment classification
is improved using proposed fuzzy logic framework .In this fuzzy logic it includes the features like Fuzzy
rules and Fuzzy C-means algorithm.The quality of the output is evaluated using the parameters such as
precision, recall, f-measure. Here Fuzzy C-means Clustering technique measured in terms of Purity and
Entropy. The data set was validated using 10-fold cross validation method and observed 95% confidence
interval between the accuracy values .Finally, the proposed fuzzy logic method produced more than 85 %
accurate results and error rate is very less compared to existing sentiment classification techniques.
Business intelligence analytics using sentiment analysis-a surveyIJECEIAES
Sentiment analysis (SA) is the study and analysis of sentiments, appraisals and impressions by people about entities, person, happening, topics and services. SA uses text analysis techniques and natural language processing methods to locate and extract information from big data. As most of the people are networked themselves through social websites, they use to express their sentiments through these websites.These sentiments are proved fruitful to an individual, business, government for making decisions. The impressions posted on different available sources are being used by organization to know the market mood about the services they are providing. Analyzing huge moods expressed with different features, style have raised challenge for users. This paper focuses on understanding the fundamentals of sentiment analysis, the techniques used for sentiment extraction and analysis. These techniques are then compared for accuracy, advantages and limitations. Based on the accuracy for expexted approach, we may use the suitable technique.
An Improved sentiment classification for objective word.IJSRD
Sentiment classification is an ongoing field and interesting area of research because of its application in various fields. Customer sentiments play a very important role in daily life. Currently, Sentiment classification focused on subjective statements and ignores objective statements which also carry sentiment. During the sentiment classification, problem is faced due to the ambiguous sense (meaning) of words and negation words. In word sense disambiguation method semantic scores calculated from SentiWordNet of WordNet glosses terms. The correct sense of the word is extracted and determined similarity in WordNet glosses terms. SentiWordNet extract first sense of word which used in general sense. This work aims at improving the sentiment classification by modifying the sentiment values returned by SentiWordNet and compare classification accuracy of support vector machine and naïve bays.
Mining Opinion Features in Customer ReviewsIJCERT JOURNAL
Now days, E-commerce systems have become extremely important. Large numbers of customers are choosing online shopping because of its convenience, reliability, and cost. Client generated information and especially item reviews are significant sources of data for consumers to make informed buy choices and for makers to keep track of customer’s opinions. It is difficult for customers to make purchasing decisions based on only pictures and short product descriptions. On the other hand, mining product reviews has become a hot research topic and prior researches are mostly based on pre-specified product features to analyse the opinions. Natural Language Processing (NLP) techniques such as NLTK for Python can be applied to raw customer reviews and keywords can be extracted. This paper presents a survey on the techniques used for designing software to mine opinion features in reviews. Elven IEEE papers are selected and a comparison is made between them. These papers are representative of the significant improvements in opinion mining in the past decade.
A SURVEY OF MACHINE LEARNING TECHNIQUES FOR SENTIMENT CLASSIFICATIONijcsa
Opinion Mining also called as Sentiment Analysis is a process that provides with the subjective informationfor the text provided. In other words we can say that it analyzes person’s opinion, evaluations, emotions,appraisals, etc. towards a particular product, event, issue, service, topic, etc. This paper focuses on the machine learning techniques used for sentiment analysis and opinion mining. These methods are furthercompared on the basis of their accuracy, advantages and limitations.
An Approach of Human Emotional States Classification and Modeling from EEGCSCJournals
In this paper, a new approach is proposed to model the emotional states from EEG signals with mathematical expressions based on wavelet analysis and trust region algorithm. EEG signals are collected in different emotional states and some salient features are extracted through temporal and spectral analysis to indicate the dispersion which will unify different states. The maximum classification accuracy of emotion is obtained for DWT analysis rather than FFT and statistical analysis. So DWT analysis is considered as the best suited for mathematical modeling of human emotions. The emotional states are modeled with different mathematical expressions using the obtained coefficients from trust region algorithm that can be compared with the sub-band wavelet coefficients of different states. The proposed approach is verified with the adjusted R-square percentage and the sum of square errors. The adjusted R- square percentage of the mathematical modeled states are 78.4% for relax, 77.18% for motor action; however for memory, pleasant, enjoying music and fear they are 93%, 95.6%, 97.7% and 91.5% respectively. The proposed system is reliable that can be applied for practical real time implementation of human emotion based systems.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
The peer-reviewed International Journal of Engineering Inventions (IJEI) is started with a mission to encourage contribution to research in Science and Technology. Encourage and motivate researchers in challenging areas of Sciences and Technology.
MACHINE LEARNING ALGORITHMS FOR MYANMAR NEWS CLASSIFICATIONijnlc
Text classification is a very important research area in machine learning. Artificial Intelligence is reshaping text classification techniques to better acquire knowledge. In spite of the growth and spread of AI in text mining research for various languages such as English, Japanese, Chinese, etc., its role with respect to Myanmar text is not well understood yet. The aim of this paper is comparative study of machine learning algorithms such as Naïve Bayes (NB), k-nearest neighbours (KNN), support vector machine (SVM) algorithms for Myanmar Language News classification. There is no comparative study of machine learning algorithms in Myanmar News. The news is classified into one of four categories (political, Business, Entertainment and Sport). Dataset is collected from 12,000 documents belongs to 4 categories. Well-known algorithms are applied on collected Myanmar language News dataset from websites. The goal of text classification is to classify documents into a certain number of pre-defined categories. News corpus is used for training and testing purpose of the classifier. Feature selection method, chi square algorithm achieves comparable performance across a number of classifiers. In this paper, the experimental results also show support vector machine is better accuracy to other classification algorithms employed in this research. Due to Myanmar Language is complex, it is more important to study and understand the nature of data before proceeding into mining.
Emotion detection on social media status in Myanmar language IJECEIAES
Many social media emerged and provided services during these years. Most people, especially in Myanmar, use them to express their emotions or moods, learn subjects, sell products, read up-to-date news, and communicate with each other. Emotion detection on social users makes critical tasks in the opinion mining and sentiment analysis. This paper presents the emotion detection system on social media (Facebook) user status or post written in Myanmar (Burmese) language. Before the emotion detection process, the user posts are pre-processed under segmentation, stemming, part-of-speech (POS) tagging, and stop word removal. The system then uses our preconstructed Myanmar word-emotion Lexicon, M-Lexicon, to extract the emotion words from the segmented POS post. The system provides six types of emotion such as surprise, disgust, fear, anger, sadness, and happiness. The system applies naïve Bayes (NB) emotion classifier to examine the emotion in the case of more than two words with different emotion values are extracted. The classifiers also classify the emotion of the users on their posts. The experiment shows that the system can detect 85% accuracy in NB based emotion detection while 86% in recurrent neural network (RNN).
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Emotion Detection is one of the most emerging issues in human computer interaction. A sufficient amount
of work has been done by researchers to detect emotions from facial and audio information whereas
recognizing emotions from textual data is still a fresh and hot research area. This paper presented a
knowledge based survey on emotion detection based on textual data and the methods used for this purpose.
At the next step paper also proposed a new architecture for recognizing emotions from text document.
Proposed architecture is composed of two main parts, emotion ontology and emotion detector algorithm.
Proposed emotion detector system takes a text document and the emotion ontology as inputs and produces
one of the six emotion classes (i.e. love, joy, anger, sadness, fear and surprise) as the output.
A MODEL BASED ON SENTIMENTS ANALYSIS FOR STOCK EXCHANGE PREDICTION - CASE STU...csandit
Predicting the behavior of shares in the stock market is a complex problem, that involves variables not always known and can undergo various influences, from the collective emotion to high-profile news. Such volatility, can represent considerable financial losses for investors. In order to anticipate such changes in the market, it has been proposed various mechanisms to try to predict the behavior of an asset in the stock market, based on previously existing information.
Such mechanisms include statistical data only, without considering the collective feeling. This article, is going to use natural language processing algorithms (LPN) to determine the collective mood on assets and later with the help of the SVM algorithm to extract patterns in an attempt to predict the active behavior. Nevertheless it is important to note that such approach is not intended to be the main factor in the decision making process, but rather an aid tool, which combined with other information, can provide higher accuracy for the solution of this problem
A MODEL BASED ON SENTIMENTS ANALYSIS FOR STOCK EXCHANGE PREDICTION - CASE STU...cscpconf
Predicting the behavior of shares in the stock market is a complex problem, that involves variables not always known and can undergo various influences, from the collective emotion to the high-profile news. Such volatility can represent considerable financial losses for investors. In order to anticipate such changes in the market, it has been proposed various mechanisms to try to predict the behavior of an asset in the stock market, based on previously existing information. Such mechanisms include statistical data only, without considering the collective feeling. This article is going to use natural language processing algorithms (LPN) to determine the collective mood on assets and later with the help of the SVM algorithm to extract patterns in an
attempt to predict the active behavior. Nevertheless it is important to note that such approach is not intended to be the main factor in the decision making process, but rather an aid tool, which combined with other information, can provide higher accuracy for the solution of this problem.
Neural Network Based Context Sensitive Sentiment AnalysisEditor IJCATR
Social media communication is evolving more in these days. Social networking site is being rapidly increased in recent years, which provides platform to connect people all over the world and share their interests. The conversation and the posts available in social media are unstructured in nature. So sentiment analysis will be a challenging work in this platform. These analyses are mostly performed in machine learning techniques which are less accurate than neural network methodologies. This paper is based on sentiment classification using Competitive layer neural networks and classifies the polarity of a given text whether the expressed opinion in the text is positive or negative or neutral. It determines the overall topic of the given text. Context independent sentences and implicit meaning in the text are also considered in polarity classification.
A scalable, lexicon based technique for sentiment analysisijfcstjournal
Rapid increase in the volume of sentiment rich social media on the web has resulted in an increased
interest among researchers regarding Sentimental Analysis and opinion mining. However, with so much
social media available on the web, sentiment analysis is now considered as a big data task. Hence the
conventional sentiment analysis approaches fails to efficiently handle the vast amount of sentiment data
available now a days. The main focus of the research was to find such a technique that can efficiently
perform sentiment analysis on big data sets. A technique that can categorize the text as positive, negative
and neutral in a fast and accurate manner. In the research, sentiment analysis was performed on a large
data set of tweets using Hadoop and the performance of the technique was measured in form of speed and
accuracy. The experimental results shows that the technique exhibits very good efficiency in handling big
sentiment data sets.
Word and Sentence Level Emotion Analyzation in Telugu Blog and NewsIJCSEA Journal
Emotion analysis, a recent sub discipline at the crossroads of information retrieval and computational linguistics is becoming increasingly important from application viewpoints of affective computing.Emotion is crucial to identify as it is not open to any objective observation or verification. In this paper, emotion analysis on blog texts has been carried out for a less privileged language, Telugu and the same system has been applied on the English SemEval 2007 affect sensing corpus containing only news headlines. A set of six emotion tags, namely, happy ( ), sad ( ), anger ( ), fear ( ), surprise ( )and disgust ( ), have been selected towards this emotion detection task for reliable and semi-automatic annotation of blog and news data. Conditional Random Field (CRF) based classifier has been applied for recognizing six basic emotion tags for different words of a sentence. The classifier accuracy has been improved by arranging an equal distribution of emotional tags and non-emotional tag. A score based technique has been adopted to calculate and assign tag weights to each of the six emotion tags. A sense based scoring strategy has been applied to identify sentence level emotion scores for the six emotion tags based on the acquired word level emotion tags. Sentence level emotion tagging has been
carried out based on the maximum obtained sentence level emotion scores. Evaluation has been conducted for each emotion class separately on 200 test sentences from each of the Telugu blog and English news data. The system has resulted accuracies of 69.82% and 71.06% for happy, 70.24% and 66.42% for sad, 65.73% and 64.27% for anger, 76.01% and 69.90% for disgust, 72.19% and 73.59% for fear and 70.54% and 66.64% for surprise emotion classes on blog and news test data respectively.
Insights to Problems, Research Trend and Progress in Techniques of Sentiment ...IJECEIAES
The research-based implementations towards Sentiment analyses are about a decade old and have introduced many significant algorithms, techniques, and framework towards enhancing its performance. The applicability of sentiment analysis towards business and the political survey is quite immense. However, we strongly feel that existing progress in research towards Sentiment Analysis is not at par with the demand of massively increasing dynamic data over the pervasive environment. The degree of problems associated with opinion mining over such forms of data has been less addressed, and still, it leaves the certain major scope of research. This paper will brief about existing research trends, some important research implementation in recent times, and exploring some major open issues about sentiment analysis. We believe that this manuscript will give a progress report with the snapshot of effectiveness borne by the research techniques towards sentiment analysis to further assist the upcoming researcher to identify and pave their research work in a perfect direction towards considering research gap.
Dialectal Arabic sentiment analysis based on tree-based pipeline optimizatio...IJECEIAES
The heavy involvement of the Arabic internet users resulted in spreading data written in the Arabic language and creating a vast research area regarding natural language processing (NLP). Sentiment analysis is a growing field of research that is of great importance to everyone considering the high added potential for decision-making and predicting upcoming actions using the texts produced in social networks. Arabic used in microblogging websites, especially Twitter, is highly informal. It is not compliant with neither standards nor spelling regulations making it quite challenging for automatic machine-learning techniques. In this paper’s scope, we propose a new approach based on AutoML methods to improve the efficiency of the sentiment classification process for dialectal Arabic. This approach was validated through benchmarks testing on three different datasets that represent three vernacular forms of Arabic. The obtained results show that the presented framework has significantly increased accuracy than similar works in the literature.
Big five personality prediction based in Indonesian tweets using machine lea...IJECEIAES
The popularity of social media has drawn the attention of researchers who have conducted cross-disciplinary studies examining the relationship between personality traits and behavior on social media. Most current work focuses on personality prediction analysis of English texts, but Indonesian has received scant attention. Therefore, this research aims to predict user’s personalities based on Indonesian text from social media using machine learning techniques. This paper evaluates several machine learning techniques, including naive Bayes (NB), K-nearest neighbors (KNN), and support vector machine (SVM), based on semantic features including emotion, sentiment, and publicly available Twitter profile. We predict the personality based on the Big Five personality model, the most appropriate model for predicting user personality in social media. We examine the relationships between the semantic features and the Big Five personality dimensions. The experimental results indicate that the Big Five personality exhibit distinct emotional, sentimental, and social characteristics and that SVM outperformed NB and KNN for Indonesian. In addition, we observe several terms in Indonesian that specifically refer to each personality type, each of which has distinct emotional, sentimental, and social features.
Similar to Evaluation of Support Vector Machine and Decision Tree for Emotion Recognition of Malay Folklores (20)
Square transposition: an approach to the transposition process in block cipherjournalBEEI
The transposition process is needed in cryptography to create a diffusion effect on data encryption standard (DES) and advanced encryption standard (AES) algorithms as standard information security algorithms by the National Institute of Standards and Technology. The problem with DES and AES algorithms is that their transposition index values form patterns and do not form random values. This condition will certainly make it easier for a cryptanalyst to look for a relationship between ciphertexts because some processes are predictable. This research designs a transposition algorithm called square transposition. Each process uses square 8 × 8 as a place to insert and retrieve 64-bits. The determination of the pairing of the input scheme and the retrieval scheme that have unequal flow is an important factor in producing a good transposition. The square transposition can generate random and non-pattern indices so that transposition can be done better than DES and AES.
Hyper-parameter optimization of convolutional neural network based on particl...journalBEEI
Deep neural networks have accomplished enormous progress in tackling many problems. More specifically, convolutional neural network (CNN) is a category of deep networks that have been a dominant technique in computer vision tasks. Despite that these deep neural networks are highly effective; the ideal structure is still an issue that needs a lot of investigation. Deep Convolutional Neural Network model is usually designed manually by trials and repeated tests which enormously constrain its application. Many hyper-parameters of the CNN can affect the model performance. These parameters are depth of the network, numbers of convolutional layers, and numbers of kernels with their sizes. Therefore, it may be a huge challenge to design an appropriate CNN model that uses optimized hyper-parameters and reduces the reliance on manual involvement and domain expertise. In this paper, a design architecture method for CNNs is proposed by utilization of particle swarm optimization (PSO) algorithm to learn the optimal CNN hyper-parameters values. In the experiment, we used Modified National Institute of Standards and Technology (MNIST) database of handwritten digit recognition. The experiments showed that our proposed approach can find an architecture that is competitive to the state-of-the-art models with a testing error of 0.87%.
Supervised machine learning based liver disease prediction approach with LASS...journalBEEI
In this contemporary era, the uses of machine learning techniques are increasing rapidly in the field of medical science for detecting various diseases such as liver disease (LD). Around the globe, a large number of people die because of this deadly disease. By diagnosing the disease in a primary stage, early treatment can be helpful to cure the patient. In this research paper, a method is proposed to diagnose the LD using supervised machine learning classification algorithms, namely logistic regression, decision tree, random forest, AdaBoost, KNN, linear discriminant analysis, gradient boosting and support vector machine (SVM). We also deployed a least absolute shrinkage and selection operator (LASSO) feature selection technique on our taken dataset to suggest the most highly correlated attributes of LD. The predictions with 10 fold cross-validation (CV) made by the algorithms are tested in terms of accuracy, sensitivity, precision and f1-score values to forecast the disease. It is observed that the decision tree algorithm has the best performance score where accuracy, precision, sensitivity and f1-score values are 94.295%, 92%, 99% and 96% respectively with the inclusion of LASSO. Furthermore, a comparison with recent studies is shown to prove the significance of the proposed system.
A secure and energy saving protocol for wireless sensor networksjournalBEEI
The research domain for wireless sensor networks (WSN) has been extensively conducted due to innovative technologies and research directions that have come up addressing the usability of WSN under various schemes. This domain permits dependable tracking of a diversity of environments for both military and civil applications. The key management mechanism is a primary protocol for keeping the privacy and confidentiality of the data transmitted among different sensor nodes in WSNs. Since node's size is small; they are intrinsically limited by inadequate resources such as battery life-time and memory capacity. The proposed secure and energy saving protocol (SESP) for wireless sensor networks) has a significant impact on the overall network life-time and energy dissipation. To encrypt sent messsages, the SESP uses the public-key cryptography’s concept. It depends on sensor nodes' identities (IDs) to prevent the messages repeated; making security goals- authentication, confidentiality, integrity, availability, and freshness to be achieved. Finally, simulation results show that the proposed approach produced better energy consumption and network life-time compared to LEACH protocol; sensors are dead after 900 rounds in the proposed SESP protocol. While, in the low-energy adaptive clustering hierarchy (LEACH) scheme, the sensors are dead after 750 rounds.
Plant leaf identification system using convolutional neural networkjournalBEEI
This paper proposes a leaf identification system using convolutional neural network (CNN). This proposed system can identify five types of local Malaysia leaf which were acacia, papaya, cherry, mango and rambutan. By using CNN from deep learning, the network is trained from the database that acquired from leaf images captured by mobile phone for image classification. ResNet-50 was the architecture has been used for neural networks image classification and training the network for leaf identification. The recognition of photographs leaves requested several numbers of steps, starting with image pre-processing, feature extraction, plant identification, matching and testing, and finally extracting the results achieved in MATLAB. Testing sets of the system consists of 3 types of images which were white background, and noise added and random background images. Finally, interfaces for the leaf identification system have developed as the end software product using MATLAB app designer. As a result, the accuracy achieved for each training sets on five leaf classes are recorded above 98%, thus recognition process was successfully implemented.
Customized moodle-based learning management system for socially disadvantaged...journalBEEI
This study aims to develop Moodle-based LMS with customized learning content and modified user interface to facilitate pedagogical processes during covid-19 pandemic and investigate how teachers of socially disadvantaged schools perceived usability and technology acceptance. Co-design process was conducted with two activities: 1) need assessment phase using an online survey and interview session with the teachers and 2) the development phase of the LMS. The system was evaluated by 30 teachers from socially disadvantaged schools for relevance to their distance learning activities. We employed computer software usability questionnaire (CSUQ) to measure perceived usability and the technology acceptance model (TAM) with insertion of 3 original variables (i.e., perceived usefulness, perceived ease of use, and intention to use) and 5 external variables (i.e., attitude toward the system, perceived interaction, self-efficacy, user interface design, and course design). The average CSUQ rating exceeded 5.0 of 7 point-scale, indicated that teachers agreed that the information quality, interaction quality, and user interface quality were clear and easy to understand. TAM results concluded that the LMS design was judged to be usable, interactive, and well-developed. Teachers reported an effective user interface that allows effective teaching operations and lead to the system adoption in immediate time.
Understanding the role of individual learner in adaptive and personalized e-l...journalBEEI
Dynamic learning environment has emerged as a powerful platform in a modern e-learning system. The learning situation that constantly changing has forced the learning platform to adapt and personalize its learning resources for students. Evidence suggested that adaptation and personalization of e-learning systems (APLS) can be achieved by utilizing learner modeling, domain modeling, and instructional modeling. In the literature of APLS, questions have been raised about the role of individual characteristics that are relevant for adaptation. With several options, a new problem has been raised where the attributes of students in APLS often overlap and are not related between studies. Therefore, this study proposed a list of learner model attributes in dynamic learning to support adaptation and personalization. The study was conducted by exploring concepts from the literature selected based on the best criteria. Then, we described the results of important concepts in student modeling and provided definitions and examples of data values that researchers have used. Besides, we also discussed the implementation of the selected learner model in providing adaptation in dynamic learning.
Prototype mobile contactless transaction system in traditional markets to sup...journalBEEI
One way to prevent and reduce the spread of the covid-19 pandemic is through physical distancing program. This research aims to develop a prototype contactless transaction system using digital payment mechanisms and QR code technology that will be applied in traditional markets. The method used in the development of electronic market systems is a prototype approach. The application of QR code and digital payments are used as a solution to minimize money exchange contacts that are common in traditional markets. The results showed that the system built was able to accelerate and facilitate the buying and selling transaction process in traditional market environment. Alpha testing shows that all functional systems are running well. Meanwhile, beta testing shows that the user can very well accept the system that was built. The results of the study also show acceptance of the usefulness of the system being built, as well as the optimism of its users to be able to take advantage of this system both technologically and functionally, so its can be a part of the digital transformation of the traditional market to the electronic market and has become one of the solutions in reducing the spread of the current covid-19 pandemic.
Wireless HART stack using multiprocessor technique with laxity algorithmjournalBEEI
The use of a real-time operating system is required for the demarcation of industrial wireless sensor network (IWSN) stacks (RTOS). In the industrial world, a vast number of sensors are utilised to gather various types of data. The data gathered by the sensors cannot be prioritised ahead of time. Because all of the information is equally essential. As a result, a protocol stack is employed to guarantee that data is acquired and processed fairly. In IWSN, the protocol stack is implemented using RTOS. The data collected from IWSN sensor nodes is processed using non-preemptive scheduling and the protocol stack, and then sent in parallel to the IWSN's central controller. The real-time operating system (RTOS) is a process that occurs between hardware and software. Packets must be sent at a certain time. It's possible that some packets may collide during transmission. We're going to undertake this project to get around this collision. As a prototype, this project is divided into two parts. The first uses RTOS and the LPC2148 as a master node, while the second serves as a standard data collection node to which sensors are attached. Any controller may be used in the second part, depending on the situation. Wireless HART allows two nodes to communicate with each other.
Implementation of double-layer loaded on octagon microstrip yagi antennajournalBEEI
A double-layer loaded on the octagon microstrip yagi antenna (OMYA) at 5.8 GHz industrial, scientific and medical (ISM) Band is investigated in this paper. The double-layer consist of two double positive (DPS) substrates. The OMYA is overlaid with a double-layer configuration were simulated, fabricated and measured. A good agreement was observed between the computed and measured results of the gain for this antenna. According to comparison results, it shows that 2.5 dB improvement of the OMYA gain can be obtained by applying the double-layer on the top of the OMYA. Meanwhile, the bandwidth of the measured OMYA with the double-layer is 14.6%. It indicates that the double-layer can be used to increase the OMYA performance in term of gain and bandwidth.
The calculation of the field of an antenna located near the human headjournalBEEI
In this work, a numerical calculation was carried out in one of the universal programs for automatic electro-dynamic design. The calculation is aimed at obtaining numerical values for specific absorbed power (SAR). It is the SAR value that can be used to determine the effect of the antenna of a wireless device on biological objects; the dipole parameters will be selected for GSM1800. Investigation of the influence of distance to a cell phone on radiation shows that absorbed in the head of a person the effect of electromagnetic radiation on the brain decreases by three times this is a very important result the SAR value has decreased by almost three times it is acceptable results.
Exact secure outage probability performance of uplinkdownlink multiple access...journalBEEI
In this paper, we study uplink-downlink non-orthogonal multiple access (NOMA) systems by considering the secure performance at the physical layer. In the considered system model, the base station acts a relay to allow two users at the left side communicate with two users at the right side. By considering imperfect channel state information (CSI), the secure performance need be studied since an eavesdropper wants to overhear signals processed at the downlink. To provide secure performance metric, we derive exact expressions of secrecy outage probability (SOP) and and evaluating the impacts of main parameters on SOP metric. The important finding is that we can achieve the higher secrecy performance at high signal to noise ratio (SNR). Moreover, the numerical results demonstrate that the SOP tends to a constant at high SNR. Finally, our results show that the power allocation factors, target rates are main factors affecting to the secrecy performance of considered uplink-downlink NOMA systems.
Design of a dual-band antenna for energy harvesting applicationjournalBEEI
This report presents an investigation on how to improve the current dual-band antenna to enhance the better result of the antenna parameters for energy harvesting application. Besides that, to develop a new design and validate the antenna frequencies that will operate at 2.4 GHz and 5.4 GHz. At 5.4 GHz, more data can be transmitted compare to 2.4 GHz. However, 2.4 GHz has long distance of radiation, so it can be used when far away from the antenna module compare to 5 GHz that has short distance in radiation. The development of this project includes the scope of designing and testing of antenna using computer simulation technology (CST) 2018 software and vector network analyzer (VNA) equipment. In the process of designing, fundamental parameters of antenna are being measured and validated, in purpose to identify the better antenna performance.
Transforming data-centric eXtensible markup language into relational database...journalBEEI
eXtensible markup language (XML) appeared internationally as the format for data representation over the web. Yet, most organizations are still utilising relational databases as their database solutions. As such, it is crucial to provide seamless integration via effective transformation between these database infrastructures. In this paper, we propose XML-REG to bridge these two technologies based on node-based and path-based approaches. The node-based approach is good to annotate each positional node uniquely, while the path-based approach provides summarised path information to join the nodes. On top of that, a new range labelling is also proposed to annotate nodes uniquely by ensuring the structural relationships are maintained between nodes. If a new node is to be added to the document, re-labelling is not required as the new label will be assigned to the node via the new proposed labelling scheme. Experimental evaluations indicated that the performance of XML-REG exceeded XMap, XRecursive, XAncestor and Mini-XML concerning storing time, query retrieval time and scalability. This research produces a core framework for XML to relational databases (RDB) mapping, which could be adopted in various industries.
Key performance requirement of future next wireless networks (6G)journalBEEI
Given the massive potentials of 5G communication networks and their foreseeable evolution, what should there be in 6G that is not in 5G or its long-term evolution? 6G communication networks are estimated to integrate the terrestrial, aerial, and maritime communications into a forceful network which would be faster, more reliable, and can support a massive number of devices with ultra-low latency requirements. This article presents a complete overview of potential 6G communication networks. The major contribution of this study is to present a broad overview of key performance indicators (KPIs) of 6G networks that cover the latest manufacturing progress in the environment of the principal areas of research application, and challenges.
Noise resistance territorial intensity-based optical flow using inverse confi...journalBEEI
This paper presents the use of the inverse confidential technique on bilateral function with the territorial intensity-based optical flow to prove the effectiveness in noise resistance environment. In general, the image’s motion vector is coded by the technique called optical flow where the sequences of the image are used to determine the motion vector. But, the accuracy rate of the motion vector is reduced when the source of image sequences is interfered by noises. This work proved that the inverse confidential technique on bilateral function can increase the percentage of accuracy in the motion vector determination by the territorial intensity-based optical flow under the noisy environment. We performed the testing with several kinds of non-Gaussian noises at several patterns of standard image sequences by analyzing the result of the motion vector in a form of the error vector magnitude (EVM) and compared it with several noise resistance techniques in territorial intensity-based optical flow method.
Modeling climate phenomenon with software grids analysis and display system i...journalBEEI
This study aims to model climate change based on rainfall, air temperature, pressure, humidity and wind with grADS software and create a global warming module. This research uses 3D model, define, design, and develop. The results of the modeling of the five climate elements consist of the annual average temperature in Indonesia in 2009-2015 which is between 29oC to 30.1oC, the horizontal distribution of the annual average pressure in Indonesia in 2009-2018 is between 800 mBar to 1000 mBar, the horizontal distribution the average annual humidity in Indonesia in 2009 and 2011 ranged between 27-57, in 2012-2015, 2017 and 2018 it ranged between 30-60, during the East Monsoon, the wind circulation moved from northern Indonesia to the southern region Indonesia. During the west monsoon, the wind circulation moves from the southern part of Indonesia to the northern part of Indonesia. The global warming module for SMA/MA produced is feasible to use, this is in accordance with the value given by the validate of 69 which is in the appropriate category and the response of teachers and students through a 91% questionnaire.
An approach of re-organizing input dataset to enhance the quality of emotion ...journalBEEI
The purpose of this paper is to propose an approach of re-organizing input data to recognize emotion based on short signal segments and increase the quality of emotional recognition using physiological signals. MIT's long physiological signal set was divided into two new datasets, with shorter and overlapped segments. Three different classification methods (support vector machine, random forest, and multilayer perceptron) were implemented to identify eight emotional states based on statistical features of each segment in these two datasets. By re-organizing the input dataset, the quality of recognition results was enhanced. The random forest shows the best classification result among three implemented classification methods, with an accuracy of 97.72% for eight emotional states, on the overlapped dataset. This approach shows that, by re-organizing the input dataset, the high accuracy of recognition results can be achieved without the use of EEG and ECG signals.
Parking detection system using background subtraction and HSV color segmentationjournalBEEI
Manual system vehicle parking makes finding vacant parking lots difficult, so it has to check directly to the vacant space. If many people do parking, then the time needed for it is very much or requires many people to handle it. This research develops a real-time parking system to detect parking. The system is designed using the HSV color segmentation method in determining the background image. In addition, the detection process uses the background subtraction method. Applying these two methods requires image preprocessing using several methods such as grayscaling, blurring (low-pass filter). In addition, it is followed by a thresholding and filtering process to get the best image in the detection process. In the process, there is a determination of the ROI to determine the focus area of the object identified as empty parking. The parking detection process produces the best average accuracy of 95.76%. The minimum threshold value of 255 pixels is 0.4. This value is the best value from 33 test data in several criteria, such as the time of capture, composition and color of the vehicle, the shape of the shadow of the object’s environment, and the intensity of light. This parking detection system can be implemented in real-time to determine the position of an empty place.
Quality of service performances of video and voice transmission in universal ...journalBEEI
The universal mobile telecommunications system (UMTS) has distinct benefits in that it supports a wide range of quality of service (QoS) criteria that users require in order to fulfill their requirements. The transmission of video and audio in real-time applications places a high demand on the cellular network, therefore QoS is a major problem in these applications. The ability to provide QoS in the UMTS backbone network necessitates an active QoS mechanism in order to maintain the necessary level of convenience on UMTS networks. For UMTS networks, investigation models for end-to-end QoS, total transmitted and received data, packet loss, and throughput providing techniques are run and assessed and the simulation results are examined. According to the results, appropriate QoS adaption allows for specific voice and video transmission. Finally, by analyzing existing QoS parameters, the QoS performance of 4G/UMTS networks may be improved.
COLLEGE BUS MANAGEMENT SYSTEM PROJECT REPORT.pdfKamal Acharya
The College Bus Management system is completely developed by Visual Basic .NET Version. The application is connect with most secured database language MS SQL Server. The application is develop by using best combination of front-end and back-end languages. The application is totally design like flat user interface. This flat user interface is more attractive user interface in 2017. The application is gives more important to the system functionality. The application is to manage the student’s details, driver’s details, bus details, bus route details, bus fees details and more. The application has only one unit for admin. The admin can manage the entire application. The admin can login into the application by using username and password of the admin. The application is develop for big and small colleges. It is more user friendly for non-computer person. Even they can easily learn how to manage the application within hours. The application is more secure by the admin. The system will give an effective output for the VB.Net and SQL Server given as input to the system. The compiled java program given as input to the system, after scanning the program will generate different reports. The application generates the report for users. The admin can view and download the report of the data. The application deliver the excel format reports. Because, excel formatted reports is very easy to understand the income and expense of the college bus. This application is mainly develop for windows operating system users. In 2017, 73% of people enterprises are using windows operating system. So the application will easily install for all the windows operating system users. The application-developed size is very low. The application consumes very low space in disk. Therefore, the user can allocate very minimum local disk space for this application.
Final project report on grocery store management system..pdfKamal Acharya
In today’s fast-changing business environment, it’s extremely important to be able to respond to client needs in the most effective and timely manner. If your customers wish to see your business online and have instant access to your products or services.
Online Grocery Store is an e-commerce website, which retails various grocery products. This project allows viewing various products available enables registered users to purchase desired products instantly using Paytm, UPI payment processor (Instant Pay) and also can place order by using Cash on Delivery (Pay Later) option. This project provides an easy access to Administrators and Managers to view orders placed using Pay Later and Instant Pay options.
In order to develop an e-commerce website, a number of Technologies must be studied and understood. These include multi-tiered architecture, server and client-side scripting techniques, implementation technologies, programming language (such as PHP, HTML, CSS, JavaScript) and MySQL relational databases. This is a project with the objective to develop a basic website where a consumer is provided with a shopping cart website and also to know about the technologies used to develop such a website.
This document will discuss each of the underlying technologies to create and implement an e- commerce website.
Event Management System Vb Net Project Report.pdfKamal Acharya
In present era, the scopes of information technology growing with a very fast .We do not see any are untouched from this industry. The scope of information technology has become wider includes: Business and industry. Household Business, Communication, Education, Entertainment, Science, Medicine, Engineering, Distance Learning, Weather Forecasting. Carrier Searching and so on.
My project named “Event Management System” is software that store and maintained all events coordinated in college. It also helpful to print related reports. My project will help to record the events coordinated by faculties with their Name, Event subject, date & details in an efficient & effective ways.
In my system we have to make a system by which a user can record all events coordinated by a particular faculty. In our proposed system some more featured are added which differs it from the existing system such as security.
Immunizing Image Classifiers Against Localized Adversary Attacksgerogepatton
This paper addresses the vulnerability of deep learning models, particularly convolutional neural networks
(CNN)s, to adversarial attacks and presents a proactive training technique designed to counter them. We
introduce a novel volumization algorithm, which transforms 2D images into 3D volumetric representations.
When combined with 3D convolution and deep curriculum learning optimization (CLO), itsignificantly improves
the immunity of models against localized universal attacks by up to 40%. We evaluate our proposed approach
using contemporary CNN architectures and the modified Canadian Institute for Advanced Research (CIFAR-10
and CIFAR-100) and ImageNet Large Scale Visual Recognition Challenge (ILSVRC12) datasets, showcasing
accuracy improvements over previous techniques. The results indicate that the combination of the volumetric
input and curriculum learning holds significant promise for mitigating adversarial attacks without necessitating
adversary training.
Overview of the fundamental roles in Hydropower generation and the components involved in wider Electrical Engineering.
This paper presents the design and construction of hydroelectric dams from the hydrologist’s survey of the valley before construction, all aspects and involved disciplines, fluid dynamics, structural engineering, generation and mains frequency regulation to the very transmission of power through the network in the United Kingdom.
Author: Robbie Edward Sayers
Collaborators and co editors: Charlie Sims and Connor Healey.
(C) 2024 Robbie E. Sayers
Student information management system project report ii.pdfKamal Acharya
Our project explains about the student management. This project mainly explains the various actions related to student details. This project shows some ease in adding, editing and deleting the student details. It also provides a less time consuming process for viewing, adding, editing and deleting the marks of the students.
About
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
• Remote control: Parallel or serial interface.
• Compatible with MAFI CCR system.
• Compatible with IDM8000 CCR.
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
• Easy in configuration using DIP switches.
Technical Specifications
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
Key Features
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
• Remote control: Parallel or serial interface
• Compatible with MAFI CCR system
• Copatiable with IDM8000 CCR
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
Application
• Remote control: Parallel or serial interface.
• Compatible with MAFI CCR system.
• Compatible with IDM8000 CCR.
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
• Easy in configuration using DIP switches.
Democratizing Fuzzing at Scale by Abhishek Aryaabh.arya
Presented at NUS: Fuzzing and Software Security Summer School 2024
This keynote talks about the democratization of fuzzing at scale, highlighting the collaboration between open source communities, academia, and industry to advance the field of fuzzing. It delves into the history of fuzzing, the development of scalable fuzzing platforms, and the empowerment of community-driven research. The talk will further discuss recent advancements leveraging AI/ML and offer insights into the future evolution of the fuzzing landscape.
Saudi Arabia stands as a titan in the global energy landscape, renowned for its abundant oil and gas resources. It's the largest exporter of petroleum and holds some of the world's most significant reserves. Let's delve into the top 10 oil and gas projects shaping Saudi Arabia's energy future in 2024.
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdffxintegritypublishin
Advancements in technology unveil a myriad of electrical and electronic breakthroughs geared towards efficiently harnessing limited resources to meet human energy demands. The optimization of hybrid solar PV panels and pumped hydro energy supply systems plays a pivotal role in utilizing natural resources effectively. This initiative not only benefits humanity but also fosters environmental sustainability. The study investigated the design optimization of these hybrid systems, focusing on understanding solar radiation patterns, identifying geographical influences on solar radiation, formulating a mathematical model for system optimization, and determining the optimal configuration of PV panels and pumped hydro storage. Through a comparative analysis approach and eight weeks of data collection, the study addressed key research questions related to solar radiation patterns and optimal system design. The findings highlighted regions with heightened solar radiation levels, showcasing substantial potential for power generation and emphasizing the system's efficiency. Optimizing system design significantly boosted power generation, promoted renewable energy utilization, and enhanced energy storage capacity. The study underscored the benefits of optimizing hybrid solar PV panels and pumped hydro energy supply systems for sustainable energy usage. Optimizing the design of solar PV panels and pumped hydro energy supply systems as examined across diverse climatic conditions in a developing country, not only enhances power generation but also improves the integration of renewable energy sources and boosts energy storage capacities, particularly beneficial for less economically prosperous regions. Additionally, the study provides valuable insights for advancing energy research in economically viable areas. Recommendations included conducting site-specific assessments, utilizing advanced modeling tools, implementing regular maintenance protocols, and enhancing communication among system components.
Quality defects in TMT Bars, Possible causes and Potential Solutions.PrashantGoswami42
Maintaining high-quality standards in the production of TMT bars is crucial for ensuring structural integrity in construction. Addressing common defects through careful monitoring, standardized processes, and advanced technology can significantly improve the quality of TMT bars. Continuous training and adherence to quality control measures will also play a pivotal role in minimizing these defects.
2. ISSN: 2302-9285
BEEI, Vol. 7, No. 3, September 2018 : 479 – 486
480
folklores to be used in an automated storytelling speech synthesis for the Malay language. In our earlier work
of storytelling speech synthesis [8], [9], we have developed a storytelling speech synthesizer that is able to
synthesize stories from text using a specific storytelling model. However, the synthesizer lacks emotions and
also requires a specific storytelling model for a story. Therefore, an emotion recognition engine is essential
prior to storytelling speech synthesis to automatically generate the emotions for the synthesizer. Once the
emotion is classified, the synthesizer should then construct the emotion models to produce the emotions
intonation.
Most works of text-based emotion recognition were done using the English language. In another
work by [10], six languages that are English, Spanish, Czech, German, Czech2 and German2 are compared in
emotion recognition using text. Since emotion recognition using text is language dependent, a different
approach for text pre-processing and classification may be needed. As stated by [11]-[12], different
languages have different language structures and not all text pre-processing used in English may be
employed in other languages. Furthermore, even though some work on textual emotion recognition in the
Malay language has been done, most of them [9], [13]-[14] focused on sentiment analysis using informal
languages. Another known work by [15]-[16] used classical literature that is poems and proverbs. As children
folklores are consist of simple formal language, we intend to investigate the use of common text feature
extraction technique and classifiers to recognize the emotions. This paper is organized as such: Section 1
describes the motivation of this research supported by related literature in Section 2. In Section 3, the
emotion recognition methodology is presented followed by the results and discussion of findings in Section
4. Finally, a conclusion and further work are deliberated in Section 5.
2. RELATED WORK
Since the main aim of this paper is to investigate the use of popular methods of feature extraction
and classification for the task of textual emotion recognition in the Malay language, the literature on the
existing methods are reviewed.
2.1. Text Feature Extraction
Feature extraction is the most important process before classifying the emotion from the text
documents. Feature extraction techniques aim to represent the emotional value of the text that will help to
classify the emotions into the correct category. There are many textual feature extraction methods such as
sentiment analysis, Term Frequency-Inverse Document Frequency (TF-IDF), and unigrams.
Sentiment analysis is widely used to extract emotion recognition from a text document. It tries to
understand the attitudes, opinion and emotion in the text by classifying it into either positive, negative or
neutral. This technique is widely used to analyze attitudes, moods, and temperaments in social media, user
profiling, news articles, and forum discussions. In [17], a survey was conducted showing the popularity of
sentiment analysis used to extract emotions from text has been conducted regarding sentiment analysis.
Sentiment analysis has also been used at sentence-, document-, aspect-, and user-levels to help extract
opinions and emotions [18]. When sentiment analysis is used with natural language processing and machine
learning, accurate sentiment results can be achieved. However, sentiment analysis techniques are mostly used
to mine opinions from informal text documents comprising mainly spontaneous written speeches.
Term Frequency-Inverse Document Frequency (TF-IDF) is one of the most used text feature
extraction technique as it provides a good insight into the important features of the text documents. TF-IDF is
used by [19] to extract features from Malay poetry text documents. Other works that employed TF-IDF as
feature extraction techniques are [20] and [21] where several classifiers are compared to classify emotions
from Thai YouTube comments and Indonesian text documents. In [22], TF-IDF is also used to categorize
relevant words in text documents to enhance query retrieval. This simple feature extractor is favoured by
many due to its simplicity, robustness and is ideal for short text documents [20],[21]. Coupled with stop
words, TF-IDF has shown to improve the classification of emotions from text documents. In this paper, TF-
IDF is chosen as the feature extraction technique.
2.2. Text-based Emotion Classification
After the text features are extracted, the classification of these features is done to categorize the
emotions into several categories such as happy, angry, sad or fearful. In this section, we review several
classifiers that were used in textual emotion recognition in the literature. Li et al. [20] performed social
emotion detection on short texts of news headlines and sentences (less than 4 words) using the hybrid neural
network (HNN). Their method outperformed outperforms the baselines of SWAT used in SemEval-2007,
Emotion Term method, Emotion Topic model, Multi-label supervised topic model, Sentiment Latent Topic
3. BEEI ISSN: 2302-9285
Evaluation of Support Vector Machine and Decision Tree for Emotion Recognition... (Mastura Md Saad)
481
model, and Affective topic model. Even though HNN shows promising results, the model is complicated and
difficult to implement given our limited data.
In [21], four classifiers which are Naïve Bayes, K-Nearest Neighbour (KNN), Support Vector
Machine (SVM) and Machine-Sequential Minimal Optimization (SVM-SMO) are compared to recognize
emotions in Indonesian folklore. One thousand documents ranging from 1-3 sentences are collected and the
emotion of each sentence is labelled using the WordNet Affect List. The highest accuracy is achieved by
SVM-SMO, followed by SVM, Naïve Bayes and KNN. Sarakit et al. [21] also compared three classification
methods to categorize emotions from Thai language YouTube comments. A total of 2,771 comments from
music videos and 3,077 comments from commercial advertisements are manually annotated and used as the
experimental datasets. Naïve Bayes, SVM and decision tree classifiers are further used to recognize the
emotions from the Thai comments. SVM outperformed Naïve Bayes and decision tree classifiers achieving
an accuracy rate of 82.28%. Further work such as [1] has also shown that SVM produced better accuracy in
the classification of emotions in 1000 news headlines from CNN and Google news. An SVM model is able to
outdo three other systems that participated in the SemEval 2007 emotion annotation task. Literature
suggested that SVM is a suitable classifier for emotion recognition from textual documents.
Another popular classifier is Decision Tree (DT) that is usually used in bioinformatics [6], data
mining [23], and capturing knowledge in the expert system. DT offers flexibility and robustness due to its
transparent nature by providing possible alternatives [24],[25]. The most important thing is that decision tree
classification can reduce the ambiguity in decision making which leads it to a better classification. In [24],
DT achieved an accuracy rate of 84.37%. In this paper, we compared SVM and DT performance to classify
emotions into four categories that are happy, angry, fearful and sad.
3. METHODOLOGY
The main stages of textual emotion recognition are data collection, text pre-processing, feature
extraction and emotion classification. Each stage is discussed further in this section.
3.1. Data Collection
The dataset used in this paper consists of Malay children short stories. The stories are collected from
"Ollie Si Gajah" and "200 Kisah Teladan Haiwan". Only stories in dialogue form are selected because
emotions are easily expressed in dialogue compared to narrations. A total of more than 200 short stories are
collected, each story ranging from 20-50 words. Examples of two short stories are given in Table 1. The short
stories are further broken down into sentences or phrases for emotion annotation. At this point onwards, each
sentence or phrase is referred to as a document.
Table 1. Examples of Short Stories in our Datasets
No. Short Story (in Malay) Short Story (in English) Story Title
1
Tolong! Tolong!
Tidak ada sesiapa yang mahu menolong saya.
Saya terpaksa tinggal disini sehingga beberapa
hari sehingga badan saya kurus.
Help! Help!
Nobody wants to help me.
I have to stay here for a few days
until I got thinner.
Musang yang Tamak
The Greedy Fox
2
Baik aku bersembunyi di dalam kandang lembu
itu.
Apakah yang kamu buat di sini?
Tolonglah saya
Saya diburu oleh seekor anjing pemburu
Saya ingin bersembunyi di dalam kandang kamu.
It’s better for me to hide in the cow
barn.
What are you doing here?
Help me
I was hunted by a dog hunter
I want to hide inside your barn
Rusa yang Malang
The Unlucky Deer
3.2. Pre-processing
Pre-processing stage involves stop-word removal and stemming. Stopword is a common pre-
processing process that filters out the meaningless or unnecessary words from each document [26]. Example
of stopwords in English is such as ‘is', ‘for', and ‘to'. Meanwhile, examples of Malay language stop words are
‘ada', ‘boleh', ‘tidak', ‘kamu', and ‘yang'. For our work, we added ‘si', ‘sang', ‘yang','adalah', ‘kau' and ‘aku'
into the collections of stop words done by [27]. Next, the documents are stemmed using a Malay language
stemmer to remove inflected words such as ‘an', ‘kan', ‘men','meng', ‘ter', ‘pe','per' and ‘ke', subsequently
producing root words. For example in English, ‘banks' is stemmed as ‘bank' while for the Malay word
‘termakan' is stemmed as ‘makan'. Figure 1 displays some examples of stopword removals and stemming
done on three documents. Once the stop words are removed from the documents and the words in the
documents are stemmed, emotion annotation and text feature extraction are done.
4. ISSN: 2302-9285
BEEI, Vol. 7, No. 3, September 2018 : 479 – 486
482
Figure 1. Stopword removal, stemming and emotion annotation
3.3. Emotion Annotation
The next step is to create a ground truth dataset for the classification experiment. For this purpose,
we hired a human annotator from a Language Academy to manually label the emotional states of all the
documents and selected 100 documents. Each document is labelled using the words contained in the
documents. For example, a document "Tiada sesiapa yang mahu menolong saya” is pre-processed producing
the words “siapa” dan “tolong". These words are categorized as sad, thus the document is labelled as a sad
emotion. If there are contradicting labelled emotions in the document, the highest frequencies of the
emotional labelled words are used to determine the sentence's emotion. Out of the 100 documents, 25
documents are classified as Sad, 25 as Fear, 25 as Angry and 25 as Happy emotions. In the last column of
Figure 2, the emotions of the documents are given. For classification purpose, eighty (80) % of the total
documents are used for training and another 20% of the collected documents are used for testing. Figure 2
shows examples of TF-IDF of several words in the document. The emotion category of each word is also
stated in the last column.
Figure 2. TF-IDF and its corresponding emotions
3.4. Text Feature Extraction
Term Frequency-Inverse Document Frequency (TF-IDF) is a text mining technique used to extract
features from a text. TF-IDF measures how important the words are in the documents. Calculation of TF-IDF
is shown as in equation 1. Term frequency will measure how frequent the words appear in a document. This
is because every document has different length of words while inverse document frequency is to measure
how frequent the word appears for all documents and all terms are considered important.
(1)
5. BEEI ISSN: 2302-9285
Evaluation of Support Vector Machine and Decision Tree for Emotion Recognition... (Mastura Md Saad)
483
where; fij=frequency of term i in document
tfij=
dfi=No. of docs containing term i
idfi=idf of term i=log2 (
N=Total no. of docs
3.5. Classification
Support Vector Machine (SVM) is a supervised machine algorithm and commonly used in
classification and regression challenges. It plots each data items as a point in n-dimensional space which
represents the numbers of features. Then, it will use hyper-plane to differentiate between features and class of
emotion. The SVM model type that is used in this training is Fine Gaussian SVM, with kernel scale of 0.43
and box constraint level is 1. Figure 3 shows the Support Vector Machine basic flow diagram of the emotion
classification, where 80% is used as training dataset and 20% as testing dataset.
Figure 3. Support Vector Machine diagram
Decision Tree (DT) is a form of a tree structure used in classification and regression model. It works
by breaking down the datasets into smaller subsets, incrementally developed them into nodes and leaves. The
branches of the decision tree represent the category of the datasets. In this paper, we used DT of Complex
Tree model type with the maximum number of splits set to 100. Split criterion is Gini’s diversity index and
the surrogate decision splits if off. The decision tree is split into 4 emotion classes: happy, sad, angry and
fear, The goal of the decisions tree is to ensure it achieved maximum separation among classes at each level.
Figure 4 shows the decision tree framework which is applied to four categories of emotion classes.
Figure 4. Decision Tree diagram
Extract feature from each document, D
Form set of feature vectors document, D
Training set at 80% to build Multi-class SVM
Testing set at 20%
Multi-class SVM
Classified emotions
Emotion dataset
No
Yes
No
Class 1
No
Yes
Class 2 Class 3 Class 4
Yes No
No Yes No
Yes No Yes No
Yes No Yes No
Yes No Yes No
Yes
6. ISSN: 2302-9285
BEEI, Vol. 7, No. 3, September 2018 : 479 – 486
484
4. RESULTS AND ANALYSIS
Results of the SVM and DT classifications are presented based on training and testing datasets.
Training dataset comprises 80 documents with a total of 320 words, while testing dataset is consists of 20
documents of 80 words. In Table 2, the results are presented and the findings are discussed. In this paper, we
use recall, precision, F-measure and confusion matrix to measure the performance of the emotion
classification. Precision also called positive predictive value is the number of documents correctly labelled as
belonging to the positive class. On the other hand, recall or sensitivity is the number of documents which are
not labelled as belonging to the positive class but should have been. Another measurement that combines
recall and precision is F-measure. F-Measure indicates how precise the classifier is (how many instances are
correctly classified) as well as its robustness (it does not miss a significant number of instances). The final
measure is accuracy that refers to how well a given classifier works in classifying the document. Calculations
of all the measurements are given in Equation 2 to 5.
(2)
(3)
(4)
(4)
Table 2. Support Vector Machine and Decision Tree Classification Results
Classification Method Accuracy (%) Precision (%) Recall (%) F-measure (%)
Training Results
Support Vector Machine 36.9 36.11 32.5 34.44
Decision Tree 53.1 28.75 28.75 28.75
Testing Results
Support Vector Machine 30.0 14.41 12.5 17.65
Decision Tree 62.5 23.72 25 23.32
Overall, DT classified a document better than SVM both using training and testing datasets by
achieving 53.1% accuracy as compared to SVM at an accuracy of 36.9% during training and 62.5% and 30%
in testing, respectively. This indicates that DT classifies the documents better than SVM. Recall, precision
and F-measure are also calculated to further support the performance of the classification. As can be seen
from Table 2, recall, precision and F-measure of DT outperformed SVM during testing. This indicates that
DT has a higher sensitivity than SVM, precisely classified documents better and is more robust than SVM.
However, results of the training dataset interestingly showed SVM achieved a higher percentage for recall,
precision and F-measure. To understand the results better, we analyzed the emotions based on each emotion
classification on the testing dataset.
Table 3 shows the performance evaluations of DT and SVM based on each emotion class. Using
DT, happy emotion achieved the highest accuracy and performed moderately well for precision, recall and F-
measure. This is followed by angry, sad and fear emotions. Similar to happy emotion, precision and recall of
sad and angry emotions scored equally moderate. This implies that DT is able to correctly classify the happy,
angry and sad moderately. However, fear emotion has a higher recall rate but a low precision rate. This
indicates a high false positive rate for fear emotion. Upon further analysis of SVM, it shows that SVM
performed miserably for fear and angry emotions. Happy emotion achieved the highest accuracy rate
followed by a sad emotion. Their respective recall and precision rates are also equally moderate. In terms of
classifying emotions into their respective classes, fear emotion seemed to be the most difficult emotion.
However, for the other emotions, no conclusive findings can be drawn from the results. A confusion matrix is
constructed for the testing dataset to further understand the emotion classifications. The matrix is shown in
Table 4.
7. BEEI ISSN: 2302-9285
Evaluation of Support Vector Machine and Decision Tree for Emotion Recognition... (Mastura Md Saad)
485
Table 3. Performance Evaluations based on Emotions
Emotion Accuracy (%) Precision (%) Recall (%) F-measure (%)
Decision Tree
Fear 52.5 25.0 45.0 32.14
Sad 62.5 14.29 10.0 11.77
Angry 65.0 16.67 10.0 12.5
Happy 70.0 38.9 35.0 36.85
Support Vector Machine
Fear 0 0 0 0
Sad 50.0 18.75 15.0 33.75
Angry 0 0 0 0
Happy 70 38.9 35.0 36.85
Table 4. Confusion Matrix of the Testing Dataset
Decision Tree Support Vector Machine
Fear 9 0 7 4 Fear 0 4 16 0
Sad 13 2 1 4 Sad 1 6 13 0
Angry 10 5 2 3 Angry 7 11 0 2
Happy 4 7 2 7 Happy 7 11 1 1
Fear Sad Angry Happy Fear Sad Angry Happy
Table 4 shows that DT wrongly classifies fear emotion as mostly sad and angry. The same scenario
can be seen for Support Vector Machine where 7 fear documents are classified as angry and another 7
documents as happy. DT also classifies 7 angry documents as fear, while 16 angry documents are classified
as fear using SVM.
5. CONCLUSION
Out of the four emotions, happy achieved the highest accuracy rate for both Decision Tree and
Support Vector Machine with a moderate rate of recall, precision and F-Measure. The overall emotion
classification of Malay folklores performed averagely showing DT achieved better results than SVM. Upon
analysis of each emotion, fear is the most complicated emotion to be classified. Even though SVM and DT
are proven to be a robust classifier for other datasets in previous work, they seem to perform rather miserably
producing inconsistent results making it difficult to reach a conclusive finding. We believed that the main
problem is the emotion annotation process. When the manual annotation is done by the human annotator, the
document is labelled based on the context of the document. For example, the word ‘tolong’ can be
categorized as sad or fear depending on the context of the document. This may reduce the precision of the
classifier. For further improvement of the text-based emotion classifier, semantic text feature extraction is
needed with a bigger dataset used for training.
REFERENCES
[1] Kirange D. K., “Emotion classification of news headlines using SVM,” Asian Journal of Computer Science &
Information Technology, vol/issue: 2(5), pp. 104-106, 2013.
[2] Hancock J. T., et al., “Expressing emotion in text-based communication,” ACM SIGCHI Conference on Human
factors in Computing Systems. San Jose, California, pp. 929-932, 2007.
[3] Shivhare S. N. and Khethawat S., “Emotion detection from text,” in Wyld D. C., et al., “Computer Science &
Information Technology (CS & IT),” pp. 371-377, 2012.
[4] Strapparava C. and Mihalcea R., “Learning to identify emotions in text,” ACM Symposium on Applied Computing,
Ceara, Brazil, pp. 1556-1560, 2008.
[5] Herzig J., et al., “Emotion detection from text via ensemble classification using word embeddings,” ACM SIGIR
International Conference on Theory of Information Retrieval, New York, pp. 269-272, 2017.
[6] Polaka I., et al., “Decision tree classifiers in bioinformatics,” Scientific Journal of Riga Technical University
Computer Sciences, vol/issue: 42(1), pp. 118-23, 2010.
[7] Kołakowska A., et al., “Emotion recognition and its application in software engineering,” 6th IEEE International
Conference on Human System Interaction (HSI), Sopot, Poland, pp. 532-539, 2013.
[8] Ramli I., et al., “The first Malay language storytelling text-to-speech (TTS) corpus for humanoid robot
storytellers,” Journal of Fundamental and Applied Sciences, vol/issue: 9(4S), pp. 340-58, 2017.
[9] I. Ramli, et al., “An improved pitch contour formulation for Malay language storytelling Text-to-Speech (TTS).
IEEE Industrial Electronics and Applications Conference (IEACon), Kota Kinabalu, Sabah, pp. 250-255, 2016.
[10] Agrawal A. and An A., “Unsupervised emotion detection from text using semantic and syntactic relations,” 2012
IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology, Los
Alamitos, vol. 1, pp. 346-353, 2012.
8. ISSN: 2302-9285
BEEI, Vol. 7, No. 3, September 2018 : 479 – 486
486
[11] Jamal N., et al., “Poetry classification using support vector machines,” Journal of Computer Science, vol/issue:
8(9), pp. 1441, 2012.
[12] Mat A. N., et al., “Morphological differences between Bahasa Melayu and English: Constraints in students’
understanding,” The Second Biennial International Conference on Teaching and Learning of English in Asia
(TELiA2). Langkawi, pp. 1-11, 2007.
[13] Samsudin N., et al., “Bess or xbest: Mining the Malaysian online reviews,” IEEE 3rd Conference on Data Mining
and Optimization (DMO), Bangi, Selangor, pp. 38-43, 2011.
[14] Kaur J. and Saini J. R., “On classifying sentiments and mining opinions,” International Journal of Emerging
Technologies in Computational and Applied Sciences, pp. 210-214, 2014.
[15] Jamaluddin M. N., et al., “Classification and quantification of user's emotion on Malay language in social network
sites using Latent Semantic Analysis,” IEEE Conference on Open Systems (ICOS), Langkawi, Kedah, pp. 65-70,
2016.
[16] Noah S. A. and Ismail F., “Automatic classifications of Malay proverbs using Naïve Bayesian algorithm,”
Information Technology Journal, vol/issue: 7(7), pp. 1016-22, 2008.
[17] Povoda L, et al., “Sentiment analysis based on Support Vector Machine and Big Data,” 39th IEEE International
Conference on Telecommunications and Signal Processing (TSP), Vienna, Austria, pp. 543-545, 2016.
[18] Kaur H. and Mangat V., “A survey of sentiment analysis techniques,” 2017 International Conference on InI-SMAC
(IoT in Social, Mobile, Analytics and Cloud (I-SMAC), India, pp. 921-925, 2017.
[19] Ahmed K., et al., “Sentiment analysis over social Networks: An overview,” 2015 IEEE International Conference
on Systems, Man, and Cybernetics (SMC), Hong Kong, pp. 2174-2179, 2015.
[20] Li X., et al., “Hybrid neural networks for social emotion detection over short text,” 2016 International Joint
Conference on Neural Networks (IJCNN), Vancouver, Canada, pp. 537-544, 2016.
[21] Sarakit P., et al., “Classifying emotion in Thai youtube comments,” 2015 6th International Conference of
Information and Communication Technology for Embedded Systems (IC-ICTES), Hua-Hin, Thailand. Mac, pp. 1-5,
2015.
[22] Jamal N., et al., “Poetry classification using support vector machines,” Journal of Computer Science, vol/issue:
8(9), pp. 1441, 2012.
[23] Wilkinson P., “Affective educational games: Utilizing emotions in game-based learning,” 2013 5th International
Conference on Games and Virtual Worlds for Serious Applications (VS-GAMES), Bournemouth, United Kingdom,
pp. 1-8, 2013.
[24] Winarsih N. A. and Supriyanto C., “Evaluation of classification methods for Indonesian text emotion detection,”
IEEE International Seminar on Application for Technology of Information and Communication (ISemantic),
Semarang, Indonesia, pp. 130-133, 2016.
[25] Sriram S. and Yuan X., “An enhanced approach for classifying emotions using customized decision tree
algorithm,” IEEE Southeastcon, Orlando, Florida, pp. 1-6, 2012.
[26] Wilbur W. J. and Sirotkin K., “The automatic identification of stop words,” Journal of Information Science,
vol/issue: 18(1), pp. 45-55, 1992.
[27] Sadjirin R. and Rahman N. A., “Efficient retrieval of Malay language documents using latent semantic indexing,”
2010 International Symposium in Information Technology (ITSim), Kuala Lumpur, vol. 3, pp. 1410-1415, 2010.