Proliferation of internet and social media has greatly increased the popularity of text
communication. People convey their sentiment and emotion through text which promotes lively
communication. Consequently, a tremendous amount of emotional text is generated on different social
media and blogs in every moment. This has raised the necessity of automated tool for emotion mining from
text. There are various rule based approaches of emotion extraction form text based on emotion intensity
lexicon. However, creating emotion intensity lexicon is a time consuming and tedious process. Moreover,
there is no hard and fast rule for assigning emotion intensity to words. To solve these difficulties, we
propose a machine learning based approach of emotion extraction from text which relies on annotated
example rather emotion intensity lexicon. We investigated Multinomial Naïve Bayesian (MNB) Classifier,
Artificial Neural Network (ANN) and Support Vector Machine (SVM) for mining emotion from text. In our
setup, SVM outperformed other classifiers with promising accuracy.
Evaluation of Support Vector Machine and Decision Tree for Emotion Recognitio...journalBEEI
In this paper, the performance of Support Vector Machine (SVM) and Decision Tree (DT) in classifying emotions from Malay folklores is presented. This work is the continuation of our storytelling speech synthesis work to add emotions for a more natural storytelling. A total of 100 documents from children short stories are collected and used as the datasets of the text-based emotion recognition experiment. Term Frequency-Inverse Document Frequency (TF-IDF) is extracted from the text documents and classified using SVM and DT. Four types of common emotions, which are happy, angry, fearful and sad are classified using the two classifiers. Results showed that DT outperformed SVM by more than 22.2% accuracy rate. However, the overall emotion recognition is only at moderate rate suggesting an improvement is needed in future work. The accuracy of the emotion recognition should be improved in future studies by using semantic feature extractors or by incorporating deep learning for classification.
A scalable, lexicon based technique for sentiment analysisijfcstjournal
Rapid increase in the volume of sentiment rich social media on the web has resulted in an increased
interest among researchers regarding Sentimental Analysis and opinion mining. However, with so much
social media available on the web, sentiment analysis is now considered as a big data task. Hence the
conventional sentiment analysis approaches fails to efficiently handle the vast amount of sentiment data
available now a days. The main focus of the research was to find such a technique that can efficiently
perform sentiment analysis on big data sets. A technique that can categorize the text as positive, negative
and neutral in a fast and accurate manner. In the research, sentiment analysis was performed on a large
data set of tweets using Hadoop and the performance of the technique was measured in form of speed and
accuracy. The experimental results shows that the technique exhibits very good efficiency in handling big
sentiment data sets.
Evaluation of Support Vector Machine and Decision Tree for Emotion Recognitio...journalBEEI
In this paper, the performance of Support Vector Machine (SVM) and Decision Tree (DT) in classifying emotions from Malay folklores is presented. This work is the continuation of our storytelling speech synthesis work to add emotions for a more natural storytelling. A total of 100 documents from children short stories are collected and used as the datasets of the text-based emotion recognition experiment. Term Frequency-Inverse Document Frequency (TF-IDF) is extracted from the text documents and classified using SVM and DT. Four types of common emotions, which are happy, angry, fearful and sad are classified using the two classifiers. Results showed that DT outperformed SVM by more than 22.2% accuracy rate. However, the overall emotion recognition is only at moderate rate suggesting an improvement is needed in future work. The accuracy of the emotion recognition should be improved in future studies by using semantic feature extractors or by incorporating deep learning for classification.
A scalable, lexicon based technique for sentiment analysisijfcstjournal
Rapid increase in the volume of sentiment rich social media on the web has resulted in an increased
interest among researchers regarding Sentimental Analysis and opinion mining. However, with so much
social media available on the web, sentiment analysis is now considered as a big data task. Hence the
conventional sentiment analysis approaches fails to efficiently handle the vast amount of sentiment data
available now a days. The main focus of the research was to find such a technique that can efficiently
perform sentiment analysis on big data sets. A technique that can categorize the text as positive, negative
and neutral in a fast and accurate manner. In the research, sentiment analysis was performed on a large
data set of tweets using Hadoop and the performance of the technique was measured in form of speed and
accuracy. The experimental results shows that the technique exhibits very good efficiency in handling big
sentiment data sets.
Sentiment analysis or opinion mining is a process of categorizing and identifying the sentiment expressed in a particular text. The need of automatic sentiment retrieval of
the text is quite high as a number of reviews obtained from the Internet sources like Twitter are huge in number. These reviews or opinions on popular products or events help in determining the public opinion towards the issue. An averaged histogram model is proposed in the process that deals with text classification in continuous variable approach. After data cleaning and feature extraction from the reviews, average histograms are constructed for every class, containing a generalized feature representation in that particular class, namely positive and negative. Histograms of every test elements are then classified using k-NN, Bayesian Classifier and LSTM network. This work is then implemented in Android integrated with Twitter. The user will have to provide the topic for analysis. The Application will show the result as the percentage of positive review tweets in favor of the topic using Bayesian Classifier.
Sentiment Analysis/Opinion Mining of Twitter Data on Unigram/Bigram/Unigram+Bigram Model using:
1. Machine Learning
2. Lexical Scores
3. Emoticon Scores
YouTube Video: https://youtu.be/VuR16P87yPE
Link to the WebPage: http://akirato.github.io/Twitter-Sentiment-Analysis-Tool
Github Page: https://github.com/Akirato/Twitter-Sentiment-Analysis-Tool
Cloud Technologies providing Complete Solution for all
AcademicProjects Final Year/Semester Student Projects
For More Details,
Contact:
Mobile:- +91 8121953811,
whatsapp:- +91 8522991105,
Office:- 040-66411811
Email ID: cloudtechnologiesprojects@gmail.com
Sentiment analysis in twitter using python
NLP (Natural Language Processing) is a mechanism that helps computers to know natural languages like English. In general, computers can understand data, tables etc. which are well formed. But when it involves natural languages, it's unacceptable for computers to spot them. NLP helps to translate the tongue in such a fashion which will be easily processed by modern computers. Financial Tracker is an approach which will use NLP as a tool and can differentiate the user messages in various categories. the appliance of the approach will be seen at multiple levels. At a personal level, this permits users to filtrate useful financial messages from an large junk of text messages. On the opposite hand, from an industrial point of view, this can be useful in services like online loan disbursal, which are hitting the market nowadays. These services attempt to provide online loans to individuals in an exceedingly faster and quicker manner. But when it involves business view, loan recovery from customers becomes a really important & crucial aspect. As most such services can’t take strict legal actions against the fraud customers, it becomes a requirement that loan should be provided only to those customers who deserve it. At that time, this model can come under the image. As a business we will find the user’s messages from their inbox (after taking permission from the users). These messages are often filtered using NLP which might help to differentiate various types of messages within the user's inbox which might further be used as a content for further prediction and analysis on user’s behaviour in terms of cash related transactions.
Sentiment Analysis of Twitter tweets using supervised classification technique IJERA Editor
Making use of social media for analyzing the perceptions of the masses over a product, event or a person has
gained momentum in recent times. Out of a wide array of social networks, we chose Twitter for our analysis as
the opinions expressed their, are concise and bear a distinctive polarity. Here, we collect the most recent tweets
on users' area of interest and analyze them. The extracted tweets are then segregated as positive, negative and
neutral. We do the classification in following manner: collect the tweets using Twitter API; then we process the
collected tweets to convert all letters to lowercase, eliminate special characters etc. which makes the
classification more efficient; the processed tweets are classified using a supervised classification technique. We
make use of Naive Bayes classifier to segregate the tweets as positive, negative and neutral. We use a set of
sample tweets to train the classifier. The percentage of the tweets in each category is then computed and the
result is represented graphically. The result can be used further to gain an insight into the views of the people
using Twitter about a particular topic that is being searched by the user. It can help corporate houses devise
strategies on the basis of the popularity of their product among the masses. It may help the consumers to make
informed choices based on the general sentiment expressed by the Twitter users on a product.
Emotion detection on social media status in Myanmar language IJECEIAES
Many social media emerged and provided services during these years. Most people, especially in Myanmar, use them to express their emotions or moods, learn subjects, sell products, read up-to-date news, and communicate with each other. Emotion detection on social users makes critical tasks in the opinion mining and sentiment analysis. This paper presents the emotion detection system on social media (Facebook) user status or post written in Myanmar (Burmese) language. Before the emotion detection process, the user posts are pre-processed under segmentation, stemming, part-of-speech (POS) tagging, and stop word removal. The system then uses our preconstructed Myanmar word-emotion Lexicon, M-Lexicon, to extract the emotion words from the segmented POS post. The system provides six types of emotion such as surprise, disgust, fear, anger, sadness, and happiness. The system applies naïve Bayes (NB) emotion classifier to examine the emotion in the case of more than two words with different emotion values are extracted. The classifiers also classify the emotion of the users on their posts. The experiment shows that the system can detect 85% accuracy in NB based emotion detection while 86% in recurrent neural network (RNN).
Sentiment analysis or opinion mining is a process of categorizing and identifying the sentiment expressed in a particular text. The need of automatic sentiment retrieval of
the text is quite high as a number of reviews obtained from the Internet sources like Twitter are huge in number. These reviews or opinions on popular products or events help in determining the public opinion towards the issue. An averaged histogram model is proposed in the process that deals with text classification in continuous variable approach. After data cleaning and feature extraction from the reviews, average histograms are constructed for every class, containing a generalized feature representation in that particular class, namely positive and negative. Histograms of every test elements are then classified using k-NN, Bayesian Classifier and LSTM network. This work is then implemented in Android integrated with Twitter. The user will have to provide the topic for analysis. The Application will show the result as the percentage of positive review tweets in favor of the topic using Bayesian Classifier.
Sentiment Analysis/Opinion Mining of Twitter Data on Unigram/Bigram/Unigram+Bigram Model using:
1. Machine Learning
2. Lexical Scores
3. Emoticon Scores
YouTube Video: https://youtu.be/VuR16P87yPE
Link to the WebPage: http://akirato.github.io/Twitter-Sentiment-Analysis-Tool
Github Page: https://github.com/Akirato/Twitter-Sentiment-Analysis-Tool
Cloud Technologies providing Complete Solution for all
AcademicProjects Final Year/Semester Student Projects
For More Details,
Contact:
Mobile:- +91 8121953811,
whatsapp:- +91 8522991105,
Office:- 040-66411811
Email ID: cloudtechnologiesprojects@gmail.com
Sentiment analysis in twitter using python
NLP (Natural Language Processing) is a mechanism that helps computers to know natural languages like English. In general, computers can understand data, tables etc. which are well formed. But when it involves natural languages, it's unacceptable for computers to spot them. NLP helps to translate the tongue in such a fashion which will be easily processed by modern computers. Financial Tracker is an approach which will use NLP as a tool and can differentiate the user messages in various categories. the appliance of the approach will be seen at multiple levels. At a personal level, this permits users to filtrate useful financial messages from an large junk of text messages. On the opposite hand, from an industrial point of view, this can be useful in services like online loan disbursal, which are hitting the market nowadays. These services attempt to provide online loans to individuals in an exceedingly faster and quicker manner. But when it involves business view, loan recovery from customers becomes a really important & crucial aspect. As most such services can’t take strict legal actions against the fraud customers, it becomes a requirement that loan should be provided only to those customers who deserve it. At that time, this model can come under the image. As a business we will find the user’s messages from their inbox (after taking permission from the users). These messages are often filtered using NLP which might help to differentiate various types of messages within the user's inbox which might further be used as a content for further prediction and analysis on user’s behaviour in terms of cash related transactions.
Sentiment Analysis of Twitter tweets using supervised classification technique IJERA Editor
Making use of social media for analyzing the perceptions of the masses over a product, event or a person has
gained momentum in recent times. Out of a wide array of social networks, we chose Twitter for our analysis as
the opinions expressed their, are concise and bear a distinctive polarity. Here, we collect the most recent tweets
on users' area of interest and analyze them. The extracted tweets are then segregated as positive, negative and
neutral. We do the classification in following manner: collect the tweets using Twitter API; then we process the
collected tweets to convert all letters to lowercase, eliminate special characters etc. which makes the
classification more efficient; the processed tweets are classified using a supervised classification technique. We
make use of Naive Bayes classifier to segregate the tweets as positive, negative and neutral. We use a set of
sample tweets to train the classifier. The percentage of the tweets in each category is then computed and the
result is represented graphically. The result can be used further to gain an insight into the views of the people
using Twitter about a particular topic that is being searched by the user. It can help corporate houses devise
strategies on the basis of the popularity of their product among the masses. It may help the consumers to make
informed choices based on the general sentiment expressed by the Twitter users on a product.
Emotion detection on social media status in Myanmar language IJECEIAES
Many social media emerged and provided services during these years. Most people, especially in Myanmar, use them to express their emotions or moods, learn subjects, sell products, read up-to-date news, and communicate with each other. Emotion detection on social users makes critical tasks in the opinion mining and sentiment analysis. This paper presents the emotion detection system on social media (Facebook) user status or post written in Myanmar (Burmese) language. Before the emotion detection process, the user posts are pre-processed under segmentation, stemming, part-of-speech (POS) tagging, and stop word removal. The system then uses our preconstructed Myanmar word-emotion Lexicon, M-Lexicon, to extract the emotion words from the segmented POS post. The system provides six types of emotion such as surprise, disgust, fear, anger, sadness, and happiness. The system applies naïve Bayes (NB) emotion classifier to examine the emotion in the case of more than two words with different emotion values are extracted. The classifiers also classify the emotion of the users on their posts. The experiment shows that the system can detect 85% accuracy in NB based emotion detection while 86% in recurrent neural network (RNN).
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
APPROXIMATE ANALYTICAL SOLUTION OF NON-LINEAR BOUSSINESQ EQUATION FOR THE UNS...mathsjournal
For one dimensional homogeneous, isotropic aquifer, without accretion the governing Boussinesq
equation under Dupuit assumptions is a nonlinear partial differential equation. In the present paper
approximate analytical solution of nonlinear Boussinesq equation is obtained using Homotopy
perturbation transform method(HPTM). The solution is compared with the exact solution. The
comparison shows that the HPTM is efficient, accurate and reliable. The analysis of two important aquifer
parameters namely viz. specific yield and hydraulic conductivity is studied to see the effects on the height
of water table. The results resemble well with the physical phenomena.
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSISmlaij
Sentiment analysis and Opinion mining has emerged as a popular and efficient technique for information retrieval and web data analysis. The exponential growth of the user generated content has opened new horizons for research in the field of sentiment analysis. This paper proposes a model for sentiment analysis of movie reviews using a combination of natural language processing and machine learning approaches. Firstly, different data pre-processing schemes are applied on the dataset. Secondly, the behaviour of twoclassifiers, Naive Bayes and SVM, is investigated in combination with different feature selection schemes to
obtain the results for sentiment analysis. Thirdly, the proposed model for sentiment analysis is extended to
obtain the results for higher order n-grams.
Neural Network Based Context Sensitive Sentiment AnalysisEditor IJCATR
Social media communication is evolving more in these days. Social networking site is being rapidly increased in recent years, which provides platform to connect people all over the world and share their interests. The conversation and the posts available in social media are unstructured in nature. So sentiment analysis will be a challenging work in this platform. These analyses are mostly performed in machine learning techniques which are less accurate than neural network methodologies. This paper is based on sentiment classification using Competitive layer neural networks and classifies the polarity of a given text whether the expressed opinion in the text is positive or negative or neutral. It determines the overall topic of the given text. Context independent sentences and implicit meaning in the text are also considered in polarity classification.
A MODEL BASED ON SENTIMENTS ANALYSIS FOR STOCK EXCHANGE PREDICTION - CASE STU...csandit
Predicting the behavior of shares in the stock market is a complex problem, that involves variables not always known and can undergo various influences, from the collective emotion to high-profile news. Such volatility, can represent considerable financial losses for investors. In order to anticipate such changes in the market, it has been proposed various mechanisms to try to predict the behavior of an asset in the stock market, based on previously existing information.
Such mechanisms include statistical data only, without considering the collective feeling. This article, is going to use natural language processing algorithms (LPN) to determine the collective mood on assets and later with the help of the SVM algorithm to extract patterns in an attempt to predict the active behavior. Nevertheless it is important to note that such approach is not intended to be the main factor in the decision making process, but rather an aid tool, which combined with other information, can provide higher accuracy for the solution of this problem
A MODEL BASED ON SENTIMENTS ANALYSIS FOR STOCK EXCHANGE PREDICTION - CASE STU...cscpconf
Predicting the behavior of shares in the stock market is a complex problem, that involves variables not always known and can undergo various influences, from the collective emotion to the high-profile news. Such volatility can represent considerable financial losses for investors. In order to anticipate such changes in the market, it has been proposed various mechanisms to try to predict the behavior of an asset in the stock market, based on previously existing information. Such mechanisms include statistical data only, without considering the collective feeling. This article is going to use natural language processing algorithms (LPN) to determine the collective mood on assets and later with the help of the SVM algorithm to extract patterns in an
attempt to predict the active behavior. Nevertheless it is important to note that such approach is not intended to be the main factor in the decision making process, but rather an aid tool, which combined with other information, can provide higher accuracy for the solution of this problem.
Investigating human subjects is the goal of predicting human emotions in the
real world scenario. A significant number of psychological effects require
(feelings) to be produced, directly releasing human emotions. The
development of effect theory leads one to believe that one must be aware of
one's sentiments and emotions to forecast one's behavior. The proposed line
of inquiry focuses on developing a reliable model incorporating
neurophysiological data into actual feelings. Any change in emotional affect
will directly elicit a response in the body's physiological systems. This
approach is named after the notion of Gaussian mixture models (GMM). The
statistical reaction following data processing, quantitative findings on emotion
labels, and coincidental responses with training samples all directly impact the
outcomes that are accomplished. In terms of statistical parameters such as
population mean and standard deviation, the suggested method is evaluated
compared to a technique considered to be state-of-the-art. The proposed
system determines an individual's emotional state after a minimum of 6
iterative learning using the Gaussian expectation-maximization (GEM)
statistical model, in which the iterations tend to continue to zero error. Perhaps
each of these improves predictions while simultaneously increasing the
amount of value extracted.
Sentiment classification aims to detect information such as opinions, explicit , implicit feelings expressed
in text. The most existing approaches are able to detect either explicit expressions or implicit expressions of
sentiments in the text separately. In this proposed framework it will detect both Implicit and Explicit
expressions available in the meeting transcripts. It will classify the Positive, Negative, Neutral words and
also identify the topic of the particular meeting transcripts by using fuzzy logic. This paper aims to add
some additional features for improving the classification method. The quality of the sentiment classification
is improved using proposed fuzzy logic framework .In this fuzzy logic it includes the features like Fuzzy
rules and Fuzzy C-means algorithm.The quality of the output is evaluated using the parameters such as
precision, recall, f-measure. Here Fuzzy C-means Clustering technique measured in terms of Purity and
Entropy. The data set was validated using 10-fold cross validation method and observed 95% confidence
interval between the accuracy values .Finally, the proposed fuzzy logic method produced more than 85 %
accurate results and error rate is very less compared to existing sentiment classification techniques.
EPIDEMIC OUTBREAK PREDICTION USING ARTIFICIAL INTELLIGENCEijcsit
Intelligent Models for predicting diseases whether building a model to help the doctor or even preventing its spread in an area globally, is increasing day by day. Here we present a noble approach to predict the disease prone area using the power of Text Analysis and Machine Learning. Epidemic Search model using the power of the social network data analysis and then using this data to provide a probability score of the spread and to analyse the areas whether going to suffer from any epidemic spread-out, is the main focus of this work. We have tried to analyse and showcase how the model with different kinds of pre-processing and algorithms predict the output. We have used the combination of words-n grams, word embeddings and TFIDF with different data mining and deep learning algorithms like SVM, Naïve Bayes and RNN-LSTM. Naïve Bayes with TF-IDF performed better in comparison to others.
Amazon products reviews classification based on machine learning, deep learni...TELKOMNIKA JOURNAL
In recent times, the trend of online shopping through e-commerce stores and websites has grown to a huge extent. Whenever a product is purchased on an e-commerce platform, people leave their reviews about the product. These reviews are very helpful for the store owners and the product’s manufacturers for the betterment of their work process as well as product quality. An automated system is proposed in this work that operates on two datasets D1 and D2 obtained from Amazon. After certain preprocessing steps, N-gram and word embedding-based features are extracted using term frequency-inverse document frequency (TF-IDF), bag of words (BoW) and global vectors (GloVe), and Word2vec, respectively. Four machine learning (ML) models support vector machines (SVM), logistic regression (RF), logistic regression (LR), multinomial Naïve Bayes (MNB), two deep learning (DL) models convolutional neural network (CNN), long-short term memory (LSTM), and standalone bidirectional encoder representations (BERT) are used to classify reviews as either positive or negative. The results obtained by the standard ML, DL models and BERT are evaluated using certain performance evaluation measures. BERT turns out to be the best-performing model in the case of D1 with an accuracy of 90% on features derived by word embedding models while the CNN provides the best accuracy of 97% upon word embedding features in the case of D2. The proposed model shows better overall performance on D2 as compared to D1.
Design, simulation, and analysis of microstrip patch antenna for wireless app...TELKOMNIKA JOURNAL
In this study, a microstrip patch antenna that works at 3.6 GHz was built and tested to see how well it works. In this work, Rogers RT/Duroid 5880 has been used as the substrate material, with a dielectric permittivity of 2.2 and a thickness of 0.3451 mm; it serves as the base for the examined antenna. The computer simulation technology (CST) studio suite is utilized to show the recommended antenna design. The goal of this study was to get a more extensive transmission capacity, a lower voltage standing wave ratio (VSWR), and a lower return loss, but the main goal was to get a higher gain, directivity, and efficiency. After simulation, the return loss, gain, directivity, bandwidth, and efficiency of the supplied antenna are found to be -17.626 dB, 9.671 dBi, 9.924 dBi, 0.2 GHz, and 97.45%, respectively. Besides, the recreation uncovered that the transfer speed side-lobe level at phi was much better than those of the earlier works, at -28.8 dB, respectively. Thus, it makes a solid contender for remote innovation and more robust communication.
Design and simulation an optimal enhanced PI controller for congestion avoida...TELKOMNIKA JOURNAL
In this paper, snake optimization algorithm (SOA) is used to find the optimal gains of an enhanced controller for controlling congestion problem in computer networks. M-file and Simulink platform is adopted to evaluate the response of the active queue management (AQM) system, a comparison with two classical controllers is done, all tuned gains of controllers are obtained using SOA method and the fitness function chose to monitor the system performance is the integral time absolute error (ITAE). Transient analysis and robust analysis is used to show the proposed controller performance, two robustness tests are applied to the AQM system, one is done by varying the size of queue value in different period and the other test is done by changing the number of transmission control protocol (TCP) sessions with a value of ± 20% from its original value. The simulation results reflect a stable and robust behavior and best performance is appeared clearly to achieve the desired queue size without any noise or any transmission problems.
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...TELKOMNIKA JOURNAL
Vehicular ad-hoc networks (VANETs) are wireless-equipped vehicles that form networks along the road. The security of this network has been a major challenge. The identity-based cryptosystem (IBC) previously used to secure the networks suffers from membership authentication security features. This paper focuses on improving the detection of intruders in VANETs with a modified identity-based cryptosystem (MIBC). The MIBC is developed using a non-singular elliptic curve with Lagrange interpolation. The public key of vehicles and roadside units on the network are derived from number plates and location identification numbers, respectively. Pseudo-identities are used to mask the real identity of users to preserve their privacy. The membership authentication mechanism ensures that only valid and authenticated members of the network are allowed to join the network. The performance of the MIBC is evaluated using intrusion detection ratio (IDR) and computation time (CT) and then validated with the existing IBC. The result obtained shows that the MIBC recorded an IDR of 99.3% against 94.3% obtained for the existing identity-based cryptosystem (EIBC) for 140 unregistered vehicles attempting to intrude on the network. The MIBC shows lower CT values of 1.17 ms against 1.70 ms for EIBC. The MIBC can be used to improve the security of VANETs.
Conceptual model of internet banking adoption with perceived risk and trust f...TELKOMNIKA JOURNAL
Understanding the primary factors of internet banking (IB) acceptance is critical for both banks and users; nevertheless, our knowledge of the role of users’ perceived risk and trust in IB adoption is limited. As a result, we develop a conceptual model by incorporating perceived risk and trust into the technology acceptance model (TAM) theory toward the IB. The proper research emphasized that the most essential component in explaining IB adoption behavior is behavioral intention to use IB adoption. TAM is helpful for figuring out how elements that affect IB adoption are connected to one another. According to previous literature on IB and the use of such technology in Iraq, one has to choose a theoretical foundation that may justify the acceptance of IB from the customer’s perspective. The conceptual model was therefore constructed using the TAM as a foundation. Furthermore, perceived risk and trust were added to the TAM dimensions as external factors. The key objective of this work was to extend the TAM to construct a conceptual model for IB adoption and to get sufficient theoretical support from the existing literature for the essential elements and their relationships in order to unearth new insights about factors responsible for IB adoption.
Efficient combined fuzzy logic and LMS algorithm for smart antennaTELKOMNIKA JOURNAL
The smart antennas are broadly used in wireless communication. The least mean square (LMS) algorithm is a procedure that is concerned in controlling the smart antenna pattern to accommodate specified requirements such as steering the beam toward the desired signal, in addition to placing the deep nulls in the direction of unwanted signals. The conventional LMS (C-LMS) has some drawbacks like slow convergence speed besides high steady state fluctuation error. To overcome these shortcomings, the present paper adopts an adaptive fuzzy control step size least mean square (FC-LMS) algorithm to adjust its step size. Computer simulation outcomes illustrate that the given model has fast convergence rate as well as low mean square error steady state.
Design and implementation of a LoRa-based system for warning of forest fireTELKOMNIKA JOURNAL
This paper presents the design and implementation of a forest fire monitoring and warning system based on long range (LoRa) technology, a novel ultra-low power consumption and long-range wireless communication technology for remote sensing applications. The proposed system includes a wireless sensor network that records environmental parameters such as temperature, humidity, wind speed, and carbon dioxide (CO2) concentration in the air, as well as taking infrared photos.The data collected at each sensor node will be transmitted to the gateway via LoRa wireless transmission. Data will be collected, processed, and uploaded to a cloud database at the gateway. An Android smartphone application that allows anyone to easily view the recorded data has been developed. When a fire is detected, the system will sound a siren and send a warning message to the responsible personnel, instructing them to take appropriate action. Experiments in Tram Chim Park, Vietnam, have been conducted to verify and evaluate the operation of the system.
Wavelet-based sensing technique in cognitive radio networkTELKOMNIKA JOURNAL
Cognitive radio is a smart radio that can change its transmitter parameter based on interaction with the environment in which it operates. The demand for frequency spectrum is growing due to a big data issue as many Internet of Things (IoT) devices are in the network. Based on previous research, most frequency spectrum was used, but some spectrums were not used, called spectrum hole. Energy detection is one of the spectrum sensing methods that has been frequently used since it is easy to use and does not require license users to have any prior signal understanding. But this technique is incapable of detecting at low signal-to-noise ratio (SNR) levels. Therefore, the wavelet-based sensing is proposed to overcome this issue and detect spectrum holes. The main objective of this work is to evaluate the performance of wavelet-based sensing and compare it with the energy detection technique. The findings show that the percentage of detection in wavelet-based sensing is 83% higher than energy detection performance. This result indicates that the wavelet-based sensing has higher precision in detection and the interference towards primary user can be decreased.
A novel compact dual-band bandstop filter with enhanced rejection bandsTELKOMNIKA JOURNAL
In this paper, we present the design of a new wide dual-band bandstop filter (DBBSF) using nonuniform transmission lines. The method used to design this filter is to replace conventional uniform transmission lines with nonuniform lines governed by a truncated Fourier series. Based on how impedances are profiled in the proposed DBBSF structure, the fractional bandwidths of the two 10 dB-down rejection bands are widened to 39.72% and 52.63%, respectively, and the physical size has been reduced compared to that of the filter with the uniform transmission lines. The results of the electromagnetic (EM) simulation support the obtained analytical response and show an improved frequency behavior.
Deep learning approach to DDoS attack with imbalanced data at the application...TELKOMNIKA JOURNAL
A distributed denial of service (DDoS) attack is where one or more computers attack or target a server computer, by flooding internet traffic to the server. As a result, the server cannot be accessed by legitimate users. A result of this attack causes enormous losses for a company because it can reduce the level of user trust, and reduce the company’s reputation to lose customers due to downtime. One of the services at the application layer that can be accessed by users is a web-based lightweight directory access protocol (LDAP) service that can provide safe and easy services to access directory applications. We used a deep learning approach to detect DDoS attacks on the CICDDoS 2019 dataset on a complex computer network at the application layer to get fast and accurate results for dealing with unbalanced data. Based on the results obtained, it is observed that DDoS attack detection using a deep learning approach on imbalanced data performs better when implemented using synthetic minority oversampling technique (SMOTE) method for binary classes. On the other hand, the proposed deep learning approach performs better for detecting DDoS attacks in multiclass when implemented using the adaptive synthetic (ADASYN) method.
The appearance of uncertainties and disturbances often effects the characteristics of either linear or nonlinear systems. Plus, the stabilization process may be deteriorated thus incurring a catastrophic effect to the system performance. As such, this manuscript addresses the concept of matching condition for the systems that are suffering from miss-match uncertainties and exogeneous disturbances. The perturbation towards the system at hand is assumed to be known and unbounded. To reach this outcome, uncertainties and their classifications are reviewed thoroughly. The structural matching condition is proposed and tabulated in the proposition 1. Two types of mathematical expressions are presented to distinguish the system with matched uncertainty and the system with miss-matched uncertainty. Lastly, two-dimensional numerical expressions are provided to practice the proposed proposition. The outcome shows that matching condition has the ability to change the system to a design-friendly model for asymptotic stabilization.
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...TELKOMNIKA JOURNAL
Many systems, including digital signal processors, finite impulse response (FIR) filters, application-specific integrated circuits, and microprocessors, use multipliers. The demand for low power multipliers is gradually rising day by day in the current technological trend. In this study, we describe a 4×4 Wallace multiplier based on a carry select adder (CSA) that uses less power and has a better power delay product than existing multipliers. HSPICE tool at 16 nm technology is used to simulate the results. In comparison to the traditional CSA-based multiplier, which has a power consumption of 1.7 µW and power delay product (PDP) of 57.3 fJ, the results demonstrate that the Wallace multiplier design employing CSA with first zero finding logic (FZF) logic has the lowest power consumption of 1.4 µW and PDP of 27.5 fJ.
Evaluation of the weighted-overlap add model with massive MIMO in a 5G systemTELKOMNIKA JOURNAL
The flaw in 5G orthogonal frequency division multiplexing (OFDM) becomes apparent in high-speed situations. Because the doppler effect causes frequency shifts, the orthogonality of OFDM subcarriers is broken, lowering both their bit error rate (BER) and throughput output. As part of this research, we use a novel design that combines massive multiple input multiple output (MIMO) and weighted overlap and add (WOLA) to improve the performance of 5G systems. To determine which design is superior, throughput and BER are calculated for both the proposed design and OFDM. The results of the improved system show a massive improvement in performance ver the conventional system and significant improvements with massive MIMO, including the best throughput and BER. When compared to conventional systems, the improved system has a throughput that is around 22% higher and the best performance in terms of BER, but it still has around 25% less error than OFDM.
Reflector antenna design in different frequencies using frequency selective s...TELKOMNIKA JOURNAL
In this study, it is aimed to obtain two different asymmetric radiation patterns obtained from antennas in the shape of the cross-section of a parabolic reflector (fan blade type antennas) and antennas with cosecant-square radiation characteristics at two different frequencies from a single antenna. For this purpose, firstly, a fan blade type antenna design will be made, and then the reflective surface of this antenna will be completed to the shape of the reflective surface of the antenna with the cosecant-square radiation characteristic with the frequency selective surface designed to provide the characteristics suitable for the purpose. The frequency selective surface designed and it provides the perfect transmission as possible at 4 GHz operating frequency, while it will act as a band-quenching filter for electromagnetic waves at 5 GHz operating frequency and will be a reflective surface. Thanks to this frequency selective surface to be used as a reflective surface in the antenna, a fan blade type radiation characteristic at 4 GHz operating frequency will be obtained, while a cosecant-square radiation characteristic at 5 GHz operating frequency will be obtained.
Reagentless iron detection in water based on unclad fiber optical sensorTELKOMNIKA JOURNAL
A simple and low-cost fiber based optical sensor for iron detection is demonstrated in this paper. The sensor head consist of an unclad optical fiber with the unclad length of 1 cm and it has a straight structure. Results obtained shows a linear relationship between the output light intensity and iron concentration, illustrating the functionality of this iron optical sensor. Based on the experimental results, the sensitivity and linearity are achieved at 0.0328/ppm and 0.9824 respectively at the wavelength of 690 nm. With the same wavelength, other performance parameters are also studied. Resolution and limit of detection (LOD) are found to be 0.3049 ppm and 0.0755 ppm correspondingly. This iron sensor is advantageous in that it does not require any reagent for detection, enabling it to be simpler and cost-effective in the implementation of the iron sensing.
Impact of CuS counter electrode calcination temperature on quantum dot sensit...TELKOMNIKA JOURNAL
In place of the commercial Pt electrode used in quantum sensitized solar cells, the low-cost CuS cathode is created using electrophoresis. High resolution scanning electron microscopy and X-ray diffraction were used to analyze the structure and morphology of structural cubic samples with diameters ranging from 40 nm to 200 nm. The conversion efficiency of solar cells is significantly impacted by the calcination temperatures of cathodes at 100 °C, 120 °C, 150 °C, and 180 °C under vacuum. The fluorine doped tin oxide (FTO)/CuS cathode electrode reached a maximum efficiency of 3.89% when it was calcined at 120 °C. Compared to other temperature combinations, CuS nanoparticles crystallize at 120 °C, which lowers resistance while increasing electron lifetime.
In place of the commercial Pt electrode used in quantum sensitized solar cells, the low-cost CuS cathode is created using electrophoresis. High resolution scanning electron microscopy and X-ray diffraction were used to analyze the structure and morphology of structural cubic samples with diameters ranging from 40 nm to 200 nm. The conversion efficiency of solar cells is significantly impacted by the calcination temperatures of cathodes at 100 °C, 120 °C, 150 °C, and 180 °C under vacuum. The fluorine doped tin oxide (FTO)/CuS cathode electrode reached a maximum efficiency of 3.89% when it was calcined at 120 °C. Compared to other temperature combinations, CuS nanoparticles crystallize at 120 °C, which lowers resistance while increasing electron lifetime.
A progressive learning for structural tolerance online sequential extreme lea...TELKOMNIKA JOURNAL
This article discusses the progressive learning for structural tolerance online sequential extreme learning machine (PSTOS-ELM). PSTOS-ELM can save robust accuracy while updating the new data and the new class data on the online training situation. The robustness accuracy arises from using the householder block exact QR decomposition recursive least squares (HBQRD-RLS) of the PSTOS-ELM. This method is suitable for applications that have data streaming and often have new class data. Our experiment compares the PSTOS-ELM accuracy and accuracy robustness while data is updating with the batch-extreme learning machine (ELM) and structural tolerance online sequential extreme learning machine (STOS-ELM) that both must retrain the data in a new class data case. The experimental results show that PSTOS-ELM has accuracy and robustness comparable to ELM and STOS-ELM while also can update new class data immediately.
Electroencephalography-based brain-computer interface using neural networksTELKOMNIKA JOURNAL
This study aimed to develop a brain-computer interface that can control an electric wheelchair using electroencephalography (EEG) signals. First, we used the Mind Wave Mobile 2 device to capture raw EEG signals from the surface of the scalp. The signals were transformed into the frequency domain using fast Fourier transform (FFT) and filtered to monitor changes in attention and relaxation. Next, we performed time and frequency domain analyses to identify features for five eye gestures: opened, closed, blink per second, double blink, and lookup. The base state was the opened-eyes gesture, and we compared the features of the remaining four action gestures to the base state to identify potential gestures. We then built a multilayer neural network to classify these features into five signals that control the wheelchair’s movement. Finally, we designed an experimental wheelchair system to test the effectiveness of the proposed approach. The results demonstrate that the EEG classification was highly accurate and computationally efficient. Moreover, the average performance of the brain-controlled wheelchair system was over 75% across different individuals, which suggests the feasibility of this approach.
Adaptive segmentation algorithm based on level set model in medical imagingTELKOMNIKA JOURNAL
For image segmentation, level set models are frequently employed. It offer best solution to overcome the main limitations of deformable parametric models. However, the challenge when applying those models in medical images stills deal with removing blurs in image edges which directly affects the edge indicator function, leads to not adaptively segmenting images and causes a wrong analysis of pathologies wich prevents to conclude a correct diagnosis. To overcome such issues, an effective process is suggested by simultaneously modelling and solving systems’ two-dimensional partial differential equations (PDE). The first PDE equation allows restoration using Euler’s equation similar to an anisotropic smoothing based on a regularized Perona and Malik filter that eliminates noise while preserving edge information in accordance with detected contours in the second equation that segments the image based on the first equation solutions. This approach allows developing a new algorithm which overcome the studied model drawbacks. Results of the proposed method give clear segments that can be applied to any application. Experiments on many medical images in particular blurry images with high information losses, demonstrate that the developed approach produces superior segmentation results in terms of quantity and quality compared to other models already presented in previeous works.
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...TELKOMNIKA JOURNAL
Drug addiction is a complex neurobiological disorder that necessitates comprehensive treatment of both the body and mind. It is categorized as a brain disorder due to its impact on the brain. Various methods such as electroencephalography (EEG), functional magnetic resonance imaging (FMRI), and magnetoencephalography (MEG) can capture brain activities and structures. EEG signals provide valuable insights into neurological disorders, including drug addiction. Accurate classification of drug addiction from EEG signals relies on appropriate features and channel selection. Choosing the right EEG channels is essential to reduce computational costs and mitigate the risk of overfitting associated with using all available channels. To address the challenge of optimal channel selection in addiction detection from EEG signals, this work employs the shuffled frog leaping algorithm (SFLA). SFLA facilitates the selection of appropriate channels, leading to improved accuracy. Wavelet features extracted from the selected input channel signals are then analyzed using various machine learning classifiers to detect addiction. Experimental results indicate that after selecting features from the appropriate channels, classification accuracy significantly increased across all classifiers. Particularly, the multi-layer perceptron (MLP) classifier combined with SFLA demonstrated a remarkable accuracy improvement of 15.78% while reducing time complexity.
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdffxintegritypublishin
Advancements in technology unveil a myriad of electrical and electronic breakthroughs geared towards efficiently harnessing limited resources to meet human energy demands. The optimization of hybrid solar PV panels and pumped hydro energy supply systems plays a pivotal role in utilizing natural resources effectively. This initiative not only benefits humanity but also fosters environmental sustainability. The study investigated the design optimization of these hybrid systems, focusing on understanding solar radiation patterns, identifying geographical influences on solar radiation, formulating a mathematical model for system optimization, and determining the optimal configuration of PV panels and pumped hydro storage. Through a comparative analysis approach and eight weeks of data collection, the study addressed key research questions related to solar radiation patterns and optimal system design. The findings highlighted regions with heightened solar radiation levels, showcasing substantial potential for power generation and emphasizing the system's efficiency. Optimizing system design significantly boosted power generation, promoted renewable energy utilization, and enhanced energy storage capacity. The study underscored the benefits of optimizing hybrid solar PV panels and pumped hydro energy supply systems for sustainable energy usage. Optimizing the design of solar PV panels and pumped hydro energy supply systems as examined across diverse climatic conditions in a developing country, not only enhances power generation but also improves the integration of renewable energy sources and boosts energy storage capacities, particularly beneficial for less economically prosperous regions. Additionally, the study provides valuable insights for advancing energy research in economically viable areas. Recommendations included conducting site-specific assessments, utilizing advanced modeling tools, implementing regular maintenance protocols, and enhancing communication among system components.
Cosmetic shop management system project report.pdfKamal Acharya
Buying new cosmetic products is difficult. It can even be scary for those who have sensitive skin and are prone to skin trouble. The information needed to alleviate this problem is on the back of each product, but it's thought to interpret those ingredient lists unless you have a background in chemistry.
Instead of buying and hoping for the best, we can use data science to help us predict which products may be good fits for us. It includes various function programs to do the above mentioned tasks.
Data file handling has been effectively used in the program.
The automated cosmetic shop management system should deal with the automation of general workflow and administration process of the shop. The main processes of the system focus on customer's request where the system is able to search the most appropriate products and deliver it to the customers. It should help the employees to quickly identify the list of cosmetic product that have reached the minimum quantity and also keep a track of expired date for each cosmetic product. It should help the employees to find the rack number in which the product is placed.It is also Faster and more efficient way.
Explore the innovative world of trenchless pipe repair with our comprehensive guide, "The Benefits and Techniques of Trenchless Pipe Repair." This document delves into the modern methods of repairing underground pipes without the need for extensive excavation, highlighting the numerous advantages and the latest techniques used in the industry.
Learn about the cost savings, reduced environmental impact, and minimal disruption associated with trenchless technology. Discover detailed explanations of popular techniques such as pipe bursting, cured-in-place pipe (CIPP) lining, and directional drilling. Understand how these methods can be applied to various types of infrastructure, from residential plumbing to large-scale municipal systems.
Ideal for homeowners, contractors, engineers, and anyone interested in modern plumbing solutions, this guide provides valuable insights into why trenchless pipe repair is becoming the preferred choice for pipe rehabilitation. Stay informed about the latest advancements and best practices in the field.
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxR&R Consult
CFD analysis is incredibly effective at solving mysteries and improving the performance of complex systems!
Here's a great example: At a large natural gas-fired power plant, where they use waste heat to generate steam and energy, they were puzzled that their boiler wasn't producing as much steam as expected.
R&R and Tetra Engineering Group Inc. were asked to solve the issue with reduced steam production.
An inspection had shown that a significant amount of hot flue gas was bypassing the boiler tubes, where the heat was supposed to be transferred.
R&R Consult conducted a CFD analysis, which revealed that 6.3% of the flue gas was bypassing the boiler tubes without transferring heat. The analysis also showed that the flue gas was instead being directed along the sides of the boiler and between the modules that were supposed to capture the heat. This was the cause of the reduced performance.
Based on our results, Tetra Engineering installed covering plates to reduce the bypass flow. This improved the boiler's performance and increased electricity production.
It is always satisfying when we can help solve complex challenges like this. Do your systems also need a check-up or optimization? Give us a call!
Work done in cooperation with James Malloy and David Moelling from Tetra Engineering.
More examples of our work https://www.r-r-consult.dk/en/cases-en/
About
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
• Remote control: Parallel or serial interface.
• Compatible with MAFI CCR system.
• Compatible with IDM8000 CCR.
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
• Easy in configuration using DIP switches.
Technical Specifications
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
Key Features
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
• Remote control: Parallel or serial interface
• Compatible with MAFI CCR system
• Copatiable with IDM8000 CCR
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
Application
• Remote control: Parallel or serial interface.
• Compatible with MAFI CCR system.
• Compatible with IDM8000 CCR.
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
• Easy in configuration using DIP switches.
Immunizing Image Classifiers Against Localized Adversary Attacksgerogepatton
This paper addresses the vulnerability of deep learning models, particularly convolutional neural networks
(CNN)s, to adversarial attacks and presents a proactive training technique designed to counter them. We
introduce a novel volumization algorithm, which transforms 2D images into 3D volumetric representations.
When combined with 3D convolution and deep curriculum learning optimization (CLO), itsignificantly improves
the immunity of models against localized universal attacks by up to 40%. We evaluate our proposed approach
using contemporary CNN architectures and the modified Canadian Institute for Advanced Research (CIFAR-10
and CIFAR-100) and ImageNet Large Scale Visual Recognition Challenge (ILSVRC12) datasets, showcasing
accuracy improvements over previous techniques. The results indicate that the combination of the volumetric
input and curriculum learning holds significant promise for mitigating adversarial attacks without necessitating
adversary training.
Student information management system project report ii.pdfKamal Acharya
Our project explains about the student management. This project mainly explains the various actions related to student details. This project shows some ease in adding, editing and deleting the student details. It also provides a less time consuming process for viewing, adding, editing and deleting the marks of the students.
Welcome to WIPAC Monthly the magazine brought to you by the LinkedIn Group Water Industry Process Automation & Control.
In this month's edition, along with this month's industry news to celebrate the 13 years since the group was created we have articles including
A case study of the used of Advanced Process Control at the Wastewater Treatment works at Lleida in Spain
A look back on an article on smart wastewater networks in order to see how the industry has measured up in the interim around the adoption of Digital Transformation in the Water Industry.
2. TELKOMNIKA ISSN: 1693-6930
Text to Emotion Extraction Using Supervised Machine... (Muhammad Anwarul Azim)
1395
augmented the exploratory data warehouse (DW) technology to their previous existing system
for enhancing their dataset [8]. All these systems relied on emotion intensity lexicon where each
word is assigned with a vector of intensity values of different emotions. However, creating
emotion intensity lexicon is very time consuming and tedious job. Moreover, there is no hard
and fast rule for assigning emotion intensity values to a word. Thus, intensity values vary from
person to person which introduces bias in the output. A Machine Learning based system would
solve the problems regarding the rule based approcah.
Machine Learning is an approach which is used to enable machine to learn from
empirical data validated by experts [9]. Machine learning task is divided into supervised learning
and unsupervised learning. In supervised learning, also known as classification, data tuples and
their corresponding classes are given to the system and a classification model is developed on
the train data. The system predicts the class level of new data based on the classification
model. On the other hand, in unsupervised learning known as clustering, no class level is given
and system forms different clusters from the data based on any similarity measure. The task of
emotion extraction from text can be represented as supervised machine learning problem.
Supervised Machine learning based approaches for sentiment analysis are investigated
by different researchers. Those works are confined to finding positivity, negativity and neutral
nature of text. Pang et al. proposed machine learning based approach for finding only positivity
and negativity in movie review domain [10]. They used the standard Bag of Features framework
with Naïve Bayesian classifier, Maximum Entropy and Support Vector Machines. Twitter [11] is
popular social media all over the world and great source of emotional data. People express their
status through short text form known as Tweets. Sentiment analysis of tweets on GSM services
is experimented in [12] where the authors used Multinomial Naïve Bayes for classification. A
methodology proposed in [13] is related to our work which proposed a system to identify the six
emotion category proposed by Ekman et al. [14]. However, extracting more emotion category
provides more insight about data. To the best of our knowledge, there is no machine learning
approach of extracting nine emotion categories defined by Izard [6] from text data.
Although machine learning based approach of emotion extraction from text which could
eliminate the problems of rule based approach, it was not taken into account. In that connection,
we proposed a machine leaning based approach for the task. Our system relied on text labeled
with emotion category rather emotion intensity lexicon and hence eliminate the tedious job of
creating emotion intensity lexicon. Labeling a text with an emotional marker is comparatively
easier than labeling a word with emotion intensity values. In our experiment, we used social
media data from Twitter [11]. We collected 4000 tweets from twitter and labeled them with
proper emotion category by human expert. We used some portion of them to train our system
and the other to test. Among the various emotional states we used the nine basic emotion
categories namely, ‘anger’, ‘disgust’, ‘fear’, ‘guilt’, ‘interest’, ‘joy’, ‘sad’, ‘shame’ and ‘surprise’
that can be identified by facial expression defined by Izard [6] along with ‘neutral’ category
which stood for absence of emotional content.
The unique contributions of this work are: 1) We showed a machine learning based
approach of extracting nine basic emotion categories from text which eliminated the necessity of
emotion intensity lexicon; 2) We showed a comparative study on performance of three most
common classifiers in the job of emotion extraction from tweets. This remainder of the paper is
organized as follows. In the Section 2, we describe the methodology of this research. The
outcomes of our experiment and the reasons behind them are discussed in Section 3. We
conclude out work and also keep pointer to future directions in Section 4.
2. Research Method
In this work, we followed a supervised machine learning approach to identify the nine
emotion categories proposed by Izard [6]. In supervised machine learning system, the system is
trained with examples for which class labels are known. After the training, the system is tested
on a data set with unknown class labels. The architecture of our system is shown in Figure 1. At
first, we prepare our dataset to make it suitable for training and testing. Preparation of data is
performed in the Data Preparation Module of our system. Emotion Extraction Module of our
system is responsible for extracting emotion from processed data.
3. ISSN: 1693-6930
TELKOMNIKA Vol. 16, No. 3, June 2018 : 1394-1401
1396
2.1. Data preparation module
In this module, data are processed so that they are ready to feed into the classifier
algorithm. We collect data from social networking site Twitter because it is a great source of
emotional text [11]. In this site, people express their emotion and opinion as a short form of text
known as Tweet. Twitter API is used to collect data from Twitter. Collected tweets contained
different types of noise. For this reason, we perform cleaning operation on the tweets. We
eliminate re-tweets along with the tweets containing links to other multimedia content. After that,
we manually label the tweets to one of the nine emotion categories defined by Izard [6]. Tweets
containing no emotional content are added to ‘neutral’ category. Furthermore, all parts of data
are not important for emotion extraction task. We apply preprocessing to eliminate uninteresting
and trivial parts of the data. Preprocessing is very important step of classification because the
accuracy of natural language processing system greatly depends on it [15].
Tokenization is the first step of preprocessing text. In tokenization, the text is chopped
into a stream of words, phrases, symbols or other meaningful units called tokens. Along with
decomposing text into tokens, tokenization may also eliminate some symbols such as
punctuations. Stopwords are the token or term those occur frequently and are essentially
meaningless in the sense that they contribute very less to the context and content of the text.
Thus they are not important for classification of text and hence are eliminated. Normalization is
a transformation process that transforms different representations of a word to one ideal form.
For example, USA and U.S.A. are normalized to USA. Different variations of a word convey
almost the same meaning and hence can be considered as the same word. We use stemming
to convert variants of a word to the base form of that word.
Figure 1. Architecture of our system
2.1. Emotion extraction module
The main task of emotion extraction from data is done by this module. In this module, to
extract the emotional category of text, a classification model is created based on the training
data. For this purpose, we extract important features from the text. We apply language modeling
technique for feature extraction. We use Vector Space Model (VSM) and Probabilistic Language
Model (PLM) for language modeling.
In VSM, a tweet is represented as a vector of terms in a common vector space.
A dictionary is formed by collecting all the terms in the domain of discourse or corpus. Each
term represents a direction and hence a unit vector in the document vector. For example, if a
corpus consists of two tweets, =‘I am sick today’ and =‘I am angry’, then the dictionary
formed by VSM modeling of the tweets is shown in Table 1.
4. TELKOMNIKA ISSN: 1693-6930
Text to Emotion Extraction Using Supervised Machine... (Muhammad Anwarul Azim)
1397
Table 1. Dictionary formed by VSM modeling of the tweets
Tw eet I am sick today angry
T1 1 1 1 1 0
T2 1 1 1 0 1
The dictionary contains the count of each term in each of the tweets. The VSM representations
of the tweets are as follows.
⃗⃗⃗ ̂ ̂ ̂ ̂ ̂
⃗⃗⃗ ̂ ̂ ̂ ̂ ̂
The number of occurrences or frequency of a term in document is called term
frequency ( ). The of the terms can be viewed as a quantitative representation of the
document. This model is known as Bag of Words model in the literature of language modeling.
However, provides the importance of term to a particular document i.e., local weight. The
problem of local weight is that it provides importance regarding a document rather the whole
corpus. That’s why we use global weight which provides importance of a term
according to the whole corpus. Global weight is defined as,
Where is called Inverse Document Frequency. If there are total N number of documents in
the corpus and represents the number of documents in the corpus that contain the term
then Inverse Document Frequency is defined as,
We use probabilistic language modeling which provides the probability of a sequence of words.
We use probabilistic N-gram modeling of language in our experiment. For a sequence of words,
w1,w2,w3,…..wi,…..wn the Gram rule of language modeling is defined as,
= ∏
If the value of k = 1, then the language model is called bigram and for k = 2, the model is called
trigram. Similarly, unigram model is defined as,
P(w1w2w3…wn) = P(w1)× P(w2)× P(w3)×.....× P(wn)
Unigram model doesn’t consider the conditioning context that’s why bigram modeling is used in
our experiment. To deal with Zero Probability problem, we use Add-1 smoothing technique
which adds 1 to each count so that no probability becomes zero. Important features extracted
from the train data are fed into the classifier algorithm to create the classification model. We
experiment Multinomial Naïve Bayesian (MNB) classifier, Support Vector Machines (SVM) and
Artificial Neural Network (ANN) for classification model creation.
The Multinomial Naïve Bayesian classifier is a probabilistic classifier that predicts the
class label based on the probability of a document to be of a particular class using Bayes’
theorem. It assumes that the effect of an attribute on the class label is independent of the effect
of other attributes [16]. The reason for using this classifier is that it found to faster that other
classifiers [17]. Let, denotes the training tweets and their associated class labels. Each tweet
is represented by an n-dimensional attribute vector, Here; attribute value
represents the measurement done on the attribute . We have a total of n attributes
and m classes . The classifier will predict that a tweet
belongs to the class for which the posterior probability, conditioned on is the highest i.e., the
tweet belongs to the class iff,
> ( | )
5. ISSN: 1693-6930
TELKOMNIKA Vol. 16, No. 3, June 2018 : 1394-1401
1398
Thus, to find the class label of we need to maximize . According to Bayes’ theorem,
The probability is constant for all classes, thus we only need to maximize .
The function provides the class for which the probability is the maximum. Therefore, we
need to find,
( )
The probability can be calculated as,
Where | | is the number of training tweets of class in . As the Multinomial Naïve Bayesian
classifier assumes class conditional independence, the value of can be calculated as,
∏
……. .
We can calculate the probabilities ……. from the
training tweets because, as stated earlier, refers to the value of attribute for tuple .
To predict the class label of tweet is calculated for each class . The function
will return the class if and only if,
.
The value of the function is the emotional class of a tweet. The Artificial Neural
Network is based on the modeling of neurons of human brain which can be considered as a
weighted directed graph in which neurons represent the nodes and weighted directed edges
can be considered as input and output connection of neuron [16]. A Neural Network is
necessarily a connected collection of simple units called perceptron. Each link has a weight
which defines the properties of the link. It has a bias input with weight . Each
perceptron computes the weighted sum of the inputs and then applies a sigmoid function to
generate the output. The Neural Network learns by iteratively processing each tweet of the
training set. It calculates the error by comparing the output of the network with given target
value. It than back propagates the error so that weight of connection links are adjusted to
reduce the mean-squared error between the network output and actual target value.
The Support Vector Machine is a supervised classification technique in which maps the
original data into a higher dimension space using nonlinear mapping. It then searches the
higher dimension for a decision boundary which separates the tuple of one class from another
which is called optimal separating hyperplane. SVM finds this hyperplane using support vectors.
Support vectors are also training tuple that are closest to the decision boundary and pushed up
by hyperplane [16]. The SVM is a binary classifier and its main function is to find a hyperplane,
known as Maximum Marginal Hyperplane (MMH) which maximizes the margin between classes.
A separating hyperplane can be written as,
Where weight vector of the form, { } is the number of attributes and
is a scalar quantity referred to as bias. The equation of the MMH can be written as a
constrained convex optimization problem which can be solvable using Lagrangian formulation.
After solving we find the following equation for MMH as decision boundary,
6. TELKOMNIKA ISSN: 1693-6930
Text to Emotion Extraction Using Supervised Machine... (Muhammad Anwarul Azim)
1399
∑
Where, is the class label of the support vector ; is a test tweet; and are numeric
parameters that are determined automatically by the SVM algorithm and is the number of
support vector. Given a test tweet we put in it the equation above. If the sign of the result is
positive then the tweet belongs to the positive class and if negative then to the negative class.
To use SVM in our multiclass classification problem, we took one class and put all other classes
altogether which is known as One vs. Rest. We applied the same procedure to all the emotion
categories.
3. Results and Analysis
To evaluate our system, we split our dataset into disjoint train set and test set. The train
set is used to train the system and the test is used to evaluate the performance of our system.
We use the following three performance criteria to test the system.
Recall =
Precision =
F-measure =
We perform holdout method and k-fold cross validation for evaluation. For holdout method, we
use one fifth of total data for testing and rest for training. We performed the test for three times
and calculated the average of various measures which are depicted in Table 2.
Table 2. Evaluation results in holdout method of our system
Language Model Classifier Accuracy Precision Recall F-Measure
Bag of Words
MNB 54.42 66 54 52
ANN 41.25 46 41 41
SVM 71.75 72 73 70
Bigram
MNB 55.85 67 56 53
ANN 42.75 47 44 43
SVM 74.20 73 74 71
It is clear from Table 2. That SVM performs better than the other two classifiers in all the
performance criteria. However, the result of holdout method may be biased by a dataset which
is not uniformly distributed. For this reason, we also perform 10-fold cross validation of our
system. Total 4000 tweets of the corpus are divided into 10 folds of each containing 400 tweets.
One fold is used for testing and other nine folds are used for training. The process is repeated
ten times on all the folds. The result of the 10-Fold cross validation is shown Table 3.
Table 3. Evaluation results of the 10-Fold cross validation of our system
Language Model Classifier Accuracy Precision Recall F-Measure
Bag of Words
MNB 52.98 66 52.9 46.6
ANN 45.95 51.1 46.0 47.4
SVM 75.23 75.7 75.2 71.7
Bigram
MNB 54.25 68 59 55
ANN 47.25 53.50 47.25 48
SVM 77.50 78 77.15 74.0
Comparing the result given in the Table 3, it is clear that in 10-fold cross validation also,
Support Vector Machine outperforms the other two classifiers, Multinomial Naïve Bayesian and
Artificial Neural Network algorithm. As there is not much difference in figures of holdout
validation and 10-fold cross validation, it is intuitive that our dataset is uniformly distributed.
7. ISSN: 1693-6930
TELKOMNIKA Vol. 16, No. 3, June 2018 : 1394-1401
1400
Moreover, SVM possesses high-dimensional input space and it is less affected by the overfitting
problem. As like other text categorization problem, in our problem there are only a few irrelevant
features and almost all features contain some considerable information that are necessary for
emotion extraction. The number of features of our dataset in Bag of Words and Bigram
modeling are 2754 and 8831 respectively. Because of this high-dimensional input space, SVM
can capture all the related features.
Another aspect that is noticeable from the both Table 2 and Table 3 is bigram modeling
provides better accuracy than Bag of Words modeling. This is because the Bag of Words
modeling doesn’t consider the context of a term in a tweet. However, the emotion of a term is
also related to the previous terms of the tweet. In bigram modeling, the relation between two
consecutive terms is also taken into account thus it provides better accuracy.
The reason of better accuracy of MNB classifier than that of ANN is that we used
weight and MNB performs better with it. On the other hand, the accuracy of ANN is
lower than expected. We used default setup for ANN classifier. The accuracy may be increased
by performing grid search method for optimal learning rate and changing the different
parameters such as the number of hidden layers, number of iteration etc. of the network.
Furthermore, increasing the size of dataset may increase the performance of all three
classifiers.
4. Conclusion
Extracting emotion from text has a lot of important applications, such as e-commerce,
product review analysis, personal or group level communication, automated motive detection for
robotics etc. Automatically extracting emotion from text can be done by various approaches. A
lot of rule based methods are established there. The main problem with these methods is
creating a reliable emotion intensity lexicon with less possible effort. We analyzed supervised
machine learning based approach to solve the problem. We experimented the Multinomial
Naïve Bayesian classifier, Support Vector Machine and Artificial Neural Network in this
research. In our setup, SVM outperformed other two classifiers with impressive accuracy for the
task. Out work can be used to eliminate currently available tedious systems of creating emotion
intensity lexicon. Although the text classification problem is very old, emotion extraction from
text using machine learning techniques is comparatively new. There are many scopes of work in
this sector. We experimented three supervised machine learning methods. Comparative study
of others classifiers will be helpful for new researchers of this domain.
As a pointer to the future work, there is a lack of established dataset for 9 emotion
categories proposed by Izard. Creating an established dataset will also be a great work in this
domain. We used bigram model for semantic analysis. More complex semantic analysis model
can be used with advanced feature extraction technique to increase the accuracy of the system.
References
[1] Asad MU, Afroz N, Dey L, Nath RP, Azim MA. Introducing active learning on text to emotion analyzer.
In Computer and Information Technology(ICCIT), 2014 17th IEEE International Conference on 2014:
35-40.
[2] Abrams D, Hogg MA, editors. Social identity and social cognition. Oxford: Blackwell; 1999.
[3] Neviarouskaya A, Prendinger H, Ishizuka M. Textual affect sensing for sociable and expres sive
online communication. Affective Computing and Intelligent Interaction. 2007: 218-29.
[4] Picard RW, Picard R. Affective computing. Cambridge: MIT press; 1997.
[5] Strapparava C, Valitutti A. WordNet Affect: an Affective Extension of WordNet. InLREC 2004; 4:
1083-1086.
[6] Izard CE. Human emotions. Springer Science & Business Media; 2013.
[7] Dey L, Asad MU, Afroz N, Nath RP. Emotion extraction from real time chat messenger. In
Informatics, Electronics & Vision (ICIEV), 2014 IEEE International Conference on 2014: 1-5.
[8] Afroz N, Asad MU, Dey L, Nath RP, Azim MA. An intelligent framework for text-to-emotion analyzer.
In Computer and Information Technology (ICCIT).2015 18th IEEE International Conference on 2015:
401-406.
[9] Subroto IM, Selamat A. Plagiarism Detection through Internet using Hybrid Artificial Neural Network
and Support Vectors Machine. TELKOMNIKA (Telecommunication Computing Electronics and
Control). 2014; 12(1): 209-18.
8. TELKOMNIKA ISSN: 1693-6930
Text to Emotion Extraction Using Supervised Machine... (Muhammad Anwarul Azim)
1401
[10] Pang B, Lee L, Vaithyanathan S. Thumbs up? Sentiment classification using machine learning
techniques. In Proceedings of the ACL-02 conference on Empirical methods in natural language
processing. Association for Computational Linguistics. 2002; 10: 79-86.
[11] https://twitter.com/
[12] Susanti AR, Djatna T, Kusuma WA. Twitter's SentimentAnalysis on Gsm Services using Multinomial
Naïve Bayes. Telkomnika. 2017; 15(3).
[13] Aman S, Szpakowicz S. Identifying expressions of emotion in text. In Text, speech and dialogue
2007: 196-205. Springer Berlin/Heidelberg.
[14] Ekman P. An argument for basic emotions. Cognition & em otion. 1992 May 1; 6(3-4): 169-200.
[15] Ruhwinaningsih L, Djatna T. A Sentiment Knowledge Discovery Model in Twitter’s TV Content Using
Stochastic Gradient Descent Algorithm. TELKOMNIKA (Telecommunication Computing Electronics
and Control). 2016 Sep 1; 14(3):1067-76.
[16] Han J, Pei J, Kamber M. Data mining: concepts and techniques. Elsevier; 2011.
[17] Sembodo JE, Setiawan EB, Baizal ZK. A Framework for Classifying Indonesian News Curator in
Twitter. TELKOMNIKA (Telecommunication Computing Electronics and Control). 2017; 15(1):
357-64.