This document presents a framework for Arabic concept-level sentiment analysis using SenticNet. It discusses existing sentiment analysis approaches and focuses on concept-level sentiment analysis, which classifies text based on semantics rather than syntax. The authors modify SenticNet to suit Arabic and test it on a multi-domain Arabic dataset. Syntactic patterns are used to extract concepts from sentences, which are then translated to English and matched to SenticNet concepts to determine polarity. An accuracy of 70% was obtained when testing the generated lexicon on the dataset. The lexicon containing 69k unique concepts covers reviews from multiple domains and is made publicly available.
One fundamental problem in sentiment analysis is categorization of sentiment polarity. Given a piece of written text, the problem is to categorize the text into one specific sentiment polarity, positive or negative (or neutral). Based on the scope of the text, there are three distinctions of sentiment polarity categorization, namely the document level, the sentence level, and the entity and aspect level. Consider a review “I like multimedia features but the battery life sucks.†This sentence has a mixed emotion. The emotion regarding multimedia is positive whereas that regarding battery life is negative. Hence, it is required to extract only those opinions relevant to a particular feature (like battery life or multimedia) and classify them, instead of taking the complete sentence and the overall sentiment. In this paper, we present a novel approach to identify pattern specific expressions of opinion in text.
Opinion mining on newspaper headlines using SVM and NLPIJECEIAES
Opinion Mining also known as Sentiment Analysis, is a technique or procedure which uses Natural Language processing (NLP) to classify the outcome from text. There are various NLP tools available which are used for processing text data. Multiple research have been done in opinion mining for online blogs, Twitter, Facebook etc. This paper proposes a new opinion mining technique using Support Vector Machine (SVM) and NLP tools on newspaper headlines. Relative words are generated using Stanford CoreNLP, which is passed to SVM using count vectorizer. On comparing three models using confusion matrix, results indicate that Tf-idf and Linear SVM provides better accuracy for smaller dataset. While for larger dataset, SGD and linear SVM model outperform other models.
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...TELKOMNIKA JOURNAL
Sentiment analysis in short informal texts like product reviews is more challenging. Short texts are
sparse, noisy, and lack of context information. Traditional text classification methods may not be suitable
for analyzing sentiment of short texts given all those difficulties. A common approach to overcome these
problems is to enrich the original texts with additional semantics to make it appear like a large document of
text. Then, traditional classification methods can be applied to it. In this study, we developed an automatic
sentiment analysis system of short informal Indonesian texts using Naïve Bayes and Synonym Based
Feature Expansion. The system consists of three main stages, preprocessing and normalization, features
expansion and classification. After preprocessing and normalization, we utilize Kateglo to find some
synonyms of every words in original texts and append them. Finally, the text is classified using Naïve
Bayes. The experiment shows that the proposed method can improve the performance of sentiment
analysis of short informal Indonesian product reviews. The best sentiment classification performance using
proposed feature expansion is obtained by accuracy of 98%.The experiment also show that feature
expansion will give higher improvement in small number of training data than in the large number of them.
The big data phenomenon has confirmed the achievement of data access transformation. Sentiment analysis (SA) is one of the most exploited area and used for profit-making purpose through business intelligence applications. This paper reviews the trends in SA and relates the growth in the area with the big data era.
One fundamental problem in sentiment analysis is categorization of sentiment polarity. Given a piece of written text, the problem is to categorize the text into one specific sentiment polarity, positive or negative (or neutral). Based on the scope of the text, there are three distinctions of sentiment polarity categorization, namely the document level, the sentence level, and the entity and aspect level. Consider a review “I like multimedia features but the battery life sucks.†This sentence has a mixed emotion. The emotion regarding multimedia is positive whereas that regarding battery life is negative. Hence, it is required to extract only those opinions relevant to a particular feature (like battery life or multimedia) and classify them, instead of taking the complete sentence and the overall sentiment. In this paper, we present a novel approach to identify pattern specific expressions of opinion in text.
Opinion mining on newspaper headlines using SVM and NLPIJECEIAES
Opinion Mining also known as Sentiment Analysis, is a technique or procedure which uses Natural Language processing (NLP) to classify the outcome from text. There are various NLP tools available which are used for processing text data. Multiple research have been done in opinion mining for online blogs, Twitter, Facebook etc. This paper proposes a new opinion mining technique using Support Vector Machine (SVM) and NLP tools on newspaper headlines. Relative words are generated using Stanford CoreNLP, which is passed to SVM using count vectorizer. On comparing three models using confusion matrix, results indicate that Tf-idf and Linear SVM provides better accuracy for smaller dataset. While for larger dataset, SGD and linear SVM model outperform other models.
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...TELKOMNIKA JOURNAL
Sentiment analysis in short informal texts like product reviews is more challenging. Short texts are
sparse, noisy, and lack of context information. Traditional text classification methods may not be suitable
for analyzing sentiment of short texts given all those difficulties. A common approach to overcome these
problems is to enrich the original texts with additional semantics to make it appear like a large document of
text. Then, traditional classification methods can be applied to it. In this study, we developed an automatic
sentiment analysis system of short informal Indonesian texts using Naïve Bayes and Synonym Based
Feature Expansion. The system consists of three main stages, preprocessing and normalization, features
expansion and classification. After preprocessing and normalization, we utilize Kateglo to find some
synonyms of every words in original texts and append them. Finally, the text is classified using Naïve
Bayes. The experiment shows that the proposed method can improve the performance of sentiment
analysis of short informal Indonesian product reviews. The best sentiment classification performance using
proposed feature expansion is obtained by accuracy of 98%.The experiment also show that feature
expansion will give higher improvement in small number of training data than in the large number of them.
The big data phenomenon has confirmed the achievement of data access transformation. Sentiment analysis (SA) is one of the most exploited area and used for profit-making purpose through business intelligence applications. This paper reviews the trends in SA and relates the growth in the area with the big data era.
"Knowing about the user’s feedback can come to a greater aid in knowing the user as well as improving the organization. Here an example of student’s data is taken for study purpose. Analyzing the student feedback will help to help to address student related problems and help to make teaching more student oriented. Prashali S. Shinde | Asmita R. Kanase | Rutuja S. Pawar | Yamini U. Waingankar ""Sentiment Analysis of Feedback Data"" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Special Issue | Fostering Innovation, Integration and Inclusion Through Interdisciplinary Practices in Management , March 2019, URL: https://www.ijtsrd.com/papers/ijtsrd23090.pdf
Paper URL: https://www.ijtsrd.com/other-scientific-research-area/other/23090/sentiment-analysis-of-feedback-data/prashali--s-shinde"
Lexicon Based Emotion Analysis on Twitter Dataijtsrd
This paper presents a system that extracts information from automatically annotated tweets using well known existing opinion lexicons and supervised machine learning approach. In this paper, the sentiment features are primarily extracted from novel high coverage tweet specific sentiment lexicons. These lexicons are automatically generated from tweets with sentiment word hashtags and from tweets with emoticons. The sentence level or tweet level classification is done based on these word level sentiment features by using Sequential Minimal Optimization SMO classifier. SemEval 2013 Twitter sentiment dataset is applied in this work. The ablation experiments show that this system gains in F Score of up to 6.8 absolute percentage points. Nang Noon Kham "Lexicon Based Emotion Analysis on Twitter Data" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-5 , August 2019, URL: https://www.ijtsrd.com/papers/ijtsrd26566.pdfPaper URL: https://www.ijtsrd.com/computer-science/data-miining/26566/lexicon-based-emotion-analysis-on-twitter-data/nang-noon-kham
A scalable, lexicon based technique for sentiment analysisijfcstjournal
Rapid increase in the volume of sentiment rich social media on the web has resulted in an increased
interest among researchers regarding Sentimental Analysis and opinion mining. However, with so much
social media available on the web, sentiment analysis is now considered as a big data task. Hence the
conventional sentiment analysis approaches fails to efficiently handle the vast amount of sentiment data
available now a days. The main focus of the research was to find such a technique that can efficiently
perform sentiment analysis on big data sets. A technique that can categorize the text as positive, negative
and neutral in a fast and accurate manner. In the research, sentiment analysis was performed on a large
data set of tweets using Hadoop and the performance of the technique was measured in form of speed and
accuracy. The experimental results shows that the technique exhibits very good efficiency in handling big
sentiment data sets.
Sentiment Analysis also known as opinion mining and Emotional AI
Refers to the use of natural language processing, text analysis, computational linguistics and biometrics to systematically identify, extract, quantify and study affective states and subjective information.
widely used in
Reviews
Survey responses
Online and social media
Health care
Troubleshooting and Optimizing Named Entity Resolution Systems in the IndustryPanos Alexopoulos
Named Entity Resolution (NER) is an information extraction task that involves detecting mentions of named entities within texts and mapping them to
their corresponding entities in a given knowledge resource. Systems and frameworks for performing NER have been developed both by the academia and the industry with different features and capabilities. Nevertheless, what all approaches have in common is that their satisfactory performance in a given scenario does not constitute a trustworthy predictor of their performance in a different one, the reason being the scenario’s different characteristics (target entities, input texts, domain knowledge etc.). With that in mind, we describe a metric-based Diagnostic Framework that can be used to identify the causes behind the low performance of NER systems in industrial settings and take appropriate actions to increase it.
https://www.youtube.com/watch?v=nvlHJgRE3pU
Won ITAC Graduation Projects Competition, ITAC ID: GP2015.R10.75
A web application that analyze big volumes of product reviews, social networks posts and tweets related to a given product. Then, present these results of this big data analytical job in a user friendly, understandable, and easily interpreted manner that can be used by different customers for different purposes.
Technologies used:
1- Hadoop
2- Hadoop Streaming
3- R Statistical
4- PHP
5- Google Charts API
Complaint Analysis in Indonesian Language Using WPKE and RAKE Algorithm IJECEIAES
Social media provides convenience in communicating and can present twoway communication that allows companies to interact with their customer. Companies can use information obtained from social media to analyze how the communities respond to their services or products. The biggest challenge in processing information in social media like Twitter, is the unstructured sentences which could lead to incorrect text processing. However, this information is very important for companies’ survival. In this research, we proposed a method to extract keywords from tweets in Indonesian language, WPKE. We compared it with RAKE, an algorithm that is language independent and usually used for keyword extraction. Finally, we develop a method to do clustering to groups the topics of complaints with data set obtained from Twitter using the “komplain” hashtag. Our method can obtain the accuracy of 72.92% while RAKE can only obtain 35.42%.
Sentiment mining- The Design and Implementation of an Internet PublicOpinion...Prateek Singh
Sentiment mining paper presentation, database mining and business intelligence.
The Design and Implementation of an Internet PublicOpinion Monitoring and Analysing System
The phenomenon of vagueness, manifested by terms and concepts like Tall, Red, Modern, etc., is quite common in human knowledge and it is related to our inability to precisely determine the extensions of such terms due to their blurred applicability boundaries. In the context of Ontologies and Semantic Web, vagueness is primarily treated by means of Fuzzy Ontologies, namely extensions of classical ontologies that apply truth degrees to vague ontological elements in an effort to quantify their vagueness and reason with it. Nevertheless, while a number of fuzzy conceptual formalisms and fuzzy ontology language extensions for representing vagueness in ontologies have been proposed by the community, the methodological issues entailed within the development process of such ontologies have been rather neglected. In this talk we position vagueness within the overall lifecycle of semantic information management and we present IKARUS-Onto, a methodology for engineering fuzzy ontologies that covers all typical ontology development stages, from specification to validation.
Methods for Sentiment Analysis: A Literature Studyvivatechijri
Sentiment analysis is a trending topic, as everyone has an opinion on everything. The systematic
study of these opinions can lead to information which can prove to be valuable for many companies and
industries in future. A huge number of users are online, and they share their opinions and comments regularly,
this information can be mined and used efficiently. Various companies can review their own product using
sentiment analysis and make the necessary changes in future. The data is huge and thus it requires efficient
processing to collect this data and analyze it to produce required result.
In this paper, we will discuss the various methods used for sentiment analysis. It also covers various techniques
used for sentiment analysis such as lexicon based approach, SVM [10], Convolution neural network,
morphological sentence pattern model [1] and IML algorithm. This paper shows studies on various data sets
such as Twitter API, Weibo, movie review, IMDb, Chinese micro-blog database [9] and more. The paper shows
various accuracy results obtained by all the systems.
Increasing interpreting needs a more objective and automatic measurement. We hold a basic idea that 'translating means translating meaning' in that we can assessment interpretation quality by comparing the
meaning of the interpreting output with the source input. That is, a translation unit of a 'chunk' named
Frame which comes from frame semantics and its components named Frame Elements (FEs) which comes
from Frame Net are proposed to explore their matching rate between target and source texts. A case study in this paper verifies the usability of semi-automatic graded semantic-scoring measurement for human
simultaneous interpreting and shows how to use frame and FE matches to score. Experiments results show that the semantic-scoring metrics have a significantly correlation coefficient with human judgment.
It gives an overview of Sentiment Analysis, Natural Language Processing, Phases of Sentiment Analysis using NLP, brief idea of Machine Learning, Textblob API and related topics.
"Knowing about the user’s feedback can come to a greater aid in knowing the user as well as improving the organization. Here an example of student’s data is taken for study purpose. Analyzing the student feedback will help to help to address student related problems and help to make teaching more student oriented. Prashali S. Shinde | Asmita R. Kanase | Rutuja S. Pawar | Yamini U. Waingankar ""Sentiment Analysis of Feedback Data"" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Special Issue | Fostering Innovation, Integration and Inclusion Through Interdisciplinary Practices in Management , March 2019, URL: https://www.ijtsrd.com/papers/ijtsrd23090.pdf
Paper URL: https://www.ijtsrd.com/other-scientific-research-area/other/23090/sentiment-analysis-of-feedback-data/prashali--s-shinde"
Lexicon Based Emotion Analysis on Twitter Dataijtsrd
This paper presents a system that extracts information from automatically annotated tweets using well known existing opinion lexicons and supervised machine learning approach. In this paper, the sentiment features are primarily extracted from novel high coverage tweet specific sentiment lexicons. These lexicons are automatically generated from tweets with sentiment word hashtags and from tweets with emoticons. The sentence level or tweet level classification is done based on these word level sentiment features by using Sequential Minimal Optimization SMO classifier. SemEval 2013 Twitter sentiment dataset is applied in this work. The ablation experiments show that this system gains in F Score of up to 6.8 absolute percentage points. Nang Noon Kham "Lexicon Based Emotion Analysis on Twitter Data" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-5 , August 2019, URL: https://www.ijtsrd.com/papers/ijtsrd26566.pdfPaper URL: https://www.ijtsrd.com/computer-science/data-miining/26566/lexicon-based-emotion-analysis-on-twitter-data/nang-noon-kham
A scalable, lexicon based technique for sentiment analysisijfcstjournal
Rapid increase in the volume of sentiment rich social media on the web has resulted in an increased
interest among researchers regarding Sentimental Analysis and opinion mining. However, with so much
social media available on the web, sentiment analysis is now considered as a big data task. Hence the
conventional sentiment analysis approaches fails to efficiently handle the vast amount of sentiment data
available now a days. The main focus of the research was to find such a technique that can efficiently
perform sentiment analysis on big data sets. A technique that can categorize the text as positive, negative
and neutral in a fast and accurate manner. In the research, sentiment analysis was performed on a large
data set of tweets using Hadoop and the performance of the technique was measured in form of speed and
accuracy. The experimental results shows that the technique exhibits very good efficiency in handling big
sentiment data sets.
Sentiment Analysis also known as opinion mining and Emotional AI
Refers to the use of natural language processing, text analysis, computational linguistics and biometrics to systematically identify, extract, quantify and study affective states and subjective information.
widely used in
Reviews
Survey responses
Online and social media
Health care
Troubleshooting and Optimizing Named Entity Resolution Systems in the IndustryPanos Alexopoulos
Named Entity Resolution (NER) is an information extraction task that involves detecting mentions of named entities within texts and mapping them to
their corresponding entities in a given knowledge resource. Systems and frameworks for performing NER have been developed both by the academia and the industry with different features and capabilities. Nevertheless, what all approaches have in common is that their satisfactory performance in a given scenario does not constitute a trustworthy predictor of their performance in a different one, the reason being the scenario’s different characteristics (target entities, input texts, domain knowledge etc.). With that in mind, we describe a metric-based Diagnostic Framework that can be used to identify the causes behind the low performance of NER systems in industrial settings and take appropriate actions to increase it.
https://www.youtube.com/watch?v=nvlHJgRE3pU
Won ITAC Graduation Projects Competition, ITAC ID: GP2015.R10.75
A web application that analyze big volumes of product reviews, social networks posts and tweets related to a given product. Then, present these results of this big data analytical job in a user friendly, understandable, and easily interpreted manner that can be used by different customers for different purposes.
Technologies used:
1- Hadoop
2- Hadoop Streaming
3- R Statistical
4- PHP
5- Google Charts API
Complaint Analysis in Indonesian Language Using WPKE and RAKE Algorithm IJECEIAES
Social media provides convenience in communicating and can present twoway communication that allows companies to interact with their customer. Companies can use information obtained from social media to analyze how the communities respond to their services or products. The biggest challenge in processing information in social media like Twitter, is the unstructured sentences which could lead to incorrect text processing. However, this information is very important for companies’ survival. In this research, we proposed a method to extract keywords from tweets in Indonesian language, WPKE. We compared it with RAKE, an algorithm that is language independent and usually used for keyword extraction. Finally, we develop a method to do clustering to groups the topics of complaints with data set obtained from Twitter using the “komplain” hashtag. Our method can obtain the accuracy of 72.92% while RAKE can only obtain 35.42%.
Sentiment mining- The Design and Implementation of an Internet PublicOpinion...Prateek Singh
Sentiment mining paper presentation, database mining and business intelligence.
The Design and Implementation of an Internet PublicOpinion Monitoring and Analysing System
The phenomenon of vagueness, manifested by terms and concepts like Tall, Red, Modern, etc., is quite common in human knowledge and it is related to our inability to precisely determine the extensions of such terms due to their blurred applicability boundaries. In the context of Ontologies and Semantic Web, vagueness is primarily treated by means of Fuzzy Ontologies, namely extensions of classical ontologies that apply truth degrees to vague ontological elements in an effort to quantify their vagueness and reason with it. Nevertheless, while a number of fuzzy conceptual formalisms and fuzzy ontology language extensions for representing vagueness in ontologies have been proposed by the community, the methodological issues entailed within the development process of such ontologies have been rather neglected. In this talk we position vagueness within the overall lifecycle of semantic information management and we present IKARUS-Onto, a methodology for engineering fuzzy ontologies that covers all typical ontology development stages, from specification to validation.
Methods for Sentiment Analysis: A Literature Studyvivatechijri
Sentiment analysis is a trending topic, as everyone has an opinion on everything. The systematic
study of these opinions can lead to information which can prove to be valuable for many companies and
industries in future. A huge number of users are online, and they share their opinions and comments regularly,
this information can be mined and used efficiently. Various companies can review their own product using
sentiment analysis and make the necessary changes in future. The data is huge and thus it requires efficient
processing to collect this data and analyze it to produce required result.
In this paper, we will discuss the various methods used for sentiment analysis. It also covers various techniques
used for sentiment analysis such as lexicon based approach, SVM [10], Convolution neural network,
morphological sentence pattern model [1] and IML algorithm. This paper shows studies on various data sets
such as Twitter API, Weibo, movie review, IMDb, Chinese micro-blog database [9] and more. The paper shows
various accuracy results obtained by all the systems.
Increasing interpreting needs a more objective and automatic measurement. We hold a basic idea that 'translating means translating meaning' in that we can assessment interpretation quality by comparing the
meaning of the interpreting output with the source input. That is, a translation unit of a 'chunk' named
Frame which comes from frame semantics and its components named Frame Elements (FEs) which comes
from Frame Net are proposed to explore their matching rate between target and source texts. A case study in this paper verifies the usability of semi-automatic graded semantic-scoring measurement for human
simultaneous interpreting and shows how to use frame and FE matches to score. Experiments results show that the semantic-scoring metrics have a significantly correlation coefficient with human judgment.
It gives an overview of Sentiment Analysis, Natural Language Processing, Phases of Sentiment Analysis using NLP, brief idea of Machine Learning, Textblob API and related topics.
The sarcasm detection with the method of logistic regressionEditorIJAERD
The prediction analysis is approach which may predict future possibilities. This research work is based on the
sarcasm detection from the text data. In the previous time SVM classification is applied for the sarcasm detection. The SVM
classifier classifies data based on the hyper plane which give low accuracy. To improve accuracy for sarcasm detection
logistic regression is applied during this work. The existing and proposed techniques are implemented in python and results
are analysed in terms of accuracy, execution time. The proposed approach has high accuracy and low execution time as
compared to SVM classifier for sarcasm detection.
Dialectal Arabic sentiment analysis based on tree-based pipeline optimizatio...IJECEIAES
The heavy involvement of the Arabic internet users resulted in spreading data written in the Arabic language and creating a vast research area regarding natural language processing (NLP). Sentiment analysis is a growing field of research that is of great importance to everyone considering the high added potential for decision-making and predicting upcoming actions using the texts produced in social networks. Arabic used in microblogging websites, especially Twitter, is highly informal. It is not compliant with neither standards nor spelling regulations making it quite challenging for automatic machine-learning techniques. In this paper’s scope, we propose a new approach based on AutoML methods to improve the efficiency of the sentiment classification process for dialectal Arabic. This approach was validated through benchmarks testing on three different datasets that represent three vernacular forms of Arabic. The obtained results show that the presented framework has significantly increased accuracy than similar works in the literature.
Twitter, has fast emerged as one of the most powerful social media sites which can
sway opinions. Sentiment or opinion analysis has of late emerged one of the most
researched and talked about subject in Natural Language Processing (NLP), thanks
mainly to sites like Twitter. In the past, sentiment analysis models using Twitter data have
been built to predict sales performance, rank products and merchants, public opinion
polls, predict election results, political standpoints, predict box-office revenues for movies
and even predict the stock market. This study proposes a general frame in R programming
language to act as a gateway for the analysis of the tweets that portray emotions in a
short and concentrated format. The target tweets include brief emotion descriptions and
words that are not used with a proper format or grammatical structure. Majority of the
work constituted in Turkish includes the data scope and the aim of preparing a data-set.
There is no concrete and usable work done on Turkish Tweet sentiment analysis as a
software client/web application. This study is a starting point on building up the next
steps. The aim is to compare five different common machine learning methods (support
vector machines, random forests, boosting, maximum entropy, and artificial neural
networks) to classify Twitters sentiments
Evaluating sentiment analysis and word embedding techniques on BrexitIAESIJAI
In this study, we investigate the effectiveness of pre-trained word embeddings for sentiment analysis on a real-world topic, namely Brexit. We compare the performance of several popular word embedding models such global vectors for word representation (GloVe), FastText, word to vec (word2vec), and embeddings from language models (ELMo) on a dataset of tweets related to Brexit and evaluate their ability to classify the sentiment of the tweets as positive, negative, or neutral. We find that pre-trained word embeddings provide useful features for sentiment analysis and can significantly improve the performance of machine learning models. We also discuss the challenges and limitations of applying these models to complex, real-world texts such as those related to Brexit.
Analyzing sentiment system to specify polarity by lexicon-basedjournalBEEI
Currently, sentiment analysis into positive or negative getting more attention from the researchers. With the rapid development of the internet and social media have made people express their views and opinion publicly. Analyzing the sentiment in people views and opinion impact many fields such as services and productions that companies offer. Movie reviewer needs many processing to be prepared to detect emotion, classify them and achieve high accuracy. The difficulties arise due of the structure and grammar of the language and manage the dictionary. We present a system that assigns scores indicating positive or negative opinion to each distinct entity in the text corpus. Propose an innovative formula to compute the polarity score for each word occurring in the text and find it in positive dictionary or negative dictionary we have to remove it from text. After classification, the words are stored in a list that will be used to calculate the accuracy. The results reveal that the system achieved the best results in accuracy of 76.585%.
APPLYING DISTRIBUTIONAL SEMANTICS TO ENHANCE CLASSIFYING EMOTIONS IN ARABIC T...cscpconf
Most of the recent researches have been carried out to analyse sentiment and emotions found in
English texts, where few studies have been conducted on Arabic contents, which have been
focused on analysing the sentiment as positive and negative, instead of the different emotions’
classes. Therefore this paper has focused on analysing different six emotions’ classes in Arabic
contents, especially Arabic tweets which have unstructured nature that make it challenging task
compared to the formal structured contents found in Arabic journals and books. On the other
hand, the recent developments in the distributional sematic models, have encouraged testing the
effect of the distributional measures on the classification process, which was not investigated by
any other classification-related studies for analysing Arabic texts. As a result, the model has
successfully improved the average accuracy to more than 86% using Support Vector Machine
(SVM) compared to the different sentiments and emotions studies for classifying Arabic texts
through the developed semi-supervised approach which has employed the contextual and the
co-occurrence information from a large amount of unlabelled dataset. In addition to the
different remarkable achieved results, the model has recorded a high average accuracy,
85.30%, after removing the labels from the unlabelled contextual information which was used in
the labelled dataset during the classification process. Moreover, due to the unstructured nature
of Twitter contents, a general set of pre-processing techniques for Arabic texts was found which
has resulted in increasing the accuracy of the six emotions’ classes to 85.95% while employing
the contextual information from the unlabelled dataset.
Insights to Problems, Research Trend and Progress in Techniques of Sentiment ...IJECEIAES
The research-based implementations towards Sentiment analyses are about a decade old and have introduced many significant algorithms, techniques, and framework towards enhancing its performance. The applicability of sentiment analysis towards business and the political survey is quite immense. However, we strongly feel that existing progress in research towards Sentiment Analysis is not at par with the demand of massively increasing dynamic data over the pervasive environment. The degree of problems associated with opinion mining over such forms of data has been less addressed, and still, it leaves the certain major scope of research. This paper will brief about existing research trends, some important research implementation in recent times, and exploring some major open issues about sentiment analysis. We believe that this manuscript will give a progress report with the snapshot of effectiveness borne by the research techniques towards sentiment analysis to further assist the upcoming researcher to identify and pave their research work in a perfect direction towards considering research gap.
APPROXIMATE ANALYTICAL SOLUTION OF NON-LINEAR BOUSSINESQ EQUATION FOR THE UNS...mathsjournal
For one dimensional homogeneous, isotropic aquifer, without accretion the governing Boussinesq
equation under Dupuit assumptions is a nonlinear partial differential equation. In the present paper
approximate analytical solution of nonlinear Boussinesq equation is obtained using Homotopy
perturbation transform method(HPTM). The solution is compared with the exact solution. The
comparison shows that the HPTM is efficient, accurate and reliable. The analysis of two important aquifer
parameters namely viz. specific yield and hydraulic conductivity is studied to see the effects on the height
of water table. The results resemble well with the physical phenomena.
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSISmlaij
Sentiment analysis and Opinion mining has emerged as a popular and efficient technique for information retrieval and web data analysis. The exponential growth of the user generated content has opened new horizons for research in the field of sentiment analysis. This paper proposes a model for sentiment analysis of movie reviews using a combination of natural language processing and machine learning approaches. Firstly, different data pre-processing schemes are applied on the dataset. Secondly, the behaviour of twoclassifiers, Naive Bayes and SVM, is investigated in combination with different feature selection schemes to
obtain the results for sentiment analysis. Thirdly, the proposed model for sentiment analysis is extended to
obtain the results for higher order n-grams.
Customer Opinions Evaluation: A Case Study on Arabic Tweets gerogepatton
This paper presents an automatic method for extracting, processing, and analysis of customer opinions
on Arabic social media. We present a four-step approach for mining of Arabic tweets. First, Natural
Language Processing (NLP) with different types of analyses had performed. Second, we present an
automatic and expandable lexicon for Arabic adjectives. The initial lexicon is built using 1350 adjectives
as seeds from processing of different datasets in Arabic language. The lexicon is automatically expanded
by collecting synonyms and morphemes of each word through Arabic resources and google translate.
Third, emotional analysis was considered by two different methods; Machine Learning (ML) and rulebased method. Finally, Feature Selection (FS) is also considered to enhance the mining results. The
experimental results reveal that the proposed method outperforms counterpart ones with an improvement
margin of up to 4% using F-Measure.
Similar to A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet (20)
Bibliometric analysis highlighting the role of women in addressing climate ch...IJECEIAES
Fossil fuel consumption increased quickly, contributing to climate change
that is evident in unusual flooding and draughts, and global warming. Over
the past ten years, women's involvement in society has grown dramatically,
and they succeeded in playing a noticeable role in reducing climate change.
A bibliometric analysis of data from the last ten years has been carried out to
examine the role of women in addressing the climate change. The analysis's
findings discussed the relevant to the sustainable development goals (SDGs),
particularly SDG 7 and SDG 13. The results considered contributions made
by women in the various sectors while taking geographic dispersion into
account. The bibliometric analysis delves into topics including women's
leadership in environmental groups, their involvement in policymaking, their
contributions to sustainable development projects, and the influence of
gender diversity on attempts to mitigate climate change. This study's results
highlight how women have influenced policies and actions related to climate
change, point out areas of research deficiency and recommendations on how
to increase role of the women in addressing the climate change and
achieving sustainability. To achieve more successful results, this initiative
aims to highlight the significance of gender equality and encourage
inclusivity in climate change decision-making processes.
Voltage and frequency control of microgrid in presence of micro-turbine inter...IJECEIAES
The active and reactive load changes have a significant impact on voltage
and frequency. In this paper, in order to stabilize the microgrid (MG) against
load variations in islanding mode, the active and reactive power of all
distributed generators (DGs), including energy storage (battery), diesel
generator, and micro-turbine, are controlled. The micro-turbine generator is
connected to MG through a three-phase to three-phase matrix converter, and
the droop control method is applied for controlling the voltage and
frequency of MG. In addition, a method is introduced for voltage and
frequency control of micro-turbines in the transition state from gridconnected mode to islanding mode. A novel switching strategy of the matrix
converter is used for converting the high-frequency output voltage of the
micro-turbine to the grid-side frequency of the utility system. Moreover,
using the switching strategy, the low-order harmonics in the output current
and voltage are not produced, and consequently, the size of the output filter
would be reduced. In fact, the suggested control strategy is load-independent
and has no frequency conversion restrictions. The proposed approach for
voltage and frequency regulation demonstrates exceptional performance and
favorable response across various load alteration scenarios. The suggested
strategy is examined in several scenarios in the MG test systems, and the
simulation results are addressed.
Enhancing battery system identification: nonlinear autoregressive modeling fo...IJECEIAES
Precisely characterizing Li-ion batteries is essential for optimizing their
performance, enhancing safety, and prolonging their lifespan across various
applications, such as electric vehicles and renewable energy systems. This
article introduces an innovative nonlinear methodology for system
identification of a Li-ion battery, employing a nonlinear autoregressive with
exogenous inputs (NARX) model. The proposed approach integrates the
benefits of nonlinear modeling with the adaptability of the NARX structure,
facilitating a more comprehensive representation of the intricate
electrochemical processes within the battery. Experimental data collected
from a Li-ion battery operating under diverse scenarios are employed to
validate the effectiveness of the proposed methodology. The identified
NARX model exhibits superior accuracy in predicting the battery's behavior
compared to traditional linear models. This study underscores the
importance of accounting for nonlinearities in battery modeling, providing
insights into the intricate relationships between state-of-charge, voltage, and
current under dynamic conditions.
Smart grid deployment: from a bibliometric analysis to a surveyIJECEIAES
Smart grids are one of the last decades' innovations in electrical energy.
They bring relevant advantages compared to the traditional grid and
significant interest from the research community. Assessing the field's
evolution is essential to propose guidelines for facing new and future smart
grid challenges. In addition, knowing the main technologies involved in the
deployment of smart grids (SGs) is important to highlight possible
shortcomings that can be mitigated by developing new tools. This paper
contributes to the research trends mentioned above by focusing on two
objectives. First, a bibliometric analysis is presented to give an overview of
the current research level about smart grid deployment. Second, a survey of
the main technological approaches used for smart grid implementation and
their contributions are highlighted. To that effect, we searched the Web of
Science (WoS), and the Scopus databases. We obtained 5,663 documents
from WoS and 7,215 from Scopus on smart grid implementation or
deployment. With the extraction limitation in the Scopus database, 5,872 of
the 7,215 documents were extracted using a multi-step process. These two
datasets have been analyzed using a bibliometric tool called bibliometrix.
The main outputs are presented with some recommendations for future
research.
Use of analytical hierarchy process for selecting and prioritizing islanding ...IJECEIAES
One of the problems that are associated to power systems is islanding
condition, which must be rapidly and properly detected to prevent any
negative consequences on the system's protection, stability, and security.
This paper offers a thorough overview of several islanding detection
strategies, which are divided into two categories: classic approaches,
including local and remote approaches, and modern techniques, including
techniques based on signal processing and computational intelligence.
Additionally, each approach is compared and assessed based on several
factors, including implementation costs, non-detected zones, declining
power quality, and response times using the analytical hierarchy process
(AHP). The multi-criteria decision-making analysis shows that the overall
weight of passive methods (24.7%), active methods (7.8%), hybrid methods
(5.6%), remote methods (14.5%), signal processing-based methods (26.6%),
and computational intelligent-based methods (20.8%) based on the
comparison of all criteria together. Thus, it can be seen from the total weight
that hybrid approaches are the least suitable to be chosen, while signal
processing-based methods are the most appropriate islanding detection
method to be selected and implemented in power system with respect to the
aforementioned factors. Using Expert Choice software, the proposed
hierarchy model is studied and examined.
Enhancing of single-stage grid-connected photovoltaic system using fuzzy logi...IJECEIAES
The power generated by photovoltaic (PV) systems is influenced by
environmental factors. This variability hampers the control and utilization of
solar cells' peak output. In this study, a single-stage grid-connected PV
system is designed to enhance power quality. Our approach employs fuzzy
logic in the direct power control (DPC) of a three-phase voltage source
inverter (VSI), enabling seamless integration of the PV connected to the
grid. Additionally, a fuzzy logic-based maximum power point tracking
(MPPT) controller is adopted, which outperforms traditional methods like
incremental conductance (INC) in enhancing solar cell efficiency and
minimizing the response time. Moreover, the inverter's real-time active and
reactive power is directly managed to achieve a unity power factor (UPF).
The system's performance is assessed through MATLAB/Simulink
implementation, showing marked improvement over conventional methods,
particularly in steady-state and varying weather conditions. For solar
irradiances of 500 and 1,000 W/m2
, the results show that the proposed
method reduces the total harmonic distortion (THD) of the injected current
to the grid by approximately 46% and 38% compared to conventional
methods, respectively. Furthermore, we compare the simulation results with
IEEE standards to evaluate the system's grid compatibility.
Enhancing photovoltaic system maximum power point tracking with fuzzy logic-b...IJECEIAES
Photovoltaic systems have emerged as a promising energy resource that
caters to the future needs of society, owing to their renewable, inexhaustible,
and cost-free nature. The power output of these systems relies on solar cell
radiation and temperature. In order to mitigate the dependence on
atmospheric conditions and enhance power tracking, a conventional
approach has been improved by integrating various methods. To optimize
the generation of electricity from solar systems, the maximum power point
tracking (MPPT) technique is employed. To overcome limitations such as
steady-state voltage oscillations and improve transient response, two
traditional MPPT methods, namely fuzzy logic controller (FLC) and perturb
and observe (P&O), have been modified. This research paper aims to
simulate and validate the step size of the proposed modified P&O and FLC
techniques within the MPPT algorithm using MATLAB/Simulink for
efficient power tracking in photovoltaic systems.
Adaptive synchronous sliding control for a robot manipulator based on neural ...IJECEIAES
Robot manipulators have become important equipment in production lines, medical fields, and transportation. Improving the quality of trajectory tracking for
robot hands is always an attractive topic in the research community. This is a
challenging problem because robot manipulators are complex nonlinear systems
and are often subject to fluctuations in loads and external disturbances. This
article proposes an adaptive synchronous sliding control scheme to improve trajectory tracking performance for a robot manipulator. The proposed controller
ensures that the positions of the joints track the desired trajectory, synchronize
the errors, and significantly reduces chattering. First, the synchronous tracking
errors and synchronous sliding surfaces are presented. Second, the synchronous
tracking error dynamics are determined. Third, a robust adaptive control law is
designed,the unknown components of the model are estimated online by the neural network, and the parameters of the switching elements are selected by fuzzy
logic. The built algorithm ensures that the tracking and approximation errors
are ultimately uniformly bounded (UUB). Finally, the effectiveness of the constructed algorithm is demonstrated through simulation and experimental results.
Simulation and experimental results show that the proposed controller is effective with small synchronous tracking errors, and the chattering phenomenon is
significantly reduced.
Remote field-programmable gate array laboratory for signal acquisition and de...IJECEIAES
A remote laboratory utilizing field-programmable gate array (FPGA) technologies enhances students’ learning experience anywhere and anytime in embedded system design. Existing remote laboratories prioritize hardware access and visual feedback for observing board behavior after programming, neglecting comprehensive debugging tools to resolve errors that require internal signal acquisition. This paper proposes a novel remote embeddedsystem design approach targeting FPGA technologies that are fully interactive via a web-based platform. Our solution provides FPGA board access and debugging capabilities beyond the visual feedback provided by existing remote laboratories. We implemented a lab module that allows users to seamlessly incorporate into their FPGA design. The module minimizes hardware resource utilization while enabling the acquisition of a large number of data samples from the signal during the experiments by adaptively compressing the signal prior to data transmission. The results demonstrate an average compression ratio of 2.90 across three benchmark signals, indicating efficient signal acquisition and effective debugging and analysis. This method allows users to acquire more data samples than conventional methods. The proposed lab allows students to remotely test and debug their designs, bridging the gap between theory and practice in embedded system design.
Detecting and resolving feature envy through automated machine learning and m...IJECEIAES
Efficiently identifying and resolving code smells enhances software project quality. This paper presents a novel solution, utilizing automated machine learning (AutoML) techniques, to detect code smells and apply move method refactoring. By evaluating code metrics before and after refactoring, we assessed its impact on coupling, complexity, and cohesion. Key contributions of this research include a unique dataset for code smell classification and the development of models using AutoGluon for optimal performance. Furthermore, the study identifies the top 20 influential features in classifying feature envy, a well-known code smell, stemming from excessive reliance on external classes. We also explored how move method refactoring addresses feature envy, revealing reduced coupling and complexity, and improved cohesion, ultimately enhancing code quality. In summary, this research offers an empirical, data-driven approach, integrating AutoML and move method refactoring to optimize software project quality. Insights gained shed light on the benefits of refactoring on code quality and the significance of specific features in detecting feature envy. Future research can expand to explore additional refactoring techniques and a broader range of code metrics, advancing software engineering practices and standards.
Smart monitoring technique for solar cell systems using internet of things ba...IJECEIAES
Rapidly and remotely monitoring and receiving the solar cell systems status parameters, solar irradiance, temperature, and humidity, are critical issues in enhancement their efficiency. Hence, in the present article an improved smart prototype of internet of things (IoT) technique based on embedded system through NodeMCU ESP8266 (ESP-12E) was carried out experimentally. Three different regions at Egypt; Luxor, Cairo, and El-Beheira cities were chosen to study their solar irradiance profile, temperature, and humidity by the proposed IoT system. The monitoring data of solar irradiance, temperature, and humidity were live visualized directly by Ubidots through hypertext transfer protocol (HTTP) protocol. The measured solar power radiation in Luxor, Cairo, and El-Beheira ranged between 216-1000, 245-958, and 187-692 W/m 2 respectively during the solar day. The accuracy and rapidity of obtaining monitoring results using the proposed IoT system made it a strong candidate for application in monitoring solar cell systems. On the other hand, the obtained solar power radiation results of the three considered regions strongly candidate Luxor and Cairo as suitable places to build up a solar cells system station rather than El-Beheira.
An efficient security framework for intrusion detection and prevention in int...IJECEIAES
Over the past few years, the internet of things (IoT) has advanced to connect billions of smart devices to improve quality of life. However, anomalies or malicious intrusions pose several security loopholes, leading to performance degradation and threat to data security in IoT operations. Thereby, IoT security systems must keep an eye on and restrict unwanted events from occurring in the IoT network. Recently, various technical solutions based on machine learning (ML) models have been derived towards identifying and restricting unwanted events in IoT. However, most ML-based approaches are prone to miss-classification due to inappropriate feature selection. Additionally, most ML approaches applied to intrusion detection and prevention consider supervised learning, which requires a large amount of labeled data to be trained. Consequently, such complex datasets are impossible to source in a large network like IoT. To address this problem, this proposed study introduces an efficient learning mechanism to strengthen the IoT security aspects. The proposed algorithm incorporates supervised and unsupervised approaches to improve the learning models for intrusion detection and mitigation. Compared with the related works, the experimental outcome shows that the model performs well in a benchmark dataset. It accomplishes an improved detection accuracy of approximately 99.21%.
Developing a smart system for infant incubators using the internet of things ...IJECEIAES
This research is developing an incubator system that integrates the internet of things and artificial intelligence to improve care for premature babies. The system workflow starts with sensors that collect data from the incubator. Then, the data is sent in real-time to the internet of things (IoT) broker eclipse mosquito using the message queue telemetry transport (MQTT) protocol version 5.0. After that, the data is stored in a database for analysis using the long short-term memory network (LSTM) method and displayed in a web application using an application programming interface (API) service. Furthermore, the experimental results produce as many as 2,880 rows of data stored in the database. The correlation coefficient between the target attribute and other attributes ranges from 0.23 to 0.48. Next, several experiments were conducted to evaluate the model-predicted value on the test data. The best results are obtained using a two-layer LSTM configuration model, each with 60 neurons and a lookback setting 6. This model produces an R 2 value of 0.934, with a root mean square error (RMSE) value of 0.015 and a mean absolute error (MAE) of 0.008. In addition, the R 2 value was also evaluated for each attribute used as input, with a result of values between 0.590 and 0.845.
A review on internet of things-based stingless bee's honey production with im...IJECEIAES
Honey is produced exclusively by honeybees and stingless bees which both are well adapted to tropical and subtropical regions such as Malaysia. Stingless bees are known for producing small amounts of honey and are known for having a unique flavor profile. Problem identified that many stingless bees collapsed due to weather, temperature and environment. It is critical to understand the relationship between the production of stingless bee honey and environmental conditions to improve honey production. Thus, this paper presents a review on stingless bee's honey production and prediction modeling. About 54 previous research has been analyzed and compared in identifying the research gaps. A framework on modeling the prediction of stingless bee honey is derived. The result presents the comparison and analysis on the internet of things (IoT) monitoring systems, honey production estimation, convolution neural networks (CNNs), and automatic identification methods on bee species. It is identified based on image detection method the top best three efficiency presents CNN is at 98.67%, densely connected convolutional networks with YOLO v3 is 97.7%, and DenseNet201 convolutional networks 99.81%. This study is significant to assist the researcher in developing a model for predicting stingless honey produced by bee's output, which is important for a stable economy and food security.
A trust based secure access control using authentication mechanism for intero...IJECEIAES
The internet of things (IoT) is a revolutionary innovation in many aspects of our society including interactions, financial activity, and global security such as the military and battlefield internet. Due to the limited energy and processing capacity of network devices, security, energy consumption, compatibility, and device heterogeneity are the long-term IoT problems. As a result, energy and security are critical for data transmission across edge and IoT networks. Existing IoT interoperability techniques need more computation time, have unreliable authentication mechanisms that break easily, lose data easily, and have low confidentiality. In this paper, a key agreement protocol-based authentication mechanism for IoT devices is offered as a solution to this issue. This system makes use of information exchange, which must be secured to prevent access by unauthorized users. Using a compact contiki/cooja simulator, the performance and design of the suggested framework are validated. The simulation findings are evaluated based on detection of malicious nodes after 60 minutes of simulation. The suggested trust method, which is based on privacy access control, reduced packet loss ratio to 0.32%, consumed 0.39% power, and had the greatest average residual energy of 0.99 mJoules at 10 nodes.
Fuzzy linear programming with the intuitionistic polygonal fuzzy numbersIJECEIAES
In real world applications, data are subject to ambiguity due to several factors; fuzzy sets and fuzzy numbers propose a great tool to model such ambiguity. In case of hesitation, the complement of a membership value in fuzzy numbers can be different from the non-membership value, in which case we can model using intuitionistic fuzzy numbers as they provide flexibility by defining both a membership and a non-membership functions. In this article, we consider the intuitionistic fuzzy linear programming problem with intuitionistic polygonal fuzzy numbers, which is a generalization of the previous polygonal fuzzy numbers found in the literature. We present a modification of the simplex method that can be used to solve any general intuitionistic fuzzy linear programming problem after approximating the problem by an intuitionistic polygonal fuzzy number with n edges. This method is given in a simple tableau formulation, and then applied on numerical examples for clarity.
The performance of artificial intelligence in prostate magnetic resonance im...IJECEIAES
Prostate cancer is the predominant form of cancer observed in men worldwide. The application of magnetic resonance imaging (MRI) as a guidance tool for conducting biopsies has been established as a reliable and well-established approach in the diagnosis of prostate cancer. The diagnostic performance of MRI-guided prostate cancer diagnosis exhibits significant heterogeneity due to the intricate and multi-step nature of the diagnostic pathway. The development of artificial intelligence (AI) models, specifically through the utilization of machine learning techniques such as deep learning, is assuming an increasingly significant role in the field of radiology. In the realm of prostate MRI, a considerable body of literature has been dedicated to the development of various AI algorithms. These algorithms have been specifically designed for tasks such as prostate segmentation, lesion identification, and classification. The overarching objective of these endeavors is to enhance diagnostic performance and foster greater agreement among different observers within MRI scans for the prostate. This review article aims to provide a concise overview of the application of AI in the field of radiology, with a specific focus on its utilization in prostate MRI.
Seizure stage detection of epileptic seizure using convolutional neural networksIJECEIAES
According to the World Health Organization (WHO), seventy million individuals worldwide suffer from epilepsy, a neurological disorder. While electroencephalography (EEG) is crucial for diagnosing epilepsy and monitoring the brain activity of epilepsy patients, it requires a specialist to examine all EEG recordings to find epileptic behavior. This procedure needs an experienced doctor, and a precise epilepsy diagnosis is crucial for appropriate treatment. To identify epileptic seizures, this study employed a convolutional neural network (CNN) based on raw scalp EEG signals to discriminate between preictal, ictal, postictal, and interictal segments. The possibility of these characteristics is explored by examining how well timedomain signals work in the detection of epileptic signals using intracranial Freiburg Hospital (FH), scalp Children's Hospital Boston-Massachusetts Institute of Technology (CHB-MIT) databases, and Temple University Hospital (TUH) EEG. To test the viability of this approach, two types of experiments were carried out. Firstly, binary class classification (preictal, ictal, postictal each versus interictal) and four-class classification (interictal versus preictal versus ictal versus postictal). The average accuracy for stage detection using CHB-MIT database was 84.4%, while the Freiburg database's time-domain signals had an accuracy of 79.7% and the highest accuracy of 94.02% for classification in the TUH EEG database when comparing interictal stage to preictal stage.
Analysis of driving style using self-organizing maps to analyze driver behaviorIJECEIAES
Modern life is strongly associated with the use of cars, but the increase in acceleration speeds and their maneuverability leads to a dangerous driving style for some drivers. In these conditions, the development of a method that allows you to track the behavior of the driver is relevant. The article provides an overview of existing methods and models for assessing the functioning of motor vehicles and driver behavior. Based on this, a combined algorithm for recognizing driving style is proposed. To do this, a set of input data was formed, including 20 descriptive features: About the environment, the driver's behavior and the characteristics of the functioning of the car, collected using OBD II. The generated data set is sent to the Kohonen network, where clustering is performed according to driving style and degree of danger. Getting the driving characteristics into a particular cluster allows you to switch to the private indicators of an individual driver and considering individual driving characteristics. The application of the method allows you to identify potentially dangerous driving styles that can prevent accidents.
Hyperspectral object classification using hybrid spectral-spatial fusion and ...IJECEIAES
Because of its spectral-spatial and temporal resolution of greater areas, hyperspectral imaging (HSI) has found widespread application in the field of object classification. The HSI is typically used to accurately determine an object's physical characteristics as well as to locate related objects with appropriate spectral fingerprints. As a result, the HSI has been extensively applied to object identification in several fields, including surveillance, agricultural monitoring, environmental research, and precision agriculture. However, because of their enormous size, objects require a lot of time to classify; for this reason, both spectral and spatial feature fusion have been completed. The existing classification strategy leads to increased misclassification, and the feature fusion method is unable to preserve semantic object inherent features; This study addresses the research difficulties by introducing a hybrid spectral-spatial fusion (HSSF) technique to minimize feature size while maintaining object intrinsic qualities; Lastly, a soft-margins kernel is proposed for multi-layer deep support vector machine (MLDSVM) to reduce misclassification. The standard Indian pines dataset is used for the experiment, and the outcome demonstrates that the HSSF-MLDSVM model performs substantially better in terms of accuracy and Kappa coefficient.
Cosmetic shop management system project report.pdfKamal Acharya
Buying new cosmetic products is difficult. It can even be scary for those who have sensitive skin and are prone to skin trouble. The information needed to alleviate this problem is on the back of each product, but it's thought to interpret those ingredient lists unless you have a background in chemistry.
Instead of buying and hoping for the best, we can use data science to help us predict which products may be good fits for us. It includes various function programs to do the above mentioned tasks.
Data file handling has been effectively used in the program.
The automated cosmetic shop management system should deal with the automation of general workflow and administration process of the shop. The main processes of the system focus on customer's request where the system is able to search the most appropriate products and deliver it to the customers. It should help the employees to quickly identify the list of cosmetic product that have reached the minimum quantity and also keep a track of expired date for each cosmetic product. It should help the employees to find the rack number in which the product is placed.It is also Faster and more efficient way.
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxR&R Consult
CFD analysis is incredibly effective at solving mysteries and improving the performance of complex systems!
Here's a great example: At a large natural gas-fired power plant, where they use waste heat to generate steam and energy, they were puzzled that their boiler wasn't producing as much steam as expected.
R&R and Tetra Engineering Group Inc. were asked to solve the issue with reduced steam production.
An inspection had shown that a significant amount of hot flue gas was bypassing the boiler tubes, where the heat was supposed to be transferred.
R&R Consult conducted a CFD analysis, which revealed that 6.3% of the flue gas was bypassing the boiler tubes without transferring heat. The analysis also showed that the flue gas was instead being directed along the sides of the boiler and between the modules that were supposed to capture the heat. This was the cause of the reduced performance.
Based on our results, Tetra Engineering installed covering plates to reduce the bypass flow. This improved the boiler's performance and increased electricity production.
It is always satisfying when we can help solve complex challenges like this. Do your systems also need a check-up or optimization? Give us a call!
Work done in cooperation with James Malloy and David Moelling from Tetra Engineering.
More examples of our work https://www.r-r-consult.dk/en/cases-en/
Student information management system project report ii.pdfKamal Acharya
Our project explains about the student management. This project mainly explains the various actions related to student details. This project shows some ease in adding, editing and deleting the student details. It also provides a less time consuming process for viewing, adding, editing and deleting the marks of the students.
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...Amil Baba Dawood bangali
Contact with Dawood Bhai Just call on +92322-6382012 and we'll help you. We'll solve all your problems within 12 to 24 hours and with 101% guarantee and with astrology systematic. If you want to take any personal or professional advice then also you can call us on +92322-6382012 , ONLINE LOVE PROBLEM & Other all types of Daily Life Problem's.Then CALL or WHATSAPP us on +92322-6382012 and Get all these problems solutions here by Amil Baba DAWOOD BANGALI
#vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore#blackmagicformarriage #aamilbaba #kalajadu #kalailam #taweez #wazifaexpert #jadumantar #vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore #blackmagicforlove #blackmagicformarriage #aamilbaba #kalajadu #kalailam #taweez #wazifaexpert #jadumantar #vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore #Amilbabainuk #amilbabainspain #amilbabaindubai #Amilbabainnorway #amilbabainkrachi #amilbabainlahore #amilbabaingujranwalan #amilbabainislamabad
About
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
• Remote control: Parallel or serial interface.
• Compatible with MAFI CCR system.
• Compatible with IDM8000 CCR.
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
• Easy in configuration using DIP switches.
Technical Specifications
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
Key Features
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
• Remote control: Parallel or serial interface
• Compatible with MAFI CCR system
• Copatiable with IDM8000 CCR
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
Application
• Remote control: Parallel or serial interface.
• Compatible with MAFI CCR system.
• Compatible with IDM8000 CCR.
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
• Easy in configuration using DIP switches.
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)MdTanvirMahtab2
This presentation is about the working procedure of Shahjalal Fertilizer Company Limited (SFCL). A Govt. owned Company of Bangladesh Chemical Industries Corporation under Ministry of Industries.
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdffxintegritypublishin
Advancements in technology unveil a myriad of electrical and electronic breakthroughs geared towards efficiently harnessing limited resources to meet human energy demands. The optimization of hybrid solar PV panels and pumped hydro energy supply systems plays a pivotal role in utilizing natural resources effectively. This initiative not only benefits humanity but also fosters environmental sustainability. The study investigated the design optimization of these hybrid systems, focusing on understanding solar radiation patterns, identifying geographical influences on solar radiation, formulating a mathematical model for system optimization, and determining the optimal configuration of PV panels and pumped hydro storage. Through a comparative analysis approach and eight weeks of data collection, the study addressed key research questions related to solar radiation patterns and optimal system design. The findings highlighted regions with heightened solar radiation levels, showcasing substantial potential for power generation and emphasizing the system's efficiency. Optimizing system design significantly boosted power generation, promoted renewable energy utilization, and enhanced energy storage capacity. The study underscored the benefits of optimizing hybrid solar PV panels and pumped hydro energy supply systems for sustainable energy usage. Optimizing the design of solar PV panels and pumped hydro energy supply systems as examined across diverse climatic conditions in a developing country, not only enhances power generation but also improves the integration of renewable energy sources and boosts energy storage capacities, particularly beneficial for less economically prosperous regions. Additionally, the study provides valuable insights for advancing energy research in economically viable areas. Recommendations included conducting site-specific assessments, utilizing advanced modeling tools, implementing regular maintenance protocols, and enhancing communication among system components.
Water scarcity is the lack of fresh water resources to meet the standard water demand. There are two type of water scarcity. One is physical. The other is economic water scarcity.
Explore the innovative world of trenchless pipe repair with our comprehensive guide, "The Benefits and Techniques of Trenchless Pipe Repair." This document delves into the modern methods of repairing underground pipes without the need for extensive excavation, highlighting the numerous advantages and the latest techniques used in the industry.
Learn about the cost savings, reduced environmental impact, and minimal disruption associated with trenchless technology. Discover detailed explanations of popular techniques such as pipe bursting, cured-in-place pipe (CIPP) lining, and directional drilling. Understand how these methods can be applied to various types of infrastructure, from residential plumbing to large-scale municipal systems.
Ideal for homeowners, contractors, engineers, and anyone interested in modern plumbing solutions, this guide provides valuable insights into why trenchless pipe repair is becoming the preferred choice for pipe rehabilitation. Stay informed about the latest advancements and best practices in the field.
2. ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 8, No. 5, October 2018 : 4015 – 4022
4016
probability of being indicating a neutral affect as in ‗unforgettable accident‘ or ‗unforgettable party‘. As
stated these probabilities are the result of training corpora; so the bigger and the more general the corpora is
the more reliable and more realistic the probabilities are. This approach outperforms the keywords spotting
approach for giving words realistic polarities and not just plain positive, negative or neutral.
The third approach and the most used one too to create lexicon datasets is using the statistical
methods, such as the Naïve Bayes algorithm, K-Nearest Neighbor or Support vector machines(SVM). It
mainly depends on training a machine learning algorithm with features like words co-occurrence frequencies,
Stylistic features, etc., collected from annotated data and then test the accuracy of the algorithm used on a test
sample from the same data. It is language independent thus avoids ambiguity issues associated with Arabic
[3]. Yet, these methods make classification errors when tested on smaller text units such as clauses as
compared with determining the polarity on the document-level [4]. Gives a quick overview of English SA
research efforts from 2002 up to 2014 that are mostly made using the statistical methods. The authors also
presented some of the available tools and datasets. Furthermore, [5] discussed some of the open issues in the
area of SA including that there is more focus on classing the text into positive and negative only with no
deeper diving in the emotions.
Concept-level sentiment analysis was first introduced by Eric Cambira to classify text based on their
semantics rather than their syntactic through the use of semantic networks like ConceptNet [6] which consists
of nodes representing concepts and connected with edges labeled with common sense 'taken for granted'
information provided by volunteers on the internet. Cambira et al. [7] developed SenticNet, a semantic
resource that uses common sense reasoning techniques along with an emotion categorization model and an
ontology for describing human emotions to infer the polarity of different common sense concepts like
‗beautiful day‘ or ‗feel guilty‘. Each concept is assigned with one float polarity value ∈ [-1,1], followed by
SenticNet2 [8] where more concepts are added allowing a deeper and more multi-faceted analysis of text
while providing a four-dimensional vector (sentic vector) to each concept combined of Pleasantness,
Attention, Sensitivity, and Aptitude and presented as a float value ∈ [-1,1] along with its top-ten affectively
related concepts. Then SenticNet3 [9], which contains both common and common-sense knowledge in order
to boost sentiment analysis tasks such as feature spotting and polarity detection, respectively. Then
SenticNet4 [10], where both verb and noun concepts are linked to primitives so that, for example, concepts
such as attain-knowledge or acquire know-how or acquire-knowledge are generalized as get information. An
addition that allows processing different forms of a concept that otherwise raises a not found error. The idea
of using a generative word was used in other methods too. For example, [11] used synonyms lists for positive
and negative words and mapped the list to one word that already has a polarity value.
To this end, we tested SenticNet4 for the task of polarity detection on a multi-domain Arabic dataset
at the sentence-level and showed results outperforming other Arabic sentiment analysis works that mainly
rely on other approaches. The rest of this paper is organized as follows: the next section reviews research
efforts in the area of Arabic sentiment analysis; followed by a section proposes our framework to detect the
polarity using SenticNet; after which a section discusses the results obtained; finally some concluding
remarks and future work recommendations are presented.
2. LITERATURE REVIEW
This section reviews the contributions to the Arabic sentiment analysis research field. Arabic, which
is the formal language of over than 20 countries around the world and spoken by 300 million native speakers,
is considered under-researched compared to English in the field of sentiment analysis. See Figure 1 that
represents the number of Arabic/English publications per year as presented in [12], [13] and detailed in [14],
[15] respectively. And Arabic is also under-resourced with respect to the amount of data on the internet
knowing that Arabic has scored 4th on the number of web users after English, Chinese and Spanish ranking
the highest growth rate in terms of users with 185 million of users in June 2017 according to
internetwebstats's website: http://www.internetworldstats.com/stats7.htm.
Despite that recent progress, researchers focused on using the statistical methods with a special
focus on supervised machine learning classifiers. They share the same methodology presented in Figure 2
while using different pre-processing and selecting different features. SAMAR [16], a system for Arabic
Subjectivity and Sentiment Analysis, uses Multi-dialectal manually annotated data that covers (Maktoob
chats, tweets, Wikipedia talks and web forums sentences) and does Tokenization, lemmatization and POS
tagging in the Pre- processing step. Then, the system selects syntactic and stylistic features; (Unique: is set
for low frequency words, Polarity Lexicon: checks the presence of positive or negative adjectives, Dialect:
checks the dialects of the text, Gender: checks the gender of text whether it's male, female or unknown, User
ID: checks if the author is a person or an organization and Document ID). They also made experiments with
different combinations of features and the pre-processing tasks while classifying using SVMlight
[17]
3. Int J Elec & Comp Eng ISSN: 2088-8708
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet (Hend G. Hassan)
4017
classifier. Overall results reveal improvements over the baseline performance depending on the training data.
Later on, S. Ibrahim et al. [18] used a manually annotated data and performed normalization and stopwords
removal in the pre-processing step. Then, they selected linguistic and syntactic features like term frequency,
Polar word position, detecting (negation, intensifiers, questions, and supplication) terms along with using the
pattern [adjective + noun] while also classifying using SVMlight
classifier.
Figure 1. Number of Arabic/English publications per year
Figure 2. Supervised machine learning process
3. MODIFYING SENTICNET TO SUIT ARABIC
The framework proposed in [19] is modified to suit the task of Arabic sentences' polarity detection
because Arabic natural language processing tools are trained on and made for modern standard Arabic
(MSA) which is rarely used by internet users' compared to slang Arabic and other Arabic dialects. Figure 3
is an illustration of the proposed framework; Sentences are first decomposed into bi-grams then normalized
and labeled with the part of speech (POS) tags. Then Syntactic patterns like [adjective + noun] are matched
to extract concepts that are translated afterward into English to find a match to in SenticNet.
In order to show the effectiveness of the framework, A multi-domain public dataset is used:
http://bit.ly/1wXue3C, created by ElSahar and El-Beltagy [20], covering Attraction (ATT), Hotels (HTL),
Movies (MOV), Restaurants (RES#1, RES#2) and Products (PROD) reviews. We kept RES#1 as a test
sample. The statistics of the dataset is presented in Table 1 showing the total number of sentences and
concepts we extracted along with the number of positive, negative and mixed sentences number. Those
reviews were rated by their native reviewers then were normalized into the three classes: positive, negative
and mixed following the approach adopted by Pang et al. [21]. The main goal is to conclude the polarity of
each sentence and compare it to the normalized polarity. In order to do that, sentences must be decomposed
into concepts that have a match in SenticNet then the polarity of this match is read. In particular, sentences
are decomposed into bigrams. If a sentence consists of only a bi-gram or a uni-gram, then it is considered a
concept without further analysis. If not the concepts are extracted according to the flowchart proposed in
Figure 3.
0
200
400
600
800
2009 2010 2011 2012 2013 2014 2015
Arabic
English
4. ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 8, No. 5, October 2018 : 4015 – 4022
4018
Figure 3. Flow chart of the framework to use SenticNet
Before extracting the concepts, sentences must be normalized and structured. To this end, the
following pre-processing steps are followed:
1. Remove elongations.
2. Remove repetitions.
3. Remove punctuations.
4. Remove diacritics.
5. Normalize all Alef forms to ا.
6. Normalize ة haa to ي.
7. Normalize ِ yaa to ْ.
Table 1. Dataset Statistics
The part of speech (POS) tags of the Normalized text are then generated using MADAMIRA [22], a
shallow syntactic parser that does tokenization, part of speech tagging, and base phrase chunking, and also
combines some of the best aspects of MADA [23] and AMIRA [24]. By reviewing the noun phrases
extracted by MADAMIRA for ATT reviews, we found that around 60% of them are unigrams 20% of which
are a separation of 'انـ ' the Arabic definite article and about %13 are pronouns separation on the word-level
like ‘ي + حصمٕم’ (‗his + design‘). Asides to misclassification errors, those 73% are ineffective as concepts.
Thus we used a hand crafted syntactic patterns following the work of ElSahar and El-Beltagy [25] that
extracts slang terms (words/expressions) and transliterated English written in Arabic letters like 'أَفز' that is
transliterated from 'over'. Their work depends on creating a set of lexico-syntactic patterns by using standard
tags like Negator [Neg], person reference [PR], Personal Pronoun [PP], Demonstrative Pronoun [DP],
Intensifier [Ints], Conjunction [Conj], Strong subjective [SS] and the extracted Subjective
Expression is {SE}. For example, "Respectable and very {polite}" would match the pattern: [[SS] [Conj]
{SE} [Ints]], having 'polite' as the extracted term. They created 11 different patterns with a finite set of terms
in each tag and were able to extract 633 unique terms out of 7.5M twitter corpus. In order to be able to match
more patterns, we added to those tags the part of speech tags labeled by MADAMIRA comprising different
patterns as detailed in Table 2. For example [Adjective + [Ints]] would match 'very excellent'.
We also benefited from the fact that the Arabic language has embedded 'ال' in definite nouns and
two consecutive nouns are usually a concept like 'االخٕزة انسىُاث' ('recent years') that can be extracted easily
using the pattern [{} ال }{ ال]. Although using syntactic patterns is considered a heuristic method, it extracted
MOV ATT RES#2 PROD RES#1 HTL
#Sentences 1524 2154 2642 4272 8364 15572
#Positive sentences 969 2073 2109 3101 5946 10775
#Negative sentences 384 81 268 863 2418 2647
#Mixed sentences 171 0 265 308 0 2150
#unique concepts 18511 9100 5862 4654 26000 41046
5. Int J Elec & Comp Eng ISSN: 2088-8708
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet (Hend G. Hassan)
4019
better concepts than using MADAMIRA's noun phrases and verb phrases alone considering the ambiguities
associated with Arabic.
Table 2. Set of Patterns with Matched Examples
Pattern
number
Syntactic pattern
Pattern's
Exception
Example of a
match
Bing Translation Concept
Noun phrases
P1
انـ{ }انـ{ }
َانـ{ }انـ{ }
نال{ }انـ{ }
كبنـ{ }انـ{ }
فبنـ{ }انـ{ }
ببنـ{ }انـ{ }
نهـ{ }انـ{ }
Second word
in [ِانذ, ّانخ ,
ّانه, etc.]
األخٕزة انسىُاث
ْانقٍز َانُسُاس
انجدٔدة نألفكبر
انكُوٕت ُْكبنق
ٓانحكُم فبنمسئُل
ٓاألَن ببندرجت
انسزٔعت نهُجببث
recent years
and obsessive compulsive disorder
to new ideas
as cosmic powers
the government official
in first class
for fast food
recent_year
obsessive_compulsive_disorder
new_idea
cosmic_power
government_official
first_class
fast_food
P2
انـ{ }Adjective
َانـ{ }Adjective
فبنـ{ }Adjective
جدٔد انفٕهم
ٓعبن َانسعز
جٕد فبنفٕهم
New movie
And the price is high
It is a good movie
new_movie
price_high
good_movie
P3 { }ٔه{ }ٔه
[انذٔه,بٕه , مب
بٕه, etc.]
مخخهفٕه بمقبسٕه With different sizes different_size
P4 { }َن{ }َن مدربُن انعبمهُن Trained workers Trained
P5 { }ان{ }ان
[ان,كبن ,َان ,
etc.]
فزٔدحبن شخصٕخبن Unique personalities Unique
P6 Adjective + [Ints] ممخبسجدا Very excellent Excellent
P7 Noun + adjective سزٔعت بحزكبث Quick movements quick_movement
P8
[P_pron] + Adj.
[P_ref] + Adj.
[D_pron] + Adj.
Pattern
followed by
'مه'
محظُظ اوج
معٕىت وبس
األفضم ٌُ
Lucky you
Certain people
Is best
Lucky
certain
best
Verb Phrases
P9 Verb + noun َقج َقضٕج I spent time spend_time
Procedure: polarity detection
Input:
English translation of the patterns
Output:
Polarity, Pleasantness, Attention, Sensitivity, Aptitude, Semantics
Begin:
For each pattern in the sentence:
Remove punctuations
Remove stopwords
lemmatize the nouns
If the first word's tag is verb
Lemmatize the verb
Search for a match in SenticNet
End
Sum the polarities of each sentence
If classifying into three classes
If (sum >= 1)
Positive
Else if (sum <=- 1)
Negative
Else
Mixed
End
Else If classifying into two classes
If (sum >= 0)
Positive
Else if (sum <0)
Negative
End
End
End
Figure 4. Pseudo code for polarity detection
Next, we used Microsoft Bing translator to translate the matches to English. Having English on the
output side of the machine translation system and not translating concepts from SenticNet into Arabic avoids
the ambiguity of different dialect candidates and different sentence structures; Arabic is one of the languages
6. ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 8, No. 5, October 2018 : 4015 – 4022
4020
that has multiple sentence forms [subject-verb-object (SVO), verb-subject-object (VSO), verb-object-subject
(VOS) asides to the possibility of having a correct sentence dropping a verb or copula].
At last, the translated extracted concepts follow the steps presented in Figure 4 to match the same
form of SenticNet's concepts where nouns are singular and verbs are lemmatized. If a match is found in
SenticNet then the polarity value is read. If not a search for a match for the first word of the concept is done
as it is usually an adjective for example 'رائع 'فىدق (wonderful hotel) would extract 'wonderful' if the whole
concept 'wonderful_hotel' is not found in SenticNet.
4. RESULTS AND DISCUSSION
In order to properly evaluate the performance of the proposed framework, we used the leading
measuring methods in the NLP Classification process: precision, recall, f-measure and accuracy that are
shown by values in table3 respectively for each dataset in case of 2-class classification problem (positive and
negative) and 3-class classification problem (positive, negative and mixed).We highlighted the datasets with
the best scores revealing that the 2-class classification problem has better results than the 3-class
classification problem for the same dataset. The same result was also obtained in [20]. And to show the
difference between our method and existing ones, we compared these results with the ones obtained in [20] in
which they used the same dataset we used but while using the statistical methods (See Table 4). The reported
average accuracy in Table 3 is the average of all accuracies reported after using different lexicon based
features. Their reported accuracy is a result of training 80% of the data with a machine learning classifier and
calculating accuracy on a 20% test sample from the same data with the classifier. Furthermore, best accuracy
score in the 2-class classification happens for the ATT dataset. This could be explained by the fact that it has
more concepts extracted as compared to RES#2 that has more sentences but fewer concepts and for which it
scored the second best accuracy.
Table 3. The Value of Precision, Recall, F-measure and accuracy respectively for each Dataset
P R F1 Acc. ElSahar's work average accuracy
3-class
ATT There are no mixed polarity reviews in the ATT dataset Not mentioned
PROD .57 .45 .49 .45 0.51
MOV .51 .62 .54 .62 0.47
RES#2 .72 .72 .71 .72 0.57
HTL .55 .62 .58 .62 0.64
2-class
ATT .96 .86 .91 .89 Not mentioned
PROD .78 .91 .84 .73 0.74
MOV .73 .91 .81 .70 0.69
RES#2 .91 .91 .91 .85 0.81
HTL .81 .87 .84 .73 0.85
Table 4. Comparison between our Results and ElSahar's Results
Average accuracy
Dataset lexicon
3-class 2-class
ElSahar's work .56 .77 2k un normalized entries
Proposed framework .60 .75 96k unique entries
Figure 5 is a boxplot of the number of words and concepts for each dataset and it shows that ATT
has more words in the sentence than RES#2 causing more concepts. Although MOV dataset has relatively
more concepts, it scored last. That can be explained by the fact that it has the longest review length as it has
2530 words in one of the reviews. The 3-class classification problem has the same ranking order except for
the PROD dataset that has fallen behind as it has uni-gram reviews.
On the other hand, ElSahar 's lexicon [20] has around 2000 entries (uni-grams and bi-grams) that are
not normalized nor lemmatized;'أوصح' (I recommend) , 'بشزاء اوصح' (I recommend to buy) ,'ًب اوصح '
(recommend it) ,' بٍب اوصح ' (I recommend it/female pronoun),' اوصحكم ' (I recommend you ) are all entries and
all of them has the same lemma 'recommend' while we were able to extract around 69 k unique entries after
removing redundancy from the different datasets. Furthermore, we used a test sample from the dataset
(RES#1) in order to validate our lexicon by following the same steps in the framework while skipping the
translation step as shown in Figure 6. We were able to match 68% of the concepts extracted from RES#1 in
the lexicon. The accuracy obtained was 70% and the precision was 70% with a recall of 100% and an F-
measure of 82%.
7. Int J Elec & Comp Eng ISSN: 2088-8708
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet (Hend G. Hassan)
4021
Figure 5. Boxplot's section showing the number of words and concepts for each dataset
Figure 6. Flow chart for the testing framework of the lexicon dataset
5. CONCLUSION AND FUTURE WORK
A novel framework for concept-level sentiment analysis was introduced to detect the polarity of
Arabic sentences using Senticnet. The framework is created so that it can handle ambiguity issues associated
with Arabic including the fact that slang Arabic lacks syntactic rules and tools to deal with and it also doesn't
include using any machine learning algorithm.The framework was tested on a multi-domain dataset covering
public reviews scrapped from the internet. The results showed promising performance as the accuracy
reached 89%. it also outperformed other research works in terms of detecting the polarity of a sentence
without having prior annotated data. In the future, we plan on Handling Polarity inversion terms such as
negations and also defining the scope of each negating term along the sentence.
REFERENCES
[1] E. Cambira, et al., ―The CLSA Model: A Novel Framework for Concept-Level Sentiment Analysis‖, 16th
international Conference,CICLing 2015, Part II, pp. 3-22, 2015.
[2] E. Cambira, ―An Introduction to Concept-Level Sentiment Analysis‖, The 12th Mexican International Conference
on Artificial Intelligence, Part II, pp. 478-483, 2013.
[3] N. Y. Habash, ―Introduction to Arabic Natural Language Processing‖, Synthesis Lectures on Human Language
Technologies 3.1, 2010.
[4] A. Dhokrat, et al., ―Review on Opinion Mining for Fully Fledged System‖, Indonesian Journal of Electrical
Engineering and Informatics (IJEEI), vol. 4, no. 2, pp. 141-148, 2016.
[5] P. Kumar and S. Nandagopalan, ―Insights to Problems, Research Trend and Progress in Techniques of Sentiment
Analysis‖, International Journal of Electrical and Computer Engineering (IJECE), vol. 7, no. 5, pp. 2818-2822,
2017.
[6] H. Liu and P. Singh, ―ConceptNet — A Practical Common Sense Reasoning Tool-kit‖, BT Technology Journal,
vol. 22, no. 4, pp. 211-226, 2004.
[7] E. Cambria, et al., ―SenticNet: A Publicly Available Semantic Resource for Opinion Mining‖, AAAI Fall
Symposium: Commonsense Knowledge, vol. 10, 2010.
[8] E. Cambria, et al., ―SenticNet 2: A Semantic and Affective Resource for Opinion Mining and Sentiment Analysis‖,
FLAIRS conference, pp. 202-207, 2012.
[9] E. Cambria, et al., ―SenticNet 3: A Common and Common-Sense Knowledge Base for Cognition-Driven Sentiment
Analysis‖, Twenty-eighth AAAI conference on artificial intelligence, 2014.
8. ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 8, No. 5, October 2018 : 4015 – 4022
4022
[10] E. Cambria, et al., ―SenticNet 4: A Semantic Resource for Sentiment Analysis based on Conceptual Primitives‖,
COLING, pp. 2666-2677, 2016.
[11] P. Arora, et al., ―An Approach for Big Data to Evolve the Auspicious Information from Cross-Domains‖,
International Journal of Electrical and Computer Engineering (IJECE), vol. 7, no. 2,pp. 967-974, 2017.
[12] A. AlOwisheq, et. al., ―Arabic Sentiment Analysis Resources: a Survey‖, Social Computing and Social Media: 8th
International Conference, pp. 267-278, 2016.
[13] K. Dashtipour, et al., ―Multilingual Sentiment Analysis: State of the Art and Independent Comparison of
Techniques Cognitive Computation‖, vol. 8, no. 4, pp. 757-771, 2016.
[14] Al-Twairesh, et al., ―Subjectivity and Sentiment Analysis of Arabic: Trends and Challenges‖, 11th International
Conference on Computer Systems and Applications (AICCSA), IEEE/ACS, pp. 148-155, 2014.
[15] K. Ravi and V. Ravi, ―A Survey on opinion mining and sentiment Analysis: Tasks, Approaches and Applications‖,
Knowledge-Based Systems, vol. 89, pp. 14-46, 2015.
[16] M. A. Mageed, et al., ―SAMAR: A System for Subjectivity and Sentiment Analysis of Arabic Social Media‖,
Computer Speech & Language, vol/issue: 28(1), pp. 20-37, 2014.
[17] T. Joachims, ―Svmlight: Support vector machine‖, Cornell University, 2008.
[18] H. Ibrahim, et al., ―Sentiment Analysis for Modern Standard Arabic and Colloquial‖, International Journal on
Natural Language Computing (IJNLC), vol. 4, no. 2, pp. 967-974, 2015.
[19] E. Cambria and A. Hussain, ―Sentic Computing A Common-Sense-Based Framework for Concept-Level Sentiment
Analysis‖, Springer, vol. 1, 2015.
[20] H. ElSahar and S. R. El-Beltagy, ―Building Large Arabic Multi-domain Resources for Sentiment Analysis‖, 16th
international Conference,CICLing 2015, Part II, pp. 23-34, 2015.
[21] B. Pang, et al., ―Thumbs up? Sentiment Classification using Machine Learning Techniques‖, Proceedings of the
ACL-02 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79-86, 2002.
[22] A. Pasha, et. al., ―MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of
Arabic‖, LREC, 2014.
[23] N. Habash, et. al., ―MADA+TOKAN:A Toolkit for Arabic Tokenization, Diacritization, Morphological
Disambiguation, POS Tagging, Stemming and Lemmatization‖, Proceedings of the 2nd International Conference
on Arabic Language Resources and Tools(MEDAR), Cairo, Egypt, pp. 102-10.
[24] M. Diab, ―Second Generation AMIRA Tools for Arabic Processing Fast and Robust Tokenization, POS tagging,
and Base Phrase Chunking‖, Proceedings of KONVENS, pp. 39-52, 2012.
[25] H. ElSahar and S. R. El-Beltagy, ―A Fully Automated Approach for Arabic Slang Lexicon Extraction from
Microblogs‖, 15th International Conference, CICLing 2014, Part I, pp. 79-91, 2014.