SlideShare a Scribd company logo
1 of 6
Download to read offline
TELKOMNIKA, Vol.16, No.3, June 2018, pp. 1345~1350
ISSN: 1693-6930, accredited A by DIKTI, Decree No: 58/DIKTI/Kep/2013
DOI: 10.12928/TELKOMNIKA.v16i3.7751  1345
Received November 7, 2017; Revised March 25, 2018; Accepted April 3, 2018
Improving Sentiment Analysis of Short Informal
Indonesian Product Reviews using Synonym Based
Feature Expansion
M. Ali Fauzi*, Ro'i Fahreza Nur Firmansyah, Tri Afirianto
Faculty of Computer Science, Universitas Brawijaya, Jl. Veteran, Malang, Indonesia
*Corresponding author, e-mail: moch.ali.fauzi@ub.ac.id
Abstract
Sentimentanalysis in short informal texts like productreviews is more challenging. Shorttexts are
sparse, noisy, and lack of context information. Traditional text classification methods may not be suitable
for analyzing sentiment of short texts given all those difficulties. A common approach to overcome these
problems is to enrich the original texts with additional semantics to make itappear like a large document of
text. Then, traditional classification methods can be applied to it. In this study, we developed an automatic
sentiment analysis system of short informal Indonesian texts using Naïve Bayes and Synonym Based
Feature Expansion. The system consists of three main stages, preprocessing and normalization, features
expansion and classification. After preprocessing and normalization, we utilize Kateglo to find some
synonyms of every words in original texts and append them. Finally, the text is classified using Naïve
Bayes. The experiment shows that the proposed method can improve the performance of sentiment
analysis of short informal Indonesian productreviews.The bestsentimentclassification performance using
proposed feature expansion is obtained by accuracy of 98%.The experiment also show that feature
expansion will give higher improvement in small number of training data than in the large number of them.
Keywords:productreview, sentimentanalysis,shorttext, feature expansion,classification
Copyright © 2018 Universitas Ahmad Dahlan. All rights reserved.
1. Introduction
Sentiment analysis is one of the fundamental problems in natural language
processing (NLP). Sentiment analysis involves analyzing people’s opinions towards entities
such as products, services, organizations, individuals, issues, events, topics, and their attributes
from a piece of text [1]. Sentiment Analysis has a number of applications, including ranking
products and merchants [2-3], predicting election results [4], predicting box-office revenues for
movies [5-7], predicting the stock market [8], characterizing social relations [9], and etc. The rise
of social media make us now dealing with much more short informal texts every day. Examples
are tweets, status updates, comments, and reviews from various social platforms. Working with
these short informal text genre is more challenging compared to traditional text genres because
there are many limitations in this genre. Thus, there is growing interest in sentiment analysis of
this kind of texts.
Sentiment classification is a special kind of text classification problem with two class,
positive and negative. Since it is a text classification problem, any existing supervised learning
method can be applied. In most cases, the use of statistical or machine learning techniques
such as Naive Bayes, Maximum Entropy, and Support Vector Machines has proven to be
successful in this field [10-16]. Some previous works also use another supervised learning such
as method decision tree and K-Nearest Neighbor (KNN) to analyze sentiment within
texts [17-19]. Those researches showed that standard machine learning methods using unigram
(bag of words) as features perform very well in this field.
Short texts are sparse, noisy, and lack of context information. Traditional text
classification methods like machine learning may not be suitable for analyzing sentiment of
short texts given all those difficulties. A common approach to overcome these problems for
analyzing sentiment of short texts is to enrich the original texts with additional semantics to
make it appear like a large document of text. Then, traditional classification methods can be
applied to it. Some of the previous works employ search engines to extract more information
 ISSN: 1693-6930
TELKOMNIKA Vol. 16, No. 3, June 2018: 1345-1350
1346
about the short text [20-22]. The other works utilize external sources such as Wikipedia and
WordNet as background knowledge [23-26].
In this study, we conducted automatic sentiment analysis of short informal Indonesian
product reviews. This is very useful because it allows review to be aggregated without manual
intervention. Consumers can utilize this information to research products before buying.
Marketers can utilize this to research public opinion of their products. Organizations can also
utilize this to get critical feedback about problems in their newly released products. The reviews
are gathered from a social platforms that provides reviews from users about certain product.
Every review in this platform is a short informal Indonesian text that express positive or negative
opinion about the product. Most of the reviews are short texts with informal language, creative
spelling and punctuation, misspellings, and slang word. This paper aims to improve short text
bag of words representation for sentiment analysis. We developed automatic sentiment analysis
system of short informal Indonesian texts using Naïve Bayes and Synonym Based Feature
Expansion. In the first step, we counduct preprocessing normalizing misspellings and slang
words. In the next we use Kateglo API (kateglo.co.id) to find synonym of each word in texts to
enrich the original texts. Finally, we do classification using Naïve Bayes Classifier and bag of
words as the features.
2. Research Method
In general, as seen in Figure 1, the sentiment analysis system in this study consists of
three main stages, preprocessing and normalization, features expansion and classification. The
first stage involves several steps including tokenization, stopwords removal, stemming and
misspellings words normalization. In this stage, we also counduted negation convert. In the
feature expansion stage, we use Kateglo API to find synonym of each word in the review texts.
Then, the synonym will be added to the original texts. Finally, in the sentiment classification
stage, Naïve Bayes is trained using some training data and the expanded review texts will be
classified using Naïve Bayes and bag of words as its features.
Figure 1. System Main Flowchart
TELKOMNIKA ISSN: 1693-6930 
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews… (M. Ali Fauzi)
1347
2.1. Preprocessing
Preprocessing involves tokenization, stopwords removal, stemming and misspellings
words normalization, and negation convert. Tokenization is an early process done to remove
punctuation, numbers, and characters other than the alphabet [27-30]. Also in this stage will be
coundected case folding, which is changing all capital letters into lowercase. Stopwords removal
or filtering is removing uninformative words referring to the existing stopword dictionary. In this
case, we use stoplist by Tala that have been used in [31]. Stemming is a process to convert
every words to its root. This process is done by removing affixes such as prefix, infix and suffix.
In this case, we use Nazief-Adriani Stemmer [32]. Misspellings words normalization is done by
changing the words into its formal form. Examples are the word “ga” become “tidak” and the
word “bisaaaa” become “bisa”. Negation convert is a process of converting negation words
contained in a sentence. The negation words has influence in changing the value of sentiment
in a sentence. The most used negation words in Indonesian Language are "tidak", "bukan",
"tak", "tanpa", "kurang" and "jangan". The negation convert is done by finding the antonym of
the word that been negated. For example, negation convert of “tidak bagus” is “jelek”.
2.2. Feature expansion
Feature expansion is process of enriching the original texts with additional semantics to
make it appear like a large document of text [33]. In this study, we utilize Kateglo to find some
synonyms of every words in original texts and append them. Kateglo is a dictionary website that
provides API for fetching word attributes such as lexical class, root form, synonym, antonym,
etc. Our system will find the synonyms of a certain word by sending some parameter to URL
http://kateglo.com/api.php?format=json&phrase=[word]. Then, our system will parse the Json
data received from the kateglo server and use them for feature expansion as seen in Figure 2.
Figure 2. Feature expansion steps
2.3. Classification using naïve bayes
Naïve Bayes is one of the most effective and efficient inductive learning algorithms for
machine learning and data mining. The performance of Naive Bayes is competitive in the
classification process although it uses the assumption of attribute independence. The
assumption of the independence of these attributes on the data is rare, but although the
assumption of attribute independence is violated, the performance of Naive Bayes classification
is quite high, as proven by various empirical studies [34-35].
The Naïve Bayes classification is incorporated into the Bayes learning algorithm
constructed by training data to estimate the probability of each category contained in the
characteristics of the testing document. In general, the classification process using the Naïve
Bayes method can be seen in equation 1.
)(
)|()(
)|(
i
jij
ij
wP
cwPcP
wcP

 (1)
 ISSN: 1693-6930
TELKOMNIKA Vol. 16, No. 3, June 2018: 1345-1350
1348
where )|( ij wcP is posterior or the probability of class jc given word iw . )( jcP is prior or
the probability of class jc . )|( ji cwP is likelihood or the probability of word iw given class jc .
Then, )( iwP is evidence or the probability of word iw .
The probability of word iw actually can be eliminated in the classification process
because the probability of word iw will not affect the comparison of classification results
between categories. Thus, the process of classification can be simplified to be equation 2.
)|()()|( jijij cwPcPwcP  (2)
3. Results and Analysis
Dataset that was used in the experiment is mobile banking app reviews. There are 100
testing data consisting of 50 positive reviews and 50 negative reviews. Meanwhile, the training
data used varies from 50, 100, 400 to 1000 training data. This experiment was conducted to
determine the effect of feature expansion and the number of training data on the sentiment
classification performance. The evaluation method used in this experiment is accuracy.
Experiment results shown in Figure 3.
Figure 3. The experiment results
As displayed in Figure 3, the best sentiment classification accuracy is obtained when
the training data is 400 using feature expansion by 98%. The experiment results using feature
expansion on every number of training data always have better classification accuracy
compared to the ones that not using feature expansion. This results show that feature
expansion increase the sentiment classification performance. In short-text classification, many
words in the testing data never appear in the training data. It can damage the sentiment
classification performance. Using feature expansion, the system append some new words, in
this case the synonyms of each word in testing data, to the testing data before the classification
stage. Therefore, the vocabulary on the testing data will be richer and the probability of training
85%
90%
98%
96%
76%
82%
93% 93%
65%
70%
75%
80%
85%
90%
95%
100%
50 100 400 1000
Accuracy
The number of trainingdata
Sentment Analysis UsingNaiveBayes and Feature
Expansion
Naïve Bayes using Feature Expansion
Naïve Bayes without using Feature Expansion
TELKOMNIKA ISSN: 1693-6930 
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews… (M. Ali Fauzi)
1349
data and testing data share the same words will increase. Thus, sentiment classification
process will produce better performance compared without using feature expansion.
Also from Figure 3, can be seen that the number of training data does have effect on
the sentiment classification performance either with feature expansion, or without feature
expansion. The more the training data, the higher accuracy obtained. The highest accuracy
difference between sentiment classifications using feature expansion and not using feature
expansion occurs when training data used is minimal. This difference will be closer along with
the increasing number of training data. This result show that feature expansion will have bigger
influence in small training data. In the large number of training data, the word in testing data that
will be expanded most likely has already appeared in the train data. Hence, in this case, using
feature expansion does not increase the sentiment classification performance significantly.
4. Conclusion and Future Works
The proposed method, Synonym based feature expansion, had been proven can
improve the performance of sentiment analysis of short informal Indonesian product reviews.
Based on the experiment, Naïve Bayes classifier that use feature expansion always have better
classification accuracy compared to the ones that not using feature expansion. The best
sentiment classification performance is obtained when the training data is 400 using feature
expansion by accuraty of 98%. The number of training data also affect on the sentiment
classification performance either with feature expansion, or without feature expansion. The
more the training data, the higher the accuracy obtained. The highest accuracy difference
between sentiment classifications using feature expansion and not using feature expansion
occurs when training data used is minimal. This difference is decreasing along with the
increasing number of training data. This result show that feature expansion will give bigger
improvement in small training data than in the large number of training data.
References
[1] Liu B. Sentimentanalysis and opinion mining. Synthesis lectures on human language technologies.
2012; 5(1):1-67.
[2] McGlohon M, Glance NS, Reiter Z. Star Quality: Aggregating Reviews to Rank Products and
Merchants. In ICWSM. 2010.
[3] Rosi F, Fauzi MA, Perdana RS. Prediksi Rating Pada Review Produk Kecantikan Menggunakan
Metode Naïve Bayes dan Categorical Proportional Difference (CPD). Jurnal Pengembangan
Teknologi Informasi dan Ilmu Komputer. 2018; 2(5): 1991-1997.
[4] Tumasjan A, Sprenger TO, Sandner PG, Welpe IM. Predicting elections with twitter: What 140
characters reveal about political sentiment. Icwsm. 2010;10(1):178-85.
[5] Asur S, Huberman BA. Predicting the future with social media. In Proceedings of the 2010
IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.
2010; 01:492-499.
[6] Joshi M, Das D, Gimpel K, Smith NA. Movie reviews and revenues:An experiment in text regression.
InHuman Language Technologies. The 2010 Annual Conference of the North American Chapter of
the Association for Computational Linguistics. 2010; :293-296.
[7] Sadikov E, Parameswaran AG, Venetis P. Blogs as Predictors of Movie Success. In ICWSM. 2009.
[8] Bollen J, Mao H, Zeng X. Twitter mood predicts the stock market. Journal of computational science.
2011; 2(1):1-8.
[9] Groh G, Hauffa J. Characterizing Social Relations Via NLP-Based Sentiment Analysis. In ICWSM.
2011.
[10] Aue, A. and Gamon, M. Customizing sentiment classifiers to new domains: A case study.
In Proceedings of recent advances in natural language processing (RANLP). 2005; 1(3.1):2-1.
[11] Tan, S., Wu, G., Tang, H. and Cheng, X. A novel scheme for domain-transfer problem in the context
of sentiment analysis. In Proceedings ofthe sixteenth ACM conference on Conference on information
and knowledge management. 2007; :979-982.
[12] Antinasari P, Perdana RS, Fauzi MA. Analisis Sentimen Tentang Opini Film Pada Dokumen Twitter
Berbahasa Indonesia Menggunakan Naive Bayes Dengan Perbaikan Kata Tidak Baku. Jurnal
Pengembangan Teknologi Informasi dan Ilmu Komputer. 2017; 1(12):1733-1741.
[13] Gunawan F, Fauzi MA, Adikara PP. Analisis Sentimen Pada Ulasan Aplikasi Mobile Menggunakan
Naive Bayes dan Normalisasi Kata Berbasis Levenshtein Distance (Studi Kasus Aplikasi BCA
Mobile). Systemic: Information System and Informatics Journal. 2017;3(2):1-6.
 ISSN: 1693-6930
TELKOMNIKA Vol. 16, No. 3, June 2018: 1345-1350
1350
[14] Fanissa S, Fauzi MA, Adinugroho S. Analisis Sentimen Pariwisata di Kota Malang Menggunakan
Metode Naive Bayes dan Seleksi Fitur Query Expansion Ranking. Jurnal Pengembangan Teknologi
Informasi dan Ilmu Komputer.2018; 2(8): 2766 - 2770.
[15] Rofiqoh U, Perdana RS, Fauzi MA. Analisis Sentimen Tingkat Kepuasan Pengguna Penyedia
Layanan Telekomunikasi Seluler Indonesia Pada Twitter Dengan Metode Support Vector Machine
dan Lexicon Based Features. Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer. 2017;
1(12): 1725-1732.
[16] Lestari AR, Perdana RS, Fauzi MA. Analisis Sentimen Tentang Opini Pilkada Dki 2017 Pada
Dokumen Twitter Berbahasa Indonesia Menggunakan Näive Bayes dan Pembobotan Emoji. Jurnal
Pengembangan Teknologi Informasi dan Ilmu Komputer. 2017;1(12): 1718-1724.
[17] Nurjanah WE, Perdana RS, Fauzi MA. Analisis Sentimen Terhadap Tayangan Televisi Berdasarkan
Opini Masyarakat pada Media Sosial Twitter menggunakan Metode K-Nearest Neighbor dan
Pembobotan Jumlah Retweet. Jurnal Pengembangan Teknologi Informasi dan Ilmu
Komputer.2017;1(12):1750-1757.
[18] Mentari ND, Fauzi MA, Muflikhah L. Analisis Sentimen Kurikulum 2013 Pada Sosial Media Twitter
Menggunakan Metode K-NearestNeighbor dan Feature Selection Query Expansion Ranking. Jurnal
Pengembangan Teknologi Informasi dan Ilmu Komputer.2018;2(8): 2739 – 2743.
[19] Bilal, M., Israr, H., Shahid, M. and Khan, A. Sentiment classification of Roman-Urdu opinions using
Naïve Bayesian,Decision Tree and KNN classification techniques. Journal of King Saud University-
Computer and Information Sciences. 2016; 28(3):330-344.
[20] Bollegala, D., Matsuo, Y. and Ishizuka, M. Measuring semantic similarity between words using web
search engines. www. 2007; 7(757-766).
[21] Sahami, M. and Heilman, T.D. A web-based kernel function for measuring the similarity of short text
snippets. In Proceedings of the 15th international conference on World Wide Web. 2006; :377-386.
[22] Yih, W.T. and Meek, C. Improving similarity measures for short segments of text. In AAAI.
2007; 7(7):1489-1494.
[23] Chen, M., Jin, X. and Shen, D. Short text classification improved by learning multi-granularity topics.
In IJCAI. 2011; :1776-1781.
[24] Hu, X., Sun, N., Zhang, C. and Chua, T.S. Exploiting internal and external semantics for the
clustering of short texts using world knowledge. In Proceedings of the 18th ACM conference on
Information and knowledge management. 2009; :919-928.
[25] Phan, X.H., Nguyen, L.M. and Horiguchi, S. Learning to classify short and sparse text & web with
hidden topics from large-scale data collections. In Proceedings of the 17th international conference
on World Wide Web. 2008; :91-100.
[26] Jin, O., Liu, N.N., Zhao, K., Yu, Y. and Yang, Q. Transferring topical knowledge from auxiliary long
texts for short text clustering. In Proceedings ofthe 20th ACM international conference on Information
and knowledge management. 2011; :775-784.
[27] Fauzi, M.A., Arifin, A.Z. and Yuniarti, A. Arabic Book Retrieval using Class and Book Index Based
Term Weighting. International Journal of Electrical and Computer Engineering (IJECE). 2017; 7(6).
[28] Pramukantoro, E.S. and Fauzi, M.A. Comparative analysis of string similarity and corpus-based
similarity for automatic essay scoring system on e-learning gamification. In Advanced Computer
Science and Information Systems (ICACSIS), 2016 International Conference on. 2016; :149-155.
[29] Fauzi, M.A., Arifin, A. and Yuniarti, A. Term Weighting Berbasis Indeks Buku dan Kelas untuk
Perangkingan Dokumen Berbahasa Arab. Lontar Komputer: Jurnal Ilmiah Teknologi Informasi.
2013; 5(2).
[30] Fauzi, M.A., Utomo, D.C., Setiawan, B.D. and Pramukantoro, E.S. Automatic Essay Scoring System
Using N-Gram and Cosine Similarity for Gamification Based E-Learning. In Proceedings of the
International Conference on Advances in Image Processing. 2017; :151-155.
[31] Suharno CF, Fauzi MA, Perdana RS. Klasifikasi Teks Bahasa Indonesia Pada Dokumen Pengaduan
SambatOnline Menggunakan Metode K-NearestNeighbors Dan Chi-Square. Systemic: Information
System and Informatics Journal. 2017 Dec 7;3(1):25-32.
[32] Adriani, M., Asian, J., Nazief, B., Tahaghoghi, S.M. and Williams, H.E. Stemming Indonesian: A
confix-stripping approach. ACM Transactions on Asian Language Information Processing (TALIP).
2007; 6(4):1-33.
[33] Yusuf S, Fauzi MA, Brata KC. Sistem Temu Kembali Informasi Pasal-Pasal KUHP (Kitab Undang-
Undang Hukum Pidana) Berbasis Android Menggunakan Metode Synonym Recognition dan Cosine
Similarity. Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer. 2018;2(2):838-847.
[34] Qiang, G. An effective algorithm for improving the performance of Naïve Bayes for text classification.
In 2010 Second International Conference on Computer Research and Development. 2010.
[35] Fauzi MA, Arifin AZ, Gosaria SC. Indonesian News Classification Using Naïve Bayes and Two-Phase
Feature Selection Model. Indonesian Journal ofElectrical Engineering and Computer Science. 2017
Dec 1;8(3):610-5.

More Related Content

What's hot

SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEWSENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEWJournal For Research
 
Lexical Analysis to Effectively Detect User's Opinion
Lexical Analysis to Effectively Detect User's Opinion   Lexical Analysis to Effectively Detect User's Opinion
Lexical Analysis to Effectively Detect User's Opinion dannyijwest
 
295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysis295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysisZahid Azam
 
A Novel Voice Based Sentimental Analysis Technique to Mine the User Driven Re...
A Novel Voice Based Sentimental Analysis Technique to Mine the User Driven Re...A Novel Voice Based Sentimental Analysis Technique to Mine the User Driven Re...
A Novel Voice Based Sentimental Analysis Technique to Mine the User Driven Re...IRJET Journal
 
Presentation on Sentiment Analysis
Presentation on Sentiment AnalysisPresentation on Sentiment Analysis
Presentation on Sentiment AnalysisRebecca Williams
 
Machine Learning Techniques with Ontology for Subjective Answer Evaluation
Machine Learning Techniques with Ontology for Subjective Answer EvaluationMachine Learning Techniques with Ontology for Subjective Answer Evaluation
Machine Learning Techniques with Ontology for Subjective Answer Evaluationijnlc
 
IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET-  	  A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...IRJET-  	  A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...IRJET Journal
 
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet IJECEIAES
 
Methods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature StudyMethods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature Studyvivatechijri
 
A Survey on Sentiment Categorization of Movie Reviews
A Survey on Sentiment Categorization of Movie ReviewsA Survey on Sentiment Categorization of Movie Reviews
A Survey on Sentiment Categorization of Movie ReviewsEditor IJMTER
 
Ijmer 46067276
Ijmer 46067276Ijmer 46067276
Ijmer 46067276IJMER
 
Supervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithmSupervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithmIJSRD
 
IRJET- Sentimental Analysis for Students’ Feedback using Machine Learning App...
IRJET- Sentimental Analysis for Students’ Feedback using Machine Learning App...IRJET- Sentimental Analysis for Students’ Feedback using Machine Learning App...
IRJET- Sentimental Analysis for Students’ Feedback using Machine Learning App...IRJET Journal
 
IRE2014-Sentiment Analysis
IRE2014-Sentiment AnalysisIRE2014-Sentiment Analysis
IRE2014-Sentiment AnalysisGangasagar Patil
 
Project sentiment analysis
Project sentiment analysisProject sentiment analysis
Project sentiment analysisBob Prieto
 
IRJET- Short-Text Semantic Similarity using Glove Word Embedding
IRJET- Short-Text Semantic Similarity using Glove Word EmbeddingIRJET- Short-Text Semantic Similarity using Glove Word Embedding
IRJET- Short-Text Semantic Similarity using Glove Word EmbeddingIRJET Journal
 

What's hot (20)

SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEWSENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
 
Lexical Analysis to Effectively Detect User's Opinion
Lexical Analysis to Effectively Detect User's Opinion   Lexical Analysis to Effectively Detect User's Opinion
Lexical Analysis to Effectively Detect User's Opinion
 
295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysis295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysis
 
A Novel Voice Based Sentimental Analysis Technique to Mine the User Driven Re...
A Novel Voice Based Sentimental Analysis Technique to Mine the User Driven Re...A Novel Voice Based Sentimental Analysis Technique to Mine the User Driven Re...
A Novel Voice Based Sentimental Analysis Technique to Mine the User Driven Re...
 
Presentation on Sentiment Analysis
Presentation on Sentiment AnalysisPresentation on Sentiment Analysis
Presentation on Sentiment Analysis
 
Machine Learning Techniques with Ontology for Subjective Answer Evaluation
Machine Learning Techniques with Ontology for Subjective Answer EvaluationMachine Learning Techniques with Ontology for Subjective Answer Evaluation
Machine Learning Techniques with Ontology for Subjective Answer Evaluation
 
F0363942
F0363942F0363942
F0363942
 
IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET-  	  A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...IRJET-  	  A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
 
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
 
Methods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature StudyMethods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature Study
 
A Survey on Sentiment Categorization of Movie Reviews
A Survey on Sentiment Categorization of Movie ReviewsA Survey on Sentiment Categorization of Movie Reviews
A Survey on Sentiment Categorization of Movie Reviews
 
Ijmer 46067276
Ijmer 46067276Ijmer 46067276
Ijmer 46067276
 
Supervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithmSupervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithm
 
IRJET- Sentimental Analysis for Students’ Feedback using Machine Learning App...
IRJET- Sentimental Analysis for Students’ Feedback using Machine Learning App...IRJET- Sentimental Analysis for Students’ Feedback using Machine Learning App...
IRJET- Sentimental Analysis for Students’ Feedback using Machine Learning App...
 
IRE2014-Sentiment Analysis
IRE2014-Sentiment AnalysisIRE2014-Sentiment Analysis
IRE2014-Sentiment Analysis
 
Novel Scoring System for Identify Accurate Answers for Factoid Questions
Novel Scoring System for Identify Accurate Answers for Factoid QuestionsNovel Scoring System for Identify Accurate Answers for Factoid Questions
Novel Scoring System for Identify Accurate Answers for Factoid Questions
 
2
22
2
 
Sentiment analysis on_unstructured_review-1
Sentiment analysis on_unstructured_review-1Sentiment analysis on_unstructured_review-1
Sentiment analysis on_unstructured_review-1
 
Project sentiment analysis
Project sentiment analysisProject sentiment analysis
Project sentiment analysis
 
IRJET- Short-Text Semantic Similarity using Glove Word Embedding
IRJET- Short-Text Semantic Similarity using Glove Word EmbeddingIRJET- Short-Text Semantic Similarity using Glove Word Embedding
IRJET- Short-Text Semantic Similarity using Glove Word Embedding
 

Similar to Improving Sentiment Analysis of Short Informal Indonesian Product Reviews using Synonym Based Feature Expansion

Dictionary Based Approach to Sentiment Analysis - A Review
Dictionary Based Approach to Sentiment Analysis - A ReviewDictionary Based Approach to Sentiment Analysis - A Review
Dictionary Based Approach to Sentiment Analysis - A ReviewINFOGAIN PUBLICATION
 
Implementation of Semantic Analysis Using Domain Ontology
Implementation of Semantic Analysis Using Domain OntologyImplementation of Semantic Analysis Using Domain Ontology
Implementation of Semantic Analysis Using Domain OntologyIOSR Journals
 
Aspect-Level Sentiment Analysis On Hotel Reviews
Aspect-Level Sentiment Analysis On Hotel ReviewsAspect-Level Sentiment Analysis On Hotel Reviews
Aspect-Level Sentiment Analysis On Hotel ReviewsKimberly Pulley
 
The sarcasm detection with the method of logistic regression
The sarcasm detection with the method of logistic regressionThe sarcasm detection with the method of logistic regression
The sarcasm detection with the method of logistic regressionEditorIJAERD
 
Sentiment Analysis using Machine Learning.pdf
Sentiment Analysis using Machine Learning.pdfSentiment Analysis using Machine Learning.pdf
Sentiment Analysis using Machine Learning.pdfOmSatpathy
 
Sentiment Features based Analysis of Online Reviews
Sentiment Features based Analysis of Online ReviewsSentiment Features based Analysis of Online Reviews
Sentiment Features based Analysis of Online Reviewsiosrjce
 
Lexicon Based Emotion Analysis on Twitter Data
Lexicon Based Emotion Analysis on Twitter DataLexicon Based Emotion Analysis on Twitter Data
Lexicon Based Emotion Analysis on Twitter Dataijtsrd
 
A Subjective Feature Extraction For Sentiment Analysis In Malayalam Language
A Subjective Feature Extraction For Sentiment Analysis In Malayalam LanguageA Subjective Feature Extraction For Sentiment Analysis In Malayalam Language
A Subjective Feature Extraction For Sentiment Analysis In Malayalam LanguageJeff Nelson
 
Senti-Lexicon and Analysis for Restaurant Reviews of Myanmar Text
Senti-Lexicon and Analysis for Restaurant Reviews of Myanmar TextSenti-Lexicon and Analysis for Restaurant Reviews of Myanmar Text
Senti-Lexicon and Analysis for Restaurant Reviews of Myanmar TextIJAEMSJORNAL
 
A scalable, lexicon based technique for sentiment analysis
A scalable, lexicon based technique for sentiment analysisA scalable, lexicon based technique for sentiment analysis
A scalable, lexicon based technique for sentiment analysisijfcstjournal
 
Hybrid Classifier for Sentiment Analysis using Effective Pipelining
Hybrid Classifier for Sentiment Analysis using Effective PipeliningHybrid Classifier for Sentiment Analysis using Effective Pipelining
Hybrid Classifier for Sentiment Analysis using Effective PipeliningIRJET Journal
 
A SURVEY OF SENTIMENT CLASSSIFICTION TECHNIQUES
A SURVEY OF SENTIMENT CLASSSIFICTION TECHNIQUESA SURVEY OF SENTIMENT CLASSSIFICTION TECHNIQUES
A SURVEY OF SENTIMENT CLASSSIFICTION TECHNIQUESJournal For Research
 
A fuzzy logic based on sentiment
A fuzzy logic based on sentimentA fuzzy logic based on sentiment
A fuzzy logic based on sentimentIJDKP
 
IRJET- A Survey on Graph based Approaches in Sentiment Analysis
IRJET- A Survey on Graph based Approaches in Sentiment AnalysisIRJET- A Survey on Graph based Approaches in Sentiment Analysis
IRJET- A Survey on Graph based Approaches in Sentiment AnalysisIRJET Journal
 
A survey on sentiment analysis and opinion mining
A survey on sentiment analysis and opinion miningA survey on sentiment analysis and opinion mining
A survey on sentiment analysis and opinion miningeSAT Publishing House
 
A survey on sentiment analysis and opinion mining
A survey on sentiment analysis and opinion miningA survey on sentiment analysis and opinion mining
A survey on sentiment analysis and opinion miningeSAT Journals
 
Evaluating sentiment analysis and word embedding techniques on Brexit
Evaluating sentiment analysis and word embedding techniques on BrexitEvaluating sentiment analysis and word embedding techniques on Brexit
Evaluating sentiment analysis and word embedding techniques on BrexitIAESIJAI
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 

Similar to Improving Sentiment Analysis of Short Informal Indonesian Product Reviews using Synonym Based Feature Expansion (20)

Dictionary Based Approach to Sentiment Analysis - A Review
Dictionary Based Approach to Sentiment Analysis - A ReviewDictionary Based Approach to Sentiment Analysis - A Review
Dictionary Based Approach to Sentiment Analysis - A Review
 
Implementation of Semantic Analysis Using Domain Ontology
Implementation of Semantic Analysis Using Domain OntologyImplementation of Semantic Analysis Using Domain Ontology
Implementation of Semantic Analysis Using Domain Ontology
 
Aspect-Level Sentiment Analysis On Hotel Reviews
Aspect-Level Sentiment Analysis On Hotel ReviewsAspect-Level Sentiment Analysis On Hotel Reviews
Aspect-Level Sentiment Analysis On Hotel Reviews
 
The sarcasm detection with the method of logistic regression
The sarcasm detection with the method of logistic regressionThe sarcasm detection with the method of logistic regression
The sarcasm detection with the method of logistic regression
 
Sentiment Analysis using Machine Learning.pdf
Sentiment Analysis using Machine Learning.pdfSentiment Analysis using Machine Learning.pdf
Sentiment Analysis using Machine Learning.pdf
 
L017358286
L017358286L017358286
L017358286
 
Sentiment Features based Analysis of Online Reviews
Sentiment Features based Analysis of Online ReviewsSentiment Features based Analysis of Online Reviews
Sentiment Features based Analysis of Online Reviews
 
Lexicon Based Emotion Analysis on Twitter Data
Lexicon Based Emotion Analysis on Twitter DataLexicon Based Emotion Analysis on Twitter Data
Lexicon Based Emotion Analysis on Twitter Data
 
A Subjective Feature Extraction For Sentiment Analysis In Malayalam Language
A Subjective Feature Extraction For Sentiment Analysis In Malayalam LanguageA Subjective Feature Extraction For Sentiment Analysis In Malayalam Language
A Subjective Feature Extraction For Sentiment Analysis In Malayalam Language
 
Senti-Lexicon and Analysis for Restaurant Reviews of Myanmar Text
Senti-Lexicon and Analysis for Restaurant Reviews of Myanmar TextSenti-Lexicon and Analysis for Restaurant Reviews of Myanmar Text
Senti-Lexicon and Analysis for Restaurant Reviews of Myanmar Text
 
A scalable, lexicon based technique for sentiment analysis
A scalable, lexicon based technique for sentiment analysisA scalable, lexicon based technique for sentiment analysis
A scalable, lexicon based technique for sentiment analysis
 
Hybrid Classifier for Sentiment Analysis using Effective Pipelining
Hybrid Classifier for Sentiment Analysis using Effective PipeliningHybrid Classifier for Sentiment Analysis using Effective Pipelining
Hybrid Classifier for Sentiment Analysis using Effective Pipelining
 
A SURVEY OF SENTIMENT CLASSSIFICTION TECHNIQUES
A SURVEY OF SENTIMENT CLASSSIFICTION TECHNIQUESA SURVEY OF SENTIMENT CLASSSIFICTION TECHNIQUES
A SURVEY OF SENTIMENT CLASSSIFICTION TECHNIQUES
 
A fuzzy logic based on sentiment
A fuzzy logic based on sentimentA fuzzy logic based on sentiment
A fuzzy logic based on sentiment
 
IRJET- A Survey on Graph based Approaches in Sentiment Analysis
IRJET- A Survey on Graph based Approaches in Sentiment AnalysisIRJET- A Survey on Graph based Approaches in Sentiment Analysis
IRJET- A Survey on Graph based Approaches in Sentiment Analysis
 
Aq35241246
Aq35241246Aq35241246
Aq35241246
 
A survey on sentiment analysis and opinion mining
A survey on sentiment analysis and opinion miningA survey on sentiment analysis and opinion mining
A survey on sentiment analysis and opinion mining
 
A survey on sentiment analysis and opinion mining
A survey on sentiment analysis and opinion miningA survey on sentiment analysis and opinion mining
A survey on sentiment analysis and opinion mining
 
Evaluating sentiment analysis and word embedding techniques on Brexit
Evaluating sentiment analysis and word embedding techniques on BrexitEvaluating sentiment analysis and word embedding techniques on Brexit
Evaluating sentiment analysis and word embedding techniques on Brexit
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 

More from TELKOMNIKA JOURNAL

Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...TELKOMNIKA JOURNAL
 
Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...TELKOMNIKA JOURNAL
 
Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...TELKOMNIKA JOURNAL
 
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...TELKOMNIKA JOURNAL
 
Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...TELKOMNIKA JOURNAL
 
Efficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antennaEfficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antennaTELKOMNIKA JOURNAL
 
Design and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fireDesign and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fireTELKOMNIKA JOURNAL
 
Wavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio networkWavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio networkTELKOMNIKA JOURNAL
 
A novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bandsA novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bandsTELKOMNIKA JOURNAL
 
Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...TELKOMNIKA JOURNAL
 
Brief note on match and miss-match uncertainties
Brief note on match and miss-match uncertaintiesBrief note on match and miss-match uncertainties
Brief note on match and miss-match uncertaintiesTELKOMNIKA JOURNAL
 
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...TELKOMNIKA JOURNAL
 
Evaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G systemEvaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G systemTELKOMNIKA JOURNAL
 
Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...TELKOMNIKA JOURNAL
 
Reagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensorReagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensorTELKOMNIKA JOURNAL
 
Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...TELKOMNIKA JOURNAL
 
A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...TELKOMNIKA JOURNAL
 
Electroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networksElectroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networksTELKOMNIKA JOURNAL
 
Adaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imagingAdaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imagingTELKOMNIKA JOURNAL
 
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...TELKOMNIKA JOURNAL
 

More from TELKOMNIKA JOURNAL (20)

Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...
 
Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...
 
Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...
 
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
 
Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...
 
Efficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antennaEfficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antenna
 
Design and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fireDesign and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fire
 
Wavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio networkWavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio network
 
A novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bandsA novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bands
 
Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...
 
Brief note on match and miss-match uncertainties
Brief note on match and miss-match uncertaintiesBrief note on match and miss-match uncertainties
Brief note on match and miss-match uncertainties
 
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
 
Evaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G systemEvaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G system
 
Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...
 
Reagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensorReagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensor
 
Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...
 
A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...
 
Electroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networksElectroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networks
 
Adaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imagingAdaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imaging
 
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
 

Recently uploaded

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxhumanexperienceaaa
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
Analog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAnalog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAbhinavSharma374939
 

Recently uploaded (20)

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
Analog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAnalog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog Converter
 

Improving Sentiment Analysis of Short Informal Indonesian Product Reviews using Synonym Based Feature Expansion

  • 1. TELKOMNIKA, Vol.16, No.3, June 2018, pp. 1345~1350 ISSN: 1693-6930, accredited A by DIKTI, Decree No: 58/DIKTI/Kep/2013 DOI: 10.12928/TELKOMNIKA.v16i3.7751  1345 Received November 7, 2017; Revised March 25, 2018; Accepted April 3, 2018 Improving Sentiment Analysis of Short Informal Indonesian Product Reviews using Synonym Based Feature Expansion M. Ali Fauzi*, Ro'i Fahreza Nur Firmansyah, Tri Afirianto Faculty of Computer Science, Universitas Brawijaya, Jl. Veteran, Malang, Indonesia *Corresponding author, e-mail: moch.ali.fauzi@ub.ac.id Abstract Sentimentanalysis in short informal texts like productreviews is more challenging. Shorttexts are sparse, noisy, and lack of context information. Traditional text classification methods may not be suitable for analyzing sentiment of short texts given all those difficulties. A common approach to overcome these problems is to enrich the original texts with additional semantics to make itappear like a large document of text. Then, traditional classification methods can be applied to it. In this study, we developed an automatic sentiment analysis system of short informal Indonesian texts using Naïve Bayes and Synonym Based Feature Expansion. The system consists of three main stages, preprocessing and normalization, features expansion and classification. After preprocessing and normalization, we utilize Kateglo to find some synonyms of every words in original texts and append them. Finally, the text is classified using Naïve Bayes. The experiment shows that the proposed method can improve the performance of sentiment analysis of short informal Indonesian productreviews.The bestsentimentclassification performance using proposed feature expansion is obtained by accuracy of 98%.The experiment also show that feature expansion will give higher improvement in small number of training data than in the large number of them. Keywords:productreview, sentimentanalysis,shorttext, feature expansion,classification Copyright © 2018 Universitas Ahmad Dahlan. All rights reserved. 1. Introduction Sentiment analysis is one of the fundamental problems in natural language processing (NLP). Sentiment analysis involves analyzing people’s opinions towards entities such as products, services, organizations, individuals, issues, events, topics, and their attributes from a piece of text [1]. Sentiment Analysis has a number of applications, including ranking products and merchants [2-3], predicting election results [4], predicting box-office revenues for movies [5-7], predicting the stock market [8], characterizing social relations [9], and etc. The rise of social media make us now dealing with much more short informal texts every day. Examples are tweets, status updates, comments, and reviews from various social platforms. Working with these short informal text genre is more challenging compared to traditional text genres because there are many limitations in this genre. Thus, there is growing interest in sentiment analysis of this kind of texts. Sentiment classification is a special kind of text classification problem with two class, positive and negative. Since it is a text classification problem, any existing supervised learning method can be applied. In most cases, the use of statistical or machine learning techniques such as Naive Bayes, Maximum Entropy, and Support Vector Machines has proven to be successful in this field [10-16]. Some previous works also use another supervised learning such as method decision tree and K-Nearest Neighbor (KNN) to analyze sentiment within texts [17-19]. Those researches showed that standard machine learning methods using unigram (bag of words) as features perform very well in this field. Short texts are sparse, noisy, and lack of context information. Traditional text classification methods like machine learning may not be suitable for analyzing sentiment of short texts given all those difficulties. A common approach to overcome these problems for analyzing sentiment of short texts is to enrich the original texts with additional semantics to make it appear like a large document of text. Then, traditional classification methods can be applied to it. Some of the previous works employ search engines to extract more information
  • 2.  ISSN: 1693-6930 TELKOMNIKA Vol. 16, No. 3, June 2018: 1345-1350 1346 about the short text [20-22]. The other works utilize external sources such as Wikipedia and WordNet as background knowledge [23-26]. In this study, we conducted automatic sentiment analysis of short informal Indonesian product reviews. This is very useful because it allows review to be aggregated without manual intervention. Consumers can utilize this information to research products before buying. Marketers can utilize this to research public opinion of their products. Organizations can also utilize this to get critical feedback about problems in their newly released products. The reviews are gathered from a social platforms that provides reviews from users about certain product. Every review in this platform is a short informal Indonesian text that express positive or negative opinion about the product. Most of the reviews are short texts with informal language, creative spelling and punctuation, misspellings, and slang word. This paper aims to improve short text bag of words representation for sentiment analysis. We developed automatic sentiment analysis system of short informal Indonesian texts using Naïve Bayes and Synonym Based Feature Expansion. In the first step, we counduct preprocessing normalizing misspellings and slang words. In the next we use Kateglo API (kateglo.co.id) to find synonym of each word in texts to enrich the original texts. Finally, we do classification using Naïve Bayes Classifier and bag of words as the features. 2. Research Method In general, as seen in Figure 1, the sentiment analysis system in this study consists of three main stages, preprocessing and normalization, features expansion and classification. The first stage involves several steps including tokenization, stopwords removal, stemming and misspellings words normalization. In this stage, we also counduted negation convert. In the feature expansion stage, we use Kateglo API to find synonym of each word in the review texts. Then, the synonym will be added to the original texts. Finally, in the sentiment classification stage, Naïve Bayes is trained using some training data and the expanded review texts will be classified using Naïve Bayes and bag of words as its features. Figure 1. System Main Flowchart
  • 3. TELKOMNIKA ISSN: 1693-6930  Improving Sentiment Analysis of Short Informal Indonesian Product Reviews… (M. Ali Fauzi) 1347 2.1. Preprocessing Preprocessing involves tokenization, stopwords removal, stemming and misspellings words normalization, and negation convert. Tokenization is an early process done to remove punctuation, numbers, and characters other than the alphabet [27-30]. Also in this stage will be coundected case folding, which is changing all capital letters into lowercase. Stopwords removal or filtering is removing uninformative words referring to the existing stopword dictionary. In this case, we use stoplist by Tala that have been used in [31]. Stemming is a process to convert every words to its root. This process is done by removing affixes such as prefix, infix and suffix. In this case, we use Nazief-Adriani Stemmer [32]. Misspellings words normalization is done by changing the words into its formal form. Examples are the word “ga” become “tidak” and the word “bisaaaa” become “bisa”. Negation convert is a process of converting negation words contained in a sentence. The negation words has influence in changing the value of sentiment in a sentence. The most used negation words in Indonesian Language are "tidak", "bukan", "tak", "tanpa", "kurang" and "jangan". The negation convert is done by finding the antonym of the word that been negated. For example, negation convert of “tidak bagus” is “jelek”. 2.2. Feature expansion Feature expansion is process of enriching the original texts with additional semantics to make it appear like a large document of text [33]. In this study, we utilize Kateglo to find some synonyms of every words in original texts and append them. Kateglo is a dictionary website that provides API for fetching word attributes such as lexical class, root form, synonym, antonym, etc. Our system will find the synonyms of a certain word by sending some parameter to URL http://kateglo.com/api.php?format=json&phrase=[word]. Then, our system will parse the Json data received from the kateglo server and use them for feature expansion as seen in Figure 2. Figure 2. Feature expansion steps 2.3. Classification using naïve bayes Naïve Bayes is one of the most effective and efficient inductive learning algorithms for machine learning and data mining. The performance of Naive Bayes is competitive in the classification process although it uses the assumption of attribute independence. The assumption of the independence of these attributes on the data is rare, but although the assumption of attribute independence is violated, the performance of Naive Bayes classification is quite high, as proven by various empirical studies [34-35]. The Naïve Bayes classification is incorporated into the Bayes learning algorithm constructed by training data to estimate the probability of each category contained in the characteristics of the testing document. In general, the classification process using the Naïve Bayes method can be seen in equation 1. )( )|()( )|( i jij ij wP cwPcP wcP   (1)
  • 4.  ISSN: 1693-6930 TELKOMNIKA Vol. 16, No. 3, June 2018: 1345-1350 1348 where )|( ij wcP is posterior or the probability of class jc given word iw . )( jcP is prior or the probability of class jc . )|( ji cwP is likelihood or the probability of word iw given class jc . Then, )( iwP is evidence or the probability of word iw . The probability of word iw actually can be eliminated in the classification process because the probability of word iw will not affect the comparison of classification results between categories. Thus, the process of classification can be simplified to be equation 2. )|()()|( jijij cwPcPwcP  (2) 3. Results and Analysis Dataset that was used in the experiment is mobile banking app reviews. There are 100 testing data consisting of 50 positive reviews and 50 negative reviews. Meanwhile, the training data used varies from 50, 100, 400 to 1000 training data. This experiment was conducted to determine the effect of feature expansion and the number of training data on the sentiment classification performance. The evaluation method used in this experiment is accuracy. Experiment results shown in Figure 3. Figure 3. The experiment results As displayed in Figure 3, the best sentiment classification accuracy is obtained when the training data is 400 using feature expansion by 98%. The experiment results using feature expansion on every number of training data always have better classification accuracy compared to the ones that not using feature expansion. This results show that feature expansion increase the sentiment classification performance. In short-text classification, many words in the testing data never appear in the training data. It can damage the sentiment classification performance. Using feature expansion, the system append some new words, in this case the synonyms of each word in testing data, to the testing data before the classification stage. Therefore, the vocabulary on the testing data will be richer and the probability of training 85% 90% 98% 96% 76% 82% 93% 93% 65% 70% 75% 80% 85% 90% 95% 100% 50 100 400 1000 Accuracy The number of trainingdata Sentment Analysis UsingNaiveBayes and Feature Expansion Naïve Bayes using Feature Expansion Naïve Bayes without using Feature Expansion
  • 5. TELKOMNIKA ISSN: 1693-6930  Improving Sentiment Analysis of Short Informal Indonesian Product Reviews… (M. Ali Fauzi) 1349 data and testing data share the same words will increase. Thus, sentiment classification process will produce better performance compared without using feature expansion. Also from Figure 3, can be seen that the number of training data does have effect on the sentiment classification performance either with feature expansion, or without feature expansion. The more the training data, the higher accuracy obtained. The highest accuracy difference between sentiment classifications using feature expansion and not using feature expansion occurs when training data used is minimal. This difference will be closer along with the increasing number of training data. This result show that feature expansion will have bigger influence in small training data. In the large number of training data, the word in testing data that will be expanded most likely has already appeared in the train data. Hence, in this case, using feature expansion does not increase the sentiment classification performance significantly. 4. Conclusion and Future Works The proposed method, Synonym based feature expansion, had been proven can improve the performance of sentiment analysis of short informal Indonesian product reviews. Based on the experiment, Naïve Bayes classifier that use feature expansion always have better classification accuracy compared to the ones that not using feature expansion. The best sentiment classification performance is obtained when the training data is 400 using feature expansion by accuraty of 98%. The number of training data also affect on the sentiment classification performance either with feature expansion, or without feature expansion. The more the training data, the higher the accuracy obtained. The highest accuracy difference between sentiment classifications using feature expansion and not using feature expansion occurs when training data used is minimal. This difference is decreasing along with the increasing number of training data. This result show that feature expansion will give bigger improvement in small training data than in the large number of training data. References [1] Liu B. Sentimentanalysis and opinion mining. Synthesis lectures on human language technologies. 2012; 5(1):1-67. [2] McGlohon M, Glance NS, Reiter Z. Star Quality: Aggregating Reviews to Rank Products and Merchants. In ICWSM. 2010. [3] Rosi F, Fauzi MA, Perdana RS. Prediksi Rating Pada Review Produk Kecantikan Menggunakan Metode Naïve Bayes dan Categorical Proportional Difference (CPD). Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer. 2018; 2(5): 1991-1997. [4] Tumasjan A, Sprenger TO, Sandner PG, Welpe IM. Predicting elections with twitter: What 140 characters reveal about political sentiment. Icwsm. 2010;10(1):178-85. [5] Asur S, Huberman BA. Predicting the future with social media. In Proceedings of the 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology. 2010; 01:492-499. [6] Joshi M, Das D, Gimpel K, Smith NA. Movie reviews and revenues:An experiment in text regression. InHuman Language Technologies. The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2010; :293-296. [7] Sadikov E, Parameswaran AG, Venetis P. Blogs as Predictors of Movie Success. In ICWSM. 2009. [8] Bollen J, Mao H, Zeng X. Twitter mood predicts the stock market. Journal of computational science. 2011; 2(1):1-8. [9] Groh G, Hauffa J. Characterizing Social Relations Via NLP-Based Sentiment Analysis. In ICWSM. 2011. [10] Aue, A. and Gamon, M. Customizing sentiment classifiers to new domains: A case study. In Proceedings of recent advances in natural language processing (RANLP). 2005; 1(3.1):2-1. [11] Tan, S., Wu, G., Tang, H. and Cheng, X. A novel scheme for domain-transfer problem in the context of sentiment analysis. In Proceedings ofthe sixteenth ACM conference on Conference on information and knowledge management. 2007; :979-982. [12] Antinasari P, Perdana RS, Fauzi MA. Analisis Sentimen Tentang Opini Film Pada Dokumen Twitter Berbahasa Indonesia Menggunakan Naive Bayes Dengan Perbaikan Kata Tidak Baku. Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer. 2017; 1(12):1733-1741. [13] Gunawan F, Fauzi MA, Adikara PP. Analisis Sentimen Pada Ulasan Aplikasi Mobile Menggunakan Naive Bayes dan Normalisasi Kata Berbasis Levenshtein Distance (Studi Kasus Aplikasi BCA Mobile). Systemic: Information System and Informatics Journal. 2017;3(2):1-6.
  • 6.  ISSN: 1693-6930 TELKOMNIKA Vol. 16, No. 3, June 2018: 1345-1350 1350 [14] Fanissa S, Fauzi MA, Adinugroho S. Analisis Sentimen Pariwisata di Kota Malang Menggunakan Metode Naive Bayes dan Seleksi Fitur Query Expansion Ranking. Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer.2018; 2(8): 2766 - 2770. [15] Rofiqoh U, Perdana RS, Fauzi MA. Analisis Sentimen Tingkat Kepuasan Pengguna Penyedia Layanan Telekomunikasi Seluler Indonesia Pada Twitter Dengan Metode Support Vector Machine dan Lexicon Based Features. Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer. 2017; 1(12): 1725-1732. [16] Lestari AR, Perdana RS, Fauzi MA. Analisis Sentimen Tentang Opini Pilkada Dki 2017 Pada Dokumen Twitter Berbahasa Indonesia Menggunakan Näive Bayes dan Pembobotan Emoji. Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer. 2017;1(12): 1718-1724. [17] Nurjanah WE, Perdana RS, Fauzi MA. Analisis Sentimen Terhadap Tayangan Televisi Berdasarkan Opini Masyarakat pada Media Sosial Twitter menggunakan Metode K-Nearest Neighbor dan Pembobotan Jumlah Retweet. Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer.2017;1(12):1750-1757. [18] Mentari ND, Fauzi MA, Muflikhah L. Analisis Sentimen Kurikulum 2013 Pada Sosial Media Twitter Menggunakan Metode K-NearestNeighbor dan Feature Selection Query Expansion Ranking. Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer.2018;2(8): 2739 – 2743. [19] Bilal, M., Israr, H., Shahid, M. and Khan, A. Sentiment classification of Roman-Urdu opinions using Naïve Bayesian,Decision Tree and KNN classification techniques. Journal of King Saud University- Computer and Information Sciences. 2016; 28(3):330-344. [20] Bollegala, D., Matsuo, Y. and Ishizuka, M. Measuring semantic similarity between words using web search engines. www. 2007; 7(757-766). [21] Sahami, M. and Heilman, T.D. A web-based kernel function for measuring the similarity of short text snippets. In Proceedings of the 15th international conference on World Wide Web. 2006; :377-386. [22] Yih, W.T. and Meek, C. Improving similarity measures for short segments of text. In AAAI. 2007; 7(7):1489-1494. [23] Chen, M., Jin, X. and Shen, D. Short text classification improved by learning multi-granularity topics. In IJCAI. 2011; :1776-1781. [24] Hu, X., Sun, N., Zhang, C. and Chua, T.S. Exploiting internal and external semantics for the clustering of short texts using world knowledge. In Proceedings of the 18th ACM conference on Information and knowledge management. 2009; :919-928. [25] Phan, X.H., Nguyen, L.M. and Horiguchi, S. Learning to classify short and sparse text & web with hidden topics from large-scale data collections. In Proceedings of the 17th international conference on World Wide Web. 2008; :91-100. [26] Jin, O., Liu, N.N., Zhao, K., Yu, Y. and Yang, Q. Transferring topical knowledge from auxiliary long texts for short text clustering. In Proceedings ofthe 20th ACM international conference on Information and knowledge management. 2011; :775-784. [27] Fauzi, M.A., Arifin, A.Z. and Yuniarti, A. Arabic Book Retrieval using Class and Book Index Based Term Weighting. International Journal of Electrical and Computer Engineering (IJECE). 2017; 7(6). [28] Pramukantoro, E.S. and Fauzi, M.A. Comparative analysis of string similarity and corpus-based similarity for automatic essay scoring system on e-learning gamification. In Advanced Computer Science and Information Systems (ICACSIS), 2016 International Conference on. 2016; :149-155. [29] Fauzi, M.A., Arifin, A. and Yuniarti, A. Term Weighting Berbasis Indeks Buku dan Kelas untuk Perangkingan Dokumen Berbahasa Arab. Lontar Komputer: Jurnal Ilmiah Teknologi Informasi. 2013; 5(2). [30] Fauzi, M.A., Utomo, D.C., Setiawan, B.D. and Pramukantoro, E.S. Automatic Essay Scoring System Using N-Gram and Cosine Similarity for Gamification Based E-Learning. In Proceedings of the International Conference on Advances in Image Processing. 2017; :151-155. [31] Suharno CF, Fauzi MA, Perdana RS. Klasifikasi Teks Bahasa Indonesia Pada Dokumen Pengaduan SambatOnline Menggunakan Metode K-NearestNeighbors Dan Chi-Square. Systemic: Information System and Informatics Journal. 2017 Dec 7;3(1):25-32. [32] Adriani, M., Asian, J., Nazief, B., Tahaghoghi, S.M. and Williams, H.E. Stemming Indonesian: A confix-stripping approach. ACM Transactions on Asian Language Information Processing (TALIP). 2007; 6(4):1-33. [33] Yusuf S, Fauzi MA, Brata KC. Sistem Temu Kembali Informasi Pasal-Pasal KUHP (Kitab Undang- Undang Hukum Pidana) Berbasis Android Menggunakan Metode Synonym Recognition dan Cosine Similarity. Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer. 2018;2(2):838-847. [34] Qiang, G. An effective algorithm for improving the performance of Naïve Bayes for text classification. In 2010 Second International Conference on Computer Research and Development. 2010. [35] Fauzi MA, Arifin AZ, Gosaria SC. Indonesian News Classification Using Naïve Bayes and Two-Phase Feature Selection Model. Indonesian Journal ofElectrical Engineering and Computer Science. 2017 Dec 1;8(3):610-5.