SlideShare a Scribd company logo
1 of 4
Download to read offline
International Journal of Trend in Scientific Research and Development (IJTSRD)
Volume 3 Issue 6, October 2019 Available Online: www.ijtsrd.com e-ISSN: 2456 – 6470
@ IJTSRD | Unique Paper ID – IJTSRD28053 | Volume – 3 | Issue – 6 | September - October 2019 Page 332
An Empirical Study on Comment Classification
Shubham Derhgawen, Rajesh Tak, Himaja Gogineni, Subhasish Chatterjee
Department of Information Technology, Dhole Patil College of Engineering, Pune, Maharashtra, India
ABSTRACT
Due to increasing technologies in the interactive web applications, there has
been a lot of development in E commerce and online social networking
activities. The comments or the post always playsa vital roleinunderstanding
of the attitude towards a particular topic, product of the online users. Most of
the times these comments or posts help the other users to understand the
scenario and to take the right decision on the web platform. Machine learning
plays a vital role to understand and to estimate the accurate semantics of
these posts and comments. Natural language processing is widely used for
this, Most of the times natural language processing does not yield much
expected results in the classification of these comments due to thecomplexity
in the narration. These complexities generally arise either due to poor
narration of the comments or highly sarcastic contents in the comments.Soto
overcome these problems this paper broadly studies all the past work on
comment classification and try to find the new way of machine learning to get
the highly classified labels of the comments.
KEYWORDS: Natural Language Processing, Machine learning, Comment
classification, labels
How to cite this paper: Shubham
Derhgawen | Rajesh Tak | Subhasish
Chatterjee "An Empirical Study on
Comment Classification" Published in
International Journal
of Trend in Scientific
Research and
Development(ijtsrd),
ISSN: 2456-6470,
Volume-3 | Issue-6,
October 2019,
pp.332-335, URL:
https://www.ijtsrd.com/papers/ijtsrd280
53.pdf
Copyright © 2019 by author(s) and
International Journal of Trend inScientific
Research and DevelopmentJournal.Thisis
an Open Access article distributed under
the terms of the Creative Commons
Attribution License
(CC BY 4.0)
(http://creativecom
mons.org/licenses/by/4.0)
I. INTRODUCTION
Communication in humans has evolved drastically since the
early ages. Humans in the early stoneagedid notdevelopthe
extensive language that we are accustomed to today but
relied heavily on sign language assisted with a number of
different calls. These calls then slowly took the form of a
language, with various different words having a different
meaning.
The language was developed due to the need to convey the
thoughts and opinions. The language also helped our
ancestors to remain together in groups peacefully and also
help coordinate various tasks between them. It was highly
useful to hunter and gatherers which relied on language to
hunt effectively as a team and also gather useful resources
and avoid the harmful items.
The language was further refined through the ages and the
vocabulary has been ever-increasing to accommodate the
various emotions, opinions, and expertise. Language is very
inherent to humans and highly critical to our survival. It
helps convey variousmessagesandalsocontributetosociety
as a whole.
Language has enabled humans to achieve a lot of
advancements, this is through the accumulationofyearsand
generations of research that is availableintheformofbooks.
The books were only possible due to the invention of
language and have enabled humans to reach even further
heights through the knowledge acquired and spread across
the world.
There are a number of languages around the world, most of
them are very similar to one another and almost all of them
are composed of words which are in turn composed of
individual letters. This is due to the way a human brain
processes the information. Which is very different from a
computer which can only organize orunderstandstructured
data.
The computer can work with very high efficiency on
structured data such as tables in a database and
spreadsheets. Which is very different from the way humans
perceive information or language. Most oftheinformationin
the world consists of raw text in various languages which is
unstructured. Therefore, one of the most demandingtasksis
to translate this information or design a computer that can
understand and perceive unstructured languages such as
language and speech. This is a very challenging task to
achieve which can be done with the help of Artificial
Intelligence, which can emulate the various processes that
happen in a human brain while it processes information.
There has been a lot of interest in a number of programmers
since the advent of computers and computer languages to
develop programs that are capable of understandinghuman
language. As humans have been writing for thousands of
years now, it would be highly helpful if the computer could
understand the written text.
Computers still can’t effectively understand the English
language thoroughly like another human would be able to,
the understanding is highly limited due to the complex
IJTSRD28053
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD28053 | Volume – 3 | Issue – 6 | September - October 2019 Page 333
nature of the language and the various unspoken rules that
have been ingrained in the human understanding of the
language does not make it easier.
Therefore, NLP, which stands for Natural Language
processing, is very crucial as a field where there has been
ongoing research on this topic. Due to the fact that most of
the English language is highly inconsistent and sometimes
requires a lot of context and sub-context to understand a
given statement at any point in time. This is problematicand
not easy to solve as Natural Language Processing is a very
broad field.
This is helped by a number of researchersperforminga lotof
research into the field of Natural Language Processing.
Various updated libraries are now accessible and open for
public use that can be utilized to achieve Natural Language
Processing in any application. But due to thelack oflogicand
consistency in the English language, most of the time,
machine learning gives a confusing output.
This can be traced to the fact that the extraction of meaning
from a sentence written in English is extremely difficult.
Therefore, for machine learning, the complicated problem
needs to be broken down into smaller parts. These smaller
chunks then can be individually processed by the machine
learning algorithm by chaining, this process is called
Tokenization and help achieve efficient machine learning
models.
Sarcasm is one of the most difficult and almost impossibleto
detect by the traditional form of Natural Language
Processing applications. Sarcasm detection is a very
daunting task for a computer, due to the fact that not most
humans can still detect sarcasm effectively. Sarcasm in a
language is something that is used to express a bitter taunt
or a gibe aimed at something or someone. It is highly
challenging to ascertain the sarcasm in normal speech as it
can seem to pass by undetected.Mostofthesarcasmtendsto
be contradictory in nature and is usually of a polarizing
nature, either objectively negative or objectively positive.
This is usually very difficult to detect.
Another characteristic of the sarcastic comments that are
very complicated is that most of the sarcastic statements
refer to various different events at the same time. This taps
into the general knowledge, logical reasoningandanaphoras
resolution to fully grasp the underlying meaning of the
statement.
Sarcasm analysis is one of the most essential concepts to
develop an accurate model for the Natural Language
processing and the eventual classification of the comments
and limiting the amount of spam that occurs online,
especially on the social media websites. Humans tend to be
highly hateful and such poisonous acts need to be nipped in
the bud and an automated systemcapableofachievingthisis
very necessary.
This paper dedicates section 2 for analysis of past work as
literature survey and section 3 concludes the paper with
feasible statement of the literature study.
II. LITERATURE SURVEY
Siswanto [1] explains that motorsports such as the MotoGP
attract a lot of audience in the electronic and print media.
This is highly evident in the electronic media duetothelarge
advancements in the cyberspace ushering the new digital
era. This makes it easier for the people to access the
information through social media, the website or the
comments on certain trending topics on the internet.
Therefore, the researchers in this paper utilize the
comments on social media websites such as Twitter on the
topic of MotoGP with the help of naïve Bayes theorem and
Support Vector Machine to mine the text. The major
drawback of this research is that the accuracy has been
capped at 95.5% and cannot be further increased.
N. Chandra states that there has been increasing
contributions to the online community such as social media
websites and blogs. This increases the incidence of some
anti-social elements in society trying to pollute people’s
brains by commenting. This is a very malicious activity and
can even be highly racist towards certain communities.
Therefore, to reduce the instances of this activity, the
authors have applied machinelearningalgorithmstoclassify
the anti-social comments based on the K- NearestNeighbors
algorithm [2]. The major drawback in this technique is the
lack of conclusive experimental evidence of the superiority
of this technique.
A. Ikeda elaborates that there has been the addition of an
indispensable commodity in the common people’s lives,
videos. Videos form an irreplaceable component of a
person’s life where millions watch a really high number of
videos, most of these videos have a discussion section
dedicated towards active public comments.Theseparticular
comments provide a lot of information and insight into the
public opinion as anyonecan posta commentcontributingto
the discussion which is highly useful for providing
advertisements. Therefore, the authors have annotated the
comments on the Nico Douga website which posts video
streams [3]. The annotations aresuccessfullyusedtoclassify
the comments. The drawback inthisstudyisthattheauthors
have done annotation only on the referred contents, which
has room for further improvement.
F. Prabowo introduces the most popular social media
website around the world and especially in Indonesia,
Instagram. The Instagram social media caters to a large
number of users and is also home to a lot of sellers who are
actively displaying their content and selling items through
social media. All of the posts on the website can be
commented on and mostshopownerscommunicatethrough
the comment section with their customers. Therefore, the
researchers have utilized a statistical approachtoclassifying
the comments [4]. The authors have deployedSVM(Support
Vector Machine)andCNN (Convolutional Neural network)to
achieve the classification which is based on feature
extraction. The proposedmodel achievesanaccuracyof84%
which is one of the major drawbacks of this approach.
M. Takeda proposes an innovative technique for the
classification of comments posted on various web services
and social media websites such as twitter, amazon, etc.Most
of the time the research regarding the text classification is
done the most basic approach has been to use the Bag of
Words technique. The researchers in this paper have not
utilized this but instead, have used the tree kernels to
classify the tweets and other comments [5]. The authors
have also deviated from the research to create a video
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD28053 | Volume – 3 | Issue – 6 | September - October 2019 Page 334
retrieval service for tourism-related content and have left a
lot of room for improvement in the classification model.
M. Ibrahim informs that there has been a steady increase in
the amount of harassment online and casesofcyberbullying.
This is due to the large advancements in technology and
access to a social media and internet that has led to a rise in
these cases which is a cause of concern to a lot of online
communities and public forums [6]. Therefore, the authors
present an innovative technique for the classification of
comments using deep learning and data augmentation
techniques. The researchers utilized CNNs (Convolutional
Neural Networks), LSTM (Long Short-Term Memory) to
achieve promising results. The researchers only classified
the toxic comments on the website which is a very limited
approach to this classification task.
W. Jitsakul takes a different approach towards the comment
classification task, due to the fact that most reviews and
comments can be tagged and reported as spam,
inappropriate. Positive, negative, etc. this feedback
mechanism is utilized with the combination of a text
classifier. The authors in this study have attempted to
classify the comments by the customers on the basis of
positive or negative remarks. The system has been
demonstrated with the helo of experiments conducted on a
dataset and achieves an average classification accuracy of
80% which can be further improved. [7]
M. Andriansyah [8] explains that social media has been the
greatest source of most cyberbullying cases being reported.
This is mostly done on the discussion forums and comments
on various posts that have been posted on social media
websites. To ameliorate this effect, the authors have
designed a methodology for the classification on comments
on a social media website such as Instagram with the help of
SVM (Support Vector Machine). The researchers have
applied this technique on the Indonesian Instagram dataset
and achieved satisfactory results. But the majordrawback of
this research is that average classificationaccuracyachieved
is 70% which is very less.
J. Savigny elaborates that there has been anincreaseinvideo
streaming websites on the internet, especially YouTube
which has been very popular around the globe with an
extremely large user base. All of the videos posted on the
website have a discussion space where the users can post
their opinions in the form of comments. The researchers in
this study aimed to perform the emotion classificationofthe
comments based on labels such as fear, disgust, surprise,
angry, sad and happy [9]. The authors utilized Word
embedding to achieve their classification. The experimental
results on a YouTube database yielded an accuracy of 76%
which is very low for a classification technique.
T. Peng introduces phishing attacks which are one of the
most dangerous and common attacks that are being utilized
by attackers nowadays. Phishing attacks are also one of the
least defended security attacks in this day and age.
Therefore, to increase the security and provide a solution to
this problem, the authors present an innovative technique
based on NLP (Natural Language Processing) to analyze and
detect the signs of a phishing attack [10]. The researchers
perform a semantic analysis of the text on a website to
determine if it has been created with malicious intent to
steal consumer data. The major drawback in this technique
is that it has not been tested adequately to perform its
performance analysis.
H. Shen states that there has been an increase in the variety
as well as the number of logs being produced every day. As
there has been an increase in the production of logs of
various sizes and types, the processing accuracy and speed
of various technique utilized for the purpose of analysis of
logs has been decreasing. Therefore, the management and
querying of the logs have led to a significant increase in the
cost. Therefore, the authors in this paper present a unique
technique for the analysis of based on log layering and NLP
(Natural Language Processing). The experimental analysis
on a database of logs revealed that the proposed technique
has improved the system by 40%. The major drawback is
that the computational complexity has been reduced with
the help of NLP but it has not been fully utilized to its
potential. [11]
K. Sintoris explains thatbusinessprocessmodelsarea highly
useful resource and vital for the organization for its growth
and stability. Therefore, the authors in this paper propose a
technique for the extraction of Business Process model with
the implementation of NLP (Natural Language Processing)
techniques [12]. The researchers applied the Natural
Language Processing to generate various business process
models automatically based on the existing documentation.
This study is a good example of how NLP can be used to
extract a lot of information from a given text efficiently. The
major drawback in this paper is that this technique has not
been effectively experimented for performance evaluation.
P. Gupta elaborates the difficulties thatarisewhenmanaging
modern databases as most of them containa largeamount of
structured data which is highly problematictoanalyze.Butit
is essential to process all that information to gain valuable
insight into the process and the data. Thus, to eliminate this
problem, the authors in this paper have proposed an
Intelligent Querying system that deploysa Natural Language
Processing technique to achieve highly accurate processing
of the Queries and in turn the database [13]. The major
drawback in this technique is the high time and space
complexities of this system.
Z. Zong introduces Natural Language Processing as a system
for a machine-based automatictranslation.Theauthorsstate
that there has not been an efficient and accurate automatic
translation system that can translate the given text
accurately into another language. The researchers believe
that this gap in the system can be bridged with the help of
the implementation of the Natural Language Processing
technique. As NLP is one of the most powerful approachesto
understand the semantics of the human language, it would
be highly helpful in achieving high levels of accuracy if
applied in translation. The major drawback in this paper is
that it is a statistical study and not a complete
implementation of this technique. [14]
P. Rani [15] presents an innovative system for a voice-
controlled home automation system, which is reliant on IoT
(Internet of Things), NLP (Natural LanguageProcessing)and
AI (Artificial Intelligence) to achieve its goals. The authors
have utilized the Internet of Thingsplatformtoconnectmost
of the electronic appliances in the home. The Natural
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD28053 | Volume – 3 | Issue – 6 | September - October 2019 Page 335
Language Processing is used to interpretandunderstandthe
voice commands with the help of Artificial intelligence to
achieve Home Automation. The major drawback in this
technique is that the NLP application is highly limited and
can be improved further.
III. CONCLUSION
Classification of the comments which are posted on the web
is always a tough task. This is due to the complexities lies in
the narration of the comments by the different users, which
unable the artificial intelligence system to take much more
effort to create the classification labels. This classification of
the comments broadly helps to understand and undermine
the possible usage of the comments to the system or the
other users. This paper studies most of the traditional and
new arenas of the comment classification system to unleash
the backdrop of the major problems. As a solution for this ,
this paper extensively relays on the machine learning
concepts to handle the sarcasm andcomplex narrationof the
comments to classify them according to different labels like
negative, positive, neutral and many more.
REFERENCES
[1] Siswanto et al, “Classification Analysis of MotoGP
Comments on Media Social Twitter Using Algorithm
Support Vector Machine and Naive Bayes”,
International Conference on Applied Information
Technology and Innovation, 2018.
[2] N. Chandra et al, “Anti-Social Comment Classification
based on kNN Algorithm”, International Conferenceon
Reliability, Infocom Technologies and Optimization
(Trends and Future Directions) (ICRITO), 2017.
[3] A. Ikeda et al, “Classification of Comments on Nico
NicoDouga for Annotation Based on Referred
Contents”, 18th International Conference on Network-
Based Information Systems, 2015.
[4] F. Prabowo and A. Purwarianti, “Instagram Online
Shop’s Comment Classification using Statistical
Approach”, 2nd International Conferences on
Information Technology, Information Systems and
Electrical Engineering (ICITISEE), 2017.
[5] M. Takeda et al, “Classification of Comments by Tree
Kernels Using the Hierarchy of Wikipedia for Tree
Structures”, 5th IIAI International Congress on
Advanced Applied Informatics, 2016.
[6] M. Ibrahim et al, “Imbalanced Toxic Comments
Classification using Data Augmentation and Deep
Learning”, 17th IEEE International Conference on
Machine Learning and Applications (ICMLA), 2018.
[7] W. Jitsakul et al, “Enhancing Comment Feedback
Classification using Text Classifiers with Word
Centrality Measures”, 2nd International Conferenceon
Information Technology (INCIT), 2017.
[8] M. Andriansyah et al, “Cyberbullying Comment
Classification on Indonesian Selebgram Using Support
Vector Machine Method”, Second International
ConferenceonInformaticsandComputing(ICIC),2017.
[9] J. Savigny and A. Purwarianti, “Emotion Classification
on Youtube Comments using Word Embedding”,
International Conference on Advanced Informatics,
Concepts, Theory, and Applications (ICAICTA), 2017.
[10] T. Peng, I. Harris, and Y. Sawa, “Detecting Phishing
Attacks Using Natural Language Processing and
MachineLearning”,12thIEEEInternational Conference
on Semantic Computing, 2018.
[11] H. Shen et al, “Log Layering Based on Natural Language
Processing”, International Conference on Advanced
Communications Technology (ICACT), 2019.
[12] K. Sintoris and K. Vergidis, “Extracting Business
Process Models using Natural Language Processing
(NLP) Techniques”, IEEE 19th Conference on Business
Informatics, 2017.
[13] P. Gupta et al, “IQS- Intelligent Querying System using
Natural Language Processing”, International
Conference on Electronics, Communication and
Aerospace Technology, ICECA, 2017.
[14] Z. Zong and C. Hong, “On Application of Natural
Language Processing in Machine Translation”, 3rd
International Conference on Mechanical, Control and
Computer Engineering (ICMCCE), 2018.
[15] P. Rani et al, “Voice Controlled Home Automation
System Using Natural Language Processing (Nlp) and
Internet of Things (IoT)”, Third International
Conference on Science Technology Engineering &
Management (ICONSTEM), 2017.

More Related Content

Similar to Machine Learning Classifies Online Comments

Informal Essay Examples. Informal Interview Essay Interview Essays Free 30...
Informal Essay Examples. Informal Interview Essay  Interview  Essays  Free 30...Informal Essay Examples. Informal Interview Essay  Interview  Essays  Free 30...
Informal Essay Examples. Informal Interview Essay Interview Essays Free 30...Ashley Rosas
 
An Analytical Survey on Hate Speech Recognition through NLP and Deep Learning
An Analytical Survey on Hate Speech Recognition through NLP and Deep LearningAn Analytical Survey on Hate Speech Recognition through NLP and Deep Learning
An Analytical Survey on Hate Speech Recognition through NLP and Deep LearningIRJET Journal
 
Chatbot
ChatbotChatbot
Chatbotijtsrd
 
Questions On Natural Language Processing
Questions On Natural Language ProcessingQuestions On Natural Language Processing
Questions On Natural Language ProcessingAdriana Wilson
 
An Overview Of Natural Language Processing
An Overview Of Natural Language ProcessingAn Overview Of Natural Language Processing
An Overview Of Natural Language ProcessingScott Faria
 
IRJET- Hand Gesture based Recognition using CNN Methodology
IRJET- Hand Gesture based Recognition using CNN MethodologyIRJET- Hand Gesture based Recognition using CNN Methodology
IRJET- Hand Gesture based Recognition using CNN MethodologyIRJET Journal
 
isl mini project report Akshay
isl mini project report Akshayisl mini project report Akshay
isl mini project report AkshayAkshayKalapgar
 
English Essay Internet. Tulsa Community College
English Essay Internet. Tulsa Community CollegeEnglish Essay Internet. Tulsa Community College
English Essay Internet. Tulsa Community CollegeHeather Hughes
 
Natural Language Processing Theory, Applications and Difficulties
Natural Language Processing Theory, Applications and DifficultiesNatural Language Processing Theory, Applications and Difficulties
Natural Language Processing Theory, Applications and Difficultiesijtsrd
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguisticsVahid Saffarian
 
A Survey Of Current Datasets For Code-Switching Research
A Survey Of Current Datasets For Code-Switching ResearchA Survey Of Current Datasets For Code-Switching Research
A Survey Of Current Datasets For Code-Switching ResearchJim Webb
 
Jawaharlal Nehru Technological University Natural Language Processing Capston...
Jawaharlal Nehru Technological University Natural Language Processing Capston...Jawaharlal Nehru Technological University Natural Language Processing Capston...
Jawaharlal Nehru Technological University Natural Language Processing Capston...write4
 
Jawaharlal Nehru Technological University Natural Language Processing Capston...
Jawaharlal Nehru Technological University Natural Language Processing Capston...Jawaharlal Nehru Technological University Natural Language Processing Capston...
Jawaharlal Nehru Technological University Natural Language Processing Capston...write5
 
IRJET- A Pragmatic Supervised Learning Methodology of Hate Speech Detection i...
IRJET- A Pragmatic Supervised Learning Methodology of Hate Speech Detection i...IRJET- A Pragmatic Supervised Learning Methodology of Hate Speech Detection i...
IRJET- A Pragmatic Supervised Learning Methodology of Hate Speech Detection i...IRJET Journal
 
A Survey on Speech Recognition with Language Specification
A Survey on Speech Recognition with Language SpecificationA Survey on Speech Recognition with Language Specification
A Survey on Speech Recognition with Language Specificationijtsrd
 
How To Make Outline For Essay
How To Make Outline For EssayHow To Make Outline For Essay
How To Make Outline For EssayJulia Slater
 

Similar to Machine Learning Classifies Online Comments (20)

Informal Essay Examples. Informal Interview Essay Interview Essays Free 30...
Informal Essay Examples. Informal Interview Essay  Interview  Essays  Free 30...Informal Essay Examples. Informal Interview Essay  Interview  Essays  Free 30...
Informal Essay Examples. Informal Interview Essay Interview Essays Free 30...
 
An Analytical Survey on Hate Speech Recognition through NLP and Deep Learning
An Analytical Survey on Hate Speech Recognition through NLP and Deep LearningAn Analytical Survey on Hate Speech Recognition through NLP and Deep Learning
An Analytical Survey on Hate Speech Recognition through NLP and Deep Learning
 
SAMUEL FULL MSC PROJECT
SAMUEL FULL MSC PROJECTSAMUEL FULL MSC PROJECT
SAMUEL FULL MSC PROJECT
 
Chatbot
ChatbotChatbot
Chatbot
 
LuisValeroInterests
LuisValeroInterestsLuisValeroInterests
LuisValeroInterests
 
Questions On Natural Language Processing
Questions On Natural Language ProcessingQuestions On Natural Language Processing
Questions On Natural Language Processing
 
An Overview Of Natural Language Processing
An Overview Of Natural Language ProcessingAn Overview Of Natural Language Processing
An Overview Of Natural Language Processing
 
IRJET- Hand Gesture based Recognition using CNN Methodology
IRJET- Hand Gesture based Recognition using CNN MethodologyIRJET- Hand Gesture based Recognition using CNN Methodology
IRJET- Hand Gesture based Recognition using CNN Methodology
 
isl mini project report Akshay
isl mini project report Akshayisl mini project report Akshay
isl mini project report Akshay
 
English Essay Internet. Tulsa Community College
English Essay Internet. Tulsa Community CollegeEnglish Essay Internet. Tulsa Community College
English Essay Internet. Tulsa Community College
 
Natural Language Processing Theory, Applications and Difficulties
Natural Language Processing Theory, Applications and DifficultiesNatural Language Processing Theory, Applications and Difficulties
Natural Language Processing Theory, Applications and Difficulties
 
Research 36. How to Write Significance. Code.601.pptx
Research 36. How to Write Significance.  Code.601.pptxResearch 36. How to Write Significance.  Code.601.pptx
Research 36. How to Write Significance. Code.601.pptx
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguistics
 
A Survey Of Current Datasets For Code-Switching Research
A Survey Of Current Datasets For Code-Switching ResearchA Survey Of Current Datasets For Code-Switching Research
A Survey Of Current Datasets For Code-Switching Research
 
Jawaharlal Nehru Technological University Natural Language Processing Capston...
Jawaharlal Nehru Technological University Natural Language Processing Capston...Jawaharlal Nehru Technological University Natural Language Processing Capston...
Jawaharlal Nehru Technological University Natural Language Processing Capston...
 
Jawaharlal Nehru Technological University Natural Language Processing Capston...
Jawaharlal Nehru Technological University Natural Language Processing Capston...Jawaharlal Nehru Technological University Natural Language Processing Capston...
Jawaharlal Nehru Technological University Natural Language Processing Capston...
 
IRJET- A Pragmatic Supervised Learning Methodology of Hate Speech Detection i...
IRJET- A Pragmatic Supervised Learning Methodology of Hate Speech Detection i...IRJET- A Pragmatic Supervised Learning Methodology of Hate Speech Detection i...
IRJET- A Pragmatic Supervised Learning Methodology of Hate Speech Detection i...
 
A Survey on Speech Recognition with Language Specification
A Survey on Speech Recognition with Language SpecificationA Survey on Speech Recognition with Language Specification
A Survey on Speech Recognition with Language Specification
 
How To Make Outline For Essay
How To Make Outline For EssayHow To Make Outline For Essay
How To Make Outline For Essay
 
call for papers, research paper publishing, where to publish research paper, ...
call for papers, research paper publishing, where to publish research paper, ...call for papers, research paper publishing, where to publish research paper, ...
call for papers, research paper publishing, where to publish research paper, ...
 

More from ijtsrd

‘Six Sigma Technique’ A Journey Through its Implementation
‘Six Sigma Technique’ A Journey Through its Implementation‘Six Sigma Technique’ A Journey Through its Implementation
‘Six Sigma Technique’ A Journey Through its Implementationijtsrd
 
Edge Computing in Space Enhancing Data Processing and Communication for Space...
Edge Computing in Space Enhancing Data Processing and Communication for Space...Edge Computing in Space Enhancing Data Processing and Communication for Space...
Edge Computing in Space Enhancing Data Processing and Communication for Space...ijtsrd
 
Dynamics of Communal Politics in 21st Century India Challenges and Prospects
Dynamics of Communal Politics in 21st Century India Challenges and ProspectsDynamics of Communal Politics in 21st Century India Challenges and Prospects
Dynamics of Communal Politics in 21st Century India Challenges and Prospectsijtsrd
 
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...ijtsrd
 
The Impact of Digital Media on the Decentralization of Power and the Erosion ...
The Impact of Digital Media on the Decentralization of Power and the Erosion ...The Impact of Digital Media on the Decentralization of Power and the Erosion ...
The Impact of Digital Media on the Decentralization of Power and the Erosion ...ijtsrd
 
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...ijtsrd
 
Problems and Challenges of Agro Entreprenurship A Study
Problems and Challenges of Agro Entreprenurship A StudyProblems and Challenges of Agro Entreprenurship A Study
Problems and Challenges of Agro Entreprenurship A Studyijtsrd
 
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...ijtsrd
 
The Impact of Educational Background and Professional Training on Human Right...
The Impact of Educational Background and Professional Training on Human Right...The Impact of Educational Background and Professional Training on Human Right...
The Impact of Educational Background and Professional Training on Human Right...ijtsrd
 
A Study on the Effective Teaching Learning Process in English Curriculum at t...
A Study on the Effective Teaching Learning Process in English Curriculum at t...A Study on the Effective Teaching Learning Process in English Curriculum at t...
A Study on the Effective Teaching Learning Process in English Curriculum at t...ijtsrd
 
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...ijtsrd
 
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...ijtsrd
 
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. SadikuSustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadikuijtsrd
 
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...ijtsrd
 
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...ijtsrd
 
Activating Geospatial Information for Sudans Sustainable Investment Map
Activating Geospatial Information for Sudans Sustainable Investment MapActivating Geospatial Information for Sudans Sustainable Investment Map
Activating Geospatial Information for Sudans Sustainable Investment Mapijtsrd
 
Educational Unity Embracing Diversity for a Stronger Society
Educational Unity Embracing Diversity for a Stronger SocietyEducational Unity Embracing Diversity for a Stronger Society
Educational Unity Embracing Diversity for a Stronger Societyijtsrd
 
Integration of Indian Indigenous Knowledge System in Management Prospects and...
Integration of Indian Indigenous Knowledge System in Management Prospects and...Integration of Indian Indigenous Knowledge System in Management Prospects and...
Integration of Indian Indigenous Knowledge System in Management Prospects and...ijtsrd
 
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...ijtsrd
 
Streamlining Data Collection eCRF Design and Machine Learning
Streamlining Data Collection eCRF Design and Machine LearningStreamlining Data Collection eCRF Design and Machine Learning
Streamlining Data Collection eCRF Design and Machine Learningijtsrd
 

More from ijtsrd (20)

‘Six Sigma Technique’ A Journey Through its Implementation
‘Six Sigma Technique’ A Journey Through its Implementation‘Six Sigma Technique’ A Journey Through its Implementation
‘Six Sigma Technique’ A Journey Through its Implementation
 
Edge Computing in Space Enhancing Data Processing and Communication for Space...
Edge Computing in Space Enhancing Data Processing and Communication for Space...Edge Computing in Space Enhancing Data Processing and Communication for Space...
Edge Computing in Space Enhancing Data Processing and Communication for Space...
 
Dynamics of Communal Politics in 21st Century India Challenges and Prospects
Dynamics of Communal Politics in 21st Century India Challenges and ProspectsDynamics of Communal Politics in 21st Century India Challenges and Prospects
Dynamics of Communal Politics in 21st Century India Challenges and Prospects
 
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
 
The Impact of Digital Media on the Decentralization of Power and the Erosion ...
The Impact of Digital Media on the Decentralization of Power and the Erosion ...The Impact of Digital Media on the Decentralization of Power and the Erosion ...
The Impact of Digital Media on the Decentralization of Power and the Erosion ...
 
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...
 
Problems and Challenges of Agro Entreprenurship A Study
Problems and Challenges of Agro Entreprenurship A StudyProblems and Challenges of Agro Entreprenurship A Study
Problems and Challenges of Agro Entreprenurship A Study
 
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
 
The Impact of Educational Background and Professional Training on Human Right...
The Impact of Educational Background and Professional Training on Human Right...The Impact of Educational Background and Professional Training on Human Right...
The Impact of Educational Background and Professional Training on Human Right...
 
A Study on the Effective Teaching Learning Process in English Curriculum at t...
A Study on the Effective Teaching Learning Process in English Curriculum at t...A Study on the Effective Teaching Learning Process in English Curriculum at t...
A Study on the Effective Teaching Learning Process in English Curriculum at t...
 
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
 
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
 
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. SadikuSustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
 
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
 
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
 
Activating Geospatial Information for Sudans Sustainable Investment Map
Activating Geospatial Information for Sudans Sustainable Investment MapActivating Geospatial Information for Sudans Sustainable Investment Map
Activating Geospatial Information for Sudans Sustainable Investment Map
 
Educational Unity Embracing Diversity for a Stronger Society
Educational Unity Embracing Diversity for a Stronger SocietyEducational Unity Embracing Diversity for a Stronger Society
Educational Unity Embracing Diversity for a Stronger Society
 
Integration of Indian Indigenous Knowledge System in Management Prospects and...
Integration of Indian Indigenous Knowledge System in Management Prospects and...Integration of Indian Indigenous Knowledge System in Management Prospects and...
Integration of Indian Indigenous Knowledge System in Management Prospects and...
 
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
 
Streamlining Data Collection eCRF Design and Machine Learning
Streamlining Data Collection eCRF Design and Machine LearningStreamlining Data Collection eCRF Design and Machine Learning
Streamlining Data Collection eCRF Design and Machine Learning
 

Recently uploaded

Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...RKavithamani
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 

Recently uploaded (20)

Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 

Machine Learning Classifies Online Comments

  • 1. International Journal of Trend in Scientific Research and Development (IJTSRD) Volume 3 Issue 6, October 2019 Available Online: www.ijtsrd.com e-ISSN: 2456 – 6470 @ IJTSRD | Unique Paper ID – IJTSRD28053 | Volume – 3 | Issue – 6 | September - October 2019 Page 332 An Empirical Study on Comment Classification Shubham Derhgawen, Rajesh Tak, Himaja Gogineni, Subhasish Chatterjee Department of Information Technology, Dhole Patil College of Engineering, Pune, Maharashtra, India ABSTRACT Due to increasing technologies in the interactive web applications, there has been a lot of development in E commerce and online social networking activities. The comments or the post always playsa vital roleinunderstanding of the attitude towards a particular topic, product of the online users. Most of the times these comments or posts help the other users to understand the scenario and to take the right decision on the web platform. Machine learning plays a vital role to understand and to estimate the accurate semantics of these posts and comments. Natural language processing is widely used for this, Most of the times natural language processing does not yield much expected results in the classification of these comments due to thecomplexity in the narration. These complexities generally arise either due to poor narration of the comments or highly sarcastic contents in the comments.Soto overcome these problems this paper broadly studies all the past work on comment classification and try to find the new way of machine learning to get the highly classified labels of the comments. KEYWORDS: Natural Language Processing, Machine learning, Comment classification, labels How to cite this paper: Shubham Derhgawen | Rajesh Tak | Subhasish Chatterjee "An Empirical Study on Comment Classification" Published in International Journal of Trend in Scientific Research and Development(ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-6, October 2019, pp.332-335, URL: https://www.ijtsrd.com/papers/ijtsrd280 53.pdf Copyright © 2019 by author(s) and International Journal of Trend inScientific Research and DevelopmentJournal.Thisis an Open Access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0) (http://creativecom mons.org/licenses/by/4.0) I. INTRODUCTION Communication in humans has evolved drastically since the early ages. Humans in the early stoneagedid notdevelopthe extensive language that we are accustomed to today but relied heavily on sign language assisted with a number of different calls. These calls then slowly took the form of a language, with various different words having a different meaning. The language was developed due to the need to convey the thoughts and opinions. The language also helped our ancestors to remain together in groups peacefully and also help coordinate various tasks between them. It was highly useful to hunter and gatherers which relied on language to hunt effectively as a team and also gather useful resources and avoid the harmful items. The language was further refined through the ages and the vocabulary has been ever-increasing to accommodate the various emotions, opinions, and expertise. Language is very inherent to humans and highly critical to our survival. It helps convey variousmessagesandalsocontributetosociety as a whole. Language has enabled humans to achieve a lot of advancements, this is through the accumulationofyearsand generations of research that is availableintheformofbooks. The books were only possible due to the invention of language and have enabled humans to reach even further heights through the knowledge acquired and spread across the world. There are a number of languages around the world, most of them are very similar to one another and almost all of them are composed of words which are in turn composed of individual letters. This is due to the way a human brain processes the information. Which is very different from a computer which can only organize orunderstandstructured data. The computer can work with very high efficiency on structured data such as tables in a database and spreadsheets. Which is very different from the way humans perceive information or language. Most oftheinformationin the world consists of raw text in various languages which is unstructured. Therefore, one of the most demandingtasksis to translate this information or design a computer that can understand and perceive unstructured languages such as language and speech. This is a very challenging task to achieve which can be done with the help of Artificial Intelligence, which can emulate the various processes that happen in a human brain while it processes information. There has been a lot of interest in a number of programmers since the advent of computers and computer languages to develop programs that are capable of understandinghuman language. As humans have been writing for thousands of years now, it would be highly helpful if the computer could understand the written text. Computers still can’t effectively understand the English language thoroughly like another human would be able to, the understanding is highly limited due to the complex IJTSRD28053
  • 2. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD28053 | Volume – 3 | Issue – 6 | September - October 2019 Page 333 nature of the language and the various unspoken rules that have been ingrained in the human understanding of the language does not make it easier. Therefore, NLP, which stands for Natural Language processing, is very crucial as a field where there has been ongoing research on this topic. Due to the fact that most of the English language is highly inconsistent and sometimes requires a lot of context and sub-context to understand a given statement at any point in time. This is problematicand not easy to solve as Natural Language Processing is a very broad field. This is helped by a number of researchersperforminga lotof research into the field of Natural Language Processing. Various updated libraries are now accessible and open for public use that can be utilized to achieve Natural Language Processing in any application. But due to thelack oflogicand consistency in the English language, most of the time, machine learning gives a confusing output. This can be traced to the fact that the extraction of meaning from a sentence written in English is extremely difficult. Therefore, for machine learning, the complicated problem needs to be broken down into smaller parts. These smaller chunks then can be individually processed by the machine learning algorithm by chaining, this process is called Tokenization and help achieve efficient machine learning models. Sarcasm is one of the most difficult and almost impossibleto detect by the traditional form of Natural Language Processing applications. Sarcasm detection is a very daunting task for a computer, due to the fact that not most humans can still detect sarcasm effectively. Sarcasm in a language is something that is used to express a bitter taunt or a gibe aimed at something or someone. It is highly challenging to ascertain the sarcasm in normal speech as it can seem to pass by undetected.Mostofthesarcasmtendsto be contradictory in nature and is usually of a polarizing nature, either objectively negative or objectively positive. This is usually very difficult to detect. Another characteristic of the sarcastic comments that are very complicated is that most of the sarcastic statements refer to various different events at the same time. This taps into the general knowledge, logical reasoningandanaphoras resolution to fully grasp the underlying meaning of the statement. Sarcasm analysis is one of the most essential concepts to develop an accurate model for the Natural Language processing and the eventual classification of the comments and limiting the amount of spam that occurs online, especially on the social media websites. Humans tend to be highly hateful and such poisonous acts need to be nipped in the bud and an automated systemcapableofachievingthisis very necessary. This paper dedicates section 2 for analysis of past work as literature survey and section 3 concludes the paper with feasible statement of the literature study. II. LITERATURE SURVEY Siswanto [1] explains that motorsports such as the MotoGP attract a lot of audience in the electronic and print media. This is highly evident in the electronic media duetothelarge advancements in the cyberspace ushering the new digital era. This makes it easier for the people to access the information through social media, the website or the comments on certain trending topics on the internet. Therefore, the researchers in this paper utilize the comments on social media websites such as Twitter on the topic of MotoGP with the help of naïve Bayes theorem and Support Vector Machine to mine the text. The major drawback of this research is that the accuracy has been capped at 95.5% and cannot be further increased. N. Chandra states that there has been increasing contributions to the online community such as social media websites and blogs. This increases the incidence of some anti-social elements in society trying to pollute people’s brains by commenting. This is a very malicious activity and can even be highly racist towards certain communities. Therefore, to reduce the instances of this activity, the authors have applied machinelearningalgorithmstoclassify the anti-social comments based on the K- NearestNeighbors algorithm [2]. The major drawback in this technique is the lack of conclusive experimental evidence of the superiority of this technique. A. Ikeda elaborates that there has been the addition of an indispensable commodity in the common people’s lives, videos. Videos form an irreplaceable component of a person’s life where millions watch a really high number of videos, most of these videos have a discussion section dedicated towards active public comments.Theseparticular comments provide a lot of information and insight into the public opinion as anyonecan posta commentcontributingto the discussion which is highly useful for providing advertisements. Therefore, the authors have annotated the comments on the Nico Douga website which posts video streams [3]. The annotations aresuccessfullyusedtoclassify the comments. The drawback inthisstudyisthattheauthors have done annotation only on the referred contents, which has room for further improvement. F. Prabowo introduces the most popular social media website around the world and especially in Indonesia, Instagram. The Instagram social media caters to a large number of users and is also home to a lot of sellers who are actively displaying their content and selling items through social media. All of the posts on the website can be commented on and mostshopownerscommunicatethrough the comment section with their customers. Therefore, the researchers have utilized a statistical approachtoclassifying the comments [4]. The authors have deployedSVM(Support Vector Machine)andCNN (Convolutional Neural network)to achieve the classification which is based on feature extraction. The proposedmodel achievesanaccuracyof84% which is one of the major drawbacks of this approach. M. Takeda proposes an innovative technique for the classification of comments posted on various web services and social media websites such as twitter, amazon, etc.Most of the time the research regarding the text classification is done the most basic approach has been to use the Bag of Words technique. The researchers in this paper have not utilized this but instead, have used the tree kernels to classify the tweets and other comments [5]. The authors have also deviated from the research to create a video
  • 3. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD28053 | Volume – 3 | Issue – 6 | September - October 2019 Page 334 retrieval service for tourism-related content and have left a lot of room for improvement in the classification model. M. Ibrahim informs that there has been a steady increase in the amount of harassment online and casesofcyberbullying. This is due to the large advancements in technology and access to a social media and internet that has led to a rise in these cases which is a cause of concern to a lot of online communities and public forums [6]. Therefore, the authors present an innovative technique for the classification of comments using deep learning and data augmentation techniques. The researchers utilized CNNs (Convolutional Neural Networks), LSTM (Long Short-Term Memory) to achieve promising results. The researchers only classified the toxic comments on the website which is a very limited approach to this classification task. W. Jitsakul takes a different approach towards the comment classification task, due to the fact that most reviews and comments can be tagged and reported as spam, inappropriate. Positive, negative, etc. this feedback mechanism is utilized with the combination of a text classifier. The authors in this study have attempted to classify the comments by the customers on the basis of positive or negative remarks. The system has been demonstrated with the helo of experiments conducted on a dataset and achieves an average classification accuracy of 80% which can be further improved. [7] M. Andriansyah [8] explains that social media has been the greatest source of most cyberbullying cases being reported. This is mostly done on the discussion forums and comments on various posts that have been posted on social media websites. To ameliorate this effect, the authors have designed a methodology for the classification on comments on a social media website such as Instagram with the help of SVM (Support Vector Machine). The researchers have applied this technique on the Indonesian Instagram dataset and achieved satisfactory results. But the majordrawback of this research is that average classificationaccuracyachieved is 70% which is very less. J. Savigny elaborates that there has been anincreaseinvideo streaming websites on the internet, especially YouTube which has been very popular around the globe with an extremely large user base. All of the videos posted on the website have a discussion space where the users can post their opinions in the form of comments. The researchers in this study aimed to perform the emotion classificationofthe comments based on labels such as fear, disgust, surprise, angry, sad and happy [9]. The authors utilized Word embedding to achieve their classification. The experimental results on a YouTube database yielded an accuracy of 76% which is very low for a classification technique. T. Peng introduces phishing attacks which are one of the most dangerous and common attacks that are being utilized by attackers nowadays. Phishing attacks are also one of the least defended security attacks in this day and age. Therefore, to increase the security and provide a solution to this problem, the authors present an innovative technique based on NLP (Natural Language Processing) to analyze and detect the signs of a phishing attack [10]. The researchers perform a semantic analysis of the text on a website to determine if it has been created with malicious intent to steal consumer data. The major drawback in this technique is that it has not been tested adequately to perform its performance analysis. H. Shen states that there has been an increase in the variety as well as the number of logs being produced every day. As there has been an increase in the production of logs of various sizes and types, the processing accuracy and speed of various technique utilized for the purpose of analysis of logs has been decreasing. Therefore, the management and querying of the logs have led to a significant increase in the cost. Therefore, the authors in this paper present a unique technique for the analysis of based on log layering and NLP (Natural Language Processing). The experimental analysis on a database of logs revealed that the proposed technique has improved the system by 40%. The major drawback is that the computational complexity has been reduced with the help of NLP but it has not been fully utilized to its potential. [11] K. Sintoris explains thatbusinessprocessmodelsarea highly useful resource and vital for the organization for its growth and stability. Therefore, the authors in this paper propose a technique for the extraction of Business Process model with the implementation of NLP (Natural Language Processing) techniques [12]. The researchers applied the Natural Language Processing to generate various business process models automatically based on the existing documentation. This study is a good example of how NLP can be used to extract a lot of information from a given text efficiently. The major drawback in this paper is that this technique has not been effectively experimented for performance evaluation. P. Gupta elaborates the difficulties thatarisewhenmanaging modern databases as most of them containa largeamount of structured data which is highly problematictoanalyze.Butit is essential to process all that information to gain valuable insight into the process and the data. Thus, to eliminate this problem, the authors in this paper have proposed an Intelligent Querying system that deploysa Natural Language Processing technique to achieve highly accurate processing of the Queries and in turn the database [13]. The major drawback in this technique is the high time and space complexities of this system. Z. Zong introduces Natural Language Processing as a system for a machine-based automatictranslation.Theauthorsstate that there has not been an efficient and accurate automatic translation system that can translate the given text accurately into another language. The researchers believe that this gap in the system can be bridged with the help of the implementation of the Natural Language Processing technique. As NLP is one of the most powerful approachesto understand the semantics of the human language, it would be highly helpful in achieving high levels of accuracy if applied in translation. The major drawback in this paper is that it is a statistical study and not a complete implementation of this technique. [14] P. Rani [15] presents an innovative system for a voice- controlled home automation system, which is reliant on IoT (Internet of Things), NLP (Natural LanguageProcessing)and AI (Artificial Intelligence) to achieve its goals. The authors have utilized the Internet of Thingsplatformtoconnectmost of the electronic appliances in the home. The Natural
  • 4. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD28053 | Volume – 3 | Issue – 6 | September - October 2019 Page 335 Language Processing is used to interpretandunderstandthe voice commands with the help of Artificial intelligence to achieve Home Automation. The major drawback in this technique is that the NLP application is highly limited and can be improved further. III. CONCLUSION Classification of the comments which are posted on the web is always a tough task. This is due to the complexities lies in the narration of the comments by the different users, which unable the artificial intelligence system to take much more effort to create the classification labels. This classification of the comments broadly helps to understand and undermine the possible usage of the comments to the system or the other users. This paper studies most of the traditional and new arenas of the comment classification system to unleash the backdrop of the major problems. As a solution for this , this paper extensively relays on the machine learning concepts to handle the sarcasm andcomplex narrationof the comments to classify them according to different labels like negative, positive, neutral and many more. REFERENCES [1] Siswanto et al, “Classification Analysis of MotoGP Comments on Media Social Twitter Using Algorithm Support Vector Machine and Naive Bayes”, International Conference on Applied Information Technology and Innovation, 2018. [2] N. Chandra et al, “Anti-Social Comment Classification based on kNN Algorithm”, International Conferenceon Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO), 2017. [3] A. Ikeda et al, “Classification of Comments on Nico NicoDouga for Annotation Based on Referred Contents”, 18th International Conference on Network- Based Information Systems, 2015. [4] F. Prabowo and A. Purwarianti, “Instagram Online Shop’s Comment Classification using Statistical Approach”, 2nd International Conferences on Information Technology, Information Systems and Electrical Engineering (ICITISEE), 2017. [5] M. Takeda et al, “Classification of Comments by Tree Kernels Using the Hierarchy of Wikipedia for Tree Structures”, 5th IIAI International Congress on Advanced Applied Informatics, 2016. [6] M. Ibrahim et al, “Imbalanced Toxic Comments Classification using Data Augmentation and Deep Learning”, 17th IEEE International Conference on Machine Learning and Applications (ICMLA), 2018. [7] W. Jitsakul et al, “Enhancing Comment Feedback Classification using Text Classifiers with Word Centrality Measures”, 2nd International Conferenceon Information Technology (INCIT), 2017. [8] M. Andriansyah et al, “Cyberbullying Comment Classification on Indonesian Selebgram Using Support Vector Machine Method”, Second International ConferenceonInformaticsandComputing(ICIC),2017. [9] J. Savigny and A. Purwarianti, “Emotion Classification on Youtube Comments using Word Embedding”, International Conference on Advanced Informatics, Concepts, Theory, and Applications (ICAICTA), 2017. [10] T. Peng, I. Harris, and Y. Sawa, “Detecting Phishing Attacks Using Natural Language Processing and MachineLearning”,12thIEEEInternational Conference on Semantic Computing, 2018. [11] H. Shen et al, “Log Layering Based on Natural Language Processing”, International Conference on Advanced Communications Technology (ICACT), 2019. [12] K. Sintoris and K. Vergidis, “Extracting Business Process Models using Natural Language Processing (NLP) Techniques”, IEEE 19th Conference on Business Informatics, 2017. [13] P. Gupta et al, “IQS- Intelligent Querying System using Natural Language Processing”, International Conference on Electronics, Communication and Aerospace Technology, ICECA, 2017. [14] Z. Zong and C. Hong, “On Application of Natural Language Processing in Machine Translation”, 3rd International Conference on Mechanical, Control and Computer Engineering (ICMCCE), 2018. [15] P. Rani et al, “Voice Controlled Home Automation System Using Natural Language Processing (Nlp) and Internet of Things (IoT)”, Third International Conference on Science Technology Engineering & Management (ICONSTEM), 2017.