SlideShare a Scribd company logo
1 of 12
NEWS CLASSIFIER IN
REGIONAL LANGUAGE
Team
Anand Mohan - 20BCE0146
Vihith arekatla - 20BCE2878
Amit Kumar - 20BCE0135
Abhishek Kumar - 20BCE0210
1. Abstract
e-News is the most readable content in the world and reading the news is common in everyday life.In india we
have different languages till now we have only english news classifier we don’t have regional languages, such
as Hindi, Telugu, Malayalam, Tamil, etc news classifier. Here we came up with our project is to train a neural
network model to classify the news articles into the following categories: Tamil Nadu, India, Cinema, Sports,
Politics, World. We are using two neural networks to achieve these results Dense neural layer and Long Short
Term Neural Layer.
The aim of this project is to train a neural network model to classify the news articles into the following
categories: Tamil Nadu, India, Cinema, Sports Politics World. Here, we have used six individual binary neural
networks using Long short term memory layers for each category, and using a voting algorithm, predict the
most apt news category a given news article belongs to. We realise this using a simple Graphical User
Interface or GUI, to get the input from the user, and return the predicted category back again.
2. Problem Statement
In this project, we aim to train a simple neural network model to classify news articles
in “Tamil” Language to six distinct categories. We train six binary classifiers - one for
each of the six categories, and use a simple voting algorithm to get the final predicted
category. We also develop a simple graphical user interface to get the news headlines
from the user, and return the predicted classifier back to the user again.
3. Literature review
Research Paper Methodology Detailed Description Advantages &
Research Gap
1. News Classification and Its
Techniques: A Review
Author’s: Gurmeet Kaur,
Karan Bajaj
The paper say about how news
classification can be done by
using different techniques in
which the first step is news
classification. News Tokenisation
here it includes in dividing the
enormous text into little tokens.
and the words in the news are
called as a string.
It is difficult to implement it in a larger
corpora and hence algorithms can be
improved so that efficiency of
categorisation could be improved these
algorithms can be tried on bigger
corpora. In addition these algorithms
can be improved so efficiency of
categorisation could be improved
Research Paper Methodology Detailed Description Advantages &
Research Gap
2.Sentiment analysis of
tweets in three Indian
languages.
In Proceedings of the 6th
Workshop on South and
Southeast Asian Natural
Language Processing
Phani, S., Lahiri, S., and
Biswas
In this paper they explain the
sentiment analysis on tweets in
three indian regional languages
namely Tamil,Hindi and Bengali
by using the SAIL dataset which
is released at 2015.
All classifiers cannot be used. It is
limited only to multinomial Naive Bayes
present in WEKA for implementing our
system because our experiments with
other classifiers show that they give
poorer performance on SAIL data set.
Research Paper Methodology Detailed Description Advantages &
Research Gap
3.Graph Convolutional
Network for Swahili News
Classification
Alexandros Kastanos, Tyler
Martin
experimentation is done on the
sparsely-labelled semi-
supervised context which is
representative of the practical
constraints facing low-resourced
African languages.
Alternative graph structures can be
used instead. They can also consider
implementing methods from text
Inductive GNN methods
Research Paper Methodology Detailed Description Advantages &
Research Gap
4. categorization of Tamil
News Articles using Pre
Trained Word2Vec
Embeddings with
Convolutional Neural
Network
Mr. RamrajS, Arthi.R
Convolutional Neural Network
system is designed with
three convolutions followed by a
merge layer. Input for
convolutions are eed from the
embedding layer. Three type
convolutions 3×3,4×4,5×5 are
used through which features are
formalized
The precision, recall and F1 score for
the class politics is low when
compared to other two classes. The
reason for this may be due to the
occurrence of new tokens in politics test
data than in cinema and sports. In
future,it can be improved by
accommodating the same methodology
for other social media data as done for
news web data. Also, sentiment of the
data can be analysed after
topic categorization
Research Paper Methodology Detailed Description Advantages &
Research Gap
5.A Deep Learning Approach
for URL based Health
Information Search
R.Rajalakshmi and S.
Ramraj
an URL based design has been
suggested to ease the task of
health information search. The
content based methods are not
suitable, as it is time consuming
and does not reflect the dynamic
changes in the web.
By consolidating the yields of two
person CNN models, this issue has
been disposed of. To examine the
viability of the proposed troupe
approach, 5-overlap cross approval was
performed.
Research Paper Methodology Detailed Description Advantages &
Research Gap
6.News Text Classification
Method and Simulation
Based on the Hybrid Deep
Learning Model
Ningfeng Sun and Chengye
Du
The paper says about how this
simulation based on the hybrid
deep learning model is essentially
made out of four sections, in
particular, news message pre-
processing, word vector-based
news message representation,
news message include extraction
and grouping, and message order
result assessment.
In this paper The influence of dropout
parameter changes based on the
accuracy of news text classification
which can be improved by adopting
different methodologies.
Research Paper Methodology Detailed Description Advantages &
Research Gap
7.A systematic review of text
classification research based
on deep learning models in
Arabic language
Ahlam Wahdan, Sendeyah
Hantoobi
, Said A. Salloum
, Khaled Shaalan
Deep learning techniques in
classification and its
type are discussed in this paper
as well. Neural networks of
various types,
namely, RNN, CNN, FFNN, and
LSTM, are identified as the
subject of study.
The researchers did not indicate in
detail the parameters used in these
networks and how they are
tuned.Usually, the machine learning
algorithms are tuned by changing
parameters and re-running the
experiments
to get significant results.
Research Paper Methodology Detailed Description Advantages &
Research Gap
8. Analyzing sentiment in
Indian languages micro text
using recurrent neural
networks.
IIOAB Journal: A Journal of
Multidisciplinary Science and
Technology
S. Seshadri, A.K.a b
Madasamy
In this work the tweets are
classified into three polarity
category namely positive,
negative and neutral. Twitter data
of three languages namely Tamil,
Hindi and Bengali are already
provided by SAIL 2015 task
organizers as we have
participated in the contest.
They limited their research to only 3
languages even thugh the accuracy is
pretty high, more languages can be
added in future.

More Related Content

Similar to NEWS CLASSIFIER IN REGIONAL LANGUAGE.pptx

A Review on Text Mining in Data Mining
A Review on Text Mining in Data Mining  A Review on Text Mining in Data Mining
A Review on Text Mining in Data Mining ijsc
 
SENTIMENT ANALYSIS FOR MOVIES REVIEWS DATASET USING DEEP LEARNING MODELS
SENTIMENT ANALYSIS FOR MOVIES REVIEWS DATASET USING DEEP LEARNING MODELSSENTIMENT ANALYSIS FOR MOVIES REVIEWS DATASET USING DEEP LEARNING MODELS
SENTIMENT ANALYSIS FOR MOVIES REVIEWS DATASET USING DEEP LEARNING MODELSIJDKP
 
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORKSENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORKijnlc
 
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural NetworkSentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Networkkevig
 
Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...
Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...
Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...ijtsrd
 
IRJET- Survey for Amazon Fine Food Reviews
IRJET- Survey for Amazon Fine Food ReviewsIRJET- Survey for Amazon Fine Food Reviews
IRJET- Survey for Amazon Fine Food ReviewsIRJET Journal
 
76201910
7620191076201910
76201910IJRAT
 
A statistical model for gist generation a case study on hindi news article
A statistical model for gist generation  a case study on hindi news articleA statistical model for gist generation  a case study on hindi news article
A statistical model for gist generation a case study on hindi news articleIJDKP
 
IRJET - Deep Collaborrative Filtering with Aspect Information
IRJET - Deep Collaborrative Filtering with Aspect InformationIRJET - Deep Collaborrative Filtering with Aspect Information
IRJET - Deep Collaborrative Filtering with Aspect InformationIRJET Journal
 
Software Effort Estimation using Neuro Fuzzy Inference System: Past and Present
Software Effort Estimation using Neuro Fuzzy Inference System: Past and PresentSoftware Effort Estimation using Neuro Fuzzy Inference System: Past and Present
Software Effort Estimation using Neuro Fuzzy Inference System: Past and Presentrahulmonikasharma
 
LSTM Based Sentiment Analysis
LSTM Based Sentiment AnalysisLSTM Based Sentiment Analysis
LSTM Based Sentiment Analysisijtsrd
 
APPROXIMATE ANALYTICAL SOLUTION OF NON-LINEAR BOUSSINESQ EQUATION FOR THE UNS...
APPROXIMATE ANALYTICAL SOLUTION OF NON-LINEAR BOUSSINESQ EQUATION FOR THE UNS...APPROXIMATE ANALYTICAL SOLUTION OF NON-LINEAR BOUSSINESQ EQUATION FOR THE UNS...
APPROXIMATE ANALYTICAL SOLUTION OF NON-LINEAR BOUSSINESQ EQUATION FOR THE UNS...mathsjournal
 
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSISFEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSISmlaij
 
A Intensified Approach On Enhanced Transformer Based Models Using Natural Lan...
A Intensified Approach On Enhanced Transformer Based Models Using Natural Lan...A Intensified Approach On Enhanced Transformer Based Models Using Natural Lan...
A Intensified Approach On Enhanced Transformer Based Models Using Natural Lan...IRJET Journal
 
A-STUDY-ON-SENTIMENT-POLARITY.pdf
A-STUDY-ON-SENTIMENT-POLARITY.pdfA-STUDY-ON-SENTIMENT-POLARITY.pdf
A-STUDY-ON-SENTIMENT-POLARITY.pdfSUDESHNASANI1
 
An in-depth exploration of Bangla blog post classification
An in-depth exploration of Bangla blog post classificationAn in-depth exploration of Bangla blog post classification
An in-depth exploration of Bangla blog post classificationjournalBEEI
 
A hybrid composite features based sentence level sentiment analyzer
A hybrid composite features based sentence level sentiment analyzerA hybrid composite features based sentence level sentiment analyzer
A hybrid composite features based sentence level sentiment analyzerIAESIJAI
 
Discover How Scientific Data is Used for the Public Good with Natural Languag...
Discover How Scientific Data is Used for the Public Good with Natural Languag...Discover How Scientific Data is Used for the Public Good with Natural Languag...
Discover How Scientific Data is Used for the Public Good with Natural Languag...BaoTramDuong2
 

Similar to NEWS CLASSIFIER IN REGIONAL LANGUAGE.pptx (20)

A Review on Text Mining in Data Mining
A Review on Text Mining in Data Mining  A Review on Text Mining in Data Mining
A Review on Text Mining in Data Mining
 
SENTIMENT ANALYSIS FOR MOVIES REVIEWS DATASET USING DEEP LEARNING MODELS
SENTIMENT ANALYSIS FOR MOVIES REVIEWS DATASET USING DEEP LEARNING MODELSSENTIMENT ANALYSIS FOR MOVIES REVIEWS DATASET USING DEEP LEARNING MODELS
SENTIMENT ANALYSIS FOR MOVIES REVIEWS DATASET USING DEEP LEARNING MODELS
 
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORKSENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
 
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural NetworkSentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
 
Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...
Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...
Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...
 
IRJET- Survey for Amazon Fine Food Reviews
IRJET- Survey for Amazon Fine Food ReviewsIRJET- Survey for Amazon Fine Food Reviews
IRJET- Survey for Amazon Fine Food Reviews
 
76201910
7620191076201910
76201910
 
A statistical model for gist generation a case study on hindi news article
A statistical model for gist generation  a case study on hindi news articleA statistical model for gist generation  a case study on hindi news article
A statistical model for gist generation a case study on hindi news article
 
Query expansion
Query expansionQuery expansion
Query expansion
 
IRJET - Deep Collaborrative Filtering with Aspect Information
IRJET - Deep Collaborrative Filtering with Aspect InformationIRJET - Deep Collaborrative Filtering with Aspect Information
IRJET - Deep Collaborrative Filtering with Aspect Information
 
Software Effort Estimation using Neuro Fuzzy Inference System: Past and Present
Software Effort Estimation using Neuro Fuzzy Inference System: Past and PresentSoftware Effort Estimation using Neuro Fuzzy Inference System: Past and Present
Software Effort Estimation using Neuro Fuzzy Inference System: Past and Present
 
LSTM Based Sentiment Analysis
LSTM Based Sentiment AnalysisLSTM Based Sentiment Analysis
LSTM Based Sentiment Analysis
 
APPROXIMATE ANALYTICAL SOLUTION OF NON-LINEAR BOUSSINESQ EQUATION FOR THE UNS...
APPROXIMATE ANALYTICAL SOLUTION OF NON-LINEAR BOUSSINESQ EQUATION FOR THE UNS...APPROXIMATE ANALYTICAL SOLUTION OF NON-LINEAR BOUSSINESQ EQUATION FOR THE UNS...
APPROXIMATE ANALYTICAL SOLUTION OF NON-LINEAR BOUSSINESQ EQUATION FOR THE UNS...
 
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSISFEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS
 
A Intensified Approach On Enhanced Transformer Based Models Using Natural Lan...
A Intensified Approach On Enhanced Transformer Based Models Using Natural Lan...A Intensified Approach On Enhanced Transformer Based Models Using Natural Lan...
A Intensified Approach On Enhanced Transformer Based Models Using Natural Lan...
 
A-STUDY-ON-SENTIMENT-POLARITY.pdf
A-STUDY-ON-SENTIMENT-POLARITY.pdfA-STUDY-ON-SENTIMENT-POLARITY.pdf
A-STUDY-ON-SENTIMENT-POLARITY.pdf
 
An in-depth exploration of Bangla blog post classification
An in-depth exploration of Bangla blog post classificationAn in-depth exploration of Bangla blog post classification
An in-depth exploration of Bangla blog post classification
 
unit-5.pdf
unit-5.pdfunit-5.pdf
unit-5.pdf
 
A hybrid composite features based sentence level sentiment analyzer
A hybrid composite features based sentence level sentiment analyzerA hybrid composite features based sentence level sentiment analyzer
A hybrid composite features based sentence level sentiment analyzer
 
Discover How Scientific Data is Used for the Public Good with Natural Languag...
Discover How Scientific Data is Used for the Public Good with Natural Languag...Discover How Scientific Data is Used for the Public Good with Natural Languag...
Discover How Scientific Data is Used for the Public Good with Natural Languag...
 

Recently uploaded

%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisamasabamasaba
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfonteinmasabamasaba
 
WSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaSWSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaSWSO2
 
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open SourceWSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open SourceWSO2
 
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Bert Jan Schrijver
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplatePresentation.STUDIO
 
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburgmasabamasaba
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...masabamasaba
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension AidPhilip Schwarz
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastPapp Krisztián
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...masabamasaba
 
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...Jittipong Loespradit
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park masabamasaba
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...SelfMade bd
 
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...WSO2
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...masabamasaba
 

Recently uploaded (20)

%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
 
WSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaSWSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaS
 
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open SourceWSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
 
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
 
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
 

NEWS CLASSIFIER IN REGIONAL LANGUAGE.pptx

  • 2. Team Anand Mohan - 20BCE0146 Vihith arekatla - 20BCE2878 Amit Kumar - 20BCE0135 Abhishek Kumar - 20BCE0210
  • 3. 1. Abstract e-News is the most readable content in the world and reading the news is common in everyday life.In india we have different languages till now we have only english news classifier we don’t have regional languages, such as Hindi, Telugu, Malayalam, Tamil, etc news classifier. Here we came up with our project is to train a neural network model to classify the news articles into the following categories: Tamil Nadu, India, Cinema, Sports, Politics, World. We are using two neural networks to achieve these results Dense neural layer and Long Short Term Neural Layer. The aim of this project is to train a neural network model to classify the news articles into the following categories: Tamil Nadu, India, Cinema, Sports Politics World. Here, we have used six individual binary neural networks using Long short term memory layers for each category, and using a voting algorithm, predict the most apt news category a given news article belongs to. We realise this using a simple Graphical User Interface or GUI, to get the input from the user, and return the predicted category back again.
  • 4. 2. Problem Statement In this project, we aim to train a simple neural network model to classify news articles in “Tamil” Language to six distinct categories. We train six binary classifiers - one for each of the six categories, and use a simple voting algorithm to get the final predicted category. We also develop a simple graphical user interface to get the news headlines from the user, and return the predicted classifier back to the user again.
  • 5. 3. Literature review Research Paper Methodology Detailed Description Advantages & Research Gap 1. News Classification and Its Techniques: A Review Author’s: Gurmeet Kaur, Karan Bajaj The paper say about how news classification can be done by using different techniques in which the first step is news classification. News Tokenisation here it includes in dividing the enormous text into little tokens. and the words in the news are called as a string. It is difficult to implement it in a larger corpora and hence algorithms can be improved so that efficiency of categorisation could be improved these algorithms can be tried on bigger corpora. In addition these algorithms can be improved so efficiency of categorisation could be improved
  • 6. Research Paper Methodology Detailed Description Advantages & Research Gap 2.Sentiment analysis of tweets in three Indian languages. In Proceedings of the 6th Workshop on South and Southeast Asian Natural Language Processing Phani, S., Lahiri, S., and Biswas In this paper they explain the sentiment analysis on tweets in three indian regional languages namely Tamil,Hindi and Bengali by using the SAIL dataset which is released at 2015. All classifiers cannot be used. It is limited only to multinomial Naive Bayes present in WEKA for implementing our system because our experiments with other classifiers show that they give poorer performance on SAIL data set.
  • 7. Research Paper Methodology Detailed Description Advantages & Research Gap 3.Graph Convolutional Network for Swahili News Classification Alexandros Kastanos, Tyler Martin experimentation is done on the sparsely-labelled semi- supervised context which is representative of the practical constraints facing low-resourced African languages. Alternative graph structures can be used instead. They can also consider implementing methods from text Inductive GNN methods
  • 8. Research Paper Methodology Detailed Description Advantages & Research Gap 4. categorization of Tamil News Articles using Pre Trained Word2Vec Embeddings with Convolutional Neural Network Mr. RamrajS, Arthi.R Convolutional Neural Network system is designed with three convolutions followed by a merge layer. Input for convolutions are eed from the embedding layer. Three type convolutions 3×3,4×4,5×5 are used through which features are formalized The precision, recall and F1 score for the class politics is low when compared to other two classes. The reason for this may be due to the occurrence of new tokens in politics test data than in cinema and sports. In future,it can be improved by accommodating the same methodology for other social media data as done for news web data. Also, sentiment of the data can be analysed after topic categorization
  • 9. Research Paper Methodology Detailed Description Advantages & Research Gap 5.A Deep Learning Approach for URL based Health Information Search R.Rajalakshmi and S. Ramraj an URL based design has been suggested to ease the task of health information search. The content based methods are not suitable, as it is time consuming and does not reflect the dynamic changes in the web. By consolidating the yields of two person CNN models, this issue has been disposed of. To examine the viability of the proposed troupe approach, 5-overlap cross approval was performed.
  • 10. Research Paper Methodology Detailed Description Advantages & Research Gap 6.News Text Classification Method and Simulation Based on the Hybrid Deep Learning Model Ningfeng Sun and Chengye Du The paper says about how this simulation based on the hybrid deep learning model is essentially made out of four sections, in particular, news message pre- processing, word vector-based news message representation, news message include extraction and grouping, and message order result assessment. In this paper The influence of dropout parameter changes based on the accuracy of news text classification which can be improved by adopting different methodologies.
  • 11. Research Paper Methodology Detailed Description Advantages & Research Gap 7.A systematic review of text classification research based on deep learning models in Arabic language Ahlam Wahdan, Sendeyah Hantoobi , Said A. Salloum , Khaled Shaalan Deep learning techniques in classification and its type are discussed in this paper as well. Neural networks of various types, namely, RNN, CNN, FFNN, and LSTM, are identified as the subject of study. The researchers did not indicate in detail the parameters used in these networks and how they are tuned.Usually, the machine learning algorithms are tuned by changing parameters and re-running the experiments to get significant results.
  • 12. Research Paper Methodology Detailed Description Advantages & Research Gap 8. Analyzing sentiment in Indian languages micro text using recurrent neural networks. IIOAB Journal: A Journal of Multidisciplinary Science and Technology S. Seshadri, A.K.a b Madasamy In this work the tweets are classified into three polarity category namely positive, negative and neutral. Twitter data of three languages namely Tamil, Hindi and Bengali are already provided by SAIL 2015 task organizers as we have participated in the contest. They limited their research to only 3 languages even thugh the accuracy is pretty high, more languages can be added in future.