SlideShare a Scribd company logo
Jour of Adv Research in Dynamical & Control Systems, Vol. 11, 07-Special Issue, 2019
ISSN 1943-023X 1423
Received: 16 July 2019/Accepted: 05 August 2019
Automated News Categorization Using
Machine Learning Techniques
V. Vijeya Kaveri, Associate Professor, Department of IT, Sathyabama Institute of Science and Technology.
J. Jabez, Associate Professor, Department of IT, Sathyabama Institute of Science and Technology.
P. Jeyanthi, Associate Professor, Department of IT, Sathyabama Institute of Science and Technology.
Abstract--- As time goes on, digitization of text has been increasing day by day and there is a need to classify
them. Text classification has become very essential in this situation. Text analysis is used for information extraction,
information retrieval and pattern recognition. The problem of text analysis is, it takes much amount of time to
complete work as various methods and various algorithms are being used in this process. Each method being used in
the classification has different accuracy rate. The idea of the project is to compare different machine learning
algorithms and to find the best possible solution for the highest accuracy. To evaluate the model efficacy, News
Articles dataset from kaggle has been used. Then, the input can be given as different News Articles and these News
Articles are filtered into different categories according to their genre i.e., sports news, business news, health news,
world news etc. The machine learning algorithms can be Naive Bayes’ classifier, Support Vector Machine and Neural
Networks which are being compared to get the accurate result. The above comparison can be represented using
graphs. This will help in searching the News, category wise.
Index Terms--- News Articles, Classification, Naive Bayes’ Classifier, Support Vector Machine, Neural Networks.
I. Introduction
In text classification, various statistical and machine learning techniques are carried out in order to automatically
assign one among the predefined labels to a given detail of the unlabeled document. For example if D is a set of
documents and d is a document within the entire set and C = {c1, c2, c3, …,cn } is the set of all of the categories, in the
text classification process one of the category ci will be assigned to the given document d.
There is a huge amount of data growing nowadays on the internet which is in the form of verity of news from
different sources of the world. In everyday scenarios, the ability to automatically classify documents into a fixed set of
categories is highly desirable; newspaper articles can be classified as 'features', 'sports' or 'news'. Other scenarios
involve classifying of documents as they are created. Natural language processing offers powerful techniques for
automatically classifying documents. These techniques are predicated on the hypothesis that documents in different
categories distinguish themselves by features of the natural language contained in each document. Salient features for
document classification may include word structure, word frequency, and natural language structure in each
document.
Text analysis involves information retrieval, lexical analysis to study word frequency distributions, pattern
recognition, annotation, information extraction, data mining techniques including the link and association analysis,
visualization, and predictive analytics.
The goal is, essentially, to turn text into data for analysis, via the application of natural language processing (NLP)
and analytical.
II. Literature Review
In this research, Vignesh Rao and Jayant Sachdev implemented machine learning techniques to classify news
articles belonging to a particular location. The location can be a city, state, country, etc. The news articles from various
websites like, Indian Express, Hindustan Times, Times of India, etc. are extracted to form their dataset. The
underlying structure of the Web Page is the HTML language.[1]
In 2018, there was an article published by M. Thangaraj and M. Sivakami on Text classification techniques. This
article is a literature review of various studies related to text classification approaches. Statistical topic modeling is
applied for multi-label document classification, where each document gets assigned to one or more classes.
Jour of Adv Research in Dynamical & Control Systems, Vol. 11, 07-Special Issue, 2019
ISSN 1943-023X 1424
Received: 16 July 2019/Accepted: 05 August 2019
It has been found more important to study and understand the nature of data before proceeding into mining. The
automation of text classification process is required, with the increasing amount of data and need for accuracy.[2]
In 2016, Victoria Bobicev and Moldova worked with the data collected from the forums and manually annotated
using several labels. The data and annotation were described. They obtained the data where each forum post was
considered as an annotation unit and annotated with up to three labels. Thus, the aim was to detect all of these labels
for every analyzed post.[3]
Text Classification (Spam) Using Machine Learning was done by Neetu Sharma, Gaganpreet Kaur and Ashish
Verma in the year 2014. The vector model for representation of texts has been offered here. In the elementary case, the
model assumes comparison to each document of a frequency spectrum of words. The dimension of space is reduced by
rejection of the most common words that increases thereby percent of the importance of the basic words in more
advanced models.[4]
The field of text analysis consists of many different sub-fields, that all require different approaches. There are
some problems in text and literature analysis systems that are universal to all approaches. The problem of text analysis
is, it takes much amount of time to complete a work as various methods and various algorithms are using for this
process. The methods and the analysis systems using for this process are very costly.
III.Different Machine Learning Algorithms
This section discuss about the different machine learning algorithms that are used .
A. Naive Bayes Classifier
Naive Bayes Classifier is a classification technique based on Bayes’ Theorem. It is a real time predictor and
learning classier which is used for making predictions in real time. A Naive Bayes classifier assumes that the presence
of a particular feature in a class is unrelated to the presence of any other feature. Naive Bayes model is easy to build
and particularly useful for very large data sets. We can predict the probability of multiple classes to target variable.
Naive Bayes classifier uses Machine learning and data mining techniques to filter unseen information and predict the
data.
B. Neural Networks
A neural network is a network of neurons which is made up of real biological neurons. It is a mathematical model
which is designed to behave like nervous system. Neural networks work in very similar manner. The model is used to
recognize complex patterns and relationships that exists within a labeled data. It takes several inputs and start
processing through multiple neurons from multiple hidden layers and returns the result using an output layer. This
result analysis process is known as Forward Propagation.
C. SVM Classifier
Support Vector Machine (SVM) is a supervised machine learning algorithm which can be used for both
classification and regression challenges. It is mostly used in classification problems. SVM are co-ordinates of
individual observation.
SVM is a frontier and the model extracts a best possible line (hyper-plane) that segregates the two classes. If
outlier occurs, SVM has a feature to ignore outliers. So, we can find the hyper-plane or line that maximize to classes.
We cannot have linear hyper-plane between two classes.
IV.Experimentation and Results
The goal of this paper is to find the best among the various system mastering algorithms mentioned in previous
segment for classifying news articles. With the help of WEKA 3.8[8] tool we have built different classifiers for these
algorithms.
These algorithms are directly applied to the dataset. WEKA tool provides the built in support for data
pre-processing, classification, regression, clustering, association rules, and visualization. New machine learning
schemes are also supported in WEKA. We have tested our dataset by applying 5-cross fold validation in order to
check the outcomes of each classifier for unknown instances. The effects of entire experimentation are proven in
Table 1. While Figure 1, suggests the pictorial illustration of efficiency of those algorithms for our dataset.
Jour of Adv Research in Dynamical & Control Systems, Vol. 11, 07-Special Issue, 2019
ISSN 1943-023X 1425
Received: 16 July 2019/Accepted: 05 August 2019
Table 1: Comparison of Different Machine Learning Algorithms
Algorithm Efficiency (%) Time Taken (Milliseconds) Mean Absolute Error
Naive Bayes 67.35 175 0.34
Neural Networks 71.67 327 0.37
Support Vector Machine 75.84 243 0.28
Figure 1: Efficiency of different Machine learning Algorithms
Figure 2: Execution Time of different Machine learning Algorithms
Figure 3: Mean Absolute Error of different Machine learning Algorithms
Figure1, Figure 2 & Figure 3 shows that Support Vector Machine algorithm performs better in terms of efficiency,
execution time and MAE compared to other two algorithms.
Jour of Adv Research in Dynamical & Control Systems, Vol. 11, 07-Special Issue, 2019
ISSN 1943-023X 1426
Received: 16 July 2019/Accepted: 05 August 2019
V. Conclusion
In this paper, we have compared the efficiency, execution time and MAE of 3 Machine Learning algorithms Navie
Bayes, SVM and Neural network for classifying news articles.
VI.Acknowledgment
This work has been carried out at DST-FIST sponsored Wireless Sensor Network and IOT Lab (order Sanction
No.: SR/FST/ETI-413/2018Dated: 08th February, 2018), Department of Computer Science and Engineering,
Sathyabama Institute of Science and Technology.
References
[1] B. Pendharkar, P. Ambekar, P. Godbole, S. Joshi, and S Abhyankar, ”Topic categorization of rss news feeds,”
Group vol. 4, p. 1, 2007.
[2] D. Shen, Z. Chen, Q. Yang, H. Zeng, B. Zhang, Y. Lu, and W. Ma, Web-page classification through
summarization, In Proceedings of the 27th annual international ACM SIGIR conference on Research and
development in information retrieval. ACM, 2004, pp. 242249.
[3] Leo Breiman, Random forests, Machine Learning. vol. 45, no. 1, pp.532, 2001.
[4] Graves, A. r. Mohamed, and G. Hinton. Speech recognition with deep recurrent neural networks. In 2013 IEEE
International Conference on Acoustics, Speech and Signal Processing, pages 6645–6649, May 2013.
[5] Moro, R. Navigli, WiSeNet: Building a Wikipedia-based semantic network with ontologies relations, In:
Proceedings of the 21st ACM Conference on Information and Knowledge Management, Maui, Hawaii, 2012.
[6] Pendharkar, P. Ambekar,P. Godbole, S. Joshi, and S. Abhyankar, ”Topic categorization of rss news feeds,”
Group vol. 4, p. 1, 2007.
[7] Bing Xu, Naiyan Wang, Tianqi Chen, and Mu Li. Empirical evaluation of rectified activations in convolutional
network. CoRR, abs/1505.00853, 2015.
[8] Ee and P. Lim, Automated online news classification with person-alization.
[9] Rijsbergen, Information Retrieval, 2nd ed. London: Butter worths, 1979.
[10] Chunting Zhou, Chonglin Sun, Zhiyuan Liu, and Francis C. M. Lau. A C-LSTM neural network for text
classification. CoRR, abs/1511.08630, 2015.
[11] Shen, Z. Chen, Q. Yang, H. Zeng, B. Zhang, Y. Lu, and W.Ma, Web-page classification through summarization.
In Proceedings of the 27th annual international ACM SIGIR conference on Research and development in
information retrieval. ACM, 2004, pp. 242249.
[12] H. a. K. S. Yu, “SVM tutorial: Classification, regression, and ranking,”
[13] R.Palaniappan, K. Sunderaj, S. Sundaraj, “A comparative study of the svm and k-nn machine learning algorithms
for the diagnosis of respiratory pathologies using pulmonary acoustic signals”, BMC io informatics, 15.1, pp.
1-8, 2014.
[14] H. Temurtas, N. Yumusak, and E. Temurtas, "A comparative study on diabetes disease diagnosis using neural
networks." Expert Systems with applications Vol. 36 No. 4, pp. 8610-8615, 2009.

More Related Content

What's hot

Machine Learning Tutorial for Beginners
Machine Learning Tutorial for BeginnersMachine Learning Tutorial for Beginners
Machine Learning Tutorial for Beginners
grinu
 
76 s201914
76 s20191476 s201914
76 s201914
IJRAT
 
DATA MINING.doc
DATA MINING.docDATA MINING.doc
DATA MINING.docbutest
 
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and PredictionUsing ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
ijtsrd
 
IRJET- Personality Recognition using Multi-Label Classification
IRJET- Personality Recognition using Multi-Label ClassificationIRJET- Personality Recognition using Multi-Label Classification
IRJET- Personality Recognition using Multi-Label Classification
IRJET Journal
 
03 fauzi indonesian 9456 11nov17 edit septian
03 fauzi indonesian 9456 11nov17 edit septian03 fauzi indonesian 9456 11nov17 edit septian
03 fauzi indonesian 9456 11nov17 edit septian
IAESIJEECS
 
Knowledge Acquisition Based on Repertory Grid Analysis System
Knowledge Acquisition Based on Repertory Grid Analysis SystemKnowledge Acquisition Based on Repertory Grid Analysis System
Knowledge Acquisition Based on Repertory Grid Analysis System
ijtsrd
 
Predicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithmsPredicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithms
IJDKP
 
IJET-V2I6P32
IJET-V2I6P32IJET-V2I6P32
Evaluating the efficiency of rule techniques for file classification
Evaluating the efficiency of rule techniques for file classificationEvaluating the efficiency of rule techniques for file classification
Evaluating the efficiency of rule techniques for file classification
eSAT Journals
 
A novel ensemble modeling for intrusion detection system
A novel ensemble modeling for intrusion detection system A novel ensemble modeling for intrusion detection system
A novel ensemble modeling for intrusion detection system
IJECEIAES
 
Digital image hiding algorithm for secret communication
Digital image hiding algorithm for secret communicationDigital image hiding algorithm for secret communication
Digital image hiding algorithm for secret communication
eSAT Journals
 
Evaluating the efficiency of rule techniques for file
Evaluating the efficiency of rule techniques for fileEvaluating the efficiency of rule techniques for file
Evaluating the efficiency of rule techniques for file
eSAT Publishing House
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
Rahul Jain
 
11.software modules clustering an effective approach for reusability
11.software modules clustering an effective approach for  reusability11.software modules clustering an effective approach for  reusability
11.software modules clustering an effective approach for reusabilityAlexander Decker
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
Test PDF
Test PDFTest PDF
Test PDFAlgnuD
 
IJET-V4I2P98
IJET-V4I2P98IJET-V4I2P98

What's hot (20)

Machine Learning Tutorial for Beginners
Machine Learning Tutorial for BeginnersMachine Learning Tutorial for Beginners
Machine Learning Tutorial for Beginners
 
76 s201914
76 s20191476 s201914
76 s201914
 
DATA MINING.doc
DATA MINING.docDATA MINING.doc
DATA MINING.doc
 
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and PredictionUsing ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
 
IRJET- Personality Recognition using Multi-Label Classification
IRJET- Personality Recognition using Multi-Label ClassificationIRJET- Personality Recognition using Multi-Label Classification
IRJET- Personality Recognition using Multi-Label Classification
 
03 fauzi indonesian 9456 11nov17 edit septian
03 fauzi indonesian 9456 11nov17 edit septian03 fauzi indonesian 9456 11nov17 edit septian
03 fauzi indonesian 9456 11nov17 edit septian
 
Knowledge Acquisition Based on Repertory Grid Analysis System
Knowledge Acquisition Based on Repertory Grid Analysis SystemKnowledge Acquisition Based on Repertory Grid Analysis System
Knowledge Acquisition Based on Repertory Grid Analysis System
 
Predicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithmsPredicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithms
 
IJET-V2I6P32
IJET-V2I6P32IJET-V2I6P32
IJET-V2I6P32
 
Evaluating the efficiency of rule techniques for file classification
Evaluating the efficiency of rule techniques for file classificationEvaluating the efficiency of rule techniques for file classification
Evaluating the efficiency of rule techniques for file classification
 
Poster
PosterPoster
Poster
 
A novel ensemble modeling for intrusion detection system
A novel ensemble modeling for intrusion detection system A novel ensemble modeling for intrusion detection system
A novel ensemble modeling for intrusion detection system
 
Digital image hiding algorithm for secret communication
Digital image hiding algorithm for secret communicationDigital image hiding algorithm for secret communication
Digital image hiding algorithm for secret communication
 
2-IJCSE-00536
2-IJCSE-005362-IJCSE-00536
2-IJCSE-00536
 
Evaluating the efficiency of rule techniques for file
Evaluating the efficiency of rule techniques for fileEvaluating the efficiency of rule techniques for file
Evaluating the efficiency of rule techniques for file
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
11.software modules clustering an effective approach for reusability
11.software modules clustering an effective approach for  reusability11.software modules clustering an effective approach for  reusability
11.software modules clustering an effective approach for reusability
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
Test PDF
Test PDFTest PDF
Test PDF
 
IJET-V4I2P98
IJET-V4I2P98IJET-V4I2P98
IJET-V4I2P98
 

Similar to Automated News Categorization Using Machine Learning Techniques

MACHINE LEARNING TOOLBOX
MACHINE LEARNING TOOLBOXMACHINE LEARNING TOOLBOX
MACHINE LEARNING TOOLBOX
mlaij
 
Machine learning for text document classification-efficient classification ap...
Machine learning for text document classification-efficient classification ap...Machine learning for text document classification-efficient classification ap...
Machine learning for text document classification-efficient classification ap...
IAESIJAI
 
A scenario based approach for dealing with
A scenario based approach for dealing withA scenario based approach for dealing with
A scenario based approach for dealing with
ijcsa
 
76201910
7620191076201910
76201910
IJRAT
 
Experimental Result Analysis of Text Categorization using Clustering and Clas...
Experimental Result Analysis of Text Categorization using Clustering and Clas...Experimental Result Analysis of Text Categorization using Clustering and Clas...
Experimental Result Analysis of Text Categorization using Clustering and Clas...
ijtsrd
 
Analysis Levels And Techniques A Survey
Analysis Levels And Techniques   A SurveyAnalysis Levels And Techniques   A Survey
Analysis Levels And Techniques A Survey
Liz Adams
 
Bx044461467
Bx044461467Bx044461467
Bx044461467
IJERA Editor
 
IRJET- An Extensive Study of Sentiment Analysis Techniques and its Progressio...
IRJET- An Extensive Study of Sentiment Analysis Techniques and its Progressio...IRJET- An Extensive Study of Sentiment Analysis Techniques and its Progressio...
IRJET- An Extensive Study of Sentiment Analysis Techniques and its Progressio...
IRJET Journal
 
A Survey on Machine Learning Algorithms
A Survey on Machine Learning AlgorithmsA Survey on Machine Learning Algorithms
A Survey on Machine Learning Algorithms
AM Publications
 
IRJET- Concept Extraction from Ambiguous Text Document using K-Means
IRJET- Concept Extraction from Ambiguous Text Document using K-MeansIRJET- Concept Extraction from Ambiguous Text Document using K-Means
IRJET- Concept Extraction from Ambiguous Text Document using K-Means
IRJET Journal
 
Text classification supervised algorithms with term frequency inverse documen...
Text classification supervised algorithms with term frequency inverse documen...Text classification supervised algorithms with term frequency inverse documen...
Text classification supervised algorithms with term frequency inverse documen...
IJECEIAES
 
The sarcasm detection with the method of logistic regression
The sarcasm detection with the method of logistic regressionThe sarcasm detection with the method of logistic regression
The sarcasm detection with the method of logistic regression
EditorIJAERD
 
IRJET- Survey for Amazon Fine Food Reviews
IRJET- Survey for Amazon Fine Food ReviewsIRJET- Survey for Amazon Fine Food Reviews
IRJET- Survey for Amazon Fine Food Reviews
IRJET Journal
 
A survey of modified support vector machine using particle of swarm optimizat...
A survey of modified support vector machine using particle of swarm optimizat...A survey of modified support vector machine using particle of swarm optimizat...
A survey of modified support vector machine using particle of swarm optimizat...
Editor Jacotech
 
Applying Soft Computing Techniques in Information Retrieval
Applying Soft Computing Techniques in Information RetrievalApplying Soft Computing Techniques in Information Retrieval
Applying Soft Computing Techniques in Information Retrieval
IJAEMSJORNAL
 
International Journal of Engineering Inventions (IJEI),
International Journal of Engineering Inventions (IJEI), International Journal of Engineering Inventions (IJEI),
International Journal of Engineering Inventions (IJEI),
International Journal of Engineering Inventions www.ijeijournal.com
 
View the Microsoft Word document.doc
View the Microsoft Word document.docView the Microsoft Word document.doc
View the Microsoft Word document.docbutest
 
View the Microsoft Word document.doc
View the Microsoft Word document.docView the Microsoft Word document.doc
View the Microsoft Word document.docbutest
 
View the Microsoft Word document.doc
View the Microsoft Word document.docView the Microsoft Word document.doc
View the Microsoft Word document.docbutest
 

Similar to Automated News Categorization Using Machine Learning Techniques (20)

MACHINE LEARNING TOOLBOX
MACHINE LEARNING TOOLBOXMACHINE LEARNING TOOLBOX
MACHINE LEARNING TOOLBOX
 
Machine learning for text document classification-efficient classification ap...
Machine learning for text document classification-efficient classification ap...Machine learning for text document classification-efficient classification ap...
Machine learning for text document classification-efficient classification ap...
 
A scenario based approach for dealing with
A scenario based approach for dealing withA scenario based approach for dealing with
A scenario based approach for dealing with
 
76201910
7620191076201910
76201910
 
Experimental Result Analysis of Text Categorization using Clustering and Clas...
Experimental Result Analysis of Text Categorization using Clustering and Clas...Experimental Result Analysis of Text Categorization using Clustering and Clas...
Experimental Result Analysis of Text Categorization using Clustering and Clas...
 
Analysis Levels And Techniques A Survey
Analysis Levels And Techniques   A SurveyAnalysis Levels And Techniques   A Survey
Analysis Levels And Techniques A Survey
 
Bx044461467
Bx044461467Bx044461467
Bx044461467
 
IRJET- An Extensive Study of Sentiment Analysis Techniques and its Progressio...
IRJET- An Extensive Study of Sentiment Analysis Techniques and its Progressio...IRJET- An Extensive Study of Sentiment Analysis Techniques and its Progressio...
IRJET- An Extensive Study of Sentiment Analysis Techniques and its Progressio...
 
A Survey on Machine Learning Algorithms
A Survey on Machine Learning AlgorithmsA Survey on Machine Learning Algorithms
A Survey on Machine Learning Algorithms
 
IRJET- Concept Extraction from Ambiguous Text Document using K-Means
IRJET- Concept Extraction from Ambiguous Text Document using K-MeansIRJET- Concept Extraction from Ambiguous Text Document using K-Means
IRJET- Concept Extraction from Ambiguous Text Document using K-Means
 
Text classification supervised algorithms with term frequency inverse documen...
Text classification supervised algorithms with term frequency inverse documen...Text classification supervised algorithms with term frequency inverse documen...
Text classification supervised algorithms with term frequency inverse documen...
 
T0 numtq0n tk=
T0 numtq0n tk=T0 numtq0n tk=
T0 numtq0n tk=
 
The sarcasm detection with the method of logistic regression
The sarcasm detection with the method of logistic regressionThe sarcasm detection with the method of logistic regression
The sarcasm detection with the method of logistic regression
 
IRJET- Survey for Amazon Fine Food Reviews
IRJET- Survey for Amazon Fine Food ReviewsIRJET- Survey for Amazon Fine Food Reviews
IRJET- Survey for Amazon Fine Food Reviews
 
A survey of modified support vector machine using particle of swarm optimizat...
A survey of modified support vector machine using particle of swarm optimizat...A survey of modified support vector machine using particle of swarm optimizat...
A survey of modified support vector machine using particle of swarm optimizat...
 
Applying Soft Computing Techniques in Information Retrieval
Applying Soft Computing Techniques in Information RetrievalApplying Soft Computing Techniques in Information Retrieval
Applying Soft Computing Techniques in Information Retrieval
 
International Journal of Engineering Inventions (IJEI),
International Journal of Engineering Inventions (IJEI), International Journal of Engineering Inventions (IJEI),
International Journal of Engineering Inventions (IJEI),
 
View the Microsoft Word document.doc
View the Microsoft Word document.docView the Microsoft Word document.doc
View the Microsoft Word document.doc
 
View the Microsoft Word document.doc
View the Microsoft Word document.docView the Microsoft Word document.doc
View the Microsoft Word document.doc
 
View the Microsoft Word document.doc
View the Microsoft Word document.docView the Microsoft Word document.doc
View the Microsoft Word document.doc
 

More from Drjabez

The power of_deep_learning_models_applications
The power of_deep_learning_models_applicationsThe power of_deep_learning_models_applications
The power of_deep_learning_models_applications
Drjabez
 
Survey of various methods used for integrating machine learning into brain tu...
Survey of various methods used for integrating machine learning into brain tu...Survey of various methods used for integrating machine learning into brain tu...
Survey of various methods used for integrating machine learning into brain tu...
Drjabez
 
Novel Methodology of Data Management in Ad Hoc Network Formulated using Nanos...
Novel Methodology of Data Management in Ad Hoc Network Formulated using Nanos...Novel Methodology of Data Management in Ad Hoc Network Formulated using Nanos...
Novel Methodology of Data Management in Ad Hoc Network Formulated using Nanos...
Drjabez
 
A Study on Genetic-Fuzzy Based Automatic Intrusion Detection on Network Datasets
A Study on Genetic-Fuzzy Based Automatic Intrusion Detection on Network DatasetsA Study on Genetic-Fuzzy Based Automatic Intrusion Detection on Network Datasets
A Study on Genetic-Fuzzy Based Automatic Intrusion Detection on Network Datasets
Drjabez
 
Intrusion Detection System (IDS): Anomaly Detection using Outlier Detection A...
Intrusion Detection System (IDS): Anomaly Detection using Outlier Detection A...Intrusion Detection System (IDS): Anomaly Detection using Outlier Detection A...
Intrusion Detection System (IDS): Anomaly Detection using Outlier Detection A...
Drjabez
 
Anomaly detection by using CFS subset and neural network with WEKA tools
Anomaly detection by using CFS subset and neural network with WEKA tools Anomaly detection by using CFS subset and neural network with WEKA tools
Anomaly detection by using CFS subset and neural network with WEKA tools
Drjabez
 
Profile Analysis of Users in Data Analytics Domain
Profile Analysis of   Users in Data Analytics DomainProfile Analysis of   Users in Data Analytics Domain
Profile Analysis of Users in Data Analytics Domain
Drjabez
 

More from Drjabez (7)

The power of_deep_learning_models_applications
The power of_deep_learning_models_applicationsThe power of_deep_learning_models_applications
The power of_deep_learning_models_applications
 
Survey of various methods used for integrating machine learning into brain tu...
Survey of various methods used for integrating machine learning into brain tu...Survey of various methods used for integrating machine learning into brain tu...
Survey of various methods used for integrating machine learning into brain tu...
 
Novel Methodology of Data Management in Ad Hoc Network Formulated using Nanos...
Novel Methodology of Data Management in Ad Hoc Network Formulated using Nanos...Novel Methodology of Data Management in Ad Hoc Network Formulated using Nanos...
Novel Methodology of Data Management in Ad Hoc Network Formulated using Nanos...
 
A Study on Genetic-Fuzzy Based Automatic Intrusion Detection on Network Datasets
A Study on Genetic-Fuzzy Based Automatic Intrusion Detection on Network DatasetsA Study on Genetic-Fuzzy Based Automatic Intrusion Detection on Network Datasets
A Study on Genetic-Fuzzy Based Automatic Intrusion Detection on Network Datasets
 
Intrusion Detection System (IDS): Anomaly Detection using Outlier Detection A...
Intrusion Detection System (IDS): Anomaly Detection using Outlier Detection A...Intrusion Detection System (IDS): Anomaly Detection using Outlier Detection A...
Intrusion Detection System (IDS): Anomaly Detection using Outlier Detection A...
 
Anomaly detection by using CFS subset and neural network with WEKA tools
Anomaly detection by using CFS subset and neural network with WEKA tools Anomaly detection by using CFS subset and neural network with WEKA tools
Anomaly detection by using CFS subset and neural network with WEKA tools
 
Profile Analysis of Users in Data Analytics Domain
Profile Analysis of   Users in Data Analytics DomainProfile Analysis of   Users in Data Analytics Domain
Profile Analysis of Users in Data Analytics Domain
 

Recently uploaded

一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 

Recently uploaded (20)

一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 

Automated News Categorization Using Machine Learning Techniques

  • 1. Jour of Adv Research in Dynamical & Control Systems, Vol. 11, 07-Special Issue, 2019 ISSN 1943-023X 1423 Received: 16 July 2019/Accepted: 05 August 2019 Automated News Categorization Using Machine Learning Techniques V. Vijeya Kaveri, Associate Professor, Department of IT, Sathyabama Institute of Science and Technology. J. Jabez, Associate Professor, Department of IT, Sathyabama Institute of Science and Technology. P. Jeyanthi, Associate Professor, Department of IT, Sathyabama Institute of Science and Technology. Abstract--- As time goes on, digitization of text has been increasing day by day and there is a need to classify them. Text classification has become very essential in this situation. Text analysis is used for information extraction, information retrieval and pattern recognition. The problem of text analysis is, it takes much amount of time to complete work as various methods and various algorithms are being used in this process. Each method being used in the classification has different accuracy rate. The idea of the project is to compare different machine learning algorithms and to find the best possible solution for the highest accuracy. To evaluate the model efficacy, News Articles dataset from kaggle has been used. Then, the input can be given as different News Articles and these News Articles are filtered into different categories according to their genre i.e., sports news, business news, health news, world news etc. The machine learning algorithms can be Naive Bayes’ classifier, Support Vector Machine and Neural Networks which are being compared to get the accurate result. The above comparison can be represented using graphs. This will help in searching the News, category wise. Index Terms--- News Articles, Classification, Naive Bayes’ Classifier, Support Vector Machine, Neural Networks. I. Introduction In text classification, various statistical and machine learning techniques are carried out in order to automatically assign one among the predefined labels to a given detail of the unlabeled document. For example if D is a set of documents and d is a document within the entire set and C = {c1, c2, c3, …,cn } is the set of all of the categories, in the text classification process one of the category ci will be assigned to the given document d. There is a huge amount of data growing nowadays on the internet which is in the form of verity of news from different sources of the world. In everyday scenarios, the ability to automatically classify documents into a fixed set of categories is highly desirable; newspaper articles can be classified as 'features', 'sports' or 'news'. Other scenarios involve classifying of documents as they are created. Natural language processing offers powerful techniques for automatically classifying documents. These techniques are predicated on the hypothesis that documents in different categories distinguish themselves by features of the natural language contained in each document. Salient features for document classification may include word structure, word frequency, and natural language structure in each document. Text analysis involves information retrieval, lexical analysis to study word frequency distributions, pattern recognition, annotation, information extraction, data mining techniques including the link and association analysis, visualization, and predictive analytics. The goal is, essentially, to turn text into data for analysis, via the application of natural language processing (NLP) and analytical. II. Literature Review In this research, Vignesh Rao and Jayant Sachdev implemented machine learning techniques to classify news articles belonging to a particular location. The location can be a city, state, country, etc. The news articles from various websites like, Indian Express, Hindustan Times, Times of India, etc. are extracted to form their dataset. The underlying structure of the Web Page is the HTML language.[1] In 2018, there was an article published by M. Thangaraj and M. Sivakami on Text classification techniques. This article is a literature review of various studies related to text classification approaches. Statistical topic modeling is applied for multi-label document classification, where each document gets assigned to one or more classes.
  • 2. Jour of Adv Research in Dynamical & Control Systems, Vol. 11, 07-Special Issue, 2019 ISSN 1943-023X 1424 Received: 16 July 2019/Accepted: 05 August 2019 It has been found more important to study and understand the nature of data before proceeding into mining. The automation of text classification process is required, with the increasing amount of data and need for accuracy.[2] In 2016, Victoria Bobicev and Moldova worked with the data collected from the forums and manually annotated using several labels. The data and annotation were described. They obtained the data where each forum post was considered as an annotation unit and annotated with up to three labels. Thus, the aim was to detect all of these labels for every analyzed post.[3] Text Classification (Spam) Using Machine Learning was done by Neetu Sharma, Gaganpreet Kaur and Ashish Verma in the year 2014. The vector model for representation of texts has been offered here. In the elementary case, the model assumes comparison to each document of a frequency spectrum of words. The dimension of space is reduced by rejection of the most common words that increases thereby percent of the importance of the basic words in more advanced models.[4] The field of text analysis consists of many different sub-fields, that all require different approaches. There are some problems in text and literature analysis systems that are universal to all approaches. The problem of text analysis is, it takes much amount of time to complete a work as various methods and various algorithms are using for this process. The methods and the analysis systems using for this process are very costly. III.Different Machine Learning Algorithms This section discuss about the different machine learning algorithms that are used . A. Naive Bayes Classifier Naive Bayes Classifier is a classification technique based on Bayes’ Theorem. It is a real time predictor and learning classier which is used for making predictions in real time. A Naive Bayes classifier assumes that the presence of a particular feature in a class is unrelated to the presence of any other feature. Naive Bayes model is easy to build and particularly useful for very large data sets. We can predict the probability of multiple classes to target variable. Naive Bayes classifier uses Machine learning and data mining techniques to filter unseen information and predict the data. B. Neural Networks A neural network is a network of neurons which is made up of real biological neurons. It is a mathematical model which is designed to behave like nervous system. Neural networks work in very similar manner. The model is used to recognize complex patterns and relationships that exists within a labeled data. It takes several inputs and start processing through multiple neurons from multiple hidden layers and returns the result using an output layer. This result analysis process is known as Forward Propagation. C. SVM Classifier Support Vector Machine (SVM) is a supervised machine learning algorithm which can be used for both classification and regression challenges. It is mostly used in classification problems. SVM are co-ordinates of individual observation. SVM is a frontier and the model extracts a best possible line (hyper-plane) that segregates the two classes. If outlier occurs, SVM has a feature to ignore outliers. So, we can find the hyper-plane or line that maximize to classes. We cannot have linear hyper-plane between two classes. IV.Experimentation and Results The goal of this paper is to find the best among the various system mastering algorithms mentioned in previous segment for classifying news articles. With the help of WEKA 3.8[8] tool we have built different classifiers for these algorithms. These algorithms are directly applied to the dataset. WEKA tool provides the built in support for data pre-processing, classification, regression, clustering, association rules, and visualization. New machine learning schemes are also supported in WEKA. We have tested our dataset by applying 5-cross fold validation in order to check the outcomes of each classifier for unknown instances. The effects of entire experimentation are proven in Table 1. While Figure 1, suggests the pictorial illustration of efficiency of those algorithms for our dataset.
  • 3. Jour of Adv Research in Dynamical & Control Systems, Vol. 11, 07-Special Issue, 2019 ISSN 1943-023X 1425 Received: 16 July 2019/Accepted: 05 August 2019 Table 1: Comparison of Different Machine Learning Algorithms Algorithm Efficiency (%) Time Taken (Milliseconds) Mean Absolute Error Naive Bayes 67.35 175 0.34 Neural Networks 71.67 327 0.37 Support Vector Machine 75.84 243 0.28 Figure 1: Efficiency of different Machine learning Algorithms Figure 2: Execution Time of different Machine learning Algorithms Figure 3: Mean Absolute Error of different Machine learning Algorithms Figure1, Figure 2 & Figure 3 shows that Support Vector Machine algorithm performs better in terms of efficiency, execution time and MAE compared to other two algorithms.
  • 4. Jour of Adv Research in Dynamical & Control Systems, Vol. 11, 07-Special Issue, 2019 ISSN 1943-023X 1426 Received: 16 July 2019/Accepted: 05 August 2019 V. Conclusion In this paper, we have compared the efficiency, execution time and MAE of 3 Machine Learning algorithms Navie Bayes, SVM and Neural network for classifying news articles. VI.Acknowledgment This work has been carried out at DST-FIST sponsored Wireless Sensor Network and IOT Lab (order Sanction No.: SR/FST/ETI-413/2018Dated: 08th February, 2018), Department of Computer Science and Engineering, Sathyabama Institute of Science and Technology. References [1] B. Pendharkar, P. Ambekar, P. Godbole, S. Joshi, and S Abhyankar, ”Topic categorization of rss news feeds,” Group vol. 4, p. 1, 2007. [2] D. Shen, Z. Chen, Q. Yang, H. Zeng, B. Zhang, Y. Lu, and W. Ma, Web-page classification through summarization, In Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 2004, pp. 242249. [3] Leo Breiman, Random forests, Machine Learning. vol. 45, no. 1, pp.532, 2001. [4] Graves, A. r. Mohamed, and G. Hinton. Speech recognition with deep recurrent neural networks. In 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pages 6645–6649, May 2013. [5] Moro, R. Navigli, WiSeNet: Building a Wikipedia-based semantic network with ontologies relations, In: Proceedings of the 21st ACM Conference on Information and Knowledge Management, Maui, Hawaii, 2012. [6] Pendharkar, P. Ambekar,P. Godbole, S. Joshi, and S. Abhyankar, ”Topic categorization of rss news feeds,” Group vol. 4, p. 1, 2007. [7] Bing Xu, Naiyan Wang, Tianqi Chen, and Mu Li. Empirical evaluation of rectified activations in convolutional network. CoRR, abs/1505.00853, 2015. [8] Ee and P. Lim, Automated online news classification with person-alization. [9] Rijsbergen, Information Retrieval, 2nd ed. London: Butter worths, 1979. [10] Chunting Zhou, Chonglin Sun, Zhiyuan Liu, and Francis C. M. Lau. A C-LSTM neural network for text classification. CoRR, abs/1511.08630, 2015. [11] Shen, Z. Chen, Q. Yang, H. Zeng, B. Zhang, Y. Lu, and W.Ma, Web-page classification through summarization. In Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 2004, pp. 242249. [12] H. a. K. S. Yu, “SVM tutorial: Classification, regression, and ranking,” [13] R.Palaniappan, K. Sunderaj, S. Sundaraj, “A comparative study of the svm and k-nn machine learning algorithms for the diagnosis of respiratory pathologies using pulmonary acoustic signals”, BMC io informatics, 15.1, pp. 1-8, 2014. [14] H. Temurtas, N. Yumusak, and E. Temurtas, "A comparative study on diabetes disease diagnosis using neural networks." Expert Systems with applications Vol. 36 No. 4, pp. 8610-8615, 2009.