SlideShare a Scribd company logo
1 of 4
Download to read offline
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 07 | July 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 2941
A review of Fake News Detection Methods
Sandhya S1, Arya J Nair2, Sreeja Kumari S3
1,2,3 Lecturer, Dept. of Computer Engineering, NSS Polytechnic College, Kerala, India
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract – False news is the dissemination of false
information to grab the audience's attention, which, at first
glance, may seem credible. False news is frequently produced
by people with personal, political, or economic objectives in
mind. It is frequently disseminated online or through
conventional media. Fake news can reach dozens and even
millions of people via social media, even if it's labeled as fake.
In order to lessen the impact of spreading misinformation as
well as to eliminate the sources of this false information, all
efforts should be made to reduce its impact on citizens. This
paper examines and evaluates the many methods for
identifying false news.
Key Words: ConvolutionalNeuralNetwork(CNN),Hybrid
feature learning unit (HFLU), K NearestNeighbors(KNN),
Support Vector Machine (SVM), Naïve Bayes, Random
Forest, Regression Tree, Bidirectional Encoder
Representations from Transformers (BERT), GDU (Gated
Diffusive Unit), Long Short Term Memory(LSTM)
1. INTRODUCTION
As the term implies, fake news is defined as "news items
that are deliberately and irrefutably false" [1] to deceive
people about facts, and assertions. Particularly on social
networking networks, spreading fake news hasprovento be
simple. Fake news is typically created to influence people's
opinions, further a political agenda, or create
misunderstandings about a significant topic [2]. Because of
this news, elections may be impacted, ethnic conflict may
increase, and criticism may be stifled.
Fake news is of many types. They are ‘targeted
misinformation, clickbait, fake headlines, fake viral posts,
hoaxes, etc. Untrue piece of information spread for personal
gain is known as targeted misinformation [3]. The most
frequent definition of clickbaitisfalseinformationcreatedto
grab readers' attention and persuade them to click on a link
to a certain website. Fake headlines are the headlines that
make up stuff in order to attract readers' attention and are
frequently used by media and newspapers with lower
credibility. Hoaxes are another sort of false informationthat
purposefully misleads the reader by harmingandfinancially
depleting its readers.
Various tactics to combat different kinds of false news are
now being actively investigated.
This paper reviews various methods of fake news
recognition. The different methods are presentedinChapter
3. Chapter 4 depicts the comparison of these methods.
Chapter 5 includes the conclusion of the study.
2. LITERATURE REVIEW
A unique diffusive network-basedfalsenewsauthenticity
checker has been introducedin [4]. A hybrid featurelearning
unit (HFLU) has been used in this system, which is trained to
learn both obvious and implicit feature representations of
news. The research suggests a uniquedeepdiffusivenetwork
model with a gated diffusive unit to manage diverse
information in social networks.
A content-based fake news classification system has been
proposed in [5] using the voting ensemble classifier method.
Logistic Regression, Regression tree, ensemble AdaBoost,
Naïve Bayes, Fuzzy Rule, KNN, Decision tree,andPNNarethe
voting classifiers utilized in this system. The model's
classification accuracy forfalse news recognition wasashigh
as 95%.
Soft voting classifiers in [6] incorporated four machine
learning techniquesi.e.,NaiveBayes,SupportVectorMachine
(SVM), and Logistic Regression—to classify news items as
false or authentic.Thisapproachgivesanaccuracyof93%for
fake news classification.
The classification of false and true news using transfer
learning on the Bidirectional Encoder Representations from
Transformers (BERT) language model was done in [7]. An
accuracy of 97.021 percent wasachievedwiththisfine-tuned
BERT model using NewsFN data. A Hybrid Classifier-based
fake news detection system is compared withtheSVM-based
fake news classification system in [8]. A Hybrid classifier-
based system gives more than 90% accuracy and SVM based
system gives more than 84%accuracy forfake and real news
classification tasks.
A deep learning network named BerConvoNet has beenused
to classify fake and realnews[11]. CNNandBERTembedding
techniques are combined to create BerConvNet. In [13]
CNN and Long Short-Term Memory (LSTM) have been used
to classify false news items in a multimodal strategy. Here,
the news was categorized according to its source and its
history. This model categorizes credible news items with a
training accuracyof99.7%and validationaccuracyof 97.5%.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 07 | July 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 2942
3. METHODOLOGY
3.1 Diffusive network-based system
The deep diffusive network model of FAKEDETECTOR is
made up of representation feature learning and credibility
label inference [4]. This model has been used to simulatethe
relationship between news pieces, authors,andtopics. Here,
the GDU (Gated Diffusive Unit) model takes many inputs
from several domains concurrently and delivers its
learned hidden state hi to the output layer and other unit
models. In this system, a backpropagation algorithm is
employed to train the network model.
3.2 Modified Voting Ensemble-based system
The input module is composed of the enhanced dataset
generated during the dataset preparation procedure. This
dataset which consists of various articles, their authors,
details, etc. is fed to voting classifiers. Voting classifiersused
in this system are Logistic Regression, Regression tree,
ensemble AdaBoost, Naïve Bayes, Fuzzy Rule, KNN,Decision
tree, and PNN. The dataset is partitioned as a training and
testing dataset. The training data stream is fed to the
learning node to build the prediction model. The outputs
from each scoring node are collected by the "voting module"
and sent into a different stream.
3.3 Soft Voting Classifier based system
The dataset for fake news detection is collected fromKaggle.
Then the data is preprocessed. Preprocessing steps include
label encoding, tokenization, removal of stop words, and
stemming [6]. Label encoding marks real news as 1 and
false news as 0. During tokenization, textual data is broken
up into tokens, without punctuation. Machine learning
techniques are made more accurate and efficient by
removing insignificant words from the textual data in the
stop word removal step. After preprocessing important
features are extracted from the textual data using the count
vectorizer. To choose an appropriate set of features for the
learning model, GridSearchCV hyperparameter tuning has
been performed. Then the data is fed to four classifiers for
the classification task. An ensembleapproachaggregatesthe
estimations of two or more classifiers in order to produce a
model that makes more accurate judgments. Thenthevoting
ensemble classifier makes predictions based on the output
class with the highest probability.
3.4 Deep Bidirectional Transformers-based
Language Model
News items were categorized reliably into two categories
using a pre-trained BERT language model [7]. The NewsFN
Dataset contains the title and content of news stories about
politics from a variety of news sources categorized as either
"Fake" or "Real." During the preprocessingstep, misfitsfrom
the dataset have been eliminated to reduce overall variation
in the data. Then the dataset is partitioned as a training and
testing dataset. The dataset is then fed to the BERT model to
classify the real and fake news.
3.5 Hybrid Classifier based system
In this system, the dataset from Kaggle has been used.
Features are extracted from the dataset using the K-fold
approach. Here, a hybrid classifier hasbeenusedforreal and
fake news classification problems. KNN and the Random
Forest Classifier Algorithm are combined to create the
Hybrid Classifier. The K-NN method stores all the data
available and classifies each new point according to how
similar it is to the previous point [9]. Random Forest uses
many decision trees on different subsets of the inputdataset
and averages the results to increase the dataset's predicted
accuracy [10].
3.6 BerConvoNet based system
At first, words from the input text dataset
have been converted into numerical representations. Word
embedding is the most popular technique fortransforminga
word into its vector form. This is done by a pretrained
unsupervised model called BERT [12].To extract the
different attributes of a news story,multiscalefeatureblocks
(MSFB) have been used. Then the feature map thusobtained
from the MSFB block is fed to a fully connected layer [11].
3.7 Convolutional Neural Network and Long Short-
Term Memory based system
A fake news dataset from Kaggle [14] is used in this system.
The dataset is organized into 12 distinctgroupsbasedonthe
type of news. The four-layer model for the false news
identification system is composed of two CNN layers, one
LSTM layer, and one FC layer [13]. CNN model isparticularly
effective when dealing with similar data, like words in
phrases. When dealing with a set of data connected to
another set of data, the LSTM model has been found to be
quite helpful. After performing the different preprocessing
steps (uppercase letters to lowercase conversion,
tokenization, padding), feature vectors are generated using
the glove embedding method. These feature vectors are fed
to the CNN+LSTM model for the classification task.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 07 | July 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 2943
4. COMPARISON
No Title Method Dataset Accuracy
1
FAKEDETECTOR: Effective
Fake News Detection with
Deep Diffusive Neural
Network. [4]
Diffusive Network
Tweets posted by PolitiFact at
its official Twitter account &
fact check articles on the
PolitiFact website
63%
2
Content-based fake news
classification through
modified voting ensemble
[5]
Logistic Regression,
Regression tree, ensemble
AdaBoost, Naïve Bayes,
Fuzzy Rule, KNN, Decision
tree, and PNN
1. fact checks from Google Fact
Check Explorer
2. ISOT Fake News Dataset
95% for both
the primary
and external
datasets and
65% when
using the
random
dataset.
3
An Ensemble Machine
Learning Approach for Fake
News Detection and
Classification Using a Soft
Voting Classifier [6]
Naïve Bayes, Support Vector
Machine, and Logistic
Regression,
Soft Voting ensemble
classifier
Publicly available dataset from
Kaggle
93%
4
Classification of Fake News
by Fine-tuning Deep
Bidirectional Transformers-
based Language Model [7]
Bidirectional Encoder
Representations from
Transformers (BERT)
language model
NewsFN Dataset 97.021%
5
Hybrid Text Classification
Method for Fake News
Detection [8]
SVM, Hybrid classifier,
Random Forest
Publicly available dataset from
Kaggle
91.5%
6
BerConvoNet: A deep
learning framework for fake
news classification
[11]
BerConvoNet framework
BERT
1.George McIntire Dataset,
2. Dataset from Kaggle
3. Dataset from Gossip cop
4. Dataset from PolitiFact
94.25%
97.45%
75.18%
90.27%
7
Fake News Classification
Bimodal using
Convolutional Neural
Network and Long Short-
Term Memory [13]
CNN, LSTM Dataset from Kaggle [14] 97.5%
Table 1: Comparison
The comparison of various fake news detection is shown in Table 1.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 07 | July 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 2944
5. CONCLUSION
Spreading false information to influence people's opinions
on a variety of issues is not a new practice. Internet-based
knowledge transfer has made this a more pervasive and
challenging issue. Some false news stories may not have a
significant negative impact, but others can stimulate anti-
democratic views. Various methods of fake news
classification have been discussed in this paper. Out of the
different methods discussed, fake news classification based
on CNN and Long Short-Term Memory isfoundtobethe best
system by giving higher classification accuracy of 97.5% in
detecting real and fake news.
REFERENCES
[1] H. Allcott and M. Gentzkow, “Social Media and Fake
News in the 2016 Election,” Journal of Economic
Perspectives, vol. 31, no. 2, pp. 211–236, May 2017
[2] https://examples.yourdictionary.com/reference/examp
les/what-is-fake-news.html
[3] https://www.ionos.com/digitalguide/onlinemarketing/
social-media/what-is-fake-news/
[4] Zhang, J., Dong, B., & Philip, S. Y. (2020, April).
Fakedetector: Effective fake news detection with deep
diffusive neural network. In 2020 IEEE 36th
international conference on data engineering (ICDE)
(pp. 1826-1829). IEEE.
[5] Ribeiro Bezerra, J. F. (2021). Content-based fake news
classification throughmodifiedvotingensemble.Journal
of Information and Telecommunication, 5(4), 499-513.
[6] Lasotte, Y. B., Garba, E. J., Malgwi, Y. M., & Buhari, M. A.
(2022). An Ensemble Machine Learning Approach for
Fake News Detection and Classification Using a Soft
Voting Classifier. European Journal of Electrical
Engineering and Computer Science, 6(2), 1-7.
[7] Aggarwal, A., Chauhan, A., Kumar,D.,Mittal,M.,&Verma,
S. (2020). Classification of fakenewsbyfine-tuningdeep
bidirectional transformers-based language model. EAI
Endorsed Transactions on Scalable Information
Systems, 7(27).
[8] Kaur, P., Boparai, R. S., & Singh, D. (2019). Hybrid text
classification method for fake news detection.
International Journal of Engineering and Advanced
Technology (IJEAT), 8(5), 2388-2392.
[9] https://www.javatpoint.com/k-nearest-neighbor-
algorithm-for-machine-learning
[10] https://www.javatpoint.com/machine-learning-
random-forest-algorithm
[11] Choudhary, M., Chouhan, S. S., Pilli, E. S., & Vipparthi, S.
K. (2021). BerConvoNet: A deep learning framework for
fake news classification. Applied Soft Computing, 110,
107614.
[12] Lee, J. S., & Hsiang, J. (2020). Patent classification by
fine-tuning BERT language model. World Patent
Information, 61, 101965.
[13] Abdullah, A., Awan, M., Shehzad, M., & Ashraf, M.(2020).
Fake news classification bimodal using convolutional
neural network and long short-term memory. Int. J.
Emerg. Technol. Learn, 11, 209-212.
[14] G. Pontes, “Fake News Data-Set,” kaggle.
[15] Jadhav, S. S., & Thepade, S. D. (2019). Fake news
identification and classification using DSSM and
improved recurrent neural network classifier. Applied
Artificial Intelligence, 33(12), 1058-1068.
[16] https://www.techtarget.com/whatis/definition/fake-
news

More Related Content

Similar to A review of Fake News Detection Methods

Intrusion detection with Parameterized Methods for Wireless Sensor Networks
Intrusion detection with Parameterized Methods for Wireless Sensor NetworksIntrusion detection with Parameterized Methods for Wireless Sensor Networks
Intrusion detection with Parameterized Methods for Wireless Sensor Networks
rahulmonikasharma
 

Similar to A review of Fake News Detection Methods (20)

High performance intrusion detection using modified k mean & naïve bayes
High performance intrusion detection using modified k mean & naïve bayesHigh performance intrusion detection using modified k mean & naïve bayes
High performance intrusion detection using modified k mean & naïve bayes
 
Analysis on Fraud Detection Mechanisms Using Machine Learning Techniques
Analysis on Fraud Detection Mechanisms Using Machine Learning TechniquesAnalysis on Fraud Detection Mechanisms Using Machine Learning Techniques
Analysis on Fraud Detection Mechanisms Using Machine Learning Techniques
 
A Compendium of Various Applications of Machine Learning
A Compendium of Various Applications of Machine LearningA Compendium of Various Applications of Machine Learning
A Compendium of Various Applications of Machine Learning
 
Network Intrusion Detection System using Machine Learning
Network Intrusion Detection System using Machine LearningNetwork Intrusion Detection System using Machine Learning
Network Intrusion Detection System using Machine Learning
 
Machine Learning-Based Phishing Detection
Machine Learning-Based Phishing DetectionMachine Learning-Based Phishing Detection
Machine Learning-Based Phishing Detection
 
Intrusion Detection System Using Machine Learning: An Overview
Intrusion Detection System Using Machine Learning: An OverviewIntrusion Detection System Using Machine Learning: An Overview
Intrusion Detection System Using Machine Learning: An Overview
 
Intrusion detection with Parameterized Methods for Wireless Sensor Networks
Intrusion detection with Parameterized Methods for Wireless Sensor NetworksIntrusion detection with Parameterized Methods for Wireless Sensor Networks
Intrusion detection with Parameterized Methods for Wireless Sensor Networks
 
Potato Leaf Disease Detection Using Machine Learning
Potato Leaf Disease Detection Using Machine LearningPotato Leaf Disease Detection Using Machine Learning
Potato Leaf Disease Detection Using Machine Learning
 
IRJET - Customer Churn Analysis in Telecom Industry
IRJET - Customer Churn Analysis in Telecom IndustryIRJET - Customer Churn Analysis in Telecom Industry
IRJET - Customer Churn Analysis in Telecom Industry
 
IRJET - An Automated System for Detection of Social Engineering Phishing Atta...
IRJET - An Automated System for Detection of Social Engineering Phishing Atta...IRJET - An Automated System for Detection of Social Engineering Phishing Atta...
IRJET - An Automated System for Detection of Social Engineering Phishing Atta...
 
Detection of Phishing Websites using machine Learning Algorithm
Detection of Phishing Websites using machine Learning AlgorithmDetection of Phishing Websites using machine Learning Algorithm
Detection of Phishing Websites using machine Learning Algorithm
 
A Comparative Study for Credit Card Fraud Detection System using Machine Lear...
A Comparative Study for Credit Card Fraud Detection System using Machine Lear...A Comparative Study for Credit Card Fraud Detection System using Machine Lear...
A Comparative Study for Credit Card Fraud Detection System using Machine Lear...
 
Irrational node detection in multihop cellular
Irrational node detection in multihop cellularIrrational node detection in multihop cellular
Irrational node detection in multihop cellular
 
Irrational node detection in multihop cellular networks using accounting center
Irrational node detection in multihop cellular networks using accounting centerIrrational node detection in multihop cellular networks using accounting center
Irrational node detection in multihop cellular networks using accounting center
 
IRJET- Competitive Analysis of Attacks on Social Media
IRJET-  	 Competitive Analysis of Attacks on Social MediaIRJET-  	 Competitive Analysis of Attacks on Social Media
IRJET- Competitive Analysis of Attacks on Social Media
 
IRJET - Company’s Stock Price Predictor using Machine Learning
IRJET -  	  Company’s Stock Price Predictor using Machine LearningIRJET -  	  Company’s Stock Price Predictor using Machine Learning
IRJET - Company’s Stock Price Predictor using Machine Learning
 
A Comparative Study on Credit Card Fraud Detection
A Comparative Study on Credit Card Fraud DetectionA Comparative Study on Credit Card Fraud Detection
A Comparative Study on Credit Card Fraud Detection
 
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
 
Development of a Web Application for Fake News Classification using Machine l...
Development of a Web Application for Fake News Classification using Machine l...Development of a Web Application for Fake News Classification using Machine l...
Development of a Web Application for Fake News Classification using Machine l...
 
Meta Classification Technique for Improving Credit Card Fraud Detection
Meta Classification Technique for Improving Credit Card Fraud Detection Meta Classification Technique for Improving Credit Card Fraud Detection
Meta Classification Technique for Improving Credit Card Fraud Detection
 

More from IRJET Journal

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

electrical installation and maintenance.
electrical installation and maintenance.electrical installation and maintenance.
electrical installation and maintenance.
benjamincojr
 

Recently uploaded (20)

Working Principle of Echo Sounder and Doppler Effect.pdf
Working Principle of Echo Sounder and Doppler Effect.pdfWorking Principle of Echo Sounder and Doppler Effect.pdf
Working Principle of Echo Sounder and Doppler Effect.pdf
 
Worksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptxWorksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptx
 
Augmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxAugmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptx
 
CLOUD COMPUTING SERVICES - Cloud Reference Modal
CLOUD COMPUTING SERVICES - Cloud Reference ModalCLOUD COMPUTING SERVICES - Cloud Reference Modal
CLOUD COMPUTING SERVICES - Cloud Reference Modal
 
Filters for Electromagnetic Compatibility Applications
Filters for Electromagnetic Compatibility ApplicationsFilters for Electromagnetic Compatibility Applications
Filters for Electromagnetic Compatibility Applications
 
What is Coordinate Measuring Machine? CMM Types, Features, Functions
What is Coordinate Measuring Machine? CMM Types, Features, FunctionsWhat is Coordinate Measuring Machine? CMM Types, Features, Functions
What is Coordinate Measuring Machine? CMM Types, Features, Functions
 
Involute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdf
Involute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdfInvolute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdf
Involute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdf
 
analog-vs-digital-communication (concept of analog and digital).pptx
analog-vs-digital-communication (concept of analog and digital).pptxanalog-vs-digital-communication (concept of analog and digital).pptx
analog-vs-digital-communication (concept of analog and digital).pptx
 
Autodesk Construction Cloud (Autodesk Build).pptx
Autodesk Construction Cloud (Autodesk Build).pptxAutodesk Construction Cloud (Autodesk Build).pptx
Autodesk Construction Cloud (Autodesk Build).pptx
 
Artificial Intelligence in due diligence
Artificial Intelligence in due diligenceArtificial Intelligence in due diligence
Artificial Intelligence in due diligence
 
Independent Solar-Powered Electric Vehicle Charging Station
Independent Solar-Powered Electric Vehicle Charging StationIndependent Solar-Powered Electric Vehicle Charging Station
Independent Solar-Powered Electric Vehicle Charging Station
 
electrical installation and maintenance.
electrical installation and maintenance.electrical installation and maintenance.
electrical installation and maintenance.
 
engineering chemistry power point presentation
engineering chemistry  power point presentationengineering chemistry  power point presentation
engineering chemistry power point presentation
 
Research Methodolgy & Intellectual Property Rights Series 1
Research Methodolgy & Intellectual Property Rights Series 1Research Methodolgy & Intellectual Property Rights Series 1
Research Methodolgy & Intellectual Property Rights Series 1
 
Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)
 
litvinenko_Henry_Intrusion_Hong-Kong_2024.pdf
litvinenko_Henry_Intrusion_Hong-Kong_2024.pdflitvinenko_Henry_Intrusion_Hong-Kong_2024.pdf
litvinenko_Henry_Intrusion_Hong-Kong_2024.pdf
 
15-Minute City: A Completely New Horizon
15-Minute City: A Completely New Horizon15-Minute City: A Completely New Horizon
15-Minute City: A Completely New Horizon
 
Software Engineering Practical File Front Pages.pdf
Software Engineering Practical File Front Pages.pdfSoftware Engineering Practical File Front Pages.pdf
Software Engineering Practical File Front Pages.pdf
 
Dynamo Scripts for Task IDs and Space Naming.pptx
Dynamo Scripts for Task IDs and Space Naming.pptxDynamo Scripts for Task IDs and Space Naming.pptx
Dynamo Scripts for Task IDs and Space Naming.pptx
 
handbook on reinforce concrete and detailing
handbook on reinforce concrete and detailinghandbook on reinforce concrete and detailing
handbook on reinforce concrete and detailing
 

A review of Fake News Detection Methods

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 07 | July 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 2941 A review of Fake News Detection Methods Sandhya S1, Arya J Nair2, Sreeja Kumari S3 1,2,3 Lecturer, Dept. of Computer Engineering, NSS Polytechnic College, Kerala, India ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract – False news is the dissemination of false information to grab the audience's attention, which, at first glance, may seem credible. False news is frequently produced by people with personal, political, or economic objectives in mind. It is frequently disseminated online or through conventional media. Fake news can reach dozens and even millions of people via social media, even if it's labeled as fake. In order to lessen the impact of spreading misinformation as well as to eliminate the sources of this false information, all efforts should be made to reduce its impact on citizens. This paper examines and evaluates the many methods for identifying false news. Key Words: ConvolutionalNeuralNetwork(CNN),Hybrid feature learning unit (HFLU), K NearestNeighbors(KNN), Support Vector Machine (SVM), Naïve Bayes, Random Forest, Regression Tree, Bidirectional Encoder Representations from Transformers (BERT), GDU (Gated Diffusive Unit), Long Short Term Memory(LSTM) 1. INTRODUCTION As the term implies, fake news is defined as "news items that are deliberately and irrefutably false" [1] to deceive people about facts, and assertions. Particularly on social networking networks, spreading fake news hasprovento be simple. Fake news is typically created to influence people's opinions, further a political agenda, or create misunderstandings about a significant topic [2]. Because of this news, elections may be impacted, ethnic conflict may increase, and criticism may be stifled. Fake news is of many types. They are ‘targeted misinformation, clickbait, fake headlines, fake viral posts, hoaxes, etc. Untrue piece of information spread for personal gain is known as targeted misinformation [3]. The most frequent definition of clickbaitisfalseinformationcreatedto grab readers' attention and persuade them to click on a link to a certain website. Fake headlines are the headlines that make up stuff in order to attract readers' attention and are frequently used by media and newspapers with lower credibility. Hoaxes are another sort of false informationthat purposefully misleads the reader by harmingandfinancially depleting its readers. Various tactics to combat different kinds of false news are now being actively investigated. This paper reviews various methods of fake news recognition. The different methods are presentedinChapter 3. Chapter 4 depicts the comparison of these methods. Chapter 5 includes the conclusion of the study. 2. LITERATURE REVIEW A unique diffusive network-basedfalsenewsauthenticity checker has been introducedin [4]. A hybrid featurelearning unit (HFLU) has been used in this system, which is trained to learn both obvious and implicit feature representations of news. The research suggests a uniquedeepdiffusivenetwork model with a gated diffusive unit to manage diverse information in social networks. A content-based fake news classification system has been proposed in [5] using the voting ensemble classifier method. Logistic Regression, Regression tree, ensemble AdaBoost, Naïve Bayes, Fuzzy Rule, KNN, Decision tree,andPNNarethe voting classifiers utilized in this system. The model's classification accuracy forfalse news recognition wasashigh as 95%. Soft voting classifiers in [6] incorporated four machine learning techniquesi.e.,NaiveBayes,SupportVectorMachine (SVM), and Logistic Regression—to classify news items as false or authentic.Thisapproachgivesanaccuracyof93%for fake news classification. The classification of false and true news using transfer learning on the Bidirectional Encoder Representations from Transformers (BERT) language model was done in [7]. An accuracy of 97.021 percent wasachievedwiththisfine-tuned BERT model using NewsFN data. A Hybrid Classifier-based fake news detection system is compared withtheSVM-based fake news classification system in [8]. A Hybrid classifier- based system gives more than 90% accuracy and SVM based system gives more than 84%accuracy forfake and real news classification tasks. A deep learning network named BerConvoNet has beenused to classify fake and realnews[11]. CNNandBERTembedding techniques are combined to create BerConvNet. In [13] CNN and Long Short-Term Memory (LSTM) have been used to classify false news items in a multimodal strategy. Here, the news was categorized according to its source and its history. This model categorizes credible news items with a training accuracyof99.7%and validationaccuracyof 97.5%.
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 07 | July 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 2942 3. METHODOLOGY 3.1 Diffusive network-based system The deep diffusive network model of FAKEDETECTOR is made up of representation feature learning and credibility label inference [4]. This model has been used to simulatethe relationship between news pieces, authors,andtopics. Here, the GDU (Gated Diffusive Unit) model takes many inputs from several domains concurrently and delivers its learned hidden state hi to the output layer and other unit models. In this system, a backpropagation algorithm is employed to train the network model. 3.2 Modified Voting Ensemble-based system The input module is composed of the enhanced dataset generated during the dataset preparation procedure. This dataset which consists of various articles, their authors, details, etc. is fed to voting classifiers. Voting classifiersused in this system are Logistic Regression, Regression tree, ensemble AdaBoost, Naïve Bayes, Fuzzy Rule, KNN,Decision tree, and PNN. The dataset is partitioned as a training and testing dataset. The training data stream is fed to the learning node to build the prediction model. The outputs from each scoring node are collected by the "voting module" and sent into a different stream. 3.3 Soft Voting Classifier based system The dataset for fake news detection is collected fromKaggle. Then the data is preprocessed. Preprocessing steps include label encoding, tokenization, removal of stop words, and stemming [6]. Label encoding marks real news as 1 and false news as 0. During tokenization, textual data is broken up into tokens, without punctuation. Machine learning techniques are made more accurate and efficient by removing insignificant words from the textual data in the stop word removal step. After preprocessing important features are extracted from the textual data using the count vectorizer. To choose an appropriate set of features for the learning model, GridSearchCV hyperparameter tuning has been performed. Then the data is fed to four classifiers for the classification task. An ensembleapproachaggregatesthe estimations of two or more classifiers in order to produce a model that makes more accurate judgments. Thenthevoting ensemble classifier makes predictions based on the output class with the highest probability. 3.4 Deep Bidirectional Transformers-based Language Model News items were categorized reliably into two categories using a pre-trained BERT language model [7]. The NewsFN Dataset contains the title and content of news stories about politics from a variety of news sources categorized as either "Fake" or "Real." During the preprocessingstep, misfitsfrom the dataset have been eliminated to reduce overall variation in the data. Then the dataset is partitioned as a training and testing dataset. The dataset is then fed to the BERT model to classify the real and fake news. 3.5 Hybrid Classifier based system In this system, the dataset from Kaggle has been used. Features are extracted from the dataset using the K-fold approach. Here, a hybrid classifier hasbeenusedforreal and fake news classification problems. KNN and the Random Forest Classifier Algorithm are combined to create the Hybrid Classifier. The K-NN method stores all the data available and classifies each new point according to how similar it is to the previous point [9]. Random Forest uses many decision trees on different subsets of the inputdataset and averages the results to increase the dataset's predicted accuracy [10]. 3.6 BerConvoNet based system At first, words from the input text dataset have been converted into numerical representations. Word embedding is the most popular technique fortransforminga word into its vector form. This is done by a pretrained unsupervised model called BERT [12].To extract the different attributes of a news story,multiscalefeatureblocks (MSFB) have been used. Then the feature map thusobtained from the MSFB block is fed to a fully connected layer [11]. 3.7 Convolutional Neural Network and Long Short- Term Memory based system A fake news dataset from Kaggle [14] is used in this system. The dataset is organized into 12 distinctgroupsbasedonthe type of news. The four-layer model for the false news identification system is composed of two CNN layers, one LSTM layer, and one FC layer [13]. CNN model isparticularly effective when dealing with similar data, like words in phrases. When dealing with a set of data connected to another set of data, the LSTM model has been found to be quite helpful. After performing the different preprocessing steps (uppercase letters to lowercase conversion, tokenization, padding), feature vectors are generated using the glove embedding method. These feature vectors are fed to the CNN+LSTM model for the classification task.
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 07 | July 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 2943 4. COMPARISON No Title Method Dataset Accuracy 1 FAKEDETECTOR: Effective Fake News Detection with Deep Diffusive Neural Network. [4] Diffusive Network Tweets posted by PolitiFact at its official Twitter account & fact check articles on the PolitiFact website 63% 2 Content-based fake news classification through modified voting ensemble [5] Logistic Regression, Regression tree, ensemble AdaBoost, Naïve Bayes, Fuzzy Rule, KNN, Decision tree, and PNN 1. fact checks from Google Fact Check Explorer 2. ISOT Fake News Dataset 95% for both the primary and external datasets and 65% when using the random dataset. 3 An Ensemble Machine Learning Approach for Fake News Detection and Classification Using a Soft Voting Classifier [6] Naïve Bayes, Support Vector Machine, and Logistic Regression, Soft Voting ensemble classifier Publicly available dataset from Kaggle 93% 4 Classification of Fake News by Fine-tuning Deep Bidirectional Transformers- based Language Model [7] Bidirectional Encoder Representations from Transformers (BERT) language model NewsFN Dataset 97.021% 5 Hybrid Text Classification Method for Fake News Detection [8] SVM, Hybrid classifier, Random Forest Publicly available dataset from Kaggle 91.5% 6 BerConvoNet: A deep learning framework for fake news classification [11] BerConvoNet framework BERT 1.George McIntire Dataset, 2. Dataset from Kaggle 3. Dataset from Gossip cop 4. Dataset from PolitiFact 94.25% 97.45% 75.18% 90.27% 7 Fake News Classification Bimodal using Convolutional Neural Network and Long Short- Term Memory [13] CNN, LSTM Dataset from Kaggle [14] 97.5% Table 1: Comparison The comparison of various fake news detection is shown in Table 1.
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 07 | July 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 2944 5. CONCLUSION Spreading false information to influence people's opinions on a variety of issues is not a new practice. Internet-based knowledge transfer has made this a more pervasive and challenging issue. Some false news stories may not have a significant negative impact, but others can stimulate anti- democratic views. Various methods of fake news classification have been discussed in this paper. Out of the different methods discussed, fake news classification based on CNN and Long Short-Term Memory isfoundtobethe best system by giving higher classification accuracy of 97.5% in detecting real and fake news. REFERENCES [1] H. Allcott and M. Gentzkow, “Social Media and Fake News in the 2016 Election,” Journal of Economic Perspectives, vol. 31, no. 2, pp. 211–236, May 2017 [2] https://examples.yourdictionary.com/reference/examp les/what-is-fake-news.html [3] https://www.ionos.com/digitalguide/onlinemarketing/ social-media/what-is-fake-news/ [4] Zhang, J., Dong, B., & Philip, S. Y. (2020, April). Fakedetector: Effective fake news detection with deep diffusive neural network. In 2020 IEEE 36th international conference on data engineering (ICDE) (pp. 1826-1829). IEEE. [5] Ribeiro Bezerra, J. F. (2021). Content-based fake news classification throughmodifiedvotingensemble.Journal of Information and Telecommunication, 5(4), 499-513. [6] Lasotte, Y. B., Garba, E. J., Malgwi, Y. M., & Buhari, M. A. (2022). An Ensemble Machine Learning Approach for Fake News Detection and Classification Using a Soft Voting Classifier. European Journal of Electrical Engineering and Computer Science, 6(2), 1-7. [7] Aggarwal, A., Chauhan, A., Kumar,D.,Mittal,M.,&Verma, S. (2020). Classification of fakenewsbyfine-tuningdeep bidirectional transformers-based language model. EAI Endorsed Transactions on Scalable Information Systems, 7(27). [8] Kaur, P., Boparai, R. S., & Singh, D. (2019). Hybrid text classification method for fake news detection. International Journal of Engineering and Advanced Technology (IJEAT), 8(5), 2388-2392. [9] https://www.javatpoint.com/k-nearest-neighbor- algorithm-for-machine-learning [10] https://www.javatpoint.com/machine-learning- random-forest-algorithm [11] Choudhary, M., Chouhan, S. S., Pilli, E. S., & Vipparthi, S. K. (2021). BerConvoNet: A deep learning framework for fake news classification. Applied Soft Computing, 110, 107614. [12] Lee, J. S., & Hsiang, J. (2020). Patent classification by fine-tuning BERT language model. World Patent Information, 61, 101965. [13] Abdullah, A., Awan, M., Shehzad, M., & Ashraf, M.(2020). Fake news classification bimodal using convolutional neural network and long short-term memory. Int. J. Emerg. Technol. Learn, 11, 209-212. [14] G. Pontes, “Fake News Data-Set,” kaggle. [15] Jadhav, S. S., & Thepade, S. D. (2019). Fake news identification and classification using DSSM and improved recurrent neural network classifier. Applied Artificial Intelligence, 33(12), 1058-1068. [16] https://www.techtarget.com/whatis/definition/fake- news