SlideShare a Scribd company logo
1 of 16
DETECTING FAKE NEWS ON
SOCIAL MEDIA
Nafim Hassan Pourno(ID:201-15-3510)
Habibur Rahman Ziad(ID:201-15-3541)
Ahmed Nihal(ID:201-15-3491)
CONTENT
• Introduction
• Major Problem
• Purpose
• Architecture
• Methodology
• Techniques
• Result
• Conclusion
• References
INTRODUCTION
• Fake news exist way before social media but it multifold when social media was
introduced
• Fake news is a news designed to deliberately spread hoaxes, propagenda and
disinformation
• Fake News Stories usually spread through social media sites like Facebook,
Twitter etc
BACKGROUND
• Social media is used for news reading
• Source of the news
• Proffession used to distribute the news in the past
• Nowadays , everybody wants to be a journalist .
• People are profiting by clickbaits and publishing fake news on online
• More clicks contribute to more money for content publishers.
MAJOR PROBLEMS
• By clicking on clickbait, users are led to page that contains false information.
• Fake news influences people’s preceptions.
• The rise of Fake news has become a global problem that even major tech companies like
Facebook and google are struggling to solve.It can be difficult to determine whether a text is
factual without additional context and human judgement
PURPOSE
• This project aims to develop a method for detecting classifying the news stories
using Natural Language Processing.
• The main goal is to indentify fake news, which is aclassic text classification issue.
• We gathered our data , preprocessed the text, and translated our articles into
supervised model feautures.
• Our goal is to develop a model that classifies a given news article as either fake or
trure.
DELIMITATIONS
• Our system does not guarantee 100% accuracy
• The system is unable to test data that is unrelated to the training database
TYPES OF FAKE NEWS
Visual based type
Visual based are mainly photoshopped images and videos which are
posted in social medias
Linguistic based type
Linguistic based are mainly the manipulation of text and string
content/ This isssues is with blogs , news , or emails
DATA SET
 Datasets are collected reaseech purposes with help of kaggole websites
 Our data is preleveled with fake and real news
 30% of the data is used for training the Ml Model
 70 of the data is used to test the model
WORKFLOW
• The steps in this procedure are as follows:
• Data set loading
• Data pre processing (remove stop words,
streaming, Drop duplicate and remove
meaningless char from the text.)
• Feauture Selection
• Applying Classifiication and model Consruction
• Classifying the new data
CONFUSION MATRIX
A confusion matrix is a table that is used to
define the performance of a classification
algorithm. A confusion matrix visualizes and
summarizes the performance of a
classification algorithm.
METHODOLOGY
1. Data collection
2. Data preprocessing
3. Model implementation
DATA COLLECTION
• In this paper, we used a dataset which collected from Kaggle[28].There are two dataset.
One of them includes real data, while the other ones contain fake data. True dataset
consists of 21417 data and Fake dataset consists of 23481 data.Title, text, subject, and
date are the four features present in both datasets. Text, Title and Subject which
attributes are contain qualitative data means all is textual data and Subject represent
categorical data.
DATA PREPROCESSING
• Both organized and unstructured data might be included. Unstructured data is that which does not adhere to all of the
grammar rules and may contain typos and slang. Structured data adheres to proper grammar standards. Both organized and
unstructured data do not produce the greatest outcomes. It is advisable to utilize data with a semi-structure. Data that is
partially organized but not completely unstructured is referred to as semi-structured data. It stands in between the two.We use
NLP.
• Eliminate punctuation
• Tokenization
• Stopwords
• Stemming
MODEL IMPLEMENTATION
• Logistic Regression
• Decision Tree
• Random Forest
• Gradient Boosting
• Naïve Bayes
 At present, one of the most popular methods is machine learning. First we taken a dataset for
implementation .But we need to modify this dataset before applying classifiers. There are two
dataset. Purpose of our work ,we need to merge our two dataset and also add a categorical
features (class) which make sure the news is true or fake. So, we preprocessing the data which
will eventually increase the performance of our model and the text data is converted into a
numerical vector during vectorization. After completing we applied some classifiers--
RESULT
• For implementation and better results, we produced a dataset in the form of a CSV
file. We taken this dataset from Kaggle. And two dataset are merged for applying
classifiers and get best performance. Additionally, a google colab was developed to
put the ML program into practice. We have employed decision trees, random
forests, gradient boosting, logistic regression, and k-nearest neighbors. The
accuracy for Logistic Regression is 0.99%, Decision Tree, Naïve Bayes is 0.93 . For
Decision Tree, it was 0.99%, for Random Forest, it was 0.98%, and for Gradient
Boosting Classifier, it was 0.99%.
• Classifiers are models that are applied to both training and testing sets of data.
Accuracy is the proportion of times an algorithm successfully classifies a data point.
Precision is calculated as the ratio of correctly anticipated positive observations to
all positively predicted observations.
CONCLUSION
• All classifiers have these prediction values calculated, and the final percentage is determined
by averaging all of these prediction values. Using these metrics. In order to determine the
percentage of news that is accurate, we are setting a range. The news is being displayed first,
then the result, on the webserver we built. We used certain emoticons as a symbol to more
effectively display the results and draw users' attention. A platform-independent web server
has been developed. It implies that every configuration of the webserver will be device-
independent. We used Bootstrap to make our web server independent of hardware. The
resultant output takes the form of a message that varies depending on certain
percentages.These prediction values are computed for each classifier, and the final 97.8 % is
obtained by averaging all of these prediction values. These metrics are used. We are
establishing a range to determine the proportion of news that is accurate. On the webserver
we constructed, the news is displayed before the outcome. To capture users' attention and
more effectively convey the results, we employed specific emoticons as a sign. A web server
that works on any platform has been created. It indicates that each webserver setup will be
independent of the device. To make our web server independent of hardware, we used
Bootstrap. The output that results is a message that changes based on specific percentages.

More Related Content

What's hot

Sentiment Analysis using Twitter Data
Sentiment Analysis using Twitter DataSentiment Analysis using Twitter Data
Sentiment Analysis using Twitter DataHari Prasad
 
Sentiment Analysis in Twitter
Sentiment Analysis in TwitterSentiment Analysis in Twitter
Sentiment Analysis in TwitterAyushi Dalmia
 
Sentiment Analaysis on Twitter
Sentiment Analaysis on TwitterSentiment Analaysis on Twitter
Sentiment Analaysis on TwitterNitish J Prabhu
 
Fake_News_Detection_1st_review[1] capstone project.pptx
Fake_News_Detection_1st_review[1] capstone project.pptxFake_News_Detection_1st_review[1] capstone project.pptx
Fake_News_Detection_1st_review[1] capstone project.pptxHarshMangal20
 
IRJET- Fake News Detection using Logistic Regression
IRJET- Fake News Detection using Logistic RegressionIRJET- Fake News Detection using Logistic Regression
IRJET- Fake News Detection using Logistic RegressionIRJET Journal
 
Fake news detection
Fake news detection Fake news detection
Fake news detection shalushamil
 
Twitter sentiment analysis project report
Twitter sentiment analysis project reportTwitter sentiment analysis project report
Twitter sentiment analysis project reportBharat Khanna
 
Twitter sentiment analysis
Twitter sentiment analysisTwitter sentiment analysis
Twitter sentiment analysisRahul Jha
 
Sentiment analysis of Twitter Data
Sentiment analysis of Twitter DataSentiment analysis of Twitter Data
Sentiment analysis of Twitter DataNurendra Choudhary
 
social network analysis project twitter sentimental analysis
social network analysis project twitter sentimental analysissocial network analysis project twitter sentimental analysis
social network analysis project twitter sentimental analysisAshish Mundra
 
Presentation on Sentiment Analysis
Presentation on Sentiment AnalysisPresentation on Sentiment Analysis
Presentation on Sentiment AnalysisRebecca Williams
 
Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment AnalysisAyush Khandelwal
 
Sentiment Analysis on Twitter
Sentiment Analysis on TwitterSentiment Analysis on Twitter
Sentiment Analysis on TwitterSmritiAgarwal26
 
Sentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using pythonSentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using pythonHetu Bhavsar
 
Sentiment analysis using ml
Sentiment analysis using mlSentiment analysis using ml
Sentiment analysis using mlPravin Katiyar
 
Sentimental analysis
Sentimental analysisSentimental analysis
Sentimental analysisAnkit Khera
 
Sentiment Analysis Using Twitter
Sentiment Analysis Using TwitterSentiment Analysis Using Twitter
Sentiment Analysis Using Twitterpiya chauhan
 

What's hot (20)

Sentiment Analysis using Twitter Data
Sentiment Analysis using Twitter DataSentiment Analysis using Twitter Data
Sentiment Analysis using Twitter Data
 
Sentiment Analysis in Twitter
Sentiment Analysis in TwitterSentiment Analysis in Twitter
Sentiment Analysis in Twitter
 
Sentiment Analaysis on Twitter
Sentiment Analaysis on TwitterSentiment Analaysis on Twitter
Sentiment Analaysis on Twitter
 
Fake_News_Detection_1st_review[1] capstone project.pptx
Fake_News_Detection_1st_review[1] capstone project.pptxFake_News_Detection_1st_review[1] capstone project.pptx
Fake_News_Detection_1st_review[1] capstone project.pptx
 
IRJET- Fake News Detection using Logistic Regression
IRJET- Fake News Detection using Logistic RegressionIRJET- Fake News Detection using Logistic Regression
IRJET- Fake News Detection using Logistic Regression
 
Twitter sentiment analysis ppt
Twitter sentiment analysis pptTwitter sentiment analysis ppt
Twitter sentiment analysis ppt
 
Fake news detection
Fake news detection Fake news detection
Fake news detection
 
Twitter sentiment analysis project report
Twitter sentiment analysis project reportTwitter sentiment analysis project report
Twitter sentiment analysis project report
 
Twitter sentiment analysis
Twitter sentiment analysisTwitter sentiment analysis
Twitter sentiment analysis
 
Sentiment analysis of Twitter Data
Sentiment analysis of Twitter DataSentiment analysis of Twitter Data
Sentiment analysis of Twitter Data
 
social network analysis project twitter sentimental analysis
social network analysis project twitter sentimental analysissocial network analysis project twitter sentimental analysis
social network analysis project twitter sentimental analysis
 
Presentation on Sentiment Analysis
Presentation on Sentiment AnalysisPresentation on Sentiment Analysis
Presentation on Sentiment Analysis
 
Social Media Sentiment Analysis
Social Media Sentiment AnalysisSocial Media Sentiment Analysis
Social Media Sentiment Analysis
 
Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment Analysis
 
Sentiment Analysis on Twitter
Sentiment Analysis on TwitterSentiment Analysis on Twitter
Sentiment Analysis on Twitter
 
Opinion Mining
Opinion Mining Opinion Mining
Opinion Mining
 
Sentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using pythonSentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using python
 
Sentiment analysis using ml
Sentiment analysis using mlSentiment analysis using ml
Sentiment analysis using ml
 
Sentimental analysis
Sentimental analysisSentimental analysis
Sentimental analysis
 
Sentiment Analysis Using Twitter
Sentiment Analysis Using TwitterSentiment Analysis Using Twitter
Sentiment Analysis Using Twitter
 

Similar to Detecting fake news .pptx

Hashtag Recommendation System in a P2P Social Networking Application
Hashtag Recommendation System in a P2P Social Networking ApplicationHashtag Recommendation System in a P2P Social Networking Application
Hashtag Recommendation System in a P2P Social Networking Applicationcsandit
 
ppt_fak_newshhhhhhjjjjjjjhhjjjsjjsjjsj.pptx
ppt_fak_newshhhhhhjjjjjjjhhjjjsjjsjjsj.pptxppt_fak_newshhhhhhjjjjjjjhhjjjsjjsjjsj.pptx
ppt_fak_newshhhhhhjjjjjjjhhjjjsjjsjjsj.pptxGeetha982072
 
Turbocharging your Google Analytics data
Turbocharging your Google Analytics dataTurbocharging your Google Analytics data
Turbocharging your Google Analytics dataNeil Barnes
 
CYBERBULLYING DETECTION USING MACHINE LEARNING-1 (1).pdf
CYBERBULLYING DETECTION USING              MACHINE LEARNING-1 (1).pdfCYBERBULLYING DETECTION USING              MACHINE LEARNING-1 (1).pdf
CYBERBULLYING DETECTION USING MACHINE LEARNING-1 (1).pdfKumbidiGaming
 
IRJET- Analysis of Brand Value Prediction based on Social Media Data
IRJET-  	  Analysis of Brand Value Prediction based on Social Media DataIRJET-  	  Analysis of Brand Value Prediction based on Social Media Data
IRJET- Analysis of Brand Value Prediction based on Social Media DataIRJET Journal
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data AnalyticsOsman Ali
 
fakenews_DBDA_Mar23.pptx
fakenews_DBDA_Mar23.pptxfakenews_DBDA_Mar23.pptx
fakenews_DBDA_Mar23.pptxdeepmitra8
 
Building an AI and ML Model Using KNIME and Python.pptx
Building an AI and ML Model Using KNIME and Python.pptxBuilding an AI and ML Model Using KNIME and Python.pptx
Building an AI and ML Model Using KNIME and Python.pptxssuser448ad3
 
cyberbullyingdetectionusingmachinelearning-11-220913143556-fec10e26.pptx
cyberbullyingdetectionusingmachinelearning-11-220913143556-fec10e26.pptxcyberbullyingdetectionusingmachinelearning-11-220913143556-fec10e26.pptx
cyberbullyingdetectionusingmachinelearning-11-220913143556-fec10e26.pptxSaiKiran101146
 
IRJET- Big Data Driven Information Diffusion Analytics and Control on Social ...
IRJET- Big Data Driven Information Diffusion Analytics and Control on Social ...IRJET- Big Data Driven Information Diffusion Analytics and Control on Social ...
IRJET- Big Data Driven Information Diffusion Analytics and Control on Social ...IRJET Journal
 
Big Data Tools PowerPoint Presentation Slides
Big Data Tools PowerPoint Presentation SlidesBig Data Tools PowerPoint Presentation Slides
Big Data Tools PowerPoint Presentation SlidesSlideTeam
 
BTech Final Project (1).pptx
BTech Final Project (1).pptxBTech Final Project (1).pptx
BTech Final Project (1).pptxSwarajPatel19
 
Mind Map Test Data Management Overview
Mind Map Test Data Management OverviewMind Map Test Data Management Overview
Mind Map Test Data Management Overviewdublinx
 
Methods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature StudyMethods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature Studyvivatechijri
 
Gender Classification based on Mouse Movement Activity on Computer
Gender Classification based on Mouse Movement Activity on ComputerGender Classification based on Mouse Movement Activity on Computer
Gender Classification based on Mouse Movement Activity on ComputerAmin Ismail
 
Data migration patterns special
Data migration patterns   specialData migration patterns   special
Data migration patterns specialManikandan Suresh
 

Similar to Detecting fake news .pptx (20)

Hashtag Recommendation System in a P2P Social Networking Application
Hashtag Recommendation System in a P2P Social Networking ApplicationHashtag Recommendation System in a P2P Social Networking Application
Hashtag Recommendation System in a P2P Social Networking Application
 
ppt_fak_newshhhhhhjjjjjjjhhjjjsjjsjjsj.pptx
ppt_fak_newshhhhhhjjjjjjjhhjjjsjjsjjsj.pptxppt_fak_newshhhhhhjjjjjjjhhjjjsjjsjjsj.pptx
ppt_fak_newshhhhhhjjjjjjjhhjjjsjjsjjsj.pptx
 
Turbocharging your Google Analytics data
Turbocharging your Google Analytics dataTurbocharging your Google Analytics data
Turbocharging your Google Analytics data
 
CYBERBULLYING DETECTION USING MACHINE LEARNING-1 (1).pdf
CYBERBULLYING DETECTION USING              MACHINE LEARNING-1 (1).pdfCYBERBULLYING DETECTION USING              MACHINE LEARNING-1 (1).pdf
CYBERBULLYING DETECTION USING MACHINE LEARNING-1 (1).pdf
 
IRJET- Analysis of Brand Value Prediction based on Social Media Data
IRJET-  	  Analysis of Brand Value Prediction based on Social Media DataIRJET-  	  Analysis of Brand Value Prediction based on Social Media Data
IRJET- Analysis of Brand Value Prediction based on Social Media Data
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
fakenews_DBDA_Mar23.pptx
fakenews_DBDA_Mar23.pptxfakenews_DBDA_Mar23.pptx
fakenews_DBDA_Mar23.pptx
 
Building an AI and ML Model Using KNIME and Python.pptx
Building an AI and ML Model Using KNIME and Python.pptxBuilding an AI and ML Model Using KNIME and Python.pptx
Building an AI and ML Model Using KNIME and Python.pptx
 
cyberbullyingdetectionusingmachinelearning-11-220913143556-fec10e26.pptx
cyberbullyingdetectionusingmachinelearning-11-220913143556-fec10e26.pptxcyberbullyingdetectionusingmachinelearning-11-220913143556-fec10e26.pptx
cyberbullyingdetectionusingmachinelearning-11-220913143556-fec10e26.pptx
 
IRJET- Big Data Driven Information Diffusion Analytics and Control on Social ...
IRJET- Big Data Driven Information Diffusion Analytics and Control on Social ...IRJET- Big Data Driven Information Diffusion Analytics and Control on Social ...
IRJET- Big Data Driven Information Diffusion Analytics and Control on Social ...
 
Big Data Tools PowerPoint Presentation Slides
Big Data Tools PowerPoint Presentation SlidesBig Data Tools PowerPoint Presentation Slides
Big Data Tools PowerPoint Presentation Slides
 
Rapid Miner
Rapid MinerRapid Miner
Rapid Miner
 
BTech Final Project (1).pptx
BTech Final Project (1).pptxBTech Final Project (1).pptx
BTech Final Project (1).pptx
 
Mind Map Test Data Management Overview
Mind Map Test Data Management OverviewMind Map Test Data Management Overview
Mind Map Test Data Management Overview
 
Methods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature StudyMethods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature Study
 
Data processing
Data processingData processing
Data processing
 
Data mining
Data miningData mining
Data mining
 
Data mining
Data miningData mining
Data mining
 
Gender Classification based on Mouse Movement Activity on Computer
Gender Classification based on Mouse Movement Activity on ComputerGender Classification based on Mouse Movement Activity on Computer
Gender Classification based on Mouse Movement Activity on Computer
 
Data migration patterns special
Data migration patterns   specialData migration patterns   special
Data migration patterns special
 

Recently uploaded

Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptxPoojaSen20
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 

Recently uploaded (20)

Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 

Detecting fake news .pptx

  • 1. DETECTING FAKE NEWS ON SOCIAL MEDIA Nafim Hassan Pourno(ID:201-15-3510) Habibur Rahman Ziad(ID:201-15-3541) Ahmed Nihal(ID:201-15-3491)
  • 2. CONTENT • Introduction • Major Problem • Purpose • Architecture • Methodology • Techniques • Result • Conclusion • References
  • 3. INTRODUCTION • Fake news exist way before social media but it multifold when social media was introduced • Fake news is a news designed to deliberately spread hoaxes, propagenda and disinformation • Fake News Stories usually spread through social media sites like Facebook, Twitter etc
  • 4. BACKGROUND • Social media is used for news reading • Source of the news • Proffession used to distribute the news in the past • Nowadays , everybody wants to be a journalist . • People are profiting by clickbaits and publishing fake news on online • More clicks contribute to more money for content publishers.
  • 5. MAJOR PROBLEMS • By clicking on clickbait, users are led to page that contains false information. • Fake news influences people’s preceptions. • The rise of Fake news has become a global problem that even major tech companies like Facebook and google are struggling to solve.It can be difficult to determine whether a text is factual without additional context and human judgement
  • 6. PURPOSE • This project aims to develop a method for detecting classifying the news stories using Natural Language Processing. • The main goal is to indentify fake news, which is aclassic text classification issue. • We gathered our data , preprocessed the text, and translated our articles into supervised model feautures. • Our goal is to develop a model that classifies a given news article as either fake or trure.
  • 7. DELIMITATIONS • Our system does not guarantee 100% accuracy • The system is unable to test data that is unrelated to the training database
  • 8. TYPES OF FAKE NEWS Visual based type Visual based are mainly photoshopped images and videos which are posted in social medias Linguistic based type Linguistic based are mainly the manipulation of text and string content/ This isssues is with blogs , news , or emails
  • 9. DATA SET  Datasets are collected reaseech purposes with help of kaggole websites  Our data is preleveled with fake and real news  30% of the data is used for training the Ml Model  70 of the data is used to test the model
  • 10. WORKFLOW • The steps in this procedure are as follows: • Data set loading • Data pre processing (remove stop words, streaming, Drop duplicate and remove meaningless char from the text.) • Feauture Selection • Applying Classifiication and model Consruction • Classifying the new data
  • 11. CONFUSION MATRIX A confusion matrix is a table that is used to define the performance of a classification algorithm. A confusion matrix visualizes and summarizes the performance of a classification algorithm.
  • 12. METHODOLOGY 1. Data collection 2. Data preprocessing 3. Model implementation
  • 13. DATA COLLECTION • In this paper, we used a dataset which collected from Kaggle[28].There are two dataset. One of them includes real data, while the other ones contain fake data. True dataset consists of 21417 data and Fake dataset consists of 23481 data.Title, text, subject, and date are the four features present in both datasets. Text, Title and Subject which attributes are contain qualitative data means all is textual data and Subject represent categorical data. DATA PREPROCESSING • Both organized and unstructured data might be included. Unstructured data is that which does not adhere to all of the grammar rules and may contain typos and slang. Structured data adheres to proper grammar standards. Both organized and unstructured data do not produce the greatest outcomes. It is advisable to utilize data with a semi-structure. Data that is partially organized but not completely unstructured is referred to as semi-structured data. It stands in between the two.We use NLP. • Eliminate punctuation • Tokenization • Stopwords • Stemming
  • 14. MODEL IMPLEMENTATION • Logistic Regression • Decision Tree • Random Forest • Gradient Boosting • Naïve Bayes  At present, one of the most popular methods is machine learning. First we taken a dataset for implementation .But we need to modify this dataset before applying classifiers. There are two dataset. Purpose of our work ,we need to merge our two dataset and also add a categorical features (class) which make sure the news is true or fake. So, we preprocessing the data which will eventually increase the performance of our model and the text data is converted into a numerical vector during vectorization. After completing we applied some classifiers--
  • 15. RESULT • For implementation and better results, we produced a dataset in the form of a CSV file. We taken this dataset from Kaggle. And two dataset are merged for applying classifiers and get best performance. Additionally, a google colab was developed to put the ML program into practice. We have employed decision trees, random forests, gradient boosting, logistic regression, and k-nearest neighbors. The accuracy for Logistic Regression is 0.99%, Decision Tree, Naïve Bayes is 0.93 . For Decision Tree, it was 0.99%, for Random Forest, it was 0.98%, and for Gradient Boosting Classifier, it was 0.99%. • Classifiers are models that are applied to both training and testing sets of data. Accuracy is the proportion of times an algorithm successfully classifies a data point. Precision is calculated as the ratio of correctly anticipated positive observations to all positively predicted observations.
  • 16. CONCLUSION • All classifiers have these prediction values calculated, and the final percentage is determined by averaging all of these prediction values. Using these metrics. In order to determine the percentage of news that is accurate, we are setting a range. The news is being displayed first, then the result, on the webserver we built. We used certain emoticons as a symbol to more effectively display the results and draw users' attention. A platform-independent web server has been developed. It implies that every configuration of the webserver will be device- independent. We used Bootstrap to make our web server independent of hardware. The resultant output takes the form of a message that varies depending on certain percentages.These prediction values are computed for each classifier, and the final 97.8 % is obtained by averaging all of these prediction values. These metrics are used. We are establishing a range to determine the proportion of news that is accurate. On the webserver we constructed, the news is displayed before the outcome. To capture users' attention and more effectively convey the results, we employed specific emoticons as a sign. A web server that works on any platform has been created. It indicates that each webserver setup will be independent of the device. To make our web server independent of hardware, we used Bootstrap. The output that results is a message that changes based on specific percentages.