SlideShare a Scribd company logo
1 of 21
PEOPLE’S DEMOCRATIC REPUBLIC OFALGERIA
MINISTERY OF HIGHER EDUCATION AND SCIENTIFIC RESEARCH
UNIVERSITY OF MOHAMED BOUDIAF - M’SILA
DOMAIN: Mathematics and Computer
Science
FIELD: Computer Science
SUB-FIELD: Information Communication
Technologies
FACULTY: Mathematics and
Computer Science
DEPARTMENT of Computer
Science
A Dissertation in Fulfillment
For the Requirements of the Degree of Master
SUBJECT
Academic year: 2016 /2017
Mr: Y. ARIOUAT University of M’sila Chairman
Mr: TAHAR MEHENNI University of M’sila Supervisor
Mr: M. KAMEL University of M’sila Examiner
By: Yassine Bensaoucha
A data mining tool for the detection of
suicide in social networks
1
Introduction
Suicide
Social Network
Data mining
2
Naive Bayes Classifiers
Proposed classification
Implementation Of The System
Conclusion
plan
Introduction
At present, the suicide phenomenon is raising, having a relevant impact on our
society. each year millions of people die as a result of suicidal, behavior becoming
an economic social and human problem. on the other hand, the use of SN as a means
of communication is becoming extremely popular, and people find that writing about
their feeling and sharing it to the world through this SNs is much easier then talking
about it in real life. the increasing number of publications in SN has made the field
of data mining evolving.
Based on these considerations, we have developed a program that can
detect the suicidal tendencies of people via Twitter.
3
4
Suicide
Suicide is when people direct violence at themselves with the intent to end their
lives, and they die as a result of their actions. Suicide is a leading cause of death
in the United States.
According to the analysis carried out by the World Health Organization
(WHO) in 2012 every 40 seconds a person died by suicide, The WHO estimates
that suicide is the 13th leading cause of death in the world and the third one
between youth aged 15-44
5
The Social Network: is a network structure composed of social
entities (might be people, organisations or groups) as nodes. And
connections in the form of edges, that represent the connection or
relationship amongst these entities. Such connections might be
friendships, or business connection etc.
Social Network
Figure 1: Social Network
Data mining is the process of sorting through large datasets to identify patterns
and establish relationships to solve problems through data analysis.
Data mining
Text Mining
the process at which text is transferred into data that can be analysed. this
incurs the procedures of creating an index for the individual terms, based on the
location of the term within the original text, or based on other techniques or
protocols. The words and indexes can then be used for a variety of analysis
methods.
Sentiment Analysis
the process of determining the emotional tone behind a series of words, used to
gain an understanding of the the attitudes, opinions and emotions expressed
within an online mention.
6
Figure 2 : Machine learning techniques
7
Naive Bayes Classifiers
The Naive Bayes algorithm is a widely used algorithm for document
classification. Given a feature vector table, the algorithm computes the posterior
probability that the document belongs to different classes and assigns it to the
class with the highest posterior probability. There are two commonly used models
( multinomial model and multi-variate Bernoulli model) for using Naive Bayes
approach for text categorization.
8
Figure 3: trainining multinomial naive bayes
Figure 4: test multinomial naive bayes
9
Example multinomial naive Bayes classifier:
Figure 5 Example multinomial naive Bayes classifier
Decide:
whether document d5 belonging to class c=China?
10
Figure 6: resulut of the example multinomial naive Bayes classifier
11
Advantages of Naïve Bayes
Easy to implement
Requires a small amount of training data to estimate the parameters
Good results obtained in most of the cases
12
Data Pre-processing:
The goal behind preprocessing is to
represent each document as a
feature vector, to separate the text
into individual words.
Preprocessing text is called
tokenization or text
normalization.
Figure 7: Data Pre-processing
13
Proposed classification:
Each sentence is first preprocessed and then passed into three categories of
classifiers, each deciding whether the sentence belongs to the
corresponding category or not.
This category contains positive
and regular sentences
Contains negative sentences that do not contain suicidal
phrases like
"the taste of pizza today very bad".
contains suicidal phrases that are classifed as
emergency phrases in the previous search
“Each day, each hour, each minute is just torture. I want it to
end.”
“Waking up every day wishing I hadn’t “
“I feel like the only way to no longer carry this pain is to die.”
“I am weird and slow. Every social interaction is painfully
awkward. “
“I want to finish my life.”
“No one understands me in this life, I'm leaving.” 14
positive negative suicidal
Proposal Solution for Zero problem(document):
This happens in case the probability of each category is 0, that is when
we pass a new text (tweet) entirely on our classification and we do not
have any word from this text in the dataset
first solution: is switching each word from the new tweet (sentence )
with synonyms, then we re-categorize the new tweet (sentence with the
same meaning ) .
second solution:
the solution depends on the extraction of sentiment from the sentence
using the textblob library.
Sentiment(polarity,subjectivity).
The polarity score is a float within the
range [-1.0, 1.0].
The subjectivity is a float within the
range [0.0, 1.0]
if the number of “polarity” is greater than 0 Then positive else negative 15
Implementation
Of The System
download
tweets
pre-process
the tweets
Classification
Positive negative suicidal
16
Analysis of the results of
Classification
The user has
suicidal tendencies
The user has no
suicidal
tendencies
Username of the person
that we want to apply the
calassification on it
Username
Twitter
API
17
download
tweets
18
Analysis of the results of
Classification
Calculate the percentage of positiveTweets = positive
Calculate the percentage of negativeTweets = negative
Calculate the percentage of suicidalTweets = suicidal
elif suicidal > 20%
and negative > 60%
If suicidal > 70% The user has
suicidal
tendencies
else
The user has
no suicidal
tendencies
19
20
This work aims to create a program, which will be capable of detecting suicide
in social networks, so the results obtained from this study or this program were
acceptable.
While we have identified some interesting and promising results, future work
will be compare the results of this work with other classification. and also
merging our classification with decision tree classification.
CONCLUSION
21
thank you

More Related Content

What's hot

Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligenceu053675
 
Artificial intelligence - An Overview
Artificial intelligence - An OverviewArtificial intelligence - An Overview
Artificial intelligence - An OverviewGiri Dharan
 
Artificial Intelligence
Artificial Intelligence Artificial Intelligence
Artificial Intelligence XashAxel
 
Anpr based licence plate detection report
Anpr  based licence plate detection reportAnpr  based licence plate detection report
Anpr based licence plate detection reportsomchaturvedi
 
The 7 Biggest Ethical Challenges of Artificial Intelligence
The 7 Biggest Ethical Challenges of Artificial IntelligenceThe 7 Biggest Ethical Challenges of Artificial Intelligence
The 7 Biggest Ethical Challenges of Artificial IntelligenceBernard Marr
 
PPT on Artificial Intelligence(A.I.)
PPT on Artificial Intelligence(A.I.) PPT on Artificial Intelligence(A.I.)
PPT on Artificial Intelligence(A.I.) Aakanksh Nath
 
artificial intelligence
artificial intelligenceartificial intelligence
artificial intelligencevallibhargavi
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial IntelligenceGirish Naik
 
Social Media Meets Artificial Intelligence
Social Media Meets Artificial IntelligenceSocial Media Meets Artificial Intelligence
Social Media Meets Artificial IntelligenceHoracio González Duhart
 
A.I. and Robots in the Workplace
A.I. and Robots in the WorkplaceA.I. and Robots in the Workplace
A.I. and Robots in the WorkplaceManpreet Singh Bedi
 
final presentation fake news detection.pptx
final presentation fake news detection.pptxfinal presentation fake news detection.pptx
final presentation fake news detection.pptxRudraSaraswat6
 
ARTIFICIAL INTELLIGENCE
ARTIFICIAL INTELLIGENCEARTIFICIAL INTELLIGENCE
ARTIFICIAL INTELLIGENCEMidhuti
 
artificial intelligence
artificial intelligenceartificial intelligence
artificial intelligencevallibhargavi
 
Principles of Artificial Intelligence & Machine Learning
Principles of Artificial Intelligence & Machine LearningPrinciples of Artificial Intelligence & Machine Learning
Principles of Artificial Intelligence & Machine LearningJerry Lu
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial IntelligenceNeil Mathew
 
Artificial Intelligency ppt class 8
Artificial Intelligency ppt class 8Artificial Intelligency ppt class 8
Artificial Intelligency ppt class 8madhuyadav97
 

What's hot (20)

Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Aippt
AipptAippt
Aippt
 
Artificial intelligence - An Overview
Artificial intelligence - An OverviewArtificial intelligence - An Overview
Artificial intelligence - An Overview
 
Artificial Intelligence
Artificial Intelligence Artificial Intelligence
Artificial Intelligence
 
Anpr based licence plate detection report
Anpr  based licence plate detection reportAnpr  based licence plate detection report
Anpr based licence plate detection report
 
The 7 Biggest Ethical Challenges of Artificial Intelligence
The 7 Biggest Ethical Challenges of Artificial IntelligenceThe 7 Biggest Ethical Challenges of Artificial Intelligence
The 7 Biggest Ethical Challenges of Artificial Intelligence
 
PPT on Artificial Intelligence(A.I.)
PPT on Artificial Intelligence(A.I.) PPT on Artificial Intelligence(A.I.)
PPT on Artificial Intelligence(A.I.)
 
artificial intelligence
artificial intelligenceartificial intelligence
artificial intelligence
 
Machine Learning and Artificial Intelligence
Machine Learning and Artificial IntelligenceMachine Learning and Artificial Intelligence
Machine Learning and Artificial Intelligence
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Social Media Meets Artificial Intelligence
Social Media Meets Artificial IntelligenceSocial Media Meets Artificial Intelligence
Social Media Meets Artificial Intelligence
 
Chatbot ppt
Chatbot pptChatbot ppt
Chatbot ppt
 
A.I. and Robots in the Workplace
A.I. and Robots in the WorkplaceA.I. and Robots in the Workplace
A.I. and Robots in the Workplace
 
final presentation fake news detection.pptx
final presentation fake news detection.pptxfinal presentation fake news detection.pptx
final presentation fake news detection.pptx
 
ARTIFICIAL INTELLIGENCE
ARTIFICIAL INTELLIGENCEARTIFICIAL INTELLIGENCE
ARTIFICIAL INTELLIGENCE
 
artificial intelligence
artificial intelligenceartificial intelligence
artificial intelligence
 
Principles of Artificial Intelligence & Machine Learning
Principles of Artificial Intelligence & Machine LearningPrinciples of Artificial Intelligence & Machine Learning
Principles of Artificial Intelligence & Machine Learning
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Artificial Intelligency ppt class 8
Artificial Intelligency ppt class 8Artificial Intelligency ppt class 8
Artificial Intelligency ppt class 8
 

Similar to Detecting Suicide Risk in Social Media Posts Using Data Mining

76201960
7620196076201960
76201960IJRAT
 
Social Media Sentiment Analysis fro Depression detection Using Machine Learni...
Social Media Sentiment Analysis fro Depression detection Using Machine Learni...Social Media Sentiment Analysis fro Depression detection Using Machine Learni...
Social Media Sentiment Analysis fro Depression detection Using Machine Learni...SaurabhMishra450
 
Predicting depression using deep learning and ensemble algorithms on raw twit...
Predicting depression using deep learning and ensemble algorithms on raw twit...Predicting depression using deep learning and ensemble algorithms on raw twit...
Predicting depression using deep learning and ensemble algorithms on raw twit...IJECEIAES
 
Hybrid sentiment and network analysis of social opinion polarization icoict
Hybrid sentiment and network analysis of social opinion polarization   icoictHybrid sentiment and network analysis of social opinion polarization   icoict
Hybrid sentiment and network analysis of social opinion polarization icoictAndry Alamsyah
 
MIMEME ATTRIBUTE CLASSIFICATION USING LDV ENSEMBLE MULTIMODEL LEARNING
MIMEME ATTRIBUTE CLASSIFICATION USING LDV ENSEMBLE MULTIMODEL LEARNINGMIMEME ATTRIBUTE CLASSIFICATION USING LDV ENSEMBLE MULTIMODEL LEARNING
MIMEME ATTRIBUTE CLASSIFICATION USING LDV ENSEMBLE MULTIMODEL LEARNINGCSEIJJournal
 
Mimeme Attribute Classification using LDV Ensemble Multimodel Learning
Mimeme Attribute Classification using LDV Ensemble Multimodel LearningMimeme Attribute Classification using LDV Ensemble Multimodel Learning
Mimeme Attribute Classification using LDV Ensemble Multimodel LearningCSEIJJournal
 
Automatic Hate Speech Detection: A Literature Review
Automatic Hate Speech Detection: A Literature ReviewAutomatic Hate Speech Detection: A Literature Review
Automatic Hate Speech Detection: A Literature ReviewDr. Amarjeet Singh
 
Depression Detection in Tweets using Logistic Regression Model
Depression Detection in Tweets using Logistic Regression ModelDepression Detection in Tweets using Logistic Regression Model
Depression Detection in Tweets using Logistic Regression Modelijtsrd
 
Combating propaganda texts using transfer learning
Combating propaganda texts using transfer learningCombating propaganda texts using transfer learning
Combating propaganda texts using transfer learningIAESIJAI
 
A study of cyberbullying detection using Deep Learning and Machine Learning T...
A study of cyberbullying detection using Deep Learning and Machine Learning T...A study of cyberbullying detection using Deep Learning and Machine Learning T...
A study of cyberbullying detection using Deep Learning and Machine Learning T...IRJET Journal
 
A study of cyberbullying detection using Deep Learning and Machine Learning T...
A study of cyberbullying detection using Deep Learning and Machine Learning T...A study of cyberbullying detection using Deep Learning and Machine Learning T...
A study of cyberbullying detection using Deep Learning and Machine Learning T...IRJET Journal
 
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...IRJET Journal
 
Text to Emotion Extraction Using Supervised Machine Learning Techniques
Text to Emotion Extraction Using Supervised Machine Learning TechniquesText to Emotion Extraction Using Supervised Machine Learning Techniques
Text to Emotion Extraction Using Supervised Machine Learning TechniquesTELKOMNIKA JOURNAL
 
A scalable, lexicon based technique for sentiment analysis
A scalable, lexicon based technique for sentiment analysisA scalable, lexicon based technique for sentiment analysis
A scalable, lexicon based technique for sentiment analysisijfcstjournal
 
A review on sentiment analysis and emotion detection.pptx
A review on sentiment analysis and emotion detection.pptxA review on sentiment analysis and emotion detection.pptx
A review on sentiment analysis and emotion detection.pptxvoicemail1
 
A Survey on Sentiment Analysis and Opinion Mining
A Survey on Sentiment Analysis and Opinion MiningA Survey on Sentiment Analysis and Opinion Mining
A Survey on Sentiment Analysis and Opinion MiningIJSRD
 
A Survey on Sentiment Analysis and Opinion Mining
A Survey on Sentiment Analysis and Opinion MiningA Survey on Sentiment Analysis and Opinion Mining
A Survey on Sentiment Analysis and Opinion MiningIJSRD
 
Semantic Sentiment Analysis using Machine Learning for Suicidal Tendency Pred...
Semantic Sentiment Analysis using Machine Learning for Suicidal Tendency Pred...Semantic Sentiment Analysis using Machine Learning for Suicidal Tendency Pred...
Semantic Sentiment Analysis using Machine Learning for Suicidal Tendency Pred...IRJET Journal
 

Similar to Detecting Suicide Risk in Social Media Posts Using Data Mining (20)

76201960
7620196076201960
76201960
 
Social Media Sentiment Analysis fro Depression detection Using Machine Learni...
Social Media Sentiment Analysis fro Depression detection Using Machine Learni...Social Media Sentiment Analysis fro Depression detection Using Machine Learni...
Social Media Sentiment Analysis fro Depression detection Using Machine Learni...
 
Predicting depression using deep learning and ensemble algorithms on raw twit...
Predicting depression using deep learning and ensemble algorithms on raw twit...Predicting depression using deep learning and ensemble algorithms on raw twit...
Predicting depression using deep learning and ensemble algorithms on raw twit...
 
Hybrid sentiment and network analysis of social opinion polarization icoict
Hybrid sentiment and network analysis of social opinion polarization   icoictHybrid sentiment and network analysis of social opinion polarization   icoict
Hybrid sentiment and network analysis of social opinion polarization icoict
 
MIMEME ATTRIBUTE CLASSIFICATION USING LDV ENSEMBLE MULTIMODEL LEARNING
MIMEME ATTRIBUTE CLASSIFICATION USING LDV ENSEMBLE MULTIMODEL LEARNINGMIMEME ATTRIBUTE CLASSIFICATION USING LDV ENSEMBLE MULTIMODEL LEARNING
MIMEME ATTRIBUTE CLASSIFICATION USING LDV ENSEMBLE MULTIMODEL LEARNING
 
Mimeme Attribute Classification using LDV Ensemble Multimodel Learning
Mimeme Attribute Classification using LDV Ensemble Multimodel LearningMimeme Attribute Classification using LDV Ensemble Multimodel Learning
Mimeme Attribute Classification using LDV Ensemble Multimodel Learning
 
Automatic Hate Speech Detection: A Literature Review
Automatic Hate Speech Detection: A Literature ReviewAutomatic Hate Speech Detection: A Literature Review
Automatic Hate Speech Detection: A Literature Review
 
Depression Detection in Tweets using Logistic Regression Model
Depression Detection in Tweets using Logistic Regression ModelDepression Detection in Tweets using Logistic Regression Model
Depression Detection in Tweets using Logistic Regression Model
 
Combating propaganda texts using transfer learning
Combating propaganda texts using transfer learningCombating propaganda texts using transfer learning
Combating propaganda texts using transfer learning
 
A study of cyberbullying detection using Deep Learning and Machine Learning T...
A study of cyberbullying detection using Deep Learning and Machine Learning T...A study of cyberbullying detection using Deep Learning and Machine Learning T...
A study of cyberbullying detection using Deep Learning and Machine Learning T...
 
A study of cyberbullying detection using Deep Learning and Machine Learning T...
A study of cyberbullying detection using Deep Learning and Machine Learning T...A study of cyberbullying detection using Deep Learning and Machine Learning T...
A study of cyberbullying detection using Deep Learning and Machine Learning T...
 
Abstract
AbstractAbstract
Abstract
 
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
 
ashu ppt final.pptx
ashu ppt final.pptxashu ppt final.pptx
ashu ppt final.pptx
 
Text to Emotion Extraction Using Supervised Machine Learning Techniques
Text to Emotion Extraction Using Supervised Machine Learning TechniquesText to Emotion Extraction Using Supervised Machine Learning Techniques
Text to Emotion Extraction Using Supervised Machine Learning Techniques
 
A scalable, lexicon based technique for sentiment analysis
A scalable, lexicon based technique for sentiment analysisA scalable, lexicon based technique for sentiment analysis
A scalable, lexicon based technique for sentiment analysis
 
A review on sentiment analysis and emotion detection.pptx
A review on sentiment analysis and emotion detection.pptxA review on sentiment analysis and emotion detection.pptx
A review on sentiment analysis and emotion detection.pptx
 
A Survey on Sentiment Analysis and Opinion Mining
A Survey on Sentiment Analysis and Opinion MiningA Survey on Sentiment Analysis and Opinion Mining
A Survey on Sentiment Analysis and Opinion Mining
 
A Survey on Sentiment Analysis and Opinion Mining
A Survey on Sentiment Analysis and Opinion MiningA Survey on Sentiment Analysis and Opinion Mining
A Survey on Sentiment Analysis and Opinion Mining
 
Semantic Sentiment Analysis using Machine Learning for Suicidal Tendency Pred...
Semantic Sentiment Analysis using Machine Learning for Suicidal Tendency Pred...Semantic Sentiment Analysis using Machine Learning for Suicidal Tendency Pred...
Semantic Sentiment Analysis using Machine Learning for Suicidal Tendency Pred...
 

Recently uploaded

GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 

Recently uploaded (20)

GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 

Detecting Suicide Risk in Social Media Posts Using Data Mining

  • 1. PEOPLE’S DEMOCRATIC REPUBLIC OFALGERIA MINISTERY OF HIGHER EDUCATION AND SCIENTIFIC RESEARCH UNIVERSITY OF MOHAMED BOUDIAF - M’SILA DOMAIN: Mathematics and Computer Science FIELD: Computer Science SUB-FIELD: Information Communication Technologies FACULTY: Mathematics and Computer Science DEPARTMENT of Computer Science A Dissertation in Fulfillment For the Requirements of the Degree of Master SUBJECT Academic year: 2016 /2017 Mr: Y. ARIOUAT University of M’sila Chairman Mr: TAHAR MEHENNI University of M’sila Supervisor Mr: M. KAMEL University of M’sila Examiner By: Yassine Bensaoucha A data mining tool for the detection of suicide in social networks 1
  • 2. Introduction Suicide Social Network Data mining 2 Naive Bayes Classifiers Proposed classification Implementation Of The System Conclusion plan
  • 3. Introduction At present, the suicide phenomenon is raising, having a relevant impact on our society. each year millions of people die as a result of suicidal, behavior becoming an economic social and human problem. on the other hand, the use of SN as a means of communication is becoming extremely popular, and people find that writing about their feeling and sharing it to the world through this SNs is much easier then talking about it in real life. the increasing number of publications in SN has made the field of data mining evolving. Based on these considerations, we have developed a program that can detect the suicidal tendencies of people via Twitter. 3
  • 4. 4 Suicide Suicide is when people direct violence at themselves with the intent to end their lives, and they die as a result of their actions. Suicide is a leading cause of death in the United States. According to the analysis carried out by the World Health Organization (WHO) in 2012 every 40 seconds a person died by suicide, The WHO estimates that suicide is the 13th leading cause of death in the world and the third one between youth aged 15-44
  • 5. 5 The Social Network: is a network structure composed of social entities (might be people, organisations or groups) as nodes. And connections in the form of edges, that represent the connection or relationship amongst these entities. Such connections might be friendships, or business connection etc. Social Network Figure 1: Social Network
  • 6. Data mining is the process of sorting through large datasets to identify patterns and establish relationships to solve problems through data analysis. Data mining Text Mining the process at which text is transferred into data that can be analysed. this incurs the procedures of creating an index for the individual terms, based on the location of the term within the original text, or based on other techniques or protocols. The words and indexes can then be used for a variety of analysis methods. Sentiment Analysis the process of determining the emotional tone behind a series of words, used to gain an understanding of the the attitudes, opinions and emotions expressed within an online mention. 6
  • 7. Figure 2 : Machine learning techniques 7
  • 8. Naive Bayes Classifiers The Naive Bayes algorithm is a widely used algorithm for document classification. Given a feature vector table, the algorithm computes the posterior probability that the document belongs to different classes and assigns it to the class with the highest posterior probability. There are two commonly used models ( multinomial model and multi-variate Bernoulli model) for using Naive Bayes approach for text categorization. 8
  • 9. Figure 3: trainining multinomial naive bayes Figure 4: test multinomial naive bayes 9
  • 10. Example multinomial naive Bayes classifier: Figure 5 Example multinomial naive Bayes classifier Decide: whether document d5 belonging to class c=China? 10
  • 11. Figure 6: resulut of the example multinomial naive Bayes classifier 11
  • 12. Advantages of Naïve Bayes Easy to implement Requires a small amount of training data to estimate the parameters Good results obtained in most of the cases 12
  • 13. Data Pre-processing: The goal behind preprocessing is to represent each document as a feature vector, to separate the text into individual words. Preprocessing text is called tokenization or text normalization. Figure 7: Data Pre-processing 13
  • 14. Proposed classification: Each sentence is first preprocessed and then passed into three categories of classifiers, each deciding whether the sentence belongs to the corresponding category or not. This category contains positive and regular sentences Contains negative sentences that do not contain suicidal phrases like "the taste of pizza today very bad". contains suicidal phrases that are classifed as emergency phrases in the previous search “Each day, each hour, each minute is just torture. I want it to end.” “Waking up every day wishing I hadn’t “ “I feel like the only way to no longer carry this pain is to die.” “I am weird and slow. Every social interaction is painfully awkward. “ “I want to finish my life.” “No one understands me in this life, I'm leaving.” 14 positive negative suicidal
  • 15. Proposal Solution for Zero problem(document): This happens in case the probability of each category is 0, that is when we pass a new text (tweet) entirely on our classification and we do not have any word from this text in the dataset first solution: is switching each word from the new tweet (sentence ) with synonyms, then we re-categorize the new tweet (sentence with the same meaning ) . second solution: the solution depends on the extraction of sentiment from the sentence using the textblob library. Sentiment(polarity,subjectivity). The polarity score is a float within the range [-1.0, 1.0]. The subjectivity is a float within the range [0.0, 1.0] if the number of “polarity” is greater than 0 Then positive else negative 15
  • 16. Implementation Of The System download tweets pre-process the tweets Classification Positive negative suicidal 16 Analysis of the results of Classification The user has suicidal tendencies The user has no suicidal tendencies
  • 17. Username of the person that we want to apply the calassification on it Username Twitter API 17 download tweets
  • 18. 18 Analysis of the results of Classification Calculate the percentage of positiveTweets = positive Calculate the percentage of negativeTweets = negative Calculate the percentage of suicidalTweets = suicidal elif suicidal > 20% and negative > 60% If suicidal > 70% The user has suicidal tendencies else The user has no suicidal tendencies
  • 19. 19
  • 20. 20 This work aims to create a program, which will be capable of detecting suicide in social networks, so the results obtained from this study or this program were acceptable. While we have identified some interesting and promising results, future work will be compare the results of this work with other classification. and also merging our classification with decision tree classification. CONCLUSION

Editor's Notes

  1. So we looked for the best technique to classify texts, and we found that the best technique is "yas" Which gives us good results, so we applied them and work to improve the result
  2. Priors and conditional probabilities of each word Likely houds
  3. But what if we don't find any word(synonym) that matches the text we have in the dataset?
  4. Of course we need to consult doctors in this proportion