Ranking Twitter Conversations

•Download as PPTX, PDF•

0 likes•327 views

Mohammed Faisal Anees

Web Mining Course Project Presentation

Software

1
Ranking Twitter conversations
• Motivating Example
– Extract real-time information
– Peoples views on upcoming elections, products.
– Extracting user interest through conversation
topics
• Problem Definition
– Rank Twitter conversations.
– Generate snippet for each ranked conversation

2
Related Work
• Wang, Hao, Zhengdong Lu, Hang Li, and Enhong Chen. "A
Dataset for Research on Short-Text Conversations." In
EMNLP, pp. 935-945. 2013.
• Key Idea
– retrieval-based response model for short-text based
conversation
• Their solution
– Considered few selected topics from Sina Weibo
– Semantic matching between post-response
– Post-response similarity
• Their results
– Mean average precision – 0.621
– retrieval is fairly effective at capturing the semantic relevance,
but relative weak on modeling the logic consistency

3
Our Methodology
• Key Idea of your work
– Give an importance score to tweets based on their position and
user based on their appearances apart from using inverted
index.
• Solution Description
– Filter tweets
– Create word index
– Considering SMS language
– Score tweets according to TF and tweet and user score
– TF score for tweets according to word type
• Hashtag, user mention, other words
– Generate snippets
• Our approach is ranking twitter conversation rather that
just finding responses to tweets.

4
Parse twitter data Filter valid tweets Extract conversation
Remove stop words
Remove duplicate
words in a tweet
Creating inverted
word index
Calculate user and
tweet score
Get query Parse words in query
Expand SMS words
Calculate
conversation score
based on TF and
tweet and user score
Generate snippet
and display the
results

5
Dataset and Experimental Settings
• Dataset details (size, source, other data statistics)
– 12077 tweets
– 4521 conversations (length >= 2)
– 119 Stop words
• Experimental settings
– Play with removing or adding the below constraints
• duplicate words
• stop words
• Tweet/user score
– Expand SMS words in query
• Accuracy or any other metric you used
– Results were subjective and it was obtained iteratively

6
Results and Summary
• Results and analysis of results
– Subjective in nature. Accuracy could not be obtained
without knowing the context of the conversation.
• What did you learn from this project?
– A basic understanding of how documents can be
ranked given a query
• Future work:
– Infer context of the conversation
– Calculate precision/recall by programmatically tagging
tweets

What's hot

Twitter Sentiment AnalysisAyush Khandelwal

Neural Network Based Context Sensitive Sentiment AnalysisEditor IJCATR

Sentiment analysisJennifer D. Davis, Ph.D.

Sentiment AnalysisAnkur Tyagi

Maximum Likelihood EstimationAvinash Chamwad

Sentiment Analysis of Twitter DataSumit Raj

Presentation on Sentiment AnalysisRebecca Williams

Best Practices for Sentiment Analysis Webinar Mechanical Turk

IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...IRJET Journal

295B_Report_Sentiment_analysisZahid Azam

Sentiment analysisAmenda Joy

IRJET- Sentimental Analysis of Product Reviews for E-Commerce WebsitesIRJET Journal

MTech Seminar Presentation [IIT-Bombay]Sagar Ahire

IRJET- A Survey on Graph based Approaches in Sentiment AnalysisIRJET Journal

Methods for Sentiment Analysis: A Literature Studyvivatechijri

N01741100102IOSR Journals

An Improved sentiment classification for objective word.IJSRD

Amazon Product Sentiment reviewLalit Jain

Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...TELKOMNIKA JOURNAL

A Survey on Sentiment Categorization of Movie ReviewsEditor IJMTER

What's hot (20)

Twitter Sentiment Analysis

Neural Network Based Context Sensitive Sentiment Analysis

Sentiment analysis

Sentiment Analysis

Maximum Likelihood Estimation

Sentiment Analysis of Twitter Data

Presentation on Sentiment Analysis

Best Practices for Sentiment Analysis Webinar

IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...

295B_Report_Sentiment_analysis

Sentiment analysis

IRJET- Sentimental Analysis of Product Reviews for E-Commerce Websites

MTech Seminar Presentation [IIT-Bombay]

IRJET- A Survey on Graph based Approaches in Sentiment Analysis

Methods for Sentiment Analysis: A Literature Study

N01741100102

An Improved sentiment classification for objective word.

Amazon Product Sentiment review

Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...

A Survey on Sentiment Categorization of Movie Reviews

Similar to Ranking Twitter Conversations

Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.comSimon Hughes

Ire major projectAbhishek Mungoli

Kesahan & kebolehpercayaan pembentukan instrumen, kesahan dan kebolehpercayaa...Muhamad Farhan

Qualitative Research vs Quantitative Research - a QuestionPro Academic WebinarQuestionPro

Personalized Search and Job Recommendations - Simon Hughes, Dice.comLucidworks

TSL3133 Topic 11 Qualitative Data AnalysisYee Bee Choo

Các phương pháp nghiên cứu thị trường - Market research methodsInfoQ - GMO Research

How Oracle Uses CrowdFlower For Sentiment AnalysisCrowdFlower

Chapter 8 data collectionNiranjanHN3

Support OptimizationLymba

Great Survey DesignSurveyGizmo

Data analysis – qualitative data presentation 2Azura Zaki

Questionnaire design & adminShameem Ali

Designing Mobile UXFarah Nuraini

When Mobile meets UX/UI powered by Growth Hacking AsiaGrowth Hacking Asia

110917_0900_Karimi.pdfJayashankara3

ENTREP 4 - Market Research.pptxWellaFatimaPortal1

Deep learning for NLPShishir Choudhary

Consumer research and in depth interviewYeshoda Bhargava

business research method chp 7]fizza tanvir

Similar to Ranking Twitter Conversations (20)

Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com

Ire major project

Kesahan & kebolehpercayaan pembentukan instrumen, kesahan dan kebolehpercayaa...

Qualitative Research vs Quantitative Research - a QuestionPro Academic Webinar

Personalized Search and Job Recommendations - Simon Hughes, Dice.com

TSL3133 Topic 11 Qualitative Data Analysis

Các phương pháp nghiên cứu thị trường - Market research methods

How Oracle Uses CrowdFlower For Sentiment Analysis

Chapter 8 data collection

Support Optimization

Great Survey Design

Data analysis – qualitative data presentation 2

Questionnaire design & admin

Designing Mobile UX

When Mobile meets UX/UI powered by Growth Hacking Asia

110917_0900_Karimi.pdf

ENTREP 4 - Market Research.pptx

Deep learning for NLP

Consumer research and in depth interview

business research method chp 7]

Recently uploaded

Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odishasmiwainfosol

Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Mater

Unveiling the Future: Sylius 2.0 New FeaturesŁukasz Chruściel

Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig

Recruitment Management Software Benefits (Infographic)Hr365.us smith

KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app

BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp

GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko

PREDICTING RIVER WATER QUALITY ppt presentationvaddepallysandeep122

Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC

Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...OnePlan Solutions

A healthy diet for your Java application Devoxx France.pdfMarharyta Nedzelska

Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024StefanoLambiase

SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa

Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...confluent

CRM Contender Series: HubSpot vs. SalesforceBrainSell Technologies

React Server Component in Next.js by Hanief UtamaHanief Utama

Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ

Best Web Development Agency- Idiosys USA.pdfIdiosysTechnologies1

Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed

Recently uploaded (20)

Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha

Ahmed Motair CV April 2024 (Senior SW Developer)

Unveiling the Future: Sylius 2.0 New Features

Automate your Kamailio Test Calls - Kamailio World 2024

Recruitment Management Software Benefits (Infographic)

KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx

BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE

GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf

PREDICTING RIVER WATER QUALITY ppt presentation

Software Project Health Check: Best Practices and Techniques for Your Product...

Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...

A healthy diet for your Java application Devoxx France.pdf

Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024

SpotFlow: Tracking Method Calls and States at Runtime

Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...

CRM Contender Series: HubSpot vs. Salesforce

React Server Component in Next.js by Hanief Utama

Cloud Data Center Network Construction - IEEE

Best Web Development Agency- Idiosys USA.pdf

Unveiling Design Patterns: A Visual Guide with UML Diagrams

Ranking Twitter Conversations

1. 1 Ranking Twitter conversations • Motivating Example – Extract real-time information – Peoples views on upcoming elections, products. – Extracting user interest through conversation topics • Problem Definition – Rank Twitter conversations. – Generate snippet for each ranked conversation

2. 2 Related Work • Wang, Hao, Zhengdong Lu, Hang Li, and Enhong Chen. "A Dataset for Research on Short-Text Conversations." In EMNLP, pp. 935-945. 2013. • Key Idea – retrieval-based response model for short-text based conversation • Their solution – Considered few selected topics from Sina Weibo – Semantic matching between post-response – Post-response similarity • Their results – Mean average precision – 0.621 – retrieval is fairly effective at capturing the semantic relevance, but relative weak on modeling the logic consistency

3. 3 Our Methodology • Key Idea of your work – Give an importance score to tweets based on their position and user based on their appearances apart from using inverted index. • Solution Description – Filter tweets – Create word index – Considering SMS language – Score tweets according to TF and tweet and user score – TF score for tweets according to word type • Hashtag, user mention, other words – Generate snippets • Our approach is ranking twitter conversation rather that just finding responses to tweets.

4. 4 Parse twitter data Filter valid tweets Extract conversation Remove stop words Remove duplicate words in a tweet Creating inverted word index Calculate user and tweet score Get query Parse words in query Expand SMS words Calculate conversation score based on TF and tweet and user score Generate snippet and display the results

5. 5 Dataset and Experimental Settings • Dataset details (size, source, other data statistics) – 12077 tweets – 4521 conversations (length >= 2) – 119 Stop words • Experimental settings – Play with removing or adding the below constraints • duplicate words • stop words • Tweet/user score – Expand SMS words in query • Accuracy or any other metric you used – Results were subjective and it was obtained iteratively

6. 6 Results and Summary • Results and analysis of results – Subjective in nature. Accuracy could not be obtained without knowing the context of the conversation. • What did you learn from this project? – A basic understanding of how documents can be ranked given a query • Future work: – Infer context of the conversation – Calculate precision/recall by programmatically tagging tweets

Ranking Twitter Conversations

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Ranking Twitter Conversations

Similar to Ranking Twitter Conversations (20)

Recently uploaded

Recently uploaded (20)

Ranking Twitter Conversations