Summarizing Situational Tweets in Crisis Scenario

Scientist at the Qatar Computing Research Institute - Lead of the Crisis Computing group at QCRI
Aug. 16, 2016
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
1 of 25

More Related Content

Viewers also liked

#Membership renewal new logo#Membership renewal new logo
#Membership renewal new logopatcalhoun
ISCRAM 2013: Extracting Information Nuggets from  Disaster-Related Messages i...ISCRAM 2013: Extracting Information Nuggets from  Disaster-Related Messages i...
ISCRAM 2013: Extracting Information Nuggets from Disaster-Related Messages i...ISCRAM Events
01.사람의 이해와 UX/UI(시스템과 유저 인터페이스)01.사람의 이해와 UX/UI(시스템과 유저 인터페이스)
01.사람의 이해와 UX/UI(시스템과 유저 인터페이스)Billy Choi
Laundry Management SystemLaundry Management System
Laundry Management Systemcetrosoft
영화 미디어에 투영된 미래 분석 및 미래가전 시사점 퍼셉션영화 미디어에 투영된 미래 분석 및 미래가전 시사점 퍼셉션
영화 미디어에 투영된 미래 분석 및 미래가전 시사점 퍼셉션PERCEPTION
Concepto e importancia de la mercadotecniaConcepto e importancia de la mercadotecnia
Concepto e importancia de la mercadotecniaNombre Apellidos

Similar to Summarizing Situational Tweets in Crisis Scenario

Processing Social Media Messages in Mass Emergency: A SurveyProcessing Social Media Messages in Mass Emergency: A Survey
Processing Social Media Messages in Mass Emergency: A SurveyMuhammad Imran
OCHA Think Brief - Hashtag Standards for emergenciesOCHA Think Brief - Hashtag Standards for emergencies
OCHA Think Brief - Hashtag Standards for emergenciesJan Husar
Data Analytics and Industry-Academic Partnerships: An Irish PerspectiveData Analytics and Industry-Academic Partnerships: An Irish Perspective
Data Analytics and Industry-Academic Partnerships: An Irish PerspectiveJohn Breslin
Open analytics   social media frameworkOpen analytics   social media framework
Open analytics social media frameworkOpen Analytics
Twitter as a personalizable information service iiTwitter as a personalizable information service ii
Twitter as a personalizable information service iiKan-Han (John) Lu
Using Twitter as a Postgraduate ResearcherUsing Twitter as a Postgraduate Researcher
Using Twitter as a Postgraduate ResearcherSimon Bishop

Similar to Summarizing Situational Tweets in Crisis Scenario(20)

More from Muhammad Imran

Damage Assessment from Social Media Imagery Data During DisastersDamage Assessment from Social Media Imagery Data During Disasters
Damage Assessment from Social Media Imagery Data During DisastersMuhammad Imran
Image4Act: Online Social Media Image Processing for Disaster ResponseImage4Act: Online Social Media Image Processing for Disaster Response
Image4Act: Online Social Media Image Processing for Disaster ResponseMuhammad Imran
Real-Time Processing of Social Media Content for Social GoodReal-Time Processing of Social Media Content for Social Good
Real-Time Processing of Social Media Content for Social GoodMuhammad Imran
AIDR Tutorial (Artificial Intelligence for Disaster Response)AIDR Tutorial (Artificial Intelligence for Disaster Response)
AIDR Tutorial (Artificial Intelligence for Disaster Response)Muhammad Imran
A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...
A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...Muhammad Imran
The Role of Social Media and Artificial Intelligence for Disaster ResponseThe Role of Social Media and Artificial Intelligence for Disaster Response
The Role of Social Media and Artificial Intelligence for Disaster ResponseMuhammad Imran

Recently uploaded

Colaboration Eco-system MNC Bank.pptxColaboration Eco-system MNC Bank.pptx
Colaboration Eco-system MNC Bank.pptxrezairsyadul
Dunning - SIGMOD - Data Economy.pptxDunning - SIGMOD - Data Economy.pptx
Dunning - SIGMOD - Data Economy.pptxTed Dunning
Data collection.pdfData collection.pdf
Data collection.pdfMuthuLakshmi124949
Data Warehouse with Fabric on data lakehouseData Warehouse with Fabric on data lakehouse
Data Warehouse with Fabric on data lakehouseMarco Pozzan
International Observe the Moon Night 2023International Observe the Moon Night 2023
International Observe the Moon Night 2023VICTOR MAESTRE RAMIREZ
Essential numpy before you start your Machine Learning journey in python.pdfEssential numpy before you start your Machine Learning journey in python.pdf
Essential numpy before you start your Machine Learning journey in python.pdfSmrati Kumar Katiyar

Summarizing Situational Tweets in Crisis Scenario

Editor's Notes

  1. For an effective planning, we need as complete information as possible. Looking back over the events of 2004 and before, it is striking how many of the year’s disasters, deaths, medical emergencies etc. could have been avoided with better information and communication with the affected people. The opaqueness induced by disasters is overwhelming. Information could be a lifeline for everybody involved in a disasters situation. People need information as much as water, food, medicine or shelter. Lack of information make people victims of disasters and targets of aid. http://www.ifrc.org/Global/Publications/disasters/WDR/69001-WDR2005-english-LR.pdf
  2. A number of research studies reveal that social media platforms such as twitter can be very useful sources of information during disasters. For example, Twitter provide active communciation channels during crises. Affected people post sitatuational awareness and actionable information. Studies have seen reports of casualities, damages, donations offers and requests… We have also seen that Twitter for example, can be quicker than traditional channels in terms of reporting first hand information
  3. Social media data can be useful for rapid crisis response But, it requires real-time data analysis So the categorizations of each incoming item into different user-defined classes should be as soon as they For example, infrastructure damage, urgent needs
  4. Social media data can be useful for rapid crisis response But, it requires real-time data analysis So the categorizations of each incoming item into different user-defined classes should be as soon as they For example, infrastructure damage, urgent needs
  5. We use a trigram-based log-likelihood score using a language model as a dummy representative of the Readability of the generated content.
  6. For extractive summarization, we mainly focus on content words. For example, nouns, numerals, main verbs Studies show that content words play significant role to capture important events. Now the question is, how much of the content words we can capture after processing N number of words. For this purpose, we randomly sample same number of tweets for each of the three classes for Nepal EQ and other two events black-Friday and Chelsea match. It was observed that number of content words are very small in case of disaster events. BLF = blackfriday CHL = chelsea match Distinct content words posted during disaster are very less in number compared to generic events Within 1000 words limit, we can cover more than 80% of the content words in a disaster event.
  7. Challenges in abstractive summarization
  8. Given this graph, in a real-scenario, the number of generated tweet-paths can be several thousands, because there can be multiple points of merging across several tweets. Our goal is to generate sentences by selecting the best paths from these generated paths with the objective of generating a readable and informative summary. We set a minimum (10 words) and maximum (16 words) length for a sentence to be generated. We apply such constraints to avoid very long sentences that might be grammatically ill-formed and very short sentences that are often incomplete.
  9. The ILP-based technique optimizes based upon three factors - (i) Presence of content words (ii) Informativeness of a path i.e. importance of a path, and (iii) Linguistic Quality Score that captures the readability of a path using a trigram confidence score. Each sentence is represented as a TF-IDF vector. The centroid is basically the mean of the TF-IDF vectors of all the sentences.
  10. Optimization problem and constraints An integer programming problem is a mathematical optimization program in which some or all of the variables are restricted to be integers and the objective function and the constraints (other than the integer constraints) are linear. Integer programming is NP-hard.
  11. Comparison of different methods and their drawbacks, our improvements
  12. Comparison with baselines when summary length increases