SlideShare a Scribd company logo
1 of 16
Detection and Extracting of Emergency
          Knowledge from Twitter Streams

         Bernhard Klein, Xabier Laiseca, Diego Casado-
        Mansilla, Diego Lopez-de-Ipiña and Alejandro Prada
                             Nespral
6th International Conference on Ubiquitous Computing and Ambient Intelligence
     Session 10: Key application domains: eEmergency, eLearning, eTraining
                               5. December, 2012



                                           Social
                                           Awareness
                                           Based
                                           Emergency
                                           Situation
                                           Solver

     UCAmI 2012                     B. Klein                      1/17
Outline


1. Problem Description
2. Research Field
3. Architecture of Analysis Tool
4. Semantic Social Network Analysis
5. Recent Advances
6. Conclusions




  UCAmI 2012             B. Klein     2/17
Objective


   Trends Detection  Event Knowledge Extraction
     ≠ Counting of Keywords
      Aggregation + Interpretation of post content!


   Problems:
        Big data
        Noisy + short posts
        Real-time support




     UCAmI 2012                B. Klein                3/17
Twitter Examples

► Good    examples:

► Bad   examples:




► Crawling     reality:




  UCAmI 2012              B. Klein   4/17
Research Field
                                           • SensePlace2
     • Hacer and Muraki, 2011              • TweetTracker
     • Sudha et al., 2011                  • Twitcident
                                            Emergency
         Corpus Analysis                    Support
                                            Tools



                           Microblogging



   SNA-Techniques                          Clustering-Techniques
• Mendozza et al., 2010
                                           • Becker et al, 2011
                                           • Marcus et al, 2011
                        NLP-Techniques     • Pohl et al, 2012
                    • Sudha et al, 2011
                    • Abel et al, 2011

       UCAmI 2012                            B. Klein              5/17
SABESS Web system




 UCAmI 2012    B. Klein   6/17
Opensource Implementation
              • Emergency message filter based on emergency taxonomy
              • Language filter e.g. english or spanish
              • Slang reduction (punctation + letter repititions)




 UCAmI 2012                B. Klein                       7/17
Social Network Analysis

► Objective:   Filtering after tweet credibility




  UCAmI 2012                B. Klein               8/17
Observed Problems

   “Slow” Graph Calculations
     Replace   betweeness centrality with user data
       a) followers count ~ influence
       b) friend count ~ knowledge access
       c) number of posts ~ experience


   “Sparse” Social Network
     Replace  SNA with Sentiment Analysis:
       Punctation-, letter- and word repititions
        Tweet credibility < Informative tweet!
          (see also Sudha et al., 2011)



    UCAmI 2012                            B. Klein     9/17
Natural language procesing

► Objective:   Content enrichment




                        • Big Improvement with “slang reduction” !!

  UCAmI 2012              B. Klein                         10/17
Other Knowledge Sources

   Hierarchical Knowledge Structure
    1. Textual location
       a) Named Entity Location
       b) Regular Expression e.g. address
          (Requires reverse coding!)
    2. Tweet metadata
       a) GPS tagged tweets
       b) Place tagged tweets
          (Author location can be different!)
    3. User profile data
       a) Home location                    Increasing reliability!



    UCAmI 2012                   B. Klein                  11/17
Recent Advances: Event Detection

► Objective:     Group tweets into emergency events

   How to describe an emergency event?
       Emergency type, location (range), time (progress),
        person/organization data, text descriptions, number of
        tweets
       Global reporting standard “Common Alert Protocol”.

   Example:




    UCAmI 2012                 B. Klein               12/17
Recent Advances: Clustering

► Incremental         DBSCAN

                            SANDY HURRICANE RELIEF VOLUNTEER EFFORTS
                            SANDY HURRICANE VICTIMS VOLUNTEER
                            SANDY HURRICANE VICTIMS VOLUNTEER EFFORT GRASSROOTS
                            SANDY HURRICANE VICTIMS VOLUNTEER NEWS EFFORT GRASSROOTS
Sandy, 180                  SANDY HURRICANE VICTIMS POLICE
Fukuschima, 170             SANDY HURRICANE VICTIMS RELIEF
….                          SANDY HURRICANE VICTIMS             Locations:
Ambulance, 80               SANDY HURRICANE VICTIMS RELIEF DISASTER PrincetonHall
                                                                e.g.
…..                         SANDY HURRICANE RELIEF
                            SANDY HURRICANE RELIEF
                            SANDY HURRICANE RELIEF DISASTER
                            SANDY VOLUNTEER

Online
                                                                        Conversations:
Dictionary                                                              ConversationID=83
                  Hashtags:
                  #TylerPerryFire
                                           Attachments:
                                           http://t.co/kqF7Xy8t




    UCAmI 2012                             B. Klein                                13/17
Common Alert Protocol

  Whenever clusters become modified,
  generate new alert message??
                                            Alert



                                CAP
                                Info
                                            Place




                                            Urls,
                                            Figs
      Cluster of tweets


 UCAmI 2012                      B. Klein           14/17
Conclusions

   Real-time analysis of noisy tweets
    ► Big data problem, 2 phase analysis
        Emergency message filtering
        Slang and language filtering
    ► Semantic Social Network Analysis
        POS/Noun tags, NER/Location tags
        Community centrality/follower count tags
    ► Tweet clustering
        Group tweets after hashtags, attachments and
         conversations
        Group tweets after emergency specific keywords
    ► Common Alert Protocol

    UCAmI 2012              B. Klein              15/17
Contact:       Bernhard Klein,
                     Email: bernhard.klein@deusto.es
                     Deusto Intitute of Technology,
                     University of Deusto,
 th International Conference on Ubiquitous Computing and Ambient Intelligence
6
                     Avda. Universidades, 24 | 48007 Bilbao |
       Session 10: Key application domains: eEmergency, eLearning, eTraining
                     Spain        5. December, 2012




     UCAmI 2012                    B. Klein                    16/17

More Related Content

Similar to UCAmI 2012 - Detection and Extracting of Emergency Knowledge from Twitter Streams

NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...
NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...
NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...
Maryam Farooq
 
CSCW in Times of Social Media
CSCW in Times of Social MediaCSCW in Times of Social Media
CSCW in Times of Social Media
Hendrik Drachsler
 
Everbridge Webinar - Ten Years After 9/11
Everbridge Webinar - Ten Years After 9/11Everbridge Webinar - Ten Years After 9/11
Everbridge Webinar - Ten Years After 9/11
Everbridge, Inc.
 
May 2009
May 2009May 2009
May 2009
linioti
 
Future of AI: Blockchain and Deep Learning
Future of AI: Blockchain and Deep LearningFuture of AI: Blockchain and Deep Learning
Future of AI: Blockchain and Deep Learning
Melanie Swan
 
ICRH Winter Institute Strand 4 Day 1 - Building Narratives with Digital Objects
ICRH Winter Institute Strand 4 Day 1 - Building Narratives with Digital ObjectsICRH Winter Institute Strand 4 Day 1 - Building Narratives with Digital Objects
ICRH Winter Institute Strand 4 Day 1 - Building Narratives with Digital Objects
Shawn Day
 

Similar to UCAmI 2012 - Detection and Extracting of Emergency Knowledge from Twitter Streams (20)

From Research to Applications: What Can We Extract with Social Media Sensing?
From Research to Applications: What Can We Extract with Social Media Sensing?From Research to Applications: What Can We Extract with Social Media Sensing?
From Research to Applications: What Can We Extract with Social Media Sensing?
 
Identifying and Responding to Emerging Technologies
Identifying and Responding to Emerging TechnologiesIdentifying and Responding to Emerging Technologies
Identifying and Responding to Emerging Technologies
 
A Virtuous Cycle of Semantics and Participation
A Virtuous Cycle of Semantics and ParticipationA Virtuous Cycle of Semantics and Participation
A Virtuous Cycle of Semantics and Participation
 
"AI" for Blockchain Security (Case Study: Cosmos)
"AI" for Blockchain Security (Case Study: Cosmos)"AI" for Blockchain Security (Case Study: Cosmos)
"AI" for Blockchain Security (Case Study: Cosmos)
 
Open Science Data Cloud - CCA 11
Open Science Data Cloud - CCA 11Open Science Data Cloud - CCA 11
Open Science Data Cloud - CCA 11
 
Module 1 - Data Around Us .pptx
Module 1 - Data Around Us .pptxModule 1 - Data Around Us .pptx
Module 1 - Data Around Us .pptx
 
Evolution of Open at University of Michigan
Evolution of Open at University of MichiganEvolution of Open at University of Michigan
Evolution of Open at University of Michigan
 
NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...
NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...
NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...
 
CSCW in Times of Social Media
CSCW in Times of Social MediaCSCW in Times of Social Media
CSCW in Times of Social Media
 
Shaping future research environments: digital challenges and opportunities
Shaping future research environments: digital challenges and opportunitiesShaping future research environments: digital challenges and opportunities
Shaping future research environments: digital challenges and opportunities
 
Everbridge Webinar - Ten Years After 9/11
Everbridge Webinar - Ten Years After 9/11Everbridge Webinar - Ten Years After 9/11
Everbridge Webinar - Ten Years After 9/11
 
May 2009
May 2009May 2009
May 2009
 
Content Architecture for Rapid Knowledge Reuse-congility2011
Content Architecture for Rapid Knowledge Reuse-congility2011Content Architecture for Rapid Knowledge Reuse-congility2011
Content Architecture for Rapid Knowledge Reuse-congility2011
 
ACM ICPC Regional Finals Talk re: drop.io, privacy, entrepreneurship by sam l...
ACM ICPC Regional Finals Talk re: drop.io, privacy, entrepreneurship by sam l...ACM ICPC Regional Finals Talk re: drop.io, privacy, entrepreneurship by sam l...
ACM ICPC Regional Finals Talk re: drop.io, privacy, entrepreneurship by sam l...
 
Future of AI: Blockchain and Deep Learning
Future of AI: Blockchain and Deep LearningFuture of AI: Blockchain and Deep Learning
Future of AI: Blockchain and Deep Learning
 
The Commons
The CommonsThe Commons
The Commons
 
ACS Summer Institute - Emerging Roles of Librarians - 14_0731
ACS Summer Institute - Emerging Roles of Librarians - 14_0731ACS Summer Institute - Emerging Roles of Librarians - 14_0731
ACS Summer Institute - Emerging Roles of Librarians - 14_0731
 
Credibility and Relevance of User-Generated Content on Crisis Events
Credibility and Relevance of User-Generated Content on Crisis EventsCredibility and Relevance of User-Generated Content on Crisis Events
Credibility and Relevance of User-Generated Content on Crisis Events
 
Planning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive ProjectsPlanning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive Projects
 
ICRH Winter Institute Strand 4 Day 1 - Building Narratives with Digital Objects
ICRH Winter Institute Strand 4 Day 1 - Building Narratives with Digital ObjectsICRH Winter Institute Strand 4 Day 1 - Building Narratives with Digital Objects
ICRH Winter Institute Strand 4 Day 1 - Building Narratives with Digital Objects
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 

UCAmI 2012 - Detection and Extracting of Emergency Knowledge from Twitter Streams

  • 1. Detection and Extracting of Emergency Knowledge from Twitter Streams Bernhard Klein, Xabier Laiseca, Diego Casado- Mansilla, Diego Lopez-de-Ipiña and Alejandro Prada Nespral 6th International Conference on Ubiquitous Computing and Ambient Intelligence Session 10: Key application domains: eEmergency, eLearning, eTraining 5. December, 2012 Social Awareness Based Emergency Situation Solver UCAmI 2012 B. Klein 1/17
  • 2. Outline 1. Problem Description 2. Research Field 3. Architecture of Analysis Tool 4. Semantic Social Network Analysis 5. Recent Advances 6. Conclusions UCAmI 2012 B. Klein 2/17
  • 3. Objective  Trends Detection  Event Knowledge Extraction ≠ Counting of Keywords  Aggregation + Interpretation of post content!  Problems:  Big data  Noisy + short posts  Real-time support UCAmI 2012 B. Klein 3/17
  • 4. Twitter Examples ► Good examples: ► Bad examples: ► Crawling reality: UCAmI 2012 B. Klein 4/17
  • 5. Research Field • SensePlace2 • Hacer and Muraki, 2011 • TweetTracker • Sudha et al., 2011 • Twitcident Emergency Corpus Analysis Support Tools Microblogging SNA-Techniques Clustering-Techniques • Mendozza et al., 2010 • Becker et al, 2011 • Marcus et al, 2011 NLP-Techniques • Pohl et al, 2012 • Sudha et al, 2011 • Abel et al, 2011 UCAmI 2012 B. Klein 5/17
  • 6. SABESS Web system UCAmI 2012 B. Klein 6/17
  • 7. Opensource Implementation • Emergency message filter based on emergency taxonomy • Language filter e.g. english or spanish • Slang reduction (punctation + letter repititions) UCAmI 2012 B. Klein 7/17
  • 8. Social Network Analysis ► Objective: Filtering after tweet credibility UCAmI 2012 B. Klein 8/17
  • 9. Observed Problems  “Slow” Graph Calculations  Replace betweeness centrality with user data a) followers count ~ influence b) friend count ~ knowledge access c) number of posts ~ experience  “Sparse” Social Network  Replace SNA with Sentiment Analysis: Punctation-, letter- and word repititions Tweet credibility < Informative tweet! (see also Sudha et al., 2011) UCAmI 2012 B. Klein 9/17
  • 10. Natural language procesing ► Objective: Content enrichment • Big Improvement with “slang reduction” !! UCAmI 2012 B. Klein 10/17
  • 11. Other Knowledge Sources  Hierarchical Knowledge Structure 1. Textual location a) Named Entity Location b) Regular Expression e.g. address (Requires reverse coding!) 2. Tweet metadata a) GPS tagged tweets b) Place tagged tweets (Author location can be different!) 3. User profile data a) Home location Increasing reliability! UCAmI 2012 B. Klein 11/17
  • 12. Recent Advances: Event Detection ► Objective: Group tweets into emergency events  How to describe an emergency event?  Emergency type, location (range), time (progress), person/organization data, text descriptions, number of tweets  Global reporting standard “Common Alert Protocol”.  Example: UCAmI 2012 B. Klein 12/17
  • 13. Recent Advances: Clustering ► Incremental DBSCAN SANDY HURRICANE RELIEF VOLUNTEER EFFORTS SANDY HURRICANE VICTIMS VOLUNTEER SANDY HURRICANE VICTIMS VOLUNTEER EFFORT GRASSROOTS SANDY HURRICANE VICTIMS VOLUNTEER NEWS EFFORT GRASSROOTS Sandy, 180 SANDY HURRICANE VICTIMS POLICE Fukuschima, 170 SANDY HURRICANE VICTIMS RELIEF …. SANDY HURRICANE VICTIMS Locations: Ambulance, 80 SANDY HURRICANE VICTIMS RELIEF DISASTER PrincetonHall e.g. ….. SANDY HURRICANE RELIEF SANDY HURRICANE RELIEF SANDY HURRICANE RELIEF DISASTER SANDY VOLUNTEER Online Conversations: Dictionary ConversationID=83 Hashtags: #TylerPerryFire Attachments: http://t.co/kqF7Xy8t UCAmI 2012 B. Klein 13/17
  • 14. Common Alert Protocol Whenever clusters become modified, generate new alert message?? Alert CAP Info Place Urls, Figs Cluster of tweets UCAmI 2012 B. Klein 14/17
  • 15. Conclusions  Real-time analysis of noisy tweets ► Big data problem, 2 phase analysis  Emergency message filtering  Slang and language filtering ► Semantic Social Network Analysis  POS/Noun tags, NER/Location tags  Community centrality/follower count tags ► Tweet clustering  Group tweets after hashtags, attachments and conversations  Group tweets after emergency specific keywords ► Common Alert Protocol UCAmI 2012 B. Klein 15/17
  • 16. Contact: Bernhard Klein, Email: bernhard.klein@deusto.es Deusto Intitute of Technology, University of Deusto, th International Conference on Ubiquitous Computing and Ambient Intelligence 6 Avda. Universidades, 24 | 48007 Bilbao | Session 10: Key application domains: eEmergency, eLearning, eTraining Spain 5. December, 2012 UCAmI 2012 B. Klein 16/17