SlideShare a Scribd company logo
1 of 28
Twitter floods when it rains:
A case study of the UK floods in
early 2014
Antonia Saravanou
University of Athens
Dimitrios Gunopulos
University of Athens
George Valkanas
Stevens Institute of Technology
Gennady Andrienko
Fraunhofer Institute IAIS, DE
Social Web for Disaster Management (WWW workshop 2015)
Florence, Italy
National and Kapodistrian
University of Athens
Outline
● Motivation
● Research Questions
● Methodology
○ Data Collection
○ Filtering Step: Flood-Related Lexicon
○ Clustering Step
○ Second Level Clustering
● Results
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
Motivation
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
Motivation
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
Motivation
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
Motivation
● Identify early the event and the affected area
● Monitor the evolution of the event
● Inform users for emergencies
● Resource allocation
● Ιmmediate notification of special incident
management units
Research questions
RQ1: How can we identify the areas that have been
hit the most by an event?
- where to dispatch emergency response units
RQ2: How effective can we be in identifying these
areas?
- robust and effective techniques to base decisions
RQ3: Can we identify areas that have been stricken
by the event in a similar manner?
- transfer the same techniques to similar affected areas
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
Data Collection
● Twitter - custom crawler
○ Streaming API
● Collection of public tweets
○ Bounding box that covers UK
○ Extract only tweets with GPS
● 13-17 January 2014
● > 2.3 million geotagged tweets
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
Flood Related Tweets
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
?
Entire Dataset
gps location within
UK b.b.
Flood - Related
Tweets
Filtering Step: Custom Flood-Related
lexicon
rain,
flood,
weather,
storm,
showers,
...
13 tokens
1546 tokens
456 tokens
tokens that contain
at least one word of
the initial seed set
as a substring
only related
tokens to the
event
initial seed set
Entire
Dataset
manually
review each
keyword and
discard non-
related
false positives
e.g. brain, train, e.t.c.
e.g. raining, floods,
#ukweather, e.t.c.
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
Original vs. Flood Related Lexicon
● Manual cleaning process
is necessary
● Only 4 keywords flood-
related in the original
lexicon
● Flood Lexicon is ⅓ of the
Original
- Slow process
+ One time at the beginning
Top-10 most frequent keywords
Flood Related Tweets
exact match with at least
one keyword from our
flood related lexicon
Entire
Dataset
Flood - Related
Tweets
Flood
Related
Lexicon
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
● Why we care
○ where to dispatch emergency response units
○ notify citizens about areas with problems caused by
floods
● From GPS to areas
○ Perform spatial clustering using the GPS
coordinates
■ Convert GPS coordinates to Cartesian ones
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
RQ1: Identifying flood-affected areas
Clustering Step: K-Means
K = 10, 100, 500, 1000
Generated clusters as Voronoi polygons
➔ more splits in the densely populated areas
10 clusters 100 clusters 500 clusters 1000 clusters
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
Which areas are the most affected?
● Prioritize generated areas by their potential of
being affected
Prioritization schemes by area a:
1. By total #tweets: baseline
2. By flood-related #tweets
3. By Signal-to-Noise Ratio:score(a) =
#flood-related tweets in a
#tweets in a
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
Visualization of top-100 most affected
areas
1. total #tweets 2. flood-related #tweets
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
add map
with SNR
3. SNR
Top 100 for K-Means (K=500)
RQ2: Identification Effectiveness
1. Likert Scale [1-5]: to specify
the degree that an area has
been affected
a. 1 = “normal levels of rainfall”
b. 5 = “completely flooded”
2. Running Average Likert:
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
Ground Truth
- MetOffice
add
map
with
SNR
3. SNR
Results
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
Results (k = 100)
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
● Baseline < Flood, SNR
● Flood ~ SNR
Results (k = 500)
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
● Baseline << Flood < SNR
● #tweets is not a good proxy
● #flood-related tweets is a better one
Results (k = 1000)
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
● SNR the best metric (especially top20)
● how many users talk about the specific event
RQ3: Similarly affected areas
Identify areas with similar behavior on a temporal
aspect, in the way that the flooding event was
perceived by Twitter users
Underlying connection:
● population level, e.g., similar posting patterns
● other variable, e.g., a nearby river
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
Second Level Clustering: Attributes
Features that show the temporal evolution of the
event in an area
1. Number of tweets in day d, count(d)
2. Ratio of day d from area a,
ratio(d) = count(d) / Σ count(d’), forall d’
3. Speed of day d, speed(d) = ratio(d)-ratio(d-1)
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
Second Level Clustering:
Areas from 2 clusters
: cluster 1
: cluster 2
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
● Speed feature
● Red cluster: Scotland,
Liverpool and Ireland,
mostly unaffected
● Purple cluster: Midlands,
affected
● Red speed decreases
● Purple speed increases
● Verification with historical
data
The INSIGHT project
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
Detecting Events:
- censors on road
network
- censors on buses
- Twitter data
http://www.insight-ict.eu/
Intelligent Synthesis and Real-time Response
using Massive Streaming of Heterogeneous Data
Conclusions
● Analysis on Twitter data
○ emergencies, disaster management & relief
● Experimental analysis on floodings
○ establishment of “flood related lexicon”
○ division of the entire UK to affected areas
○ identification of flood-stricken areas with high accuracy
● Comparison with ground truth data
○ quality evaluation
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
Future Work
● Collect more data of similar flooding events and
test our approach in larger datasets
○ generalize in other areas
○ test with larger timespan
● Develop online clustering approaches (1ier)
● To incorporate into the INSIGHT tool
Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
Thank you!
Acknowledgements:
MMD - Mining Mobility DataINSIGHT - Intelligent
Synthesis and Real-time
Response using Massive
Streaming of Heterogeneous
Data

More Related Content

Viewers also liked

Kashmir floods 2014
Kashmir floods 2014Kashmir floods 2014
Kashmir floods 2014ravilbsnaa
 
Jammu and Kashmir - Then and Now
Jammu and Kashmir - Then and NowJammu and Kashmir - Then and Now
Jammu and Kashmir - Then and NowVasantha Gullapalli
 
Data Mining on Twitter
Data Mining on TwitterData Mining on Twitter
Data Mining on TwitterPulkit Goyal
 
How has England been affected by Floods?
How has England been affected by Floods?How has England been affected by Floods?
How has England been affected by Floods?Hydro Cleansing
 
Flooding in bangladesh
Flooding in bangladeshFlooding in bangladesh
Flooding in bangladeshambermckenzie
 
Mozambique Floods 2000
Mozambique Floods 2000Mozambique Floods 2000
Mozambique Floods 2000missm
 
Bangladesh Flooding
Bangladesh FloodingBangladesh Flooding
Bangladesh Floodingsamuel valko
 
Urban Flood Risk from Flood Plains to Floor Drains
Urban Flood Risk from Flood Plains to Floor DrainsUrban Flood Risk from Flood Plains to Floor Drains
Urban Flood Risk from Flood Plains to Floor DrainsRobert Muir
 
Google project soli report
Google project soli reportGoogle project soli report
Google project soli reportSunil Havani
 
Project Soli by Google ATAP
Project Soli by Google ATAPProject Soli by Google ATAP
Project Soli by Google ATAPrguptarrr
 
Urban Flooding causes and Management Dr.Reddy
Urban Flooding causes and Management Dr.ReddyUrban Flooding causes and Management Dr.Reddy
Urban Flooding causes and Management Dr.ReddySai Bhaskar Reddy Nakka
 
Chennai flood 2015, The Disaster, The Challenges and The Solutions
Chennai flood 2015, The Disaster, The Challenges and The SolutionsChennai flood 2015, The Disaster, The Challenges and The Solutions
Chennai flood 2015, The Disaster, The Challenges and The SolutionsBharathi
 
Clock Synchronization in Distributed Systems
Clock Synchronization in Distributed SystemsClock Synchronization in Distributed Systems
Clock Synchronization in Distributed SystemsZbigniew Jerzak
 
URBAN FLOODS an opportunity for water conservation
URBAN FLOODS an opportunity for water conservationURBAN FLOODS an opportunity for water conservation
URBAN FLOODS an opportunity for water conservationSai Bhaskar Reddy Nakka
 

Viewers also liked (20)

Kashmir floods 2014
Kashmir floods 2014Kashmir floods 2014
Kashmir floods 2014
 
Jammu and Kashmir - Then and Now
Jammu and Kashmir - Then and NowJammu and Kashmir - Then and Now
Jammu and Kashmir - Then and Now
 
Data Mining on Twitter
Data Mining on TwitterData Mining on Twitter
Data Mining on Twitter
 
How has England been affected by Floods?
How has England been affected by Floods?How has England been affected by Floods?
How has England been affected by Floods?
 
Flooding in bangladesh
Flooding in bangladeshFlooding in bangladesh
Flooding in bangladesh
 
Mozambique Floods 2000
Mozambique Floods 2000Mozambique Floods 2000
Mozambique Floods 2000
 
Project soli
Project soliProject soli
Project soli
 
Project soli
Project  soliProject  soli
Project soli
 
Bangladesh Flooding
Bangladesh FloodingBangladesh Flooding
Bangladesh Flooding
 
Urban Flood Risk from Flood Plains to Floor Drains
Urban Flood Risk from Flood Plains to Floor DrainsUrban Flood Risk from Flood Plains to Floor Drains
Urban Flood Risk from Flood Plains to Floor Drains
 
Google project soli report
Google project soli reportGoogle project soli report
Google project soli report
 
Project Soli by Google ATAP
Project Soli by Google ATAPProject Soli by Google ATAP
Project Soli by Google ATAP
 
Bangladesh floods
Bangladesh floodsBangladesh floods
Bangladesh floods
 
Urban Flooding causes and Management Dr.Reddy
Urban Flooding causes and Management Dr.ReddyUrban Flooding causes and Management Dr.Reddy
Urban Flooding causes and Management Dr.Reddy
 
Chennai flood 2015, The Disaster, The Challenges and The Solutions
Chennai flood 2015, The Disaster, The Challenges and The SolutionsChennai flood 2015, The Disaster, The Challenges and The Solutions
Chennai flood 2015, The Disaster, The Challenges and The Solutions
 
Clock Synchronization in Distributed Systems
Clock Synchronization in Distributed SystemsClock Synchronization in Distributed Systems
Clock Synchronization in Distributed Systems
 
Mumbai floods 2005
Mumbai floods 2005Mumbai floods 2005
Mumbai floods 2005
 
project Soli ppt
project Soli pptproject Soli ppt
project Soli ppt
 
URBAN FLOODS an opportunity for water conservation
URBAN FLOODS an opportunity for water conservationURBAN FLOODS an opportunity for water conservation
URBAN FLOODS an opportunity for water conservation
 
Flood Management in Bangladesh
Flood Management in Bangladesh Flood Management in Bangladesh
Flood Management in Bangladesh
 

Similar to UK Flood Case Study: Identifying Affected Areas from Twitter Data

Social media use in the queensland floods
Social media use in the queensland floodsSocial media use in the queensland floods
Social media use in the queensland floodsEidos Australia
 
Social Media Use in the Queensland Floods
Social Media Use in the Queensland FloodsSocial Media Use in the Queensland Floods
Social Media Use in the Queensland FloodsAxel Bruns
 
IAHR 2015 - Managing flood risk in coastal cities through an integrated model...
IAHR 2015 - Managing flood risk in coastal cities through an integrated model...IAHR 2015 - Managing flood risk in coastal cities through an integrated model...
IAHR 2015 - Managing flood risk in coastal cities through an integrated model...Deltares
 
Twitter & mobility disruptions
Twitter & mobility disruptionsTwitter & mobility disruptions
Twitter & mobility disruptionsHolly Anne
 
New York City Case Study
New York City Case StudyNew York City Case Study
New York City Case StudyAlbert Chen
 
DSD-INT 2014 - Delft-FEWS Users Meeting - Extending FEWS with Floodtags - soc...
DSD-INT 2014 - Delft-FEWS Users Meeting - Extending FEWS with Floodtags - soc...DSD-INT 2014 - Delft-FEWS Users Meeting - Extending FEWS with Floodtags - soc...
DSD-INT 2014 - Delft-FEWS Users Meeting - Extending FEWS with Floodtags - soc...Deltares
 
11. Forecasting Tools - Sandra Mancini & Michael Jones
11. Forecasting Tools - Sandra Mancini & Michael Jones11. Forecasting Tools - Sandra Mancini & Michael Jones
11. Forecasting Tools - Sandra Mancini & Michael JonesEasternOntarioCropConference
 
Final PresentationRodent Baiting
Final PresentationRodent BaitingFinal PresentationRodent Baiting
Final PresentationRodent BaitingSanchit Khandelwal
 
Results, calculations, and assumptions of the resilience.io WASH sector in GA...
Results, calculations, and assumptions of the resilience.io WASH sector in GA...Results, calculations, and assumptions of the resilience.io WASH sector in GA...
Results, calculations, and assumptions of the resilience.io WASH sector in GA...Ecological Sequestration Trust
 
Modeling Water Demand in Droughts (in England & Wales)
Modeling Water Demand in Droughts (in England & Wales)Modeling Water Demand in Droughts (in England & Wales)
Modeling Water Demand in Droughts (in England & Wales) Ben Anderson
 
Public crowd-sensing of heat-waves by social media data
Public crowd-sensing of heat-waves by social media dataPublic crowd-sensing of heat-waves by social media data
Public crowd-sensing of heat-waves by social media dataAlfonso Crisci
 
DSD-INT 2020 Beyond the Forecast - Communicating Flood - Risk in the Toronto ...
DSD-INT 2020 Beyond the Forecast - Communicating Flood - Risk in the Toronto ...DSD-INT 2020 Beyond the Forecast - Communicating Flood - Risk in the Toronto ...
DSD-INT 2020 Beyond the Forecast - Communicating Flood - Risk in the Toronto ...Deltares
 
The Future of Water in New York
The Future of Water in New YorkThe Future of Water in New York
The Future of Water in New YorkCarter Craft
 
Smart Water nella Città del Futuro - Michele Romano: Event Recognition System...
Smart Water nella Città del Futuro - Michele Romano: Event Recognition System...Smart Water nella Città del Futuro - Michele Romano: Event Recognition System...
Smart Water nella Città del Futuro - Michele Romano: Event Recognition System...Servizi a rete
 
Mining Twitter Data with Resource Constraints - IEEE/ACM Conference on Web In...
Mining Twitter Data with Resource Constraints - IEEE/ACM Conference on Web In...Mining Twitter Data with Resource Constraints - IEEE/ACM Conference on Web In...
Mining Twitter Data with Resource Constraints - IEEE/ACM Conference on Web In...Ioannis Katakis
 
DSD-INT 2018 Improvement of hazard and damage estimations from tropical cyclo...
DSD-INT 2018 Improvement of hazard and damage estimations from tropical cyclo...DSD-INT 2018 Improvement of hazard and damage estimations from tropical cyclo...
DSD-INT 2018 Improvement of hazard and damage estimations from tropical cyclo...Deltares
 
Smart Water nella Città del Futuro - Anders Lynggaard-Jensen: Real time sew...
 Smart Water nella Città del Futuro -  Anders Lynggaard-Jensen: Real time sew... Smart Water nella Città del Futuro -  Anders Lynggaard-Jensen: Real time sew...
Smart Water nella Città del Futuro - Anders Lynggaard-Jensen: Real time sew...Servizi a rete
 

Similar to UK Flood Case Study: Identifying Affected Areas from Twitter Data (20)

Social media use in the queensland floods
Social media use in the queensland floodsSocial media use in the queensland floods
Social media use in the queensland floods
 
Social Media Use in the Queensland Floods
Social Media Use in the Queensland FloodsSocial Media Use in the Queensland Floods
Social Media Use in the Queensland Floods
 
IAHR 2015 - Managing flood risk in coastal cities through an integrated model...
IAHR 2015 - Managing flood risk in coastal cities through an integrated model...IAHR 2015 - Managing flood risk in coastal cities through an integrated model...
IAHR 2015 - Managing flood risk in coastal cities through an integrated model...
 
Twitter & mobility disruptions
Twitter & mobility disruptionsTwitter & mobility disruptions
Twitter & mobility disruptions
 
New York City Case Study
New York City Case StudyNew York City Case Study
New York City Case Study
 
DSD-INT 2014 - Delft-FEWS Users Meeting - Extending FEWS with Floodtags - soc...
DSD-INT 2014 - Delft-FEWS Users Meeting - Extending FEWS with Floodtags - soc...DSD-INT 2014 - Delft-FEWS Users Meeting - Extending FEWS with Floodtags - soc...
DSD-INT 2014 - Delft-FEWS Users Meeting - Extending FEWS with Floodtags - soc...
 
11. Forecasting Tools - Sandra Mancini & Michael Jones
11. Forecasting Tools - Sandra Mancini & Michael Jones11. Forecasting Tools - Sandra Mancini & Michael Jones
11. Forecasting Tools - Sandra Mancini & Michael Jones
 
Final PresentationRodent Baiting
Final PresentationRodent BaitingFinal PresentationRodent Baiting
Final PresentationRodent Baiting
 
Results, calculations, and assumptions of the resilience.io WASH sector in GA...
Results, calculations, and assumptions of the resilience.io WASH sector in GA...Results, calculations, and assumptions of the resilience.io WASH sector in GA...
Results, calculations, and assumptions of the resilience.io WASH sector in GA...
 
Modeling Water Demand in Droughts (in England & Wales)
Modeling Water Demand in Droughts (in England & Wales)Modeling Water Demand in Droughts (in England & Wales)
Modeling Water Demand in Droughts (in England & Wales)
 
Public crowd-sensing of heat-waves by social media data
Public crowd-sensing of heat-waves by social media dataPublic crowd-sensing of heat-waves by social media data
Public crowd-sensing of heat-waves by social media data
 
ENACTS: A New Technical Innovation to Meet Climate Information Needs
ENACTS: A New Technical Innovation to Meet Climate Information NeedsENACTS: A New Technical Innovation to Meet Climate Information Needs
ENACTS: A New Technical Innovation to Meet Climate Information Needs
 
DSD-INT 2020 Beyond the Forecast - Communicating Flood - Risk in the Toronto ...
DSD-INT 2020 Beyond the Forecast - Communicating Flood - Risk in the Toronto ...DSD-INT 2020 Beyond the Forecast - Communicating Flood - Risk in the Toronto ...
DSD-INT 2020 Beyond the Forecast - Communicating Flood - Risk in the Toronto ...
 
The Future of Water in New York
The Future of Water in New YorkThe Future of Water in New York
The Future of Water in New York
 
4 intersucho zalud
4 intersucho zalud4 intersucho zalud
4 intersucho zalud
 
Smart Water nella Città del Futuro - Michele Romano: Event Recognition System...
Smart Water nella Città del Futuro - Michele Romano: Event Recognition System...Smart Water nella Città del Futuro - Michele Romano: Event Recognition System...
Smart Water nella Città del Futuro - Michele Romano: Event Recognition System...
 
Local Memory Project
Local Memory ProjectLocal Memory Project
Local Memory Project
 
Mining Twitter Data with Resource Constraints - IEEE/ACM Conference on Web In...
Mining Twitter Data with Resource Constraints - IEEE/ACM Conference on Web In...Mining Twitter Data with Resource Constraints - IEEE/ACM Conference on Web In...
Mining Twitter Data with Resource Constraints - IEEE/ACM Conference on Web In...
 
DSD-INT 2018 Improvement of hazard and damage estimations from tropical cyclo...
DSD-INT 2018 Improvement of hazard and damage estimations from tropical cyclo...DSD-INT 2018 Improvement of hazard and damage estimations from tropical cyclo...
DSD-INT 2018 Improvement of hazard and damage estimations from tropical cyclo...
 
Smart Water nella Città del Futuro - Anders Lynggaard-Jensen: Real time sew...
 Smart Water nella Città del Futuro -  Anders Lynggaard-Jensen: Real time sew... Smart Water nella Città del Futuro -  Anders Lynggaard-Jensen: Real time sew...
Smart Water nella Città del Futuro - Anders Lynggaard-Jensen: Real time sew...
 

Recently uploaded

NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfNAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfWadeK3
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 

Recently uploaded (20)

NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfNAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physics
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 

UK Flood Case Study: Identifying Affected Areas from Twitter Data

  • 1. Twitter floods when it rains: A case study of the UK floods in early 2014 Antonia Saravanou University of Athens Dimitrios Gunopulos University of Athens George Valkanas Stevens Institute of Technology Gennady Andrienko Fraunhofer Institute IAIS, DE Social Web for Disaster Management (WWW workshop 2015) Florence, Italy National and Kapodistrian University of Athens
  • 2. Outline ● Motivation ● Research Questions ● Methodology ○ Data Collection ○ Filtering Step: Flood-Related Lexicon ○ Clustering Step ○ Second Level Clustering ● Results Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
  • 3. Motivation Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
  • 4. Motivation Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
  • 5. Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015 Motivation
  • 6. Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015 Motivation ● Identify early the event and the affected area ● Monitor the evolution of the event ● Inform users for emergencies ● Resource allocation ● Ιmmediate notification of special incident management units
  • 7. Research questions RQ1: How can we identify the areas that have been hit the most by an event? - where to dispatch emergency response units RQ2: How effective can we be in identifying these areas? - robust and effective techniques to base decisions RQ3: Can we identify areas that have been stricken by the event in a similar manner? - transfer the same techniques to similar affected areas Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
  • 8. Data Collection ● Twitter - custom crawler ○ Streaming API ● Collection of public tweets ○ Bounding box that covers UK ○ Extract only tweets with GPS ● 13-17 January 2014 ● > 2.3 million geotagged tweets Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
  • 9. Flood Related Tweets Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015 ? Entire Dataset gps location within UK b.b. Flood - Related Tweets
  • 10. Filtering Step: Custom Flood-Related lexicon rain, flood, weather, storm, showers, ... 13 tokens 1546 tokens 456 tokens tokens that contain at least one word of the initial seed set as a substring only related tokens to the event initial seed set Entire Dataset manually review each keyword and discard non- related false positives e.g. brain, train, e.t.c. e.g. raining, floods, #ukweather, e.t.c. Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
  • 11. Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015 Original vs. Flood Related Lexicon ● Manual cleaning process is necessary ● Only 4 keywords flood- related in the original lexicon ● Flood Lexicon is ⅓ of the Original - Slow process + One time at the beginning Top-10 most frequent keywords
  • 12. Flood Related Tweets exact match with at least one keyword from our flood related lexicon Entire Dataset Flood - Related Tweets Flood Related Lexicon Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
  • 13. ● Why we care ○ where to dispatch emergency response units ○ notify citizens about areas with problems caused by floods ● From GPS to areas ○ Perform spatial clustering using the GPS coordinates ■ Convert GPS coordinates to Cartesian ones Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015 RQ1: Identifying flood-affected areas
  • 14. Clustering Step: K-Means K = 10, 100, 500, 1000 Generated clusters as Voronoi polygons ➔ more splits in the densely populated areas 10 clusters 100 clusters 500 clusters 1000 clusters Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
  • 15. Which areas are the most affected? ● Prioritize generated areas by their potential of being affected Prioritization schemes by area a: 1. By total #tweets: baseline 2. By flood-related #tweets 3. By Signal-to-Noise Ratio:score(a) = #flood-related tweets in a #tweets in a Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
  • 16. Visualization of top-100 most affected areas 1. total #tweets 2. flood-related #tweets Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015 add map with SNR 3. SNR Top 100 for K-Means (K=500)
  • 17. RQ2: Identification Effectiveness 1. Likert Scale [1-5]: to specify the degree that an area has been affected a. 1 = “normal levels of rainfall” b. 5 = “completely flooded” 2. Running Average Likert: Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015 Ground Truth - MetOffice add map with SNR 3. SNR
  • 18. Results Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
  • 19. Results (k = 100) Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015 ● Baseline < Flood, SNR ● Flood ~ SNR
  • 20. Results (k = 500) Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015 ● Baseline << Flood < SNR ● #tweets is not a good proxy ● #flood-related tweets is a better one
  • 21. Results (k = 1000) Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015 ● SNR the best metric (especially top20) ● how many users talk about the specific event
  • 22. RQ3: Similarly affected areas Identify areas with similar behavior on a temporal aspect, in the way that the flooding event was perceived by Twitter users Underlying connection: ● population level, e.g., similar posting patterns ● other variable, e.g., a nearby river Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
  • 23. Second Level Clustering: Attributes Features that show the temporal evolution of the event in an area 1. Number of tweets in day d, count(d) 2. Ratio of day d from area a, ratio(d) = count(d) / Σ count(d’), forall d’ 3. Speed of day d, speed(d) = ratio(d)-ratio(d-1) Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
  • 24. Second Level Clustering: Areas from 2 clusters : cluster 1 : cluster 2 Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015 ● Speed feature ● Red cluster: Scotland, Liverpool and Ireland, mostly unaffected ● Purple cluster: Midlands, affected ● Red speed decreases ● Purple speed increases ● Verification with historical data
  • 25. The INSIGHT project Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015 Detecting Events: - censors on road network - censors on buses - Twitter data http://www.insight-ict.eu/ Intelligent Synthesis and Real-time Response using Massive Streaming of Heterogeneous Data
  • 26. Conclusions ● Analysis on Twitter data ○ emergencies, disaster management & relief ● Experimental analysis on floodings ○ establishment of “flood related lexicon” ○ division of the entire UK to affected areas ○ identification of flood-stricken areas with high accuracy ● Comparison with ground truth data ○ quality evaluation Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
  • 27. Future Work ● Collect more data of similar flooding events and test our approach in larger datasets ○ generalize in other areas ○ test with larger timespan ● Develop online clustering approaches (1ier) ● To incorporate into the INSIGHT tool Twitter floods when it rains: A case study of the UK floods in early 2014 18 May 2015
  • 28. Thank you! Acknowledgements: MMD - Mining Mobility DataINSIGHT - Intelligent Synthesis and Real-time Response using Massive Streaming of Heterogeneous Data