Multi-Scale and Multi-Modal Streaming Data Aggregation and Processing for Decision Support during Natural Disasters

Multi-Scale and Multi-Modal Streaming Data
Aggregation and Processing for Decision
Support during Natural Disasters
rebrand.ly/HazardSEE
S
1
DisasterRecord: Disaster Response and Relief Coordination Pipeline
Shruti Kar
M.S. Thesis Defense
COMMITTEE MEMBERS:
Dr. Krishnaprasad Thirunarayan (ADVISOR)
Dr. Amit Sheth
Dr. Valerie L. Shalin
MENTOR:
Hussein S. Al-Olimat
Fall 2018

Problem Definition
3
Given a disaster event with a known spatial context, can we extract the
knowledge from multimodal data including targeted crowd-sourced data,
Twitter stream, Satellite imagery and pre-disaster data and intertwine them
with locations on the map to provide decision-support for real-time situational
awareness?
Part of this work was published in ACM SIGSPATIAL ARIC 2018 Workshop.
Shruti Kar, Hussein S. Al-Olimat, Krishnaprasad Thirunarayan, Valerie Shalin, Amit Sheth, and Srinivasan Parthasarathy. D-record: Disaster
response and relief coordination pipeline. In Proceedings of the ACM SIGSPATIAL International Workshop on Advances in Resilient and
Intelligent Cities (ARIC 2018). Association for Computing Machinery, 2018.

Location-centric Ontology
Use-Case
4
Eg: “I am stuck at ‘XYZ’.. Help me!”
Need Location OSM Locations
Need Matching
Need Classification Location Affordability
Need type? Location type?
Flood Mapping

Data Sources
6
Data
Streaming and Crowd-
sourced data
Satellite imageryPre-disaster data
Excel sheets from volunteersTwitter
Training set Twitris collection
CrisisLexT26
CrisisNLP
Chennai Flood in 2015
Houston Flood in 2016
(http://rebrand.ly/reliefspreadsheet)

Data Sources - Streaming data
7
● Twitter Data

Data Sources - Streaming data
8
● Twitter Data
LOCATIONS NEEDS
Extract Locations names from Text
LNEx
Extract Needs type from Text

Disaster Text Classification - Related Literature
9
▪ Olteanu et al., 2015 ▪ CREES
▪ Cameran et al., 2012 - Binary classifier for Infrastructure damage

Location Name Extraction
10
● User profile location vs location
name mentioned?
● Location Name mentioned in
unstructured text streams
● Intertwine with geo
coordinates
LOCATIONS
(github.com/halolimat/LNEx)

Need Classification
11
Our Need Class CrisisLexT26,CrisisNLP Classes
Shelter/Food/Supplies Need
Donation_needs_or_offers_or_volunteering_services,
displaced_people_and_evacuations
Medical/Rescue Help Need
Missing_trapped_or_found_people, deaths_reports,
injured_or_dead_people, affected_people
Twitter
Training set Twitris Collection
CrisisLexT26
CrisisNLP
Chennai Flood in 2015
Houston Flood in 2016

Need Classification
12
● Preprocessing - Stemming, case folding and removal of noise.
● Semantics - Feature Engineering
■ TF-IDF vectors
■ Gensim’s word2vec embeddings
● Class Imbalance - SVM-SMOTE synthetically oversampled the minority class
● Gradient Boosting Classifier
Feature set
Without SVM-SMOTE With SVM-SMOTE
RF SVM GB RF SVM GB
tf-idf 0.50 0.57 0.58 0.55 0.61 0.63
word2vec 0.53 0.59 0.60 0.58 0.66 0.68
tf-idf + word2vec 0.60 0.64 0.71 0.67 0.72 0.77

Location-Centric Ontology
13
Event Ontology &
Situational Data
Need Classification
OSM Location
Affordability

Data Sources - Crowd Sourced data
14
● Excel sheets from volunteers

Data Sources - Crowd Sourced data
15
● Excel sheets from volunteers
...

Data Sources - Pre-Disaster
16
Houston
● OpenStreetMap Location features

Location-centric Ontological Modeling
17
▪ Matching Need and availability
▪ Event Classification
Event Ontology &
Situational Data
Seakers / Providers
Need Classification
Matching Need
and Availability

19
▪ Matching Need and availability - Relationships with Location features
▪ Event Classification - Concepts with Lexicon based features
Event Ontology &
Situational Data
Seakers / Providers
Need Classification
Matching Need
and Availability
Concepts with Lexicon
features
Location features

20
▪ Matching Need and availability - Relationships with Location features
▪ Event Classification - Concepts with Lexicon based features
Event Ontology &
Situational Data
Eg: “I Broke my leg..
Help me!” Kindred Hospital
Medical Need Medical Help Availability
From Lexicon features From Location features

Disaster-centric Ontologies - Related Literature
22
Competency Questions
▪ Coverage - depth and breadth
▪ Spatial details
▪ Temporal details
▪ Thematic details
▪ Response details
▪ CrisisNLP
▪ CrisisLex
▪ CREES
▪ MOAC - the Management of Crisis
▪ HXL - Humanitarian eXchange Language
▪ SOCC - SOCS crisis ontology
▪ EDXL-RESCUER
▪ SMEM - Social Media and Emergency Management
▪ Wang et al., 2014

23
CrisisNLP, CrisisLex, CREES - Lexicons for classification
▪ Spatial details

24
▪ Spatial details
MOAC - the Management of Crisis Vocabulary
Answers - WHO? WHAT? WHERE?
▪ Spatial details
HXL - Humanitarian eXchange Language
Framework for Interoperability

25
▪ Spatial details
SOCC - SOCS crisis ontology
Extended HXL and MOAC
▪ Spatial details
EDXL-RESCUER
Data exchange with legacy systems

26
▪ Spatial details
SMEM - Social Media and Emergency
Management
Combines crisis domain with social
media
▪ Spatial details
Wang et al., 2014- Lacks hierarchical
relationships between concepts

27
▪ Uses information from the past disaster and risk reports from ACAPS.
▪ Distinction between response and relief phase of disaster
▪ Concepts
□ Needs - Topic modeling
□ Shelter/Food/Supplies
□ Medical/Rescue Help
□ Availability - OpenStreetMap features
▪ Relevance scores of each term to the Need class - used while Text Classification
Relationship : Available As

28
▪ Topic Modeling of Concepts of Interest:
Shelter/Food/Supplies
relief
donate
food
fund
assist
shelter
evacuate
volunteer
money
blood
contribute
Medical/Rescue Help
death
injure
missing
collapse
trapped
rubble
police
found
search
killed
blast
▪ Collected, cleaned and processed data.
▪ Latent Dirichlet Allocation (LDA) - A probabilistic
topic model
▪ p(topic t | document d) x p(word w | topic t)

29
▪ Topic Modeling of Concepts of Interest:
Completely Justified Quiet Justified To be reviewed Remove
rubble Collapse help deadli
police Rescue baby control
army Team old find
relief Caution water emergency
power hospital response
devast blast benefit
aid clean force
evacuation loss factory

30
▪ Relevance Probability:
‘0.028*“relief” + 0.023*“donat” + 0.012*“aid” + 0.007*“rescu” + 0.007*“food” + 0.006*“send” +
0.005*“fund” + 0.005*“assist” + 0.005*“provid” + 0.004*“shelter” + 0.004*“evacu” + 0.004*“armi” +
0.004*“team” + 0.003*“free” + 0.003*“money” + 0.003*“volunt” + 0.002*“blood” + 0.002*“devast” +
0.002*“contribut” + 0.001*“humanitarian” + 0.001*“taxi” + 0.001*“chariti” + 0.001*“offer” +
0.001*“truck” + 0.001*“materi” + 0.001*“cloth” + 0.001*“import” + 0.001*“commun” + 0.001*“collect
+ 0.001*“center” + 0.001*“rice” + 0.001*“avail” + 0.001*“polic”’
‘1.659*“relief” + 1.537*“donat” + 1.268*“aid” + 1.146*“rescu” + 1.146*“food” + 1.122*“send” +
1.098*“fund” + 1.098*“assist” + 1.098*“provid” + 1.073*“shelter” + 1.073*“evacu” + 1.073*“armi” +
1.073*“team” + 1.049*“free” + 1.049*“money” + 1.049*“volunt” + 1.024*“blood” + 1.024*“devast” +
1.024*“contribut” + 1*“humanitarian” + 1*“taxi” + 1*“chariti” + 1*“offer” + 1*“truck” + 1*“materi” +
1*“cloth” + 1*“import” + 1*“commun” + 1*“collect + 1*“center” + 1*“rice” + 1*“avail” + 1*“polic”’
Normalized Relevance Score

31
Usage in text classification
▪ Concepts
□ Availability - OpenStreetMap
features

Feature set
Without SVM-SMOTE With SVM-SMOTE
RF SVM GB RF SVM GB
tf-idf 0.50 0.57 0.58 0.55 0.61 0.63
word2vec 0.53 0.59 0.60 0.58 0.66 0.68
tf-idf + word2vec 0.60 0.64 0.71 0.67 0.72 0.77
tf-idf + word2vec + lexicon based features 0.69 0.71 0.74 0.71 0.80 0.81
Need Classification Results
32

OSM Location Affordability Encoding
33
▪ Concepts
features

OSM Location Affordability Encoding
34
▪ Concepts
features

35
▪ Spatial details

Need Matching
36
Needs Classified Location Affordance

Need Matching - Related Literature
37
▪ Purohit et al., 2014 - uses hand-crafted rules to match “Seekers” and “Suppliers”.
▪ Currion et al., 2012- Uses structures input from users and utilizes moderators in the
loop to match.
▪ Palomares et al.,2015- quantifies the degree of impact of a disaster and prioritizes the
matching accordingly.
▪ Murali et al., 2016- provides online and offline solution for requesting, providing and
coordinating resources.
▪ Limitations:
□ Faulty assumption that all routes are available for matching during a disaster.
□ Matching problem being solved for a different level of granularity

Need Matching
38
Seakers / Providers
Need Classification
+
Need
Concepts
Location
Affordability
+
Need Locations Available Locations
Flood Mapping
Need Matching

Data Sources - Satellite Imagery
39
● Flood mapping
Liang et al. "Human-Guided Flood Mapping: From Experts
to the Crowd." WWW 2018.

Flood Mapping
40
▪ Liang et al., 2018 - interpolates satellite imagery on a map in order to determine flooded
geo-coordinates
□ Eliminate flooded OSM Locations
□ Prune away flooded/closed routes

Data Visualization - Related Literature
43
▪ Wang et al., 2014 - News report
▪ Cameron et al., 2012 - Infrastructure
needs

Data Visualization - Related Literature
44
▪ MacEachren et al., 2011 - places geo-tagged tweets
on map, Heatmap of tweet frequency
(spatially), Ranked, sorted relevant tweets
▪ Neis et al., 2010 - User’s structured
Input.
▪ Junior et al., 2015 - showed effectiveness of
layered visualization systems.

Data Visualization
46
▪ Handles various user’s requirements.
□ Community-Level users
□ Individual-Level users

Data Visualization
47
▪ Community-Level of Analysis
□ Location-specific textual data
□ Categorizing tweets to
need types
□ Aggregating tweets with
respect to location vicinity

Data Visualization
48
▪ Community-Level of Analysis
□ Location-specific image features
□ Filtering for Flooded images
□ Detecting objects of
interest
□ Location-specific available help
□ Location-specific thematic profile

Data Visualization
49
▪ Individual-Level of Analysis
□ Flooded areas around
their vicinity
□ The possible available
help around their location
matching their needs
□ Route guidance of non-
flooded routes for relief
workers and individuals
seeking help.

Results and Evaluation
50
Datasets Chennai Flood 2015 Houston Flood 2016
Original number of tweets 169,838 415,057
Locations extracted with LNEx 85,564 (23,401 needs) 241,684 (60,421 needs)
OSM-featured Location 1395 2,826

Conclusion and Future Work
● Our multi-scale and multi-modal streaming data aggregation and processing system supports
individual and aggregated level analysis for better-informed decision support during natural
disasters.
● A domain-specific location-centric event ontology is crucial for situation awareness and
disaster response.
● DisasterRecord is a modularized pipeline enabling multi-modal data as input demonstrates
Need-offer matching.
● Finer-grained classifier can be designed to do flexible and specific matching.
● Other background knowledge such as Weather data, storm surge model can help in
preparedness in addition to response.
● Online and offline infrastructure support can be provided.
● Studying Geo-tagged Locations in sync with Location Name mentions in the text to infer trust.
● Weighing the edges of road network with flood mapping to reduce ETAs while solving matching
problem.
51

References
● Muhammad Imran, Prasenjit Mitra, and Carlos Castillo. Twitter as a lifeline: Human-annotated twitter corpora for nlp of crisis-related messages. arXiv preprint
arXiv:1605.05894, 2016.
● SOCS:Social Media Enhanced Organizational Sensemaking in Emergency Response. http://knoesis.org/projects/socs. Accessed: 2018-02-22.
● Hussein S. Al-Olimat, Krishnaprasad Thirunarayan, Valerie Shalin, and Amit Sheth. Location name extraction from targeted text streams using gazetteer-based
statistical language models. In Proceedings of the 27th International Conference on Computational Linguistics, pages 1986–1997. Association for Computational
Linguistics, 2018.
● Rebeca Barros, Pedro Kislansky, Láis Salvador, Reinaldo Almeida, Matthias Breyer, and Laia Gasparin Pedraza. Edxl-rescuer ontology: Conceptual model for semantic
integration. In ISCRAM, 2015.
● Shreyansh P Bhatt, Hemant Purohit, Andrew Hampton, Valerie Shalin, Amit Sheth, and John Flach. Assisting coordination during crisis: a domain ontology based
approach to infer resource needs from tweets. In Proceedings of the 2014 ACM conference on Web science, pages 297–298. ACM, 2014.
● Paul Currion, Chamindra de Silva, and Bartel Van de Walle. Open source software for disaster management. Communications of the ACM, 50(3):61–65, 2007.
● Zhi-Hua Hu. A container multimodal transportation scheduling approach based on immune affinity model for emergency relief. Expert Systems with Applications,
38(3):2632–2639, 2011.
● Paulo Sim̃ oes J́unior, Renato Novais, aninha Vieira, Laia G Pedraza, Manoel Mendonc ̧a, and Karina Villela. Visualization mechanisms for cro wdsourcing information
in emergency coordination. In Proceedings of the 14th Brazilian Symposium on Human Factors in Computing Systems, page 35. ACM, 2015.
● Carsten Keßler and Chad Hendrix. The humanitarian exchange language: coordinating disaster response with semantic web technologies. Semantic Web, 6(1):5–
21,2015.
● Jiongqian Liang, Peter Jacobs, and Srinivasan Parthasarathy. Human-guided flood mapping: From experts to the crowd. In WWW 2018, pages 291–298, 2018.
● Alan M MacEachren, Anuj Jaiswal, Anthony C Robinson, Scott Pezanowski, Alexander Savelyev, Prasenjit Mitra, Xiao Zhang, and Justine Blanford. Senseplace2:
Geotwitter analytics support for situational awareness. In Visual analytics science and technology (VAST), 2011 IEEE conference on, pages 181–190. IEEE, 2011.
● Smriti Murali, V Krishnapriya, and Aadhiya Thomas. Crowdsourcing for disaster relief: A multi-platform model. In Distributed Computing, VLSI, Electrical Circuits and
Robotics (DISCOVER), IEEE, pages 264–268. IEEE, 2016.
● Pascal Neis, Peter Singler, and Alexander Zipf. Collaborative mapping and emergency routing for disaster logistics–case studies from the haiti earthquake and the UN
Portal for Afrika. na, 2010.
● Alexandra Olteanu, Sarah Vieweg, and Carlos Castillo. What to expect when the unexpected happens: Social media communications across crises. In Proceedings of
the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing, pages 994–1009. ACM, 2015.
● Iv́an Palomares, Leo Galway, Martin Haran, Martijn Neef, Conor Woods, and Hui Wang. A knowledge management and need-capacity matching approach
for community-based disaster management and recovery. In ISK, pages 389–396. IEEE, 2015.
● Hemant Purohit, Andrew Hampton, Shreyansh Bhatt, Valerie L Shalin, Amit P Sheth, and John M Flach. Identifying seekers and suppliers in social media communities
to support crisis coordination. CSCW, 23(4-6):513–545, 2014.
● Hemant Purohit, Nikhita Vedula, Krishnaprasad Thirunarayan, and Srinivasan Parthasarathy. Transportation uncertainty in matching help seekers and
suppliers during disasters. In First SIGIR Workshop on Intelligent Transporation Informatics. ACM, 2018.
52

Resources
53
● Paper - http://knoesis.org/node/2915
● Video - https://goo.gl/wNp3S2
● DisasterRecord wiki page - http://wiki.knoesis.org/index.php/DisasterRecord
● Github repo - https://github.com/shrutikar/DisasterRecord
● Ontology link - https://goo.gl/k344EH

Acknowledgement
54
TK. Prasad
Hussein Al-Olimat
Amit Sheth Valerie Shalin
Srinivasan
Parthasarathy

Acknowledgement
55
Team Members of DisasterRecord :
○ Michael Partin, MSc Student of Computer Engineering
○ Dipesh Kadariya, MSc Student of Computer Science
○ Hussein Al-Olimat, PhD Student of Computer Science
Jeremy Brunn for helping with UI design
Alan Smith for collaboration to make the tool real-time.
TO THE ENTIRE KNOESIS FAMILY
Our collaborators Jiongqian (Albert) Liang, Jiayong Liang (Jay), Desheng Liu, and Nikhita Vedula
from Ohio State University for providing flood mapping data.
This research was supported by the NSF award EAR-1520870 “Hazards SEES:Social and Physical
Sensing Enabled Decision Support for Disaster Management and Response”. All views are those of
researchers and do not necessarily reflect the views of the sponsor.

Multi-Scale and Multi-Modal Streaming Data Aggregation and Processing for Decision Support during Natural Disasters

Recommended

Recommended

More Related Content

Similar to Multi-Scale and Multi-Modal Streaming Data Aggregation and Processing for Decision Support during Natural Disasters

Similar to Multi-Scale and Multi-Modal Streaming Data Aggregation and Processing for Decision Support during Natural Disasters (20)

Recently uploaded

Recently uploaded (20)

Multi-Scale and Multi-Modal Streaming Data Aggregation and Processing for Decision Support during Natural Disasters

Editor's Notes