SlideShare a Scribd company logo
1 of 41
Download to read offline
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Machine Learning for Improving Disaster Management and Response
Doina Caragea
Professor of Computer Science
Kansas State University
W P S 3 1 3
Sanjay Padhi
AWS Research Initiatives, WWPS
Amazon Web Services
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Acknowledgements
BIGDATA: IA: Collaborative Research: Domain Adaptation
Approaches for Classifying Crisis Related Data on Social Media
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Deadly disasters happen all the time
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Year 2017―the costliest for natural disasters in US
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
9-1-1 lines can be overwhelmed
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Social media to the rescue
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Responders faced with information overload
Direct Twitter search: noisy, non-
relevant tweets retrieved
Keyword-based search:
“harvey hurricane”, #harveyhurricane
Location-based search
postings containing geographical
coordinates inside the affected areas
Manual selection: time consuming
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Streaming
Crisis Data
(Twitter)
Human
Analyst
Amazon Machine
Learning
(Amazon ML)
Historical Crisis Data and Models
Crisis Affected
Community
Response
Organizations
Unlabeled
Unlabeled
Unlabeled
Labeled
Labeled
Labeled
911
Dispatcher
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Methodology
Data Collection
JSON tweets
Data Extraction
Tweet id, create time, text
Data Processing
Stop words, special
characters, URLs, Emails
Topics Modeling
Streaming Corpus
Latent Dirichlet Allocation
Analysis
Preparedness, During
Hurricane, Aftermath
Hurricane timeline
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
• Use Twitter Streaming API to crawl tweets posted during crisis events
• Parse the tweet JSON objects to extract tweet text, hashtags, media
information, user information, and geo-location (when available)
• Perform text classification, natural language processing and text analytics on
the tweet text
Data collection and analysis
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Tweet classification
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Classes of machine learning algorithms
Supervised learning [Imran et al., 2013; Ashktorab et al., 2014; Caragea et al., 2014; Imran et al.,
2018]
• Labeled tweets needed, but not readily available for an emergent disaster
Domain adaptation [Li et al., 2015; Li et al., 2017, Alam et al., 2018, Mazloom et al., 2018]
• Knowledge from a prior source disaster is transferred to a target disaster
Unsupervised learning, e.g., topic modeling [Resch et al., 2017]
• Topic modeling can help associate topics/categories with tweets
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Large road signs down over I-37 near Corpus. #ksatwx #harvey
#HurricaneHarvey At least one dead in Texas, more casualties feared
Currently stuck on Monroe.... R.I.P my truck... #HurricaneHarvey
I’ve got a water stain the size of Texas on my shirt so that’s cool
10/30-11/2 Water Infrastructure Conference happening in #Houston
Relevant
Relevant
Relevant
Irrelevant
Irrelevant
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Relevant
Irrelevant
Classifier
Large road signs down over I-37 near Corpus. #ksatwx #harvey
#HurricaneHarvey At least one dead in Texas, more casualties feared
Currently stuck on Monroe.... R.I.P my truck... #HurricaneHarvey
I’ve got a water stain the size of Texas on my shirt so that’s cool
10/30-11/2 Water Infrastructure Conference happening in #Houston
Relevant
Relevant
Relevant
Irrelevant
Irrelevant
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Classes of machine learning algorithms
Supervised learning [Imran et al., 2013; Ashktorab et al., 2014; Caragea et al., 2014; Imran et al.,
2018]
• Labeled tweets needed, but not readily available for an emergent disaster
Domain adaptation [Li et al., 2015; Li et al., 2017, Alam et al., 2018, Mazloom et al., 2018]
• Knowledge from a prior source disaster is transferred to a target disaster
Unsupervised learning, e.g., topic modeling [Resch et al., 2017]
• Topic modeling can help associate topics/categories with tweets
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Labeled Source Data
Unlabeled Target Data
Classifier for Target
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Classes of machine learning algorithms
Supervised learning [Imran et al., 2013; Ashktorab et al., 2014; Caragea et al., 2014; Imran et al.,
2018]
• Labeled tweets needed, but not readily available for an emergent disaster
Domain adaptation [Li et al., 2015; Li et al., 2017, Alam et al., 2018, Mazloom et al., 2018]
• Knowledge from a prior source disaster is transferred to a target disaster
Unsupervised learning, e.g., topic modeling [Resch et al., 2017]
• Topic modeling can help associate topics/categories with tweets
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Topic modeling using Latent Dirichlet Allocation (LDA)
A Document is a Mixture of Topics
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Topic modeling using Latent Dirichlet Allocation (LDA)
LDA finds topics in a collection of documents/tweets LDA tags each document/tweet with one or more topics
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
Damage Donations Recovery Storm Utilities
Document: topic distribution
0
0.005
0.01
0.015
0.02
0.025
0.03
0.035
area flood heavy morning thursday tropical wind
Topic: word distribution
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Comprehend workflow
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
How can Amazon Comprehend help disaster response?
Identify the language of a tweet
Identify popular key phrases to facilitate keyword search
Identify entities in actionable tweets
• Determine: who (person, organization), where (location), when (date), etc.
Identify the sentiment of the tweets to enhance situational awareness
Build custom classifiers to categorize tweets
Identify topics in a collection of tweets to categorize tweets, find trends and patterns
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
“We're at 11115 Sageview in Houston. Water is swallowing us up. Please
try calling 911 for rescue. Please send help”
“People dying, widespread destruction #HurricaneMaria #HurricaineIrma
and all top stories are @realDonaldTrump. Sad. Very sad. Disappointing”
“Awful seeing whats going on in the Caribbean. THE most resolute people
but these countries wont recover w/o sufficient aid #HurricaneMaria”
“Escaping Hurricane Irma and driving to South Bend with @emhen10 and
our 3 furry kids... #GoIrish #BeatBulldogs https://t.co/DFc0LeaOIt”
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Topic modeling
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Comprehend topic modeling output
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Example: topics
Hurricane degenerates Hurricane regenerates
Harvey degenerates into tropical wave
RT Hurricane Hunter plane finds that the
remnants of Harvey have not regenerated
into tropical cyclone
RT Harvey showing signs of regeneration over western Caribbean Sea
amp will produce heavy rainfall storms this week hit
RT Active tropics continue redvlpmt of Harvey likely as well as
the potential for two new tropical cyclones next days on
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Comprehend custom classifiers
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Comprehend custom classifiers - Output
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Using Amazon Comprehend results to get aggregate statistics
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Using Amazon Comprehend results to determine frequent entities
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Using locations identified by Amazon Comprehend to track hurricane path
Source: Wikipedia
Source: Weather Channel
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Using locations identified by Amazon Comprehend to track hurricane
path
Source: Weather Channel
Thank you!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Doina Caragea – dcaragea@ksu.edu
Sanjay Padhi – sanpadhi@amazon.com
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.

More Related Content

What's hot

Uttrakhand disaster
Uttrakhand disasterUttrakhand disaster
Uttrakhand disaster
Indira Kundu
 

What's hot (20)

Matrix Factorization
Matrix FactorizationMatrix Factorization
Matrix Factorization
 
Vulnerability analysis and experience of vulnerability in india
Vulnerability analysis and experience of vulnerability in indiaVulnerability analysis and experience of vulnerability in india
Vulnerability analysis and experience of vulnerability in india
 
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis ScenarioSummarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
 
Uttrakhand disaster
Uttrakhand disasterUttrakhand disaster
Uttrakhand disaster
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSIS
 
Captcha
CaptchaCaptcha
Captcha
 
Image Classification Data (Fashion-MNIST)
Image Classification Data (Fashion-MNIST)Image Classification Data (Fashion-MNIST)
Image Classification Data (Fashion-MNIST)
 
CS6010 Social Network Analysis Unit III
CS6010 Social Network Analysis   Unit IIICS6010 Social Network Analysis   Unit III
CS6010 Social Network Analysis Unit III
 
Probabilistic Programming: Why, What, How, When?
Probabilistic Programming: Why, What, How, When?Probabilistic Programming: Why, What, How, When?
Probabilistic Programming: Why, What, How, When?
 
Cloud Adoption in the Enterprise
Cloud Adoption in the EnterpriseCloud Adoption in the Enterprise
Cloud Adoption in the Enterprise
 
Cyclone
CycloneCyclone
Cyclone
 
Disaster management flood
Disaster management  flood Disaster management  flood
Disaster management flood
 
Microservices Workshop - Craft Conference
Microservices Workshop - Craft ConferenceMicroservices Workshop - Craft Conference
Microservices Workshop - Craft Conference
 
AI Algorithms
AI AlgorithmsAI Algorithms
AI Algorithms
 
Adversarial Attacks on A.I. Systems — NextCon, Jan 2019
Adversarial Attacks on A.I. Systems — NextCon, Jan 2019Adversarial Attacks on A.I. Systems — NextCon, Jan 2019
Adversarial Attacks on A.I. Systems — NextCon, Jan 2019
 
Indian ocean tsunami case study
Indian ocean tsunami case studyIndian ocean tsunami case study
Indian ocean tsunami case study
 
Futures for the Web - Reclaim 2023.pptx
Futures for the Web - Reclaim 2023.pptxFutures for the Web - Reclaim 2023.pptx
Futures for the Web - Reclaim 2023.pptx
 
Web application development - The past, the present, the future
Web application development - The past, the present, the futureWeb application development - The past, the present, the future
Web application development - The past, the present, the future
 
Cloud Computing Fundamentals
Cloud Computing FundamentalsCloud Computing Fundamentals
Cloud Computing Fundamentals
 
Natural Disasters
Natural DisastersNatural Disasters
Natural Disasters
 

Similar to Machine Learning for Improving Disaster Management and Response (WPS313) - AWS re:Invent 2018

Similar to Machine Learning for Improving Disaster Management and Response (WPS313) - AWS re:Invent 2018 (20)

Leadership Session: Digital Advertising - Customer Learning & the Road Ahead ...
Leadership Session: Digital Advertising - Customer Learning & the Road Ahead ...Leadership Session: Digital Advertising - Customer Learning & the Road Ahead ...
Leadership Session: Digital Advertising - Customer Learning & the Road Ahead ...
 
Introduction to AWS ML Application Services - BDA202 - Toronto AWS Summit
Introduction to AWS ML Application Services - BDA202 - Toronto AWS SummitIntroduction to AWS ML Application Services - BDA202 - Toronto AWS Summit
Introduction to AWS ML Application Services - BDA202 - Toronto AWS Summit
 
Container Security and Avoiding the 2 A.M. Call (CON303-R1) - AWS re:Invent 2018
Container Security and Avoiding the 2 A.M. Call (CON303-R1) - AWS re:Invent 2018Container Security and Avoiding the 2 A.M. Call (CON303-R1) - AWS re:Invent 2018
Container Security and Avoiding the 2 A.M. Call (CON303-R1) - AWS re:Invent 2018
 
[REPEAT] Better Analytics Through Natural Language Processing (AIM405-R) - AW...
[REPEAT] Better Analytics Through Natural Language Processing (AIM405-R) - AW...[REPEAT] Better Analytics Through Natural Language Processing (AIM405-R) - AW...
[REPEAT] Better Analytics Through Natural Language Processing (AIM405-R) - AW...
 
Transform the Modern Contact Center Using Machine Learning and Analytics (AIM...
Transform the Modern Contact Center Using Machine Learning and Analytics (AIM...Transform the Modern Contact Center Using Machine Learning and Analytics (AIM...
Transform the Modern Contact Center Using Machine Learning and Analytics (AIM...
 
AI Today
AI TodayAI Today
AI Today
 
Keynote: What Transformation Really Means for the Enterprise - AWS Transforma...
Keynote: What Transformation Really Means for the Enterprise - AWS Transforma...Keynote: What Transformation Really Means for the Enterprise - AWS Transforma...
Keynote: What Transformation Really Means for the Enterprise - AWS Transforma...
 
Create Advanced Text Analytics Solutions with NLP - BDA310 - Chicago AWS Summit
Create Advanced Text Analytics Solutions with NLP - BDA310 - Chicago AWS SummitCreate Advanced Text Analytics Solutions with NLP - BDA310 - Chicago AWS Summit
Create Advanced Text Analytics Solutions with NLP - BDA310 - Chicago AWS Summit
 
Enabling Sustainable Research Platforms in the Cloud
Enabling Sustainable Research Platforms in the CloudEnabling Sustainable Research Platforms in the Cloud
Enabling Sustainable Research Platforms in the Cloud
 
Detect Anomalies Using Amazon SageMaker (AIM420) - AWS re:Invent 2018
Detect Anomalies Using Amazon SageMaker (AIM420) - AWS re:Invent 2018Detect Anomalies Using Amazon SageMaker (AIM420) - AWS re:Invent 2018
Detect Anomalies Using Amazon SageMaker (AIM420) - AWS re:Invent 2018
 
Going Graph With Amazon Neptune - AWS Summit Sydney 2018
Going Graph With Amazon Neptune - AWS Summit Sydney 2018Going Graph With Amazon Neptune - AWS Summit Sydney 2018
Going Graph With Amazon Neptune - AWS Summit Sydney 2018
 
How Do I Know I Need an Amazon Neptune Graph Database? (DAT316) - AWS re:Inve...
How Do I Know I Need an Amazon Neptune Graph Database? (DAT316) - AWS re:Inve...How Do I Know I Need an Amazon Neptune Graph Database? (DAT316) - AWS re:Inve...
How Do I Know I Need an Amazon Neptune Graph Database? (DAT316) - AWS re:Inve...
 
Meetup Niort Data - AWS Intelligence Artificielle
Meetup Niort Data - AWS Intelligence ArtificielleMeetup Niort Data - AWS Intelligence Artificielle
Meetup Niort Data - AWS Intelligence Artificielle
 
Security Observability: Democratizing Security in the Cloud (DEV206-S) - AWS ...
Security Observability: Democratizing Security in the Cloud (DEV206-S) - AWS ...Security Observability: Democratizing Security in the Cloud (DEV206-S) - AWS ...
Security Observability: Democratizing Security in the Cloud (DEV206-S) - AWS ...
 
Resolving NLP Problems Using Amazon SageMaker Algorithms (GPSCT305) - AWS re:...
Resolving NLP Problems Using Amazon SageMaker Algorithms (GPSCT305) - AWS re:...Resolving NLP Problems Using Amazon SageMaker Algorithms (GPSCT305) - AWS re:...
Resolving NLP Problems Using Amazon SageMaker Algorithms (GPSCT305) - AWS re:...
 
Network Foundations on AWS (GPSCT409) - AWS re:Invent 2018
Network Foundations on AWS (GPSCT409) - AWS re:Invent 2018Network Foundations on AWS (GPSCT409) - AWS re:Invent 2018
Network Foundations on AWS (GPSCT409) - AWS re:Invent 2018
 
AWS Transformation Day 2018 - Charlotte NC
AWS Transformation Day 2018 - Charlotte NCAWS Transformation Day 2018 - Charlotte NC
AWS Transformation Day 2018 - Charlotte NC
 
Deep Dive on Amazon Neptune - AWS Online Tech Talks
Deep Dive on Amazon Neptune - AWS Online Tech TalksDeep Dive on Amazon Neptune - AWS Online Tech Talks
Deep Dive on Amazon Neptune - AWS Online Tech Talks
 
ProTips for Scaling AWS Training to Accelerate Adoption (DVC203) - AWS re:Inv...
ProTips for Scaling AWS Training to Accelerate Adoption (DVC203) - AWS re:Inv...ProTips for Scaling AWS Training to Accelerate Adoption (DVC203) - AWS re:Inv...
ProTips for Scaling AWS Training to Accelerate Adoption (DVC203) - AWS re:Inv...
 
From Idea to Customers: Developing Modern Cloud-Enabled Apps with AWS (MOB201...
From Idea to Customers: Developing Modern Cloud-Enabled Apps with AWS (MOB201...From Idea to Customers: Developing Modern Cloud-Enabled Apps with AWS (MOB201...
From Idea to Customers: Developing Modern Cloud-Enabled Apps with AWS (MOB201...
 

More from Amazon Web Services

Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Machine Learning for Improving Disaster Management and Response (WPS313) - AWS re:Invent 2018

  • 1.
  • 2. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Machine Learning for Improving Disaster Management and Response Doina Caragea Professor of Computer Science Kansas State University W P S 3 1 3 Sanjay Padhi AWS Research Initiatives, WWPS Amazon Web Services
  • 3. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Acknowledgements BIGDATA: IA: Collaborative Research: Domain Adaptation Approaches for Classifying Crisis Related Data on Social Media
  • 4. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Deadly disasters happen all the time
  • 5. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Year 2017―the costliest for natural disasters in US
  • 6. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. 9-1-1 lines can be overwhelmed
  • 7. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Social media to the rescue
  • 8. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Responders faced with information overload Direct Twitter search: noisy, non- relevant tweets retrieved Keyword-based search: “harvey hurricane”, #harveyhurricane Location-based search postings containing geographical coordinates inside the affected areas Manual selection: time consuming
  • 9. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Streaming Crisis Data (Twitter) Human Analyst Amazon Machine Learning (Amazon ML) Historical Crisis Data and Models Crisis Affected Community Response Organizations Unlabeled Unlabeled Unlabeled Labeled Labeled Labeled 911 Dispatcher
  • 10. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Methodology Data Collection JSON tweets Data Extraction Tweet id, create time, text Data Processing Stop words, special characters, URLs, Emails Topics Modeling Streaming Corpus Latent Dirichlet Allocation Analysis Preparedness, During Hurricane, Aftermath Hurricane timeline
  • 11. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. • Use Twitter Streaming API to crawl tweets posted during crisis events • Parse the tweet JSON objects to extract tweet text, hashtags, media information, user information, and geo-location (when available) • Perform text classification, natural language processing and text analytics on the tweet text Data collection and analysis
  • 12. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Tweet classification
  • 13. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Classes of machine learning algorithms Supervised learning [Imran et al., 2013; Ashktorab et al., 2014; Caragea et al., 2014; Imran et al., 2018] • Labeled tweets needed, but not readily available for an emergent disaster Domain adaptation [Li et al., 2015; Li et al., 2017, Alam et al., 2018, Mazloom et al., 2018] • Knowledge from a prior source disaster is transferred to a target disaster Unsupervised learning, e.g., topic modeling [Resch et al., 2017] • Topic modeling can help associate topics/categories with tweets
  • 14. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Large road signs down over I-37 near Corpus. #ksatwx #harvey #HurricaneHarvey At least one dead in Texas, more casualties feared Currently stuck on Monroe.... R.I.P my truck... #HurricaneHarvey I’ve got a water stain the size of Texas on my shirt so that’s cool 10/30-11/2 Water Infrastructure Conference happening in #Houston Relevant Relevant Relevant Irrelevant Irrelevant
  • 15. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Relevant Irrelevant Classifier Large road signs down over I-37 near Corpus. #ksatwx #harvey #HurricaneHarvey At least one dead in Texas, more casualties feared Currently stuck on Monroe.... R.I.P my truck... #HurricaneHarvey I’ve got a water stain the size of Texas on my shirt so that’s cool 10/30-11/2 Water Infrastructure Conference happening in #Houston Relevant Relevant Relevant Irrelevant Irrelevant
  • 16. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Classes of machine learning algorithms Supervised learning [Imran et al., 2013; Ashktorab et al., 2014; Caragea et al., 2014; Imran et al., 2018] • Labeled tweets needed, but not readily available for an emergent disaster Domain adaptation [Li et al., 2015; Li et al., 2017, Alam et al., 2018, Mazloom et al., 2018] • Knowledge from a prior source disaster is transferred to a target disaster Unsupervised learning, e.g., topic modeling [Resch et al., 2017] • Topic modeling can help associate topics/categories with tweets
  • 17. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Labeled Source Data Unlabeled Target Data Classifier for Target
  • 18. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Classes of machine learning algorithms Supervised learning [Imran et al., 2013; Ashktorab et al., 2014; Caragea et al., 2014; Imran et al., 2018] • Labeled tweets needed, but not readily available for an emergent disaster Domain adaptation [Li et al., 2015; Li et al., 2017, Alam et al., 2018, Mazloom et al., 2018] • Knowledge from a prior source disaster is transferred to a target disaster Unsupervised learning, e.g., topic modeling [Resch et al., 2017] • Topic modeling can help associate topics/categories with tweets
  • 19. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Topic modeling using Latent Dirichlet Allocation (LDA) A Document is a Mixture of Topics
  • 20. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Topic modeling using Latent Dirichlet Allocation (LDA) LDA finds topics in a collection of documents/tweets LDA tags each document/tweet with one or more topics 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 Damage Donations Recovery Storm Utilities Document: topic distribution 0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 area flood heavy morning thursday tropical wind Topic: word distribution
  • 21. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 22. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Comprehend workflow
  • 23. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. How can Amazon Comprehend help disaster response? Identify the language of a tweet Identify popular key phrases to facilitate keyword search Identify entities in actionable tweets • Determine: who (person, organization), where (location), when (date), etc. Identify the sentiment of the tweets to enhance situational awareness Build custom classifiers to categorize tweets Identify topics in a collection of tweets to categorize tweets, find trends and patterns
  • 24. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 25. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 26. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 27. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. “We're at 11115 Sageview in Houston. Water is swallowing us up. Please try calling 911 for rescue. Please send help” “People dying, widespread destruction #HurricaneMaria #HurricaineIrma and all top stories are @realDonaldTrump. Sad. Very sad. Disappointing” “Awful seeing whats going on in the Caribbean. THE most resolute people but these countries wont recover w/o sufficient aid #HurricaneMaria” “Escaping Hurricane Irma and driving to South Bend with @emhen10 and our 3 furry kids... #GoIrish #BeatBulldogs https://t.co/DFc0LeaOIt”
  • 28. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Topic modeling
  • 29. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Comprehend topic modeling output
  • 30. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Example: topics Hurricane degenerates Hurricane regenerates Harvey degenerates into tropical wave RT Hurricane Hunter plane finds that the remnants of Harvey have not regenerated into tropical cyclone RT Harvey showing signs of regeneration over western Caribbean Sea amp will produce heavy rainfall storms this week hit RT Active tropics continue redvlpmt of Harvey likely as well as the potential for two new tropical cyclones next days on
  • 31. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 32. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Comprehend custom classifiers
  • 33. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Comprehend custom classifiers - Output
  • 34. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 35. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 36. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Using Amazon Comprehend results to get aggregate statistics
  • 37. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Using Amazon Comprehend results to determine frequent entities
  • 38. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Using locations identified by Amazon Comprehend to track hurricane path Source: Wikipedia Source: Weather Channel
  • 39. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Using locations identified by Amazon Comprehend to track hurricane path Source: Weather Channel
  • 40. Thank you! © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Doina Caragea – dcaragea@ksu.edu Sanjay Padhi – sanpadhi@amazon.com
  • 41. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.