SlideShare a Scribd company logo
1 of 4
Download to read offline
AIDR: Artificial Intelligence for Disaster Response
Muhammad Imran
Qatar Computing
Research Institute
Doha, Qatar
mimran@qf.org.qa
Carlos Castillo
Qatar Computing
Research Institute
Doha, Qatar
chato@acm.org
Ji Lucas
Qatar Computing
Research Institute
Doha, Qatar
jlucas@qf.org.qa
Patrick Meier
Qatar Computing
Research Institute
Doha, Qatar
patrick@irevolution.net
Sarah Vieweg
Qatar Computing
Research Institute
Doha, Qatar
svieweg@qf.org.qa
ABSTRACT
We present AIDR (Artificial Intelligence for Disaster Re-
sponse), a platform designed to perform automatic classi-
fication of crisis-related microblog communications. AIDR
enables humans and machines to work together to apply hu-
man intelligence to large-scale data at high speed.
The objective of AIDR is to classify messages that peo-
ple post during disasters into a set of user-defined categories
of information (e.g., “needs”, “damage”, etc.) For this pur-
pose, the system continuously ingests data from Twitter,
processes it (i.e., using machine learning classification tech-
niques) and leverages human-participation (through crowd-
sourcing) in real-time. AIDR has been successfully tested to
classify informative vs. non-informative tweets posted dur-
ing the 2013 Pakistan Earthquake. Overall, we achieved a
classification quality (measured using AUC) of 80%. AIDR
is available at http://aidr.qcri.org/.
Categories and Subject Descriptors
H.4 [Information Systems Applications]: Miscellaneous;
D.2.2 [Software Engineering]: Design Tools and Tech-
niques
Keywords
Stream processing; Crowdsourcing; Classification; Online
Machine learning
1. INTRODUCTION
Information overload during disasters can be as paralyz-
ing to humanitarian response as the absence of information.
During disasters, microblogging platforms like Twitter re-
Copyright is held by the International World Wide Web Conference Com-
mittee (IW3C2). IW3C2 reserves the right to provide a hyperlink to the
author’s site if the Material is used in electronic media.
WWW’14 Companion, April 7–11, 2014, Seoul, Korea.
ACM 978-1-4503-2745-9/14/04.
http://dx.doi.org/10.1145/2567948.2577034.
Trainer
Figure 1: AIDR: overall approach
ceive an overwhelming amount of situation-sensitive infor-
mation that people post in the form of textual messages, im-
ages, and videos. Despite the fact that social media streams
contain a significant amount of noise, much research [9, 4]
has shown that these same streams of information also in-
clude relevant, tactical information (e.g., regarding infras-
tructure damage, needs, donations). Because social media
communications provide a rich trove of information, it is pos-
sible that even a small amount of relevant information can
greatly enhance situational awareness and help responders
and other concerned parties make more informed decision.
Finding tactical and actionable information in real time
within a rapidly growing stack of information is challeng-
ing for many reasons. For instance, performing information
extraction on short bursts of text (e.g., on 140-character
tweets) is significantly more difficult than performing the
same task on large documents such as blog posts pr news
articles [6]. Moreover, research has shown that pre-trained
classifiers significantly drop in classification accuracy when
used in different but similar disasters [3]. This requires
learning and training new classifiers using fresh training data
every time a disaster strikes.
Considering the amount of information that flows on Twit-
ter, it is challenging for emergency managers and other stake-
holders to investigate each individual tweet in real-time to
look for useful information. Therefore, our goal is to lever-
age different machine learning techniques (e.g., information
classification, and extraction) to perform the job automati-
cally. Moreover, we want humans (i.e. volunteers) to label
part of the incoming data to be used for the training pur-
poses of machine learning algorithms. Above all, the whole
process must be ingesting, processing and producing only
credible information in real-time, or with low latency [5].
The rest of the paper is organized as follows: In the next
section, we describe domain challenges in crisis response. In
section 3, we present an overview of AIDR from an end-user
perspective, as well as an evaluation. Section 4 presents
AIDR’s architecture and implementation. A demonstration
storyboard is described in section 5, followed by the conclu-
sion in section 6.
2. DOMAIN CHALLENGES IN
CRISIS RESPONSE
During disasters, social media messages provide real-time
or low-latency situational awareness information that can
enable crisis responders to be more effective in their relief
efforts [8]. However, different emergency response agen-
cies are interested in different types of messages. For in-
stance, reports of damage to infrastructures should be di-
rected to some agencies, while reports about shortages of
water and/or food should be directed to others.1
Moreover, disaster response in its various types can be
applied during the various phases of disaster such as prepa-
ration, response and recovery. During each phase, disaster
responders require different information. In our previous
work [1], we observed that social media response to disasters
follows the same pattern, that is, messages posted on social
media during the early phases of a disaster talk about cau-
tion & warnings, whereas messages posted during the later
phases report infrastructure damage, casualties, donations
required or available, etc.
Below, we discuss the roles of automatic computation, hu-
man computation, and the combination of the two in the
processing of social media streams.
Role of machine intelligence: Traditional information
processing cannot be employed in this model, as disaster
responders cannot wait to collect information, and then cu-
rate and classify it offline. Instead, responders and other
stakeholders require real-time insight and intelligence as the
disaster unfolds. To this end, we aim to ingest and classify
social media streams in real-time through automated means
with the help of human intervention.
Role of human intelligence: When attempting to per-
form non-trivial tasks, machines alone are not capable of
great accuracy. Human intervention is required to verify,
teach, and/or correct the machine output [2]. Use of human
intelligence fills the gap for the tasks that cannot be auto-
mated, for example, providing input labels (i.e., for initial
training), correcting or validating the machine’s output (i.e.,
for performance optimization) are among the types of human
interventions. In AIDR, we aim to find a right balance so
that the human intelligence can be used in an effective way.
Combined intelligence: Relying solely on humans to in-
vestigate each individual message is challenging due to the
1
The United Nations organizes its agencies into clusters:
http://business.un.org/en/documents/6852.
scale of information posted on Twitter, which goes beyond
the processing capacity of humans. To this end, an au-
tomatic approach is required that can intelligently crowd-
source messages to obtain training examples when needed,
and additionally, the system should effectively use crowd-
sourcing workers both in terms of time (i.e., for volunteers)
and cost (i.e., for paid workers).
3. SYSTEMS OVERVIEW
The purpose of AIDR (Artificial Intelligence for Disaster
Response),2
is to filter and classify messages posted to social
media during humanitarian crises in real time.
Specifically, AIDR collects crisis-related messages from
Twitter3
(“tweets”), asks a crowd to label a sub-set of those
messages, and trains an automatic classifier based on the
labels. It also improves the classifier as more labels become
available. Automatic classification using pre-existing train-
ing data is not a satisfactory solution because although crises
have elements in common, they also have specific aspects
which make domain adaptation difficult. Crisis-specific la-
bels lead to higher accuracy than labels from past disas-
ters [3].
3.1 AIDR in action: end-user perspective
AIDR users begin by creating a collection process by en-
tering a set of keywords or a geographical region that will
be used to filter the Twitter stream, as shown in Figure
2(a). The user can monitor the collection status (e.g., to-
tal processed items, last processed item, time elapsed, etc.)
using dashboard as shown in Figure 2 (b). Next, a crowd
of annotators provide training examples: a system-selected
message plus a human-assigned label, as shown in Figure
2(c), which are then used to train classifiers for incoming
items, as shown in Figure 2(d).
Finally, an output of messages sorted into categories is
generated, which can be collected and used to create crisis
maps and other types of reports. An example consumer
application is the current version of CrisisTracker,4
which
uses AIDR to enable users to slice the data by categories of
interest, which vary by deployment scenario to include for
instance eyewitness accounts, reports of violence, or reports
of damage infrastructure.
3.2 Evaluation
AIDR was successfully tested during a recent earthquake
in Pakistan in 2013. We set AIDR up to collect tweets using
the hashtags (#Pakistan, #Awaran, #Balochistan, #earth-
quake, #ReliefPK) on September 25, 2013 at 20:20:09 AST5
on a request of UN Office for the Coordination of Humani-
tarian Affairs (OCHA). Within a few hours, SBTF (Standby
Task Force) 6
volunteers were asked to label whether a given
tweet was informative (i.e., if the tweet reports infrastruc-
ture damage, casualties, donation offered or needed, etc.).
They tagged about 1,000 tweets approximately within 6
hours. Though the prevalence of the negative class (“not
informative”) was high, the system was able to learn from
≈200 informative labeled tweets. In this setup, we achieved
2
http://aidr.qcri.org/
3
http://twitter.com/
4
http://ufn.virtues.fi/~jakob/yolanda/
5
Arabian Standard Time
6
http://blog.standbytaskforce.com/
(a) AIDR Collector - create collection UI
allows users to create their collections
by specifying keywords, geographical
regions, language filters etc.
(b) AIDR Collector - for a running collection
this UI shows various parameters like,
downloaded items, time period,
configuration history overtime etc.
(d) AIDR Tagger - this UI shows the
results of automatic tagging in terms
of training elements , classified
elements and quality etc.
(c) AIDR Trainer - a public crowd sourcing UI
used by volunteers to help providing labels for
tweets which are then used by the automatic
tagger for auto classification.
Figure 2: AIDR Screenshots: showing (a) collector (creation screens), (b) collector (monitoring screen), (c)
trainer, and (d) tagger.
a maximum classification quality (in terms of AUC) up to
80%. AIDR success during the initial tests was featured
by Wired UK7
on 30 September 2013, and by CBC8
on 18
December 2013.
4. ARCHITECTURE & IMPLEMENTATION
The general architecture of AIDR is shown in Figure 3.
AIDR is a free software platform that can be run as a web
application, or downloaded to create your own instance.9
It consists of three core components; collector, tagger, and
trainer. The collector performs edge adaptation [7], and is
responsible for data collection. For instance, in our current
setup it collects messages from Twitter using the Twitter
streaming API. The collected tweets are then passed to the
tagger for further processing. The tagger is responsible for
the classification of each individual tweet. The tagger is
comprised of three modules: feature extractor, learner, and
classifier. First, the feature extractor receives a tweet, it
extracts features (e.g., uni-grams and bi-grams), and passes
it to the classifier. Second, the classifier’s job is to assign
one of the user-defined categories (e.g., donations, damage,
7
http://www.webcitation.org/6N9iZuG1E
8
http://fw.to/QM0Lqnl
9
https://github.com/qcri-social/AIDR
casualties, etc.) to the tweet. To do so, the classifier uses the
learner module, which requires sufficient training examples
to learn about each user-defined category.
The training examples required by the system can be ob-
tained either using internal web-based interface or by call-
ing an external crowdsourcing platform. The former aims
at enabling the collection owner to provide trusted training
examples, whereas the latter collects training examples us-
ing public crowdsourcing with the help of volunteers. We
assume that there is a fixed budget of crowdsourcing work,
but even if that is not the case, we see this as a problem of
cost effectiveness. To ensure quality, training examples are
obtained in a way that maximizes marginal quality gains per
human label. The maximization of quality gains per label is
done by performing intelligent task generation by selecting
a small set of messages to be labeled by humans. Details
on AIDR crowdsourcing part and task generation strategies
are discussed in detail in our additional research [2].
The output of AIDR (i.e., classified tweets) can be ac-
cessed through output adapters, which are exposed as an
API. To show real-time classified items on a map or any
other visualization widget, one can use AIDR’s live stream
output adapter. Moreover, to fulfill various visualization de-
mands, AIDR includes APIs to retrieve the k-latest items or
to subscribe to a live data feed.
Tweets collector
query
Administrator UI
API API
Trainer
(PyBossa)
API
Features
extractor
Classifier
Learner Trainer
(for trusted users)
API
Crowdsourcing UI
PyBossa DB
Collector DB
Tagger DB
client
server
Output
adapters
tweets
OAuth
crowdsourcing workersadministrator
AIDR Tagger
AIDR Trainer
- Collections
- Users & roles
...
- Training data
- Test data
...
TwitterstreamingAPI
AIDR Collector
- Task info
- Workers
...
Figure 3: AIDR architecture shows Collector, Trainer, and Tagger
Implementation: AIDR comprised of a client-side and
three server-side applications. Mainly, the client-side ap-
plication has been developed using the Sencha ExtJS frame-
work10
, and the server-side implementation is developed us-
ing Java and the Springs 3.0 framework for the main applica-
tion logic. We use PyBossa for the crowdsourcing processing
purposes, and REDIS11
for the communication flows. AIDR
is an open-source platform, and its source code is available
at this repository12
.
5. DEMONSTRATION
A live demo will be presented starting from an introduc-
tion of the crisis computing domain and motivation behind
the development of AIDR platform. A guided walk-through
of the platform will be presented to introduce how different
components of AIDR work. After demonstrating how to cre-
ate collections, perform training, and enable an automatic
classification process, we ask our reader to try the tool and
create their own collection and perform classification with-
out using any knowledge of machine learning.
6. CONCLUSIONS
Social media platforms like Twitter receive an overwhelm-
ing amount of situational awareness information. For emer-
gency response, real-time disaster insights are important.
Finding actionable and tactical information in real-time poses
serious challenges. Effective coordination of human and ma-
chine intelligence can improve disaster response efforts. In
this paper, we have described AIDR, a platform to clas-
sify Twitter messages into a set of user-defined situational
awareness categories in real-time. The platform combines
human and machine intelligence to obtain labels of a subset
of messages and trains an automatic classifier to classify fur-
ther posts. The platform uses active learning approach to
select potential messages to tag, and learns continuously to
increase classification accuracy when new training examples
are available.
10
http://www.sencha.com/products/extjs/
11
http://redis.io/
12
https://github.com/qcri-social/AIDR
7. REFERENCES
[1] S. R. Chowdhury, M. Imran, M. R. Asghar,
S. Amer-Yahia, and C. Castillo. Tweet4act: Using
incident-specific profiles for classifying crisis-related
messages. In Proc. of ISCRAM, Baden-Baden,
Germany, 2013.
[2] M. Imran, C. Castillo, J. Lucas, M. Patrick, and
J. Rogstadius. Coordinating human and machine
intelligence to classify microblog communications in
crises. Proc. of ISCRAM, 2014.
[3] M. Imran, S. Elbassuoni, C. Castillo, F. Diaz, and
P. Meier. Practical extraction of disaster-relevant
information from social media. In Proc. of Workshop on
Social Media Data for Disaster Management, WWW
’13 Companion, pages 1021–1024. ACM/IW3C2, 2013.
[4] M. Imran, S. M. Elbassuoni, C. Castillo, F. Diaz, and
P. Meier. Extracting information nuggets from
disaster-related messages in social media. In Proc. of
ISCRAM, Baden-Baden, Germany, 2013.
[5] M. Imran, I. Lykourentzou, and C. Castillo.
Engineering crowdsourced stream processing systems.
arXiv preprint arXiv:1310.5463, 2013.
[6] C. Li, J. Weng, Q. He, Y. Yao, A. Datta, A. Sun, and
B.-S. Lee. Twiner: named entity recognition in targeted
twitter stream. In Proceedings of the 35th international
ACM SIGIR conference on Research and development
in information retrieval, pages 721–730. ACM, 2012.
[7] D. Turaga, H. Andrade, B. Gedik, C. Venkatramani,
O. Verscheure, J. D. Harris, J. Cox, W. Szewczyk, and
P. Jones. Design principles for developing stream
processing applications. Software: Practice and
Experience, 40(12):1073–1104, 2010.
[8] S. Vieweg. Microblogged contributions to the
emergency arena: Discovery, interpretation and
implications. In Proc. of CSCW, February 2010.
[9] S. E. Vieweg. Situational awareness in mass emergency:
A behavioral and linguistic analysis of microblogged
communications. 2012.

More Related Content

Viewers also liked (7)

A S O F 2011
A S  O F 2011A S  O F 2011
A S O F 2011
 
Salon
SalonSalon
Salon
 
45 Lessons of Life
45 Lessons of Life45 Lessons of Life
45 Lessons of Life
 
Nutricionppt
NutricionpptNutricionppt
Nutricionppt
 
Denver Airport Baggage Handling System
Denver Airport Baggage Handling SystemDenver Airport Baggage Handling System
Denver Airport Baggage Handling System
 
Materias primas
Materias primasMaterias primas
Materias primas
 
TATA Nano sales ppt
TATA Nano sales pptTATA Nano sales ppt
TATA Nano sales ppt
 

Similar to Demo 2014 aidr_artificial_intelligence_disaster_response

Expelling Information of Events from Critical Public Space using Social Senso...
Expelling Information of Events from Critical Public Space using Social Senso...Expelling Information of Events from Critical Public Space using Social Senso...
Expelling Information of Events from Critical Public Space using Social Senso...ijtsrd
 
Blockchain-Based_Event_Detection_and_Trust_Verification_Using_Natural_Languag...
Blockchain-Based_Event_Detection_and_Trust_Verification_Using_Natural_Languag...Blockchain-Based_Event_Detection_and_Trust_Verification_Using_Natural_Languag...
Blockchain-Based_Event_Detection_and_Trust_Verification_Using_Natural_Languag...JayaramB11
 
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...IRJET- Identification of Prevalent News from Twitter and Traditional Media us...
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...IRJET Journal
 
Classification of Disastrous Tweets on Twitter using BERT Model
Classification of Disastrous Tweets on Twitter using BERT ModelClassification of Disastrous Tweets on Twitter using BERT Model
Classification of Disastrous Tweets on Twitter using BERT ModelIRJET Journal
 
Multimodal Combination.pdf
Multimodal Combination.pdfMultimodal Combination.pdf
Multimodal Combination.pdfclientmentailai
 
Event detection in twitter using text and image fusion
Event detection in twitter using text and image fusionEvent detection in twitter using text and image fusion
Event detection in twitter using text and image fusioncsandit
 
Exploratory Data Analysis and Feature Selection for Social Media Hackers Pred...
Exploratory Data Analysis and Feature Selection for Social Media Hackers Pred...Exploratory Data Analysis and Feature Selection for Social Media Hackers Pred...
Exploratory Data Analysis and Feature Selection for Social Media Hackers Pred...CSEIJJournal
 
EXPLORATORY DATA ANALYSIS AND FEATURE SELECTION FOR SOCIAL MEDIA HACKERS PRED...
EXPLORATORY DATA ANALYSIS AND FEATURE SELECTION FOR SOCIAL MEDIA HACKERS PRED...EXPLORATORY DATA ANALYSIS AND FEATURE SELECTION FOR SOCIAL MEDIA HACKERS PRED...
EXPLORATORY DATA ANALYSIS AND FEATURE SELECTION FOR SOCIAL MEDIA HACKERS PRED...CSEIJJournal
 
Pestle based event detection and classification
Pestle based event detection and classificationPestle based event detection and classification
Pestle based event detection and classificationeSAT Journals
 
Controlling Anti Ragging System using Android
Controlling Anti Ragging System using AndroidControlling Anti Ragging System using Android
Controlling Anti Ragging System using AndroidIRJET Journal
 
IRJET- Competitive Analysis of Attacks on Social Media
IRJET-  	 Competitive Analysis of Attacks on Social MediaIRJET-  	 Competitive Analysis of Attacks on Social Media
IRJET- Competitive Analysis of Attacks on Social MediaIRJET Journal
 
Iaetsd real time event detection and alert system using sensors
Iaetsd real time event detection and alert system using sensorsIaetsd real time event detection and alert system using sensors
Iaetsd real time event detection and alert system using sensorsIaetsd Iaetsd
 
A Framework to Identify Best Practices: Social Media and Web 2.0 Technologies...
A Framework to Identify Best Practices: Social Media and Web 2.0 Technologies...A Framework to Identify Best Practices: Social Media and Web 2.0 Technologies...
A Framework to Identify Best Practices: Social Media and Web 2.0 Technologies...Connie White
 
Activity Streaming as Information X-Docking
Activity Streaming as Information X-DockingActivity Streaming as Information X-Docking
Activity Streaming as Information X-DockingKai Riemer
 
Empowering Cyber Threat Intelligence with AI
Empowering Cyber Threat Intelligence with AIEmpowering Cyber Threat Intelligence with AI
Empowering Cyber Threat Intelligence with AIIJCI JOURNAL
 
SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...
SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...
SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...cscpconf
 

Similar to Demo 2014 aidr_artificial_intelligence_disaster_response (20)

Expelling Information of Events from Critical Public Space using Social Senso...
Expelling Information of Events from Critical Public Space using Social Senso...Expelling Information of Events from Critical Public Space using Social Senso...
Expelling Information of Events from Critical Public Space using Social Senso...
 
Akram.pptx
Akram.pptxAkram.pptx
Akram.pptx
 
Blockchain-Based_Event_Detection_and_Trust_Verification_Using_Natural_Languag...
Blockchain-Based_Event_Detection_and_Trust_Verification_Using_Natural_Languag...Blockchain-Based_Event_Detection_and_Trust_Verification_Using_Natural_Languag...
Blockchain-Based_Event_Detection_and_Trust_Verification_Using_Natural_Languag...
 
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...IRJET- Identification of Prevalent News from Twitter and Traditional Media us...
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...
 
Classification of Disastrous Tweets on Twitter using BERT Model
Classification of Disastrous Tweets on Twitter using BERT ModelClassification of Disastrous Tweets on Twitter using BERT Model
Classification of Disastrous Tweets on Twitter using BERT Model
 
Multimodal Combination.pdf
Multimodal Combination.pdfMultimodal Combination.pdf
Multimodal Combination.pdf
 
RITMAN2012-kun
RITMAN2012-kunRITMAN2012-kun
RITMAN2012-kun
 
Event detection in twitter using text and image fusion
Event detection in twitter using text and image fusionEvent detection in twitter using text and image fusion
Event detection in twitter using text and image fusion
 
IJET-V3I1P3
IJET-V3I1P3IJET-V3I1P3
IJET-V3I1P3
 
O3496103
O3496103O3496103
O3496103
 
Exploratory Data Analysis and Feature Selection for Social Media Hackers Pred...
Exploratory Data Analysis and Feature Selection for Social Media Hackers Pred...Exploratory Data Analysis and Feature Selection for Social Media Hackers Pred...
Exploratory Data Analysis and Feature Selection for Social Media Hackers Pred...
 
EXPLORATORY DATA ANALYSIS AND FEATURE SELECTION FOR SOCIAL MEDIA HACKERS PRED...
EXPLORATORY DATA ANALYSIS AND FEATURE SELECTION FOR SOCIAL MEDIA HACKERS PRED...EXPLORATORY DATA ANALYSIS AND FEATURE SELECTION FOR SOCIAL MEDIA HACKERS PRED...
EXPLORATORY DATA ANALYSIS AND FEATURE SELECTION FOR SOCIAL MEDIA HACKERS PRED...
 
Pestle based event detection and classification
Pestle based event detection and classificationPestle based event detection and classification
Pestle based event detection and classification
 
Controlling Anti Ragging System using Android
Controlling Anti Ragging System using AndroidControlling Anti Ragging System using Android
Controlling Anti Ragging System using Android
 
IRJET- Competitive Analysis of Attacks on Social Media
IRJET-  	 Competitive Analysis of Attacks on Social MediaIRJET-  	 Competitive Analysis of Attacks on Social Media
IRJET- Competitive Analysis of Attacks on Social Media
 
Iaetsd real time event detection and alert system using sensors
Iaetsd real time event detection and alert system using sensorsIaetsd real time event detection and alert system using sensors
Iaetsd real time event detection and alert system using sensors
 
A Framework to Identify Best Practices: Social Media and Web 2.0 Technologies...
A Framework to Identify Best Practices: Social Media and Web 2.0 Technologies...A Framework to Identify Best Practices: Social Media and Web 2.0 Technologies...
A Framework to Identify Best Practices: Social Media and Web 2.0 Technologies...
 
Activity Streaming as Information X-Docking
Activity Streaming as Information X-DockingActivity Streaming as Information X-Docking
Activity Streaming as Information X-Docking
 
Empowering Cyber Threat Intelligence with AI
Empowering Cyber Threat Intelligence with AIEmpowering Cyber Threat Intelligence with AI
Empowering Cyber Threat Intelligence with AI
 
SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...
SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...
SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...
 

Recently uploaded

What Does It Mean When Mercedes Says 'ESP Inoperative See Owner's Manual'
What Does It Mean When Mercedes Says 'ESP Inoperative See Owner's Manual'What Does It Mean When Mercedes Says 'ESP Inoperative See Owner's Manual'
What Does It Mean When Mercedes Says 'ESP Inoperative See Owner's Manual'Euromotive Performance
 
如何办理(Waterloo毕业证书)滑铁卢大学毕业证毕业证成绩单原版一比一
如何办理(Waterloo毕业证书)滑铁卢大学毕业证毕业证成绩单原版一比一如何办理(Waterloo毕业证书)滑铁卢大学毕业证毕业证成绩单原版一比一
如何办理(Waterloo毕业证书)滑铁卢大学毕业证毕业证成绩单原版一比一avy6anjnd
 
如何办理新西兰林肯大学毕业证(Lincoln毕业证书)成绩单原版一比一
如何办理新西兰林肯大学毕业证(Lincoln毕业证书)成绩单原版一比一如何办理新西兰林肯大学毕业证(Lincoln毕业证书)成绩单原版一比一
如何办理新西兰林肯大学毕业证(Lincoln毕业证书)成绩单原版一比一opyff
 
01552_14_01306_8.0_EPS_CMP_SW_VC2_Notebook.doc
01552_14_01306_8.0_EPS_CMP_SW_VC2_Notebook.doc01552_14_01306_8.0_EPS_CMP_SW_VC2_Notebook.doc
01552_14_01306_8.0_EPS_CMP_SW_VC2_Notebook.docazrfdstgdgdfh
 
JOHN DEERE 7200R 7215R 7230R 7260R 7280R TECHNICAL SERVICE PDF MANUAL 2680PGS...
JOHN DEERE 7200R 7215R 7230R 7260R 7280R TECHNICAL SERVICE PDF MANUAL 2680PGS...JOHN DEERE 7200R 7215R 7230R 7260R 7280R TECHNICAL SERVICE PDF MANUAL 2680PGS...
JOHN DEERE 7200R 7215R 7230R 7260R 7280R TECHNICAL SERVICE PDF MANUAL 2680PGS...Excavator
 
在线定制(UBC毕业证书)英属哥伦比亚大学毕业证成绩单留信学历认证原版一比一
在线定制(UBC毕业证书)英属哥伦比亚大学毕业证成绩单留信学历认证原版一比一在线定制(UBC毕业证书)英属哥伦比亚大学毕业证成绩单留信学历认证原版一比一
在线定制(UBC毕业证书)英属哥伦比亚大学毕业证成绩单留信学历认证原版一比一qh1ao5mm
 
Electronic Stability Program. (ESP).pptx
Electronic Stability Program. (ESP).pptxElectronic Stability Program. (ESP).pptx
Electronic Stability Program. (ESP).pptxmohamedAabdeltwab
 
一比一原版(Deakin毕业证书)迪肯大学毕业证成绩单留信学历认证
一比一原版(Deakin毕业证书)迪肯大学毕业证成绩单留信学历认证一比一原版(Deakin毕业证书)迪肯大学毕业证成绩单留信学历认证
一比一原版(Deakin毕业证书)迪肯大学毕业证成绩单留信学历认证62qaf0hi
 
Illustrative History and Influence of Board Games - Thesis.pptx
Illustrative History and Influence of Board Games - Thesis.pptxIllustrative History and Influence of Board Games - Thesis.pptx
Illustrative History and Influence of Board Games - Thesis.pptxHenriSandoval
 
T.L.E 5S's (Seiri, Seiton, Seiso, Seiketsu, Shitsuke).pptx
T.L.E 5S's (Seiri, Seiton, Seiso, Seiketsu, Shitsuke).pptxT.L.E 5S's (Seiri, Seiton, Seiso, Seiketsu, Shitsuke).pptx
T.L.E 5S's (Seiri, Seiton, Seiso, Seiketsu, Shitsuke).pptxBernardLongboan
 
John Deere Tractors 5415 Diagnostic Repair Service Manual.pdf
John Deere Tractors 5415 Diagnostic Repair Service Manual.pdfJohn Deere Tractors 5415 Diagnostic Repair Service Manual.pdf
John Deere Tractors 5415 Diagnostic Repair Service Manual.pdfExcavator
 
SEM 922 MOTOR GRADER PARTS LIST, ALL WHEEL DRIVE
SEM 922 MOTOR GRADER PARTS LIST, ALL WHEEL DRIVESEM 922 MOTOR GRADER PARTS LIST, ALL WHEEL DRIVE
SEM 922 MOTOR GRADER PARTS LIST, ALL WHEEL DRIVEZhandosBuzheyev
 
Top profile Call Girls In dewas [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In dewas [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In dewas [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In dewas [ 7014168258 ] Call Me For Genuine Models We ...gajnagarg
 
如何办理多伦多大学毕业证(UofT毕业证书)成绩单原版一比一
如何办理多伦多大学毕业证(UofT毕业证书)成绩单原版一比一如何办理多伦多大学毕业证(UofT毕业证书)成绩单原版一比一
如何办理多伦多大学毕业证(UofT毕业证书)成绩单原版一比一opyff
 
如何办理美国华盛顿大学毕业证(UW毕业证书)毕业证成绩单原版一比一
如何办理美国华盛顿大学毕业证(UW毕业证书)毕业证成绩单原版一比一如何办理美国华盛顿大学毕业证(UW毕业证书)毕业证成绩单原版一比一
如何办理美国华盛顿大学毕业证(UW毕业证书)毕业证成绩单原版一比一avy6anjnd
 
John deere 7200r 7230R 7260R Problems Repair Manual
John deere 7200r 7230R 7260R Problems Repair ManualJohn deere 7200r 7230R 7260R Problems Repair Manual
John deere 7200r 7230R 7260R Problems Repair ManualExcavator
 
如何办理伦敦商学院毕业证(LBS毕业证)毕业证成绩单原版一比一
如何办理伦敦商学院毕业证(LBS毕业证)毕业证成绩单原版一比一如何办理伦敦商学院毕业证(LBS毕业证)毕业证成绩单原版一比一
如何办理伦敦商学院毕业证(LBS毕业证)毕业证成绩单原版一比一avy6anjnd
 
如何办理(NCL毕业证书)纽卡斯尔大学毕业证毕业证成绩单原版一比一
如何办理(NCL毕业证书)纽卡斯尔大学毕业证毕业证成绩单原版一比一如何办理(NCL毕业证书)纽卡斯尔大学毕业证毕业证成绩单原版一比一
如何办理(NCL毕业证书)纽卡斯尔大学毕业证毕业证成绩单原版一比一avy6anjnd
 
Mercedes Check Engine Light Solutions Precision Service for Peak Performance
Mercedes Check Engine Light Solutions Precision Service for Peak PerformanceMercedes Check Engine Light Solutions Precision Service for Peak Performance
Mercedes Check Engine Light Solutions Precision Service for Peak PerformanceMotronix
 

Recently uploaded (20)

What Does It Mean When Mercedes Says 'ESP Inoperative See Owner's Manual'
What Does It Mean When Mercedes Says 'ESP Inoperative See Owner's Manual'What Does It Mean When Mercedes Says 'ESP Inoperative See Owner's Manual'
What Does It Mean When Mercedes Says 'ESP Inoperative See Owner's Manual'
 
如何办理(Waterloo毕业证书)滑铁卢大学毕业证毕业证成绩单原版一比一
如何办理(Waterloo毕业证书)滑铁卢大学毕业证毕业证成绩单原版一比一如何办理(Waterloo毕业证书)滑铁卢大学毕业证毕业证成绩单原版一比一
如何办理(Waterloo毕业证书)滑铁卢大学毕业证毕业证成绩单原版一比一
 
如何办理新西兰林肯大学毕业证(Lincoln毕业证书)成绩单原版一比一
如何办理新西兰林肯大学毕业证(Lincoln毕业证书)成绩单原版一比一如何办理新西兰林肯大学毕业证(Lincoln毕业证书)成绩单原版一比一
如何办理新西兰林肯大学毕业证(Lincoln毕业证书)成绩单原版一比一
 
01552_14_01306_8.0_EPS_CMP_SW_VC2_Notebook.doc
01552_14_01306_8.0_EPS_CMP_SW_VC2_Notebook.doc01552_14_01306_8.0_EPS_CMP_SW_VC2_Notebook.doc
01552_14_01306_8.0_EPS_CMP_SW_VC2_Notebook.doc
 
JOHN DEERE 7200R 7215R 7230R 7260R 7280R TECHNICAL SERVICE PDF MANUAL 2680PGS...
JOHN DEERE 7200R 7215R 7230R 7260R 7280R TECHNICAL SERVICE PDF MANUAL 2680PGS...JOHN DEERE 7200R 7215R 7230R 7260R 7280R TECHNICAL SERVICE PDF MANUAL 2680PGS...
JOHN DEERE 7200R 7215R 7230R 7260R 7280R TECHNICAL SERVICE PDF MANUAL 2680PGS...
 
Abortion pills Dubai (+918133066128) Cytotec 200mg pills UAE Abudhabi
Abortion pills Dubai (+918133066128) Cytotec 200mg pills UAE AbudhabiAbortion pills Dubai (+918133066128) Cytotec 200mg pills UAE Abudhabi
Abortion pills Dubai (+918133066128) Cytotec 200mg pills UAE Abudhabi
 
在线定制(UBC毕业证书)英属哥伦比亚大学毕业证成绩单留信学历认证原版一比一
在线定制(UBC毕业证书)英属哥伦比亚大学毕业证成绩单留信学历认证原版一比一在线定制(UBC毕业证书)英属哥伦比亚大学毕业证成绩单留信学历认证原版一比一
在线定制(UBC毕业证书)英属哥伦比亚大学毕业证成绩单留信学历认证原版一比一
 
Electronic Stability Program. (ESP).pptx
Electronic Stability Program. (ESP).pptxElectronic Stability Program. (ESP).pptx
Electronic Stability Program. (ESP).pptx
 
一比一原版(Deakin毕业证书)迪肯大学毕业证成绩单留信学历认证
一比一原版(Deakin毕业证书)迪肯大学毕业证成绩单留信学历认证一比一原版(Deakin毕业证书)迪肯大学毕业证成绩单留信学历认证
一比一原版(Deakin毕业证书)迪肯大学毕业证成绩单留信学历认证
 
Illustrative History and Influence of Board Games - Thesis.pptx
Illustrative History and Influence of Board Games - Thesis.pptxIllustrative History and Influence of Board Games - Thesis.pptx
Illustrative History and Influence of Board Games - Thesis.pptx
 
T.L.E 5S's (Seiri, Seiton, Seiso, Seiketsu, Shitsuke).pptx
T.L.E 5S's (Seiri, Seiton, Seiso, Seiketsu, Shitsuke).pptxT.L.E 5S's (Seiri, Seiton, Seiso, Seiketsu, Shitsuke).pptx
T.L.E 5S's (Seiri, Seiton, Seiso, Seiketsu, Shitsuke).pptx
 
John Deere Tractors 5415 Diagnostic Repair Service Manual.pdf
John Deere Tractors 5415 Diagnostic Repair Service Manual.pdfJohn Deere Tractors 5415 Diagnostic Repair Service Manual.pdf
John Deere Tractors 5415 Diagnostic Repair Service Manual.pdf
 
SEM 922 MOTOR GRADER PARTS LIST, ALL WHEEL DRIVE
SEM 922 MOTOR GRADER PARTS LIST, ALL WHEEL DRIVESEM 922 MOTOR GRADER PARTS LIST, ALL WHEEL DRIVE
SEM 922 MOTOR GRADER PARTS LIST, ALL WHEEL DRIVE
 
Top profile Call Girls In dewas [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In dewas [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In dewas [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In dewas [ 7014168258 ] Call Me For Genuine Models We ...
 
如何办理多伦多大学毕业证(UofT毕业证书)成绩单原版一比一
如何办理多伦多大学毕业证(UofT毕业证书)成绩单原版一比一如何办理多伦多大学毕业证(UofT毕业证书)成绩单原版一比一
如何办理多伦多大学毕业证(UofT毕业证书)成绩单原版一比一
 
如何办理美国华盛顿大学毕业证(UW毕业证书)毕业证成绩单原版一比一
如何办理美国华盛顿大学毕业证(UW毕业证书)毕业证成绩单原版一比一如何办理美国华盛顿大学毕业证(UW毕业证书)毕业证成绩单原版一比一
如何办理美国华盛顿大学毕业证(UW毕业证书)毕业证成绩单原版一比一
 
John deere 7200r 7230R 7260R Problems Repair Manual
John deere 7200r 7230R 7260R Problems Repair ManualJohn deere 7200r 7230R 7260R Problems Repair Manual
John deere 7200r 7230R 7260R Problems Repair Manual
 
如何办理伦敦商学院毕业证(LBS毕业证)毕业证成绩单原版一比一
如何办理伦敦商学院毕业证(LBS毕业证)毕业证成绩单原版一比一如何办理伦敦商学院毕业证(LBS毕业证)毕业证成绩单原版一比一
如何办理伦敦商学院毕业证(LBS毕业证)毕业证成绩单原版一比一
 
如何办理(NCL毕业证书)纽卡斯尔大学毕业证毕业证成绩单原版一比一
如何办理(NCL毕业证书)纽卡斯尔大学毕业证毕业证成绩单原版一比一如何办理(NCL毕业证书)纽卡斯尔大学毕业证毕业证成绩单原版一比一
如何办理(NCL毕业证书)纽卡斯尔大学毕业证毕业证成绩单原版一比一
 
Mercedes Check Engine Light Solutions Precision Service for Peak Performance
Mercedes Check Engine Light Solutions Precision Service for Peak PerformanceMercedes Check Engine Light Solutions Precision Service for Peak Performance
Mercedes Check Engine Light Solutions Precision Service for Peak Performance
 

Demo 2014 aidr_artificial_intelligence_disaster_response

  • 1. AIDR: Artificial Intelligence for Disaster Response Muhammad Imran Qatar Computing Research Institute Doha, Qatar mimran@qf.org.qa Carlos Castillo Qatar Computing Research Institute Doha, Qatar chato@acm.org Ji Lucas Qatar Computing Research Institute Doha, Qatar jlucas@qf.org.qa Patrick Meier Qatar Computing Research Institute Doha, Qatar patrick@irevolution.net Sarah Vieweg Qatar Computing Research Institute Doha, Qatar svieweg@qf.org.qa ABSTRACT We present AIDR (Artificial Intelligence for Disaster Re- sponse), a platform designed to perform automatic classi- fication of crisis-related microblog communications. AIDR enables humans and machines to work together to apply hu- man intelligence to large-scale data at high speed. The objective of AIDR is to classify messages that peo- ple post during disasters into a set of user-defined categories of information (e.g., “needs”, “damage”, etc.) For this pur- pose, the system continuously ingests data from Twitter, processes it (i.e., using machine learning classification tech- niques) and leverages human-participation (through crowd- sourcing) in real-time. AIDR has been successfully tested to classify informative vs. non-informative tweets posted dur- ing the 2013 Pakistan Earthquake. Overall, we achieved a classification quality (measured using AUC) of 80%. AIDR is available at http://aidr.qcri.org/. Categories and Subject Descriptors H.4 [Information Systems Applications]: Miscellaneous; D.2.2 [Software Engineering]: Design Tools and Tech- niques Keywords Stream processing; Crowdsourcing; Classification; Online Machine learning 1. INTRODUCTION Information overload during disasters can be as paralyz- ing to humanitarian response as the absence of information. During disasters, microblogging platforms like Twitter re- Copyright is held by the International World Wide Web Conference Com- mittee (IW3C2). IW3C2 reserves the right to provide a hyperlink to the author’s site if the Material is used in electronic media. WWW’14 Companion, April 7–11, 2014, Seoul, Korea. ACM 978-1-4503-2745-9/14/04. http://dx.doi.org/10.1145/2567948.2577034. Trainer Figure 1: AIDR: overall approach ceive an overwhelming amount of situation-sensitive infor- mation that people post in the form of textual messages, im- ages, and videos. Despite the fact that social media streams contain a significant amount of noise, much research [9, 4] has shown that these same streams of information also in- clude relevant, tactical information (e.g., regarding infras- tructure damage, needs, donations). Because social media communications provide a rich trove of information, it is pos- sible that even a small amount of relevant information can greatly enhance situational awareness and help responders and other concerned parties make more informed decision. Finding tactical and actionable information in real time within a rapidly growing stack of information is challeng- ing for many reasons. For instance, performing information extraction on short bursts of text (e.g., on 140-character tweets) is significantly more difficult than performing the same task on large documents such as blog posts pr news articles [6]. Moreover, research has shown that pre-trained classifiers significantly drop in classification accuracy when used in different but similar disasters [3]. This requires learning and training new classifiers using fresh training data every time a disaster strikes. Considering the amount of information that flows on Twit- ter, it is challenging for emergency managers and other stake- holders to investigate each individual tweet in real-time to
  • 2. look for useful information. Therefore, our goal is to lever- age different machine learning techniques (e.g., information classification, and extraction) to perform the job automati- cally. Moreover, we want humans (i.e. volunteers) to label part of the incoming data to be used for the training pur- poses of machine learning algorithms. Above all, the whole process must be ingesting, processing and producing only credible information in real-time, or with low latency [5]. The rest of the paper is organized as follows: In the next section, we describe domain challenges in crisis response. In section 3, we present an overview of AIDR from an end-user perspective, as well as an evaluation. Section 4 presents AIDR’s architecture and implementation. A demonstration storyboard is described in section 5, followed by the conclu- sion in section 6. 2. DOMAIN CHALLENGES IN CRISIS RESPONSE During disasters, social media messages provide real-time or low-latency situational awareness information that can enable crisis responders to be more effective in their relief efforts [8]. However, different emergency response agen- cies are interested in different types of messages. For in- stance, reports of damage to infrastructures should be di- rected to some agencies, while reports about shortages of water and/or food should be directed to others.1 Moreover, disaster response in its various types can be applied during the various phases of disaster such as prepa- ration, response and recovery. During each phase, disaster responders require different information. In our previous work [1], we observed that social media response to disasters follows the same pattern, that is, messages posted on social media during the early phases of a disaster talk about cau- tion & warnings, whereas messages posted during the later phases report infrastructure damage, casualties, donations required or available, etc. Below, we discuss the roles of automatic computation, hu- man computation, and the combination of the two in the processing of social media streams. Role of machine intelligence: Traditional information processing cannot be employed in this model, as disaster responders cannot wait to collect information, and then cu- rate and classify it offline. Instead, responders and other stakeholders require real-time insight and intelligence as the disaster unfolds. To this end, we aim to ingest and classify social media streams in real-time through automated means with the help of human intervention. Role of human intelligence: When attempting to per- form non-trivial tasks, machines alone are not capable of great accuracy. Human intervention is required to verify, teach, and/or correct the machine output [2]. Use of human intelligence fills the gap for the tasks that cannot be auto- mated, for example, providing input labels (i.e., for initial training), correcting or validating the machine’s output (i.e., for performance optimization) are among the types of human interventions. In AIDR, we aim to find a right balance so that the human intelligence can be used in an effective way. Combined intelligence: Relying solely on humans to in- vestigate each individual message is challenging due to the 1 The United Nations organizes its agencies into clusters: http://business.un.org/en/documents/6852. scale of information posted on Twitter, which goes beyond the processing capacity of humans. To this end, an au- tomatic approach is required that can intelligently crowd- source messages to obtain training examples when needed, and additionally, the system should effectively use crowd- sourcing workers both in terms of time (i.e., for volunteers) and cost (i.e., for paid workers). 3. SYSTEMS OVERVIEW The purpose of AIDR (Artificial Intelligence for Disaster Response),2 is to filter and classify messages posted to social media during humanitarian crises in real time. Specifically, AIDR collects crisis-related messages from Twitter3 (“tweets”), asks a crowd to label a sub-set of those messages, and trains an automatic classifier based on the labels. It also improves the classifier as more labels become available. Automatic classification using pre-existing train- ing data is not a satisfactory solution because although crises have elements in common, they also have specific aspects which make domain adaptation difficult. Crisis-specific la- bels lead to higher accuracy than labels from past disas- ters [3]. 3.1 AIDR in action: end-user perspective AIDR users begin by creating a collection process by en- tering a set of keywords or a geographical region that will be used to filter the Twitter stream, as shown in Figure 2(a). The user can monitor the collection status (e.g., to- tal processed items, last processed item, time elapsed, etc.) using dashboard as shown in Figure 2 (b). Next, a crowd of annotators provide training examples: a system-selected message plus a human-assigned label, as shown in Figure 2(c), which are then used to train classifiers for incoming items, as shown in Figure 2(d). Finally, an output of messages sorted into categories is generated, which can be collected and used to create crisis maps and other types of reports. An example consumer application is the current version of CrisisTracker,4 which uses AIDR to enable users to slice the data by categories of interest, which vary by deployment scenario to include for instance eyewitness accounts, reports of violence, or reports of damage infrastructure. 3.2 Evaluation AIDR was successfully tested during a recent earthquake in Pakistan in 2013. We set AIDR up to collect tweets using the hashtags (#Pakistan, #Awaran, #Balochistan, #earth- quake, #ReliefPK) on September 25, 2013 at 20:20:09 AST5 on a request of UN Office for the Coordination of Humani- tarian Affairs (OCHA). Within a few hours, SBTF (Standby Task Force) 6 volunteers were asked to label whether a given tweet was informative (i.e., if the tweet reports infrastruc- ture damage, casualties, donation offered or needed, etc.). They tagged about 1,000 tweets approximately within 6 hours. Though the prevalence of the negative class (“not informative”) was high, the system was able to learn from ≈200 informative labeled tweets. In this setup, we achieved 2 http://aidr.qcri.org/ 3 http://twitter.com/ 4 http://ufn.virtues.fi/~jakob/yolanda/ 5 Arabian Standard Time 6 http://blog.standbytaskforce.com/
  • 3. (a) AIDR Collector - create collection UI allows users to create their collections by specifying keywords, geographical regions, language filters etc. (b) AIDR Collector - for a running collection this UI shows various parameters like, downloaded items, time period, configuration history overtime etc. (d) AIDR Tagger - this UI shows the results of automatic tagging in terms of training elements , classified elements and quality etc. (c) AIDR Trainer - a public crowd sourcing UI used by volunteers to help providing labels for tweets which are then used by the automatic tagger for auto classification. Figure 2: AIDR Screenshots: showing (a) collector (creation screens), (b) collector (monitoring screen), (c) trainer, and (d) tagger. a maximum classification quality (in terms of AUC) up to 80%. AIDR success during the initial tests was featured by Wired UK7 on 30 September 2013, and by CBC8 on 18 December 2013. 4. ARCHITECTURE & IMPLEMENTATION The general architecture of AIDR is shown in Figure 3. AIDR is a free software platform that can be run as a web application, or downloaded to create your own instance.9 It consists of three core components; collector, tagger, and trainer. The collector performs edge adaptation [7], and is responsible for data collection. For instance, in our current setup it collects messages from Twitter using the Twitter streaming API. The collected tweets are then passed to the tagger for further processing. The tagger is responsible for the classification of each individual tweet. The tagger is comprised of three modules: feature extractor, learner, and classifier. First, the feature extractor receives a tweet, it extracts features (e.g., uni-grams and bi-grams), and passes it to the classifier. Second, the classifier’s job is to assign one of the user-defined categories (e.g., donations, damage, 7 http://www.webcitation.org/6N9iZuG1E 8 http://fw.to/QM0Lqnl 9 https://github.com/qcri-social/AIDR casualties, etc.) to the tweet. To do so, the classifier uses the learner module, which requires sufficient training examples to learn about each user-defined category. The training examples required by the system can be ob- tained either using internal web-based interface or by call- ing an external crowdsourcing platform. The former aims at enabling the collection owner to provide trusted training examples, whereas the latter collects training examples us- ing public crowdsourcing with the help of volunteers. We assume that there is a fixed budget of crowdsourcing work, but even if that is not the case, we see this as a problem of cost effectiveness. To ensure quality, training examples are obtained in a way that maximizes marginal quality gains per human label. The maximization of quality gains per label is done by performing intelligent task generation by selecting a small set of messages to be labeled by humans. Details on AIDR crowdsourcing part and task generation strategies are discussed in detail in our additional research [2]. The output of AIDR (i.e., classified tweets) can be ac- cessed through output adapters, which are exposed as an API. To show real-time classified items on a map or any other visualization widget, one can use AIDR’s live stream output adapter. Moreover, to fulfill various visualization de- mands, AIDR includes APIs to retrieve the k-latest items or to subscribe to a live data feed.
  • 4. Tweets collector query Administrator UI API API Trainer (PyBossa) API Features extractor Classifier Learner Trainer (for trusted users) API Crowdsourcing UI PyBossa DB Collector DB Tagger DB client server Output adapters tweets OAuth crowdsourcing workersadministrator AIDR Tagger AIDR Trainer - Collections - Users & roles ... - Training data - Test data ... TwitterstreamingAPI AIDR Collector - Task info - Workers ... Figure 3: AIDR architecture shows Collector, Trainer, and Tagger Implementation: AIDR comprised of a client-side and three server-side applications. Mainly, the client-side ap- plication has been developed using the Sencha ExtJS frame- work10 , and the server-side implementation is developed us- ing Java and the Springs 3.0 framework for the main applica- tion logic. We use PyBossa for the crowdsourcing processing purposes, and REDIS11 for the communication flows. AIDR is an open-source platform, and its source code is available at this repository12 . 5. DEMONSTRATION A live demo will be presented starting from an introduc- tion of the crisis computing domain and motivation behind the development of AIDR platform. A guided walk-through of the platform will be presented to introduce how different components of AIDR work. After demonstrating how to cre- ate collections, perform training, and enable an automatic classification process, we ask our reader to try the tool and create their own collection and perform classification with- out using any knowledge of machine learning. 6. CONCLUSIONS Social media platforms like Twitter receive an overwhelm- ing amount of situational awareness information. For emer- gency response, real-time disaster insights are important. Finding actionable and tactical information in real-time poses serious challenges. Effective coordination of human and ma- chine intelligence can improve disaster response efforts. In this paper, we have described AIDR, a platform to clas- sify Twitter messages into a set of user-defined situational awareness categories in real-time. The platform combines human and machine intelligence to obtain labels of a subset of messages and trains an automatic classifier to classify fur- ther posts. The platform uses active learning approach to select potential messages to tag, and learns continuously to increase classification accuracy when new training examples are available. 10 http://www.sencha.com/products/extjs/ 11 http://redis.io/ 12 https://github.com/qcri-social/AIDR 7. REFERENCES [1] S. R. Chowdhury, M. Imran, M. R. Asghar, S. Amer-Yahia, and C. Castillo. Tweet4act: Using incident-specific profiles for classifying crisis-related messages. In Proc. of ISCRAM, Baden-Baden, Germany, 2013. [2] M. Imran, C. Castillo, J. Lucas, M. Patrick, and J. Rogstadius. Coordinating human and machine intelligence to classify microblog communications in crises. Proc. of ISCRAM, 2014. [3] M. Imran, S. Elbassuoni, C. Castillo, F. Diaz, and P. Meier. Practical extraction of disaster-relevant information from social media. In Proc. of Workshop on Social Media Data for Disaster Management, WWW ’13 Companion, pages 1021–1024. ACM/IW3C2, 2013. [4] M. Imran, S. M. Elbassuoni, C. Castillo, F. Diaz, and P. Meier. Extracting information nuggets from disaster-related messages in social media. In Proc. of ISCRAM, Baden-Baden, Germany, 2013. [5] M. Imran, I. Lykourentzou, and C. Castillo. Engineering crowdsourced stream processing systems. arXiv preprint arXiv:1310.5463, 2013. [6] C. Li, J. Weng, Q. He, Y. Yao, A. Datta, A. Sun, and B.-S. Lee. Twiner: named entity recognition in targeted twitter stream. In Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval, pages 721–730. ACM, 2012. [7] D. Turaga, H. Andrade, B. Gedik, C. Venkatramani, O. Verscheure, J. D. Harris, J. Cox, W. Szewczyk, and P. Jones. Design principles for developing stream processing applications. Software: Practice and Experience, 40(12):1073–1104, 2010. [8] S. Vieweg. Microblogged contributions to the emergency arena: Discovery, interpretation and implications. In Proc. of CSCW, February 2010. [9] S. E. Vieweg. Situational awareness in mass emergency: A behavioral and linguistic analysis of microblogged communications. 2012.