SlideShare a Scribd company logo
1 of 78
Download to read offline
BIG CRISIS DATA
An Open Invitation
CARLOS CASTILLO
@BigCrisisData
Manaus, Brasil, Outubro 2015
BigCrisis Data — Carlos Castillo 2
This talk is about ...
● Disasters and time-critical situations
– Natural, social, or technological hazards
– Mass convergence events
● Social media
– Particularly microtext
● Computing
– Applications of many fields including NLP, ML, IR
Big Crisis Data — Carlos Castillo 3
http://www.youtube.com/watch?v=0UFsJhYBxzY
BigCrisis Data — Carlos Castillo 4
An earthquake hits a Twitter user
http://xkcd.com/723/
● When an earthquake strikes, the first tweets are posted 20-30
seconds later
● Damaging seismic waves travel at 3-5 km/s, while network
communications are light speed on fiber/copper + latency
● After ~100km seismic waves may be overtaken by tweets about
them
Big Crisis Data — Carlos Castillo 5
January 2010
How/when did it start for me?
Big Crisis Data — Carlos Castillo 6
Humanitarian Computing
At least 775 publications:
● Crisis Analysis (55)
● Crisis Management (309)
● Situational Awareness (67)
● Social Media (231)
● Mobile Phones (74)
● Crowdsourcing (116)
● Software and Tools (97)
● Human-Computer Interaction (28) 
● Natural Language Processing (33) 
● Trust and Security (33)
● Geographical Analysis (53)
Source: http://humanitariancomp.referata.com/
Big Crisis Data — Carlos Castillo 7
Humanitarian Computing Topics
Big Crisis Data — Carlos Castillo 8
Big Crisis Data — Carlos Castillo 9
BigCrisis Data — Carlos Castillo 10
Fertile grounds for applied research
✔ Problems of global significance
✔ Solved with labor-intensive methods
✔ Better solution provides a public good
✔ Large and noisy data sets available
✔ Engage volunteer communities
BigCrisis Data — Carlos Castillo 11
Fertile grounds for applied research
✔ Problems of global significance
✔ Solved with labor-intensive methods
✔ Better solution provides a public good
✔ Large and noisy data sets available
✔ Engage volunteer communities
Relevance to practitioners?
BigCrisis Data — Carlos Castillo 12
Recent collaborators
Patrick Meier
Sarah Vieweg
– QCRI
Muhammad Imran
– QCRI
Irina Temnikova
– QCRI
Alexandra Olteanu
– EPFL
Aditi Gupta
– IIIT Delhi
“P.K.” Kumaraguru
– IIIT Delhi
Fernando Diaz
– Microsoft
BigCrisis Data — Carlos Castillo 13
Outline
Volume
Vagueness
Visualization
Volunteering
Values
BigCrisis Data — Carlos Castillo 14
Disaster
Communications
and Scale
BigCrisis Data — Carlos Castillo 15
Crises and disasters
● Crises are unstable situations
– May or may not lead to a disaster
● Disasters are social phenomena
– Disruptions of routines
BigCrisis Data — Carlos Castillo 16
Temporal and Spatial Dimensions
BigCrisis Data — Carlos Castillo 17
Examples
Big Crisis Data — Carlos Castillo 18
REEL LIFE OR REAL LIFE?
Big Crisis Data — Carlos Castillo 19
REEL LIFE OR REAL LIFE?
Big Crisis Data — Carlos Castillo 20
https://www.youtube.com/watch?v=MylI8HmgMBk
BigCrisis Data — Carlos Castillo 21
In Real Life ...
● Some people panic, most people don't
● People gather information from familiar sources
● People quickly decide whether to flee, take cover, or take action
● People improvise complex rescue operations on the spot
Devon, UK, June 2014 London, UK, May 2015 San José Boquerón, Paraguay, Oct 2013
Big Crisis Data — Carlos Castillo 22
Example Disaster-Related Messages
“OMG! The fire seems out of control: It’s running down the
hills!”
Bush fire near Marseilles, France, in 2009 [Longueville et al. 2009]
“Red River at East Grand Forks is 48.70 feet, +20.7 feet of
flood stage, -5.65 feet of 1997 crest. #flood09”
Red River Valley floods in 2009 [Starbird et al. 2010]
“My moms backyard in Hatteras. That dock is usually about 3
feet above water [photo]”
Hurricane Sandy 2013 [Leavitt and Clark 2014]
“Sirens going off now!! Take cover...be safe!”
Moore Tornado 2013 [Blanford et al. 2014].
“There is shooting at Utøya, my little sister is there and just
called home!”
2011 attacks in Norway [Perng et al. 2013]
BigCrisis Data — Carlos Castillo 23
Social media usage during disasters
● Interpersonal (horizontal)
– Stay in touch with family and friends
● Citizen sensing (bottom-up)
– Read/Write reports on ground situation
● Official communications (top-down)
– E.g. advice, warnings, or evacuation orders
BigCrisis Data — Carlos Castillo 24
Scale: Tweets per Second
BigCrisis Data — Carlos Castillo 25
Requirements
● Typical users
– Emergency response services
– Humanitarian relief agencies
– Journalists and the Public
● Underspecified requirements that vary over time
● Usually a combination of:
1) Capture the “Big Picture”
2) Obtain “Actionable Insights”
BigCrisis Data — Carlos Castillo 26
Understanding,
Classifying and
Extracting
BigCrisis Data — Carlos Castillo 27
Example
“Media must report
about d alleged 20k
RSS chaps off 2
#Nepal.here’s a pic
coz d 1 @ShainaNC
shared isn’t true..
;)”
BigCrisis Data — Carlos Castillo 28
Social media messages
● Social media is more like a transcript of a conversation than like
text meant to stand on its own
– Awkward entry methods:
● Fragmented language and incomplete sentences
● Many typographic and grammatical errors
– Conversational:
● Little or no context (hard to comprehend in isolation)
● Code switching and borrowing
● Internet slang
Big Crisis Data — Carlos Castillo 29
Slang
Big Crisis Data — Carlos Castillo 30
Classification
Caution &
Advice
Information
Sources
Damage &
Casualties Donations
Gov
Eyewitness
Media
NGO
Outsider
...
...
Filtered
tweets
BigCrisis Data — Carlos Castillo 31
Classification Axes
● By usefulness (application-dependent!)
– Not related, Related but useless, Useful
● By factual, subjective, or emotional content
● By information provided
● By information source
– Government, NGOs, media, eyewitnesses, etc.
● By humanitarian clusters
Big Crisis Data — Carlos Castillo 32
Humanitarian Clusters
Big Crisis Data — Carlos Castillo 33
Alexandra Olteanu, Sarah Vieweg and Carlos Castillo: What to Expect When the
Unexpected Happens: Social Media Communications Across Crises.
To appear in CSCW 2015.
Humanitarian Clusters (cont.)
BigCrisis Data — Carlos Castillo 34
A large-scale study of crisis tweets
● Collect tweets from 26 disasters
● Classify according to:
● Informative / Not informative
● Information provided
● Information source
● Several iterations required to write the “right” instructions
Alexandra Olteanu, Sarah Vieweg and Carlos Castillo: "What to Expect
When the Unexpected Happens: Social Media Communications Across
Crises" In CSCW 2015, 14-18 March in Vancouver, Canada. ACM Press.
Big Crisis Data — Carlos Castillo 35
Information Provided in Crisis Tweets
N=26; Data available at http://crisislex.org/
BigCrisis Data — Carlos Castillo 36
What do people tweet about?
● Affected individuals
– 20% on average (min. 5%, max. 57%)
– most prevalent in human-induced, focalized & instantaneous
events
● Sympathy and emotional support
– 20% on average (min. 3%, max. 52%)
– most prevalent in instantaneous events
● Other useful information
– 32% on average (min. 7%, max. 59%)
– least prevalent in diffused events
BigCrisis Data — Carlos Castillo 37
What do people tweet about? (cont.)
● Infrastructure and utilities
– 7% on average (min. 0%, max. 22%)
– most prevalent in diffused events, in particular floods
● Caution and advice
– 10% on average (min. 0%, max. 34%)
– least prevalent in instantaneous & human-induced events
● Donations and volunteering
– 10% on average (min. 0%, max. 44%)
– most prevalent in natural hazards
Big Crisis Data — Carlos Castillo 38
Distribution over information sources
Big Crisis Data — Carlos Castillo 39
Distribution over time
BigCrisis Data — Carlos Castillo 40
Dataset
CrisisLexT26
www.crisislex.org
Big Crisis Data — Carlos Castillo 41
Information Extraction
...
Classified
tweets @JimFreund: Apparently we have no choice.
There is a tornado watch in effect
tonight.
BigCrisis Data — Carlos Castillo 42
Extraction
● #hashtags, @user mentions, URLs, etc.
– Regular expressions
– Text library from Twitter
● Temporal expressions
– Part-of-speech tagger + heuristics
– Natty library
● Supervised learning
Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz
and Patrick Meier: Practical Extraction of Disaster-Relevant Information
from Social Media. Social Web and Disaster Management (SWDM)
workshop. Rio de Janeiro, Brazil, 2013.
BigCrisis Data — Carlos Castillo 43
Labels for extraction
● Type-dependent instruction
● Ask evaluators to copy-paste a word/phrase from each tweet
BigCrisis Data — Carlos Castillo 44
Learning: Conditional Random Fields
● Extends HMM to incorporate more possible dependencies
● Used extensively in NLP for part-of-speech tagging and
information extraction
HMM Linear-chain CRF
hidden
observed
BigCrisis Data — Carlos Castillo 45
Tool
● CMU ARK Twitter NLP
– Tokenization
– Feature extraction
– CRF learning
● Very easy to use
– simply change the training set (part-of-speech tags),
– then re-train
Big Crisis Data — Carlos Castillo 46
Output examples
RT @weatherchannel: .@NYGovCuomo orders closing of NYC bridges. Only Staten
Island bridges unaffected at this time. Bridges must close by 7pm. #Sandy #NYC
Wow what a mess #Sandy has made. Be sure to check on the elderly and
homeless please! Thoughts and prayers to all affected
RT @twc_hurricane: Wind gusts over 60 mph are being reported at Central Park
and JFK airport in #NYC this hour. #Sandy
RT @mitchellreports: Red Cross tells us grateful for Romney donation but prefer
people send money or donate blood dont collect goods NOT best way to help
#Sandy
Big Crisis Data — Carlos Castillo 47
Extractor evaluation
Setting Rec Prec
Train 2/3 Joplin, Test 1/3 Joplin 78% 90%
Train 2/3 Sandy, Test 1/3 Sandy 41% 79%
Train Joplin, Test Sandy 11% 78%
Train Joplin + 10% Sandy, Test 90%
Sandy
21% 81%
● Precision is: one word or more in common with
what humans extracted
BigCrisis Data — Carlos Castillo 48
Donations matching
● Identify and match requests/offers for donations
– Money, clothing, food, shelter, volunteers, blood
● Method
– Classify
– Determine key aspects
– Extract key aspects
– Per-aspect matching
Hemant Purohit, Amit Sheth, Carlos Castillo, Patrick Meier, Fernando
Diaz: Emergency-Relief Coordination on Social Media: Automatically
Matching Resource Requests and Offers. First Monday 19 (1), January
2014.
BigCrisis Data — Carlos Castillo 49
Donations matching
Average precision = 0.21 (0.16 if only text similarity is used)
BigCrisis Data — Carlos Castillo 50
Crisis maps
from social
media
Big Crisis Data — Carlos Castillo 51
Big Crisis Data — Carlos Castillo 52
Big Crisis Data — Carlos Castillo 53
Patrick Meier, Social Innovation Director @ QCRI – http://irevolution.net/
“What can speed humanitarian
response to tsunami-ravaged coasts?
Expose human rights atrocities?
Launch helicopters to rescue
earthquake victims? Outwit corrupt
regimes?
A map.”
BigCrisis Data — Carlos Castillo 54
Crisis mapping goes mainstream
(2011)
Big Crisis Data — Carlos Castillo 55
Big Crisis Data — Carlos Castillo 56
Big Crisis Data — Carlos Castillo 57
Big Crisis Data — Carlos Castillo 58
Big Crisis Data — Carlos Castillo 59
BigCrisis Data — Carlos Castillo 60
Automatic Mapping (floods)
● Top: hydrological data
● Bottom: tweet density
● Broad match with affected areas
● Many biases towards places with
higher density of smartphones
De Albuquerque, João Porto, Herfort, Benjamin, Brenning, Alexander, and
Zipf, Alexander. 2015. A geographic approach for combining social media
and authoritative data towards identifying useful information for disaster
management. International Journal of Geographical Information Science,
29(4), 667–689.
BigCrisis Data — Carlos Castillo 61
Automatic Mapping (Dengue)
Gomide, Janaina and Veloso, Adriano and Meira, Wagner and Almeida,
Virgilio and Benevenuto, Fabricio and Ferraz, Fernanda and Teixeira,
Mauro (2011) Dengue surveillance based on a computational model of
spatio-temporal locality of Twitter. pp. 1-8. In: Proceedings of the ACM
WebSci'11, June 14-17 2011, Koblenz, Germany.
● Top: official reports
● Bottom: tweets
BigCrisis Data — Carlos Castillo 62
Current Approach
Hybrid real-time systems
MicroMappers
Manual processing:
crowdsourcing
Automatic processing:
machine learning
Big Crisis Data — Carlos Castillo 63
http://newsbeatsocial.com/watch/0_s6xxcr3p
Big Crisis Data — Carlos Castillo 64
Big Crisis Data — Carlos Castillo 65
Big Crisis Data — Carlos Castillo 66
https://www.youtube.com/watch?v=uKgE3yWJ0_I
BigCrisis Data — Carlos Castillo 67
Volunteering
and Values
BigCrisis Data — Carlos Castillo 68
Volunteering is a constant
● Integral part of how communities react to disasters
● Organizational types:
– Existing – Extending – Expanding – Emerging
● Emergent organizations a mixed blessing for existing ones
● New scenario: digital volunteering
– E.g. volunteer annotations, including crisis mapping
BigCrisis Data — Carlos Castillo 69
Why do people volunteer?
Altruism is
key, but it's
one of many
reasons
BigCrisis Data — Carlos Castillo 70
Privacy and Ethics
● Protect the privacy of individuals
– ICRC Data Protection Guidelines
– UN Guidelines on Cyber Security
● Protect victims and responders during armed attacks
● Protect volunteers from distal exposure
● Protect citizen reporters from danger and retaliation
● Give back and share results and data
BigCrisis Data — Carlos Castillo 71
“I'm dying, they are tweeting”
Digital Voyeurism
BigCrisis Data — Carlos Castillo 72
CONCLUSIONS
Computationally
feasible
Supported by
data
Useful
Good projects in this space
Computationally
feasible
Supported by
data
Useful
Good projects in this space
Temptation! Danger!
Poorly planned
projects :-(
AI-complete
problems
Big Crisis Data — Carlos Castillo 75
Interdisciplinary Research
●
As many things, it has Good, Bad, and Ugly aspects
●
Good
– You learn a lot, and it's the only way of supporting claims of practical
utility in applied research
●
Bad
– Formal response organizations can be very difficult to engage with;
relationships should be established between operations
●
Ugly
– Working software and 24/7 support for a critical need now vs
advanced proof-of-concept later
Possibility of large impact by using
computer science to support
humanitarian work
=
Applied computing at its best
Big Crisis Data — Carlos Castillo 77
References
●
Carlos Castillo: “Big Crisis Data.” Cambridge University Press, 2016 (forthcoming).
●
Muhammad Imran, Carlos Castillo, Fernando Diaz, Sarah Vieweg: "Processing Social Media Messages in Mass
Emergency: A Survey" In ACM Computing Surveys, Volume 47, Issue 4, June 2015.
●
Alexandra Olteanu, Sarah Vieweg and Carlos Castillo: "What to Expect When the Unexpected Happens: Social
Media Communications Across Crises" In CSCW 2015, 14-18 March in Vancouver, Canada. ACM Press.
●
Muhammad Imran, Ioanna Lykourentzou, Yannick Naudet and Carlos Castillo: Engineering Crowdsourced Stream
Processing Systems. Technical report, 2015.
●
Hemant Purohit, Amit Sheth, Carlos Castillo, Patrick Meier, Fernando Diaz: Emergency-Relief Coordination on
Social Media: Automatically Matching Resource Requests and Offers. First Monday 19 (1), January 2014.
●
Sarah Vieweg, Carlos Castillo and Muhammad Imran: "Integrating Social Media Communications into the Rapid
Assessment of Sudden Onset Disasters." SocInfo 2014.
●
Alexandra Olteanu, Carlos Castillo, Fernando Diaz and Sarah Vieweg: CrisisLex: A Lexicon for Collecting and
Filtering Microblogged Communications in Crises. In ICWSM. Ann Arbor, MI, USA. June 2014.
●
Carlos Castillo, Marcelo Mendoza, Barbara Poblete: Predicting Information Credibility in Time-Sensitive Social
Media (+Supplementary Material). In Internet Research, Vol. 23, Issue 5, Special issue on The Predictive Power of
Social Media, pp. 560-588. October 2013.
●
Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz and Patrick Meier: Practical Extraction of
Disaster-Relevant Information from Social Media. Social Web and Disaster Management (SWDM) workshop. Rio
de Janeiro, Brazil, 2013.
●
Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz and Patrick Meier: Extracting Information
Nuggets from Disaster-Related Messages in Social Media. In ISCRAM. Baden-Baden, Germany, 2013. Best paper
award.
BigCrisis Data — Carlos Castillo 78
Thank you!
Follow @BigCrisisData

More Related Content

What's hot

Public Health Crisis Analytics for Gender Violence
Public Health Crisis Analytics for Gender ViolencePublic Health Crisis Analytics for Gender Violence
Public Health Crisis Analytics for Gender ViolenceHemant Purohit
 
Snowden-final-report-for-publication
Snowden-final-report-for-publicationSnowden-final-report-for-publication
Snowden-final-report-for-publicationZarte Siempre
 
New media and democratic society 1117 presentation2
New media and democratic society 1117 presentation2New media and democratic society 1117 presentation2
New media and democratic society 1117 presentation2Tina Moore
 
The Internet & The Cloud - Socio-economic Impact on Citizens
The Internet & The Cloud - Socio-economic Impact on CitizensThe Internet & The Cloud - Socio-economic Impact on Citizens
The Internet & The Cloud - Socio-economic Impact on CitizensLSP / PSL
 
New media and democratic society 1117 presentation
New media and democratic society 1117 presentationNew media and democratic society 1117 presentation
New media and democratic society 1117 presentationTina Moore
 
The roadmap to abolish aging by 2040
The roadmap to abolish aging by 2040The roadmap to abolish aging by 2040
The roadmap to abolish aging by 2040David Wood
 
Typhoon pablo bopha activation
Typhoon pablo bopha activationTyphoon pablo bopha activation
Typhoon pablo bopha activationCatherine Graham
 
Progressive ethics in the digital age
Progressive ethics in the digital ageProgressive ethics in the digital age
Progressive ethics in the digital ageDavid Wood
 
Beyond the Bubble: A Critical Review of the Evidence for Echo Chambers and Fi...
Beyond the Bubble: A Critical Review of the Evidence for Echo Chambers and Fi...Beyond the Bubble: A Critical Review of the Evidence for Echo Chambers and Fi...
Beyond the Bubble: A Critical Review of the Evidence for Echo Chambers and Fi...Axel Bruns
 
Pipelines: 2052-James Breaux, Centurion Pipeline Co.
Pipelines: 2052-James Breaux, Centurion Pipeline Co.Pipelines: 2052-James Breaux, Centurion Pipeline Co.
Pipelines: 2052-James Breaux, Centurion Pipeline Co.Energy Network marcus evans
 
Offline Activism - How successful activism facilitate social media
Offline Activism - How successful activism facilitate social mediaOffline Activism - How successful activism facilitate social media
Offline Activism - How successful activism facilitate social mediaWilson Fung
 
Government 2.0 Defined
Government 2.0 DefinedGovernment 2.0 Defined
Government 2.0 DefinedWalter Schwabe
 

What's hot (12)

Public Health Crisis Analytics for Gender Violence
Public Health Crisis Analytics for Gender ViolencePublic Health Crisis Analytics for Gender Violence
Public Health Crisis Analytics for Gender Violence
 
Snowden-final-report-for-publication
Snowden-final-report-for-publicationSnowden-final-report-for-publication
Snowden-final-report-for-publication
 
New media and democratic society 1117 presentation2
New media and democratic society 1117 presentation2New media and democratic society 1117 presentation2
New media and democratic society 1117 presentation2
 
The Internet & The Cloud - Socio-economic Impact on Citizens
The Internet & The Cloud - Socio-economic Impact on CitizensThe Internet & The Cloud - Socio-economic Impact on Citizens
The Internet & The Cloud - Socio-economic Impact on Citizens
 
New media and democratic society 1117 presentation
New media and democratic society 1117 presentationNew media and democratic society 1117 presentation
New media and democratic society 1117 presentation
 
The roadmap to abolish aging by 2040
The roadmap to abolish aging by 2040The roadmap to abolish aging by 2040
The roadmap to abolish aging by 2040
 
Typhoon pablo bopha activation
Typhoon pablo bopha activationTyphoon pablo bopha activation
Typhoon pablo bopha activation
 
Progressive ethics in the digital age
Progressive ethics in the digital ageProgressive ethics in the digital age
Progressive ethics in the digital age
 
Beyond the Bubble: A Critical Review of the Evidence for Echo Chambers and Fi...
Beyond the Bubble: A Critical Review of the Evidence for Echo Chambers and Fi...Beyond the Bubble: A Critical Review of the Evidence for Echo Chambers and Fi...
Beyond the Bubble: A Critical Review of the Evidence for Echo Chambers and Fi...
 
Pipelines: 2052-James Breaux, Centurion Pipeline Co.
Pipelines: 2052-James Breaux, Centurion Pipeline Co.Pipelines: 2052-James Breaux, Centurion Pipeline Co.
Pipelines: 2052-James Breaux, Centurion Pipeline Co.
 
Offline Activism - How successful activism facilitate social media
Offline Activism - How successful activism facilitate social mediaOffline Activism - How successful activism facilitate social media
Offline Activism - How successful activism facilitate social media
 
Government 2.0 Defined
Government 2.0 DefinedGovernment 2.0 Defined
Government 2.0 Defined
 

Viewers also liked

Detecting Algorithmic Bias (keynote at DIR 2016)
Detecting Algorithmic Bias (keynote at DIR 2016)Detecting Algorithmic Bias (keynote at DIR 2016)
Detecting Algorithmic Bias (keynote at DIR 2016)Carlos Castillo (ChaTo)
 
Dr. Searcher and Mr. Browser: A unified hyperlink-click graph
Dr. Searcher and Mr. Browser: A unified hyperlink-click graphDr. Searcher and Mr. Browser: A unified hyperlink-click graph
Dr. Searcher and Mr. Browser: A unified hyperlink-click graphCarlos Castillo (ChaTo)
 
The Effects of Time on Query Flow Graph-based Models for Query Suggestion
The Effects of Time on Query Flow Graph-based Models for Query SuggestionThe Effects of Time on Query Flow Graph-based Models for Query Suggestion
The Effects of Time on Query Flow Graph-based Models for Query SuggestionCarlos Castillo (ChaTo)
 
Characterizing the Life Cycle of Online News Stories Using Social Media React...
Characterizing the Life Cycle of Online News Stories Using Social Media React...Characterizing the Life Cycle of Online News Stories Using Social Media React...
Characterizing the Life Cycle of Online News Stories Using Social Media React...Carlos Castillo (ChaTo)
 
Information Verification During Natural Disasters
Information Verification During Natural DisastersInformation Verification During Natural Disasters
Information Verification During Natural DisastersCarlos Castillo (ChaTo)
 
Social Media News Communities: Gatekeeping, Coverage, and Statement Bias
 Social Media News Communities: Gatekeeping, Coverage, and Statement Bias Social Media News Communities: Gatekeeping, Coverage, and Statement Bias
Social Media News Communities: Gatekeeping, Coverage, and Statement BiasMounia Lalmas-Roelleke
 
TweetCred: Real-Time Credibility Assessment of 
 Content on Twitter @ Socinfo...
TweetCred: Real-Time Credibility Assessment of 
 Content on Twitter @ Socinfo...TweetCred: Real-Time Credibility Assessment of 
 Content on Twitter @ Socinfo...
TweetCred: Real-Time Credibility Assessment of 
 Content on Twitter @ Socinfo...IIIT Hyderabad
 
Kdd12 tutorial-inf-part-i
Kdd12 tutorial-inf-part-iKdd12 tutorial-inf-part-i
Kdd12 tutorial-inf-part-iLaks Lakshmanan
 
Kdd12 tutorial-inf-part-iv
Kdd12 tutorial-inf-part-ivKdd12 tutorial-inf-part-iv
Kdd12 tutorial-inf-part-ivLaks Lakshmanan
 
Kdd12 tutorial-inf-part-ii
Kdd12 tutorial-inf-part-iiKdd12 tutorial-inf-part-ii
Kdd12 tutorial-inf-part-iiLaks Lakshmanan
 
Extracting Information Nuggets from Disaster-Related Messages in Social Media
Extracting Information Nuggets from Disaster-Related Messages in Social MediaExtracting Information Nuggets from Disaster-Related Messages in Social Media
Extracting Information Nuggets from Disaster-Related Messages in Social MediaMuhammad Imran
 
What to Expect When the Unexpected Happens: Social Media Communications Acros...
What to Expect When the Unexpected Happens: Social Media Communications Acros...What to Expect When the Unexpected Happens: Social Media Communications Acros...
What to Expect When the Unexpected Happens: Social Media Communications Acros...Carlos Castillo (ChaTo)
 
Emotions and dialogue in a peer-production community: the case of Wikipedia
Emotions and dialogue in a peer-production community: the case of WikipediaEmotions and dialogue in a peer-production community: the case of Wikipedia
Emotions and dialogue in a peer-production community: the case of WikipediaDavid Laniado
 
Kdd12 tutorial-inf-part-iii
Kdd12 tutorial-inf-part-iiiKdd12 tutorial-inf-part-iii
Kdd12 tutorial-inf-part-iiiLaks Lakshmanan
 

Viewers also liked (17)

Crisis Computing
Crisis ComputingCrisis Computing
Crisis Computing
 
Detecting Algorithmic Bias (keynote at DIR 2016)
Detecting Algorithmic Bias (keynote at DIR 2016)Detecting Algorithmic Bias (keynote at DIR 2016)
Detecting Algorithmic Bias (keynote at DIR 2016)
 
Dr. Searcher and Mr. Browser: A unified hyperlink-click graph
Dr. Searcher and Mr. Browser: A unified hyperlink-click graphDr. Searcher and Mr. Browser: A unified hyperlink-click graph
Dr. Searcher and Mr. Browser: A unified hyperlink-click graph
 
The Effects of Time on Query Flow Graph-based Models for Query Suggestion
The Effects of Time on Query Flow Graph-based Models for Query SuggestionThe Effects of Time on Query Flow Graph-based Models for Query Suggestion
The Effects of Time on Query Flow Graph-based Models for Query Suggestion
 
Characterizing the Life Cycle of Online News Stories Using Social Media React...
Characterizing the Life Cycle of Online News Stories Using Social Media React...Characterizing the Life Cycle of Online News Stories Using Social Media React...
Characterizing the Life Cycle of Online News Stories Using Social Media React...
 
Information Verification During Natural Disasters
Information Verification During Natural DisastersInformation Verification During Natural Disasters
Information Verification During Natural Disasters
 
Social Media News Communities: Gatekeeping, Coverage, and Statement Bias
 Social Media News Communities: Gatekeeping, Coverage, and Statement Bias Social Media News Communities: Gatekeeping, Coverage, and Statement Bias
Social Media News Communities: Gatekeeping, Coverage, and Statement Bias
 
TweetCred: Real-Time Credibility Assessment of 
 Content on Twitter @ Socinfo...
TweetCred: Real-Time Credibility Assessment of 
 Content on Twitter @ Socinfo...TweetCred: Real-Time Credibility Assessment of 
 Content on Twitter @ Socinfo...
TweetCred: Real-Time Credibility Assessment of 
 Content on Twitter @ Socinfo...
 
Kdd12 tutorial-inf-part-i
Kdd12 tutorial-inf-part-iKdd12 tutorial-inf-part-i
Kdd12 tutorial-inf-part-i
 
Kdd12 tutorial-inf-part-iv
Kdd12 tutorial-inf-part-ivKdd12 tutorial-inf-part-iv
Kdd12 tutorial-inf-part-iv
 
Kdd12 tutorial-inf-part-ii
Kdd12 tutorial-inf-part-iiKdd12 tutorial-inf-part-ii
Kdd12 tutorial-inf-part-ii
 
Extracting Information Nuggets from Disaster-Related Messages in Social Media
Extracting Information Nuggets from Disaster-Related Messages in Social MediaExtracting Information Nuggets from Disaster-Related Messages in Social Media
Extracting Information Nuggets from Disaster-Related Messages in Social Media
 
What to Expect When the Unexpected Happens: Social Media Communications Acros...
What to Expect When the Unexpected Happens: Social Media Communications Acros...What to Expect When the Unexpected Happens: Social Media Communications Acros...
What to Expect When the Unexpected Happens: Social Media Communications Acros...
 
Fairness-Aware Data Mining
Fairness-Aware Data MiningFairness-Aware Data Mining
Fairness-Aware Data Mining
 
Emotions and dialogue in a peer-production community: the case of Wikipedia
Emotions and dialogue in a peer-production community: the case of WikipediaEmotions and dialogue in a peer-production community: the case of Wikipedia
Emotions and dialogue in a peer-production community: the case of Wikipedia
 
Kdd12 tutorial-inf-part-iii
Kdd12 tutorial-inf-part-iiiKdd12 tutorial-inf-part-iii
Kdd12 tutorial-inf-part-iii
 
Discrimination Discovery
Discrimination DiscoveryDiscrimination Discovery
Discrimination Discovery
 

Similar to Keynote talk: Big Crisis Data, an Open Invitation

Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017Carlos Castillo (ChaTo)
 
InfoCrisis.Social - Design Process
InfoCrisis.Social - Design ProcessInfoCrisis.Social - Design Process
InfoCrisis.Social - Design ProcessJavier Velasco, PhD
 
Middlebury Institute May 2016
Middlebury Institute May 2016Middlebury Institute May 2016
Middlebury Institute May 2016Catherine Graham
 
Disarm vanguards 2022-02-25 (3)
Disarm vanguards 2022-02-25 (3)Disarm vanguards 2022-02-25 (3)
Disarm vanguards 2022-02-25 (3)SaraJayneTerp
 
Transforming Social Big Data into Timely Decisions and Actions for Crisis Mi...
Transforming Social Big Data into Timely Decisions  and Actions for Crisis Mi...Transforming Social Big Data into Timely Decisions  and Actions for Crisis Mi...
Transforming Social Big Data into Timely Decisions and Actions for Crisis Mi...Amit Sheth
 
Addressing post-truth - GSK - 10/05/2019
Addressing post-truth - GSK - 10/05/2019Addressing post-truth - GSK - 10/05/2019
Addressing post-truth - GSK - 10/05/2019Denys Malengreau
 
Why pandemics and climate change are hard to understand, and can we help?
Why pandemics and climate change are hard to understand, and can we help?Why pandemics and climate change are hard to understand, and can we help?
Why pandemics and climate change are hard to understand, and can we help?Alan Dix
 
The Filter in Our (?) Heads: Digital Media and Polarisation
The Filter in Our (?) Heads: Digital Media and PolarisationThe Filter in Our (?) Heads: Digital Media and Polarisation
The Filter in Our (?) Heads: Digital Media and PolarisationAxel Bruns
 
Emergency Risk Communication
Emergency Risk CommunicationEmergency Risk Communication
Emergency Risk CommunicationHeather Blanchard
 
Social Media and the News: Approaches to the Spread of (Mis)information
Social Media and the News: Approaches to the Spread of (Mis)informationSocial Media and the News: Approaches to the Spread of (Mis)information
Social Media and the News: Approaches to the Spread of (Mis)informationAxel Bruns
 
Social Media Management in Crisis Communication
Social Media Management in Crisis CommunicationSocial Media Management in Crisis Communication
Social Media Management in Crisis CommunicationDavid Vicent
 
2022-08-13_cogsec_defcon.pptx
2022-08-13_cogsec_defcon.pptx2022-08-13_cogsec_defcon.pptx
2022-08-13_cogsec_defcon.pptxSaraJayneTerp
 
Humanitarian Informatics Approach for Cooperation between Citizens and Organi...
Humanitarian Informatics Approach for Cooperation between Citizens and Organi...Humanitarian Informatics Approach for Cooperation between Citizens and Organi...
Humanitarian Informatics Approach for Cooperation between Citizens and Organi...Hemant Purohit
 
Com 427 final presentation
Com 427 final presentationCom 427 final presentation
Com 427 final presentationKyle Basedow
 
COM 427 Social Media and Security
COM 427 Social Media and SecurityCOM 427 Social Media and Security
COM 427 Social Media and SecurityKyle Basedow
 
Big data for development
Big data for development Big data for development
Big data for development Junaid Qadir
 
Crisiscamp Ireland Opening presentation
Crisiscamp Ireland Opening presentationCrisiscamp Ireland Opening presentation
Crisiscamp Ireland Opening presentationEvert Bopp
 
Saving Lives with Big Data from Social Media in Emergencies
Saving Lives with Big Data from Social Media in EmergenciesSaving Lives with Big Data from Social Media in Emergencies
Saving Lives with Big Data from Social Media in EmergenciesThomas Dybro Lundorf
 
Why pandemics and climate change are hard to understand and make decision mak...
Why pandemics and climate change are hard to understand and make decision mak...Why pandemics and climate change are hard to understand and make decision mak...
Why pandemics and climate change are hard to understand and make decision mak...Alan Dix
 

Similar to Keynote talk: Big Crisis Data, an Open Invitation (20)

Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
 
InfoCrisis.Social - Design Process
InfoCrisis.Social - Design ProcessInfoCrisis.Social - Design Process
InfoCrisis.Social - Design Process
 
Middlebury Institute May 2016
Middlebury Institute May 2016Middlebury Institute May 2016
Middlebury Institute May 2016
 
Disarm vanguards 2022-02-25 (3)
Disarm vanguards 2022-02-25 (3)Disarm vanguards 2022-02-25 (3)
Disarm vanguards 2022-02-25 (3)
 
Transforming Social Big Data into Timely Decisions and Actions for Crisis Mi...
Transforming Social Big Data into Timely Decisions  and Actions for Crisis Mi...Transforming Social Big Data into Timely Decisions  and Actions for Crisis Mi...
Transforming Social Big Data into Timely Decisions and Actions for Crisis Mi...
 
Addressing post-truth - GSK - 10/05/2019
Addressing post-truth - GSK - 10/05/2019Addressing post-truth - GSK - 10/05/2019
Addressing post-truth - GSK - 10/05/2019
 
Crisis Informatics (November 2013)
Crisis Informatics (November 2013)Crisis Informatics (November 2013)
Crisis Informatics (November 2013)
 
Why pandemics and climate change are hard to understand, and can we help?
Why pandemics and climate change are hard to understand, and can we help?Why pandemics and climate change are hard to understand, and can we help?
Why pandemics and climate change are hard to understand, and can we help?
 
The Filter in Our (?) Heads: Digital Media and Polarisation
The Filter in Our (?) Heads: Digital Media and PolarisationThe Filter in Our (?) Heads: Digital Media and Polarisation
The Filter in Our (?) Heads: Digital Media and Polarisation
 
Emergency Risk Communication
Emergency Risk CommunicationEmergency Risk Communication
Emergency Risk Communication
 
Social Media and the News: Approaches to the Spread of (Mis)information
Social Media and the News: Approaches to the Spread of (Mis)informationSocial Media and the News: Approaches to the Spread of (Mis)information
Social Media and the News: Approaches to the Spread of (Mis)information
 
Social Media Management in Crisis Communication
Social Media Management in Crisis CommunicationSocial Media Management in Crisis Communication
Social Media Management in Crisis Communication
 
2022-08-13_cogsec_defcon.pptx
2022-08-13_cogsec_defcon.pptx2022-08-13_cogsec_defcon.pptx
2022-08-13_cogsec_defcon.pptx
 
Humanitarian Informatics Approach for Cooperation between Citizens and Organi...
Humanitarian Informatics Approach for Cooperation between Citizens and Organi...Humanitarian Informatics Approach for Cooperation between Citizens and Organi...
Humanitarian Informatics Approach for Cooperation between Citizens and Organi...
 
Com 427 final presentation
Com 427 final presentationCom 427 final presentation
Com 427 final presentation
 
COM 427 Social Media and Security
COM 427 Social Media and SecurityCOM 427 Social Media and Security
COM 427 Social Media and Security
 
Big data for development
Big data for development Big data for development
Big data for development
 
Crisiscamp Ireland Opening presentation
Crisiscamp Ireland Opening presentationCrisiscamp Ireland Opening presentation
Crisiscamp Ireland Opening presentation
 
Saving Lives with Big Data from Social Media in Emergencies
Saving Lives with Big Data from Social Media in EmergenciesSaving Lives with Big Data from Social Media in Emergencies
Saving Lives with Big Data from Social Media in Emergencies
 
Why pandemics and climate change are hard to understand and make decision mak...
Why pandemics and climate change are hard to understand and make decision mak...Why pandemics and climate change are hard to understand and make decision mak...
Why pandemics and climate change are hard to understand and make decision mak...
 

More from Carlos Castillo (ChaTo) (19)

Finding High Quality Content in Social Media
Finding High Quality Content in Social MediaFinding High Quality Content in Social Media
Finding High Quality Content in Social Media
 
When no clicks are good news
When no clicks are good newsWhen no clicks are good news
When no clicks are good news
 
Observational studies in social media
Observational studies in social mediaObservational studies in social media
Observational studies in social media
 
Natural experiments
Natural experimentsNatural experiments
Natural experiments
 
Content-based link prediction
Content-based link predictionContent-based link prediction
Content-based link prediction
 
Link prediction
Link predictionLink prediction
Link prediction
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Graph Partitioning and Spectral Methods
Graph Partitioning and Spectral MethodsGraph Partitioning and Spectral Methods
Graph Partitioning and Spectral Methods
 
Finding Dense Subgraphs
Finding Dense SubgraphsFinding Dense Subgraphs
Finding Dense Subgraphs
 
Graph Evolution Models
Graph Evolution ModelsGraph Evolution Models
Graph Evolution Models
 
Link-Based Ranking
Link-Based RankingLink-Based Ranking
Link-Based Ranking
 
Text Indexing / Inverted Indices
Text Indexing / Inverted IndicesText Indexing / Inverted Indices
Text Indexing / Inverted Indices
 
Indexing
IndexingIndexing
Indexing
 
Text Summarization
Text SummarizationText Summarization
Text Summarization
 
Hierarchical Clustering
Hierarchical ClusteringHierarchical Clustering
Hierarchical Clustering
 
K-Means Algorithm
K-Means AlgorithmK-Means Algorithm
K-Means Algorithm
 
Clustering
ClusteringClustering
Clustering
 
Text similarity and the vector space model
Text similarity and the vector space modelText similarity and the vector space model
Text similarity and the vector space model
 
Intro to Creative Commons (May 2015)
Intro to Creative Commons (May 2015)Intro to Creative Commons (May 2015)
Intro to Creative Commons (May 2015)
 

Recently uploaded

##9711199012 Call Girls Delhi Rs-5000 UpTo 10 K Hauz Khas Whats Up Number
##9711199012 Call Girls Delhi Rs-5000 UpTo 10 K Hauz Khas  Whats Up Number##9711199012 Call Girls Delhi Rs-5000 UpTo 10 K Hauz Khas  Whats Up Number
##9711199012 Call Girls Delhi Rs-5000 UpTo 10 K Hauz Khas Whats Up NumberMs Riya
 
Jewish Efforts to Influence American Immigration Policy in the Years Before t...
Jewish Efforts to Influence American Immigration Policy in the Years Before t...Jewish Efforts to Influence American Immigration Policy in the Years Before t...
Jewish Efforts to Influence American Immigration Policy in the Years Before t...yalehistoricalreview
 
Start Donating your Old Clothes to Poor People kurnool
Start Donating your Old Clothes to Poor People kurnoolStart Donating your Old Clothes to Poor People kurnool
Start Donating your Old Clothes to Poor People kurnoolSERUDS INDIA
 
Madurai Call Girls 7001305949 WhatsApp Number 24x7 Best Services
Madurai Call Girls 7001305949 WhatsApp Number 24x7 Best ServicesMadurai Call Girls 7001305949 WhatsApp Number 24x7 Best Services
Madurai Call Girls 7001305949 WhatsApp Number 24x7 Best Servicesnajka9823
 
Call Girl Benson Town - Phone No 7001305949 For Ultimate Sexual Urges
Call Girl Benson Town - Phone No 7001305949 For Ultimate Sexual UrgesCall Girl Benson Town - Phone No 7001305949 For Ultimate Sexual Urges
Call Girl Benson Town - Phone No 7001305949 For Ultimate Sexual Urgesnarwatsonia7
 
Action Toolkit - Earth Day 2024 - April 22nd.
Action Toolkit - Earth Day 2024 - April 22nd.Action Toolkit - Earth Day 2024 - April 22nd.
Action Toolkit - Earth Day 2024 - April 22nd.Christina Parmionova
 
VIP Call Girls Service Bikaner Aishwarya 8250192130 Independent Escort Servic...
VIP Call Girls Service Bikaner Aishwarya 8250192130 Independent Escort Servic...VIP Call Girls Service Bikaner Aishwarya 8250192130 Independent Escort Servic...
VIP Call Girls Service Bikaner Aishwarya 8250192130 Independent Escort Servic...Suhani Kapoor
 
(多少钱)Dal毕业证国外本科学位证
(多少钱)Dal毕业证国外本科学位证(多少钱)Dal毕业证国外本科学位证
(多少钱)Dal毕业证国外本科学位证mbetknu
 
Panet vs.Plastics - Earth Day 2024 - 22 APRIL
Panet vs.Plastics - Earth Day 2024 - 22 APRILPanet vs.Plastics - Earth Day 2024 - 22 APRIL
Panet vs.Plastics - Earth Day 2024 - 22 APRILChristina Parmionova
 
Call Girls Service AECS Layout Just Call 7001305949 Enjoy College Girls Service
Call Girls Service AECS Layout Just Call 7001305949 Enjoy College Girls ServiceCall Girls Service AECS Layout Just Call 7001305949 Enjoy College Girls Service
Call Girls Service AECS Layout Just Call 7001305949 Enjoy College Girls Servicenarwatsonia7
 
Earth Day 2024 - AMC "COMMON GROUND'' movie night.
Earth Day 2024 - AMC "COMMON GROUND'' movie night.Earth Day 2024 - AMC "COMMON GROUND'' movie night.
Earth Day 2024 - AMC "COMMON GROUND'' movie night.Christina Parmionova
 
2024: The FAR, Federal Acquisition Regulations - Part 28
2024: The FAR, Federal Acquisition Regulations - Part 282024: The FAR, Federal Acquisition Regulations - Part 28
2024: The FAR, Federal Acquisition Regulations - Part 28JSchaus & Associates
 
Premium Call Girls Btm Layout - 7001305949 Escorts Service with Real Photos a...
Premium Call Girls Btm Layout - 7001305949 Escorts Service with Real Photos a...Premium Call Girls Btm Layout - 7001305949 Escorts Service with Real Photos a...
Premium Call Girls Btm Layout - 7001305949 Escorts Service with Real Photos a...narwatsonia7
 
Enhancing Indigenous Peoples' right to self-determination in the context of t...
Enhancing Indigenous Peoples' right to self-determination in the context of t...Enhancing Indigenous Peoples' right to self-determination in the context of t...
Enhancing Indigenous Peoples' right to self-determination in the context of t...Christina Parmionova
 
history of 1935 philippine constitution.pptx
history of 1935 philippine constitution.pptxhistory of 1935 philippine constitution.pptx
history of 1935 philippine constitution.pptxhellokittymaearciaga
 
productionpost-productiondiary-240320114322-5004daf6.pptx
productionpost-productiondiary-240320114322-5004daf6.pptxproductionpost-productiondiary-240320114322-5004daf6.pptx
productionpost-productiondiary-240320114322-5004daf6.pptxHenryBriggs2
 
Goa Escorts WhatsApp Number South Goa Call Girl … 8588052666…
Goa Escorts WhatsApp Number South Goa Call Girl … 8588052666…Goa Escorts WhatsApp Number South Goa Call Girl … 8588052666…
Goa Escorts WhatsApp Number South Goa Call Girl … 8588052666…nishakur201
 
Call Girls Bangalore Saanvi 7001305949 Independent Escort Service Bangalore
Call Girls Bangalore Saanvi 7001305949 Independent Escort Service BangaloreCall Girls Bangalore Saanvi 7001305949 Independent Escort Service Bangalore
Call Girls Bangalore Saanvi 7001305949 Independent Escort Service Bangalorenarwatsonia7
 

Recently uploaded (20)

##9711199012 Call Girls Delhi Rs-5000 UpTo 10 K Hauz Khas Whats Up Number
##9711199012 Call Girls Delhi Rs-5000 UpTo 10 K Hauz Khas  Whats Up Number##9711199012 Call Girls Delhi Rs-5000 UpTo 10 K Hauz Khas  Whats Up Number
##9711199012 Call Girls Delhi Rs-5000 UpTo 10 K Hauz Khas Whats Up Number
 
Jewish Efforts to Influence American Immigration Policy in the Years Before t...
Jewish Efforts to Influence American Immigration Policy in the Years Before t...Jewish Efforts to Influence American Immigration Policy in the Years Before t...
Jewish Efforts to Influence American Immigration Policy in the Years Before t...
 
Start Donating your Old Clothes to Poor People kurnool
Start Donating your Old Clothes to Poor People kurnoolStart Donating your Old Clothes to Poor People kurnool
Start Donating your Old Clothes to Poor People kurnool
 
Madurai Call Girls 7001305949 WhatsApp Number 24x7 Best Services
Madurai Call Girls 7001305949 WhatsApp Number 24x7 Best ServicesMadurai Call Girls 7001305949 WhatsApp Number 24x7 Best Services
Madurai Call Girls 7001305949 WhatsApp Number 24x7 Best Services
 
Call Girls In Rohini ꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCe
Call Girls In  Rohini ꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCeCall Girls In  Rohini ꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCe
Call Girls In Rohini ꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCe
 
Call Girl Benson Town - Phone No 7001305949 For Ultimate Sexual Urges
Call Girl Benson Town - Phone No 7001305949 For Ultimate Sexual UrgesCall Girl Benson Town - Phone No 7001305949 For Ultimate Sexual Urges
Call Girl Benson Town - Phone No 7001305949 For Ultimate Sexual Urges
 
Action Toolkit - Earth Day 2024 - April 22nd.
Action Toolkit - Earth Day 2024 - April 22nd.Action Toolkit - Earth Day 2024 - April 22nd.
Action Toolkit - Earth Day 2024 - April 22nd.
 
VIP Call Girls Service Bikaner Aishwarya 8250192130 Independent Escort Servic...
VIP Call Girls Service Bikaner Aishwarya 8250192130 Independent Escort Servic...VIP Call Girls Service Bikaner Aishwarya 8250192130 Independent Escort Servic...
VIP Call Girls Service Bikaner Aishwarya 8250192130 Independent Escort Servic...
 
(多少钱)Dal毕业证国外本科学位证
(多少钱)Dal毕业证国外本科学位证(多少钱)Dal毕业证国外本科学位证
(多少钱)Dal毕业证国外本科学位证
 
9953330565 Low Rate Call Girls In Adarsh Nagar Delhi NCR
9953330565 Low Rate Call Girls In Adarsh Nagar Delhi NCR9953330565 Low Rate Call Girls In Adarsh Nagar Delhi NCR
9953330565 Low Rate Call Girls In Adarsh Nagar Delhi NCR
 
Panet vs.Plastics - Earth Day 2024 - 22 APRIL
Panet vs.Plastics - Earth Day 2024 - 22 APRILPanet vs.Plastics - Earth Day 2024 - 22 APRIL
Panet vs.Plastics - Earth Day 2024 - 22 APRIL
 
Call Girls Service AECS Layout Just Call 7001305949 Enjoy College Girls Service
Call Girls Service AECS Layout Just Call 7001305949 Enjoy College Girls ServiceCall Girls Service AECS Layout Just Call 7001305949 Enjoy College Girls Service
Call Girls Service AECS Layout Just Call 7001305949 Enjoy College Girls Service
 
Earth Day 2024 - AMC "COMMON GROUND'' movie night.
Earth Day 2024 - AMC "COMMON GROUND'' movie night.Earth Day 2024 - AMC "COMMON GROUND'' movie night.
Earth Day 2024 - AMC "COMMON GROUND'' movie night.
 
2024: The FAR, Federal Acquisition Regulations - Part 28
2024: The FAR, Federal Acquisition Regulations - Part 282024: The FAR, Federal Acquisition Regulations - Part 28
2024: The FAR, Federal Acquisition Regulations - Part 28
 
Premium Call Girls Btm Layout - 7001305949 Escorts Service with Real Photos a...
Premium Call Girls Btm Layout - 7001305949 Escorts Service with Real Photos a...Premium Call Girls Btm Layout - 7001305949 Escorts Service with Real Photos a...
Premium Call Girls Btm Layout - 7001305949 Escorts Service with Real Photos a...
 
Enhancing Indigenous Peoples' right to self-determination in the context of t...
Enhancing Indigenous Peoples' right to self-determination in the context of t...Enhancing Indigenous Peoples' right to self-determination in the context of t...
Enhancing Indigenous Peoples' right to self-determination in the context of t...
 
history of 1935 philippine constitution.pptx
history of 1935 philippine constitution.pptxhistory of 1935 philippine constitution.pptx
history of 1935 philippine constitution.pptx
 
productionpost-productiondiary-240320114322-5004daf6.pptx
productionpost-productiondiary-240320114322-5004daf6.pptxproductionpost-productiondiary-240320114322-5004daf6.pptx
productionpost-productiondiary-240320114322-5004daf6.pptx
 
Goa Escorts WhatsApp Number South Goa Call Girl … 8588052666…
Goa Escorts WhatsApp Number South Goa Call Girl … 8588052666…Goa Escorts WhatsApp Number South Goa Call Girl … 8588052666…
Goa Escorts WhatsApp Number South Goa Call Girl … 8588052666…
 
Call Girls Bangalore Saanvi 7001305949 Independent Escort Service Bangalore
Call Girls Bangalore Saanvi 7001305949 Independent Escort Service BangaloreCall Girls Bangalore Saanvi 7001305949 Independent Escort Service Bangalore
Call Girls Bangalore Saanvi 7001305949 Independent Escort Service Bangalore
 

Keynote talk: Big Crisis Data, an Open Invitation

  • 1. BIG CRISIS DATA An Open Invitation CARLOS CASTILLO @BigCrisisData Manaus, Brasil, Outubro 2015
  • 2. BigCrisis Data — Carlos Castillo 2 This talk is about ... ● Disasters and time-critical situations – Natural, social, or technological hazards – Mass convergence events ● Social media – Particularly microtext ● Computing – Applications of many fields including NLP, ML, IR
  • 3. Big Crisis Data — Carlos Castillo 3 http://www.youtube.com/watch?v=0UFsJhYBxzY
  • 4. BigCrisis Data — Carlos Castillo 4 An earthquake hits a Twitter user http://xkcd.com/723/ ● When an earthquake strikes, the first tweets are posted 20-30 seconds later ● Damaging seismic waves travel at 3-5 km/s, while network communications are light speed on fiber/copper + latency ● After ~100km seismic waves may be overtaken by tweets about them
  • 5. Big Crisis Data — Carlos Castillo 5 January 2010 How/when did it start for me?
  • 6. Big Crisis Data — Carlos Castillo 6 Humanitarian Computing At least 775 publications: ● Crisis Analysis (55) ● Crisis Management (309) ● Situational Awareness (67) ● Social Media (231) ● Mobile Phones (74) ● Crowdsourcing (116) ● Software and Tools (97) ● Human-Computer Interaction (28)  ● Natural Language Processing (33)  ● Trust and Security (33) ● Geographical Analysis (53) Source: http://humanitariancomp.referata.com/
  • 7. Big Crisis Data — Carlos Castillo 7 Humanitarian Computing Topics
  • 8. Big Crisis Data — Carlos Castillo 8
  • 9. Big Crisis Data — Carlos Castillo 9
  • 10. BigCrisis Data — Carlos Castillo 10 Fertile grounds for applied research ✔ Problems of global significance ✔ Solved with labor-intensive methods ✔ Better solution provides a public good ✔ Large and noisy data sets available ✔ Engage volunteer communities
  • 11. BigCrisis Data — Carlos Castillo 11 Fertile grounds for applied research ✔ Problems of global significance ✔ Solved with labor-intensive methods ✔ Better solution provides a public good ✔ Large and noisy data sets available ✔ Engage volunteer communities Relevance to practitioners?
  • 12. BigCrisis Data — Carlos Castillo 12 Recent collaborators Patrick Meier Sarah Vieweg – QCRI Muhammad Imran – QCRI Irina Temnikova – QCRI Alexandra Olteanu – EPFL Aditi Gupta – IIIT Delhi “P.K.” Kumaraguru – IIIT Delhi Fernando Diaz – Microsoft
  • 13. BigCrisis Data — Carlos Castillo 13 Outline Volume Vagueness Visualization Volunteering Values
  • 14. BigCrisis Data — Carlos Castillo 14 Disaster Communications and Scale
  • 15. BigCrisis Data — Carlos Castillo 15 Crises and disasters ● Crises are unstable situations – May or may not lead to a disaster ● Disasters are social phenomena – Disruptions of routines
  • 16. BigCrisis Data — Carlos Castillo 16 Temporal and Spatial Dimensions
  • 17. BigCrisis Data — Carlos Castillo 17 Examples
  • 18. Big Crisis Data — Carlos Castillo 18 REEL LIFE OR REAL LIFE?
  • 19. Big Crisis Data — Carlos Castillo 19 REEL LIFE OR REAL LIFE?
  • 20. Big Crisis Data — Carlos Castillo 20 https://www.youtube.com/watch?v=MylI8HmgMBk
  • 21. BigCrisis Data — Carlos Castillo 21 In Real Life ... ● Some people panic, most people don't ● People gather information from familiar sources ● People quickly decide whether to flee, take cover, or take action ● People improvise complex rescue operations on the spot Devon, UK, June 2014 London, UK, May 2015 San José Boquerón, Paraguay, Oct 2013
  • 22. Big Crisis Data — Carlos Castillo 22 Example Disaster-Related Messages “OMG! The fire seems out of control: It’s running down the hills!” Bush fire near Marseilles, France, in 2009 [Longueville et al. 2009] “Red River at East Grand Forks is 48.70 feet, +20.7 feet of flood stage, -5.65 feet of 1997 crest. #flood09” Red River Valley floods in 2009 [Starbird et al. 2010] “My moms backyard in Hatteras. That dock is usually about 3 feet above water [photo]” Hurricane Sandy 2013 [Leavitt and Clark 2014] “Sirens going off now!! Take cover...be safe!” Moore Tornado 2013 [Blanford et al. 2014]. “There is shooting at Utøya, my little sister is there and just called home!” 2011 attacks in Norway [Perng et al. 2013]
  • 23. BigCrisis Data — Carlos Castillo 23 Social media usage during disasters ● Interpersonal (horizontal) – Stay in touch with family and friends ● Citizen sensing (bottom-up) – Read/Write reports on ground situation ● Official communications (top-down) – E.g. advice, warnings, or evacuation orders
  • 24. BigCrisis Data — Carlos Castillo 24 Scale: Tweets per Second
  • 25. BigCrisis Data — Carlos Castillo 25 Requirements ● Typical users – Emergency response services – Humanitarian relief agencies – Journalists and the Public ● Underspecified requirements that vary over time ● Usually a combination of: 1) Capture the “Big Picture” 2) Obtain “Actionable Insights”
  • 26. BigCrisis Data — Carlos Castillo 26 Understanding, Classifying and Extracting
  • 27. BigCrisis Data — Carlos Castillo 27 Example “Media must report about d alleged 20k RSS chaps off 2 #Nepal.here’s a pic coz d 1 @ShainaNC shared isn’t true.. ;)”
  • 28. BigCrisis Data — Carlos Castillo 28 Social media messages ● Social media is more like a transcript of a conversation than like text meant to stand on its own – Awkward entry methods: ● Fragmented language and incomplete sentences ● Many typographic and grammatical errors – Conversational: ● Little or no context (hard to comprehend in isolation) ● Code switching and borrowing ● Internet slang
  • 29. Big Crisis Data — Carlos Castillo 29 Slang
  • 30. Big Crisis Data — Carlos Castillo 30 Classification Caution & Advice Information Sources Damage & Casualties Donations Gov Eyewitness Media NGO Outsider ... ... Filtered tweets
  • 31. BigCrisis Data — Carlos Castillo 31 Classification Axes ● By usefulness (application-dependent!) – Not related, Related but useless, Useful ● By factual, subjective, or emotional content ● By information provided ● By information source – Government, NGOs, media, eyewitnesses, etc. ● By humanitarian clusters
  • 32. Big Crisis Data — Carlos Castillo 32 Humanitarian Clusters
  • 33. Big Crisis Data — Carlos Castillo 33 Alexandra Olteanu, Sarah Vieweg and Carlos Castillo: What to Expect When the Unexpected Happens: Social Media Communications Across Crises. To appear in CSCW 2015. Humanitarian Clusters (cont.)
  • 34. BigCrisis Data — Carlos Castillo 34 A large-scale study of crisis tweets ● Collect tweets from 26 disasters ● Classify according to: ● Informative / Not informative ● Information provided ● Information source ● Several iterations required to write the “right” instructions Alexandra Olteanu, Sarah Vieweg and Carlos Castillo: "What to Expect When the Unexpected Happens: Social Media Communications Across Crises" In CSCW 2015, 14-18 March in Vancouver, Canada. ACM Press.
  • 35. Big Crisis Data — Carlos Castillo 35 Information Provided in Crisis Tweets N=26; Data available at http://crisislex.org/
  • 36. BigCrisis Data — Carlos Castillo 36 What do people tweet about? ● Affected individuals – 20% on average (min. 5%, max. 57%) – most prevalent in human-induced, focalized & instantaneous events ● Sympathy and emotional support – 20% on average (min. 3%, max. 52%) – most prevalent in instantaneous events ● Other useful information – 32% on average (min. 7%, max. 59%) – least prevalent in diffused events
  • 37. BigCrisis Data — Carlos Castillo 37 What do people tweet about? (cont.) ● Infrastructure and utilities – 7% on average (min. 0%, max. 22%) – most prevalent in diffused events, in particular floods ● Caution and advice – 10% on average (min. 0%, max. 34%) – least prevalent in instantaneous & human-induced events ● Donations and volunteering – 10% on average (min. 0%, max. 44%) – most prevalent in natural hazards
  • 38. Big Crisis Data — Carlos Castillo 38 Distribution over information sources
  • 39. Big Crisis Data — Carlos Castillo 39 Distribution over time
  • 40. BigCrisis Data — Carlos Castillo 40 Dataset CrisisLexT26 www.crisislex.org
  • 41. Big Crisis Data — Carlos Castillo 41 Information Extraction ... Classified tweets @JimFreund: Apparently we have no choice. There is a tornado watch in effect tonight.
  • 42. BigCrisis Data — Carlos Castillo 42 Extraction ● #hashtags, @user mentions, URLs, etc. – Regular expressions – Text library from Twitter ● Temporal expressions – Part-of-speech tagger + heuristics – Natty library ● Supervised learning Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz and Patrick Meier: Practical Extraction of Disaster-Relevant Information from Social Media. Social Web and Disaster Management (SWDM) workshop. Rio de Janeiro, Brazil, 2013.
  • 43. BigCrisis Data — Carlos Castillo 43 Labels for extraction ● Type-dependent instruction ● Ask evaluators to copy-paste a word/phrase from each tweet
  • 44. BigCrisis Data — Carlos Castillo 44 Learning: Conditional Random Fields ● Extends HMM to incorporate more possible dependencies ● Used extensively in NLP for part-of-speech tagging and information extraction HMM Linear-chain CRF hidden observed
  • 45. BigCrisis Data — Carlos Castillo 45 Tool ● CMU ARK Twitter NLP – Tokenization – Feature extraction – CRF learning ● Very easy to use – simply change the training set (part-of-speech tags), – then re-train
  • 46. Big Crisis Data — Carlos Castillo 46 Output examples RT @weatherchannel: .@NYGovCuomo orders closing of NYC bridges. Only Staten Island bridges unaffected at this time. Bridges must close by 7pm. #Sandy #NYC Wow what a mess #Sandy has made. Be sure to check on the elderly and homeless please! Thoughts and prayers to all affected RT @twc_hurricane: Wind gusts over 60 mph are being reported at Central Park and JFK airport in #NYC this hour. #Sandy RT @mitchellreports: Red Cross tells us grateful for Romney donation but prefer people send money or donate blood dont collect goods NOT best way to help #Sandy
  • 47. Big Crisis Data — Carlos Castillo 47 Extractor evaluation Setting Rec Prec Train 2/3 Joplin, Test 1/3 Joplin 78% 90% Train 2/3 Sandy, Test 1/3 Sandy 41% 79% Train Joplin, Test Sandy 11% 78% Train Joplin + 10% Sandy, Test 90% Sandy 21% 81% ● Precision is: one word or more in common with what humans extracted
  • 48. BigCrisis Data — Carlos Castillo 48 Donations matching ● Identify and match requests/offers for donations – Money, clothing, food, shelter, volunteers, blood ● Method – Classify – Determine key aspects – Extract key aspects – Per-aspect matching Hemant Purohit, Amit Sheth, Carlos Castillo, Patrick Meier, Fernando Diaz: Emergency-Relief Coordination on Social Media: Automatically Matching Resource Requests and Offers. First Monday 19 (1), January 2014.
  • 49. BigCrisis Data — Carlos Castillo 49 Donations matching Average precision = 0.21 (0.16 if only text similarity is used)
  • 50. BigCrisis Data — Carlos Castillo 50 Crisis maps from social media
  • 51. Big Crisis Data — Carlos Castillo 51
  • 52. Big Crisis Data — Carlos Castillo 52
  • 53. Big Crisis Data — Carlos Castillo 53 Patrick Meier, Social Innovation Director @ QCRI – http://irevolution.net/ “What can speed humanitarian response to tsunami-ravaged coasts? Expose human rights atrocities? Launch helicopters to rescue earthquake victims? Outwit corrupt regimes? A map.”
  • 54. BigCrisis Data — Carlos Castillo 54 Crisis mapping goes mainstream (2011)
  • 55. Big Crisis Data — Carlos Castillo 55
  • 56. Big Crisis Data — Carlos Castillo 56
  • 57. Big Crisis Data — Carlos Castillo 57
  • 58. Big Crisis Data — Carlos Castillo 58
  • 59. Big Crisis Data — Carlos Castillo 59
  • 60. BigCrisis Data — Carlos Castillo 60 Automatic Mapping (floods) ● Top: hydrological data ● Bottom: tweet density ● Broad match with affected areas ● Many biases towards places with higher density of smartphones De Albuquerque, João Porto, Herfort, Benjamin, Brenning, Alexander, and Zipf, Alexander. 2015. A geographic approach for combining social media and authoritative data towards identifying useful information for disaster management. International Journal of Geographical Information Science, 29(4), 667–689.
  • 61. BigCrisis Data — Carlos Castillo 61 Automatic Mapping (Dengue) Gomide, Janaina and Veloso, Adriano and Meira, Wagner and Almeida, Virgilio and Benevenuto, Fabricio and Ferraz, Fernanda and Teixeira, Mauro (2011) Dengue surveillance based on a computational model of spatio-temporal locality of Twitter. pp. 1-8. In: Proceedings of the ACM WebSci'11, June 14-17 2011, Koblenz, Germany. ● Top: official reports ● Bottom: tweets
  • 62. BigCrisis Data — Carlos Castillo 62 Current Approach Hybrid real-time systems MicroMappers Manual processing: crowdsourcing Automatic processing: machine learning
  • 63. Big Crisis Data — Carlos Castillo 63 http://newsbeatsocial.com/watch/0_s6xxcr3p
  • 64. Big Crisis Data — Carlos Castillo 64
  • 65. Big Crisis Data — Carlos Castillo 65
  • 66. Big Crisis Data — Carlos Castillo 66 https://www.youtube.com/watch?v=uKgE3yWJ0_I
  • 67. BigCrisis Data — Carlos Castillo 67 Volunteering and Values
  • 68. BigCrisis Data — Carlos Castillo 68 Volunteering is a constant ● Integral part of how communities react to disasters ● Organizational types: – Existing – Extending – Expanding – Emerging ● Emergent organizations a mixed blessing for existing ones ● New scenario: digital volunteering – E.g. volunteer annotations, including crisis mapping
  • 69. BigCrisis Data — Carlos Castillo 69 Why do people volunteer? Altruism is key, but it's one of many reasons
  • 70. BigCrisis Data — Carlos Castillo 70 Privacy and Ethics ● Protect the privacy of individuals – ICRC Data Protection Guidelines – UN Guidelines on Cyber Security ● Protect victims and responders during armed attacks ● Protect volunteers from distal exposure ● Protect citizen reporters from danger and retaliation ● Give back and share results and data
  • 71. BigCrisis Data — Carlos Castillo 71 “I'm dying, they are tweeting” Digital Voyeurism
  • 72. BigCrisis Data — Carlos Castillo 72 CONCLUSIONS
  • 74. Computationally feasible Supported by data Useful Good projects in this space Temptation! Danger! Poorly planned projects :-( AI-complete problems
  • 75. Big Crisis Data — Carlos Castillo 75 Interdisciplinary Research ● As many things, it has Good, Bad, and Ugly aspects ● Good – You learn a lot, and it's the only way of supporting claims of practical utility in applied research ● Bad – Formal response organizations can be very difficult to engage with; relationships should be established between operations ● Ugly – Working software and 24/7 support for a critical need now vs advanced proof-of-concept later
  • 76. Possibility of large impact by using computer science to support humanitarian work = Applied computing at its best
  • 77. Big Crisis Data — Carlos Castillo 77 References ● Carlos Castillo: “Big Crisis Data.” Cambridge University Press, 2016 (forthcoming). ● Muhammad Imran, Carlos Castillo, Fernando Diaz, Sarah Vieweg: "Processing Social Media Messages in Mass Emergency: A Survey" In ACM Computing Surveys, Volume 47, Issue 4, June 2015. ● Alexandra Olteanu, Sarah Vieweg and Carlos Castillo: "What to Expect When the Unexpected Happens: Social Media Communications Across Crises" In CSCW 2015, 14-18 March in Vancouver, Canada. ACM Press. ● Muhammad Imran, Ioanna Lykourentzou, Yannick Naudet and Carlos Castillo: Engineering Crowdsourced Stream Processing Systems. Technical report, 2015. ● Hemant Purohit, Amit Sheth, Carlos Castillo, Patrick Meier, Fernando Diaz: Emergency-Relief Coordination on Social Media: Automatically Matching Resource Requests and Offers. First Monday 19 (1), January 2014. ● Sarah Vieweg, Carlos Castillo and Muhammad Imran: "Integrating Social Media Communications into the Rapid Assessment of Sudden Onset Disasters." SocInfo 2014. ● Alexandra Olteanu, Carlos Castillo, Fernando Diaz and Sarah Vieweg: CrisisLex: A Lexicon for Collecting and Filtering Microblogged Communications in Crises. In ICWSM. Ann Arbor, MI, USA. June 2014. ● Carlos Castillo, Marcelo Mendoza, Barbara Poblete: Predicting Information Credibility in Time-Sensitive Social Media (+Supplementary Material). In Internet Research, Vol. 23, Issue 5, Special issue on The Predictive Power of Social Media, pp. 560-588. October 2013. ● Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz and Patrick Meier: Practical Extraction of Disaster-Relevant Information from Social Media. Social Web and Disaster Management (SWDM) workshop. Rio de Janeiro, Brazil, 2013. ● Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz and Patrick Meier: Extracting Information Nuggets from Disaster-Related Messages in Social Media. In ISCRAM. Baden-Baden, Germany, 2013. Best paper award.
  • 78. BigCrisis Data — Carlos Castillo 78 Thank you! Follow @BigCrisisData