SlideShare a Scribd company logo
Ph.D. in Information Retrieval, University of Chile
Computer scientist seeking to address issues of social significance
through data mining and interdisciplinary research.
Director of Research for Data Science
Eurecat @Eurecat_news
Big Crisis Data,
Towards Participatory
Data Mining
Carlos Castillo
@ChaToX
Trigger warning: this talk contains footage from recent disaster situations
Topic of this short talk: CRISIS INFORMATICS
Patrick Meier
QCRI →
Muhammad Imran
QCRI
Irina Temnikova
QCRI
Aditi Gupta
IIIT Delhi →
P. K. Kumaraguru
IIIT Delhi
Alexandra Olteanu
EPFL →
Ji Lucas
QCRI
Ferda Ofli
QCRI
Hemant Purohit
Wright State George Mason→
Book (2016) from
Cambridge University PressWork done (2012-2015) at Qatar Computing Research Institute
Sarah Vieweg
QCRI
Fernando Diaz
Microsoft
4
Two months ago (March 22, 2016)
Police asks the public:
1) to use social media, not phone;
2) to reduce video/audio streaming;
3) to avoid sharing real-time information
about police actions
Attacks in the airport and a metro station
in Brussels kill 35 and injure 340
5
2,800-words Wikipedia article
In the first 8 hours after the attacks ...
Reddit post with 17,000 comments
700+ YouTube videos per hour
Facebook pages and Safety Check
Tweets and photos
6 BIGCRISISDATA.ORG
A Common Pattern
Disaster or mass-convergence event
People have increased communication needs
People are familiar with social media
Internet is not bullet-proof but fairly resilient
Emergency agencies encourage social media usage
Intensive usage of social media by the public for
emergency communications
7 BIGCRISISDATA.ORG
ExampleS from #SMEM papers
“OMG! The fire seems out of control: It’s running down the hills!”
Bush fire near Marseilles, France, in 2009 [Longueville et al. 2009]
“Red River at East Grand Forks is 48.70 feet, +20.7 feet of flood stage... #flood09”
Red River Valley floods in 2009 [Starbird et al. 2010]
“My moms backyard in Hatteras. That dock is usually about 3 feet above water [photo]”
Hurricane Sandy 2013 [Leavitt and Clark 2014]
“Sirens going off now!! Take cover...be safe!”
Moore Tornado 2013 [Blanford et al. 2014].
“There is shooting at Utøya, my little sister is there and just called home!”
2011 attacks in Norway [Perng et al. 2013]
8 BIGCRISISDATA.ORG
Data miner reflex: to classify and cluster
Caution &
Advice
Information
Sources
Damage &
Casualties
Donations
Gov
Eyewitness
Media
NGO
Outsider
...
...
9 BIGCRISISDATA.ORG
A study of Twitter on 26 crises
Results from Olteanu et al. CSCW 2015. Data available at http://crisislex.org/
11 BIGCRISISDATA.ORG
Temporal progression
Peak
12 hr 24 hr 36 hr 48 hr ... several days
Caution and
advice
Sympathy and
support
Affected
individuals
Infrastructure
and utilities
Other specific
information
Donations and
volunteering
Results from Olteanu et al. CSCW 2015. Data available at http://crisislex.org/
12
Information Extraction + donations matching
...
Classified
tweets
@TheNGO looking for blood donors at the
Riverside Stadium
@APerson do you know where can I donate
blood near Middlesbrough?
See Purohit et al. 2013 for automatic donations matching.
13Facebook: Visualizing Crisis Relief in Nepal (September 2015)..
The geography of an event 2015 Nepal earthquake
15 BIGCRISISDATA.ORG
Community-powered crisis mapping
16 BIGCRISISDATA.ORG
Crowdsourced Mapping
See Ofli et al. 2016 for details.
17 BIGCRISISDATA.ORG
Automatic mapping
Floods in Germany
De Albuquerque et al. 2015
Dengue in Brazil
Gomide et al. 2015
Earthquakes in Italy
Cresci et al. 2014
18 BIGCRISISDATA.ORG
hybrid mapping: AIDR + MICROMAPPERS
Manual processing:
crowdsourcing
Automatic processing:
machine learning
See Imran et al. 2014 for details on AIDR. Find out more at http://aidr.qcri.org/
19See Imran et al. 2014 for details on AIDR. Find out more at http://aidr.qcri.org/
20 BIGCRISISDATA.ORG
The future: Real-Time Crowdsourced Mining
● Mapping disaster-affected
areas using UAVs
● Is crowdsourced stream
mining possible?
See Patrick Meier's blog post from Nov. 2015 for details.
21Visit the UAViators community for more information on video clickers.
22 BIGCRISISDATA.ORG
Computationally
feasible
Supported by
data
Useful
Good projects in this space
Temptation! Danger!
Poorly planned
projects :-(
AI-complete
problems
Thank YOU!
Patrick Meier
QCRI →
Muhammad Imran
QCRI
Irina Temnikova
QCRI
Aditi Gupta
IIIT Delhi →
P. K. Kumaraguru
IIIT Delhi
Alexandra Olteanu
EPFL →
Ji Lucas
QCRI
Ferda Ofli
QCRI
Hemant Purohit
Wright State George Mason→
Sarah Vieweg
QCRI
Fernando Diaz
Microsoft
chato@acm.org
BIGCRISISDATA.ORG
24
25

More Related Content

What's hot

The case for integrating crisis response with social media
The case for integrating crisis response with social media The case for integrating crisis response with social media
The case for integrating crisis response with social media
American Red Cross
 
Pacific Endeavor 2012 Presentation
Pacific Endeavor 2012 PresentationPacific Endeavor 2012 Presentation
Pacific Endeavor 2012 Presentation
Catherine Graham
 
Knowing Your Place - Smart Education - Schools - AC18
Knowing Your Place - Smart Education - Schools - AC18Knowing Your Place - Smart Education - Schools - AC18
Knowing Your Place - Smart Education - Schools - AC18
Esri UK
 
The Human Factor in Disaster Risk Reduction
The Human Factor in Disaster Risk ReductionThe Human Factor in Disaster Risk Reduction
The Human Factor in Disaster Risk Reduction
Prof. David E. Alexander (UCL)
 
New media and democratic society 1117 presentation
New media and democratic society 1117 presentationNew media and democratic society 1117 presentation
New media and democratic society 1117 presentation
Tina Moore
 
Typhoon pablo bopha activation
Typhoon pablo bopha activationTyphoon pablo bopha activation
Typhoon pablo bopha activation
Catherine Graham
 
Safecast long version oct 2015
Safecast long version oct 2015Safecast long version oct 2015
Safecast long version oct 2015
Safecast
 
Disasters 2.0: Real Time Collaboration: Documentation and Mapping
Disasters 2.0: Real Time Collaboration: Documentation and MappingDisasters 2.0: Real Time Collaboration: Documentation and Mapping
Disasters 2.0: Real Time Collaboration: Documentation and Mapping
Connie White
 
Government 2.0 Defined
Government 2.0 DefinedGovernment 2.0 Defined
Government 2.0 Defined
Walter Schwabe
 
A Framework to Identify Best Practices: Social Media and Web 2.0 Technologies...
A Framework to Identify Best Practices: Social Media and Web 2.0 Technologies...A Framework to Identify Best Practices: Social Media and Web 2.0 Technologies...
A Framework to Identify Best Practices: Social Media and Web 2.0 Technologies...
Connie White
 
Technology and inequality
Technology and inequalityTechnology and inequality
Technology and inequality
David Wood
 
Open Foreste Italiane - Crisis Camp Europe -
Open Foreste Italiane - Crisis Camp Europe - Open Foreste Italiane - Crisis Camp Europe -
Open Foreste Italiane - Crisis Camp Europe -
Elena Rapisardi
 
Social Media in Sri Lanka: Do Science and Reason Stand a Chance? - Nalaka Gun...
Social Media in Sri Lanka: Do Science and Reason Stand a Chance? - Nalaka Gun...Social Media in Sri Lanka: Do Science and Reason Stand a Chance? - Nalaka Gun...
Social Media in Sri Lanka: Do Science and Reason Stand a Chance? - Nalaka Gun...
Nalaka Gunawardene
 
CrisisCampUk: Where next for UK crisis crowdsourcing
CrisisCampUk: Where next for UK crisis crowdsourcingCrisisCampUk: Where next for UK crisis crowdsourcing
CrisisCampUk: Where next for UK crisis crowdsourcing
Sara-Jayne Terp
 
#EMAG2011 Use Social Media Now for Emergency Management
#EMAG2011 Use Social Media Now for Emergency Management#EMAG2011 Use Social Media Now for Emergency Management
#EMAG2011 Use Social Media Now for Emergency Management
Connie White
 
Humanitarian Informatics Approach for Cooperation between Citizens and Organi...
Humanitarian Informatics Approach for Cooperation between Citizens and Organi...Humanitarian Informatics Approach for Cooperation between Citizens and Organi...
Humanitarian Informatics Approach for Cooperation between Citizens and Organi...
Hemant Purohit
 
ATHack! Inc. - Social Good Hackathons
ATHack! Inc. - Social Good HackathonsATHack! Inc. - Social Good Hackathons
ATHack! Inc. - Social Good Hackathons
Ehb Teng
 
Introduction to Machine Learning: An Application to Disaster Response
Introduction to Machine Learning: An Application to Disaster ResponseIntroduction to Machine Learning: An Application to Disaster Response
Introduction to Machine Learning: An Application to Disaster Response
Muhammad Imran
 
HunchWorks: Combining Human Expertise and Big Data
HunchWorks: Combining Human Expertise and Big DataHunchWorks: Combining Human Expertise and Big Data
HunchWorks: Combining Human Expertise and Big Data
Dane Petersen
 
Progressive ethics in the digital age
Progressive ethics in the digital ageProgressive ethics in the digital age
Progressive ethics in the digital age
David Wood
 

What's hot (20)

The case for integrating crisis response with social media
The case for integrating crisis response with social media The case for integrating crisis response with social media
The case for integrating crisis response with social media
 
Pacific Endeavor 2012 Presentation
Pacific Endeavor 2012 PresentationPacific Endeavor 2012 Presentation
Pacific Endeavor 2012 Presentation
 
Knowing Your Place - Smart Education - Schools - AC18
Knowing Your Place - Smart Education - Schools - AC18Knowing Your Place - Smart Education - Schools - AC18
Knowing Your Place - Smart Education - Schools - AC18
 
The Human Factor in Disaster Risk Reduction
The Human Factor in Disaster Risk ReductionThe Human Factor in Disaster Risk Reduction
The Human Factor in Disaster Risk Reduction
 
New media and democratic society 1117 presentation
New media and democratic society 1117 presentationNew media and democratic society 1117 presentation
New media and democratic society 1117 presentation
 
Typhoon pablo bopha activation
Typhoon pablo bopha activationTyphoon pablo bopha activation
Typhoon pablo bopha activation
 
Safecast long version oct 2015
Safecast long version oct 2015Safecast long version oct 2015
Safecast long version oct 2015
 
Disasters 2.0: Real Time Collaboration: Documentation and Mapping
Disasters 2.0: Real Time Collaboration: Documentation and MappingDisasters 2.0: Real Time Collaboration: Documentation and Mapping
Disasters 2.0: Real Time Collaboration: Documentation and Mapping
 
Government 2.0 Defined
Government 2.0 DefinedGovernment 2.0 Defined
Government 2.0 Defined
 
A Framework to Identify Best Practices: Social Media and Web 2.0 Technologies...
A Framework to Identify Best Practices: Social Media and Web 2.0 Technologies...A Framework to Identify Best Practices: Social Media and Web 2.0 Technologies...
A Framework to Identify Best Practices: Social Media and Web 2.0 Technologies...
 
Technology and inequality
Technology and inequalityTechnology and inequality
Technology and inequality
 
Open Foreste Italiane - Crisis Camp Europe -
Open Foreste Italiane - Crisis Camp Europe - Open Foreste Italiane - Crisis Camp Europe -
Open Foreste Italiane - Crisis Camp Europe -
 
Social Media in Sri Lanka: Do Science and Reason Stand a Chance? - Nalaka Gun...
Social Media in Sri Lanka: Do Science and Reason Stand a Chance? - Nalaka Gun...Social Media in Sri Lanka: Do Science and Reason Stand a Chance? - Nalaka Gun...
Social Media in Sri Lanka: Do Science and Reason Stand a Chance? - Nalaka Gun...
 
CrisisCampUk: Where next for UK crisis crowdsourcing
CrisisCampUk: Where next for UK crisis crowdsourcingCrisisCampUk: Where next for UK crisis crowdsourcing
CrisisCampUk: Where next for UK crisis crowdsourcing
 
#EMAG2011 Use Social Media Now for Emergency Management
#EMAG2011 Use Social Media Now for Emergency Management#EMAG2011 Use Social Media Now for Emergency Management
#EMAG2011 Use Social Media Now for Emergency Management
 
Humanitarian Informatics Approach for Cooperation between Citizens and Organi...
Humanitarian Informatics Approach for Cooperation between Citizens and Organi...Humanitarian Informatics Approach for Cooperation between Citizens and Organi...
Humanitarian Informatics Approach for Cooperation between Citizens and Organi...
 
ATHack! Inc. - Social Good Hackathons
ATHack! Inc. - Social Good HackathonsATHack! Inc. - Social Good Hackathons
ATHack! Inc. - Social Good Hackathons
 
Introduction to Machine Learning: An Application to Disaster Response
Introduction to Machine Learning: An Application to Disaster ResponseIntroduction to Machine Learning: An Application to Disaster Response
Introduction to Machine Learning: An Application to Disaster Response
 
HunchWorks: Combining Human Expertise and Big Data
HunchWorks: Combining Human Expertise and Big DataHunchWorks: Combining Human Expertise and Big Data
HunchWorks: Combining Human Expertise and Big Data
 
Progressive ethics in the digital age
Progressive ethics in the digital ageProgressive ethics in the digital age
Progressive ethics in the digital age
 

Viewers also liked

Observational studies in social media
Observational studies in social mediaObservational studies in social media
Observational studies in social media
Carlos Castillo (ChaTo)
 
Detecting Algorithmic Bias (keynote at DIR 2016)
Detecting Algorithmic Bias (keynote at DIR 2016)Detecting Algorithmic Bias (keynote at DIR 2016)
Detecting Algorithmic Bias (keynote at DIR 2016)
Carlos Castillo (ChaTo)
 
What to Expect When the Unexpected Happens: Social Media Communications Acros...
What to Expect When the Unexpected Happens: Social Media Communications Acros...What to Expect When the Unexpected Happens: Social Media Communications Acros...
What to Expect When the Unexpected Happens: Social Media Communications Acros...
Carlos Castillo (ChaTo)
 
Fairness-Aware Data Mining
Fairness-Aware Data MiningFairness-Aware Data Mining
Fairness-Aware Data Mining
Carlos Castillo (ChaTo)
 
Crisis Computing
Crisis ComputingCrisis Computing
Crisis Computing
Carlos Castillo (ChaTo)
 
Discrimination Discovery
Discrimination DiscoveryDiscrimination Discovery
Discrimination Discovery
Carlos Castillo (ChaTo)
 
K-Means Algorithm
K-Means AlgorithmK-Means Algorithm
K-Means Algorithm
Carlos Castillo (ChaTo)
 
Clustering, k means algorithm
Clustering, k means algorithmClustering, k means algorithm
Clustering, k means algorithm
Junyoung Park
 

Viewers also liked (8)

Observational studies in social media
Observational studies in social mediaObservational studies in social media
Observational studies in social media
 
Detecting Algorithmic Bias (keynote at DIR 2016)
Detecting Algorithmic Bias (keynote at DIR 2016)Detecting Algorithmic Bias (keynote at DIR 2016)
Detecting Algorithmic Bias (keynote at DIR 2016)
 
What to Expect When the Unexpected Happens: Social Media Communications Acros...
What to Expect When the Unexpected Happens: Social Media Communications Acros...What to Expect When the Unexpected Happens: Social Media Communications Acros...
What to Expect When the Unexpected Happens: Social Media Communications Acros...
 
Fairness-Aware Data Mining
Fairness-Aware Data MiningFairness-Aware Data Mining
Fairness-Aware Data Mining
 
Crisis Computing
Crisis ComputingCrisis Computing
Crisis Computing
 
Discrimination Discovery
Discrimination DiscoveryDiscrimination Discovery
Discrimination Discovery
 
K-Means Algorithm
K-Means AlgorithmK-Means Algorithm
K-Means Algorithm
 
Clustering, k means algorithm
Clustering, k means algorithmClustering, k means algorithm
Clustering, k means algorithm
 

Similar to Databeers: Big Crisis Data

The age of analytics
The age of analyticsThe age of analytics
The age of analytics
bis_foresight
 
Big Data Paper
Big Data PaperBig Data Paper
Big Data Paper
Andile Ngcaba
 
Using Data for Science Journalism
Using Data for Science JournalismUsing Data for Science Journalism
Using Data for Science Journalism
Liliana Bounegru
 
Using Data for Science Journalism
Using Data for Science JournalismUsing Data for Science Journalism
Using Data for Science Journalism
Jonathan Gray
 
Presentation ISCRAM 2012
Presentation ISCRAM 2012Presentation ISCRAM 2012
Presentation ISCRAM 2012
Twittercrisis
 
Invasion Of Privacy In Canadian Media
Invasion Of Privacy In Canadian MediaInvasion Of Privacy In Canadian Media
Invasion Of Privacy In Canadian Media
Kelly Ratkovic
 
Data! Action! Data journalism issues to watch in the next 10 years
Data! Action! Data journalism issues to watch in the next 10 yearsData! Action! Data journalism issues to watch in the next 10 years
Data! Action! Data journalism issues to watch in the next 10 years
Paul Bradshaw
 
Role of Data Accessibility During Pandemic
Role of Data Accessibility During PandemicRole of Data Accessibility During Pandemic
Role of Data Accessibility During Pandemic
Databricks
 
Fake news and trust and distrust in fact checking sites
Fake news and trust and distrust in fact checking sitesFake news and trust and distrust in fact checking sites
Fake news and trust and distrust in fact checking sites
Petter Bae Brandtzæg
 
Oxford Internet Institute - Twitter predicts epidemics
Oxford Internet Institute - Twitter predicts epidemicsOxford Internet Institute - Twitter predicts epidemics
Oxford Internet Institute - Twitter predicts epidemics
Patty Kostkova
 
Wedf brochure (september)
Wedf brochure (september)Wedf brochure (september)
Wedf brochure (september)
Morne Olivier
 
Human Rights Council Study Guide
Human Rights Council Study GuideHuman Rights Council Study Guide
Human Rights Council Study Guide
dudasings
 
Mac201 spies and whistleblowers lecture
Mac201 spies and whistleblowers lectureMac201 spies and whistleblowers lecture
Mac201 spies and whistleblowers lecture
Rob Jewitt
 
Volta05_0 (3)
Volta05_0 (3)Volta05_0 (3)
Volta05_0 (3)
Hanneke Teunissen
 
Internet of Things - IoT Webinar 2013
Internet of Things - IoT Webinar 2013Internet of Things - IoT Webinar 2013
Internet of Things - IoT Webinar 2013
Desiree Miloshevic
 
Med312 spies and whistleblowers lecture
Med312 spies and whistleblowers lectureMed312 spies and whistleblowers lecture
Med312 spies and whistleblowers lecture
Rob Jewitt
 
Rasetti fondazioneisi 29_06_2015
Rasetti fondazioneisi 29_06_2015Rasetti fondazioneisi 29_06_2015
Rasetti fondazioneisi 29_06_2015
CSI Piemonte
 
Io t malta_2013 Internet of Things IoT Webinar Dec 2013 #iot @Des
Io t malta_2013 Internet of Things IoT Webinar Dec 2013 #iot @DesIo t malta_2013 Internet of Things IoT Webinar Dec 2013 #iot @Des
Io t malta_2013 Internet of Things IoT Webinar Dec 2013 #iot @Des
Desiree Miloshevic
 
Open Data Sources for Disaster Management
Open Data Sources for Disaster ManagementOpen Data Sources for Disaster Management
Open Data Sources for Disaster Management
Michal Bodnar
 
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Darlene Cavalier
 

Similar to Databeers: Big Crisis Data (20)

The age of analytics
The age of analyticsThe age of analytics
The age of analytics
 
Big Data Paper
Big Data PaperBig Data Paper
Big Data Paper
 
Using Data for Science Journalism
Using Data for Science JournalismUsing Data for Science Journalism
Using Data for Science Journalism
 
Using Data for Science Journalism
Using Data for Science JournalismUsing Data for Science Journalism
Using Data for Science Journalism
 
Presentation ISCRAM 2012
Presentation ISCRAM 2012Presentation ISCRAM 2012
Presentation ISCRAM 2012
 
Invasion Of Privacy In Canadian Media
Invasion Of Privacy In Canadian MediaInvasion Of Privacy In Canadian Media
Invasion Of Privacy In Canadian Media
 
Data! Action! Data journalism issues to watch in the next 10 years
Data! Action! Data journalism issues to watch in the next 10 yearsData! Action! Data journalism issues to watch in the next 10 years
Data! Action! Data journalism issues to watch in the next 10 years
 
Role of Data Accessibility During Pandemic
Role of Data Accessibility During PandemicRole of Data Accessibility During Pandemic
Role of Data Accessibility During Pandemic
 
Fake news and trust and distrust in fact checking sites
Fake news and trust and distrust in fact checking sitesFake news and trust and distrust in fact checking sites
Fake news and trust and distrust in fact checking sites
 
Oxford Internet Institute - Twitter predicts epidemics
Oxford Internet Institute - Twitter predicts epidemicsOxford Internet Institute - Twitter predicts epidemics
Oxford Internet Institute - Twitter predicts epidemics
 
Wedf brochure (september)
Wedf brochure (september)Wedf brochure (september)
Wedf brochure (september)
 
Human Rights Council Study Guide
Human Rights Council Study GuideHuman Rights Council Study Guide
Human Rights Council Study Guide
 
Mac201 spies and whistleblowers lecture
Mac201 spies and whistleblowers lectureMac201 spies and whistleblowers lecture
Mac201 spies and whistleblowers lecture
 
Volta05_0 (3)
Volta05_0 (3)Volta05_0 (3)
Volta05_0 (3)
 
Internet of Things - IoT Webinar 2013
Internet of Things - IoT Webinar 2013Internet of Things - IoT Webinar 2013
Internet of Things - IoT Webinar 2013
 
Med312 spies and whistleblowers lecture
Med312 spies and whistleblowers lectureMed312 spies and whistleblowers lecture
Med312 spies and whistleblowers lecture
 
Rasetti fondazioneisi 29_06_2015
Rasetti fondazioneisi 29_06_2015Rasetti fondazioneisi 29_06_2015
Rasetti fondazioneisi 29_06_2015
 
Io t malta_2013 Internet of Things IoT Webinar Dec 2013 #iot @Des
Io t malta_2013 Internet of Things IoT Webinar Dec 2013 #iot @DesIo t malta_2013 Internet of Things IoT Webinar Dec 2013 #iot @Des
Io t malta_2013 Internet of Things IoT Webinar Dec 2013 #iot @Des
 
Open Data Sources for Disaster Management
Open Data Sources for Disaster ManagementOpen Data Sources for Disaster Management
Open Data Sources for Disaster Management
 
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
 

More from Carlos Castillo (ChaTo)

Finding High Quality Content in Social Media
Finding High Quality Content in Social MediaFinding High Quality Content in Social Media
Finding High Quality Content in Social Media
Carlos Castillo (ChaTo)
 
When no clicks are good news
When no clicks are good newsWhen no clicks are good news
When no clicks are good news
Carlos Castillo (ChaTo)
 
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
Carlos Castillo (ChaTo)
 
Natural experiments
Natural experimentsNatural experiments
Natural experiments
Carlos Castillo (ChaTo)
 
Content-based link prediction
Content-based link predictionContent-based link prediction
Content-based link prediction
Carlos Castillo (ChaTo)
 
Link prediction
Link predictionLink prediction
Link prediction
Carlos Castillo (ChaTo)
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
Carlos Castillo (ChaTo)
 
Graph Partitioning and Spectral Methods
Graph Partitioning and Spectral MethodsGraph Partitioning and Spectral Methods
Graph Partitioning and Spectral Methods
Carlos Castillo (ChaTo)
 
Finding Dense Subgraphs
Finding Dense SubgraphsFinding Dense Subgraphs
Finding Dense Subgraphs
Carlos Castillo (ChaTo)
 
Graph Evolution Models
Graph Evolution ModelsGraph Evolution Models
Graph Evolution Models
Carlos Castillo (ChaTo)
 
Link-Based Ranking
Link-Based RankingLink-Based Ranking
Link-Based Ranking
Carlos Castillo (ChaTo)
 
Text Indexing / Inverted Indices
Text Indexing / Inverted IndicesText Indexing / Inverted Indices
Text Indexing / Inverted Indices
Carlos Castillo (ChaTo)
 
Indexing
IndexingIndexing
Text Summarization
Text SummarizationText Summarization
Text Summarization
Carlos Castillo (ChaTo)
 
Hierarchical Clustering
Hierarchical ClusteringHierarchical Clustering
Hierarchical Clustering
Carlos Castillo (ChaTo)
 
Clustering
ClusteringClustering
Text similarity and the vector space model
Text similarity and the vector space modelText similarity and the vector space model
Text similarity and the vector space model
Carlos Castillo (ChaTo)
 
Intro to Creative Commons (May 2015)
Intro to Creative Commons (May 2015)Intro to Creative Commons (May 2015)
Intro to Creative Commons (May 2015)
Carlos Castillo (ChaTo)
 
Characterizing the Life Cycle of Online News Stories Using Social Media React...
Characterizing the Life Cycle of Online News Stories Using Social Media React...Characterizing the Life Cycle of Online News Stories Using Social Media React...
Characterizing the Life Cycle of Online News Stories Using Social Media React...
Carlos Castillo (ChaTo)
 
Crisis Informatics (November 2013)
Crisis Informatics (November 2013)Crisis Informatics (November 2013)
Crisis Informatics (November 2013)
Carlos Castillo (ChaTo)
 

More from Carlos Castillo (ChaTo) (20)

Finding High Quality Content in Social Media
Finding High Quality Content in Social MediaFinding High Quality Content in Social Media
Finding High Quality Content in Social Media
 
When no clicks are good news
When no clicks are good newsWhen no clicks are good news
When no clicks are good news
 
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
 
Natural experiments
Natural experimentsNatural experiments
Natural experiments
 
Content-based link prediction
Content-based link predictionContent-based link prediction
Content-based link prediction
 
Link prediction
Link predictionLink prediction
Link prediction
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Graph Partitioning and Spectral Methods
Graph Partitioning and Spectral MethodsGraph Partitioning and Spectral Methods
Graph Partitioning and Spectral Methods
 
Finding Dense Subgraphs
Finding Dense SubgraphsFinding Dense Subgraphs
Finding Dense Subgraphs
 
Graph Evolution Models
Graph Evolution ModelsGraph Evolution Models
Graph Evolution Models
 
Link-Based Ranking
Link-Based RankingLink-Based Ranking
Link-Based Ranking
 
Text Indexing / Inverted Indices
Text Indexing / Inverted IndicesText Indexing / Inverted Indices
Text Indexing / Inverted Indices
 
Indexing
IndexingIndexing
Indexing
 
Text Summarization
Text SummarizationText Summarization
Text Summarization
 
Hierarchical Clustering
Hierarchical ClusteringHierarchical Clustering
Hierarchical Clustering
 
Clustering
ClusteringClustering
Clustering
 
Text similarity and the vector space model
Text similarity and the vector space modelText similarity and the vector space model
Text similarity and the vector space model
 
Intro to Creative Commons (May 2015)
Intro to Creative Commons (May 2015)Intro to Creative Commons (May 2015)
Intro to Creative Commons (May 2015)
 
Characterizing the Life Cycle of Online News Stories Using Social Media React...
Characterizing the Life Cycle of Online News Stories Using Social Media React...Characterizing the Life Cycle of Online News Stories Using Social Media React...
Characterizing the Life Cycle of Online News Stories Using Social Media React...
 
Crisis Informatics (November 2013)
Crisis Informatics (November 2013)Crisis Informatics (November 2013)
Crisis Informatics (November 2013)
 

Recently uploaded

Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Tatiana Kojar
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
Dinusha Kumarasiri
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Jeffrey Haguewood
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
AWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptxAWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptx
HarisZaheer8
 
Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!
GDSC PJATK
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
alexjohnson7307
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
Hiike
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
Wouter Lemaire
 

Recently uploaded (20)

Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
AWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptxAWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptx
 
Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
 

Databeers: Big Crisis Data

  • 1. Ph.D. in Information Retrieval, University of Chile Computer scientist seeking to address issues of social significance through data mining and interdisciplinary research. Director of Research for Data Science Eurecat @Eurecat_news Big Crisis Data, Towards Participatory Data Mining Carlos Castillo @ChaToX Trigger warning: this talk contains footage from recent disaster situations
  • 2. Topic of this short talk: CRISIS INFORMATICS Patrick Meier QCRI → Muhammad Imran QCRI Irina Temnikova QCRI Aditi Gupta IIIT Delhi → P. K. Kumaraguru IIIT Delhi Alexandra Olteanu EPFL → Ji Lucas QCRI Ferda Ofli QCRI Hemant Purohit Wright State George Mason→ Book (2016) from Cambridge University PressWork done (2012-2015) at Qatar Computing Research Institute Sarah Vieweg QCRI Fernando Diaz Microsoft
  • 3. 4 Two months ago (March 22, 2016) Police asks the public: 1) to use social media, not phone; 2) to reduce video/audio streaming; 3) to avoid sharing real-time information about police actions Attacks in the airport and a metro station in Brussels kill 35 and injure 340
  • 4. 5 2,800-words Wikipedia article In the first 8 hours after the attacks ... Reddit post with 17,000 comments 700+ YouTube videos per hour Facebook pages and Safety Check Tweets and photos
  • 5. 6 BIGCRISISDATA.ORG A Common Pattern Disaster or mass-convergence event People have increased communication needs People are familiar with social media Internet is not bullet-proof but fairly resilient Emergency agencies encourage social media usage Intensive usage of social media by the public for emergency communications
  • 6. 7 BIGCRISISDATA.ORG ExampleS from #SMEM papers “OMG! The fire seems out of control: It’s running down the hills!” Bush fire near Marseilles, France, in 2009 [Longueville et al. 2009] “Red River at East Grand Forks is 48.70 feet, +20.7 feet of flood stage... #flood09” Red River Valley floods in 2009 [Starbird et al. 2010] “My moms backyard in Hatteras. That dock is usually about 3 feet above water [photo]” Hurricane Sandy 2013 [Leavitt and Clark 2014] “Sirens going off now!! Take cover...be safe!” Moore Tornado 2013 [Blanford et al. 2014]. “There is shooting at Utøya, my little sister is there and just called home!” 2011 attacks in Norway [Perng et al. 2013]
  • 7. 8 BIGCRISISDATA.ORG Data miner reflex: to classify and cluster Caution & Advice Information Sources Damage & Casualties Donations Gov Eyewitness Media NGO Outsider ... ...
  • 8. 9 BIGCRISISDATA.ORG A study of Twitter on 26 crises Results from Olteanu et al. CSCW 2015. Data available at http://crisislex.org/
  • 9. 11 BIGCRISISDATA.ORG Temporal progression Peak 12 hr 24 hr 36 hr 48 hr ... several days Caution and advice Sympathy and support Affected individuals Infrastructure and utilities Other specific information Donations and volunteering Results from Olteanu et al. CSCW 2015. Data available at http://crisislex.org/
  • 10. 12 Information Extraction + donations matching ... Classified tweets @TheNGO looking for blood donors at the Riverside Stadium @APerson do you know where can I donate blood near Middlesbrough? See Purohit et al. 2013 for automatic donations matching.
  • 11. 13Facebook: Visualizing Crisis Relief in Nepal (September 2015).. The geography of an event 2015 Nepal earthquake
  • 13. 16 BIGCRISISDATA.ORG Crowdsourced Mapping See Ofli et al. 2016 for details.
  • 14. 17 BIGCRISISDATA.ORG Automatic mapping Floods in Germany De Albuquerque et al. 2015 Dengue in Brazil Gomide et al. 2015 Earthquakes in Italy Cresci et al. 2014
  • 15. 18 BIGCRISISDATA.ORG hybrid mapping: AIDR + MICROMAPPERS Manual processing: crowdsourcing Automatic processing: machine learning See Imran et al. 2014 for details on AIDR. Find out more at http://aidr.qcri.org/
  • 16. 19See Imran et al. 2014 for details on AIDR. Find out more at http://aidr.qcri.org/
  • 17. 20 BIGCRISISDATA.ORG The future: Real-Time Crowdsourced Mining ● Mapping disaster-affected areas using UAVs ● Is crowdsourced stream mining possible? See Patrick Meier's blog post from Nov. 2015 for details.
  • 18. 21Visit the UAViators community for more information on video clickers.
  • 19. 22 BIGCRISISDATA.ORG Computationally feasible Supported by data Useful Good projects in this space Temptation! Danger! Poorly planned projects :-( AI-complete problems
  • 20. Thank YOU! Patrick Meier QCRI → Muhammad Imran QCRI Irina Temnikova QCRI Aditi Gupta IIIT Delhi → P. K. Kumaraguru IIIT Delhi Alexandra Olteanu EPFL → Ji Lucas QCRI Ferda Ofli QCRI Hemant Purohit Wright State George Mason→ Sarah Vieweg QCRI Fernando Diaz Microsoft chato@acm.org BIGCRISISDATA.ORG
  • 21. 24
  • 22. 25