SlideShare a Scribd company logo
Big Data for
Social Good
Nuria Oliver, PhD
Scientific Director
Telefonica R&D
7+ billion mobile phones worldwide
97% of world’s population (ITU)
Mobile penetration of 120% to 89% of population (ITU)
Emerging and developed regions
More time spent on our phones than watching TV or with our with
our partner (US and UK)
Mobile Network Infrastructure
● A cell tower (also known as BTS or base station) provides an
approximate location
● Spacing between towers is 500m-1km in urban areas, 2-3 km in
rural areas
● Each tower has a “Voronoi cell”; this is the area to which the
tower provides best coverage i.e. tower is closest
Typical Mobile Data
• CDR
• SMS
Consumption Social Network Mobility
Call duration In/Out Degree Radius of gyration
N. Events Delta w.r.t time window Travelled distance
Lapse between events Unique Calls per day
Rate of popular
antennas
Reciprocated events Unique SMS per day
Regularity of popular
antennas
… … …
HR_ORG TLFN_A TLFN_B CD_GEO_A CD_GEO_B DT_ORG CD_SNTD CD_ERB CD_CCC QT_DUR
20:05:31 XXX YYY 3 11 20140519 2 1562 568 33
… … … … … … … … … …
HR_ORG TLFN_A TLFN_B CD_GEO_A CD_GEO_B DT_ORG CD_SNTD QT_TRFG
15:53:54 XXX ZZZ 3 25 20140506 2 1
… … … … … … … …
Cell
Phone
Data
BEHAVIORAL
INSIGHTS
Urban
Planning
Tools
Official
Statistics
Tools
TELEFONICA RESEARCH INSTITUTIONS & POLICY MAKERS
Crisis
Management
Tools
Public
Health Tools
Big Data for Social Good @ Telefonica
Crime Prediction
Analysis of impact of floods
http://www.wired.co.uk/news/archive/2013-10/17/nuria-oliver
70% accuracy
Natural Disasters
Flooding Through the Lens of Mobile
Phone Activity
David Pastor-Escuredo1, Alfredo Morales-Guzmán1, Yolanda
Torres-Fernández1, Jean-Martin Bauer2, Amit Wadhwa2, Carlos
Castro-Correa3, Liudmyla Romanoff4, Jong Gun Lee4, Alex
Rutherford4, Vanessa Frias-Martinez5, Nuria Oliver6, Enrique Frias-
Martinez6, Miguel Luengo-Oroz4
1. Universidad Politécnica de Madrid
2. Vulnerability Analysis and Mapping, World Food Programme
3. Coordinación de Estrategia Digital Nacional, Presidencia de la República de México
4. United Nations Global Pulse
5. University Maryland
6. Telefónica Research
UN Global Pulse is an innovation initiative of the United Nations Secretary-
General, driving a big data revolution for development.
Its vision is to see big data used safely and responsibly for public good; its
mission to accelerate discovery, development and adoption of big data
and real-time analytics for sustainable development and humanitarian
action.
Tabasco Floods
• Tabasco state frequently suffers flooding
• 800mm of rain 31st October – 3rd November 2009 (4x
normal)
• 214,000 people affected
• State of emergency declared
Reconstructing the Floods
• We combined mobile data with
NASA LANDSAT satellite imagery
data, governmental surveys and
census data
• NASA LANDSAT  geolocation of
the most affected areas
• Census data  representativeness
of mobile data
• Precipitations data and civil
protection reports  timeline of
people’s awareness and response
CDR Representativeness
● Mobile user
representativeness
validated against census
data
● Population estimates
based on CDRs are strongly
correlated with official
Census data and
population statistics
● Mobile data  proxy of
population distribution in
areas where other data
sources are not available
or reliable
R2=0.97
Event Detection
x(t) is raw number of unique phones
placing or receiving calls in each
antennae per day
● Quantify impact of floods by
comparing behavior of BTS against
baseline activity using z-score like
metric
● BTS with higher variations in the
number of calls made during the
floods were located in the most
affected regions
● Special events (e.g. Christmas) led
to similar changes in behavior wrt
baseline, but across the entire
region as opposed to in specific
locations
Actionable Insights
● Patterns in BTS activity can be
used to measure the impact of
floodings
● Behavioral insights are important
for emergency services
● People communicated more as a
result of the initial impacts of the
floodings, rather than following the
recommendations of public
protection
● Civil protection warnings did not
seem to be an effective way to
raise awareness
● Spikes in activity are observed only
after the situation was critical
Big Data for Social Good @ Telefonica
Crime Prediction
Analysis of impact of floods
http://www.wired.co.uk/news/archive/2013-10/17/nuria-oliver
70% accuracy
Pandemics
Vanessa Frias-Martinez, Enrique Frias-Martinez et al
Data Analysis
• Call Records from 1st Jan till 31st May 2009
• Compute mobility as different number of BTS visited
• Stages
• Medical Alert - Stage 1 (17th-27th April)
• Closing Schools - Stage 2 (28th-1st May)
• Suspension of Essential Activities - Stage 3 (1st May-6th May)
• Baselines
• same periods, different year (2008)
80% 55%0%
Agent-based Disease Model
Mobility
Model
Social
Network
Model
Disease
Model
Impact on disease propagation
10%
40h
Impact on disease propagation
Big Data for Social Good @ Telefonica
Crime Prediction
Analysis of impact of floods
http://www.wired.co.uk/news/archive/2013-10/17/nuria-oliver
70% accuracy
Crime
Work with Bogomolov, A., Lepri, B., Staiano, J., Pianesi, F., Pentland, A.
 Affects quality of life and economic development both
at the national and local level
 Several studies explore relationships between crime
and socio-economic variables: education, income,
unemployment, ethnicity, …
 Several studies have shown significant concentrations
of crime in small geographical areas: crime hotspots
Crime
 T1: Natural surveillance as key deterrent for crime:
people moving around are eyes on the street
(Jacobs, 1961)
 high diversity among the population and high
number of visitors -> less crime
 T2: Defensible space theory (Newman, 1972)
 high mix of people -> more crime
Crime and Urban Environment
People-centric perspective vs Place-
centric perspective
 people-centric perspective used for
individual or collective criminal profiling
 place-centric perspective used for
predicting crime hotspots
Crime Prediction
Data-driven and place-centric approach to
crime prediction
 Multimodal approach: people dynamics
derived from mobile network data and
demographics
European metropolis: London
Prediction of crime hotspots and not criminals
profiling
Our Approach
Smartsteps Dataset:
 for each of the Smartsteps cells a variety of demographic and
human dynamics variables were computed every hour for 3
weeks (from December 9 to December 15, 2012 and from
December 23, 2012 to January 5, 2013)
Criminal Cases Dataset:
 criminal cases for December 2012 and for January 2013
London Borough Profiles Dataset:
 open dataset containing 68 metrics about the population of a
particular geographic area
Multimodal Approach: Data
• Footfall count: Shows the trend in footfall in a
specified area hourly, daily, weekly and monthly.
Provides a basic profile of the crowd.
• Catchment area: Shows which postal sectors are
your customers coming from by hour, day, week
and month. Shows the “battleground” for two sites.
• Transport mode: Shows flows of crowds from any
two points, segmented by road, air, train, etc.
SmartSteps
For each cell and for each hour the dataset contains:
 an estimation of how many people are in the cell
 the percentage of these people at home, at work or
just visiting the cell
 the gender splits (male vs. female)
 the age splits (0-20 years, 21-30 years, 31-40 years, …)
SmartSteps Data
 Crime geolocation for 2 months (December 2012 –
January 2013)
 All reported crimes in UK specifying month and year and
not specific day/time
 Median crime value (=5) used as threshold
 Spatial granularity of borough profiles is at LSOA levels:
LSOA are small geographical areas defined by UK Office
for National Statistics (mean population: 1500)
Crime Data
 68 metrics about the population of a specific geographical area:
demographics, households, migrant population, employment,
earnings, life expectancy, happiness levels, house prices, etc.
 Spatial granularity of borough profiles is at LSOA levels:
 LSOA are small geographical areas defined by UK Office for
National Statistics (mean population: 1500)
London Borough Profiles Data
From Smartsteps data we extract
 1st order features (mean, median, min., max.,
entropy, etc.)
 2nd order features on sliding windows of variable
length (1 hour, 4 hours, 1 day, etc.) to account for
temporal patterns
Feature Extraction
Feature Selection
 Mean decrease in Gini coefficient of inequality
 the feature with maximum mean decrease in Gini
coefficient is expected to have the maximum
influence in minimizing the out-of-the-bag error
 The feature selection process produced a
reduced subset of 68 features (from an initial pool
of about 6000 features)
Classification Approach
 Binary classification task: high crime area vs low crime
area
 10-fold cross-validation approach
 Classifier: Random Forest (RF)
 RF overcomes logistic regression, support vector
machines, neural networks, decision trees
Smartsteps-based classifier significantly outperforms baseline
majority and borough profiles-based classifiers
Experimental Results
ground-truth
Experimental Results
~70% accuracy in predicting crime hotspots
predictions
 Features encoding daily dynamics have more predictive power
than features extracted on a monthly basis
 Relevance of high number of residents to predict crime areas
 increased ratio of residents -> more crime (in contrast with
Newman’s thesis)
 Entropy-based features are useful for predicting the crime
hotspots
 high diversity of functions (home vs work) and high diversity of
people (gender and age) act as eyes on street decreasing
crime (in line with Jacobs’ thesis)
Relevant Features
 Only 6 out of 68 features in the joint model are
London Borough features, namely
%working population claiming out of work benefits
Largest migrant population
% overseas nationals entering the UK
% resident population born abroad
Relevant Features
 Our method captures the dynamics of a place rather
than making extrapolations from previous crime
histories. We can use it in areas where people are less
inclined to report crimes
Our method provides new ways of describing
geographical areas: novel risk-inducing or risk-reducing
features of geographical areas
Implications
Relevant Publications
“A gender-centric analysis of calling behavior in a developing country using
CDRS”, Vanessa Frias-Martinez, Enrique Frias-Martinez and Oliver, N.,
Proceedings of AAAI, 2009
"Prediction of Socioeconomic Levels using Cell Phone Records", Victor Soto
and Vanessa Frias-Martinez and Jesus Virseda and Enrique Frias-Martinez,
International Conference on User Modeling, Adaptation and Personalization,
UMAP'11, Industrial Track, Girona, Spain, 2011
"An Agent-Based Model Of Epidemic Spread Using Human Mobility and Social
Network Information", Vanessa Frias-Martinez et al, 3rd International
Conference on Social Computing, SocialCom '11, Boston, USA, 2011.
Talk at WIRED 2013. London UK
http://www.wired.co.uk/news/archive/2013-10/17/nuria-oliver
Relevant Publications
“Moves on the street: Predicting Crime Hotspots using aggregated
anonymized data on people dynamics” - A. Bogomolov, B. Lepri, J. Staiano,
Leouze, E., N. Oliver, F. Pianesi, A. Pentland Journal of Big Data (Big Data
Journal 2015)
“Once Upon a Crime: Towards Crime Prediction from Demographics and
Mobile Data” - A. Bogomolov, B. Lepri, J. Staiano, N. Oliver, F. Pianesi, A.
Pentland 16th ACM International Conference on Multimodal Interaction
(ICMI 2014)
"Flooding through the Lens of Mobile Phone Activity"
Pastor-Escuredo, D., Torres Fernandez, Y., Bauer, J.M., Wadhwa, A., Castro-
Correa, C., Romanoff, L., Lee, J.G., Rutherford, A., Frias-Martinez, V., Oliver,
N., Frias-Martinez, E. and Luengo-Oroz, M. Proceedins of IEEE Global
Humanitarian Technology Conference, GHTC 2014, Silicon Valley, CA, Oct
2014
Opportunities
Temporal and Spatial Granularity
• Big Data can be available in real-time or if not in real time much more frequently than how
data is typically collected (every 5-10 years for example for census data);
• Some kinds of Big Data (e.g. data about the city, collected by sensors placed in the urban
infrastructure) can be collected with significantly finer grained spatial granularity than with
traditional methods;
Cost and Effort
Accuracy
• It could be argued that some kinds of data that are relevant for the public sector (e.g.
migrations) can be collected more accurately by automatic means through Big Data
platforms than by manual means as it is the state of the art.
• In addition, given that there isn’t a human-in-the-loop, the data is less prone to human errors
and potential biases introduced by humans.
• Most of the Big Data that could be used for the public sector is data that
has been collected already for other purposes. In addition, Big Data is
typically collected by automatic means which makes its collection very
cost-efficient;
Challenges
Risk/Benefit Analysis
• Even while adopting all sorts of precautions, if by any chance there is an error in the data
extraction such that there could be a privacy leak, the consequences for the business could
be devastating
• Hence, the potential benefit would have to be really large to compensate for the risk.
• As this is an emergent research area, the benefits are still to be defined
Regulatory/Social
• Lack of updated regulation
• Lack of clear guidelines regarding safe data handling, processing and sharing for
humanitarian purposes
• Risk of potential unintended social consequences
• Risk of creating a digital divide, unbalanced access to data and-or expertise on how to
analyze it and make sense of it
Internal Barriers
• Big Data for Development/Social Good projects are typically not part of
any business unit in the Telcos;
Technical
• Representativeness of the data, generalization
• Combination of data from multiple sources
• Real-time analysis and prediction
• Lack of ground truth  intervention to validate
• Significant vs substantially significant
• Correlations vs causality
Privacy/Security
• Potential privacy risks need to be minimized and understood.
Control and transparency
• Security and traceability of the data
• Clear code of conduct and ethical principles when dealing
with data
• Strict access control when appropriate
Framework for Data Sharing
Remote
Access
Question and
Answers
Limited
License
Pre-computed
Indicators &
Synthetic
Data
Security
Secure access to the data
Anonymization
Limited or aggregated data release
Dataprotectedthrough
ApplicativeExploratory
Low to medium
number of users
Medium to large number
of users and open data
Development stage
What can we all do to
responsibly turn this
opportunity into a reality ?
Thanks!
nuria.oliver@telefonica.com
@nuriaoliver

More Related Content

Viewers also liked

Big data luiss Facebook and epistemology
Big data luiss Facebook and epistemologyBig data luiss Facebook and epistemology
Big data luiss Facebook and epistemologyTeresa Numerico
 
Socializing Big Data: Collaborative Opportunities in Computer Science, the So...
Socializing Big Data: Collaborative Opportunities in Computer Science, the So...Socializing Big Data: Collaborative Opportunities in Computer Science, the So...
Socializing Big Data: Collaborative Opportunities in Computer Science, the So...Sheryl Grant
 
Big Data and its emergence
Big Data and its emergenceBig Data and its emergence
Big Data and its emergencekoolkalpz
 
Anticipatory Intelligence
Anticipatory IntelligenceAnticipatory Intelligence
Anticipatory IntelligenceAbe Usher
 
Big Data and Social Science Theory: Leveraging Large Scale Data to Discover N...
Big Data and Social Science Theory: Leveraging Large Scale Data to Discover N...Big Data and Social Science Theory: Leveraging Large Scale Data to Discover N...
Big Data and Social Science Theory: Leveraging Large Scale Data to Discover N...Han Woo PARK
 
Big Data? Big Issues: Degradation in Longitudinal Data and Implications for ...
Big Data? Big Issues:  Degradation in Longitudinal Data and Implications for ...Big Data? Big Issues:  Degradation in Longitudinal Data and Implications for ...
Big Data? Big Issues: Degradation in Longitudinal Data and Implications for ...mwe400
 
Accessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science KnowledgeAccessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science KnowledgeJosh Cowls
 
Internet Archives and Social Science Research - Yeungnam University
Internet Archives and Social Science Research - Yeungnam UniversityInternet Archives and Social Science Research - Yeungnam University
Internet Archives and Social Science Research - Yeungnam Universitymwe400
 
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCESBROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCESMicah Altman
 
Moving from small science to big science: Social and organizational impedimen...
Moving from small science to big science: Social and organizational impedimen...Moving from small science to big science: Social and organizational impedimen...
Moving from small science to big science: Social and organizational impedimen...Eric Meyer
 
Big Data and the Social Sciences
Big Data and the Social SciencesBig Data and the Social Sciences
Big Data and the Social SciencesAbe Usher
 
Big Data Science - hype?
Big Data Science - hype?Big Data Science - hype?
Big Data Science - hype?BalaBit
 
Big data in social sciences and IT developments (ethics considerations)
Big data in social sciences and IT developments (ethics considerations)Big data in social sciences and IT developments (ethics considerations)
Big data in social sciences and IT developments (ethics considerations)Efthimios Tambouris
 
Big Data and Social Sciences
Big Data and Social SciencesBig Data and Social Sciences
Big Data and Social SciencesDavid De Roure
 
Big Data Challenges for the Social Sciences
Big Data Challenges for the Social SciencesBig Data Challenges for the Social Sciences
Big Data Challenges for the Social SciencesDavid De Roure
 
Big Social Data: The Spatial Turn in Big Data (Video available soon on YouTube)
Big Social Data: The Spatial Turn in Big Data (Video available soon on YouTube)Big Social Data: The Spatial Turn in Big Data (Video available soon on YouTube)
Big Social Data: The Spatial Turn in Big Data (Video available soon on YouTube)Rich Heimann
 
Computational Social Science:The Collaborative Futures of Big Data, Computer ...
Computational Social Science:The Collaborative Futures of Big Data, Computer ...Computational Social Science:The Collaborative Futures of Big Data, Computer ...
Computational Social Science:The Collaborative Futures of Big Data, Computer ...Academia Sinica
 
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...Lauri Eloranta
 
5 facts everyone should know about big data presentation
5 facts everyone should know about big data presentation5 facts everyone should know about big data presentation
5 facts everyone should know about big data presentationInfobrandz
 

Viewers also liked (20)

Big data luiss Facebook and epistemology
Big data luiss Facebook and epistemologyBig data luiss Facebook and epistemology
Big data luiss Facebook and epistemology
 
Socializing Big Data: Collaborative Opportunities in Computer Science, the So...
Socializing Big Data: Collaborative Opportunities in Computer Science, the So...Socializing Big Data: Collaborative Opportunities in Computer Science, the So...
Socializing Big Data: Collaborative Opportunities in Computer Science, the So...
 
Big Data and its emergence
Big Data and its emergenceBig Data and its emergence
Big Data and its emergence
 
Anticipatory Intelligence
Anticipatory IntelligenceAnticipatory Intelligence
Anticipatory Intelligence
 
Big Data and Social Science Theory: Leveraging Large Scale Data to Discover N...
Big Data and Social Science Theory: Leveraging Large Scale Data to Discover N...Big Data and Social Science Theory: Leveraging Large Scale Data to Discover N...
Big Data and Social Science Theory: Leveraging Large Scale Data to Discover N...
 
Big Data? Big Issues: Degradation in Longitudinal Data and Implications for ...
Big Data? Big Issues:  Degradation in Longitudinal Data and Implications for ...Big Data? Big Issues:  Degradation in Longitudinal Data and Implications for ...
Big Data? Big Issues: Degradation in Longitudinal Data and Implications for ...
 
Accessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science KnowledgeAccessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science Knowledge
 
Internet Archives and Social Science Research - Yeungnam University
Internet Archives and Social Science Research - Yeungnam UniversityInternet Archives and Social Science Research - Yeungnam University
Internet Archives and Social Science Research - Yeungnam University
 
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCESBROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
 
Moving from small science to big science: Social and organizational impedimen...
Moving from small science to big science: Social and organizational impedimen...Moving from small science to big science: Social and organizational impedimen...
Moving from small science to big science: Social and organizational impedimen...
 
Big Data and the Social Sciences
Big Data and the Social SciencesBig Data and the Social Sciences
Big Data and the Social Sciences
 
Big Data Science - hype?
Big Data Science - hype?Big Data Science - hype?
Big Data Science - hype?
 
Big data in social sciences and IT developments (ethics considerations)
Big data in social sciences and IT developments (ethics considerations)Big data in social sciences and IT developments (ethics considerations)
Big data in social sciences and IT developments (ethics considerations)
 
Big Data and Social Sciences
Big Data and Social SciencesBig Data and Social Sciences
Big Data and Social Sciences
 
Big Data Challenges for the Social Sciences
Big Data Challenges for the Social SciencesBig Data Challenges for the Social Sciences
Big Data Challenges for the Social Sciences
 
Big Social Data: The Spatial Turn in Big Data (Video available soon on YouTube)
Big Social Data: The Spatial Turn in Big Data (Video available soon on YouTube)Big Social Data: The Spatial Turn in Big Data (Video available soon on YouTube)
Big Social Data: The Spatial Turn in Big Data (Video available soon on YouTube)
 
Computational Social Science:The Collaborative Futures of Big Data, Computer ...
Computational Social Science:The Collaborative Futures of Big Data, Computer ...Computational Social Science:The Collaborative Futures of Big Data, Computer ...
Computational Social Science:The Collaborative Futures of Big Data, Computer ...
 
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
 
Seminário - Apresentação do artigo - Ontologies methodologies-and-new-uses-of...
Seminário - Apresentação do artigo - Ontologies methodologies-and-new-uses-of...Seminário - Apresentação do artigo - Ontologies methodologies-and-new-uses-of...
Seminário - Apresentação do artigo - Ontologies methodologies-and-new-uses-of...
 
5 facts everyone should know about big data presentation
5 facts everyone should know about big data presentation5 facts everyone should know about big data presentation
5 facts everyone should know about big data presentation
 

Similar to The emergent opportunity of Big Data for Social Good - Nuria Oliver @ PAPIs Connect

Characterizing Social Response to Urban Earthquakes using Cell-Phone Network ...
Characterizing Social Response to Urban Earthquakes using Cell-Phone Network ...Characterizing Social Response to Urban Earthquakes using Cell-Phone Network ...
Characterizing Social Response to Urban Earthquakes using Cell-Phone Network ...Enrique Frias-Martinez
 
Using Mobile Phone Activity for Disaster Management During Floods - Project O...
Using Mobile Phone Activity for Disaster Management During Floods - Project O...Using Mobile Phone Activity for Disaster Management During Floods - Project O...
Using Mobile Phone Activity for Disaster Management During Floods - Project O...UN Global Pulse
 
Listening to the community. Using ICTs for program monitoring
Listening to the community. Using ICTs for program monitoringListening to the community. Using ICTs for program monitoring
Listening to the community. Using ICTs for program monitoringbetterplace lab
 
CitizenReporting_for_Crime_Analysis
CitizenReporting_for_Crime_AnalysisCitizenReporting_for_Crime_Analysis
CitizenReporting_for_Crime_AnalysisPatrick Floto
 
Cybercrime & global mapping
Cybercrime & global mappingCybercrime & global mapping
Cybercrime & global mappingFred Zimmerman
 
Big Data for Development: Opportunities and Challenges, Summary Slidedeck
Big Data for Development: Opportunities and Challenges, Summary SlidedeckBig Data for Development: Opportunities and Challenges, Summary Slidedeck
Big Data for Development: Opportunities and Challenges, Summary SlidedeckUN Global Pulse
 
Big Data - Janaka Abeysinghe
Big Data - Janaka AbeysingheBig Data - Janaka Abeysinghe
Big Data - Janaka AbeysingheSTS FORUM 2016
 
Measuring the impact of epidemic alerts on human mobility using cell-phone ne...
Measuring the impact of epidemic alerts on human mobility using cell-phone ne...Measuring the impact of epidemic alerts on human mobility using cell-phone ne...
Measuring the impact of epidemic alerts on human mobility using cell-phone ne...Enrique Frias-Martinez
 
Big Data & Analytics for Government - Case Studies
Big Data & Analytics for Government - Case StudiesBig Data & Analytics for Government - Case Studies
Big Data & Analytics for Government - Case StudiesJohn Palfreyman
 
Transport Technology Forum Show Version
Transport Technology Forum Show VersionTransport Technology Forum Show Version
Transport Technology Forum Show VersionJennifer Wilson
 
Disaster Risk Management in the Information Age
Disaster Risk Management in the Information AgeDisaster Risk Management in the Information Age
Disaster Risk Management in the Information Ageglobal
 
Activate Urban Blood for Smarter Cities
Activate Urban Blood for Smarter CitiesActivate Urban Blood for Smarter Cities
Activate Urban Blood for Smarter CitiesTakuro Yonezawa
 
Gis and gps in plant biosecurity
Gis and gps in plant biosecurityGis and gps in plant biosecurity
Gis and gps in plant biosecurityNageshb11
 
Expelling Information of Events from Critical Public Space using Social Senso...
Expelling Information of Events from Critical Public Space using Social Senso...Expelling Information of Events from Critical Public Space using Social Senso...
Expelling Information of Events from Critical Public Space using Social Senso...ijtsrd
 
Opportunities and Challenges in Crisis Informatics
Opportunities and Challenges in Crisis InformaticsOpportunities and Challenges in Crisis Informatics
Opportunities and Challenges in Crisis InformaticsLea Shanley
 
Michael holland ppt
Michael holland pptMichael holland ppt
Michael holland pptLaura Manley
 

Similar to The emergent opportunity of Big Data for Social Good - Nuria Oliver @ PAPIs Connect (20)

Characterizing Social Response to Urban Earthquakes using Cell-Phone Network ...
Characterizing Social Response to Urban Earthquakes using Cell-Phone Network ...Characterizing Social Response to Urban Earthquakes using Cell-Phone Network ...
Characterizing Social Response to Urban Earthquakes using Cell-Phone Network ...
 
Using Mobile Phone Activity for Disaster Management During Floods - Project O...
Using Mobile Phone Activity for Disaster Management During Floods - Project O...Using Mobile Phone Activity for Disaster Management During Floods - Project O...
Using Mobile Phone Activity for Disaster Management During Floods - Project O...
 
Listening to the community. Using ICTs for program monitoring
Listening to the community. Using ICTs for program monitoringListening to the community. Using ICTs for program monitoring
Listening to the community. Using ICTs for program monitoring
 
CitizenReporting_for_Crime_Analysis
CitizenReporting_for_Crime_AnalysisCitizenReporting_for_Crime_Analysis
CitizenReporting_for_Crime_Analysis
 
Cybercrime & global mapping
Cybercrime & global mappingCybercrime & global mapping
Cybercrime & global mapping
 
It and safety
It and safetyIt and safety
It and safety
 
Big Data for Development: Opportunities and Challenges, Summary Slidedeck
Big Data for Development: Opportunities and Challenges, Summary SlidedeckBig Data for Development: Opportunities and Challenges, Summary Slidedeck
Big Data for Development: Opportunities and Challenges, Summary Slidedeck
 
asi_22876_Rev
asi_22876_Revasi_22876_Rev
asi_22876_Rev
 
Big Data - Janaka Abeysinghe
Big Data - Janaka AbeysingheBig Data - Janaka Abeysinghe
Big Data - Janaka Abeysinghe
 
Measuring the impact of epidemic alerts on human mobility using cell-phone ne...
Measuring the impact of epidemic alerts on human mobility using cell-phone ne...Measuring the impact of epidemic alerts on human mobility using cell-phone ne...
Measuring the impact of epidemic alerts on human mobility using cell-phone ne...
 
Big Data & Analytics for Government - Case Studies
Big Data & Analytics for Government - Case StudiesBig Data & Analytics for Government - Case Studies
Big Data & Analytics for Government - Case Studies
 
J017555559
J017555559J017555559
J017555559
 
Transport Technology Forum Show Version
Transport Technology Forum Show VersionTransport Technology Forum Show Version
Transport Technology Forum Show Version
 
Disaster Risk Management in the Information Age
Disaster Risk Management in the Information AgeDisaster Risk Management in the Information Age
Disaster Risk Management in the Information Age
 
Activate Urban Blood for Smarter Cities
Activate Urban Blood for Smarter CitiesActivate Urban Blood for Smarter Cities
Activate Urban Blood for Smarter Cities
 
Gis and gps in plant biosecurity
Gis and gps in plant biosecurityGis and gps in plant biosecurity
Gis and gps in plant biosecurity
 
Expelling Information of Events from Critical Public Space using Social Senso...
Expelling Information of Events from Critical Public Space using Social Senso...Expelling Information of Events from Critical Public Space using Social Senso...
Expelling Information of Events from Critical Public Space using Social Senso...
 
CCTNS & Homeland Security
CCTNS & Homeland SecurityCCTNS & Homeland Security
CCTNS & Homeland Security
 
Opportunities and Challenges in Crisis Informatics
Opportunities and Challenges in Crisis InformaticsOpportunities and Challenges in Crisis Informatics
Opportunities and Challenges in Crisis Informatics
 
Michael holland ppt
Michael holland pptMichael holland ppt
Michael holland ppt
 

More from PAPIs.io

Shortening the time from analysis to deployment with ml as-a-service — Luiz A...
Shortening the time from analysis to deployment with ml as-a-service — Luiz A...Shortening the time from analysis to deployment with ml as-a-service — Luiz A...
Shortening the time from analysis to deployment with ml as-a-service — Luiz A...PAPIs.io
 
Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017
Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017
Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017PAPIs.io
 
Extracting information from images using deep learning and transfer learning ...
Extracting information from images using deep learning and transfer learning ...Extracting information from images using deep learning and transfer learning ...
Extracting information from images using deep learning and transfer learning ...PAPIs.io
 
Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...
Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...
Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...PAPIs.io
 
Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...
Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...
Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...PAPIs.io
 
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...PAPIs.io
 
Building machine learning applications locally with Spark — Joel Pinho Lucas ...
Building machine learning applications locally with Spark — Joel Pinho Lucas ...Building machine learning applications locally with Spark — Joel Pinho Lucas ...
Building machine learning applications locally with Spark — Joel Pinho Lucas ...PAPIs.io
 
Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...
Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...
Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...PAPIs.io
 
A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...
A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...
A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...PAPIs.io
 
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016PAPIs.io
 
Real-world applications of AI - Daniel Hulme @ PAPIs Connect
Real-world applications of AI - Daniel Hulme @ PAPIs ConnectReal-world applications of AI - Daniel Hulme @ PAPIs Connect
Real-world applications of AI - Daniel Hulme @ PAPIs ConnectPAPIs.io
 
Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...
Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...
Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...PAPIs.io
 
Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...
Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...
Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...PAPIs.io
 
Demystifying Deep Learning - Roberto Paredes Palacios @ PAPIs Connect
Demystifying Deep Learning - Roberto Paredes Palacios @ PAPIs ConnectDemystifying Deep Learning - Roberto Paredes Palacios @ PAPIs Connect
Demystifying Deep Learning - Roberto Paredes Palacios @ PAPIs ConnectPAPIs.io
 
Predictive APIs: What about Banking? - Natalino Busa @ PAPIs Connect
Predictive APIs: What about Banking? - Natalino Busa @ PAPIs ConnectPredictive APIs: What about Banking? - Natalino Busa @ PAPIs Connect
Predictive APIs: What about Banking? - Natalino Busa @ PAPIs ConnectPAPIs.io
 
Microdecision making in financial services - Greg Lamp @ PAPIs Connect
Microdecision making in financial services - Greg Lamp @ PAPIs ConnectMicrodecision making in financial services - Greg Lamp @ PAPIs Connect
Microdecision making in financial services - Greg Lamp @ PAPIs ConnectPAPIs.io
 
Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...
Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...
Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...PAPIs.io
 
Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...
Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...
Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...PAPIs.io
 
How to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
How to predict the future of shopping - Ulrich Kerzel @ PAPIs ConnectHow to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
How to predict the future of shopping - Ulrich Kerzel @ PAPIs ConnectPAPIs.io
 
Automating Machine Learning Workflows: A Report from the Trenches - Jose A. O...
Automating Machine Learning Workflows: A Report from the Trenches - Jose A. O...Automating Machine Learning Workflows: A Report from the Trenches - Jose A. O...
Automating Machine Learning Workflows: A Report from the Trenches - Jose A. O...PAPIs.io
 

More from PAPIs.io (20)

Shortening the time from analysis to deployment with ml as-a-service — Luiz A...
Shortening the time from analysis to deployment with ml as-a-service — Luiz A...Shortening the time from analysis to deployment with ml as-a-service — Luiz A...
Shortening the time from analysis to deployment with ml as-a-service — Luiz A...
 
Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017
Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017
Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017
 
Extracting information from images using deep learning and transfer learning ...
Extracting information from images using deep learning and transfer learning ...Extracting information from images using deep learning and transfer learning ...
Extracting information from images using deep learning and transfer learning ...
 
Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...
Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...
Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...
 
Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...
Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...
Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...
 
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
 
Building machine learning applications locally with Spark — Joel Pinho Lucas ...
Building machine learning applications locally with Spark — Joel Pinho Lucas ...Building machine learning applications locally with Spark — Joel Pinho Lucas ...
Building machine learning applications locally with Spark — Joel Pinho Lucas ...
 
Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...
Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...
Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...
 
A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...
A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...
A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...
 
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
 
Real-world applications of AI - Daniel Hulme @ PAPIs Connect
Real-world applications of AI - Daniel Hulme @ PAPIs ConnectReal-world applications of AI - Daniel Hulme @ PAPIs Connect
Real-world applications of AI - Daniel Hulme @ PAPIs Connect
 
Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...
Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...
Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...
 
Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...
Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...
Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...
 
Demystifying Deep Learning - Roberto Paredes Palacios @ PAPIs Connect
Demystifying Deep Learning - Roberto Paredes Palacios @ PAPIs ConnectDemystifying Deep Learning - Roberto Paredes Palacios @ PAPIs Connect
Demystifying Deep Learning - Roberto Paredes Palacios @ PAPIs Connect
 
Predictive APIs: What about Banking? - Natalino Busa @ PAPIs Connect
Predictive APIs: What about Banking? - Natalino Busa @ PAPIs ConnectPredictive APIs: What about Banking? - Natalino Busa @ PAPIs Connect
Predictive APIs: What about Banking? - Natalino Busa @ PAPIs Connect
 
Microdecision making in financial services - Greg Lamp @ PAPIs Connect
Microdecision making in financial services - Greg Lamp @ PAPIs ConnectMicrodecision making in financial services - Greg Lamp @ PAPIs Connect
Microdecision making in financial services - Greg Lamp @ PAPIs Connect
 
Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...
Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...
Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...
 
Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...
Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...
Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...
 
How to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
How to predict the future of shopping - Ulrich Kerzel @ PAPIs ConnectHow to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
How to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
 
Automating Machine Learning Workflows: A Report from the Trenches - Jose A. O...
Automating Machine Learning Workflows: A Report from the Trenches - Jose A. O...Automating Machine Learning Workflows: A Report from the Trenches - Jose A. O...
Automating Machine Learning Workflows: A Report from the Trenches - Jose A. O...
 

Recently uploaded

AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...Product School
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxAbida Shariff
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxDavid Michel
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Alison B. Lowndes
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Product School
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlPeter Udo Diehl
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityScyllaDB
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...CzechDreamin
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...CzechDreamin
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoTAnalytics
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...Elena Simperl
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupCatarinaPereira64715
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backElena Simperl
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...CzechDreamin
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
 

Recently uploaded (20)

AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 

The emergent opportunity of Big Data for Social Good - Nuria Oliver @ PAPIs Connect

  • 1. Big Data for Social Good Nuria Oliver, PhD Scientific Director Telefonica R&D
  • 2. 7+ billion mobile phones worldwide 97% of world’s population (ITU) Mobile penetration of 120% to 89% of population (ITU) Emerging and developed regions More time spent on our phones than watching TV or with our with our partner (US and UK)
  • 3.
  • 4.
  • 5. Mobile Network Infrastructure ● A cell tower (also known as BTS or base station) provides an approximate location ● Spacing between towers is 500m-1km in urban areas, 2-3 km in rural areas ● Each tower has a “Voronoi cell”; this is the area to which the tower provides best coverage i.e. tower is closest
  • 6.
  • 7. Typical Mobile Data • CDR • SMS Consumption Social Network Mobility Call duration In/Out Degree Radius of gyration N. Events Delta w.r.t time window Travelled distance Lapse between events Unique Calls per day Rate of popular antennas Reciprocated events Unique SMS per day Regularity of popular antennas … … … HR_ORG TLFN_A TLFN_B CD_GEO_A CD_GEO_B DT_ORG CD_SNTD CD_ERB CD_CCC QT_DUR 20:05:31 XXX YYY 3 11 20140519 2 1562 568 33 … … … … … … … … … … HR_ORG TLFN_A TLFN_B CD_GEO_A CD_GEO_B DT_ORG CD_SNTD QT_TRFG 15:53:54 XXX ZZZ 3 25 20140506 2 1 … … … … … … … …
  • 8.
  • 9.
  • 11. Big Data for Social Good @ Telefonica Crime Prediction Analysis of impact of floods http://www.wired.co.uk/news/archive/2013-10/17/nuria-oliver 70% accuracy
  • 13. Flooding Through the Lens of Mobile Phone Activity David Pastor-Escuredo1, Alfredo Morales-Guzmán1, Yolanda Torres-Fernández1, Jean-Martin Bauer2, Amit Wadhwa2, Carlos Castro-Correa3, Liudmyla Romanoff4, Jong Gun Lee4, Alex Rutherford4, Vanessa Frias-Martinez5, Nuria Oliver6, Enrique Frias- Martinez6, Miguel Luengo-Oroz4 1. Universidad Politécnica de Madrid 2. Vulnerability Analysis and Mapping, World Food Programme 3. Coordinación de Estrategia Digital Nacional, Presidencia de la República de México 4. United Nations Global Pulse 5. University Maryland 6. Telefónica Research
  • 14. UN Global Pulse is an innovation initiative of the United Nations Secretary- General, driving a big data revolution for development. Its vision is to see big data used safely and responsibly for public good; its mission to accelerate discovery, development and adoption of big data and real-time analytics for sustainable development and humanitarian action.
  • 15. Tabasco Floods • Tabasco state frequently suffers flooding • 800mm of rain 31st October – 3rd November 2009 (4x normal) • 214,000 people affected • State of emergency declared
  • 16. Reconstructing the Floods • We combined mobile data with NASA LANDSAT satellite imagery data, governmental surveys and census data • NASA LANDSAT  geolocation of the most affected areas • Census data  representativeness of mobile data • Precipitations data and civil protection reports  timeline of people’s awareness and response
  • 17. CDR Representativeness ● Mobile user representativeness validated against census data ● Population estimates based on CDRs are strongly correlated with official Census data and population statistics ● Mobile data  proxy of population distribution in areas where other data sources are not available or reliable R2=0.97
  • 18. Event Detection x(t) is raw number of unique phones placing or receiving calls in each antennae per day ● Quantify impact of floods by comparing behavior of BTS against baseline activity using z-score like metric ● BTS with higher variations in the number of calls made during the floods were located in the most affected regions ● Special events (e.g. Christmas) led to similar changes in behavior wrt baseline, but across the entire region as opposed to in specific locations
  • 19.
  • 20. Actionable Insights ● Patterns in BTS activity can be used to measure the impact of floodings ● Behavioral insights are important for emergency services ● People communicated more as a result of the initial impacts of the floodings, rather than following the recommendations of public protection ● Civil protection warnings did not seem to be an effective way to raise awareness ● Spikes in activity are observed only after the situation was critical
  • 21. Big Data for Social Good @ Telefonica Crime Prediction Analysis of impact of floods http://www.wired.co.uk/news/archive/2013-10/17/nuria-oliver 70% accuracy
  • 23.
  • 24.
  • 25.
  • 26.
  • 27. Data Analysis • Call Records from 1st Jan till 31st May 2009 • Compute mobility as different number of BTS visited • Stages • Medical Alert - Stage 1 (17th-27th April) • Closing Schools - Stage 2 (28th-1st May) • Suspension of Essential Activities - Stage 3 (1st May-6th May) • Baselines • same periods, different year (2008)
  • 30. Impact on disease propagation
  • 32. Big Data for Social Good @ Telefonica Crime Prediction Analysis of impact of floods http://www.wired.co.uk/news/archive/2013-10/17/nuria-oliver 70% accuracy
  • 33. Crime Work with Bogomolov, A., Lepri, B., Staiano, J., Pianesi, F., Pentland, A.
  • 34.  Affects quality of life and economic development both at the national and local level  Several studies explore relationships between crime and socio-economic variables: education, income, unemployment, ethnicity, …  Several studies have shown significant concentrations of crime in small geographical areas: crime hotspots Crime
  • 35.  T1: Natural surveillance as key deterrent for crime: people moving around are eyes on the street (Jacobs, 1961)  high diversity among the population and high number of visitors -> less crime  T2: Defensible space theory (Newman, 1972)  high mix of people -> more crime Crime and Urban Environment
  • 36. People-centric perspective vs Place- centric perspective  people-centric perspective used for individual or collective criminal profiling  place-centric perspective used for predicting crime hotspots Crime Prediction
  • 37. Data-driven and place-centric approach to crime prediction  Multimodal approach: people dynamics derived from mobile network data and demographics European metropolis: London Prediction of crime hotspots and not criminals profiling Our Approach
  • 38. Smartsteps Dataset:  for each of the Smartsteps cells a variety of demographic and human dynamics variables were computed every hour for 3 weeks (from December 9 to December 15, 2012 and from December 23, 2012 to January 5, 2013) Criminal Cases Dataset:  criminal cases for December 2012 and for January 2013 London Borough Profiles Dataset:  open dataset containing 68 metrics about the population of a particular geographic area Multimodal Approach: Data
  • 39. • Footfall count: Shows the trend in footfall in a specified area hourly, daily, weekly and monthly. Provides a basic profile of the crowd. • Catchment area: Shows which postal sectors are your customers coming from by hour, day, week and month. Shows the “battleground” for two sites. • Transport mode: Shows flows of crowds from any two points, segmented by road, air, train, etc. SmartSteps
  • 40. For each cell and for each hour the dataset contains:  an estimation of how many people are in the cell  the percentage of these people at home, at work or just visiting the cell  the gender splits (male vs. female)  the age splits (0-20 years, 21-30 years, 31-40 years, …) SmartSteps Data
  • 41.  Crime geolocation for 2 months (December 2012 – January 2013)  All reported crimes in UK specifying month and year and not specific day/time  Median crime value (=5) used as threshold  Spatial granularity of borough profiles is at LSOA levels: LSOA are small geographical areas defined by UK Office for National Statistics (mean population: 1500) Crime Data
  • 42.  68 metrics about the population of a specific geographical area: demographics, households, migrant population, employment, earnings, life expectancy, happiness levels, house prices, etc.  Spatial granularity of borough profiles is at LSOA levels:  LSOA are small geographical areas defined by UK Office for National Statistics (mean population: 1500) London Borough Profiles Data
  • 43. From Smartsteps data we extract  1st order features (mean, median, min., max., entropy, etc.)  2nd order features on sliding windows of variable length (1 hour, 4 hours, 1 day, etc.) to account for temporal patterns Feature Extraction
  • 44. Feature Selection  Mean decrease in Gini coefficient of inequality  the feature with maximum mean decrease in Gini coefficient is expected to have the maximum influence in minimizing the out-of-the-bag error  The feature selection process produced a reduced subset of 68 features (from an initial pool of about 6000 features)
  • 45. Classification Approach  Binary classification task: high crime area vs low crime area  10-fold cross-validation approach  Classifier: Random Forest (RF)  RF overcomes logistic regression, support vector machines, neural networks, decision trees
  • 46. Smartsteps-based classifier significantly outperforms baseline majority and borough profiles-based classifiers Experimental Results
  • 47. ground-truth Experimental Results ~70% accuracy in predicting crime hotspots predictions
  • 48.  Features encoding daily dynamics have more predictive power than features extracted on a monthly basis  Relevance of high number of residents to predict crime areas  increased ratio of residents -> more crime (in contrast with Newman’s thesis)  Entropy-based features are useful for predicting the crime hotspots  high diversity of functions (home vs work) and high diversity of people (gender and age) act as eyes on street decreasing crime (in line with Jacobs’ thesis) Relevant Features
  • 49.  Only 6 out of 68 features in the joint model are London Borough features, namely %working population claiming out of work benefits Largest migrant population % overseas nationals entering the UK % resident population born abroad Relevant Features
  • 50.  Our method captures the dynamics of a place rather than making extrapolations from previous crime histories. We can use it in areas where people are less inclined to report crimes Our method provides new ways of describing geographical areas: novel risk-inducing or risk-reducing features of geographical areas Implications
  • 51. Relevant Publications “A gender-centric analysis of calling behavior in a developing country using CDRS”, Vanessa Frias-Martinez, Enrique Frias-Martinez and Oliver, N., Proceedings of AAAI, 2009 "Prediction of Socioeconomic Levels using Cell Phone Records", Victor Soto and Vanessa Frias-Martinez and Jesus Virseda and Enrique Frias-Martinez, International Conference on User Modeling, Adaptation and Personalization, UMAP'11, Industrial Track, Girona, Spain, 2011 "An Agent-Based Model Of Epidemic Spread Using Human Mobility and Social Network Information", Vanessa Frias-Martinez et al, 3rd International Conference on Social Computing, SocialCom '11, Boston, USA, 2011. Talk at WIRED 2013. London UK http://www.wired.co.uk/news/archive/2013-10/17/nuria-oliver
  • 52. Relevant Publications “Moves on the street: Predicting Crime Hotspots using aggregated anonymized data on people dynamics” - A. Bogomolov, B. Lepri, J. Staiano, Leouze, E., N. Oliver, F. Pianesi, A. Pentland Journal of Big Data (Big Data Journal 2015) “Once Upon a Crime: Towards Crime Prediction from Demographics and Mobile Data” - A. Bogomolov, B. Lepri, J. Staiano, N. Oliver, F. Pianesi, A. Pentland 16th ACM International Conference on Multimodal Interaction (ICMI 2014) "Flooding through the Lens of Mobile Phone Activity" Pastor-Escuredo, D., Torres Fernandez, Y., Bauer, J.M., Wadhwa, A., Castro- Correa, C., Romanoff, L., Lee, J.G., Rutherford, A., Frias-Martinez, V., Oliver, N., Frias-Martinez, E. and Luengo-Oroz, M. Proceedins of IEEE Global Humanitarian Technology Conference, GHTC 2014, Silicon Valley, CA, Oct 2014
  • 54. Temporal and Spatial Granularity • Big Data can be available in real-time or if not in real time much more frequently than how data is typically collected (every 5-10 years for example for census data); • Some kinds of Big Data (e.g. data about the city, collected by sensors placed in the urban infrastructure) can be collected with significantly finer grained spatial granularity than with traditional methods; Cost and Effort Accuracy • It could be argued that some kinds of data that are relevant for the public sector (e.g. migrations) can be collected more accurately by automatic means through Big Data platforms than by manual means as it is the state of the art. • In addition, given that there isn’t a human-in-the-loop, the data is less prone to human errors and potential biases introduced by humans. • Most of the Big Data that could be used for the public sector is data that has been collected already for other purposes. In addition, Big Data is typically collected by automatic means which makes its collection very cost-efficient;
  • 56.
  • 57. Risk/Benefit Analysis • Even while adopting all sorts of precautions, if by any chance there is an error in the data extraction such that there could be a privacy leak, the consequences for the business could be devastating • Hence, the potential benefit would have to be really large to compensate for the risk. • As this is an emergent research area, the benefits are still to be defined Regulatory/Social • Lack of updated regulation • Lack of clear guidelines regarding safe data handling, processing and sharing for humanitarian purposes • Risk of potential unintended social consequences • Risk of creating a digital divide, unbalanced access to data and-or expertise on how to analyze it and make sense of it Internal Barriers • Big Data for Development/Social Good projects are typically not part of any business unit in the Telcos;
  • 58. Technical • Representativeness of the data, generalization • Combination of data from multiple sources • Real-time analysis and prediction • Lack of ground truth  intervention to validate • Significant vs substantially significant • Correlations vs causality Privacy/Security • Potential privacy risks need to be minimized and understood. Control and transparency • Security and traceability of the data • Clear code of conduct and ethical principles when dealing with data • Strict access control when appropriate
  • 59. Framework for Data Sharing Remote Access Question and Answers Limited License Pre-computed Indicators & Synthetic Data Security Secure access to the data Anonymization Limited or aggregated data release Dataprotectedthrough ApplicativeExploratory Low to medium number of users Medium to large number of users and open data Development stage
  • 60. What can we all do to responsibly turn this opportunity into a reality ?