SlideShare a Scribd company logo
Martin Hilbert
Department of Communication
hilbert@UCDavis.edu
Data for Development
the value of data for research and society
Hilbert & López (2011).
The world’s technological
capacity to store,
communicate and compute
information.
Science, 332, 6025, 60-65
www.martinhilbert.net/WorldInfoCapacity.html
World’s Info Storage Capacity
in optimally compressed MB
Stored digital information has doubled every 2.5 years
≈ 5 ZB in 2014 (5 x 10 21 Bytes)
4,500 piles of printed books
Gillings, Hilbert & Kemp (2016)
Information in the Biosphere:
Biological and Digital Worlds.
Trends in Ecology & Evolution,
31(3), 180–189
www.martinhilbert.net/information-in-the-biosphere/
7.2 bn humans * 6.2 bn nucleotides = 1 x 1019 Bytes vs.
All DNA on Earth = 1.3 x 1037 Bytes
≈> 100 years!
5 x 1021 Bytes
https://maps.google.com/locationhistory
Digital
Footprint
Source: http://www.youtube.com/watch?v=wqjKTW3wJZ8
Big Data as Commodity
La vulnerabilidad financiera de consumidores:
 "Socialmente influyente"
 "Rural y apenas sobreviven"
 “Etnia minoría luchando en segunda ciudad"
 “Jubilado en vacío: Singles"
 "Inicio duro: Padres jóvenes single"
 "Mayores buscado oportunidades: ancianos
en busca de maneras de hacer dinero"
 "Oldies but Goodies: ancianos ingenuos,
quieren creer su suerte puede cambiar”
US Senate. A Review of the Data Broker Industry: Collect, Use, and Sale of Consumer Data for Marketing Purposes, 2013)
TED-Ed. Jer Thorp(2013).
Visualizing the world’s
Twitter data;
The Economist. (2014, ).
Off the map.
Digital
Footprint
8am
9am
10am
there be
dragons
96.2
38.8
16.5
29.5
9.8
0
10
20
30
40
50
60
70
80
90
100
2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012* 2013*
Per100inhabitants
Global ICT developments, 2001-2013
Mobile-cellular telephone subscriptions
Individuals using the Internet
Fixed-telephone subscriptions
Active mobile-broadband subscriptions
Fixed (wired)-broadband subscriptions
Access to electricity
Note: * Estimate
Source: ITU World Telecommunication /ICT Indicators database
sample = universe
Using “meta-data”
records like call
duration and call
frequency, one can
predict socio-
economic,
demographic, and
other behavioral
trades with 80-85%
accuracy.
75 % among those with less than US$1 per day!
Naef, et al. (2014). Using Mobile Data for Development
Sources: Raento, M., Oulasvirta, A., & Eagle, N. (2009). Smartphones: An Emerging Tool for Social Scientists. Sociol. Methods & Research, 37(3), 426–454.
Frias-Martinez, V., & Frias-Martinez, E. (2014). Spectral clustering for sensing urban land use using Twitter activity. Engin. Appl. of Artificial Intell., 35, 237–245.
Frias-Martinez, V., & Virseda, J. (2013). Cell Phone Analytics: Scaling Human Behavior Studies into the Millions. ITID, 9(2), pp. 35–50.
Frias-Martinez, V., Frias-Martinez, E., & Oliver, N. (2010). A Gender-centric Analysis of Calling Behavior…. AAAI 201 Artificial Intelligence for Development.
Blumenstock, J. E., Gillick, D., & Eagle, N. (2010). Who’s Calling? Demographics of Mobile Phone Use in Rwanda. AAAI 201 Artificial Intelligence for Development.
Source: Youyou, W., Kosinski, M., & Stillwell, D. (2015). Computer-based personality judgments are more accurate than those made by humans. PNAS, 201418680.
Kosinski, M., Stillwell, D., & Graepel, T. (2013). Private traits and attributes are predictable from digital records of human behavior. PNAS, 110(15), 5802–5805.
“[100-250] Facebook Likes, can be used
to automatically and accurately
predict…: sexual orientation, ethnicity,
religious and political views, personality
traits, intelligence, happiness, use of
addictive substances, parental
separation, age, and gender…”
Machine learning knows us better than we ourselves
Source: Stephens-Davidowitz, S. (2015). Searching for Sex. The New York Times.
2015, January 24. Rudder, C. Dataclysm: Who We Are. (Crown, 2014).
social science
3.5 million active
users in 2010
SDG Goal 8.7: …eradicate forced
labor, end modern slavery…
FRDM (Forced Labor Risk Determination & Mitigation)
54,000 products, services, and commodities:
- Services
- Materials
- Component parts
+ Origin
+ qualitative peer-reviewed reports
+ Ariba Network: largest B2B platform > 2 million companies
https://vimeo.com/92259979
International Center for Tropical
Agriculture; Colombian Government
Agriculture and Food Security; Colombia’s
National Federation of Rice Growers
[weather data] + [RICE crops data] +
+ algorithms from neuroscience / biology =>
=> climate change
Results localized for towns:
Saldaña: solar radiation during the grain-ripening stage
Espinal: sensitivity to warm nights
Low cost solutions: sowing crops in right period of time
Impact: 170 farmers avoided direct losses of $ 3.6 million +
productivity from 1 to 3 tons per hectare.
….now being scaled out through Colombia, Argentina, Nicaragua, Peru and Uruguay.
SDG Goal 2.4: …ensure sustainable food
production systems and implement resilient
agricultural practices that increase productivity
U.S. Bureau of Labor Statistics
o > 100s staff visiting > 90 cities
o 80,000 prices
o Cost: US$ 250 million/year
Real-time
Sources: http://www.pricestats.com/about-us/meet-the-team ; www.economist.com/node/21548242 ;
http://www.inflacionverdadera.com
PriceStats
o 17 staff
o 300 online retailers; > 70 countries
o daily 5,000,000 prices
http://www.eloyalty.com ; http://www.mattersight.com/ ; http://www.fastcompany.com/1706766/how-personality-test-designed-pick-astronauts-taking-pain-out-customer-support ; http://www.ssca.com/resources/articles/104-
the-history-of-the-process-communication-model-in-astronaut-selection ; http://www.forbes.com/forbes/2011/0214/entrepreneurs-kelly-conway-software-eloyalty-your-pain.html
Cook, Scott (October 2013). "Personality Matters: Behavioral analytics is now a reality in contact centres". Direct Marketing Magazine 26 (3): 5.
EMOTIONS-DRIVEN (30% of the population)
THOUGHTS-DRIVEN (25%)
REACTIONS-DRIVEN (20%)
OPINIONS-DRIVEN (10%)
REFLECTIONS-DRIVEN (10%)
ACTIONS-DRIVEN (5%)
Matching Personality Types:
 Call average from 10 min to 5 min
 Customer Satisfaction from 47 % to 92%
Data
o US$1 billion investment; core group of 40 engineers
(from Twitter, Google, Facebook, Craigslist, stem cell, professional poker players…)
o Project Narwhal: 16 million unique voter profiles:
email sign-ups, zip codes, profession, voter registrations, volunteering
& donation record, Tweets, Facebook postings and network ties, TV
Watching behavior through 20 million set-top boxes, etc.
o Ranking the 20% of Obama’s 2008 vote that shifted to
undecided on a 0-10 persuasion score
o 62,000 computer simulations of likely voter behavior
Outcome
o Obama paid 35% less per broadcast commercial than opponent Romney
(40,000 more spots on the air, spending $90 million less!)
o Present tailor made campaign promises (agreeable adds; etc)
o Guide volunteers in phone and door-to-door campaigns
o Email donation requests, raising $181 million/month
o Predict States voting outcome at an accuracy of 0.5 percent
o Change voting behavior of 78 % of targeted undecided voters through Facebook
Obama 2012 campaign
Sources: Woodie, A. (2013, June 7). Big Data Analytics Give Electoral Edge. Datanami. Kolb, J., & Kolb, J. (2013). The Big Data Revolution. CreateSpace Independent Publishing Platform. Madrigal, A. C. (2012,
November 16). When the Nerds Go Marching In. The Atlantic. Rutenberg (2013), Data You Can Believe In The Obama Campaign’s Digital Masterminds Cash In; NYT.
Meaning ≠ Meaningful
Correlation ≠ Causation
Past ≠ Future
Data ≠ Reality
Homicide Parole candidates
o 60 – 70 % correct who commits homicide
Berk, R., Sherman, L., Barnes, G., Kurtz, E., & Ahlman, L. (2009). Forecasting murder within a population of probationers and parolees: a high
stakes application of statistical learning. Journal of the Royal Stat.Soc.: Series A, 172(1), 191–211. http://spectrum.ieee.org/podcast/at-
work/innovation/can-software-predict-repeat-offenders ; http://www.spiegel.de/netzwelt/web/in-santa-cruz-sagen-computer-verbrechen-
voraus-a-899422.html ;http://www.sfgate.com/default/article/Sci-fi-policing-predicting-crime-before-it-occurs-3725708.php ; Wikipedia
Commons; Scahill, J., & Greenwald, G. (2014). The NSA’s Secret Role in the U.S. Assassination Program. The Intercept.
JSOC drone operator: “Por supuesto, se supone que el
teléfono pertenece a un ser humano que es nefasto y
considerado un ‘combatiente enemigo ilegal.’”
Aquí es donde se pone muy turbio…”
"We kill people based on metadata"
Data ≠ Reality
“rey”
Meaning ≠ Meaningful
Caliskan-Islam, Bryson, Narayanan (2016). Semantics derived automatically from language corpora necessarily contain human biases. arXiv:1608.07187 [Cs]
“executive, management,
professional, corporation, salary,
office, business, career.”
“Amy,
Lisa,
Sarah,
Diana,
Ann”
“home, parents, children, family,
marriage, wedding, relatives”
“John,
Paul,
Mike,
Kevin,
Bill”
Meaning ≠ Meaningful
Meaning ≠ Meaningful
“Harry,
Katie,
Jonathan,
Nancy,
Emily”
“freedom, health, love, peace,
friend, heaven, gentle, loyal, lucky,
diploma, happy, laughter, vacation”
Caliskan-Islam, Bryson, Narayanan (2016). Semantics derived automatically from language corpora necessarily contain human biases. arXiv:1608.07187 [Cs]
Meaning ≠ Meaningful
“Jerome,
Ebony,
Jasmine,
Latisha,
Tia”
Caliskan-Islam, Bryson, Narayanan (2016). Semantics derived automatically from language corpora necessarily contain human biases. arXiv:1608.07187 [Cs]
“abuse, filth, sickness, accident,
death, grief, poison, assault,
poverty, ugly, evil, agony, prison.”
29
30
31
32
33
2000 2001 2002 2003 2004 2005 2006 2007 2008 2009
400
450
500
550
600
650
700
750
Civil engineering doctorates
Consumption of cheese
correlation coeff: 0.974
http://tylervigen.com/discover
Correlation ≠ Causation
7
9
11
13
15
17
19
21
0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010
Movies with Nicolas Cage
Deaths being hit by sports equipment
correlation coeff: 0.762
http://tylervigen.com/discover , Wikipedia Commons
Correlation ≠ Causation
2300
2500
2700
2900
3100
3300
3500
3700
3900
4.50E+07
5.00E+07
5.50E+07
6.00E+07
6.50E+07
7.00E+07
7.50E+07
8.00E+07
8.50E+07
2001 2003 2005 2007 2009 2011
Mobile phone subscriptions
Death by falls
correlation coeff: 0.980
Correlation ≠ Causation
?
?
Past ≠ Future
Wikipedia Commons
2013
1989Past ≠ Future
Computational Social Science
Sources: Bohemia Interactive Simulations, http://youtu.be/G9P9bUTCdpA ; TRANSIMS: http://www.youtube.com/watch?v=mN7kq0ITAys ; Epstein, http://www.youtube.com/watch?v=wZZJCIGtVkw
“…any change in policy will systematically
alter the structure of econometric models”
(1976)
Correlation ≠ Causation
Meaning ≠ Meaningful
Past ≠ Future
Data ≠ Reality
Main References:
> Hilbert (2016). Big Data for Development: A Review of Promises and
Challenges. Development Policy Review, 34(1), 135–174.
https://doi.org/10.1111/dpr.12142
> Hilbert, M. (2015). ICT4ICTD: Computational Social Science for Digital
Development. 48th (HICSS) (pp. 2145–2157). IEEE Computer Society.
https://doi.org/10.1109/HICSS.2015.258
> Gillings, Hilbert & Kemp (2016). Information in the Biosphere: Biological
and Digital Worlds. Trends in Ecology & Evolution, 31(3), 180–189
www.martinhilbert.net/information-in-the-biosphere/
> Hilbert & López (2011). The world’s technological capacity to store,
communicate and compute information. Science, 332, 6025, 60-65
www.martinhilbert.net/WorldInfoCapacity.html
> Hilbert (2016). The bad news is that the digital access divide is here to stay:
Domestically installed bandwidths among 172 countries for 1986–2014.
Telecommunications Policy. www.martinhilbert.net/the-bad-news-
is-that-the-digital-access-divide-is-here-to-stay

More Related Content

Similar to “Data for Development – the value of data for research and society” by Dr. Martin Hilbert

Big Data Paper
Big Data PaperBig Data Paper
Big Data Paper
Andile Ngcaba
 
Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science  Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science
suresh sood
 
Data Colonialism and Digital Sustainability: Problems and Solutions to Curren...
Data Colonialism and Digital Sustainability: Problems and Solutions to Curren...Data Colonialism and Digital Sustainability: Problems and Solutions to Curren...
Data Colonialism and Digital Sustainability: Problems and Solutions to Curren...
Matthias Stürmer
 
Listening to the community. Using ICTs for program monitoring
Listening to the community. Using ICTs for program monitoringListening to the community. Using ICTs for program monitoring
Listening to the community. Using ICTs for program monitoring
betterplace lab
 
20220103 jim spohrer hicss v9
20220103 jim spohrer hicss v920220103 jim spohrer hicss v9
20220103 jim spohrer hicss v9
ISSIP
 
G. Poste. Security, Sensors, Surveillance, Smart Machines and Supercomputers....
G. Poste. Security, Sensors, Surveillance, Smart Machines and Supercomputers....G. Poste. Security, Sensors, Surveillance, Smart Machines and Supercomputers....
G. Poste. Security, Sensors, Surveillance, Smart Machines and Supercomputers....
CASI, Arizona State University
 
GIS and Agent-based modeling: Part 2
GIS and Agent-based modeling: Part 2GIS and Agent-based modeling: Part 2
GIS and Agent-based modeling: Part 2
crooksAndrew
 
Digital Trails Dave King 1 5 10 Part 1 D3
Digital Trails   Dave King   1 5 10   Part 1 D3Digital Trails   Dave King   1 5 10   Part 1 D3
Digital Trails Dave King 1 5 10 Part 1 D3
Dave King
 
Rasetti fondazioneisi 29_06_2015
Rasetti fondazioneisi 29_06_2015Rasetti fondazioneisi 29_06_2015
Rasetti fondazioneisi 29_06_2015
CSI Piemonte
 
AI Challenges for Non-Profits, Small Business and Government
AI Challenges for Non-Profits, Small Business and GovernmentAI Challenges for Non-Profits, Small Business and Government
AI Challenges for Non-Profits, Small Business and Government
Michael Bryan
 
Contextual Analysis
Contextual AnalysisContextual Analysis
Contextual Analysis
Md Tajul Islam
 
NOAH Berlin '17 - Venture Capital Trends in Digital Health
NOAH Berlin '17 - Venture Capital Trends in Digital HealthNOAH Berlin '17 - Venture Capital Trends in Digital Health
NOAH Berlin '17 - Venture Capital Trends in Digital Health
Min-Sung Sean Kim
 
Jason Witty, SVP & CISO at US Bank - Next eneration information security meet...
Jason Witty, SVP & CISO at US Bank - Next eneration information security meet...Jason Witty, SVP & CISO at US Bank - Next eneration information security meet...
Jason Witty, SVP & CISO at US Bank - Next eneration information security meet...
Global Business Events
 
Towards a More Open World
Towards a More Open WorldTowards a More Open World
Towards a More Open World
Alexander Howard
 
IRJET- Big Data Analytics of Boko Haram Insurgency Attacks Menace in Nigeria ...
IRJET- Big Data Analytics of Boko Haram Insurgency Attacks Menace in Nigeria ...IRJET- Big Data Analytics of Boko Haram Insurgency Attacks Menace in Nigeria ...
IRJET- Big Data Analytics of Boko Haram Insurgency Attacks Menace in Nigeria ...
IRJET Journal
 
Exploring digital fake news phenomenon in indonesia cpr south_short_pdf
Exploring digital fake news phenomenon in indonesia cpr south_short_pdfExploring digital fake news phenomenon in indonesia cpr south_short_pdf
Exploring digital fake news phenomenon in indonesia cpr south_short_pdf
Riri Kusumarani
 
Almaden may 6th 2014 gilbert
Almaden may 6th 2014 gilbertAlmaden may 6th 2014 gilbert
Almaden may 6th 2014 gilbert
Jack Gilbert
 
A Review Paper on Big Data: Technologies, Tools and Trends
A Review Paper on Big Data: Technologies, Tools and TrendsA Review Paper on Big Data: Technologies, Tools and Trends
A Review Paper on Big Data: Technologies, Tools and Trends
IRJET Journal
 
2017 11 cascd
2017 11 cascd2017 11 cascd
2017 11 cascd
Johannes Keizer
 
APPLICATION OF ARTIFICIAL INTELLIGENCE TO TRACK PLANT DISEASES
APPLICATION OF ARTIFICIAL INTELLIGENCE TO TRACK PLANT DISEASESAPPLICATION OF ARTIFICIAL INTELLIGENCE TO TRACK PLANT DISEASES
APPLICATION OF ARTIFICIAL INTELLIGENCE TO TRACK PLANT DISEASES
ABHISEK RATH
 

Similar to “Data for Development – the value of data for research and society” by Dr. Martin Hilbert (20)

Big Data Paper
Big Data PaperBig Data Paper
Big Data Paper
 
Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science  Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science
 
Data Colonialism and Digital Sustainability: Problems and Solutions to Curren...
Data Colonialism and Digital Sustainability: Problems and Solutions to Curren...Data Colonialism and Digital Sustainability: Problems and Solutions to Curren...
Data Colonialism and Digital Sustainability: Problems and Solutions to Curren...
 
Listening to the community. Using ICTs for program monitoring
Listening to the community. Using ICTs for program monitoringListening to the community. Using ICTs for program monitoring
Listening to the community. Using ICTs for program monitoring
 
20220103 jim spohrer hicss v9
20220103 jim spohrer hicss v920220103 jim spohrer hicss v9
20220103 jim spohrer hicss v9
 
G. Poste. Security, Sensors, Surveillance, Smart Machines and Supercomputers....
G. Poste. Security, Sensors, Surveillance, Smart Machines and Supercomputers....G. Poste. Security, Sensors, Surveillance, Smart Machines and Supercomputers....
G. Poste. Security, Sensors, Surveillance, Smart Machines and Supercomputers....
 
GIS and Agent-based modeling: Part 2
GIS and Agent-based modeling: Part 2GIS and Agent-based modeling: Part 2
GIS and Agent-based modeling: Part 2
 
Digital Trails Dave King 1 5 10 Part 1 D3
Digital Trails   Dave King   1 5 10   Part 1 D3Digital Trails   Dave King   1 5 10   Part 1 D3
Digital Trails Dave King 1 5 10 Part 1 D3
 
Rasetti fondazioneisi 29_06_2015
Rasetti fondazioneisi 29_06_2015Rasetti fondazioneisi 29_06_2015
Rasetti fondazioneisi 29_06_2015
 
AI Challenges for Non-Profits, Small Business and Government
AI Challenges for Non-Profits, Small Business and GovernmentAI Challenges for Non-Profits, Small Business and Government
AI Challenges for Non-Profits, Small Business and Government
 
Contextual Analysis
Contextual AnalysisContextual Analysis
Contextual Analysis
 
NOAH Berlin '17 - Venture Capital Trends in Digital Health
NOAH Berlin '17 - Venture Capital Trends in Digital HealthNOAH Berlin '17 - Venture Capital Trends in Digital Health
NOAH Berlin '17 - Venture Capital Trends in Digital Health
 
Jason Witty, SVP & CISO at US Bank - Next eneration information security meet...
Jason Witty, SVP & CISO at US Bank - Next eneration information security meet...Jason Witty, SVP & CISO at US Bank - Next eneration information security meet...
Jason Witty, SVP & CISO at US Bank - Next eneration information security meet...
 
Towards a More Open World
Towards a More Open WorldTowards a More Open World
Towards a More Open World
 
IRJET- Big Data Analytics of Boko Haram Insurgency Attacks Menace in Nigeria ...
IRJET- Big Data Analytics of Boko Haram Insurgency Attacks Menace in Nigeria ...IRJET- Big Data Analytics of Boko Haram Insurgency Attacks Menace in Nigeria ...
IRJET- Big Data Analytics of Boko Haram Insurgency Attacks Menace in Nigeria ...
 
Exploring digital fake news phenomenon in indonesia cpr south_short_pdf
Exploring digital fake news phenomenon in indonesia cpr south_short_pdfExploring digital fake news phenomenon in indonesia cpr south_short_pdf
Exploring digital fake news phenomenon in indonesia cpr south_short_pdf
 
Almaden may 6th 2014 gilbert
Almaden may 6th 2014 gilbertAlmaden may 6th 2014 gilbert
Almaden may 6th 2014 gilbert
 
A Review Paper on Big Data: Technologies, Tools and Trends
A Review Paper on Big Data: Technologies, Tools and TrendsA Review Paper on Big Data: Technologies, Tools and Trends
A Review Paper on Big Data: Technologies, Tools and Trends
 
2017 11 cascd
2017 11 cascd2017 11 cascd
2017 11 cascd
 
APPLICATION OF ARTIFICIAL INTELLIGENCE TO TRACK PLANT DISEASES
APPLICATION OF ARTIFICIAL INTELLIGENCE TO TRACK PLANT DISEASESAPPLICATION OF ARTIFICIAL INTELLIGENCE TO TRACK PLANT DISEASES
APPLICATION OF ARTIFICIAL INTELLIGENCE TO TRACK PLANT DISEASES
 

More from LEARN Project

Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster
LEARN Project
 
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM PolicyLEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
LEARN Project
 
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM ToolkitLEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
LEARN Project
 
Research Data in an Open Science World - Prof. Dr. Eva Mendez, uc3m
Research Data in an Open Science World - Prof. Dr. Eva Mendez, uc3mResearch Data in an Open Science World - Prof. Dr. Eva Mendez, uc3m
Research Data in an Open Science World - Prof. Dr. Eva Mendez, uc3m
LEARN Project
 
Data, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of ChileData, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of Chile
LEARN Project
 
LEARN Final Conference: Tutorial Group | How To Engage Early Career Researchers
LEARN Final Conference: Tutorial Group | How To Engage Early Career ResearchersLEARN Final Conference: Tutorial Group | How To Engage Early Career Researchers
LEARN Final Conference: Tutorial Group | How To Engage Early Career Researchers
LEARN Project
 
LEARN Final Conference: Tutorial Group | Costing RDM
LEARN Final Conference: Tutorial Group | Costing RDMLEARN Final Conference: Tutorial Group | Costing RDM
LEARN Final Conference: Tutorial Group | Costing RDM
LEARN Project
 
Paolo Budroni at COAR Annual Meeting
Paolo Budroni at COAR Annual MeetingPaolo Budroni at COAR Annual Meeting
Paolo Budroni at COAR Annual Meeting
LEARN Project
 
LEARN Webinar
LEARN WebinarLEARN Webinar
LEARN Webinar
LEARN Project
 
Developing a Framework for Research Data Management Protocols
Developing a Framework for Research Data Management ProtocolsDeveloping a Framework for Research Data Management Protocols
Developing a Framework for Research Data Management Protocols
LEARN Project
 
The Needs of Stakeholders in the RDM Process - the role of LEARN
The Needs of Stakeholders in the RDM Process - the role of LEARNThe Needs of Stakeholders in the RDM Process - the role of LEARN
The Needs of Stakeholders in the RDM Process - the role of LEARN
LEARN Project
 
Opening Research Data in EU Universities: Policies, Motivators and Challenges
Opening Research Data in EU Universities: Policies, Motivators and ChallengesOpening Research Data in EU Universities: Policies, Motivators and Challenges
Opening Research Data in EU Universities: Policies, Motivators and Challenges
LEARN Project
 
About Data From A Machine Learning Perspective
About Data From A Machine Learning PerspectiveAbout Data From A Machine Learning Perspective
About Data From A Machine Learning Perspective
LEARN Project
 
LEARN Carribean Workshop Opening Remarks
LEARN Carribean Workshop Opening RemarksLEARN Carribean Workshop Opening Remarks
LEARN Carribean Workshop Opening Remarks
LEARN Project
 
Managing Research Data in the Caribbean: Good practices and challenges
Managing Research Data in the Caribbean: Good practices and challengesManaging Research Data in the Caribbean: Good practices and challenges
Managing Research Data in the Caribbean: Good practices and challenges
LEARN Project
 
LEARN Project: The Story So Far
LEARN Project: The Story So FarLEARN Project: The Story So Far
LEARN Project: The Story So Far
LEARN Project
 
The Data Deluge: the Role of Research Organisations
The Data Deluge: the Role of Research OrganisationsThe Data Deluge: the Role of Research Organisations
The Data Deluge: the Role of Research Organisations
LEARN Project
 
Data for Development in the Caribbean
Data for Development in the CaribbeanData for Development in the Caribbean
Data for Development in the Caribbean
LEARN Project
 
Open Data in a Big World by Fernando Ariel López
Open Data in a Big World by Fernando Ariel López Open Data in a Big World by Fernando Ariel López
Open Data in a Big World by Fernando Ariel López
LEARN Project
 
CENTRO DE DATOS
CENTRO DE DATOSCENTRO DE DATOS
CENTRO DE DATOS
LEARN Project
 

More from LEARN Project (20)

Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster
 
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM PolicyLEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
 
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM ToolkitLEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
 
Research Data in an Open Science World - Prof. Dr. Eva Mendez, uc3m
Research Data in an Open Science World - Prof. Dr. Eva Mendez, uc3mResearch Data in an Open Science World - Prof. Dr. Eva Mendez, uc3m
Research Data in an Open Science World - Prof. Dr. Eva Mendez, uc3m
 
Data, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of ChileData, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of Chile
 
LEARN Final Conference: Tutorial Group | How To Engage Early Career Researchers
LEARN Final Conference: Tutorial Group | How To Engage Early Career ResearchersLEARN Final Conference: Tutorial Group | How To Engage Early Career Researchers
LEARN Final Conference: Tutorial Group | How To Engage Early Career Researchers
 
LEARN Final Conference: Tutorial Group | Costing RDM
LEARN Final Conference: Tutorial Group | Costing RDMLEARN Final Conference: Tutorial Group | Costing RDM
LEARN Final Conference: Tutorial Group | Costing RDM
 
Paolo Budroni at COAR Annual Meeting
Paolo Budroni at COAR Annual MeetingPaolo Budroni at COAR Annual Meeting
Paolo Budroni at COAR Annual Meeting
 
LEARN Webinar
LEARN WebinarLEARN Webinar
LEARN Webinar
 
Developing a Framework for Research Data Management Protocols
Developing a Framework for Research Data Management ProtocolsDeveloping a Framework for Research Data Management Protocols
Developing a Framework for Research Data Management Protocols
 
The Needs of Stakeholders in the RDM Process - the role of LEARN
The Needs of Stakeholders in the RDM Process - the role of LEARNThe Needs of Stakeholders in the RDM Process - the role of LEARN
The Needs of Stakeholders in the RDM Process - the role of LEARN
 
Opening Research Data in EU Universities: Policies, Motivators and Challenges
Opening Research Data in EU Universities: Policies, Motivators and ChallengesOpening Research Data in EU Universities: Policies, Motivators and Challenges
Opening Research Data in EU Universities: Policies, Motivators and Challenges
 
About Data From A Machine Learning Perspective
About Data From A Machine Learning PerspectiveAbout Data From A Machine Learning Perspective
About Data From A Machine Learning Perspective
 
LEARN Carribean Workshop Opening Remarks
LEARN Carribean Workshop Opening RemarksLEARN Carribean Workshop Opening Remarks
LEARN Carribean Workshop Opening Remarks
 
Managing Research Data in the Caribbean: Good practices and challenges
Managing Research Data in the Caribbean: Good practices and challengesManaging Research Data in the Caribbean: Good practices and challenges
Managing Research Data in the Caribbean: Good practices and challenges
 
LEARN Project: The Story So Far
LEARN Project: The Story So FarLEARN Project: The Story So Far
LEARN Project: The Story So Far
 
The Data Deluge: the Role of Research Organisations
The Data Deluge: the Role of Research OrganisationsThe Data Deluge: the Role of Research Organisations
The Data Deluge: the Role of Research Organisations
 
Data for Development in the Caribbean
Data for Development in the CaribbeanData for Development in the Caribbean
Data for Development in the Caribbean
 
Open Data in a Big World by Fernando Ariel López
Open Data in a Big World by Fernando Ariel López Open Data in a Big World by Fernando Ariel López
Open Data in a Big World by Fernando Ariel López
 
CENTRO DE DATOS
CENTRO DE DATOSCENTRO DE DATOS
CENTRO DE DATOS
 

Recently uploaded

The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Dr. Vinod Kumar Kanvaria
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
Best Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDABest Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDA
deeptiverma2406
 
Digital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments UnitDigital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments Unit
chanes7
 
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdfMASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
goswamiyash170123
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBCSTRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
kimdan468
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
Academy of Science of South Africa
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 

Recently uploaded (20)

The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
Best Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDABest Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDA
 
Digital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments UnitDigital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments Unit
 
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdfMASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBCSTRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 

“Data for Development – the value of data for research and society” by Dr. Martin Hilbert

  • 1. Martin Hilbert Department of Communication hilbert@UCDavis.edu Data for Development the value of data for research and society
  • 2. Hilbert & López (2011). The world’s technological capacity to store, communicate and compute information. Science, 332, 6025, 60-65 www.martinhilbert.net/WorldInfoCapacity.html World’s Info Storage Capacity in optimally compressed MB Stored digital information has doubled every 2.5 years ≈ 5 ZB in 2014 (5 x 10 21 Bytes) 4,500 piles of printed books
  • 3. Gillings, Hilbert & Kemp (2016) Information in the Biosphere: Biological and Digital Worlds. Trends in Ecology & Evolution, 31(3), 180–189 www.martinhilbert.net/information-in-the-biosphere/ 7.2 bn humans * 6.2 bn nucleotides = 1 x 1019 Bytes vs. All DNA on Earth = 1.3 x 1037 Bytes ≈> 100 years! 5 x 1021 Bytes
  • 5. Source: http://www.youtube.com/watch?v=wqjKTW3wJZ8 Big Data as Commodity La vulnerabilidad financiera de consumidores:  "Socialmente influyente"  "Rural y apenas sobreviven"  “Etnia minoría luchando en segunda ciudad"  “Jubilado en vacío: Singles"  "Inicio duro: Padres jóvenes single"  "Mayores buscado oportunidades: ancianos en busca de maneras de hacer dinero"  "Oldies but Goodies: ancianos ingenuos, quieren creer su suerte puede cambiar” US Senate. A Review of the Data Broker Industry: Collect, Use, and Sale of Consumer Data for Marketing Purposes, 2013)
  • 6. TED-Ed. Jer Thorp(2013). Visualizing the world’s Twitter data; The Economist. (2014, ). Off the map. Digital Footprint 8am 9am 10am there be dragons
  • 7. 96.2 38.8 16.5 29.5 9.8 0 10 20 30 40 50 60 70 80 90 100 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012* 2013* Per100inhabitants Global ICT developments, 2001-2013 Mobile-cellular telephone subscriptions Individuals using the Internet Fixed-telephone subscriptions Active mobile-broadband subscriptions Fixed (wired)-broadband subscriptions Access to electricity Note: * Estimate Source: ITU World Telecommunication /ICT Indicators database sample = universe Using “meta-data” records like call duration and call frequency, one can predict socio- economic, demographic, and other behavioral trades with 80-85% accuracy. 75 % among those with less than US$1 per day! Naef, et al. (2014). Using Mobile Data for Development Sources: Raento, M., Oulasvirta, A., & Eagle, N. (2009). Smartphones: An Emerging Tool for Social Scientists. Sociol. Methods & Research, 37(3), 426–454. Frias-Martinez, V., & Frias-Martinez, E. (2014). Spectral clustering for sensing urban land use using Twitter activity. Engin. Appl. of Artificial Intell., 35, 237–245. Frias-Martinez, V., & Virseda, J. (2013). Cell Phone Analytics: Scaling Human Behavior Studies into the Millions. ITID, 9(2), pp. 35–50. Frias-Martinez, V., Frias-Martinez, E., & Oliver, N. (2010). A Gender-centric Analysis of Calling Behavior…. AAAI 201 Artificial Intelligence for Development. Blumenstock, J. E., Gillick, D., & Eagle, N. (2010). Who’s Calling? Demographics of Mobile Phone Use in Rwanda. AAAI 201 Artificial Intelligence for Development.
  • 8. Source: Youyou, W., Kosinski, M., & Stillwell, D. (2015). Computer-based personality judgments are more accurate than those made by humans. PNAS, 201418680. Kosinski, M., Stillwell, D., & Graepel, T. (2013). Private traits and attributes are predictable from digital records of human behavior. PNAS, 110(15), 5802–5805. “[100-250] Facebook Likes, can be used to automatically and accurately predict…: sexual orientation, ethnicity, religious and political views, personality traits, intelligence, happiness, use of addictive substances, parental separation, age, and gender…” Machine learning knows us better than we ourselves
  • 9. Source: Stephens-Davidowitz, S. (2015). Searching for Sex. The New York Times. 2015, January 24. Rudder, C. Dataclysm: Who We Are. (Crown, 2014). social science 3.5 million active users in 2010
  • 10. SDG Goal 8.7: …eradicate forced labor, end modern slavery… FRDM (Forced Labor Risk Determination & Mitigation) 54,000 products, services, and commodities: - Services - Materials - Component parts + Origin + qualitative peer-reviewed reports + Ariba Network: largest B2B platform > 2 million companies https://vimeo.com/92259979
  • 11. International Center for Tropical Agriculture; Colombian Government Agriculture and Food Security; Colombia’s National Federation of Rice Growers [weather data] + [RICE crops data] + + algorithms from neuroscience / biology => => climate change Results localized for towns: Saldaña: solar radiation during the grain-ripening stage Espinal: sensitivity to warm nights Low cost solutions: sowing crops in right period of time Impact: 170 farmers avoided direct losses of $ 3.6 million + productivity from 1 to 3 tons per hectare. ….now being scaled out through Colombia, Argentina, Nicaragua, Peru and Uruguay. SDG Goal 2.4: …ensure sustainable food production systems and implement resilient agricultural practices that increase productivity
  • 12. U.S. Bureau of Labor Statistics o > 100s staff visiting > 90 cities o 80,000 prices o Cost: US$ 250 million/year Real-time Sources: http://www.pricestats.com/about-us/meet-the-team ; www.economist.com/node/21548242 ; http://www.inflacionverdadera.com PriceStats o 17 staff o 300 online retailers; > 70 countries o daily 5,000,000 prices
  • 13. http://www.eloyalty.com ; http://www.mattersight.com/ ; http://www.fastcompany.com/1706766/how-personality-test-designed-pick-astronauts-taking-pain-out-customer-support ; http://www.ssca.com/resources/articles/104- the-history-of-the-process-communication-model-in-astronaut-selection ; http://www.forbes.com/forbes/2011/0214/entrepreneurs-kelly-conway-software-eloyalty-your-pain.html Cook, Scott (October 2013). "Personality Matters: Behavioral analytics is now a reality in contact centres". Direct Marketing Magazine 26 (3): 5. EMOTIONS-DRIVEN (30% of the population) THOUGHTS-DRIVEN (25%) REACTIONS-DRIVEN (20%) OPINIONS-DRIVEN (10%) REFLECTIONS-DRIVEN (10%) ACTIONS-DRIVEN (5%) Matching Personality Types:  Call average from 10 min to 5 min  Customer Satisfaction from 47 % to 92%
  • 14. Data o US$1 billion investment; core group of 40 engineers (from Twitter, Google, Facebook, Craigslist, stem cell, professional poker players…) o Project Narwhal: 16 million unique voter profiles: email sign-ups, zip codes, profession, voter registrations, volunteering & donation record, Tweets, Facebook postings and network ties, TV Watching behavior through 20 million set-top boxes, etc. o Ranking the 20% of Obama’s 2008 vote that shifted to undecided on a 0-10 persuasion score o 62,000 computer simulations of likely voter behavior Outcome o Obama paid 35% less per broadcast commercial than opponent Romney (40,000 more spots on the air, spending $90 million less!) o Present tailor made campaign promises (agreeable adds; etc) o Guide volunteers in phone and door-to-door campaigns o Email donation requests, raising $181 million/month o Predict States voting outcome at an accuracy of 0.5 percent o Change voting behavior of 78 % of targeted undecided voters through Facebook Obama 2012 campaign Sources: Woodie, A. (2013, June 7). Big Data Analytics Give Electoral Edge. Datanami. Kolb, J., & Kolb, J. (2013). The Big Data Revolution. CreateSpace Independent Publishing Platform. Madrigal, A. C. (2012, November 16). When the Nerds Go Marching In. The Atlantic. Rutenberg (2013), Data You Can Believe In The Obama Campaign’s Digital Masterminds Cash In; NYT.
  • 15. Meaning ≠ Meaningful Correlation ≠ Causation Past ≠ Future Data ≠ Reality
  • 16. Homicide Parole candidates o 60 – 70 % correct who commits homicide Berk, R., Sherman, L., Barnes, G., Kurtz, E., & Ahlman, L. (2009). Forecasting murder within a population of probationers and parolees: a high stakes application of statistical learning. Journal of the Royal Stat.Soc.: Series A, 172(1), 191–211. http://spectrum.ieee.org/podcast/at- work/innovation/can-software-predict-repeat-offenders ; http://www.spiegel.de/netzwelt/web/in-santa-cruz-sagen-computer-verbrechen- voraus-a-899422.html ;http://www.sfgate.com/default/article/Sci-fi-policing-predicting-crime-before-it-occurs-3725708.php ; Wikipedia Commons; Scahill, J., & Greenwald, G. (2014). The NSA’s Secret Role in the U.S. Assassination Program. The Intercept. JSOC drone operator: “Por supuesto, se supone que el teléfono pertenece a un ser humano que es nefasto y considerado un ‘combatiente enemigo ilegal.’” Aquí es donde se pone muy turbio…” "We kill people based on metadata" Data ≠ Reality
  • 18. Caliskan-Islam, Bryson, Narayanan (2016). Semantics derived automatically from language corpora necessarily contain human biases. arXiv:1608.07187 [Cs] “executive, management, professional, corporation, salary, office, business, career.” “Amy, Lisa, Sarah, Diana, Ann” “home, parents, children, family, marriage, wedding, relatives” “John, Paul, Mike, Kevin, Bill” Meaning ≠ Meaningful
  • 19. Meaning ≠ Meaningful “Harry, Katie, Jonathan, Nancy, Emily” “freedom, health, love, peace, friend, heaven, gentle, loyal, lucky, diploma, happy, laughter, vacation” Caliskan-Islam, Bryson, Narayanan (2016). Semantics derived automatically from language corpora necessarily contain human biases. arXiv:1608.07187 [Cs]
  • 20. Meaning ≠ Meaningful “Jerome, Ebony, Jasmine, Latisha, Tia” Caliskan-Islam, Bryson, Narayanan (2016). Semantics derived automatically from language corpora necessarily contain human biases. arXiv:1608.07187 [Cs] “abuse, filth, sickness, accident, death, grief, poison, assault, poverty, ugly, evil, agony, prison.”
  • 21. 29 30 31 32 33 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 400 450 500 550 600 650 700 750 Civil engineering doctorates Consumption of cheese correlation coeff: 0.974 http://tylervigen.com/discover Correlation ≠ Causation
  • 22. 7 9 11 13 15 17 19 21 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 Movies with Nicolas Cage Deaths being hit by sports equipment correlation coeff: 0.762 http://tylervigen.com/discover , Wikipedia Commons Correlation ≠ Causation
  • 23. 2300 2500 2700 2900 3100 3300 3500 3700 3900 4.50E+07 5.00E+07 5.50E+07 6.00E+07 6.50E+07 7.00E+07 7.50E+07 8.00E+07 8.50E+07 2001 2003 2005 2007 2009 2011 Mobile phone subscriptions Death by falls correlation coeff: 0.980 Correlation ≠ Causation
  • 26. Computational Social Science Sources: Bohemia Interactive Simulations, http://youtu.be/G9P9bUTCdpA ; TRANSIMS: http://www.youtube.com/watch?v=mN7kq0ITAys ; Epstein, http://www.youtube.com/watch?v=wZZJCIGtVkw “…any change in policy will systematically alter the structure of econometric models” (1976)
  • 27. Correlation ≠ Causation Meaning ≠ Meaningful Past ≠ Future Data ≠ Reality
  • 28. Main References: > Hilbert (2016). Big Data for Development: A Review of Promises and Challenges. Development Policy Review, 34(1), 135–174. https://doi.org/10.1111/dpr.12142 > Hilbert, M. (2015). ICT4ICTD: Computational Social Science for Digital Development. 48th (HICSS) (pp. 2145–2157). IEEE Computer Society. https://doi.org/10.1109/HICSS.2015.258 > Gillings, Hilbert & Kemp (2016). Information in the Biosphere: Biological and Digital Worlds. Trends in Ecology & Evolution, 31(3), 180–189 www.martinhilbert.net/information-in-the-biosphere/ > Hilbert & López (2011). The world’s technological capacity to store, communicate and compute information. Science, 332, 6025, 60-65 www.martinhilbert.net/WorldInfoCapacity.html > Hilbert (2016). The bad news is that the digital access divide is here to stay: Domestically installed bandwidths among 172 countries for 1986–2014. Telecommunications Policy. www.martinhilbert.net/the-bad-news- is-that-the-digital-access-divide-is-here-to-stay