Martin Hilbert is a professor who studies the value of data for research and society. Some key points from the document are:
- Digital data storage has doubled every 2.5 years, with about 5 zettabytes stored globally as of 2014.
- Big data can be used to predict personal attributes and behaviors with accuracy, but may also reflect underlying human biases.
- While data may show correlations, correlations do not necessarily prove causation, and past patterns do not guarantee future outcomes. Data is not a perfect reflection of reality.
Big Data: Impact on Global Health and Clinical Decision MakingBedirhan Ustun
A primer on Big Data and some warnings:
Big Data is not a FAD
YOU are already using it…
It is here to stay
Big Data has Minimal Structure
Big Data Is usually Raw Data
It is NOT like a typical Relational Database
Big Data is available - and Less Expensive
Big Data is not collected for a purpose - has no map
It is your business – your time and money is at work
The emergent opportunity of Big Data for Social Good - Nuria Oliver @ PAPIs C...PAPIs.io
We live in a world of data, of big data. A big portion of this data has been generated by humans, and particularly through their mobile phones. In fact, there are almost as many mobile phones in the world as humans. The mobile phone is the piece of technology with the highest levels of adoption in human history. We carry them with us all through the day (and night, in many cases), leaving digital traces of our physical interactions. Mobile phones have become sensors of human activity in the large scale and also the most personal devices.
In my talk, I will present some of the work that we are doing at Telefonica Research in the area of human behavior understanding from data captured with mobile phones, and particularly our work in the area of Big Data for Social Good. I will highlight opportunities but also challenges that we would need to address in order to truly leverage this opportunity.
Nuria Oliver is a computer scientist and Scientific Director at Telefónica. She holds a Ph.D. from the Media Lab at MIT. She is one of the most cited female computer scientist in Spain, with her research having been cited by more than 8900 publications. She is well known for her work in computational models of human behavior, human computer-interaction, intelligent user interfaces, mobile computing and big data for social good.
Big Data: Impact on Global Health and Clinical Decision MakingBedirhan Ustun
A primer on Big Data and some warnings:
Big Data is not a FAD
YOU are already using it…
It is here to stay
Big Data has Minimal Structure
Big Data Is usually Raw Data
It is NOT like a typical Relational Database
Big Data is available - and Less Expensive
Big Data is not collected for a purpose - has no map
It is your business – your time and money is at work
The emergent opportunity of Big Data for Social Good - Nuria Oliver @ PAPIs C...PAPIs.io
We live in a world of data, of big data. A big portion of this data has been generated by humans, and particularly through their mobile phones. In fact, there are almost as many mobile phones in the world as humans. The mobile phone is the piece of technology with the highest levels of adoption in human history. We carry them with us all through the day (and night, in many cases), leaving digital traces of our physical interactions. Mobile phones have become sensors of human activity in the large scale and also the most personal devices.
In my talk, I will present some of the work that we are doing at Telefonica Research in the area of human behavior understanding from data captured with mobile phones, and particularly our work in the area of Big Data for Social Good. I will highlight opportunities but also challenges that we would need to address in order to truly leverage this opportunity.
Nuria Oliver is a computer scientist and Scientific Director at Telefónica. She holds a Ph.D. from the Media Lab at MIT. She is one of the most cited female computer scientist in Spain, with her research having been cited by more than 8900 publications. She is well known for her work in computational models of human behavior, human computer-interaction, intelligent user interfaces, mobile computing and big data for social good.
Convergence Partners has released its latest research report on big data and its meaning for Africa. The report argues that big data poses a threat to those it overlooks, namely a large percentage of Africa’s populace, who remain on big data’s periphery.
Data Science Innovations : Democratisation of Data and Data Science suresh sood
Data Science Innovations : Democratisation of Data and Data Science covers the opportunity of citizen data science lying at the convergence of natural language generation and discoveries in data made by the professions, not data scientists.
Data Colonialism and Digital Sustainability: Problems and Solutions to Curren...Matthias Stürmer
The global datasphere is growing from 60 Zettabytes today to 175 Zettabytes in 2025. Much of this data and software is privately controlled by American and Chinese corporations with enormous market power. Only the seven largest big tech companies such as Microsoft, Facebook, Alibaba or Tencent already have a market capitalization of over USD 8700 billion, which is almost three times India's GDP. This trend is called data colonialism of the cyber space. What problems arise from this and how can they be solved? The concept of digitale sustainability addresses this challenge by presenting a new pathway towards greater data sovereignty.
Dati: La "quinta" rivoluzione dell'information technology - intervento di Mario Rasetti, Fondazione ISI, al Lunch Seminar "Big Data e Internet of Things" del 29 giugno 2015, organizzato dal CSI-Piemonte
NOAH Berlin '17 - Venture Capital Trends in Digital HealthMin-Sung Sean Kim
At the 2017 NOAH Conference in Berlin, Min-Sung Sean Kim provided attendees with an insight into the trends and opportunities driving forward the digital transformation of healthcare.
APPLICATION OF ARTIFICIAL INTELLIGENCE TO TRACK PLANT DISEASESABHISEK RATH
this explained how artificial intelligence can be used in agriculture and especially in plant pathology i.e., tracking plant diseases, use of robotics, drone in applying chemicals and other aspects.
Convergence Partners has released its latest research report on big data and its meaning for Africa. The report argues that big data poses a threat to those it overlooks, namely a large percentage of Africa’s populace, who remain on big data’s periphery.
Data Science Innovations : Democratisation of Data and Data Science suresh sood
Data Science Innovations : Democratisation of Data and Data Science covers the opportunity of citizen data science lying at the convergence of natural language generation and discoveries in data made by the professions, not data scientists.
Data Colonialism and Digital Sustainability: Problems and Solutions to Curren...Matthias Stürmer
The global datasphere is growing from 60 Zettabytes today to 175 Zettabytes in 2025. Much of this data and software is privately controlled by American and Chinese corporations with enormous market power. Only the seven largest big tech companies such as Microsoft, Facebook, Alibaba or Tencent already have a market capitalization of over USD 8700 billion, which is almost three times India's GDP. This trend is called data colonialism of the cyber space. What problems arise from this and how can they be solved? The concept of digitale sustainability addresses this challenge by presenting a new pathway towards greater data sovereignty.
Dati: La "quinta" rivoluzione dell'information technology - intervento di Mario Rasetti, Fondazione ISI, al Lunch Seminar "Big Data e Internet of Things" del 29 giugno 2015, organizzato dal CSI-Piemonte
NOAH Berlin '17 - Venture Capital Trends in Digital HealthMin-Sung Sean Kim
At the 2017 NOAH Conference in Berlin, Min-Sung Sean Kim provided attendees with an insight into the trends and opportunities driving forward the digital transformation of healthcare.
APPLICATION OF ARTIFICIAL INTELLIGENCE TO TRACK PLANT DISEASESABHISEK RATH
this explained how artificial intelligence can be used in agriculture and especially in plant pathology i.e., tracking plant diseases, use of robotics, drone in applying chemicals and other aspects.
Similar to “Data for Development – the value of data for research and society” by Dr. Martin Hilbert (20)
Presented by Ms Diane Quarless, Director, ECLAC subregional headquarters for the Caribbean, at the LEARN Caribbean Research Data Workshop. http://learn-rdm.eu/en/workshops/eclac-mini-workshops/3rd-mini-workshop
Presented by Ms Bernadette Lewis, Secretary General, Caribbean Telecommunications Union at the LEARN Caribbean Research Data Workshop. http://learn-rdm.eu/en/workshops/eclac-mini-workshops/3rd-mini-workshop
This slide is special for master students (MIBS & MIFB) in UUM. Also useful for readers who are interested in the topic of contemporary Islamic banking.
Francesca Gottschalk - How can education support child empowerment.pptxEduSkills OECD
Francesca Gottschalk from the OECD’s Centre for Educational Research and Innovation presents at the Ask an Expert Webinar: How can education support child empowerment?
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Dr. Vinod Kumar Kanvaria
Exploiting Artificial Intelligence for Empowering Researchers and Faculty,
International FDP on Fundamentals of Research in Social Sciences
at Integral University, Lucknow, 06.06.2024
By Dr. Vinod Kumar Kanvaria
A Strategic Approach: GenAI in EducationPeter Windle
Artificial Intelligence (AI) technologies such as Generative AI, Image Generators and Large Language Models have had a dramatic impact on teaching, learning and assessment over the past 18 months. The most immediate threat AI posed was to Academic Integrity with Higher Education Institutes (HEIs) focusing their efforts on combating the use of GenAI in assessment. Guidelines were developed for staff and students, policies put in place too. Innovative educators have forged paths in the use of Generative AI for teaching, learning and assessments leading to pockets of transformation springing up across HEIs, often with little or no top-down guidance, support or direction.
This Gasta posits a strategic approach to integrating AI into HEIs to prepare staff, students and the curriculum for an evolving world and workplace. We will highlight the advantages of working with these technologies beyond the realm of teaching, learning and assessment by considering prompt engineering skills, industry impact, curriculum changes, and the need for staff upskilling. In contrast, not engaging strategically with Generative AI poses risks, including falling behind peers, missed opportunities and failing to ensure our graduates remain employable. The rapid evolution of AI technologies necessitates a proactive and strategic approach if we are to remain relevant.
Safalta Digital marketing institute in Noida, provide complete applications that encompass a huge range of virtual advertising and marketing additives, which includes search engine optimization, virtual communication advertising, pay-per-click on marketing, content material advertising, internet analytics, and greater. These university courses are designed for students who possess a comprehensive understanding of virtual marketing strategies and attributes.Safalta Digital Marketing Institute in Noida is a first choice for young individuals or students who are looking to start their careers in the field of digital advertising. The institute gives specialized courses designed and certification.
for beginners, providing thorough training in areas such as SEO, digital communication marketing, and PPC training in Noida. After finishing the program, students receive the certifications recognised by top different universitie, setting a strong foundation for a successful career in digital marketing.
Read| The latest issue of The Challenger is here! We are thrilled to announce that our school paper has qualified for the NATIONAL SCHOOLS PRESS CONFERENCE (NSPC) 2024. Thank you for your unwavering support and trust. Dive into the stories that made us stand out!
Biological screening of herbal drugs: Introduction and Need for
Phyto-Pharmacological Screening, New Strategies for evaluating
Natural Products, In vitro evaluation techniques for Antioxidants, Antimicrobial and Anticancer drugs. In vivo evaluation techniques
for Anti-inflammatory, Antiulcer, Anticancer, Wound healing, Antidiabetic, Hepatoprotective, Cardio protective, Diuretics and
Antifertility, Toxicity studies as per OECD guidelines
2024.06.01 Introducing a competency framework for languag learning materials ...Sandy Millin
http://sandymillin.wordpress.com/iateflwebinar2024
Published classroom materials form the basis of syllabuses, drive teacher professional development, and have a potentially huge influence on learners, teachers and education systems. All teachers also create their own materials, whether a few sentences on a blackboard, a highly-structured fully-realised online course, or anything in between. Despite this, the knowledge and skills needed to create effective language learning materials are rarely part of teacher training, and are mostly learnt by trial and error.
Knowledge and skills frameworks, generally called competency frameworks, for ELT teachers, trainers and managers have existed for a few years now. However, until I created one for my MA dissertation, there wasn’t one drawing together what we need to know and do to be able to effectively produce language learning materials.
This webinar will introduce you to my framework, highlighting the key competencies I identified from my research. It will also show how anybody involved in language teaching (any language, not just English!), teacher training, managing schools or developing language learning materials can benefit from using the framework.
A workshop hosted by the South African Journal of Science aimed at postgraduate students and early career researchers with little or no experience in writing and publishing journal articles.
“Data for Development – the value of data for research and society” by Dr. Martin Hilbert
1. Martin Hilbert
Department of Communication
hilbert@UCDavis.edu
Data for Development
the value of data for research and society
2. Hilbert & López (2011).
The world’s technological
capacity to store,
communicate and compute
information.
Science, 332, 6025, 60-65
www.martinhilbert.net/WorldInfoCapacity.html
World’s Info Storage Capacity
in optimally compressed MB
Stored digital information has doubled every 2.5 years
≈ 5 ZB in 2014 (5 x 10 21 Bytes)
4,500 piles of printed books
3. Gillings, Hilbert & Kemp (2016)
Information in the Biosphere:
Biological and Digital Worlds.
Trends in Ecology & Evolution,
31(3), 180–189
www.martinhilbert.net/information-in-the-biosphere/
7.2 bn humans * 6.2 bn nucleotides = 1 x 1019 Bytes vs.
All DNA on Earth = 1.3 x 1037 Bytes
≈> 100 years!
5 x 1021 Bytes
5. Source: http://www.youtube.com/watch?v=wqjKTW3wJZ8
Big Data as Commodity
La vulnerabilidad financiera de consumidores:
"Socialmente influyente"
"Rural y apenas sobreviven"
“Etnia minoría luchando en segunda ciudad"
“Jubilado en vacío: Singles"
"Inicio duro: Padres jóvenes single"
"Mayores buscado oportunidades: ancianos
en busca de maneras de hacer dinero"
"Oldies but Goodies: ancianos ingenuos,
quieren creer su suerte puede cambiar”
US Senate. A Review of the Data Broker Industry: Collect, Use, and Sale of Consumer Data for Marketing Purposes, 2013)
6. TED-Ed. Jer Thorp(2013).
Visualizing the world’s
Twitter data;
The Economist. (2014, ).
Off the map.
Digital
Footprint
8am
9am
10am
there be
dragons
7. 96.2
38.8
16.5
29.5
9.8
0
10
20
30
40
50
60
70
80
90
100
2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012* 2013*
Per100inhabitants
Global ICT developments, 2001-2013
Mobile-cellular telephone subscriptions
Individuals using the Internet
Fixed-telephone subscriptions
Active mobile-broadband subscriptions
Fixed (wired)-broadband subscriptions
Access to electricity
Note: * Estimate
Source: ITU World Telecommunication /ICT Indicators database
sample = universe
Using “meta-data”
records like call
duration and call
frequency, one can
predict socio-
economic,
demographic, and
other behavioral
trades with 80-85%
accuracy.
75 % among those with less than US$1 per day!
Naef, et al. (2014). Using Mobile Data for Development
Sources: Raento, M., Oulasvirta, A., & Eagle, N. (2009). Smartphones: An Emerging Tool for Social Scientists. Sociol. Methods & Research, 37(3), 426–454.
Frias-Martinez, V., & Frias-Martinez, E. (2014). Spectral clustering for sensing urban land use using Twitter activity. Engin. Appl. of Artificial Intell., 35, 237–245.
Frias-Martinez, V., & Virseda, J. (2013). Cell Phone Analytics: Scaling Human Behavior Studies into the Millions. ITID, 9(2), pp. 35–50.
Frias-Martinez, V., Frias-Martinez, E., & Oliver, N. (2010). A Gender-centric Analysis of Calling Behavior…. AAAI 201 Artificial Intelligence for Development.
Blumenstock, J. E., Gillick, D., & Eagle, N. (2010). Who’s Calling? Demographics of Mobile Phone Use in Rwanda. AAAI 201 Artificial Intelligence for Development.
8. Source: Youyou, W., Kosinski, M., & Stillwell, D. (2015). Computer-based personality judgments are more accurate than those made by humans. PNAS, 201418680.
Kosinski, M., Stillwell, D., & Graepel, T. (2013). Private traits and attributes are predictable from digital records of human behavior. PNAS, 110(15), 5802–5805.
“[100-250] Facebook Likes, can be used
to automatically and accurately
predict…: sexual orientation, ethnicity,
religious and political views, personality
traits, intelligence, happiness, use of
addictive substances, parental
separation, age, and gender…”
Machine learning knows us better than we ourselves
9. Source: Stephens-Davidowitz, S. (2015). Searching for Sex. The New York Times.
2015, January 24. Rudder, C. Dataclysm: Who We Are. (Crown, 2014).
social science
3.5 million active
users in 2010
10. SDG Goal 8.7: …eradicate forced
labor, end modern slavery…
FRDM (Forced Labor Risk Determination & Mitigation)
54,000 products, services, and commodities:
- Services
- Materials
- Component parts
+ Origin
+ qualitative peer-reviewed reports
+ Ariba Network: largest B2B platform > 2 million companies
https://vimeo.com/92259979
11. International Center for Tropical
Agriculture; Colombian Government
Agriculture and Food Security; Colombia’s
National Federation of Rice Growers
[weather data] + [RICE crops data] +
+ algorithms from neuroscience / biology =>
=> climate change
Results localized for towns:
Saldaña: solar radiation during the grain-ripening stage
Espinal: sensitivity to warm nights
Low cost solutions: sowing crops in right period of time
Impact: 170 farmers avoided direct losses of $ 3.6 million +
productivity from 1 to 3 tons per hectare.
….now being scaled out through Colombia, Argentina, Nicaragua, Peru and Uruguay.
SDG Goal 2.4: …ensure sustainable food
production systems and implement resilient
agricultural practices that increase productivity
12. U.S. Bureau of Labor Statistics
o > 100s staff visiting > 90 cities
o 80,000 prices
o Cost: US$ 250 million/year
Real-time
Sources: http://www.pricestats.com/about-us/meet-the-team ; www.economist.com/node/21548242 ;
http://www.inflacionverdadera.com
PriceStats
o 17 staff
o 300 online retailers; > 70 countries
o daily 5,000,000 prices
13. http://www.eloyalty.com ; http://www.mattersight.com/ ; http://www.fastcompany.com/1706766/how-personality-test-designed-pick-astronauts-taking-pain-out-customer-support ; http://www.ssca.com/resources/articles/104-
the-history-of-the-process-communication-model-in-astronaut-selection ; http://www.forbes.com/forbes/2011/0214/entrepreneurs-kelly-conway-software-eloyalty-your-pain.html
Cook, Scott (October 2013). "Personality Matters: Behavioral analytics is now a reality in contact centres". Direct Marketing Magazine 26 (3): 5.
EMOTIONS-DRIVEN (30% of the population)
THOUGHTS-DRIVEN (25%)
REACTIONS-DRIVEN (20%)
OPINIONS-DRIVEN (10%)
REFLECTIONS-DRIVEN (10%)
ACTIONS-DRIVEN (5%)
Matching Personality Types:
Call average from 10 min to 5 min
Customer Satisfaction from 47 % to 92%
14. Data
o US$1 billion investment; core group of 40 engineers
(from Twitter, Google, Facebook, Craigslist, stem cell, professional poker players…)
o Project Narwhal: 16 million unique voter profiles:
email sign-ups, zip codes, profession, voter registrations, volunteering
& donation record, Tweets, Facebook postings and network ties, TV
Watching behavior through 20 million set-top boxes, etc.
o Ranking the 20% of Obama’s 2008 vote that shifted to
undecided on a 0-10 persuasion score
o 62,000 computer simulations of likely voter behavior
Outcome
o Obama paid 35% less per broadcast commercial than opponent Romney
(40,000 more spots on the air, spending $90 million less!)
o Present tailor made campaign promises (agreeable adds; etc)
o Guide volunteers in phone and door-to-door campaigns
o Email donation requests, raising $181 million/month
o Predict States voting outcome at an accuracy of 0.5 percent
o Change voting behavior of 78 % of targeted undecided voters through Facebook
Obama 2012 campaign
Sources: Woodie, A. (2013, June 7). Big Data Analytics Give Electoral Edge. Datanami. Kolb, J., & Kolb, J. (2013). The Big Data Revolution. CreateSpace Independent Publishing Platform. Madrigal, A. C. (2012,
November 16). When the Nerds Go Marching In. The Atlantic. Rutenberg (2013), Data You Can Believe In The Obama Campaign’s Digital Masterminds Cash In; NYT.
16. Homicide Parole candidates
o 60 – 70 % correct who commits homicide
Berk, R., Sherman, L., Barnes, G., Kurtz, E., & Ahlman, L. (2009). Forecasting murder within a population of probationers and parolees: a high
stakes application of statistical learning. Journal of the Royal Stat.Soc.: Series A, 172(1), 191–211. http://spectrum.ieee.org/podcast/at-
work/innovation/can-software-predict-repeat-offenders ; http://www.spiegel.de/netzwelt/web/in-santa-cruz-sagen-computer-verbrechen-
voraus-a-899422.html ;http://www.sfgate.com/default/article/Sci-fi-policing-predicting-crime-before-it-occurs-3725708.php ; Wikipedia
Commons; Scahill, J., & Greenwald, G. (2014). The NSA’s Secret Role in the U.S. Assassination Program. The Intercept.
JSOC drone operator: “Por supuesto, se supone que el
teléfono pertenece a un ser humano que es nefasto y
considerado un ‘combatiente enemigo ilegal.’”
Aquí es donde se pone muy turbio…”
"We kill people based on metadata"
Data ≠ Reality
26. Computational Social Science
Sources: Bohemia Interactive Simulations, http://youtu.be/G9P9bUTCdpA ; TRANSIMS: http://www.youtube.com/watch?v=mN7kq0ITAys ; Epstein, http://www.youtube.com/watch?v=wZZJCIGtVkw
“…any change in policy will systematically
alter the structure of econometric models”
(1976)
28. Main References:
> Hilbert (2016). Big Data for Development: A Review of Promises and
Challenges. Development Policy Review, 34(1), 135–174.
https://doi.org/10.1111/dpr.12142
> Hilbert, M. (2015). ICT4ICTD: Computational Social Science for Digital
Development. 48th (HICSS) (pp. 2145–2157). IEEE Computer Society.
https://doi.org/10.1109/HICSS.2015.258
> Gillings, Hilbert & Kemp (2016). Information in the Biosphere: Biological
and Digital Worlds. Trends in Ecology & Evolution, 31(3), 180–189
www.martinhilbert.net/information-in-the-biosphere/
> Hilbert & López (2011). The world’s technological capacity to store,
communicate and compute information. Science, 332, 6025, 60-65
www.martinhilbert.net/WorldInfoCapacity.html
> Hilbert (2016). The bad news is that the digital access divide is here to stay:
Domestically installed bandwidths among 172 countries for 1986–2014.
Telecommunications Policy. www.martinhilbert.net/the-bad-news-
is-that-the-digital-access-divide-is-here-to-stay