SlideShare a Scribd company logo
1 of 50
HOW
How to follow actors through their traces
Exploiting digital traceability
Tommaso Venturini
tommaso.venturini@sciences-po.fr
The quali/quantitative divide
rich data, small populations
large populations, poor data
The problems with
either methods
Traditional quantitative methods:
• data collection:
standard discourses’ collection risks to hide the heterogeneity
• data treatment:
statistical comparison risks to hide divergences
Traditional qualitative methods:
•data collection:
risk of not being representative (beyond small controversies)
•data treatment:
problem of weighting different discourse
The problem with both
methods
rich data, small populations
large populations, poor data
Wide angle VS. telephoto
Follow the White Rabbit
why controversy mapping (and digital methods) will change
everything you know about sociology
Tommaso Venturini
tommaso.venturini@sciences-po.fr
The methodological strabismus
of social sciences…
Photo credit – tarout_sun via Flickr - ©
… is reified in social
theory
The collective self is not a simple epiphenomenon of
its morphologic base, precisely as the individual self is
not a simple efflorescence of the nervous system.
For the collective self to appear, a sui generis synthesis
of individual self has to be produced. This synthesis
creates a world of feelings, ideas, images that, once
come to life, follow their own laws.
Emile Durkheim, 1912
Le formes élémentaires de la vie
religieuse
Emergence
The emergent is unlike its components insofar as
these are incommensurable, and it cannot be reduced
to their sum or their difference (p. 412)
George Henry Lewes, 1875
Problems of Life and Mind
Cats and mice Jack Cohen, 2000
The Collapse of Chaos:
Discovering Simplicity in a Complex World
The amazing dictyostelium
discoideum Evelyn Fox Keller
“morphogenesis”
God save the ant
Queen
Theraulaz, G. & Bonabeau, E. (1999)
A brief history of stigmergy
Artificial Life, 5, 97–116
The Bootstrapping of
life
The bootstrapping of
intelligence
The bootstrapping of
society Thomas Hobbes, 1651
The Leviathan
Gulliver
sociology
Gulliver's Travels
Jonathan Swift, 1726
Diving in magma T. Venturini (2010)
Public Understanding of Science 19(3)
The Tarde vs Durkheim
controversy
Gabriel Tarde vs Emile Durkheim
Against emergence
It is surprising to see the men of sciences, so ready to
repeat that nothing is ever created from nothing,
admitting implicitly (as if it was self-evident) that the
connections among different beings can become beings
themselves (p. 67)
Tarde, 1893
Monadologie et sociologie
Against emergence
Supposons pour un instant qu'un de nos États humains, composé non de quelques milliers
mais de quelques quatrillions ou quintillions d'hommes hermétiquement clos et inaccessibles
individuellement (sorte de Chine infiniment plus populeuse encore et plus fermée) nous soit
simplement connu par les données de ses statisticiens, dont les chiffres portant sur de très
grands nombres se reproduiraient avec une extrême régularité. Quand une révolution
politique ou sociale, qui nous serait révélée par un grossissement ou un affaissement
brusques de certains de ces chiffres, se produirait dans cet État, nous aurions beau être
certains qu'il s'agit là d'un fait causé par des idées et des passions individuelles, nous
éviterions de nous perdre en conjectures superflues sur la nature de ces causes seules vraies,
mais impénétrables, et le plus sage nous paraîtrait d'expliquer tant bien que mal les chiffres
anormaux par des comparaisons ingénieuses avec les chiffres normaux habilement maniés.
Nous atteindrions ainsi au moins des résultats clairs et des vérités symboliques. Toutefois, il
importerait de temps en temps de nous rappeler le caractère purement symbolique de ces
vérités.
Tarde, 1893
Monadologie et sociologie
How to overcome the
quali-quantitative divide?
Inscriptions
as traces Callon, M., Law, J., & Rip, A. (1986)
Mapping the Dynamics of Science & Technology
Inscriptions
as traces Callon, M., Law, J., & Rip, A. (1986)
Mapping the Dynamics of Science & Technology
And then the web arrived…
<a href="http://www.medialab.sciencespo.fr/index.php"> click here </a>
And then the web arrived…
and Google with it
Brin, S., & Page, P. (1998).
The Anatomy of a Large-Scale Hypertextual Web Search Engine.
Computer Networks and ISDN Systems, 30(1-7), 107–117
Digital
traceability Latour, B. (2007). Beware your Imagination Leave Digital Traces.
Times Higher Literary Supplement.
Owen Gingerich, the great historian of astronomy, spent a life-time retrieving all the
annotations of all the copies of Copernicus’s first edition. He could thus give a precise
meaning to the rather empty notion of “Copernican revolution” and could show which parts
of the book everyone had read and misinterpreted. Nowadays, any scientist can do the same
for each portion of each article he or she has published so long as the local library has
bought a good package of digital data banks. But what is more extraordinary is that any
journalist can do so as well for the latest Madonna video or the dirtiest rumour about Prince
Harry’s love affairs.
In other words, the former distinction between the circulation of facts and the
dissemination of opinions has been erased in such a way that they are both graduating to
the same type of visibility — not a small advantage if we wish to disentangle the mixture
of facts and opinions that has become our usual diet of information
Digital
traceability
Once you can get information as bores, bytes, modem, sockets,
cables and so on, you have actually a more material way of looking
at what happens in Society.
Virtual Society thus, is not a thing of the future, it’s the
materialisation, the traceability of society. It renders visible because
of the obsessive necessity of materialising information into cables,
into data.
Latour, B. 1998
“Thought Experiments in Social Science: from the Social
Contract to Virtual Society”
From digital
traceability …
Bruno Latour (1998), argued that the Web is mainly of importance to
social science insofar as it makes possible new types of descriptions of
social life. According to Latour, the social integration of the Web
constitutes an event for social science because the social link becomes
traceable in this medium. Thus, social relations are established in a
tangible form as a material network connection. We take Latour’s claim of
the tangibility of the social as a point of departure in our search (p. 342).
Rogers, R., and Marres, N. 2002
“Frenchs candals on the Web, and on the streets:
A small experiment in stretching the limits of reported reality.”
Asian Journal of Social Science 66: 339-353.
… to digital
methods
The Internet is employed as a site of research for far more than just
online culture. The issue no longer is how much of society and
culture is online, but rather how to diagnose cultural change and
societal conditions with the Internet. The conceptual point of
departure for the research program is the recognition that the
Internet is not only an object of study, but also a source.
Rogers, R. 2009
The End of the Virtual: Digital Methods. Amsterdam
University Press.
Quali-quantitative
methods
Top 50 US blogs
Ben Fry, 2006
http://nymag.com/news/media/15972/
Datascapes exploration
Linkscape
Linkscape©
by Linkfluence©
médialab tools
Alice sociology Alice's Adventures in Wonderland
Lewis Carroll, 1865
Building on faults T. Venturini (2012)
Public Understanding of Science 21(7)
Beware!
1. More data means more noise
2. Digital data is not your data
Beware: more data
means more noise!
Taking “data mining”
seriously Yanacocha Gold Mine,
Cajamarca, Peru
An (pseudo-) exhaustive map
of the Web http://internet-map.net
Compulsive hoarding
A good
map of the Web politicosphere.blog.lemonde.fr
A good
map of the Web politicosphere.blog.lemonde.fr
How to search/query
Bisphenol
http://en.wikipedia.org/wiki/Bisphenol_A
Bisphenol heart diseases controversy
http://www.foxnews.com/health/2012/03/07/bpa-chemical-may-be-tied-to-heart-disease/
Bisphenol Melzer controversy
http://www.thelancet.com/journals/lancet/article/PIIS0140-6736(12)60496-6/fulltext
How to search/query
Bisphenol
http://en.wikipedia.org/wiki/Bisphenol_A
Bisphenol heart diseases controversy
http://www.foxnews.com/health/2012/03/07/bpa-chemical-may-be-tied-to-heart-disease/
Bisphenol Melzer controversy
http://www.thelancet.com/journals/lancet/article/PIIS0140-6736(12)60496-6/fulltext
BPA
Polycarbonate
endocrine disruptor
hearth desease
David Melzer
Food and Drug Administration
coronary artery disease
Monica Lind
Jeremy Pearson
Steven Hentges
Polycarbonate Global Group
Subject-specific keywords
• Proper names
• Name of institutions
• Toponyms
• Scientific/technical terminology
• Scientific references
• …
Improving your
query
• Exploit linguistic differences
• Go advanced (use search fields)
• Limit the time span
• Use search operators
• “exact” / -exclude / ~synonyms / * / OR / AND
Beware: digital data
is not your data!
Whose data
is this?
• Proliferation of new devices, genres and formats for the documentation
of social life… explosion of digital technologies that enable people to
report and comment upon social life.
• Routine generation of data about social life as part of social life. ‘Social
media’ platforms… embed the process of social data generation in
everyday practices.
• Development of online platforms and tools for the analysis of digital
social data. These days, most online platforms come with ‘analytics’
attached: a set of tools and services facilitating the analysis of the data
generated by said platforms.
Marres, N. (2011).
Re-distributing Methods:
Interventions in Digital Social Research.
Redistribution of
research methods
• Methods as usual (ex. Andrew Abbott, )
The techniques used by digital platforms have been long used in social sciences.
• Big methods (ex. Newman et al, 2007)
Digital traceability increases the quantity of social data thereby demanding use of
mathematical techniques of analysis.
• Virtual methods (ex. Christine Hine, 2000, 2005)
Digital media transform the quality of social practices and demand therefore increased
efforts of observations and interpretation.
• Digital methods (ex. Richard Rogers, 2009)
Digital platforms have their own methods that need to be understood and re-purposed
for social research.
• Re-mediation of methods (ex. Nortje Marres, 2011)
The techniques used by digital platforms have been long used in social sciences, but are
radically transformed the new context of their use.
Marres, N. (2011).
Re-distributing Methods:
Interventions in Digital Social Research.
More
redistribution
Less
Natural
experiments
Snow, J. (1855). On the Mode of
Communication of Cholera
Natural
experiments
Snow, J. (1855). On the Mode of
Communication of Cholera
Natural
experiments
Snow, J. (1855). On the Mode of
Communication of Cholera
tommaso.venturini@sciences-po.fr

More Related Content

What's hot

Dancing Together: the Fluidification of the Modern Mind
Dancing Together: the Fluidification of the Modern MindDancing Together: the Fluidification of the Modern Mind
Dancing Together: the Fluidification of the Modern MindINRIA - ENS Lyon
 
Kim Solez Singularity explained and promoted winter 2014
Kim Solez Singularity explained and promoted winter 2014Kim Solez Singularity explained and promoted winter 2014
Kim Solez Singularity explained and promoted winter 2014Kim Solez ,
 
Sovereignty and the_ufo
Sovereignty and the_ufoSovereignty and the_ufo
Sovereignty and the_ufogorin2008
 
Super-humans: Super-language?
Super-humans: Super-language?Super-humans: Super-language?
Super-humans: Super-language?Vasil Penchev
 
Techno-Utopianism in Silicon Valley
Techno-Utopianism in Silicon ValleyTechno-Utopianism in Silicon Valley
Techno-Utopianism in Silicon ValleyThomas Klaffke
 
[Challenge:Future] Transhumanism
[Challenge:Future] Transhumanism[Challenge:Future] Transhumanism
[Challenge:Future] TranshumanismChallenge:Future
 
Paradigms: International Council of Museums (ICOM) Committee on Documentation...
Paradigms: International Council of Museums (ICOM) Committee on Documentation...Paradigms: International Council of Museums (ICOM) Committee on Documentation...
Paradigms: International Council of Museums (ICOM) Committee on Documentation...Tom Moritz
 
Deeper Insights: Appendix 3
Deeper Insights: Appendix 3Deeper Insights: Appendix 3
Deeper Insights: Appendix 3528Hz TRUTH
 

What's hot (9)

Dancing Together: the Fluidification of the Modern Mind
Dancing Together: the Fluidification of the Modern MindDancing Together: the Fluidification of the Modern Mind
Dancing Together: the Fluidification of the Modern Mind
 
Kim Solez Singularity explained and promoted winter 2014
Kim Solez Singularity explained and promoted winter 2014Kim Solez Singularity explained and promoted winter 2014
Kim Solez Singularity explained and promoted winter 2014
 
Sovereignty and the_ufo
Sovereignty and the_ufoSovereignty and the_ufo
Sovereignty and the_ufo
 
Super-humans: Super-language?
Super-humans: Super-language?Super-humans: Super-language?
Super-humans: Super-language?
 
Techno-Utopianism in Silicon Valley
Techno-Utopianism in Silicon ValleyTechno-Utopianism in Silicon Valley
Techno-Utopianism in Silicon Valley
 
[Challenge:Future] Transhumanism
[Challenge:Future] Transhumanism[Challenge:Future] Transhumanism
[Challenge:Future] Transhumanism
 
Paradigms: International Council of Museums (ICOM) Committee on Documentation...
Paradigms: International Council of Museums (ICOM) Committee on Documentation...Paradigms: International Council of Museums (ICOM) Committee on Documentation...
Paradigms: International Council of Museums (ICOM) Committee on Documentation...
 
Deeper Insights: Appendix 3
Deeper Insights: Appendix 3Deeper Insights: Appendix 3
Deeper Insights: Appendix 3
 
Transhumanism
TranshumanismTranshumanism
Transhumanism
 

Similar to How to follow actors through their traces. Exploiting digital traceability

Quantified Self and Philosophy
Quantified Self and PhilosophyQuantified Self and Philosophy
Quantified Self and PhilosophyJoerg Blumtritt
 
111What Is the Elephant in the Digital RoomAny hi.docx
111What Is the Elephant in the Digital RoomAny hi.docx111What Is the Elephant in the Digital RoomAny hi.docx
111What Is the Elephant in the Digital RoomAny hi.docxmoggdede
 
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"Darlene Cavalier
 
Data, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of ChileData, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of ChileLEARN Project
 
The Web and its Publics (by Tommaso Venturini & Jean-Philippe Cointet)
The Web and its Publics (by Tommaso Venturini & Jean-Philippe Cointet)The Web and its Publics (by Tommaso Venturini & Jean-Philippe Cointet)
The Web and its Publics (by Tommaso Venturini & Jean-Philippe Cointet)medialabSciencesPo
 
Keynote speech at the Digitale Praxen conference at Frankfurt University
Keynote speech at the Digitale Praxen conference at Frankfurt UniversityKeynote speech at the Digitale Praxen conference at Frankfurt University
Keynote speech at the Digitale Praxen conference at Frankfurt UniversityINRIA - ENS Lyon
 
Computer Technology Essay
Computer Technology EssayComputer Technology Essay
Computer Technology EssayDonna Harvey
 
Funding Truth in the Media with its Fake News
Funding Truth in the Media with its Fake NewsFunding Truth in the Media with its Fake News
Funding Truth in the Media with its Fake NewsPaul H. Carr
 
A Trip to Flatland: mapping or modeling in the social sciences
A Trip to Flatland: mapping or modeling in the social sciencesA Trip to Flatland: mapping or modeling in the social sciences
A Trip to Flatland: mapping or modeling in the social sciencesINRIA - ENS Lyon
 
European librarians theatre - Social Media Spotlight
European librarians theatre - Social Media SpotlightEuropean librarians theatre - Social Media Spotlight
European librarians theatre - Social Media SpotlightJulien Houssiere
 
Escaping greatdivide coimbra
Escaping greatdivide coimbraEscaping greatdivide coimbra
Escaping greatdivide coimbraINRIA - ENS Lyon
 
Digital Scholarship Seminar: Implications of Data for the 21st-century Humanist
Digital Scholarship Seminar: Implications of Data for the 21st-century HumanistDigital Scholarship Seminar: Implications of Data for the 21st-century Humanist
Digital Scholarship Seminar: Implications of Data for the 21st-century HumanistRebecca Davis
 
Quantum computing key words for orientation (2017)
Quantum computing   key words for orientation (2017)Quantum computing   key words for orientation (2017)
Quantum computing key words for orientation (2017)Research Impulses
 
Digital Anthropology
Digital AnthropologyDigital Anthropology
Digital AnthropologyFifi Stahlman
 
LECTURE 7 - Cyberculture
LECTURE 7 - CybercultureLECTURE 7 - Cyberculture
LECTURE 7 - CybercultureKim Flintoff
 
Social Machines Paradigm
Social Machines ParadigmSocial Machines Paradigm
Social Machines ParadigmDavid De Roure
 
Social Machines Democratization
Social Machines DemocratizationSocial Machines Democratization
Social Machines DemocratizationDavid De Roure
 

Similar to How to follow actors through their traces. Exploiting digital traceability (20)

The Digital Divide
The Digital DivideThe Digital Divide
The Digital Divide
 
Quantified Self and Philosophy
Quantified Self and PhilosophyQuantified Self and Philosophy
Quantified Self and Philosophy
 
111What Is the Elephant in the Digital RoomAny hi.docx
111What Is the Elephant in the Digital RoomAny hi.docx111What Is the Elephant in the Digital RoomAny hi.docx
111What Is the Elephant in the Digital RoomAny hi.docx
 
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
 
Data, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of ChileData, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of Chile
 
Taking IT for Granted - David De Roure
Taking IT for Granted - David De RoureTaking IT for Granted - David De Roure
Taking IT for Granted - David De Roure
 
The Web and its Publics (by Tommaso Venturini & Jean-Philippe Cointet)
The Web and its Publics (by Tommaso Venturini & Jean-Philippe Cointet)The Web and its Publics (by Tommaso Venturini & Jean-Philippe Cointet)
The Web and its Publics (by Tommaso Venturini & Jean-Philippe Cointet)
 
Keynote speech at the Digitale Praxen conference at Frankfurt University
Keynote speech at the Digitale Praxen conference at Frankfurt UniversityKeynote speech at the Digitale Praxen conference at Frankfurt University
Keynote speech at the Digitale Praxen conference at Frankfurt University
 
Computer Technology Essay
Computer Technology EssayComputer Technology Essay
Computer Technology Essay
 
Funding Truth in the Media with its Fake News
Funding Truth in the Media with its Fake NewsFunding Truth in the Media with its Fake News
Funding Truth in the Media with its Fake News
 
A Trip to Flatland: mapping or modeling in the social sciences
A Trip to Flatland: mapping or modeling in the social sciencesA Trip to Flatland: mapping or modeling in the social sciences
A Trip to Flatland: mapping or modeling in the social sciences
 
European librarians theatre - Social Media Spotlight
European librarians theatre - Social Media SpotlightEuropean librarians theatre - Social Media Spotlight
European librarians theatre - Social Media Spotlight
 
Escaping greatdivide coimbra
Escaping greatdivide coimbraEscaping greatdivide coimbra
Escaping greatdivide coimbra
 
Digital Scholarship Seminar: Implications of Data for the 21st-century Humanist
Digital Scholarship Seminar: Implications of Data for the 21st-century HumanistDigital Scholarship Seminar: Implications of Data for the 21st-century Humanist
Digital Scholarship Seminar: Implications of Data for the 21st-century Humanist
 
Quantum computing key words for orientation (2017)
Quantum computing   key words for orientation (2017)Quantum computing   key words for orientation (2017)
Quantum computing key words for orientation (2017)
 
Digital Anthropology
Digital AnthropologyDigital Anthropology
Digital Anthropology
 
LECTURE 7 - Cyberculture
LECTURE 7 - CybercultureLECTURE 7 - Cyberculture
LECTURE 7 - Cyberculture
 
Taking IT for Granted
Taking IT for GrantedTaking IT for Granted
Taking IT for Granted
 
Social Machines Paradigm
Social Machines ParadigmSocial Machines Paradigm
Social Machines Paradigm
 
Social Machines Democratization
Social Machines DemocratizationSocial Machines Democratization
Social Machines Democratization
 

More from INRIA - ENS Lyon

Actor-Network Theory as a Theory of Action
Actor-Network Theory as a Theory of ActionActor-Network Theory as a Theory of Action
Actor-Network Theory as a Theory of ActionINRIA - ENS Lyon
 
Actor‐Network Theory VS Network Analysis VS Digital Networks Are We Talking A...
Actor‐Network Theory VS Network Analysis VS Digital Networks Are We Talking A...Actor‐Network Theory VS Network Analysis VS Digital Networks Are We Talking A...
Actor‐Network Theory VS Network Analysis VS Digital Networks Are We Talking A...INRIA - ENS Lyon
 
Dr. Jekyll and Mr. Hyde IPCC and the Double Logic of International Expertise
Dr. Jekyll and Mr. Hyde IPCC and the Double Logic of International ExpertiseDr. Jekyll and Mr. Hyde IPCC and the Double Logic of International Expertise
Dr. Jekyll and Mr. Hyde IPCC and the Double Logic of International ExpertiseINRIA - ENS Lyon
 
Digital methods - 1 : Introduction
Digital methods - 1 : IntroductionDigital methods - 1 : Introduction
Digital methods - 1 : IntroductionINRIA - ENS Lyon
 
Contropedia, and the question of analytically separating the medium and the m...
Contropedia, and the question of analytically separating the medium and the m...Contropedia, and the question of analytically separating the medium and the m...
Contropedia, and the question of analytically separating the medium and the m...INRIA - ENS Lyon
 
Climaps by EMAPS et Europeana2015
Climaps by EMAPS et Europeana2015Climaps by EMAPS et Europeana2015
Climaps by EMAPS et Europeana2015INRIA - ENS Lyon
 
On Continuity in Social Sciences
On Continuity in Social SciencesOn Continuity in Social Sciences
On Continuity in Social SciencesINRIA - ENS Lyon
 
From Before the Cradle: mapping online debates on c-section and family planning
From Before the Cradle: mapping online debates on c-section and family planningFrom Before the Cradle: mapping online debates on c-section and family planning
From Before the Cradle: mapping online debates on c-section and family planningINRIA - ENS Lyon
 
Mapping connectionswithheatmaps
Mapping connectionswithheatmapsMapping connectionswithheatmaps
Mapping connectionswithheatmapsINRIA - ENS Lyon
 
Journal of Digital Social Sciences
Journal of Digital Social SciencesJournal of Digital Social Sciences
Journal of Digital Social SciencesINRIA - ENS Lyon
 
Controversies Communication
Controversies CommunicationControversies Communication
Controversies CommunicationINRIA - ENS Lyon
 
Designing Controversies for the Public
Designing Controversies for the PublicDesigning Controversies for the Public
Designing Controversies for the PublicINRIA - ENS Lyon
 

More from INRIA - ENS Lyon (19)

Actor-Network Theory as a Theory of Action
Actor-Network Theory as a Theory of ActionActor-Network Theory as a Theory of Action
Actor-Network Theory as a Theory of Action
 
Actor‐Network Theory VS Network Analysis VS Digital Networks Are We Talking A...
Actor‐Network Theory VS Network Analysis VS Digital Networks Are We Talking A...Actor‐Network Theory VS Network Analysis VS Digital Networks Are We Talking A...
Actor‐Network Theory VS Network Analysis VS Digital Networks Are We Talking A...
 
Dr. Jekyll and Mr. Hyde IPCC and the Double Logic of International Expertise
Dr. Jekyll and Mr. Hyde IPCC and the Double Logic of International ExpertiseDr. Jekyll and Mr. Hyde IPCC and the Double Logic of International Expertise
Dr. Jekyll and Mr. Hyde IPCC and the Double Logic of International Expertise
 
Digital methods - 1 : Introduction
Digital methods - 1 : IntroductionDigital methods - 1 : Introduction
Digital methods - 1 : Introduction
 
Contropedia, and the question of analytically separating the medium and the m...
Contropedia, and the question of analytically separating the medium and the m...Contropedia, and the question of analytically separating the medium and the m...
Contropedia, and the question of analytically separating the medium and the m...
 
A Tale of Two Cities
A Tale of Two CitiesA Tale of Two Cities
A Tale of Two Cities
 
What isa border_kings
What isa border_kingsWhat isa border_kings
What isa border_kings
 
Climaps by EMAPS et Europeana2015
Climaps by EMAPS et Europeana2015Climaps by EMAPS et Europeana2015
Climaps by EMAPS et Europeana2015
 
On Continuity in Social Sciences
On Continuity in Social SciencesOn Continuity in Social Sciences
On Continuity in Social Sciences
 
From Before the Cradle: mapping online debates on c-section and family planning
From Before the Cradle: mapping online debates on c-section and family planningFrom Before the Cradle: mapping online debates on c-section and family planning
From Before the Cradle: mapping online debates on c-section and family planning
 
Mapping connectionswithheatmaps
Mapping connectionswithheatmapsMapping connectionswithheatmaps
Mapping connectionswithheatmaps
 
Welcome to Flatland
Welcome to FlatlandWelcome to Flatland
Welcome to Flatland
 
Why We Love Networks
Why We Love NetworksWhy We Love Networks
Why We Love Networks
 
Second Degree Objectivity
Second Degree ObjectivitySecond Degree Objectivity
Second Degree Objectivity
 
Journal of Digital Social Sciences
Journal of Digital Social SciencesJournal of Digital Social Sciences
Journal of Digital Social Sciences
 
Visual Network Analysis
Visual Network AnalysisVisual Network Analysis
Visual Network Analysis
 
Controversies Communication
Controversies CommunicationControversies Communication
Controversies Communication
 
Changer de Focale
Changer de FocaleChanger de Focale
Changer de Focale
 
Designing Controversies for the Public
Designing Controversies for the PublicDesigning Controversies for the Public
Designing Controversies for the Public
 

Recently uploaded

Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Exploring ChatGPT Prompt Hacks To Maximally Optimise Your Queries
Exploring ChatGPT Prompt Hacks To Maximally Optimise Your QueriesExploring ChatGPT Prompt Hacks To Maximally Optimise Your Queries
Exploring ChatGPT Prompt Hacks To Maximally Optimise Your QueriesSanjay Willie
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 

Recently uploaded (20)

Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Exploring ChatGPT Prompt Hacks To Maximally Optimise Your Queries
Exploring ChatGPT Prompt Hacks To Maximally Optimise Your QueriesExploring ChatGPT Prompt Hacks To Maximally Optimise Your Queries
Exploring ChatGPT Prompt Hacks To Maximally Optimise Your Queries
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 

How to follow actors through their traces. Exploiting digital traceability

  • 1. HOW How to follow actors through their traces Exploiting digital traceability Tommaso Venturini tommaso.venturini@sciences-po.fr
  • 2. The quali/quantitative divide rich data, small populations large populations, poor data
  • 3. The problems with either methods Traditional quantitative methods: • data collection: standard discourses’ collection risks to hide the heterogeneity • data treatment: statistical comparison risks to hide divergences Traditional qualitative methods: •data collection: risk of not being representative (beyond small controversies) •data treatment: problem of weighting different discourse
  • 4. The problem with both methods rich data, small populations large populations, poor data Wide angle VS. telephoto
  • 5. Follow the White Rabbit why controversy mapping (and digital methods) will change everything you know about sociology Tommaso Venturini tommaso.venturini@sciences-po.fr The methodological strabismus of social sciences… Photo credit – tarout_sun via Flickr - ©
  • 6. … is reified in social theory The collective self is not a simple epiphenomenon of its morphologic base, precisely as the individual self is not a simple efflorescence of the nervous system. For the collective self to appear, a sui generis synthesis of individual self has to be produced. This synthesis creates a world of feelings, ideas, images that, once come to life, follow their own laws. Emile Durkheim, 1912 Le formes élémentaires de la vie religieuse
  • 7. Emergence The emergent is unlike its components insofar as these are incommensurable, and it cannot be reduced to their sum or their difference (p. 412) George Henry Lewes, 1875 Problems of Life and Mind
  • 8. Cats and mice Jack Cohen, 2000 The Collapse of Chaos: Discovering Simplicity in a Complex World
  • 9. The amazing dictyostelium discoideum Evelyn Fox Keller “morphogenesis”
  • 10. God save the ant Queen Theraulaz, G. & Bonabeau, E. (1999) A brief history of stigmergy Artificial Life, 5, 97–116
  • 13. The bootstrapping of society Thomas Hobbes, 1651 The Leviathan
  • 15. Diving in magma T. Venturini (2010) Public Understanding of Science 19(3)
  • 16. The Tarde vs Durkheim controversy Gabriel Tarde vs Emile Durkheim
  • 17. Against emergence It is surprising to see the men of sciences, so ready to repeat that nothing is ever created from nothing, admitting implicitly (as if it was self-evident) that the connections among different beings can become beings themselves (p. 67) Tarde, 1893 Monadologie et sociologie
  • 18. Against emergence Supposons pour un instant qu'un de nos États humains, composé non de quelques milliers mais de quelques quatrillions ou quintillions d'hommes hermétiquement clos et inaccessibles individuellement (sorte de Chine infiniment plus populeuse encore et plus fermée) nous soit simplement connu par les données de ses statisticiens, dont les chiffres portant sur de très grands nombres se reproduiraient avec une extrême régularité. Quand une révolution politique ou sociale, qui nous serait révélée par un grossissement ou un affaissement brusques de certains de ces chiffres, se produirait dans cet État, nous aurions beau être certains qu'il s'agit là d'un fait causé par des idées et des passions individuelles, nous éviterions de nous perdre en conjectures superflues sur la nature de ces causes seules vraies, mais impénétrables, et le plus sage nous paraîtrait d'expliquer tant bien que mal les chiffres anormaux par des comparaisons ingénieuses avec les chiffres normaux habilement maniés. Nous atteindrions ainsi au moins des résultats clairs et des vérités symboliques. Toutefois, il importerait de temps en temps de nous rappeler le caractère purement symbolique de ces vérités. Tarde, 1893 Monadologie et sociologie
  • 19. How to overcome the quali-quantitative divide?
  • 20. Inscriptions as traces Callon, M., Law, J., & Rip, A. (1986) Mapping the Dynamics of Science & Technology
  • 21. Inscriptions as traces Callon, M., Law, J., & Rip, A. (1986) Mapping the Dynamics of Science & Technology
  • 22. And then the web arrived… <a href="http://www.medialab.sciencespo.fr/index.php"> click here </a>
  • 23. And then the web arrived… and Google with it Brin, S., & Page, P. (1998). The Anatomy of a Large-Scale Hypertextual Web Search Engine. Computer Networks and ISDN Systems, 30(1-7), 107–117
  • 24. Digital traceability Latour, B. (2007). Beware your Imagination Leave Digital Traces. Times Higher Literary Supplement. Owen Gingerich, the great historian of astronomy, spent a life-time retrieving all the annotations of all the copies of Copernicus’s first edition. He could thus give a precise meaning to the rather empty notion of “Copernican revolution” and could show which parts of the book everyone had read and misinterpreted. Nowadays, any scientist can do the same for each portion of each article he or she has published so long as the local library has bought a good package of digital data banks. But what is more extraordinary is that any journalist can do so as well for the latest Madonna video or the dirtiest rumour about Prince Harry’s love affairs. In other words, the former distinction between the circulation of facts and the dissemination of opinions has been erased in such a way that they are both graduating to the same type of visibility — not a small advantage if we wish to disentangle the mixture of facts and opinions that has become our usual diet of information
  • 25. Digital traceability Once you can get information as bores, bytes, modem, sockets, cables and so on, you have actually a more material way of looking at what happens in Society. Virtual Society thus, is not a thing of the future, it’s the materialisation, the traceability of society. It renders visible because of the obsessive necessity of materialising information into cables, into data. Latour, B. 1998 “Thought Experiments in Social Science: from the Social Contract to Virtual Society”
  • 26. From digital traceability … Bruno Latour (1998), argued that the Web is mainly of importance to social science insofar as it makes possible new types of descriptions of social life. According to Latour, the social integration of the Web constitutes an event for social science because the social link becomes traceable in this medium. Thus, social relations are established in a tangible form as a material network connection. We take Latour’s claim of the tangibility of the social as a point of departure in our search (p. 342). Rogers, R., and Marres, N. 2002 “Frenchs candals on the Web, and on the streets: A small experiment in stretching the limits of reported reality.” Asian Journal of Social Science 66: 339-353.
  • 27. … to digital methods The Internet is employed as a site of research for far more than just online culture. The issue no longer is how much of society and culture is online, but rather how to diagnose cultural change and societal conditions with the Internet. The conceptual point of departure for the research program is the recognition that the Internet is not only an object of study, but also a source. Rogers, R. 2009 The End of the Virtual: Digital Methods. Amsterdam University Press.
  • 28. Quali-quantitative methods Top 50 US blogs Ben Fry, 2006 http://nymag.com/news/media/15972/
  • 31. Alice sociology Alice's Adventures in Wonderland Lewis Carroll, 1865
  • 32. Building on faults T. Venturini (2012) Public Understanding of Science 21(7)
  • 33. Beware! 1. More data means more noise 2. Digital data is not your data
  • 34. Beware: more data means more noise!
  • 35. Taking “data mining” seriously Yanacocha Gold Mine, Cajamarca, Peru
  • 36. An (pseudo-) exhaustive map of the Web http://internet-map.net
  • 38. A good map of the Web politicosphere.blog.lemonde.fr
  • 39. A good map of the Web politicosphere.blog.lemonde.fr
  • 40. How to search/query Bisphenol http://en.wikipedia.org/wiki/Bisphenol_A Bisphenol heart diseases controversy http://www.foxnews.com/health/2012/03/07/bpa-chemical-may-be-tied-to-heart-disease/ Bisphenol Melzer controversy http://www.thelancet.com/journals/lancet/article/PIIS0140-6736(12)60496-6/fulltext
  • 41. How to search/query Bisphenol http://en.wikipedia.org/wiki/Bisphenol_A Bisphenol heart diseases controversy http://www.foxnews.com/health/2012/03/07/bpa-chemical-may-be-tied-to-heart-disease/ Bisphenol Melzer controversy http://www.thelancet.com/journals/lancet/article/PIIS0140-6736(12)60496-6/fulltext BPA Polycarbonate endocrine disruptor hearth desease David Melzer Food and Drug Administration coronary artery disease Monica Lind Jeremy Pearson Steven Hentges Polycarbonate Global Group
  • 42. Subject-specific keywords • Proper names • Name of institutions • Toponyms • Scientific/technical terminology • Scientific references • …
  • 43. Improving your query • Exploit linguistic differences • Go advanced (use search fields) • Limit the time span • Use search operators • “exact” / -exclude / ~synonyms / * / OR / AND
  • 44. Beware: digital data is not your data!
  • 45. Whose data is this? • Proliferation of new devices, genres and formats for the documentation of social life… explosion of digital technologies that enable people to report and comment upon social life. • Routine generation of data about social life as part of social life. ‘Social media’ platforms… embed the process of social data generation in everyday practices. • Development of online platforms and tools for the analysis of digital social data. These days, most online platforms come with ‘analytics’ attached: a set of tools and services facilitating the analysis of the data generated by said platforms. Marres, N. (2011). Re-distributing Methods: Interventions in Digital Social Research.
  • 46. Redistribution of research methods • Methods as usual (ex. Andrew Abbott, ) The techniques used by digital platforms have been long used in social sciences. • Big methods (ex. Newman et al, 2007) Digital traceability increases the quantity of social data thereby demanding use of mathematical techniques of analysis. • Virtual methods (ex. Christine Hine, 2000, 2005) Digital media transform the quality of social practices and demand therefore increased efforts of observations and interpretation. • Digital methods (ex. Richard Rogers, 2009) Digital platforms have their own methods that need to be understood and re-purposed for social research. • Re-mediation of methods (ex. Nortje Marres, 2011) The techniques used by digital platforms have been long used in social sciences, but are radically transformed the new context of their use. Marres, N. (2011). Re-distributing Methods: Interventions in Digital Social Research. More redistribution Less
  • 47. Natural experiments Snow, J. (1855). On the Mode of Communication of Cholera
  • 48. Natural experiments Snow, J. (1855). On the Mode of Communication of Cholera
  • 49. Natural experiments Snow, J. (1855). On the Mode of Communication of Cholera

Editor's Notes

  1. 27/08/12
  2. 27/08/12
  3. On the one hand, social sciences could use quantitative methods (surveys and statistics) to collect data on large population, but the data they collect would necessarily be relatively poor and superficial. On the other hand, they could use qualitative methods (interviews, focus group, observations) to collect rich and detailed data, but they were then forced to limit their investigation to small populations.
  4. 27/08/12
  5. Social science could observe many thing from far away (quantitative methods = wide angle) or have a close look to few things (qualitative methods = telephoto). Never could they maintain the span and the focus of their observation at the same time, nor change their focal length continuosly.
  6. Up until now, social sciences cannot use natural experiments either, because this type of experiments requires a detailed knowledge of a large number of subjects (Snow, for instance, had the complete map of the water distribution system of London, which allowed him to know which water company was serving each specific household). Unfortunately, these two conditions are seldom reunited in social sciences. Since their foundation, social sciences have always had to deal with a sort of methodological strabismus .
  7. To use another metaphor, this is what I call the ‘Gulliver sociology’.
  8. 27/08/12 In the previous unit we learnt how difficult is to study controversies. In this unit, we will discover that, luckily, there is at least one thing that can help us in this otherwise impossible mission. The one thing that can make the task of controversy mapping less helpless.
  9. 27/08/12 Hop-o&apos;-My-Thumb
  10. 27/08/12
  11. 27/08/12
  12. 27/08/12
  13. 27/08/12
  14. 27/08/12
  15. But this situation has started to change as soon as social scientists have stopped considering media (and electronic media in particular) just as an object of study…
  16. … and started considering them also as a possible source of data. Digital media have, in fact, a very interesting feature: all the interactions that they mediate becomes easily traceable and is often easily traced. Though these traces are not collected for the sake of social science (but for surveillance, marketing or for technical optimiszation), they can nonetheless be exploited by social scientists. Giving social sciences, for the first time in their history, access to plenty of data.
  17. These data concerns huge population as about one third of world population has access to the Internet and about half of it owns a mobile phone. Digital media are spreading like a immense carbon paper, tracing social phenomena to an extent that has never been possible before. As a proof of concept, in the image in the slide Paul Butler showed how it is possible to generate a very detailed map of the world by mapping friendships connections in Facebook.
  18. At the same time, this data are also as rich than the data collected with qualitative data. As a proof of concept, see the documentary on the life of American On Line user 711391. Drawing on a an accidental leak of AOL data, the documentary reports the three month complete search history of this user. The sequence of her queries (and nothing else) allows disturbingly intimate access to the life of this ”religious middle-aged and somewhat obese middle-aged lady from Houston Texas who is looking for a way to rejuvenate her sex life” (as we come to discover).
  19. What is most important, thanks to digital traceability is now possible to collect data that are rich and concerning large population at the same time , as convincingly demonstrated by the famous Google study on the detection of flu epidemics.
  20. In this study Google engineers identified the 45 search queries that best matched the flu curves released by the U.S. Centers for Disease Control (CDC). Then they combined the curves of this 45 queries and built and indicator that has an increadible mean correlation of 0.97 with CDC data.
  21. With the advantage that whereas the CDC needs about two weeks to collect and release the data on US flu epidemics, Google can calculate its indicator every day.
  22. (Google also made the same type of research possible to anyone and on any subject through Google Insight for Search and Google Correlate)
  23. (Google also made the same type of research possible to anyone and on any subject through Google Insight for Search and Google Correlate)
  24. (Google also made the same type of research possible to anyone and on any subject through Google Insight for Search and Google Correlate)
  25. (Google also made the same type of research possible to anyone and on any subject through Google Insight for Search and Google Correlate)
  26. From the point of view of social science, the change is dramatic. For the first time, it is possible to start imagining methods having both a large scope and a detailed focus, thereby overcoming the limitations of both quantitative and qualitative methods. The image in the slide is a good proof of concept. In this map of the US blogosphere in 2006 realized by Ben Fry, it is possible to observe zoom out to see the big picture and observe large-scale patterns (like the fact the the more visible websites link to the less visible one, but not the other way around – the so called preferential attachment), but also to zoom in and observe each individual connection. A new generation of quali-quantitative methods becomes therefore possible …
  27. This is a map digital tools and methods that we use at the médialab of Sciences Po. In this course (and in particular in the second semester) you will lean to use most of them.
  28. … and it becomes possible to move from the sociology of Gulliver to the sociology of Alice (as you know in her trip to Wonderland Alice can change her size at her will by drinking a magical potion and eating a magical cookie).
  29. 27/08/12
  30. 27/08/12
  31. 27/08/12
  32. 27/08/12
  33. 27/08/12
  34. 27/08/12
  35. 27/08/12
  36. The first challenge consists in taking the data mining metaphor seriously. Everyone who ever visited a gold mine knows well that what is striking about this type of landscape is the feeling of absence that dominate them. Where a mountain is supposed to be, there is a huge hole instead. Describing mining as the act of collecting gold and other precious materials is mistaking the aim for the practice. 0.1% of mining is about collecting precious substances, 99,9% of it is about removing tons and tons of rocks, sand and earth. Gold is the product of such absence, what is left when everything else is gone. The same is true for information mining: it is not about collecting as much data as possible; it is about getting rid of most of it. This is important, because the current ‘data deluge’ ideology, obsessed as it is with the question of collecting, storing, exploiting data, forgets that the careful selection of data is most important part of all scientific protocol.
  37. 27/08/12 An example will make our argument clear. The so-called Internet map is, to our knowledge, the largest publicly available map of the Web. As you can see, very little knowledge can be extracted from this map. All that we can see is that the Web is polarized by language (the color of the nodes) and that some nodes are (far) more connected than the other (size of the nodes). None of this is a surprise.
  38. 27/08/12 Beautiful and breathtaking as they may be, this kind of maps is useless for research purposes. This is not data mining, this is compulsive hoarding: a syndrome that is growing more and more serious among the data deluge fans.
  39. A good map of the Web is always limited in its ambition: it tries to represent a limited portion of the Web and the better this portion is delimited, the better is the map. In the example an interesting map of the French political blogosphere, realized by Linkfluence (a research partner of the médialab).
  40. Because the selection of the websites has been done carefully it is possible to use this map as a research tool and discover for example, that the extreme left and the extreme right have two very different position in French online politics: the first being little, spread out and central; the second being massive, clusterized and eccentric.
  41. 0.1% of Web-crawling is about collecting relevant websites, 99,9% of it is about removing irrelevant ones. That is why the most important button in all the crawling tools that we develop at the médialab (in the slide you see the old Navicrawler and the soon-to-be-release Hyphe) is the one allowing the exclusion of one website from the corpus. Providing us tools for filtering, delimiting, sieving data is the first contribution that we would like to have from CHI experts.
  42. 27/08/12 The first skill is ‘searching’ that is to say using a search engine. This is, by far, the most common way of finding information on the Web. All of you have already used search engines millions of times. And yet, it is important (and not only for the sake of controversy mapping) for you to understand the very specific movement of search engine querying. Contrarily to what you may think, this movement should not aim at expansion (finding more information), but at reduction. The problem with search engines is not that they return too little information, but that they return too much (and most of it is not relevant). Improving one’s queries is therefore an effort in finding more and more specific words capable to reduce the information reduce by the search engine.
  43. 27/08/12 In fact, the movement just described need to be precised. The aim of the research, of course, is not to reduce the quantity of information found, but to reduce the irrelevant information and increase the relevant one. This movement of concentration (or distillation) requires identifying a number of ‘specific keywords’ clearly focused on the subject of the research.
  44. 27/08/12 This subject-specific keywords can include proper names, name of institutions, toponyms, scientific/technical terminology, scientific references and in general all words or expression that are not polysemic or vague.
  45. 27/08/12 And here are some other advices on how to improve your queries
  46. In order to understand the revolution brought by digital traceability in controversy mapping and, more generally, to social science, we have to go back to a famous research conducted by the British epidemiologist John Snow at the middle of the XIX century. John Snow was trying to understand the mechanisms of diffusion of the cholera (one of the main death cause in UK). At the time, the dominant theory was that cholera was caused by pollution or a noxious form of &quot;bad air”. Snow, however, criticized by this theory and claimed instead that cholera germs were transported by infected water. Snow first tried to prove its theory by showing that one particularly severe cholera outbreak in London was centered around a particular water pump located in broad street. But how how to prove that these particular observation could be generalized to all cholera epidemics.
  47. Snow, of course, could not prove his theory by direct experiments on human beings and yet an experimental evidence was exactly he needed to convince the scientific community. Trying to solve this conundrum, Snow came up with the idea of ‘natural experiment’. First of all, he observed that the mortality rate in different households was strongly correlated with the company that provided them water. In particular, the houses supplied by the Southwark Company the mortality was almost six times higher that in the houses supplied by the Lambeth Company.
  48. But this proof was not sufficient, as other differences between the households could have explained the difference. Snow however had at his disposal the detailed map of the London water system and observed that the distribution network of Southwark and Lambeth intermingled in central London. Since in these district the households supplied by the two water company were side by side, Snow can easily assume that all other conditions were equal. In other words, it was as if London population had been divided randomly in an experimental group and a control group, a perfect experimental setting except that Snow had not prepared it himself, but just found it in ‘nature’.
  49. One of the main difference between natural science and social science is that the latter cannot reproduce the phenomena that they study in the controlled setting of the laboratory. Social sciences cannot rely on controlled experiments to investigate collective dynamics (and this is why the comics in the slide are funny). But can social sciences employ at least natural experiments?