SlideShare a Scribd company logo
1 of 24
Big Data
and the
Question of Objectivity
Federica Russo
Philosophy | Humanities | Amsterdam
russofederica.wordpress.com | @federicarusso
Previous work
• Plantin JC and Russo F. (2016) D’abord les données,
ensuite la méthode? Big data et déterminisme en
sciences sociales. Socio 06, 97-115
• Plantin, JC (2018) Data cleaners for pristine datasets:
visibility and invisibility of data processors in social
science. Science, Technology and Human Values. DOI:
10.1177/0162243918781268
2
Overview
• The social sciences go big
– Quantitative social science
• A big question: what conception(s) of objectivity in
big-data social science practices?
– What practices?
– What objectivity?
• Why does it matter?
– Conceptual issues
– Practical implications
3
The social sciences go big
4
Big and quantitative
• The ‘first’ big data revolution in social science:
– Positivism and the birth of quantitative social science
• Possibility of analysing more data, using the tools of statistics
• Going quantitative helped the social science reach
the ‘realm of the sciences’
– And yet questions related to the objectivity of the social
sciences didn’t settle
5
Big and problematic
Current debates around big data and scholarship
Borgman (2015):
• Don’t conflate “ease of acquisition [of data] for ease of
analysis”
• Need theoretical as well as methodological framework
A choir of old and new data-philosophers (e.g. Sellars, Floridi,
Leonelli …)
• Data are not given
• Data, information, knowledge are not all the same
• Data are relational
• …
6
Lots of questions already asked
• How big / fast is ‘big’?
• How much theory in big automated
algorithms?
• What kind of reasoning? Inductive?
• What implications does the ‘big’ have at social
/ technical / scholarly level?
• ...
7
The question
What exactly do data curators want to achieve with big-data
practices?
Based on Plantin’s ethnographic study, a two-step answer:
1. Analysis of big-data practices in social science
2. Problematisation of 2 aspects of big-data practices:
a. Making the data curator visible / invisible [(in)visibility]
b. Standardisation of processes for data curation [standardisation]
In a nutshell: [a-b] force us to re-think the notion of objectivity
8
Big-data practices in social science
9
The manual processing
‘pipeline’
10
A ‘factory metaphor’
“We're more of an assembly line, and so it's
production type of work” Paul, Archive Manager
Employment conditions that characterize “invisible technicians” in science
(Shapin, 1989; Barley, Bechky, 1994; Star, Strauss, 1999)
– Strict division of roles
– Rhythm of work
– No skills development
– Short term employment and turn over
– Highly standardized work, routine
– Invisible contribution
• Why all this??
11
Making data ‘pristine’
“We want [the datasets] to be right, and everything
to read properly […] Trying to get that, so that the
future users when they get [the datasets], they get
everything in a pristine manner.” Paul, Archive Manager
12
Data processing and [in]visible labor
• Complete invisibility outside the archive
– No critique allowed of the datasets: “Don’t get carried away”
– Contacting the PI only as last resort
– Strict formatting for standardized output
• Complete visibility inside the archive
– Making all processing techniques explicit
– Processing history file + Quality check
– Homogenization of practices
13
‘Pristineness’
• Cleaning data twice: traces of original context + traces of
cleaning
• Reproduces erroneous conception of ‘raw data’ (Gitelman,
2013)
• Conceals contributions of data processors: protocol work
(Downey, 2014) data packaging (Leonelli, 2016)
14
The question of objectivity
15
Two main issues
[(in)visibility] and [standardisation]:
• Re-introduce old ideas about objectivity
• Exemplify some more recent ideas about
objectivity
• But also: ideas pulling in opposite
directions
16
The data curator must be invisible
from the outside
• Data users don’t know / need to know about the
process
• Focus on ‘end product’ (rather than process)
 Data are objectively clean, ready to (re)use
• No (interfering) curator behind data curation
• Objectivity is a property of data, not of the process
 An old ideal of objectivity: objects’ objectivity
 Kitcher: the Legend View of Neopositivism
17
The data curator must be visible
from the inside
• At any time in the data curation process, who the data
curator is and what s/he does must be visible, traceable,
transparent
• The process is objective as long as procedures are respected
• The curator is present at all times
 Objectivity lies in the procedure
• A more recent idea of procedural objectivity
– Montuschi, Little, Cardano, … :
• the social sciences can attain objectivity;
• objectivity is in the process, not in the object of science
18
Procedural objectivity: pulling in
opposite directions?
A good tool to have in the kit
• Liberate social sciences from
inferiority complex
• Can value role of data curators
• Helps understand where the
process can go wrong
• Increases objectivity of the ‘end
product’ by self-reflectively
work on process
• ...
A ‘procedural drift’ towards
obsessive standardisation?
• Can / should we be flexible
about procedure
• If so, do we lose or gain on
objectivity?
• Is objectivity just a matter of
procedure?
• What role is left to the data
curator then?
• ...
• What else does ‘obsessive
procedural objectivity’
presuppose?
19
Strong procedures and data
pristineness
• Much of [(in)visibility] and of [standardisation] rest
on the myth of raw data and of clean data
• Pristineness: data are cleaned twice (original PI and
of traces of cleaning)
• Here, along with the choir of data-philosophers
– No, data is not raw or clean
– No, you can’t just assume their cleanness abstracting from curation
procedures
– No, maybe they shouldn’t be cleaned up so much after all
– Yes, perhaps the social science need somewhat dirty data
20
To sum up and conclude
21
Social science practices go big
• The social sciences grew big already since Positivism
– Introduction and development of quantitative methods
– Demography and sociology; understanding and acting on
social phenomena
• In the era of big data, they grow even bigger
– More data: social media provide tons
– More practices: data curation and automated data
analyses
22
Big-data practices strive for
objectivity
Two relevant aspects of these practices:
[(in)visbility] and [standardisation]
Two notions of objectivity at play:
• [invisible] curators: the objects are objective
• [visible] curators: the procedures are objective
• [standardisation] of procedures: procedures are
objectives … too objective?
23
Why does it matter?
• An interesting ‘philosophy of science in practice’
question
– From the practice, bottom up crucial philosophical issues
– Objectivity, an evergreen of phil sci. But what new is at stake with big
data?
• Beyond scholarly questions
– Open data and open science
• Can we abstract from the alleged objectivity of these practices?
• When are data objective enough to be safely re-used?
– If [standardisation] doesn’t ensure it, what does?
– Should we strive for that kind of objectivity?
24

More Related Content

What's hot

What's hot (7)

NYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talkNYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
 
Data Science and its impact on society
Data Science and its impact on societyData Science and its impact on society
Data Science and its impact on society
 
Isolating values from big data with the help of four v’s
Isolating values from big data with the help of four v’sIsolating values from big data with the help of four v’s
Isolating values from big data with the help of four v’s
 
CODATA International Training Workshop in Big Data for Science for Researcher...
CODATA International Training Workshop in Big Data for Science for Researcher...CODATA International Training Workshop in Big Data for Science for Researcher...
CODATA International Training Workshop in Big Data for Science for Researcher...
 
DRIVE 2017 | 26 October - RESEARCH THROUGH DESIGN & MORE - Societal Challenge...
DRIVE 2017 | 26 October - RESEARCH THROUGH DESIGN & MORE - Societal Challenge...DRIVE 2017 | 26 October - RESEARCH THROUGH DESIGN & MORE - Societal Challenge...
DRIVE 2017 | 26 October - RESEARCH THROUGH DESIGN & MORE - Societal Challenge...
 
The Science of Data Science
The Science of Data Science The Science of Data Science
The Science of Data Science
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 

Similar to Big data and the question of objectivity

Ralph schroeder and eric meyer
Ralph schroeder and eric meyerRalph schroeder and eric meyer
Ralph schroeder and eric meyer
oiisdp
 

Similar to Big data and the question of objectivity (20)

Big data and the question of objectivity
Big data and  the question of objectivityBig data and  the question of objectivity
Big data and the question of objectivity
 
Data Science-1 (1).ppt
Data Science-1 (1).pptData Science-1 (1).ppt
Data Science-1 (1).ppt
 
Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...
Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...
Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...
 
Fundamentals of Data science Introduction Unit 1
Fundamentals of Data science Introduction Unit 1Fundamentals of Data science Introduction Unit 1
Fundamentals of Data science Introduction Unit 1
 
TL_Thompson.pptx.ppt
TL_Thompson.pptx.pptTL_Thompson.pptx.ppt
TL_Thompson.pptx.ppt
 
Mapping Movements: Social movement research and big data: critiques and alter...
Mapping Movements: Social movement research and big data: critiques and alter...Mapping Movements: Social movement research and big data: critiques and alter...
Mapping Movements: Social movement research and big data: critiques and alter...
 
Big data, new epistemologies and paradigm shifts
Big data, new epistemologies and paradigm shiftsBig data, new epistemologies and paradigm shifts
Big data, new epistemologies and paradigm shifts
 
Social physics
Social physicsSocial physics
Social physics
 
Internet Research Ethics CSSWS2015 Tutorial
Internet Research Ethics CSSWS2015 TutorialInternet Research Ethics CSSWS2015 Tutorial
Internet Research Ethics CSSWS2015 Tutorial
 
Social Graphs for Better Drug Development
Social Graphs for Better Drug DevelopmentSocial Graphs for Better Drug Development
Social Graphs for Better Drug Development
 
DBMS
DBMSDBMS
DBMS
 
"I always feel it must be great to be a hacker"
"I always feel it must be great to be a hacker" "I always feel it must be great to be a hacker"
"I always feel it must be great to be a hacker"
 
Ralph schroeder and eric meyer
Ralph schroeder and eric meyerRalph schroeder and eric meyer
Ralph schroeder and eric meyer
 
Accessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science KnowledgeAccessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science Knowledge
 
Introduction Data Science.pptx
Introduction Data Science.pptxIntroduction Data Science.pptx
Introduction Data Science.pptx
 
50 Years of Data Science
50 Years of Data Science50 Years of Data Science
50 Years of Data Science
 
Ethical research in the era of Digital Studies : the French point of view
Ethical research in the era of Digital Studies : the French point of view Ethical research in the era of Digital Studies : the French point of view
Ethical research in the era of Digital Studies : the French point of view
 
Learning Analytics – Ethical questions and dilemmas
Learning Analytics  – Ethical questions and dilemmasLearning Analytics  – Ethical questions and dilemmas
Learning Analytics – Ethical questions and dilemmas
 
The Hidden Data of Social Media Rearch_CSS-winter-symposium
The Hidden Data of Social Media Rearch_CSS-winter-symposiumThe Hidden Data of Social Media Rearch_CSS-winter-symposium
The Hidden Data of Social Media Rearch_CSS-winter-symposium
 
Nicholas Jewell MedicReS World Congress 2014
Nicholas Jewell MedicReS World Congress 2014Nicholas Jewell MedicReS World Congress 2014
Nicholas Jewell MedicReS World Congress 2014
 

More from University of Amsterdam and University College London

More from University of Amsterdam and University College London (20)

H-AI-BRID - Thinking and designing Human-AI systems
H-AI-BRID - Thinking and designing Human-AI systemsH-AI-BRID - Thinking and designing Human-AI systems
H-AI-BRID - Thinking and designing Human-AI systems
 
Time in QCA: a philosopher’s perspective
Time in QCA: a philosopher’s perspectiveTime in QCA: a philosopher’s perspective
Time in QCA: a philosopher’s perspective
 
Interconnected health-environmental challenges: Between the implosion of the ...
Interconnected health-environmental challenges: Between the implosion of the ...Interconnected health-environmental challenges: Between the implosion of the ...
Interconnected health-environmental challenges: Between the implosion of the ...
 
Trusting AI-generated contents: a techno-scientific approach
Trusting AI-generated contents: a techno-scientific approachTrusting AI-generated contents: a techno-scientific approach
Trusting AI-generated contents: a techno-scientific approach
 
Interconnected health-environmental challenges, Health and the Environment: c...
Interconnected health-environmental challenges, Health and the Environment: c...Interconnected health-environmental challenges, Health and the Environment: c...
Interconnected health-environmental challenges, Health and the Environment: c...
 
Who Needs “Philosophy of Techno- Science”?
Who Needs “Philosophy of Techno- Science”?Who Needs “Philosophy of Techno- Science”?
Who Needs “Philosophy of Techno- Science”?
 
Philosophy of Techno-Science: Whence and Whither
Philosophy of Techno-Science: Whence and WhitherPhilosophy of Techno-Science: Whence and Whither
Philosophy of Techno-Science: Whence and Whither
 
Charting the explanatory potential of network models/network modeling in psyc...
Charting the explanatory potential of network models/network modeling in psyc...Charting the explanatory potential of network models/network modeling in psyc...
Charting the explanatory potential of network models/network modeling in psyc...
 
The implosion of medical evidence: emerging approaches for diverse practices ...
The implosion of medical evidence: emerging approaches for diverse practices ...The implosion of medical evidence: emerging approaches for diverse practices ...
The implosion of medical evidence: emerging approaches for diverse practices ...
 
On the epistemic and normative benefits of methodological pluralism
On the epistemic and normative benefits of methodological pluralismOn the epistemic and normative benefits of methodological pluralism
On the epistemic and normative benefits of methodological pluralism
 
Socio-markers and information transmission
Socio-markers and information transmissionSocio-markers and information transmission
Socio-markers and information transmission
 
Disease causation and public health interventions
Disease causation and public health interventionsDisease causation and public health interventions
Disease causation and public health interventions
 
The life-world of health and disease and the design of public health interven...
The life-world of health and disease and the design of public health interven...The life-world of health and disease and the design of public health interven...
The life-world of health and disease and the design of public health interven...
 
Towards and epistemological and ethical XAI
Towards and epistemological and ethical XAITowards and epistemological and ethical XAI
Towards and epistemological and ethical XAI
 
Value-promoting concepts in the health sciences and public health
Value-promoting concepts in the health sciences and public healthValue-promoting concepts in the health sciences and public health
Value-promoting concepts in the health sciences and public health
 
Connecting the epistemology and ethics of AI
Connecting the epistemology and ethics of AIConnecting the epistemology and ethics of AI
Connecting the epistemology and ethics of AI
 
How is Who. Empowering evidence for sustainability and public health interven...
How is Who. Empowering evidence for sustainability and public health interven...How is Who. Empowering evidence for sustainability and public health interven...
How is Who. Empowering evidence for sustainability and public health interven...
 
High technologized justice – The road map for policy & regulation. Legaltech ...
High technologized justice – The road map for policy & regulation. Legaltech ...High technologized justice – The road map for policy & regulation. Legaltech ...
High technologized justice – The road map for policy & regulation. Legaltech ...
 
Connecting the epistemology and ethics of AI
Connecting the epistemology and ethics of AIConnecting the epistemology and ethics of AI
Connecting the epistemology and ethics of AI
 
Science and values. A two-way relations
Science and values. A two-way relationsScience and values. A two-way relations
Science and values. A two-way relations
 

Recently uploaded

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 

Recently uploaded (20)

Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 

Big data and the question of objectivity

  • 1. Big Data and the Question of Objectivity Federica Russo Philosophy | Humanities | Amsterdam russofederica.wordpress.com | @federicarusso
  • 2. Previous work • Plantin JC and Russo F. (2016) D’abord les données, ensuite la méthode? Big data et déterminisme en sciences sociales. Socio 06, 97-115 • Plantin, JC (2018) Data cleaners for pristine datasets: visibility and invisibility of data processors in social science. Science, Technology and Human Values. DOI: 10.1177/0162243918781268 2
  • 3. Overview • The social sciences go big – Quantitative social science • A big question: what conception(s) of objectivity in big-data social science practices? – What practices? – What objectivity? • Why does it matter? – Conceptual issues – Practical implications 3
  • 5. Big and quantitative • The ‘first’ big data revolution in social science: – Positivism and the birth of quantitative social science • Possibility of analysing more data, using the tools of statistics • Going quantitative helped the social science reach the ‘realm of the sciences’ – And yet questions related to the objectivity of the social sciences didn’t settle 5
  • 6. Big and problematic Current debates around big data and scholarship Borgman (2015): • Don’t conflate “ease of acquisition [of data] for ease of analysis” • Need theoretical as well as methodological framework A choir of old and new data-philosophers (e.g. Sellars, Floridi, Leonelli …) • Data are not given • Data, information, knowledge are not all the same • Data are relational • … 6
  • 7. Lots of questions already asked • How big / fast is ‘big’? • How much theory in big automated algorithms? • What kind of reasoning? Inductive? • What implications does the ‘big’ have at social / technical / scholarly level? • ... 7
  • 8. The question What exactly do data curators want to achieve with big-data practices? Based on Plantin’s ethnographic study, a two-step answer: 1. Analysis of big-data practices in social science 2. Problematisation of 2 aspects of big-data practices: a. Making the data curator visible / invisible [(in)visibility] b. Standardisation of processes for data curation [standardisation] In a nutshell: [a-b] force us to re-think the notion of objectivity 8
  • 9. Big-data practices in social science 9
  • 11. A ‘factory metaphor’ “We're more of an assembly line, and so it's production type of work” Paul, Archive Manager Employment conditions that characterize “invisible technicians” in science (Shapin, 1989; Barley, Bechky, 1994; Star, Strauss, 1999) – Strict division of roles – Rhythm of work – No skills development – Short term employment and turn over – Highly standardized work, routine – Invisible contribution • Why all this?? 11
  • 12. Making data ‘pristine’ “We want [the datasets] to be right, and everything to read properly […] Trying to get that, so that the future users when they get [the datasets], they get everything in a pristine manner.” Paul, Archive Manager 12
  • 13. Data processing and [in]visible labor • Complete invisibility outside the archive – No critique allowed of the datasets: “Don’t get carried away” – Contacting the PI only as last resort – Strict formatting for standardized output • Complete visibility inside the archive – Making all processing techniques explicit – Processing history file + Quality check – Homogenization of practices 13
  • 14. ‘Pristineness’ • Cleaning data twice: traces of original context + traces of cleaning • Reproduces erroneous conception of ‘raw data’ (Gitelman, 2013) • Conceals contributions of data processors: protocol work (Downey, 2014) data packaging (Leonelli, 2016) 14
  • 15. The question of objectivity 15
  • 16. Two main issues [(in)visibility] and [standardisation]: • Re-introduce old ideas about objectivity • Exemplify some more recent ideas about objectivity • But also: ideas pulling in opposite directions 16
  • 17. The data curator must be invisible from the outside • Data users don’t know / need to know about the process • Focus on ‘end product’ (rather than process)  Data are objectively clean, ready to (re)use • No (interfering) curator behind data curation • Objectivity is a property of data, not of the process  An old ideal of objectivity: objects’ objectivity  Kitcher: the Legend View of Neopositivism 17
  • 18. The data curator must be visible from the inside • At any time in the data curation process, who the data curator is and what s/he does must be visible, traceable, transparent • The process is objective as long as procedures are respected • The curator is present at all times  Objectivity lies in the procedure • A more recent idea of procedural objectivity – Montuschi, Little, Cardano, … : • the social sciences can attain objectivity; • objectivity is in the process, not in the object of science 18
  • 19. Procedural objectivity: pulling in opposite directions? A good tool to have in the kit • Liberate social sciences from inferiority complex • Can value role of data curators • Helps understand where the process can go wrong • Increases objectivity of the ‘end product’ by self-reflectively work on process • ... A ‘procedural drift’ towards obsessive standardisation? • Can / should we be flexible about procedure • If so, do we lose or gain on objectivity? • Is objectivity just a matter of procedure? • What role is left to the data curator then? • ... • What else does ‘obsessive procedural objectivity’ presuppose? 19
  • 20. Strong procedures and data pristineness • Much of [(in)visibility] and of [standardisation] rest on the myth of raw data and of clean data • Pristineness: data are cleaned twice (original PI and of traces of cleaning) • Here, along with the choir of data-philosophers – No, data is not raw or clean – No, you can’t just assume their cleanness abstracting from curation procedures – No, maybe they shouldn’t be cleaned up so much after all – Yes, perhaps the social science need somewhat dirty data 20
  • 21. To sum up and conclude 21
  • 22. Social science practices go big • The social sciences grew big already since Positivism – Introduction and development of quantitative methods – Demography and sociology; understanding and acting on social phenomena • In the era of big data, they grow even bigger – More data: social media provide tons – More practices: data curation and automated data analyses 22
  • 23. Big-data practices strive for objectivity Two relevant aspects of these practices: [(in)visbility] and [standardisation] Two notions of objectivity at play: • [invisible] curators: the objects are objective • [visible] curators: the procedures are objective • [standardisation] of procedures: procedures are objectives … too objective? 23
  • 24. Why does it matter? • An interesting ‘philosophy of science in practice’ question – From the practice, bottom up crucial philosophical issues – Objectivity, an evergreen of phil sci. But what new is at stake with big data? • Beyond scholarly questions – Open data and open science • Can we abstract from the alleged objectivity of these practices? • When are data objective enough to be safely re-used? – If [standardisation] doesn’t ensure it, what does? – Should we strive for that kind of objectivity? 24