SlideShare a Scribd company logo
OPEN EXPERIMENTS AND OPEN
SOURCE SOFTWARE
Jonathan Peirce
University of Nottingham
OPEN-SOURCE SOFTWARE
 Is free
 Is often more feature-rich/advanced
 Allows us to examine/change all the code
 Buggy?
     Young packages can be, but developers are usually
      very responsive to fixing bugs
     Mature packages aren’t (e.g. see Firefox, Thunderbird,
      GIMP, Linux, Python…)
   Unsustainable?
       Not once they reach critical mass
 Open-source software is good for science
 What about open-sourcing experiments?
A REPOSITORY OF OPEN-EXPERIMENTS?
   Goals:
     Reproducibility: Rather than running your interpretation of a
      study from its methods section, fetch the actual experiment
     Publicity: draw attention to your experiment as people
      browse the repository
     Education: a starting point to build an experiment for new
      users of a piece of software


   One-stop location to up/download entire experiments or
    components of them
   Platform/package independent (PsychoPy, PTB,
    Presentation…)
   Easy to upload, easy to browse
SIMILAR REPOSITORIES
 MatlabCentral File Exchange (proprietary)
 Figshare.com (for data)
SIMILAR REPOSITORIES
 MatlabCentral File Exchange (proprietary)
 Figshare.com (for data)

 Viperlib (for demos)

 RunMyCode.org (computational economics)
     Create a 'Companion website' for your paper
     The model code can be run directly on the site(!)
SIMILAR REPOSITORIES
 MatlabCentral File Exchange (proprietary)
 Figshare.com (for data)

 Viperlib (for demos)

 RunMyCode.org (computational economics)

 OpenScienceFramework.org (new)
       "The Open Science Framework (OSF) is an
        infrastructure for documenting, archiving, logging,
        sharing, and registering scientific projects. Tools are
        being designed to integrate open practices with a
        scientist's daily workflow rather than appending them ex
        post facto."
       See the OSF goals here:
           http://openscienceframework.org/project/4znZP/wiki/home
POTENTIAL CONCERNS
 People don’t want others to see their code
 People might run studies that they didn't actually
  understand
 Errors in studies might propagate more

 Why should someone else benefit from the hours I
  spent coding that experiment/stimulus?
 We don’t need this resource; we can make web
  pages and use code repositories (e.g. github)
 People will never use such a resource

 Someone will have to set it up and run it
PEOPLE DON’T WANT OTHERS TO SEE THEIR
CODE

   Why not?
     Most people write code for themselves, not for others to
      see
     Cleaning/documenting your code takes time
     Maybe you’re a little worried about someone finding a
      bug in your code?

   On the other hand
     Writing neat, clear code is good; it means fewer bugs
      and more-reusable code for yourself!
     Although we don’t like people finding our bugs, it is
      actually a good thing for science
     Some tools provide graphical interfaces which should
      reduce the anxiety
PEOPLE MIGHT RUN STUDIES THAT THEY
DIDN'T UNDERSTAND

   How?
     They might not realise some critical part of the setup
      (e.g a calibrated monitor)
     They might make an inappropriate change or use
      settings that aren't possible


   On the other hand
     Should we really be setting programming ability as a
      hurdle to running studies?
     Providing the base code (and some notes including
      some of the caveats) will reduce this problem
     Maybe the resource should point out that code does not
      replace the need for good supervision/education
ERRORS IN STUDIES MIGHT PROPAGATE MORE
   How?
       If a study contains a bug in code, and is re-used by
        another lab, the bug will tend to remain. If they re-wrote
        the code from scratch it would be gone


   On the other hand
     In reality, if the latter study finds a different result to the
      former, it just fails to get published because we don't
      know why the 2 studies differ. No advantage.
     If there were a bug and the code were available we
      would stand some chance of finding
WHY SHOULD SOMEONE ELSE BENEFIT?
 You've put a lot of effort into your building study
 Why should someone else just download it and use
  it for free?! Let them think of their own study!

   On the other hand;
       (Thank goodness the open-source developers don't
        think like that!)
       You would get to benefit from other people's work.
        Science benefits
       You should want people to build on your studies. That is
        in your interest
WE DON’T NEED THIS RESOURCE
   Why not?
       We could use code repositories (e.g. sourceforge,
        github etc) or our institutional websites


   But recall the goals:
     Replicability
     Publicity
     Education

   Open-source repositories are mostly designed for
    technically very literate, which limits the contributors
PEOPLE WILL NEVER USE SUCH A RESOURCE
   Really?
       Lots of 'do-gooders' have set up data repositories, but
        they're empty


   OK, so how would we get people to use an open-
    science repository?
     Encourage people that it really is good for them if
      people can extend their study easily
     Make it compulsory (e.g. via the journals)?
PEOPLE WILL NEVER USE SUCH A RESOURCE
   Really?:
       Lots of 'do-gooders' have set up data repositories, but they're
        empty

   OK, so how would we get people to use an open-
    science repository?
     Encourage people that it really is good for them if people can
      extend their study easily
     Make it compulsory (e.g. via the journals)?
     Provide a kite-mark, via the journals, for articles that can be
      fully replicated

    [since giving the talk I have discovered that 'kite-mark' is a
    purely British concept. It refers to a non-compulsory badge, from
    the British Standards Institute, showing that a product meats
    high quality standards]
REPRODUCIBLE RESEARCH STANDARD
   Stodden (2009) Enabling reproducible research:
    licensing scientific innovation. International Journal of
    Communications Law and Policy
       Potentially different levels of compliance with the standard:
           Verified: has already been verified in an independent lab
           Verifiable: the compendium (full set of research materials) is
            available to fully reproduce the study
           Semi-verifiable: not all materials have been released but the
            description of the work should allow replication
           Non-verifiable: the work requires materials or apparatus that are
            not typically available
       “Efforts are currently under way for the RRS to be an official
        mark of Science Commons. This would provide an easily
        identifiable logo and a clear definition for each level of
        reproducibility.”
IT WILL TAKE TIME AND EFFORT TO IMPLEMENT
 There will be some development time to building a
  site
 There might be further time needed to
  manage/screen the contributions
 (I'm too busy with PsychoPy)



   On the other hand;
     There are open-source tools already available to build
      academic repositories
     We might be able to piggy-back on another site
     Maybe the Open Science Framework will do all we want
SUMMARY
   Open-source software has improved scientific
       Productivity


   Open-source experiments could improve scientific
     Reproducability
     Education
     Productivity


   But we need;
       buy-in from the scientists (and possibly the journals)
       user-friendly resources
OPINIONS?

More Related Content

Viewers also liked

Photosynthesis
Photosynthesis Photosynthesis
Photosynthesis
Ahmed Aly
 
Gta liberty city stories
Gta liberty city storiesGta liberty city stories
Gta liberty city storiesdarurat33
 
Swimming Pool concept
Swimming Pool concept Swimming Pool concept
Swimming Pool concept
Ajay Jain
 
INTERIM REPORT REQUIREMENTS
INTERIM REPORT REQUIREMENTSINTERIM REPORT REQUIREMENTS
INTERIM REPORT REQUIREMENTSMd Azree
 
Demandware tco webinar final 12122012
Demandware tco webinar final 12122012Demandware tco webinar final 12122012
Demandware tco webinar final 12122012Thomas Roider
 
business ethics presentation 2002.ppts
business ethics presentation 2002.pptsbusiness ethics presentation 2002.ppts
business ethics presentation 2002.ppts
roniezee
 

Viewers also liked (7)

Pelan permainan dalaman 2013
Pelan permainan dalaman 2013Pelan permainan dalaman 2013
Pelan permainan dalaman 2013
 
Photosynthesis
Photosynthesis Photosynthesis
Photosynthesis
 
Gta liberty city stories
Gta liberty city storiesGta liberty city stories
Gta liberty city stories
 
Swimming Pool concept
Swimming Pool concept Swimming Pool concept
Swimming Pool concept
 
INTERIM REPORT REQUIREMENTS
INTERIM REPORT REQUIREMENTSINTERIM REPORT REQUIREMENTS
INTERIM REPORT REQUIREMENTS
 
Demandware tco webinar final 12122012
Demandware tco webinar final 12122012Demandware tco webinar final 12122012
Demandware tco webinar final 12122012
 
business ethics presentation 2002.ppts
business ethics presentation 2002.pptsbusiness ethics presentation 2002.ppts
business ethics presentation 2002.ppts
 

Similar to Open experiments and open-source

STM Innovations Seminar London
STM Innovations Seminar LondonSTM Innovations Seminar London
STM Innovations Seminar London
Philip Bourne
 
Creativity vs Best Practices
Creativity vs Best PracticesCreativity vs Best Practices
Creativity vs Best Practices
Supun Dissanayake
 
Research Objects for FAIRer Science
Research Objects for FAIRer Science Research Objects for FAIRer Science
Research Objects for FAIRer Science
Carole Goble
 
Scott Edmunds: Using FAIR principles for more Open & Democratic Science
Scott Edmunds: Using FAIR principles for more Open & Democratic ScienceScott Edmunds: Using FAIR principles for more Open & Democratic Science
Scott Edmunds: Using FAIR principles for more Open & Democratic Science
GigaScience, BGI Hong Kong
 
'Scikit-project': How open source is empowering open science – and vice versa
'Scikit-project': How open source is empowering open science – and vice versa'Scikit-project': How open source is empowering open science – and vice versa
'Scikit-project': How open source is empowering open science – and vice versa
Nathan Shammah
 
2016-11-21 BioExcel Workflows and Pipelines Interest Group
2016-11-21 BioExcel Workflows and Pipelines Interest Group2016-11-21 BioExcel Workflows and Pipelines Interest Group
2016-11-21 BioExcel Workflows and Pipelines Interest Group
Stian Soiland-Reyes
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynote
Carole Goble
 
Do you speak open science
Do you speak open science Do you speak open science
Do you speak open science
Paola Chiara Masuzzo
 
Academic SEO, or: How do I get my research to show up in search engines and d...
Academic SEO, or: How do I get my research to show up in search engines and d...Academic SEO, or: How do I get my research to show up in search engines and d...
Academic SEO, or: How do I get my research to show up in search engines and d...
Open Knowledge Maps
 
myExperiment - Defining the Social Virtual Research Environment
myExperiment - Defining the Social Virtual Research EnvironmentmyExperiment - Defining the Social Virtual Research Environment
myExperiment - Defining the Social Virtual Research Environment
David De Roure
 
Open PHACTS April 2017 Science webinar Workflow tools
Open PHACTS April 2017 Science webinar Workflow toolsOpen PHACTS April 2017 Science webinar Workflow tools
Open PHACTS April 2017 Science webinar Workflow tools
open_phacts
 
The Art Of Documentation for Open Source Projects
The Art Of Documentation for Open Source ProjectsThe Art Of Documentation for Open Source Projects
The Art Of Documentation for Open Source Projects
Ben Hall
 
Research Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibilityResearch Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibility
Oscar Corcho
 
Reproducibility: 10 Simple Rules
Reproducibility: 10 Simple RulesReproducibility: 10 Simple Rules
Reproducibility: 10 Simple Rules
Annika Eriksson
 
Modern Tools & Rationales for 21st Century Research
Modern Tools & Rationales  for 21st Century ResearchModern Tools & Rationales  for 21st Century Research
Modern Tools & Rationales for 21st Century Research
Ross Mounce
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects help
Carole Goble
 
Lab Hackathon Opening Presentation
Lab Hackathon Opening PresentationLab Hackathon Opening Presentation
Lab Hackathon Opening Presentation
Lab Hackaton Zimbabwe
 
Libraries meet research 2.0
Libraries meet research 2.0Libraries meet research 2.0
Libraries meet research 2.0
Guus van den Brekel
 

Similar to Open experiments and open-source (20)

STM Innovations Seminar London
STM Innovations Seminar LondonSTM Innovations Seminar London
STM Innovations Seminar London
 
Creativity vs Best Practices
Creativity vs Best PracticesCreativity vs Best Practices
Creativity vs Best Practices
 
Research Objects for FAIRer Science
Research Objects for FAIRer Science Research Objects for FAIRer Science
Research Objects for FAIRer Science
 
Scott Edmunds: Using FAIR principles for more Open & Democratic Science
Scott Edmunds: Using FAIR principles for more Open & Democratic ScienceScott Edmunds: Using FAIR principles for more Open & Democratic Science
Scott Edmunds: Using FAIR principles for more Open & Democratic Science
 
'Scikit-project': How open source is empowering open science – and vice versa
'Scikit-project': How open source is empowering open science – and vice versa'Scikit-project': How open source is empowering open science – and vice versa
'Scikit-project': How open source is empowering open science – and vice versa
 
2016-11-21 BioExcel Workflows and Pipelines Interest Group
2016-11-21 BioExcel Workflows and Pipelines Interest Group2016-11-21 BioExcel Workflows and Pipelines Interest Group
2016-11-21 BioExcel Workflows and Pipelines Interest Group
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynote
 
Do you speak open science
Do you speak open science Do you speak open science
Do you speak open science
 
Promberger_paper
Promberger_paperPromberger_paper
Promberger_paper
 
Academic SEO, or: How do I get my research to show up in search engines and d...
Academic SEO, or: How do I get my research to show up in search engines and d...Academic SEO, or: How do I get my research to show up in search engines and d...
Academic SEO, or: How do I get my research to show up in search engines and d...
 
myExperiment - Defining the Social Virtual Research Environment
myExperiment - Defining the Social Virtual Research EnvironmentmyExperiment - Defining the Social Virtual Research Environment
myExperiment - Defining the Social Virtual Research Environment
 
Promberger_paper
Promberger_paperPromberger_paper
Promberger_paper
 
Open PHACTS April 2017 Science webinar Workflow tools
Open PHACTS April 2017 Science webinar Workflow toolsOpen PHACTS April 2017 Science webinar Workflow tools
Open PHACTS April 2017 Science webinar Workflow tools
 
The Art Of Documentation for Open Source Projects
The Art Of Documentation for Open Source ProjectsThe Art Of Documentation for Open Source Projects
The Art Of Documentation for Open Source Projects
 
Research Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibilityResearch Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibility
 
Reproducibility: 10 Simple Rules
Reproducibility: 10 Simple RulesReproducibility: 10 Simple Rules
Reproducibility: 10 Simple Rules
 
Modern Tools & Rationales for 21st Century Research
Modern Tools & Rationales  for 21st Century ResearchModern Tools & Rationales  for 21st Century Research
Modern Tools & Rationales for 21st Century Research
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects help
 
Lab Hackathon Opening Presentation
Lab Hackathon Opening PresentationLab Hackathon Opening Presentation
Lab Hackathon Opening Presentation
 
Libraries meet research 2.0
Libraries meet research 2.0Libraries meet research 2.0
Libraries meet research 2.0
 

Recently uploaded

From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
g2nightmarescribd
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 

Recently uploaded (20)

From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 

Open experiments and open-source

  • 1. OPEN EXPERIMENTS AND OPEN SOURCE SOFTWARE Jonathan Peirce University of Nottingham
  • 2. OPEN-SOURCE SOFTWARE  Is free  Is often more feature-rich/advanced  Allows us to examine/change all the code  Buggy?  Young packages can be, but developers are usually very responsive to fixing bugs  Mature packages aren’t (e.g. see Firefox, Thunderbird, GIMP, Linux, Python…)  Unsustainable?  Not once they reach critical mass  Open-source software is good for science  What about open-sourcing experiments?
  • 3. A REPOSITORY OF OPEN-EXPERIMENTS?  Goals:  Reproducibility: Rather than running your interpretation of a study from its methods section, fetch the actual experiment  Publicity: draw attention to your experiment as people browse the repository  Education: a starting point to build an experiment for new users of a piece of software  One-stop location to up/download entire experiments or components of them  Platform/package independent (PsychoPy, PTB, Presentation…)  Easy to upload, easy to browse
  • 4. SIMILAR REPOSITORIES  MatlabCentral File Exchange (proprietary)  Figshare.com (for data)
  • 5. SIMILAR REPOSITORIES  MatlabCentral File Exchange (proprietary)  Figshare.com (for data)  Viperlib (for demos)  RunMyCode.org (computational economics)  Create a 'Companion website' for your paper  The model code can be run directly on the site(!)
  • 6. SIMILAR REPOSITORIES  MatlabCentral File Exchange (proprietary)  Figshare.com (for data)  Viperlib (for demos)  RunMyCode.org (computational economics)  OpenScienceFramework.org (new)  "The Open Science Framework (OSF) is an infrastructure for documenting, archiving, logging, sharing, and registering scientific projects. Tools are being designed to integrate open practices with a scientist's daily workflow rather than appending them ex post facto."  See the OSF goals here:  http://openscienceframework.org/project/4znZP/wiki/home
  • 7. POTENTIAL CONCERNS  People don’t want others to see their code  People might run studies that they didn't actually understand  Errors in studies might propagate more  Why should someone else benefit from the hours I spent coding that experiment/stimulus?  We don’t need this resource; we can make web pages and use code repositories (e.g. github)  People will never use such a resource  Someone will have to set it up and run it
  • 8. PEOPLE DON’T WANT OTHERS TO SEE THEIR CODE  Why not?  Most people write code for themselves, not for others to see  Cleaning/documenting your code takes time  Maybe you’re a little worried about someone finding a bug in your code?  On the other hand  Writing neat, clear code is good; it means fewer bugs and more-reusable code for yourself!  Although we don’t like people finding our bugs, it is actually a good thing for science  Some tools provide graphical interfaces which should reduce the anxiety
  • 9. PEOPLE MIGHT RUN STUDIES THAT THEY DIDN'T UNDERSTAND  How?  They might not realise some critical part of the setup (e.g a calibrated monitor)  They might make an inappropriate change or use settings that aren't possible  On the other hand  Should we really be setting programming ability as a hurdle to running studies?  Providing the base code (and some notes including some of the caveats) will reduce this problem  Maybe the resource should point out that code does not replace the need for good supervision/education
  • 10. ERRORS IN STUDIES MIGHT PROPAGATE MORE  How?  If a study contains a bug in code, and is re-used by another lab, the bug will tend to remain. If they re-wrote the code from scratch it would be gone  On the other hand  In reality, if the latter study finds a different result to the former, it just fails to get published because we don't know why the 2 studies differ. No advantage.  If there were a bug and the code were available we would stand some chance of finding
  • 11. WHY SHOULD SOMEONE ELSE BENEFIT?  You've put a lot of effort into your building study  Why should someone else just download it and use it for free?! Let them think of their own study!  On the other hand;  (Thank goodness the open-source developers don't think like that!)  You would get to benefit from other people's work. Science benefits  You should want people to build on your studies. That is in your interest
  • 12. WE DON’T NEED THIS RESOURCE  Why not?  We could use code repositories (e.g. sourceforge, github etc) or our institutional websites  But recall the goals:  Replicability  Publicity  Education  Open-source repositories are mostly designed for technically very literate, which limits the contributors
  • 13. PEOPLE WILL NEVER USE SUCH A RESOURCE  Really?  Lots of 'do-gooders' have set up data repositories, but they're empty  OK, so how would we get people to use an open- science repository?  Encourage people that it really is good for them if people can extend their study easily  Make it compulsory (e.g. via the journals)?
  • 14. PEOPLE WILL NEVER USE SUCH A RESOURCE  Really?:  Lots of 'do-gooders' have set up data repositories, but they're empty  OK, so how would we get people to use an open- science repository?  Encourage people that it really is good for them if people can extend their study easily  Make it compulsory (e.g. via the journals)?  Provide a kite-mark, via the journals, for articles that can be fully replicated [since giving the talk I have discovered that 'kite-mark' is a purely British concept. It refers to a non-compulsory badge, from the British Standards Institute, showing that a product meats high quality standards]
  • 15. REPRODUCIBLE RESEARCH STANDARD  Stodden (2009) Enabling reproducible research: licensing scientific innovation. International Journal of Communications Law and Policy  Potentially different levels of compliance with the standard:  Verified: has already been verified in an independent lab  Verifiable: the compendium (full set of research materials) is available to fully reproduce the study  Semi-verifiable: not all materials have been released but the description of the work should allow replication  Non-verifiable: the work requires materials or apparatus that are not typically available  “Efforts are currently under way for the RRS to be an official mark of Science Commons. This would provide an easily identifiable logo and a clear definition for each level of reproducibility.”
  • 16. IT WILL TAKE TIME AND EFFORT TO IMPLEMENT  There will be some development time to building a site  There might be further time needed to manage/screen the contributions  (I'm too busy with PsychoPy)  On the other hand;  There are open-source tools already available to build academic repositories  We might be able to piggy-back on another site  Maybe the Open Science Framework will do all we want
  • 17. SUMMARY  Open-source software has improved scientific  Productivity  Open-source experiments could improve scientific  Reproducability  Education  Productivity  But we need;  buy-in from the scientists (and possibly the journals)  user-friendly resources

Editor's Notes

  1. This talk was given by Jonathan Peirceas part of the Open Science Symposium at ECVP 2012, organised by Lee de Wit. It may be distributed freely under creative commons (CC BY 3.0) http://creativecommons.org/licenses/by/3.0/
  2. See also Stodden (2009) The Legal Framework for Reproducible Scientific Research. IEEE Computing in Science & engineeringReproducible conditions:The full compendium is available on the InternetThe media components, including the original selection and arrangement of the data, are licensed under CC BY or released to the public domain under CC0The code components are licensed under one of Apache 2.0, the MIT License, or the Modified BSD license, or released to the public domain under CC0The data have been released into the public d. omain according to the Science Commons Open Data Protocol.