SlideShare a Scribd company logo
1 of 30
Software in the scientific literature:
Software mentions and a
provocative proposal
James Howison
Information School
University of Texas at Austin
This material is based upon work supported by the National
Science Foundation under Grant No. SMA-1064209.
@jameshowison
What does a citation do, anyway?
• Gives credit for contribution
– A key reward that drives activity in science
– Sits alongside publications, grants, promotions,
and prizes
– Rewards drive type of artifacts and collaboration
• Explains the method used
– Citations assist in knowing what was done
– Provenance
– Replication and extension
@jameshowison
How problematic are current
practices?
• How is software mentioned in papers?
• How accessible and reusable is the software
mentioned?
• How well do these mentions perform the
functions of citation?
github.com/jameshowison/softcite
DOI: 10.6084/m9.figshare.1146366
Howison, J., & Bullard, J. (2015). Software in the scientific literature: Problems with
seeing, finding, and using software mentioned in the biology literature. Journal of the
Association for Information Science and Technology (JASIST), doi: 10.1002/asi.23538
@jameshowison
Sample and Method
• 90 randomly selected articles from biology
literature, articles published between 2000
and 2010.
• Journals stratified across Journal Impact
Factor to balance coverage with influence
@jameshowison
Content analysis scheme
Manual content analysis (3 coders, Kappa)
1. Identifying mentions
– Read article, locate a mention of a piece of software
2. Identify in-text characteristics of mention
– Name of software? URL? Date? Version number? In
bibliography? Cite to paper/manual/webpage?
3. Functions of mention
– Identifiable? Findable? Accessible? Source? Match
preferred citation?
@jameshowison
https://github.com/jameshowison/softcite/blob
/master/data/software-citation-coding.ttl
@jameshowison
How many mentions?
• 59 articles mentioned software, 31 did not.
• There were 286 distinct mentions of software.
• Those mentions were to 146 distinct pieces of
software.
– This includes general purpose (e.g., Microsoft
Excel) and science-specific software (e.g., DENZO,
BLAST).
@jameshowison
Types of mentions
Mention Type Example
Cite to Publication … was calculated using biosys (Swofford & Selander 1981).
Cite to Project Name or
Website
… using the program Autodecay version 4.0.29 PPC
(Eriksson 1998).
Reference List has: ERIKSSON, T. 1998. Autodecay, vers.
4.0.29 Stockholm: Department of Botany.
Like Instrument … calculated by t-test using the Prism 3.0 software
(GraphPad Software, San Diego, CA, USA).
URL in text … freely available from http://www.cibiv.at/software/pda/
.
In-text name mention
only
… were analyzed using MapQTL (4.0) software.
Not even name
mentioned
… was carried out using software implemented in the Java
programming language.
@jameshowison
Types of Mentions
@jameshowison
Simpler Mention Kinds
@jameshowison
By Strata?
@jameshowison
What sort of software mentioned?
@jameshowison
Proprietary software more likely to be
mentioned “like instrument”
@jameshowison
How useful are these mentions?
@jameshowison
Not much change across strata
@jameshowison
Do mention types work differently?
@jameshowison
Other findings
• Only 24% journals had policies that
mentioned software, declining by strata.
– Rarely mention versions.
– Not clear that these are followed.
• Only between 13–30% of packages make a
specific request for a particular type of
citation
– 32% of mentions didn’t follow the citation.
@jameshowison
Visible citation formats as “nudge”
• Some disagreement about how important the
text of a publication is:
– Should effort focus on machine readable “meta-data”
in publication repositories (not in paper)?
– Or focus on human readable formats in the paper?
• My position is that human readable will influence
practice more quickly
• Formal, well-structured formats and policies act
as a “nudge” to shape how authors mention
software.
@jameshowison
Software archiving
• Strong finding that many pieces of software were
not findable.
– 1 in 10 packages could not be found at all
– Only 1 in 20 packages could the specific version be
found (combination of missing version info and
missing versions online)
• Analogous to link-rot for URLs in publications
(Koehler, 1999)
• Need to influence how software is archived
– Is that a role for publishers? Escrow for non-open
software?
@jameshowison
Part 2
But what are we working to incentivize anyway?
@jameshowison
NCBI BLAST
WU-BLAST
BLAST+
GPU-
BLAST
CUDA-
BLAST
AB-
BLAST
CS-BLAST
Mac OS X Port
Compaq mods
Apple (A/G)
BLAST
FSA-BLAST
@jameshowison
Howison, J., & Herbsleb, J. D. (2013). Incentives and
integration in scientific software production. In
Proceedings of the ACM Conference on Computer
Supported Cooperative Work (pp. 459–470). San
Antonio, TX.
Citation and collaboration
• What is the impact on collaboration of credit-
giving through citations?
• Can a citation (of any kind) incentivize an
ongoing collaboration able to do the work
needed to keep a piece of software
scientifically functional?
• Could a standard undermine collaboration
further?
@jameshowison
Can citation incentivize maintenance?
• Software relies on other software
– Dependencies all the way down
– Software stacks change quickly (new opportunities,
new problems, new libraries)
• Scientists seek to extend the work of others, not
just re-execute it.
• Many re-implementations come from frustration
with poorly maintained software
– Software that wasn’t adjusted as its dependencies
changed
– Software that wasn’t updated with newer techniques
@jameshowison
A modest proposal
1. Papers have full workflow available
2. Workflows have regression tests running on a
continuous integration system
3. Integration system pulls all new versions of
dependencies, executes regression tests.
4. On fail (build or tests) the paper is retracted.
@jameshowison
Howison, J. (2014). Retract bit-rotten publications: Aligning incentives for sustaining scientific software.
In Working towards Sustainable Software for Science: Practice and Experiences (SuperComputing 2014
Workshop). New Orleans.
Uh …
• Retraction too strong, you say?
Ok, let’s revisit step 4:
• On fail, the paper is marked “provisionally
non-extendable” and authors have some
period to fix before marked as “retired”.
@jameshowison
Could others fix papers?
• Why must the original authors be the ones to
fix maintenance issues?
– Attract new resources, motivate integration.
• Re-write Step 4 again:
– On fail, workflow is marked as “needing work”
– Anyone can contribute that work
• Those extending the work, grad students, citizen
scientists
– Anyone that succeeds is added as an Author
@jameshowison
Added as an author??!?
• Just for fixing a bug?
Ok, fine. Let’s re-write the second half of step 4
again:
– Anyone maintaining a workflow and returning a
publication to full extendable status is:
• Added to paper as an acknowledgement
• Invited to a conference, Given a prize
• Credited in a visible, public, system (think github
profile)
@jameshowison
Takeaways
• Software citation is diverse and fails functions:
– “Like instrument” and “cite to publication” citations
give credit but fail to provide version information
– Other, informal mentions, better at versions but often
fail to give credit
• Software is frequently inaccessible
• Collaboration is counter-motivated by publication
• Bit-rotten papers should create opportunities to
earn reputation for scientific contribution.
@jameshowison
Extras
@jameshowison
Software packages found
@jameshowison

More Related Content

What's hot

How to Execute A Research Paper
How to Execute A Research PaperHow to Execute A Research Paper
How to Execute A Research PaperAnita de Waard
 
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksResults Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksCarole Goble
 
Executing the Research Paper
Executing the Research PaperExecuting the Research Paper
Executing the Research PaperAnita de Waard
 
Repurposing authoritative data about faculty to analyze publication output, i...
Repurposing authoritative data about faculty to analyze publication output, i...Repurposing authoritative data about faculty to analyze publication output, i...
Repurposing authoritative data about faculty to analyze publication output, i...Paul Albert
 
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...GigaScience, BGI Hong Kong
 
Reproducibility and Scientific Research: why, what, where, when, who, how
Reproducibility and Scientific Research: why, what, where, when, who, how Reproducibility and Scientific Research: why, what, where, when, who, how
Reproducibility and Scientific Research: why, what, where, when, who, how Carole Goble
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)Carole Goble
 
Some Early Thoughts
Some Early ThoughtsSome Early Thoughts
Some Early ThoughtsPhilip Bourne
 
Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...
Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...
Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...GigaScience, BGI Hong Kong
 
HKU Data Curation MLIM7350 Class 7
HKU Data Curation MLIM7350 Class 7HKU Data Curation MLIM7350 Class 7
HKU Data Curation MLIM7350 Class 7Scott Edmunds
 
Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking  Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking Mohamed BEN ELLEFI
 
Cartegena051811
Cartegena051811Cartegena051811
Cartegena051811Philip Bourne
 
Ten Simple Rules for Open Access Publishers
Ten Simple Rules for Open Access PublishersTen Simple Rules for Open Access Publishers
Ten Simple Rules for Open Access PublishersPhilip Bourne
 
Software in the scientific literature: Problems with seeing, finding, and usi...
Software in the scientific literature: Problems with seeing, finding, and usi...Software in the scientific literature: Problems with seeing, finding, and usi...
Software in the scientific literature: Problems with seeing, finding, and usi...James Howison
 
Scott Edmunds @ Balti & Bioinformatics: New Models in Open Data Publishing
Scott Edmunds @ Balti & Bioinformatics: New Models in Open Data PublishingScott Edmunds @ Balti & Bioinformatics: New Models in Open Data Publishing
Scott Edmunds @ Balti & Bioinformatics: New Models in Open Data PublishingGigaScience, BGI Hong Kong
 
From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
From Theory to Practice: Can Opennesss Improve the Quality of OER Research? From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
From Theory to Practice: Can Opennesss Improve the Quality of OER Research? Beck Pitt
 
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven ScienceCapturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven Sciencedgarijo
 
Crowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentCrowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentMaribel Acosta Deibe
 
Finding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics DatasetsFinding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics DatasetsManuel Corpas
 
Scott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data era
Scott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data eraScott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data era
Scott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data eraGigaScience, BGI Hong Kong
 

What's hot (20)

How to Execute A Research Paper
How to Execute A Research PaperHow to Execute A Research Paper
How to Execute A Research Paper
 
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksResults Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
 
Executing the Research Paper
Executing the Research PaperExecuting the Research Paper
Executing the Research Paper
 
Repurposing authoritative data about faculty to analyze publication output, i...
Repurposing authoritative data about faculty to analyze publication output, i...Repurposing authoritative data about faculty to analyze publication output, i...
Repurposing authoritative data about faculty to analyze publication output, i...
 
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
 
Reproducibility and Scientific Research: why, what, where, when, who, how
Reproducibility and Scientific Research: why, what, where, when, who, how Reproducibility and Scientific Research: why, what, where, when, who, how
Reproducibility and Scientific Research: why, what, where, when, who, how
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
 
Some Early Thoughts
Some Early ThoughtsSome Early Thoughts
Some Early Thoughts
 
Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...
Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...
Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...
 
HKU Data Curation MLIM7350 Class 7
HKU Data Curation MLIM7350 Class 7HKU Data Curation MLIM7350 Class 7
HKU Data Curation MLIM7350 Class 7
 
Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking  Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking
 
Cartegena051811
Cartegena051811Cartegena051811
Cartegena051811
 
Ten Simple Rules for Open Access Publishers
Ten Simple Rules for Open Access PublishersTen Simple Rules for Open Access Publishers
Ten Simple Rules for Open Access Publishers
 
Software in the scientific literature: Problems with seeing, finding, and usi...
Software in the scientific literature: Problems with seeing, finding, and usi...Software in the scientific literature: Problems with seeing, finding, and usi...
Software in the scientific literature: Problems with seeing, finding, and usi...
 
Scott Edmunds @ Balti & Bioinformatics: New Models in Open Data Publishing
Scott Edmunds @ Balti & Bioinformatics: New Models in Open Data PublishingScott Edmunds @ Balti & Bioinformatics: New Models in Open Data Publishing
Scott Edmunds @ Balti & Bioinformatics: New Models in Open Data Publishing
 
From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
From Theory to Practice: Can Opennesss Improve the Quality of OER Research? From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
 
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven ScienceCapturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
 
Crowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentCrowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality Assessment
 
Finding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics DatasetsFinding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics Datasets
 
Scott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data era
Scott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data eraScott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data era
Scott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data era
 

Viewers also liked

A Practical Guide to (Correctly) Troubleshooting with Traceroute
A Practical Guide to (Correctly) Troubleshooting with TracerouteA Practical Guide to (Correctly) Troubleshooting with Traceroute
A Practical Guide to (Correctly) Troubleshooting with TracerouteRichard Steenbergen
 
Diario del viajero
Diario del viajeroDiario del viajero
Diario del viajerosergiomadera
 
Professional Letter of Recommendation Hockey
Professional Letter of Recommendation HockeyProfessional Letter of Recommendation Hockey
Professional Letter of Recommendation HockeyAustin Simundza
 
Introduction to MPLS - NANOG 61
Introduction to MPLS - NANOG 61Introduction to MPLS - NANOG 61
Introduction to MPLS - NANOG 61Richard Steenbergen
 
Trying To Change A Habit? Beware These 5 Traps.
Trying To Change A Habit? Beware These 5 Traps.Trying To Change A Habit? Beware These 5 Traps.
Trying To Change A Habit? Beware These 5 Traps.Gretchen Rubin
 
Intrapreneurship: How GoDaddy & Adobe Nurture Their Employees' Innovation
Intrapreneurship: How GoDaddy & Adobe Nurture Their Employees' InnovationIntrapreneurship: How GoDaddy & Adobe Nurture Their Employees' Innovation
Intrapreneurship: How GoDaddy & Adobe Nurture Their Employees' InnovationCenter For Entrepreneurial Innovation
 

Viewers also liked (12)

A Practical Guide to (Correctly) Troubleshooting with Traceroute
A Practical Guide to (Correctly) Troubleshooting with TracerouteA Practical Guide to (Correctly) Troubleshooting with Traceroute
A Practical Guide to (Correctly) Troubleshooting with Traceroute
 
SHAFFAQUE SULTAN
SHAFFAQUE SULTANSHAFFAQUE SULTAN
SHAFFAQUE SULTAN
 
Maloka
MalokaMaloka
Maloka
 
Competencia
CompetenciaCompetencia
Competencia
 
Diario del viajero
Diario del viajeroDiario del viajero
Diario del viajero
 
Professional Letter of Recommendation Hockey
Professional Letter of Recommendation HockeyProfessional Letter of Recommendation Hockey
Professional Letter of Recommendation Hockey
 
PPP: Davao Sasa Port Modernization Project
PPP: Davao Sasa Port Modernization ProjectPPP: Davao Sasa Port Modernization Project
PPP: Davao Sasa Port Modernization Project
 
Introduction to MPLS - NANOG 61
Introduction to MPLS - NANOG 61Introduction to MPLS - NANOG 61
Introduction to MPLS - NANOG 61
 
Trying To Change A Habit? Beware These 5 Traps.
Trying To Change A Habit? Beware These 5 Traps.Trying To Change A Habit? Beware These 5 Traps.
Trying To Change A Habit? Beware These 5 Traps.
 
Topline MRT & LRTStudy 2009 (For Press Briefing)
Topline MRT & LRTStudy 2009 (For Press Briefing)Topline MRT & LRTStudy 2009 (For Press Briefing)
Topline MRT & LRTStudy 2009 (For Press Briefing)
 
Intrapreneurship: How GoDaddy & Adobe Nurture Their Employees' Innovation
Intrapreneurship: How GoDaddy & Adobe Nurture Their Employees' InnovationIntrapreneurship: How GoDaddy & Adobe Nurture Their Employees' Innovation
Intrapreneurship: How GoDaddy & Adobe Nurture Their Employees' Innovation
 
Ensayo de quimica
Ensayo de quimicaEnsayo de quimica
Ensayo de quimica
 

Similar to Software Citation and a Proposal (NSF workshop at Havard Medical School)

Citation and reproducibility in software
Citation and reproducibility in softwareCitation and reproducibility in software
Citation and reproducibility in softwareDaniel S. Katz
 
Software Citation: Principles, Implementation, and Impact
Software Citation:  Principles, Implementation, and ImpactSoftware Citation:  Principles, Implementation, and Impact
Software Citation: Principles, Implementation, and ImpactDaniel S. Katz
 
Sustainability in Scientific Software: Ecosystem complexity and Software Vis...
Sustainability in Scientific Software:Ecosystem complexityandSoftware Vis...Sustainability in Scientific Software:Ecosystem complexityandSoftware Vis...
Sustainability in Scientific Software: Ecosystem complexity and Software Vis...James Howison
 
20160607 citation4software panel
20160607 citation4software panel20160607 citation4software panel
20160607 citation4software panelDaniel S. Katz
 
Software Citation in Theory and Practice
Software Citation in Theory and PracticeSoftware Citation in Theory and Practice
Software Citation in Theory and PracticeDaniel S. Katz
 
Abcd iqs ssoftware-projects-mercecrosas
Abcd iqs ssoftware-projects-mercecrosasAbcd iqs ssoftware-projects-mercecrosas
Abcd iqs ssoftware-projects-mercecrosasMerce Crosas
 
Crediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teamsCrediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teamsCarole Goble
 
Pathways to Technology Transfer and Adoption: Achievements and Challenges
Pathways to Technology Transfer and Adoption: Achievements and ChallengesPathways to Technology Transfer and Adoption: Achievements and Challenges
Pathways to Technology Transfer and Adoption: Achievements and ChallengesTao Xie
 
20171003 lancaster data conversations Chue-Hong
20171003 lancaster data conversations Chue-Hong20171003 lancaster data conversations Chue-Hong
20171003 lancaster data conversations Chue-HongLancaster University Library
 
Scientific Software - what happens after the grant?
Scientific Software - what happens after the grant?Scientific Software - what happens after the grant?
Scientific Software - what happens after the grant?James Howison
 
Scientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesScientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesDaniel S. Katz
 
Scientific software sustainability and ecosystem complexity
Scientific software sustainability and ecosystem complexityScientific software sustainability and ecosystem complexity
Scientific software sustainability and ecosystem complexityJames Howison
 
Open source vs. open data
Open source vs. open dataOpen source vs. open data
Open source vs. open datadata publica
 
PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...
PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...
PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...CS, NcState
 
Research software susainability
Research software susainabilityResearch software susainability
Research software susainabilityDaniel S. Katz
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better ResearchCarole Goble
 
InstructionsA SWOT analysis is used as a strategic planning tech.docx
InstructionsA SWOT analysis is used as a strategic planning tech.docxInstructionsA SWOT analysis is used as a strategic planning tech.docx
InstructionsA SWOT analysis is used as a strategic planning tech.docxpauline234567
 
MAT111–Spring2020 Name__________________________.docx
MAT111–Spring2020     Name__________________________.docxMAT111–Spring2020     Name__________________________.docx
MAT111–Spring2020 Name__________________________.docxalfredacavx97
 
Collaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna WorkflowsCollaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna WorkflowsAndrea Wiggins
 
Teaching Bioinformatics
Teaching BioinformaticsTeaching Bioinformatics
Teaching Bioinformaticsguest705f7e
 

Similar to Software Citation and a Proposal (NSF workshop at Havard Medical School) (20)

Citation and reproducibility in software
Citation and reproducibility in softwareCitation and reproducibility in software
Citation and reproducibility in software
 
Software Citation: Principles, Implementation, and Impact
Software Citation:  Principles, Implementation, and ImpactSoftware Citation:  Principles, Implementation, and Impact
Software Citation: Principles, Implementation, and Impact
 
Sustainability in Scientific Software: Ecosystem complexity and Software Vis...
Sustainability in Scientific Software:Ecosystem complexityandSoftware Vis...Sustainability in Scientific Software:Ecosystem complexityandSoftware Vis...
Sustainability in Scientific Software: Ecosystem complexity and Software Vis...
 
20160607 citation4software panel
20160607 citation4software panel20160607 citation4software panel
20160607 citation4software panel
 
Software Citation in Theory and Practice
Software Citation in Theory and PracticeSoftware Citation in Theory and Practice
Software Citation in Theory and Practice
 
Abcd iqs ssoftware-projects-mercecrosas
Abcd iqs ssoftware-projects-mercecrosasAbcd iqs ssoftware-projects-mercecrosas
Abcd iqs ssoftware-projects-mercecrosas
 
Crediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teamsCrediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teams
 
Pathways to Technology Transfer and Adoption: Achievements and Challenges
Pathways to Technology Transfer and Adoption: Achievements and ChallengesPathways to Technology Transfer and Adoption: Achievements and Challenges
Pathways to Technology Transfer and Adoption: Achievements and Challenges
 
20171003 lancaster data conversations Chue-Hong
20171003 lancaster data conversations Chue-Hong20171003 lancaster data conversations Chue-Hong
20171003 lancaster data conversations Chue-Hong
 
Scientific Software - what happens after the grant?
Scientific Software - what happens after the grant?Scientific Software - what happens after the grant?
Scientific Software - what happens after the grant?
 
Scientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesScientific Software Challenges and Community Responses
Scientific Software Challenges and Community Responses
 
Scientific software sustainability and ecosystem complexity
Scientific software sustainability and ecosystem complexityScientific software sustainability and ecosystem complexity
Scientific software sustainability and ecosystem complexity
 
Open source vs. open data
Open source vs. open dataOpen source vs. open data
Open source vs. open data
 
PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...
PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...
PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...
 
Research software susainability
Research software susainabilityResearch software susainability
Research software susainability
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better Research
 
InstructionsA SWOT analysis is used as a strategic planning tech.docx
InstructionsA SWOT analysis is used as a strategic planning tech.docxInstructionsA SWOT analysis is used as a strategic planning tech.docx
InstructionsA SWOT analysis is used as a strategic planning tech.docx
 
MAT111–Spring2020 Name__________________________.docx
MAT111–Spring2020     Name__________________________.docxMAT111–Spring2020     Name__________________________.docx
MAT111–Spring2020 Name__________________________.docx
 
Collaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna WorkflowsCollaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna Workflows
 
Teaching Bioinformatics
Teaching BioinformaticsTeaching Bioinformatics
Teaching Bioinformatics
 

More from James Howison

How to Ask questions people want to answer
How to Ask questions people want to answerHow to Ask questions people want to answer
How to Ask questions people want to answerJames Howison
 
Heartbeat: measuring installed base by analyzing downloads and Scientific S...
Heartbeat: measuring installed base by analyzing downloads and Scientific S...Heartbeat: measuring installed base by analyzing downloads and Scientific S...
Heartbeat: measuring installed base by analyzing downloads and Scientific S...James Howison
 
Open Superposition and lessons for scientific software development
Open Superposition and lessons for scientific software developmentOpen Superposition and lessons for scientific software development
Open Superposition and lessons for scientific software developmentJames Howison
 
Howison CAREER journey - NSF CISE CAREER Workshop 2016
Howison CAREER journey - NSF CISE CAREER Workshop 2016Howison CAREER journey - NSF CISE CAREER Workshop 2016
Howison CAREER journey - NSF CISE CAREER Workshop 2016James Howison
 
Studying archives of online behavior
Studying archives of online behaviorStudying archives of online behavior
Studying archives of online behaviorJames Howison
 
SciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystemsSciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystemsJames Howison
 
Howison si2 keynote
Howison si2 keynoteHowison si2 keynote
Howison si2 keynoteJames Howison
 
Howison rutgers-open superposition
Howison rutgers-open superpositionHowison rutgers-open superposition
Howison rutgers-open superpositionJames Howison
 
Howison i conf-transition
Howison i conf-transitionHowison i conf-transition
Howison i conf-transitionJames Howison
 
Howison traces
Howison tracesHowison traces
Howison tracesJames Howison
 
Sharing the spoils: the trouble with reputation as a motivation and reward fo...
Sharing the spoils: the trouble with reputation as a motivation and reward fo...Sharing the spoils: the trouble with reputation as a motivation and reward fo...
Sharing the spoils: the trouble with reputation as a motivation and reward fo...James Howison
 
Five Models for Interaction Between Science Enterprises and Organization Scie...
Five Models for Interaction Between Science Enterprises and Organization Scie...Five Models for Interaction Between Science Enterprises and Organization Scie...
Five Models for Interaction Between Science Enterprises and Organization Scie...James Howison
 
Alone Together: Patterns of collaboration in free and open source software de...
Alone Together: Patterns of collaboration in free and open source software de...Alone Together: Patterns of collaboration in free and open source software de...
Alone Together: Patterns of collaboration in free and open source software de...James Howison
 

More from James Howison (13)

How to Ask questions people want to answer
How to Ask questions people want to answerHow to Ask questions people want to answer
How to Ask questions people want to answer
 
Heartbeat: measuring installed base by analyzing downloads and Scientific S...
Heartbeat: measuring installed base by analyzing downloads and Scientific S...Heartbeat: measuring installed base by analyzing downloads and Scientific S...
Heartbeat: measuring installed base by analyzing downloads and Scientific S...
 
Open Superposition and lessons for scientific software development
Open Superposition and lessons for scientific software developmentOpen Superposition and lessons for scientific software development
Open Superposition and lessons for scientific software development
 
Howison CAREER journey - NSF CISE CAREER Workshop 2016
Howison CAREER journey - NSF CISE CAREER Workshop 2016Howison CAREER journey - NSF CISE CAREER Workshop 2016
Howison CAREER journey - NSF CISE CAREER Workshop 2016
 
Studying archives of online behavior
Studying archives of online behaviorStudying archives of online behavior
Studying archives of online behavior
 
SciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystemsSciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystems
 
Howison si2 keynote
Howison si2 keynoteHowison si2 keynote
Howison si2 keynote
 
Howison rutgers-open superposition
Howison rutgers-open superpositionHowison rutgers-open superposition
Howison rutgers-open superposition
 
Howison i conf-transition
Howison i conf-transitionHowison i conf-transition
Howison i conf-transition
 
Howison traces
Howison tracesHowison traces
Howison traces
 
Sharing the spoils: the trouble with reputation as a motivation and reward fo...
Sharing the spoils: the trouble with reputation as a motivation and reward fo...Sharing the spoils: the trouble with reputation as a motivation and reward fo...
Sharing the spoils: the trouble with reputation as a motivation and reward fo...
 
Five Models for Interaction Between Science Enterprises and Organization Scie...
Five Models for Interaction Between Science Enterprises and Organization Scie...Five Models for Interaction Between Science Enterprises and Organization Scie...
Five Models for Interaction Between Science Enterprises and Organization Scie...
 
Alone Together: Patterns of collaboration in free and open source software de...
Alone Together: Patterns of collaboration in free and open source software de...Alone Together: Patterns of collaboration in free and open source software de...
Alone Together: Patterns of collaboration in free and open source software de...
 

Recently uploaded

Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 

Recently uploaded (20)

Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 

Software Citation and a Proposal (NSF workshop at Havard Medical School)

  • 1. Software in the scientific literature: Software mentions and a provocative proposal James Howison Information School University of Texas at Austin This material is based upon work supported by the National Science Foundation under Grant No. SMA-1064209. @jameshowison
  • 2. What does a citation do, anyway? • Gives credit for contribution – A key reward that drives activity in science – Sits alongside publications, grants, promotions, and prizes – Rewards drive type of artifacts and collaboration • Explains the method used – Citations assist in knowing what was done – Provenance – Replication and extension @jameshowison
  • 3. How problematic are current practices? • How is software mentioned in papers? • How accessible and reusable is the software mentioned? • How well do these mentions perform the functions of citation? github.com/jameshowison/softcite DOI: 10.6084/m9.figshare.1146366 Howison, J., & Bullard, J. (2015). Software in the scientific literature: Problems with seeing, finding, and using software mentioned in the biology literature. Journal of the Association for Information Science and Technology (JASIST), doi: 10.1002/asi.23538 @jameshowison
  • 4. Sample and Method • 90 randomly selected articles from biology literature, articles published between 2000 and 2010. • Journals stratified across Journal Impact Factor to balance coverage with influence @jameshowison
  • 5. Content analysis scheme Manual content analysis (3 coders, Kappa) 1. Identifying mentions – Read article, locate a mention of a piece of software 2. Identify in-text characteristics of mention – Name of software? URL? Date? Version number? In bibliography? Cite to paper/manual/webpage? 3. Functions of mention – Identifiable? Findable? Accessible? Source? Match preferred citation? @jameshowison
  • 7. How many mentions? • 59 articles mentioned software, 31 did not. • There were 286 distinct mentions of software. • Those mentions were to 146 distinct pieces of software. – This includes general purpose (e.g., Microsoft Excel) and science-specific software (e.g., DENZO, BLAST). @jameshowison
  • 8. Types of mentions Mention Type Example Cite to Publication … was calculated using biosys (Swofford & Selander 1981). Cite to Project Name or Website … using the program Autodecay version 4.0.29 PPC (Eriksson 1998). Reference List has: ERIKSSON, T. 1998. Autodecay, vers. 4.0.29 Stockholm: Department of Botany. Like Instrument … calculated by t-test using the Prism 3.0 software (GraphPad Software, San Diego, CA, USA). URL in text … freely available from http://www.cibiv.at/software/pda/ . In-text name mention only … were analyzed using MapQTL (4.0) software. Not even name mentioned … was carried out using software implemented in the Java programming language. @jameshowison
  • 12. What sort of software mentioned? @jameshowison
  • 13. Proprietary software more likely to be mentioned “like instrument” @jameshowison
  • 14. How useful are these mentions? @jameshowison
  • 15. Not much change across strata @jameshowison
  • 16. Do mention types work differently? @jameshowison
  • 17. Other findings • Only 24% journals had policies that mentioned software, declining by strata. – Rarely mention versions. – Not clear that these are followed. • Only between 13–30% of packages make a specific request for a particular type of citation – 32% of mentions didn’t follow the citation. @jameshowison
  • 18. Visible citation formats as “nudge” • Some disagreement about how important the text of a publication is: – Should effort focus on machine readable “meta-data” in publication repositories (not in paper)? – Or focus on human readable formats in the paper? • My position is that human readable will influence practice more quickly • Formal, well-structured formats and policies act as a “nudge” to shape how authors mention software. @jameshowison
  • 19. Software archiving • Strong finding that many pieces of software were not findable. – 1 in 10 packages could not be found at all – Only 1 in 20 packages could the specific version be found (combination of missing version info and missing versions online) • Analogous to link-rot for URLs in publications (Koehler, 1999) • Need to influence how software is archived – Is that a role for publishers? Escrow for non-open software? @jameshowison
  • 20. Part 2 But what are we working to incentivize anyway? @jameshowison
  • 21. NCBI BLAST WU-BLAST BLAST+ GPU- BLAST CUDA- BLAST AB- BLAST CS-BLAST Mac OS X Port Compaq mods Apple (A/G) BLAST FSA-BLAST @jameshowison Howison, J., & Herbsleb, J. D. (2013). Incentives and integration in scientific software production. In Proceedings of the ACM Conference on Computer Supported Cooperative Work (pp. 459–470). San Antonio, TX.
  • 22. Citation and collaboration • What is the impact on collaboration of credit- giving through citations? • Can a citation (of any kind) incentivize an ongoing collaboration able to do the work needed to keep a piece of software scientifically functional? • Could a standard undermine collaboration further? @jameshowison
  • 23. Can citation incentivize maintenance? • Software relies on other software – Dependencies all the way down – Software stacks change quickly (new opportunities, new problems, new libraries) • Scientists seek to extend the work of others, not just re-execute it. • Many re-implementations come from frustration with poorly maintained software – Software that wasn’t adjusted as its dependencies changed – Software that wasn’t updated with newer techniques @jameshowison
  • 24. A modest proposal 1. Papers have full workflow available 2. Workflows have regression tests running on a continuous integration system 3. Integration system pulls all new versions of dependencies, executes regression tests. 4. On fail (build or tests) the paper is retracted. @jameshowison Howison, J. (2014). Retract bit-rotten publications: Aligning incentives for sustaining scientific software. In Working towards Sustainable Software for Science: Practice and Experiences (SuperComputing 2014 Workshop). New Orleans.
  • 25. Uh … • Retraction too strong, you say? Ok, let’s revisit step 4: • On fail, the paper is marked “provisionally non-extendable” and authors have some period to fix before marked as “retired”. @jameshowison
  • 26. Could others fix papers? • Why must the original authors be the ones to fix maintenance issues? – Attract new resources, motivate integration. • Re-write Step 4 again: – On fail, workflow is marked as “needing work” – Anyone can contribute that work • Those extending the work, grad students, citizen scientists – Anyone that succeeds is added as an Author @jameshowison
  • 27. Added as an author??!? • Just for fixing a bug? Ok, fine. Let’s re-write the second half of step 4 again: – Anyone maintaining a workflow and returning a publication to full extendable status is: • Added to paper as an acknowledgement • Invited to a conference, Given a prize • Credited in a visible, public, system (think github profile) @jameshowison
  • 28. Takeaways • Software citation is diverse and fails functions: – “Like instrument” and “cite to publication” citations give credit but fail to provide version information – Other, informal mentions, better at versions but often fail to give credit • Software is frequently inaccessible • Collaboration is counter-motivated by publication • Bit-rotten papers should create opportunities to earn reputation for scientific contribution. @jameshowison

Editor's Notes

  1. Differences in the sample, tending to informalism in lower impact factor journals, but confidence intervals overlap across strata.
  2. Combine out codes to identify different kinds of software (at least for those that we could find). Note that “non-commercial” largely means “written for scientists but not released as open source”; “Open source” combines code written for scientists and general purpose open source tools.
  3. Wide diversity, but “like instrument” mentions much more common for proprietary (commercial) software, while cite to publication much more likely for non-commercial and open source
  4. Overall the practices of mentioning code are useful for identifying and mostly for finding software, but useless for anything requiring a version (important in replication and extension). Happily, around 80% of mentions make some effort to credit those responsible for providing the software.
  5. Neither cite to publication or like instrument mentions do a good job on versioning: the templates for these type of citations simply don’t include this information; authors can include it but aren’t driven to do so. Note that the “other” mention types (informal mentions, like name in text or URL I footnote) work hard to identify the software but are significantly worse at crediting the relevant authors.
  6. “social proof” “demonstration effects”, some evidence for this in the way that different kinds of mentions did different functions better and worse.