SlideShare a Scribd company logo
1 of 29
Software Repositories for
Research
-- An environmental scan
Micah Altman
MIT Libraries
Prepared for
Digital Preservation 2016
Milwaukee
November 2016
Disclaimer
These opinions are my own, they are not the opinions of MIT, any of
the project funders, nor (with the exception of co-authored
previously published work) my collaborators
Secondary disclaimer:
“It’s tough to make predictions, especially about the future!”
-- Attributed to Woody Allen, Yogi Berra, Niels Bohr, Vint Cerf, Winston Churchill, Confucius, Disreali [sic], Freeman Dyson,
Cecil B. Demille, Albert Einstein, Enrico Fermi, Edgar R. Fiedler, Bob Fourer, Sam Goldwyn, Allan Lamport, Groucho Marx,
Dan Quayle, George Bernard Shaw, Casey Stengel, Will Rogers, M. Taub, Mark Twain, Kerr L. White, etc.
2
Related Publications
• Altman M, Jackman S. “Nineteen Ways of Looking at Statistical
Software”. Journal of Statistical Software. 2011;42.
• Altman, Micah, and Gary King. "A proposed standard for the
scholarly citation of quantitative data." D-lib 13, no. 3 (2007):
• Altman, M., Gill, J. and McDonald, M.P., 2004. Numerical issues in
statistical computing for the social scientist. John Wiley & Sons.
Reprints available from:
informatics.mit.edu 3
Today’s Perspectives
* Motivations *
* Methods *
* Measures *
* Musings *
* Merit *
4
Motivations
5
Why Software?
6
What is Software
Working definition:
“Part of a computer system that consists of encoded information or computer
instructions” (wikipedia) that is directly executable within a system.
Corollaries
Software generally is composed of instantiations of algorithms, heuristics,
and fixed information (internal data).
The behavior and output of software generally depends on the execution
context: execution environment (software, hardware, network, networked
resources), configuration parameters, and dynamic inputs.
7
Some Caution About Definitions
Software is often tightly coupled to data
Boundaries among software objects and systems are fuzzy & permeable
Usefulness of software is strongly dependent on the intent of the user,
knowledge and capabilities of user (documentation matters), and
execution context.
8
"... if they [philosophers] do ask and they want a definition, they do not want the most natural definition, e.g. of 'chair' they
do not want the definition 'something to sit on'. Why are they not satisfied with the normal definition of chair, or, to put the
question in another way, why do they wish to ask for the definition of a physical object?"
Source: "From the Minutes of the Moral Science Club, 23.2.1939" in Wittgenstein in Cambridge (2008)
Research Questions
Characterizing Research Software Repositories and Related Practices
How is software related to research formally disseminated?
Which “repositories” (points for mid/long term publishing/access of
software) are recognized at the discipline level?
What is the relative prevalence and affordances of “repositories” for
software as compared to other established disciplinary repositories?
What practices, requirements, or standards for software curation and
preservation are recognized at the disciplinary level?
9
Methods
10
Literature Review
Data Curation, Publication and Citation
Software significant properties, use cases
Software repositories
Software & scientific reproducibility
Software Engineering Methodology
11
Web Research - Practice
Review of research repositories
Sources: OpenDOAR, Re3Data, Sherpajuliet
Goals: Estimate prevalence of repositories that accept research software; identify exemplar
repositories, characterize feature sets by repository category
Methods: term-based queries; descriptive statistics; stratified content case studies
Review of Software Directories
Sources: OpenHub, OSDir, DMOZ
Goals: Identify additional software repositories used in research
Methods: qualitative text analysis; descriptive statistics
12
Web Research - Policies
Review of funder policies
Sources: Roarmap; US Federal Agency Websites
Goals: Estimate prevalence of funder policies on software curation; identify exemplar
policies; identify recommended repositories
Methods: qualitative text analysis; descriptive statistics
Review of Journal Policies
Source: Google Scholar, WoS, DOAJ, Software Sustainability Institute Index
Goals: Estimate prevalence of journals that publish research software; prevalence of
software policies at journals exemplar policies; identify recommended repositories
Methods: qualitative text analysis; descriptive statistics
13
Measures
14
Typical Prevalence of Software Repositories
15
We Got Nothing
RE3 SherpaJuliet
16
And the Nothing We got is Not that Great
17
Some Exemplars and Promising Initiatives
• Citation and publisher policies
FORCE 11 Software Citation Principles
www.force11.org/software-citation-
principles
ACM New Publication Policies on
Software Reproducibility and
Contributorship
www.acm.org/publications/policies
PLOS
http://journals.plos.org/plosone/s/m
aterials-and-software-sharing
18
• Long Term Access:
- www.softwareheritage.org
- www.softwarepreservationnetwork.org/
- guides.github.com/activities/citable-code/
- archive.org/details/softwarelibrary
• Software Journals:
- www.journals.elsevier.com/softwarex/
- www.jstatsoft.org/
-
http://openresearchsoftware.metajnl.com/
Musings
19
20
21
22
Use Cases and Motivating Value
23
Historic / cultural - historical scholarship
- “intrinsic value”
Replication and reproducibility - check claims made in research
- reduced deliberate research fraud
- check reliability (robustness) of results
- check validity (accuracy)
Reuse - efficiency
- increase speed of development
- standards compliance
- apply methodology to a different corpus
- increased quality and dependability
Render other digital objects - renders other objects meaningful
- see digital preservation use cases
Legal - record of licensing, ownership, copyright
- manage legal risks/accountability
- compliance with laws/funding mandates
- reduce barriers to long-term access for other historic use, replication, reuse, rendering
Citation and attribution - track individual academic career
- track software development/history
- track institutional outputs
- track funder outputs
Repository Affordances
24
Authoring/
Development
Discovery/
Access
Collection Preservation Legal
creator Language specific
authoring tools
Build environment
integration
Versioning
Documentation
Project management
Collaboration
Attribution Backups
Commitment to long-
term access
Access control
License templating
curator Project management
License template
Monitoring
Collaboration
Browsing
Searching
Persistent Identifiers
Version Ids
Collection Policy
Peer Review
Selection
Annotation
Metdata
Preservation policy
Documentation
Format management
Access control
License standardization
Legal guidance
institution Author, Funder
Identifiers
Metrics
Author, Funder
Identifiers
Metrics
Author, Funder
Identifiers
Metrics
Compliance
Attribution
Preservation Policy
Preservation replication
Auditability
Certification
License standardization
Privacy Management
end-user Browsing
Searching
Search engine
integration
ersistent Identifiers
Selection criteria
Annotation
Quality Measures
Documentation Open licensing
License discoverability
Merit
25
Preliminary findings: State of Software Curation
1. No comprehensive indices of software archives
2. Orders of magnitude fewer software archives than data archives.
( Corollary: Institutional repositories offer little functionality for software
archiving, even when nominally supported )
3. Very small proportion of funders have policies addressing software curation
4. There is little available advice for researchers who wish to curate, cite, &
preserve software
5. Substantial reproducibility reproducibility failures related to software continue
to be reported
26
“Nothing Exists” - Parmenides (ca. 500 BCE)
Contrast with Data Curation
-- Lack of Progress• Compliance
– Funder: data management plans, open data
– Publishers: data access/archiving/citation
• Norms & practices
– Joint data citation principles
– Recognition of data in funder biosketches
– Increased recognition of reproducibility gaps
– Increased recognition of open data/open science
• Technical infrastructure
– Data repository directories
– Data citation indices
– ORCID researcher identifier and registry
• Recognition
– Data citation indices
– Virtual branded archives
– High-profile data publications
27
Summing it all up…
Software curation looks a lot like data curation a decade ago…
28
“How much slower would scientific progress be if the near universal standards for scholarly citation of articles and books had
never been developed? Suppose shortly after publication only some printed works could be reliably found by other scholars; or
if researchers were only permitted to read an article if they first committed not to criticize it, or were required to coauthor with
the original author any work that built on the original. How many discoveries would never have been made if the titles of books
and articles in libraries changed unpredictably, with no link back to the old title; if printed works existed in different libraries
under different titles; if researchers routinely redistributed modified versions of other authors' works without changing the title
or author listed; or if publishing new editions of books meant that earlier editions were destroyed? …
“Unfortunately, no such universal standards exist for citing quantitative data software, and so all the problems listed above
exist now. Practices vary from field to field, archive to archive, and often from article to article.
The data software cited may no longer exist, may not be available publicly, or may have never been held by anyone but the
investigator. Data software listed as available from the author are unlikely to be available for long and will not be available after
the author retires or dies. … Data software are sometimes listed in the bibliography, sometimes in the text, sometimes not at
all, and rarely with enough information to guarantee future access to the identical data set. Replicating published tables and
figures even without having to rerun the original experiment, is often difficult or impossible”
-- Altman & King 2007
Questions?
Web:
Informatics.mit.edu
Email:
escience@mit.edu
29

More Related Content

What's hot

Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19Janifer Gatenby
 
Towards Research Engines: Supporting Search Stages in Web Archives (2015)
Towards Research Engines: Supporting Search Stages in Web Archives (2015)Towards Research Engines: Supporting Search Stages in Web Archives (2015)
Towards Research Engines: Supporting Search Stages in Web Archives (2015)TimelessFuture
 
The Future of Finding: Resource Discovery @ The University of Oxford
The Future of Finding: Resource Discovery @ The University of OxfordThe Future of Finding: Resource Discovery @ The University of Oxford
The Future of Finding: Resource Discovery @ The University of OxfordChristine Madsen
 
Best Practices for Descriptive Metadata for Web Archiving
Best Practices for Descriptive Metadata for Web ArchivingBest Practices for Descriptive Metadata for Web Archiving
Best Practices for Descriptive Metadata for Web ArchivingOCLC
 
Internationalising South African Scholarly Journals
Internationalising South African Scholarly Journals Internationalising South African Scholarly Journals
Internationalising South African Scholarly Journals KidsintheCloud
 
WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)
WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)
WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)TimelessFuture
 
The Power of Sharing Linked Data: Bibliothekartag 2014
The Power of Sharing Linked Data: Bibliothekartag 2014The Power of Sharing Linked Data: Bibliothekartag 2014
The Power of Sharing Linked Data: Bibliothekartag 2014Richard Wallis
 
Linked Data, Library Users, and the Discovery Tools of the Future
Linked Data, Library Users, and the Discovery Tools of the FutureLinked Data, Library Users, and the Discovery Tools of the Future
Linked Data, Library Users, and the Discovery Tools of the FutureEmily Nimsakont
 
Linked Data Implementations—Who, What and Why?
Linked Data Implementations—Who, What and Why?Linked Data Implementations—Who, What and Why?
Linked Data Implementations—Who, What and Why?OCLC
 
BIBFRAME and OCLC Works: Defining Models and Discovering Evidence
BIBFRAME and OCLC Works: Defining Models and Discovering EvidenceBIBFRAME and OCLC Works: Defining Models and Discovering Evidence
BIBFRAME and OCLC Works: Defining Models and Discovering EvidenceOCLC
 
Peer Council 2016 Keynote Address with John Chapman
Peer Council 2016 Keynote Address with John ChapmanPeer Council 2016 Keynote Address with John Chapman
Peer Council 2016 Keynote Address with John ChapmanAndrea Coffin
 
Let's Get Visible! with Karla Smith, Winnefox Library System
Let's Get Visible! with Karla Smith, Winnefox Library SystemLet's Get Visible! with Karla Smith, Winnefox Library System
Let's Get Visible! with Karla Smith, Winnefox Library SystemWiLS
 
ER&L The Role of Choice in the Future of Discovery Evaluations Panel
ER&L The Role of Choice in the Future of Discovery Evaluations PanelER&L The Role of Choice in the Future of Discovery Evaluations Panel
ER&L The Role of Choice in the Future of Discovery Evaluations PanelRobert H. McDonald
 
Web-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationWeb-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationRachel Vacek
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosOCLC
 
Role of libraries in accelerating research
Role of libraries in accelerating researchRole of libraries in accelerating research
Role of libraries in accelerating researchNikesh Narayanan
 
Current and emerging trends in library services
Current and emerging trends in library servicesCurrent and emerging trends in library services
Current and emerging trends in library servicesNikesh Narayanan
 

What's hot (20)

ResourceSync - NISO Update Jan 2014
ResourceSync - NISO Update Jan 2014ResourceSync - NISO Update Jan 2014
ResourceSync - NISO Update Jan 2014
 
Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19
 
Towards Research Engines: Supporting Search Stages in Web Archives (2015)
Towards Research Engines: Supporting Search Stages in Web Archives (2015)Towards Research Engines: Supporting Search Stages in Web Archives (2015)
Towards Research Engines: Supporting Search Stages in Web Archives (2015)
 
The Future of Finding: Resource Discovery @ The University of Oxford
The Future of Finding: Resource Discovery @ The University of OxfordThe Future of Finding: Resource Discovery @ The University of Oxford
The Future of Finding: Resource Discovery @ The University of Oxford
 
Best Practices for Descriptive Metadata for Web Archiving
Best Practices for Descriptive Metadata for Web ArchivingBest Practices for Descriptive Metadata for Web Archiving
Best Practices for Descriptive Metadata for Web Archiving
 
Butler - Security Lessons Learned from an Ezproxy Admin
Butler - Security Lessons Learned from an Ezproxy AdminButler - Security Lessons Learned from an Ezproxy Admin
Butler - Security Lessons Learned from an Ezproxy Admin
 
Internationalising South African Scholarly Journals
Internationalising South African Scholarly Journals Internationalising South African Scholarly Journals
Internationalising South African Scholarly Journals
 
Connecting the Dots: Constellations in the Linked Data Universe
Connecting the Dots: Constellations in the Linked Data UniverseConnecting the Dots: Constellations in the Linked Data Universe
Connecting the Dots: Constellations in the Linked Data Universe
 
WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)
WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)
WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)
 
The Power of Sharing Linked Data: Bibliothekartag 2014
The Power of Sharing Linked Data: Bibliothekartag 2014The Power of Sharing Linked Data: Bibliothekartag 2014
The Power of Sharing Linked Data: Bibliothekartag 2014
 
Linked Data, Library Users, and the Discovery Tools of the Future
Linked Data, Library Users, and the Discovery Tools of the FutureLinked Data, Library Users, and the Discovery Tools of the Future
Linked Data, Library Users, and the Discovery Tools of the Future
 
Linked Data Implementations—Who, What and Why?
Linked Data Implementations—Who, What and Why?Linked Data Implementations—Who, What and Why?
Linked Data Implementations—Who, What and Why?
 
BIBFRAME and OCLC Works: Defining Models and Discovering Evidence
BIBFRAME and OCLC Works: Defining Models and Discovering EvidenceBIBFRAME and OCLC Works: Defining Models and Discovering Evidence
BIBFRAME and OCLC Works: Defining Models and Discovering Evidence
 
Peer Council 2016 Keynote Address with John Chapman
Peer Council 2016 Keynote Address with John ChapmanPeer Council 2016 Keynote Address with John Chapman
Peer Council 2016 Keynote Address with John Chapman
 
Let's Get Visible! with Karla Smith, Winnefox Library System
Let's Get Visible! with Karla Smith, Winnefox Library SystemLet's Get Visible! with Karla Smith, Winnefox Library System
Let's Get Visible! with Karla Smith, Winnefox Library System
 
ER&L The Role of Choice in the Future of Discovery Evaluations Panel
ER&L The Role of Choice in the Future of Discovery Evaluations PanelER&L The Role of Choice in the Future of Discovery Evaluations Panel
ER&L The Role of Choice in the Future of Discovery Evaluations Panel
 
Web-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationWeb-Scale Discovery: Post Implementation
Web-Scale Discovery: Post Implementation
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
 
Role of libraries in accelerating research
Role of libraries in accelerating researchRole of libraries in accelerating research
Role of libraries in accelerating research
 
Current and emerging trends in library services
Current and emerging trends in library servicesCurrent and emerging trends in library services
Current and emerging trends in library services
 

Viewers also liked

Making Decisions in a World Awash in Data: We’re going to need a different bo...
Making Decisions in a World Awash in Data: We’re going to need a different bo...Making Decisions in a World Awash in Data: We’re going to need a different bo...
Making Decisions in a World Awash in Data: We’re going to need a different bo...Micah Altman
 
BROWN BAG TALK WITH MICAH ALTMAN INTEGRATING OPEN DATA INTO OPEN ACCESS JOURNALS
BROWN BAG TALK WITH MICAH ALTMAN INTEGRATING OPEN DATA INTO OPEN ACCESS JOURNALSBROWN BAG TALK WITH MICAH ALTMAN INTEGRATING OPEN DATA INTO OPEN ACCESS JOURNALS
BROWN BAG TALK WITH MICAH ALTMAN INTEGRATING OPEN DATA INTO OPEN ACCESS JOURNALSMicah Altman
 
Redistricting Algorithms
Redistricting AlgorithmsRedistricting Algorithms
Redistricting AlgorithmsMicah Altman
 
Ndsa 2016 opening plenary
Ndsa 2016 opening plenaryNdsa 2016 opening plenary
Ndsa 2016 opening plenaryMicah Altman
 
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCESBROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCESMicah Altman
 
BROWN BAG TALK WITH CHAOQUN NI- TRANSFORMATIVE INTERACTIONS IN THE SCIENTIFIC...
BROWN BAG TALK WITH CHAOQUN NI- TRANSFORMATIVE INTERACTIONS IN THE SCIENTIFIC...BROWN BAG TALK WITH CHAOQUN NI- TRANSFORMATIVE INTERACTIONS IN THE SCIENTIFIC...
BROWN BAG TALK WITH CHAOQUN NI- TRANSFORMATIVE INTERACTIONS IN THE SCIENTIFIC...Micah Altman
 
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...Micah Altman
 

Viewers also liked (7)

Making Decisions in a World Awash in Data: We’re going to need a different bo...
Making Decisions in a World Awash in Data: We’re going to need a different bo...Making Decisions in a World Awash in Data: We’re going to need a different bo...
Making Decisions in a World Awash in Data: We’re going to need a different bo...
 
BROWN BAG TALK WITH MICAH ALTMAN INTEGRATING OPEN DATA INTO OPEN ACCESS JOURNALS
BROWN BAG TALK WITH MICAH ALTMAN INTEGRATING OPEN DATA INTO OPEN ACCESS JOURNALSBROWN BAG TALK WITH MICAH ALTMAN INTEGRATING OPEN DATA INTO OPEN ACCESS JOURNALS
BROWN BAG TALK WITH MICAH ALTMAN INTEGRATING OPEN DATA INTO OPEN ACCESS JOURNALS
 
Redistricting Algorithms
Redistricting AlgorithmsRedistricting Algorithms
Redistricting Algorithms
 
Ndsa 2016 opening plenary
Ndsa 2016 opening plenaryNdsa 2016 opening plenary
Ndsa 2016 opening plenary
 
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCESBROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
 
BROWN BAG TALK WITH CHAOQUN NI- TRANSFORMATIVE INTERACTIONS IN THE SCIENTIFIC...
BROWN BAG TALK WITH CHAOQUN NI- TRANSFORMATIVE INTERACTIONS IN THE SCIENTIFIC...BROWN BAG TALK WITH CHAOQUN NI- TRANSFORMATIVE INTERACTIONS IN THE SCIENTIFIC...
BROWN BAG TALK WITH CHAOQUN NI- TRANSFORMATIVE INTERACTIONS IN THE SCIENTIFIC...
 
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
 

Similar to Software Repositories for Research-- An Environmental Scan

Software Repositories for Research -- An Environmental Scan
Software Repositories for Research -- An Environmental ScanSoftware Repositories for Research -- An Environmental Scan
Software Repositories for Research -- An Environmental ScanMicah Altman
 
Crediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teamsCrediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teamsCarole Goble
 
Software as a Well-Formed Research Object
Software as a Well-Formed Research ObjectSoftware as a Well-Formed Research Object
Software as a Well-Formed Research ObjectYasmin AlNoamany, PhD
 
Borgman orcid dryadsymposiumoxford20130523
Borgman orcid dryadsymposiumoxford20130523Borgman orcid dryadsymposiumoxford20130523
Borgman orcid dryadsymposiumoxford20130523ORCID, Inc
 
Citation and reproducibility in software
Citation and reproducibility in softwareCitation and reproducibility in software
Citation and reproducibility in softwareDaniel S. Katz
 
20160607 citation4software panel
20160607 citation4software panel20160607 citation4software panel
20160607 citation4software panelDaniel S. Katz
 
Introduction to Altmetrics for Medical and Special Librarians
Introduction to Altmetrics for Medical and Special LibrariansIntroduction to Altmetrics for Medical and Special Librarians
Introduction to Altmetrics for Medical and Special LibrariansLinda Galloway
 
UKSG Conference 2016 Breakout Session - Institutional insights: adopting new ...
UKSG Conference 2016 Breakout Session - Institutional insights: adopting new ...UKSG Conference 2016 Breakout Session - Institutional insights: adopting new ...
UKSG Conference 2016 Breakout Session - Institutional insights: adopting new ...UKSG: connecting the knowledge community
 
Data Citation Rewards and Incentives
 Data Citation Rewards and Incentives Data Citation Rewards and Incentives
Data Citation Rewards and IncentivesMicah Altman
 
Biomedical Research as an Open Digital Enterprise
Biomedical Research as an Open Digital EnterpriseBiomedical Research as an Open Digital Enterprise
Biomedical Research as an Open Digital EnterprisePhilip Bourne
 
IR Strangelove or: How I Learned to Stop Worrying and Love the Institutional ...
IR Strangelove or: How I Learned to Stop Worrying and Love the Institutional ...IR Strangelove or: How I Learned to Stop Worrying and Love the Institutional ...
IR Strangelove or: How I Learned to Stop Worrying and Love the Institutional ...OCLC Research
 
Privacy in Research Data Managemnt - Use Cases
Privacy in Research Data Managemnt - Use CasesPrivacy in Research Data Managemnt - Use Cases
Privacy in Research Data Managemnt - Use CasesMicah Altman
 
Panel: Our Scholarly Recognition System Doesn’t Still Work
Panel: Our Scholarly Recognition System Doesn’t Still WorkPanel: Our Scholarly Recognition System Doesn’t Still Work
Panel: Our Scholarly Recognition System Doesn’t Still WorkDaniel S. Katz
 
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataSusanna-Assunta Sansone
 
Software Citation in Theory and Practice
Software Citation in Theory and PracticeSoftware Citation in Theory and Practice
Software Citation in Theory and PracticeDaniel S. Katz
 

Similar to Software Repositories for Research-- An Environmental Scan (20)

Software Repositories for Research -- An Environmental Scan
Software Repositories for Research -- An Environmental ScanSoftware Repositories for Research -- An Environmental Scan
Software Repositories for Research -- An Environmental Scan
 
Crediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teamsCrediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teams
 
Software as a Well-Formed Research Object
Software as a Well-Formed Research ObjectSoftware as a Well-Formed Research Object
Software as a Well-Formed Research Object
 
Borgman orcid dryadsymposiumoxford20130523
Borgman orcid dryadsymposiumoxford20130523Borgman orcid dryadsymposiumoxford20130523
Borgman orcid dryadsymposiumoxford20130523
 
Citation and reproducibility in software
Citation and reproducibility in softwareCitation and reproducibility in software
Citation and reproducibility in software
 
20160607 citation4software panel
20160607 citation4software panel20160607 citation4software panel
20160607 citation4software panel
 
Lern, june 2016, digital media slides
Lern, june 2016, digital media slidesLern, june 2016, digital media slides
Lern, june 2016, digital media slides
 
Introduction to Altmetrics for Medical and Special Librarians
Introduction to Altmetrics for Medical and Special LibrariansIntroduction to Altmetrics for Medical and Special Librarians
Introduction to Altmetrics for Medical and Special Librarians
 
UKSG Conference 2016 Breakout Session - Institutional insights: adopting new ...
UKSG Conference 2016 Breakout Session - Institutional insights: adopting new ...UKSG Conference 2016 Breakout Session - Institutional insights: adopting new ...
UKSG Conference 2016 Breakout Session - Institutional insights: adopting new ...
 
Data Citation Rewards and Incentives
 Data Citation Rewards and Incentives Data Citation Rewards and Incentives
Data Citation Rewards and Incentives
 
Jonathan Breeze, Symplectic
Jonathan Breeze, SymplecticJonathan Breeze, Symplectic
Jonathan Breeze, Symplectic
 
BLC & Digital Science: Jonathan Breeze, Symplectic
BLC & Digital Science: Jonathan Breeze, SymplecticBLC & Digital Science: Jonathan Breeze, Symplectic
BLC & Digital Science: Jonathan Breeze, Symplectic
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
 
Lern, jan 2015, digital media slides
Lern, jan 2015, digital media slidesLern, jan 2015, digital media slides
Lern, jan 2015, digital media slides
 
Biomedical Research as an Open Digital Enterprise
Biomedical Research as an Open Digital EnterpriseBiomedical Research as an Open Digital Enterprise
Biomedical Research as an Open Digital Enterprise
 
IR Strangelove or: How I Learned to Stop Worrying and Love the Institutional ...
IR Strangelove or: How I Learned to Stop Worrying and Love the Institutional ...IR Strangelove or: How I Learned to Stop Worrying and Love the Institutional ...
IR Strangelove or: How I Learned to Stop Worrying and Love the Institutional ...
 
Privacy in Research Data Managemnt - Use Cases
Privacy in Research Data Managemnt - Use CasesPrivacy in Research Data Managemnt - Use Cases
Privacy in Research Data Managemnt - Use Cases
 
Panel: Our Scholarly Recognition System Doesn’t Still Work
Panel: Our Scholarly Recognition System Doesn’t Still WorkPanel: Our Scholarly Recognition System Doesn’t Still Work
Panel: Our Scholarly Recognition System Doesn’t Still Work
 
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
 
Software Citation in Theory and Practice
Software Citation in Theory and PracticeSoftware Citation in Theory and Practice
Software Citation in Theory and Practice
 

More from Micah Altman

Selecting efficient and reliable preservation strategies
Selecting efficient and reliable preservation strategiesSelecting efficient and reliable preservation strategies
Selecting efficient and reliable preservation strategiesMicah Altman
 
Well-Being - A Sunset Conversation
Well-Being - A Sunset ConversationWell-Being - A Sunset Conversation
Well-Being - A Sunset ConversationMicah Altman
 
Matching Uses and Protections for Government Data Releases: Presentation at t...
Matching Uses and Protections for Government Data Releases: Presentation at t...Matching Uses and Protections for Government Data Releases: Presentation at t...
Matching Uses and Protections for Government Data Releases: Presentation at t...Micah Altman
 
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019Micah Altman
 
Well-being A Sunset Conversation
Well-being A Sunset ConversationWell-being A Sunset Conversation
Well-being A Sunset ConversationMicah Altman
 
Can We Fix Peer Review
Can We Fix Peer ReviewCan We Fix Peer Review
Can We Fix Peer ReviewMicah Altman
 
Academy Owned Peer Review
Academy Owned Peer ReviewAcademy Owned Peer Review
Academy Owned Peer ReviewMicah Altman
 
Redistricting in the US -- An Overview
Redistricting in the US -- An OverviewRedistricting in the US -- An Overview
Redistricting in the US -- An OverviewMicah Altman
 
A Future for Electoral Districting
A Future for Electoral DistrictingA Future for Electoral Districting
A Future for Electoral DistrictingMicah Altman
 
A History of the Internet :Scott Bradner’s Program on Information Science Talk
A History of the Internet :Scott Bradner’s Program on Information Science Talk  A History of the Internet :Scott Bradner’s Program on Information Science Talk
A History of the Internet :Scott Bradner’s Program on Information Science Talk Micah Altman
 
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...Micah Altman
 
Utilizing VR and AR in the Library Space:
Utilizing VR and AR in the Library Space:Utilizing VR and AR in the Library Space:
Utilizing VR and AR in the Library Space:Micah Altman
 
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-NotsCreative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-NotsMicah Altman
 
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...Micah Altman
 
Attribution from a Research Library Perspective, on NISO Webinar: How Librari...
Attribution from a Research Library Perspective, on NISO Webinar: How Librari...Attribution from a Research Library Perspective, on NISO Webinar: How Librari...
Attribution from a Research Library Perspective, on NISO Webinar: How Librari...Micah Altman
 
Agenda's for Preservation Research
Agenda's for Preservation ResearchAgenda's for Preservation Research
Agenda's for Preservation ResearchMicah Altman
 
July IAP: Confidential Information - Storage, Sharing, & Publication - with M...
July IAP: Confidential Information - Storage, Sharing, & Publication - with M...July IAP: Confidential Information - Storage, Sharing, & Publication - with M...
July IAP: Confidential Information - Storage, Sharing, & Publication - with M...Micah Altman
 
Can computers be feminist? Program on Information Science Talk by Gillian Smith
Can computers be feminist? Program on Information Science Talk by Gillian SmithCan computers be feminist? Program on Information Science Talk by Gillian Smith
Can computers be feminist? Program on Information Science Talk by Gillian SmithMicah Altman
 
BROWN BAG: THE VISUAL COMPONENT: MORE THAN PRETTY PICTURES - WITH FELICE FRANKEL
BROWN BAG: THE VISUAL COMPONENT: MORE THAN PRETTY PICTURES - WITH FELICE FRANKELBROWN BAG: THE VISUAL COMPONENT: MORE THAN PRETTY PICTURES - WITH FELICE FRANKEL
BROWN BAG: THE VISUAL COMPONENT: MORE THAN PRETTY PICTURES - WITH FELICE FRANKELMicah Altman
 
MIT Program on Information Science Talk -- Ophir Frieder on Searching in Hars...
MIT Program on Information Science Talk -- Ophir Frieder on Searching in Hars...MIT Program on Information Science Talk -- Ophir Frieder on Searching in Hars...
MIT Program on Information Science Talk -- Ophir Frieder on Searching in Hars...Micah Altman
 

More from Micah Altman (20)

Selecting efficient and reliable preservation strategies
Selecting efficient and reliable preservation strategiesSelecting efficient and reliable preservation strategies
Selecting efficient and reliable preservation strategies
 
Well-Being - A Sunset Conversation
Well-Being - A Sunset ConversationWell-Being - A Sunset Conversation
Well-Being - A Sunset Conversation
 
Matching Uses and Protections for Government Data Releases: Presentation at t...
Matching Uses and Protections for Government Data Releases: Presentation at t...Matching Uses and Protections for Government Data Releases: Presentation at t...
Matching Uses and Protections for Government Data Releases: Presentation at t...
 
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019
 
Well-being A Sunset Conversation
Well-being A Sunset ConversationWell-being A Sunset Conversation
Well-being A Sunset Conversation
 
Can We Fix Peer Review
Can We Fix Peer ReviewCan We Fix Peer Review
Can We Fix Peer Review
 
Academy Owned Peer Review
Academy Owned Peer ReviewAcademy Owned Peer Review
Academy Owned Peer Review
 
Redistricting in the US -- An Overview
Redistricting in the US -- An OverviewRedistricting in the US -- An Overview
Redistricting in the US -- An Overview
 
A Future for Electoral Districting
A Future for Electoral DistrictingA Future for Electoral Districting
A Future for Electoral Districting
 
A History of the Internet :Scott Bradner’s Program on Information Science Talk
A History of the Internet :Scott Bradner’s Program on Information Science Talk  A History of the Internet :Scott Bradner’s Program on Information Science Talk
A History of the Internet :Scott Bradner’s Program on Information Science Talk
 
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
 
Utilizing VR and AR in the Library Space:
Utilizing VR and AR in the Library Space:Utilizing VR and AR in the Library Space:
Utilizing VR and AR in the Library Space:
 
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-NotsCreative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
 
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
 
Attribution from a Research Library Perspective, on NISO Webinar: How Librari...
Attribution from a Research Library Perspective, on NISO Webinar: How Librari...Attribution from a Research Library Perspective, on NISO Webinar: How Librari...
Attribution from a Research Library Perspective, on NISO Webinar: How Librari...
 
Agenda's for Preservation Research
Agenda's for Preservation ResearchAgenda's for Preservation Research
Agenda's for Preservation Research
 
July IAP: Confidential Information - Storage, Sharing, & Publication - with M...
July IAP: Confidential Information - Storage, Sharing, & Publication - with M...July IAP: Confidential Information - Storage, Sharing, & Publication - with M...
July IAP: Confidential Information - Storage, Sharing, & Publication - with M...
 
Can computers be feminist? Program on Information Science Talk by Gillian Smith
Can computers be feminist? Program on Information Science Talk by Gillian SmithCan computers be feminist? Program on Information Science Talk by Gillian Smith
Can computers be feminist? Program on Information Science Talk by Gillian Smith
 
BROWN BAG: THE VISUAL COMPONENT: MORE THAN PRETTY PICTURES - WITH FELICE FRANKEL
BROWN BAG: THE VISUAL COMPONENT: MORE THAN PRETTY PICTURES - WITH FELICE FRANKELBROWN BAG: THE VISUAL COMPONENT: MORE THAN PRETTY PICTURES - WITH FELICE FRANKEL
BROWN BAG: THE VISUAL COMPONENT: MORE THAN PRETTY PICTURES - WITH FELICE FRANKEL
 
MIT Program on Information Science Talk -- Ophir Frieder on Searching in Hars...
MIT Program on Information Science Talk -- Ophir Frieder on Searching in Hars...MIT Program on Information Science Talk -- Ophir Frieder on Searching in Hars...
MIT Program on Information Science Talk -- Ophir Frieder on Searching in Hars...
 

Recently uploaded

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 

Recently uploaded (20)

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 

Software Repositories for Research-- An Environmental Scan

  • 1. Software Repositories for Research -- An environmental scan Micah Altman MIT Libraries Prepared for Digital Preservation 2016 Milwaukee November 2016
  • 2. Disclaimer These opinions are my own, they are not the opinions of MIT, any of the project funders, nor (with the exception of co-authored previously published work) my collaborators Secondary disclaimer: “It’s tough to make predictions, especially about the future!” -- Attributed to Woody Allen, Yogi Berra, Niels Bohr, Vint Cerf, Winston Churchill, Confucius, Disreali [sic], Freeman Dyson, Cecil B. Demille, Albert Einstein, Enrico Fermi, Edgar R. Fiedler, Bob Fourer, Sam Goldwyn, Allan Lamport, Groucho Marx, Dan Quayle, George Bernard Shaw, Casey Stengel, Will Rogers, M. Taub, Mark Twain, Kerr L. White, etc. 2
  • 3. Related Publications • Altman M, Jackman S. “Nineteen Ways of Looking at Statistical Software”. Journal of Statistical Software. 2011;42. • Altman, Micah, and Gary King. "A proposed standard for the scholarly citation of quantitative data." D-lib 13, no. 3 (2007): • Altman, M., Gill, J. and McDonald, M.P., 2004. Numerical issues in statistical computing for the social scientist. John Wiley & Sons. Reprints available from: informatics.mit.edu 3
  • 4. Today’s Perspectives * Motivations * * Methods * * Measures * * Musings * * Merit * 4
  • 7. What is Software Working definition: “Part of a computer system that consists of encoded information or computer instructions” (wikipedia) that is directly executable within a system. Corollaries Software generally is composed of instantiations of algorithms, heuristics, and fixed information (internal data). The behavior and output of software generally depends on the execution context: execution environment (software, hardware, network, networked resources), configuration parameters, and dynamic inputs. 7
  • 8. Some Caution About Definitions Software is often tightly coupled to data Boundaries among software objects and systems are fuzzy & permeable Usefulness of software is strongly dependent on the intent of the user, knowledge and capabilities of user (documentation matters), and execution context. 8 "... if they [philosophers] do ask and they want a definition, they do not want the most natural definition, e.g. of 'chair' they do not want the definition 'something to sit on'. Why are they not satisfied with the normal definition of chair, or, to put the question in another way, why do they wish to ask for the definition of a physical object?" Source: "From the Minutes of the Moral Science Club, 23.2.1939" in Wittgenstein in Cambridge (2008)
  • 9. Research Questions Characterizing Research Software Repositories and Related Practices How is software related to research formally disseminated? Which “repositories” (points for mid/long term publishing/access of software) are recognized at the discipline level? What is the relative prevalence and affordances of “repositories” for software as compared to other established disciplinary repositories? What practices, requirements, or standards for software curation and preservation are recognized at the disciplinary level? 9
  • 11. Literature Review Data Curation, Publication and Citation Software significant properties, use cases Software repositories Software & scientific reproducibility Software Engineering Methodology 11
  • 12. Web Research - Practice Review of research repositories Sources: OpenDOAR, Re3Data, Sherpajuliet Goals: Estimate prevalence of repositories that accept research software; identify exemplar repositories, characterize feature sets by repository category Methods: term-based queries; descriptive statistics; stratified content case studies Review of Software Directories Sources: OpenHub, OSDir, DMOZ Goals: Identify additional software repositories used in research Methods: qualitative text analysis; descriptive statistics 12
  • 13. Web Research - Policies Review of funder policies Sources: Roarmap; US Federal Agency Websites Goals: Estimate prevalence of funder policies on software curation; identify exemplar policies; identify recommended repositories Methods: qualitative text analysis; descriptive statistics Review of Journal Policies Source: Google Scholar, WoS, DOAJ, Software Sustainability Institute Index Goals: Estimate prevalence of journals that publish research software; prevalence of software policies at journals exemplar policies; identify recommended repositories Methods: qualitative text analysis; descriptive statistics 13
  • 15. Typical Prevalence of Software Repositories 15
  • 16. We Got Nothing RE3 SherpaJuliet 16
  • 17. And the Nothing We got is Not that Great 17
  • 18. Some Exemplars and Promising Initiatives • Citation and publisher policies FORCE 11 Software Citation Principles www.force11.org/software-citation- principles ACM New Publication Policies on Software Reproducibility and Contributorship www.acm.org/publications/policies PLOS http://journals.plos.org/plosone/s/m aterials-and-software-sharing 18 • Long Term Access: - www.softwareheritage.org - www.softwarepreservationnetwork.org/ - guides.github.com/activities/citable-code/ - archive.org/details/softwarelibrary • Software Journals: - www.journals.elsevier.com/softwarex/ - www.jstatsoft.org/ - http://openresearchsoftware.metajnl.com/
  • 20. 20
  • 21. 21
  • 22. 22
  • 23. Use Cases and Motivating Value 23 Historic / cultural - historical scholarship - “intrinsic value” Replication and reproducibility - check claims made in research - reduced deliberate research fraud - check reliability (robustness) of results - check validity (accuracy) Reuse - efficiency - increase speed of development - standards compliance - apply methodology to a different corpus - increased quality and dependability Render other digital objects - renders other objects meaningful - see digital preservation use cases Legal - record of licensing, ownership, copyright - manage legal risks/accountability - compliance with laws/funding mandates - reduce barriers to long-term access for other historic use, replication, reuse, rendering Citation and attribution - track individual academic career - track software development/history - track institutional outputs - track funder outputs
  • 24. Repository Affordances 24 Authoring/ Development Discovery/ Access Collection Preservation Legal creator Language specific authoring tools Build environment integration Versioning Documentation Project management Collaboration Attribution Backups Commitment to long- term access Access control License templating curator Project management License template Monitoring Collaboration Browsing Searching Persistent Identifiers Version Ids Collection Policy Peer Review Selection Annotation Metdata Preservation policy Documentation Format management Access control License standardization Legal guidance institution Author, Funder Identifiers Metrics Author, Funder Identifiers Metrics Author, Funder Identifiers Metrics Compliance Attribution Preservation Policy Preservation replication Auditability Certification License standardization Privacy Management end-user Browsing Searching Search engine integration ersistent Identifiers Selection criteria Annotation Quality Measures Documentation Open licensing License discoverability
  • 26. Preliminary findings: State of Software Curation 1. No comprehensive indices of software archives 2. Orders of magnitude fewer software archives than data archives. ( Corollary: Institutional repositories offer little functionality for software archiving, even when nominally supported ) 3. Very small proportion of funders have policies addressing software curation 4. There is little available advice for researchers who wish to curate, cite, & preserve software 5. Substantial reproducibility reproducibility failures related to software continue to be reported 26 “Nothing Exists” - Parmenides (ca. 500 BCE)
  • 27. Contrast with Data Curation -- Lack of Progress• Compliance – Funder: data management plans, open data – Publishers: data access/archiving/citation • Norms & practices – Joint data citation principles – Recognition of data in funder biosketches – Increased recognition of reproducibility gaps – Increased recognition of open data/open science • Technical infrastructure – Data repository directories – Data citation indices – ORCID researcher identifier and registry • Recognition – Data citation indices – Virtual branded archives – High-profile data publications 27
  • 28. Summing it all up… Software curation looks a lot like data curation a decade ago… 28 “How much slower would scientific progress be if the near universal standards for scholarly citation of articles and books had never been developed? Suppose shortly after publication only some printed works could be reliably found by other scholars; or if researchers were only permitted to read an article if they first committed not to criticize it, or were required to coauthor with the original author any work that built on the original. How many discoveries would never have been made if the titles of books and articles in libraries changed unpredictably, with no link back to the old title; if printed works existed in different libraries under different titles; if researchers routinely redistributed modified versions of other authors' works without changing the title or author listed; or if publishing new editions of books meant that earlier editions were destroyed? … “Unfortunately, no such universal standards exist for citing quantitative data software, and so all the problems listed above exist now. Practices vary from field to field, archive to archive, and often from article to article. The data software cited may no longer exist, may not be available publicly, or may have never been held by anyone but the investigator. Data software listed as available from the author are unlikely to be available for long and will not be available after the author retires or dies. … Data software are sometimes listed in the bibliography, sometimes in the text, sometimes not at all, and rarely with enough information to guarantee future access to the identical data set. Replicating published tables and figures even without having to rerun the original experiment, is often difficult or impossible” -- Altman & King 2007