SlideShare a Scribd company logo
Prepared for
Second Open Economics International Workshop
June 2013
“Not-bad” Practices for Sharing
Economics Data
Dr. Micah Altman
<escience@mit.edu>
Director of Research, MIT Libraries
Non-Resident Senior Fellow, The Brookings Institution
DISCLAIMER
These opinions are my own, they are not the opinions
of MIT, Brookings, any of the project funders, nor (with
the exception of co-authored previously published
work) my collaborators
Secondary disclaimer:
“It’s tough to make predictions, especially about the
future!”
-- Attributed to Woody Allen, Yogi Berra, Niels Bohr, Vint Cerf, Winston Churchill,
Confucius, Disreali [sic], Freeman Dyson, Cecil B. Demille, Albert Einstein, Enrico Fermi,
Edgar R. Fiedler, Bob Fourer, Sam Goldwyn, Allan Lamport, Groucho Marx, Dan Quayle,
George Bernard Shaw, Casey Stengel, Will Rogers, M. Taub, Mark Twain, Kerr L. White,
etc.
“Not-bad” Practices for Sharing Economics Data 2
Collaborators & Co-Conspirators
• Jonathan Crabtree, Merce Crosas, Gary
King, Michael McDonald, Nancy
McGovern, Salil Vadhan & many others
• Research Support
Thanks to the Library of Congress, the National
Science Foundation, IMLS, the Sloan
Foundation, the Joyce Foundation, the
Massachusetts Institute of Technology, &
Harvard University.
“Not-bad” Practices for Sharing Economics Data 3
Related Work
• Altman (2013) Data Citation in The Dataverse Network ®,. In Developing Data
Attribution and Citation Practices and Standards: Report from an International
Workshop.
• National Digital Stewardship Alliance, 2013 (Forthcoming), 2014 National
Agenda for Digital Stewardship.
• M. Altman, Adams, M., Crabtree, J., Donakowski, D., Maynard, M., Pienta, A., & Young,
C. 2009. "Digital preservation through archival collaboration: The Data Preservation
Alliance for the Social Sciences." The American Archivist. 72(1): 169-182
• M. Altman, 2008, "A Fingerprint Method for Verification of Scientific Data" in,
Advances in Systems, Computing Sciences and Software Engineering, (Proceedings of
the International Conference on Systems, Computing Sciences and Software
Engineering 2007) , Springer-Verlag.
• M. Altman and G. King. 2007. “A Proposed Standard for the Scholarly Citation of
Quantitative Data”, D-Lib, 13, 3/4 (March/April).
Most reprints available from:
informatics.mit.edu
“Not-bad” Practices for Sharing Economics Data 4
“Not-bad” Practices for Sharing Economics Data
„Not Bad‟
Practices
5
Some Trends
Shifting Evidence Base
High Performance Collaboration
(here comes everybody…)
More Data
Publish, then Filter
More Learners
6
More Open
The Lifecycle and Institutional Ecology of Data
Why not ‘best’ practices?
• Few models for systematic valuation of data
– how much will data X be worth to community Y at time Z?
See: National Digital Stewardship Alliance, 2013 (Forthcoming), 2014
National Agenda for Digital Stewardship. Library of Congress
• Optimality of practices are generally strongly dependent on operational
context
• Context of data sharing very dynamic
– change in publication models
– change in evidence base
– change in data management methodologies
– change in policies
• Paucity of evidence to establish data practices as best:
– Descriptive: adoption, compliance
– Predictive: association of best practices &desired outcomes
– Causal: intervention with best practices linked to improvement
“Not-bad” Practices for Sharing Economics Data 7
Best practices neither best nor practiced.
Why ‘not bad’ practices?
• Avoid clearly bad practices
• Document operational and tacit knowledge
• Elicit assumptions
• Provide basis for auditing, evaluation, and
improvement
“Not-bad” Practices for Sharing Economics Data 8
“Not-bad” Practices for Sharing Economics Data
Probably Not Bad
Practices
9
Types of Practices
• Analytic practices
– Lifecycle analysis
– Requirements analysis
• Policy practices
– Data dissemination policies
– Data citation policies
– Reproducibility policies
• Technical practices
– Sharing technologies
– Reproducibility technologies
“Not-bad” Practices for Sharing Economics Data 10
Core Dimensions of Shared Information Infrastructure
“Not-bad” Practices for Sharing Economics Data
• Stakeholder incentives
– recognition; citation; payment; compliance; services
• Dissemination
– access to metadata; documentation; data
• Access control
– authentication; authorization; rights management
• Provenance
– chain of control; verification of metadata, bits, semantic content
• Persistence
– bits; semantic content; use
• Legal protection
– rights management; consent; record keeping;
• Usability for…
– discovery; deposit; curation; administration; annotation; collaboration
• Economic model
– valuation models; cost models; business models
• Trust model
– verification; transparency; enforcement
See: King 2007; ICSU 2004; NSB 2005; Schneier 2011
11
Creation/C
ollection
Storage/
Ingest
Processing
Internal
Sharing
Analysis
External
dissemination/
publication
Re-use
Long-
term
access
Stakeholders
Scholarly
Publishers
Researchers
Data
Archives/
Publisher
Research
Sponsors
Data
Sources/Su
bjects
Consumers
Service/Infras
tructure
Providers
Research
Organizations
“Not-bad” Practices for Sharing Economics Data12
Modeling
Legal Constraints
Contract Intellectual Property
Access
Rights Confidentiality
Copyright
Fair Use
DMCA
Database Rights
Moral Rights
Intellectual
Attribution
Trade Secret
Patent
Trademark
Common Rule
45 CFR 26
HIPAA
FERPA
EU Privacy Directive
Privacy
Torts
(Invasion,
Defamation)
Rights of
Publicity
Sensitive but
Unclassified
Potentially
Harmful
(Archeological
Sites,
Endangered
Species,
Animal Testing,
…)
Classified
FOIA
CIPSEA
State
Privacy Laws
EAR
State FOI
Laws
Journal
Replication
Requirements
Funder Open
Access
Contract
License
Click-Wrap
TOU
ITAR
Export
Restrictions
Data Dissemination Policies - How
• License: Creative Commons
Version 4.0 of the Creative Commons licenses
– Legally well crafted
– Avoids attribution stacking – attribution through links
– Handles sui-generis database rights, licensee rights to publicity, etc.
– Machine actionable
See: wiki.creativecommons.org/4.0
• Confidentiality
Deidentification & public use files insufficient.
– Need multiple modes of access, including protected access to confidential data.
See: National Research Council. 2005. Expanding access to research data: Reconciling risks and
opportunities. Washington, DC: The National Academies Press.
Vadhan, S. , et al. 2010. “Re: Advance Notice of Proposed Rulemaking: Human Subjects Research
Protections”. Available from: http://dataprivacylab.org/projects/irb/Vadhan.pdf
“Not-bad” Practices for Sharing Economics Data 14
Data Dissemination Policy - When
• Timeliness [NRC Recommendations]
– Sharing data should be a regular practice.
– Investigators should share their data by the time of
publication of initial major results of analyses of the
data except in compelling circumstances.
– Data relevant to public policy should be shared as
quickly and widely as possible.
– Plans for data sharing should be an integral part of a
research plan whenever data sharing is feasible.
Fienberg, et al. (eds). 1985. Sharing Research data.
Washington, DC: The National Academies Press.
“Not-bad” Practices for Sharing Economics Data 15
Data Dissemination Policy - Where
• With journals. Follow NISO supplementary
materials:
http://www.niso.org/workrooms/supplementalre
commendations
• With sustainable well known collaboratively-
stewarded repositories
– Example: data-pass.org
Also see:
M.
Altman, Adams, M., Crabtree, J., Donakowski, D., Maynard, M., Pienta, A., &
Young, C. 2009. "Digital preservation through archival collaboration: The Data
Preservation Alliance for the Social Sciences." The American Archivist. 72(1):
169-182
“Not-bad” Practices for Sharing Economics Data 16
Data Citation Policies
• Data Citation First Principles
(Harvard Workshop, NRC Report, Co-Data Forthcoming)
– Data citations should be treated as first-class objects of publication
– At minimum, all data necessary to understand assess extend conclusions in scholarly
work should be cited.
See:
Altman, Micah. “Data Citation in The Dataverse Network.” Developing Data Attribution and Citation Practices and
Standards Report from an International Workshop. Ed. Paul F Uhlir. National Academies Press, 2012
M. Altman and G. King. 2007. “A Proposed Standard for the Scholarly Citation of Quantitative Data”, D-Lib, 13, 3/4
(March/April).
• Data-PASS recommendations
– Minimal elements: author, date, title, persistent id
– Location: must appear with other elements
– Recommended: fixity information, such as
Universal Numeric Fingerprint
See: data-pass.org/citations.html
“Not-bad” Practices for Sharing Economics Data 17
Reproducibility Policies
• Science
– “Unpublished data and personal communications. Citations to unpublished
data and personal communications cannot be used to support claims in a
published paper. Papers will be held for publication until all "in press" citations
are published.”
– “Data and materials availability All data necessary to understand, assess, and
extend the conclusions of the manuscript must be available to any reader of
Science. All computer codes involved in the creation or analysis of data must
also be available to any reader of Science. “
• Support for publishing replication
– Registered replication reports:
http://www.psychologicalscience.org/index.php/replication
– ICMJE Clinical Trials Registration:
http://www.icmje.org/publishing_10register.html
– Journals of Negative/Null Results
“Not-bad” Practices for Sharing Economics Data 18
Policies are not Self-Enforcing /Sustaining
• Technical and financial sustainability must be
planned, to ensure long term access
See: National Science Board, Long-Lived Digital Data
Collections: Enabling Research and
Education in the 21st Century. NSF.
http://www.nsf.gov/pubs/2005/nsb0540/nsb0540.pdf
• Long-term access requires initial investment in
data preparation
– Capture tacit knowledge, create metadata
– Transfer to stable formats
“Not-bad” Practices for Sharing Economics Data 19
Compliance with Data Sharing Policies is often
Low
The Lifecycle and Institutional Ecology of Data
 Compliance is low even in best
examples of journals
 Checking compliance is labor-
intensive without citation and
repository standards
[See Glandon 2011; Mucullough, et.
al 2008]
20
Technical Infrastructure Examples
• CKAN
– Open Source
– Established
– Built on drupal platform
– http://ckan.org/
• Dataverse Network
– http://thedata.org
– Open Source
– Flexible archival models
– Semantic Fixity (UNF)
[Altman 2008]
• MyExperiment
– http://www.myexperiment.org/
– Long lasting
– Archives complete workflows to produce results
“Not-bad” Practices for Sharing Economics Data 21
Technical Criteria
• Long term access
– Replication,
independence
• Verifiability and fixity
• Provenance
• Workflows/code
Final Observations
• Best practices aren’t…
– document context of practice & measure desired outcomes
• Not-bad practice starts with analysis…
– lifecycle; requirements; sustainability ; predicted costs and
benefits
• Effective data sharing requires policies:
– dissemination, citation, replication, auditing
• Effective data sharing requires infrastructure:
– For verifiability, provenance, workflows/code, & long term
access
• Policies are not self-enforcing
– combine incentives, transparency, auditing, & evaluation
“Not-bad” Practices for Sharing Economics Data 22
Additional Bibliography (Selected)
• McCullough, B.D., Kerry Anne McGeary, and Teresa D. Harrison. "Do Economics Journal Archives Promote Replicable
Research?" Canadian Journal of Economics 41, no. 4 (2008).
• Schneier, Bruce, 2012, Liars and Outliers. Wiley.
• Borgman, Christine. “The Conundrum of Research Sharing.” Journal of the American Society for Information Science and
Technology (2011):1-40.
• Glandon P. , 2011. Report on the American Economic Review Data Availability Compliance Project.
http://www.aeaweb.org/aer/2011_Data_Compliance_Report.pdf
• King, Gary. 2007. An Introduction to the Dataverse Network as an Infrastructure for Data Sharing. Sociological
Methods and Research 36: 173–199NSB
• International Council For Science (ICSU) 2004. ICSU Report of the CSPR Assessment Panel on Scientific Data and Information.
Report.
“Not-bad” Practices for Sharing
Economics Data
23
Questions?
E-mail: escience@mit.edu
Web: micahaltman.com
Twitter: @drmaltman
“Not-bad” Practices for Sharing Economics
Data
24

More Related Content

What's hot

Assessing Digital Output in New Ways
Assessing Digital Output in New WaysAssessing Digital Output in New Ways
Assessing Digital Output in New Ways
National Information Standards Organization (NISO)
 
Writing Analytics for Epistemic Features of Student Writing #icls2016 talk
Writing Analytics for Epistemic Features of Student Writing #icls2016 talkWriting Analytics for Epistemic Features of Student Writing #icls2016 talk
Writing Analytics for Epistemic Features of Student Writing #icls2016 talk
Simon Knight
 
RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...
ASIS&T
 
"Reproducibility from the Informatics Perspective"
"Reproducibility from the Informatics Perspective""Reproducibility from the Informatics Perspective"
"Reproducibility from the Informatics Perspective"
Micah Altman
 
Taylor Ghost of Altmetrics Yet to Come
Taylor Ghost of Altmetrics Yet to ComeTaylor Ghost of Altmetrics Yet to Come
Taylor Ghost of Altmetrics Yet to Come
National Information Standards Organization (NISO)
 
Linking Data to Publications through Citation and Virtual Archives
Linking Data to Publications through Citation and Virtual ArchivesLinking Data to Publications through Citation and Virtual Archives
Linking Data to Publications through Citation and Virtual Archives
Micah Altman
 
From Big Data to the Big Picture
From Big Data to the Big PictureFrom Big Data to the Big Picture
From Big Data to the Big Picture
SAGE Publishing
 
Konkiel Exploring Values-Based Altmetrics
Konkiel Exploring Values-Based AltmetricsKonkiel Exploring Values-Based Altmetrics
Konkiel Exploring Values-Based Altmetrics
National Information Standards Organization (NISO)
 
Gunn Designing Metrics that Serve Academica
Gunn Designing Metrics that Serve AcademicaGunn Designing Metrics that Serve Academica
Gunn Designing Metrics that Serve Academica
National Information Standards Organization (NISO)
 
Mejias "Making it work globally"
Mejias "Making it work globally"Mejias "Making it work globally"
Mejias "Making it work globally"
National Information Standards Organization (NISO)
 
Introduction to Digital Life
Introduction to Digital LifeIntroduction to Digital Life
Introduction to Digital Life
KR_Barker
 
Trust Management: A Tutorial
Trust Management: A TutorialTrust Management: A Tutorial
Trust Management: A Tutorial
Artificial Intelligence Institute at UofSC
 
MIT Program on Information Science Talk -- Ophir Frieder on Searching in Hars...
MIT Program on Information Science Talk -- Ophir Frieder on Searching in Hars...MIT Program on Information Science Talk -- Ophir Frieder on Searching in Hars...
MIT Program on Information Science Talk -- Ophir Frieder on Searching in Hars...
Micah Altman
 
Big Data Talent in Academic and Industry R&D
Big Data Talent in Academic and Industry R&DBig Data Talent in Academic and Industry R&D
Big Data Talent in Academic and Industry R&D
University of Washington
 
Context Aware Harassment Detection in Social Media [Overview]
Context Aware Harassment Detection in Social Media [Overview]Context Aware Harassment Detection in Social Media [Overview]
Context Aware Harassment Detection in Social Media [Overview]
Artificial Intelligence Institute at UofSC
 
Wilbanks Can We Simultaneously Support Both Privacy & Research?
Wilbanks Can We Simultaneously Support Both Privacy & Research?Wilbanks Can We Simultaneously Support Both Privacy & Research?
Wilbanks Can We Simultaneously Support Both Privacy & Research?
National Information Standards Organization (NISO)
 
Reputation Management for Early Career Researchers
Reputation Management for Early Career ResearchersReputation Management for Early Career Researchers
Reputation Management for Early Career Researchers
Micah Altman
 
Understanding ICPSR - An Orientation and Tours of ICPSR Data Services and Edu...
Understanding ICPSR - An Orientation and Tours of ICPSR Data Services and Edu...Understanding ICPSR - An Orientation and Tours of ICPSR Data Services and Edu...
Understanding ICPSR - An Orientation and Tours of ICPSR Data Services and Edu...
ICPSR
 
Green "Building and Launching The Commons: Because the Scholarly Record has a...
Green "Building and Launching The Commons: Because the Scholarly Record has a...Green "Building and Launching The Commons: Because the Scholarly Record has a...
Green "Building and Launching The Commons: Because the Scholarly Record has a...
National Information Standards Organization (NISO)
 
The Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIHThe Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIH
Philip Bourne
 

What's hot (20)

Assessing Digital Output in New Ways
Assessing Digital Output in New WaysAssessing Digital Output in New Ways
Assessing Digital Output in New Ways
 
Writing Analytics for Epistemic Features of Student Writing #icls2016 talk
Writing Analytics for Epistemic Features of Student Writing #icls2016 talkWriting Analytics for Epistemic Features of Student Writing #icls2016 talk
Writing Analytics for Epistemic Features of Student Writing #icls2016 talk
 
RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...
 
"Reproducibility from the Informatics Perspective"
"Reproducibility from the Informatics Perspective""Reproducibility from the Informatics Perspective"
"Reproducibility from the Informatics Perspective"
 
Taylor Ghost of Altmetrics Yet to Come
Taylor Ghost of Altmetrics Yet to ComeTaylor Ghost of Altmetrics Yet to Come
Taylor Ghost of Altmetrics Yet to Come
 
Linking Data to Publications through Citation and Virtual Archives
Linking Data to Publications through Citation and Virtual ArchivesLinking Data to Publications through Citation and Virtual Archives
Linking Data to Publications through Citation and Virtual Archives
 
From Big Data to the Big Picture
From Big Data to the Big PictureFrom Big Data to the Big Picture
From Big Data to the Big Picture
 
Konkiel Exploring Values-Based Altmetrics
Konkiel Exploring Values-Based AltmetricsKonkiel Exploring Values-Based Altmetrics
Konkiel Exploring Values-Based Altmetrics
 
Gunn Designing Metrics that Serve Academica
Gunn Designing Metrics that Serve AcademicaGunn Designing Metrics that Serve Academica
Gunn Designing Metrics that Serve Academica
 
Mejias "Making it work globally"
Mejias "Making it work globally"Mejias "Making it work globally"
Mejias "Making it work globally"
 
Introduction to Digital Life
Introduction to Digital LifeIntroduction to Digital Life
Introduction to Digital Life
 
Trust Management: A Tutorial
Trust Management: A TutorialTrust Management: A Tutorial
Trust Management: A Tutorial
 
MIT Program on Information Science Talk -- Ophir Frieder on Searching in Hars...
MIT Program on Information Science Talk -- Ophir Frieder on Searching in Hars...MIT Program on Information Science Talk -- Ophir Frieder on Searching in Hars...
MIT Program on Information Science Talk -- Ophir Frieder on Searching in Hars...
 
Big Data Talent in Academic and Industry R&D
Big Data Talent in Academic and Industry R&DBig Data Talent in Academic and Industry R&D
Big Data Talent in Academic and Industry R&D
 
Context Aware Harassment Detection in Social Media [Overview]
Context Aware Harassment Detection in Social Media [Overview]Context Aware Harassment Detection in Social Media [Overview]
Context Aware Harassment Detection in Social Media [Overview]
 
Wilbanks Can We Simultaneously Support Both Privacy & Research?
Wilbanks Can We Simultaneously Support Both Privacy & Research?Wilbanks Can We Simultaneously Support Both Privacy & Research?
Wilbanks Can We Simultaneously Support Both Privacy & Research?
 
Reputation Management for Early Career Researchers
Reputation Management for Early Career ResearchersReputation Management for Early Career Researchers
Reputation Management for Early Career Researchers
 
Understanding ICPSR - An Orientation and Tours of ICPSR Data Services and Edu...
Understanding ICPSR - An Orientation and Tours of ICPSR Data Services and Edu...Understanding ICPSR - An Orientation and Tours of ICPSR Data Services and Edu...
Understanding ICPSR - An Orientation and Tours of ICPSR Data Services and Edu...
 
Green "Building and Launching The Commons: Because the Scholarly Record has a...
Green "Building and Launching The Commons: Because the Scholarly Record has a...Green "Building and Launching The Commons: Because the Scholarly Record has a...
Green "Building and Launching The Commons: Because the Scholarly Record has a...
 
The Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIHThe Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIH
 

Similar to Best Practices for Sharing Economics Data

Emerging Data Citation Infrastructure
Emerging Data Citation InfrastructureEmerging Data Citation Infrastructure
Emerging Data Citation Infrastructure
Micah Altman
 
Privacy in Research Data Managemnt - Use Cases
Privacy in Research Data Managemnt - Use CasesPrivacy in Research Data Managemnt - Use Cases
Privacy in Research Data Managemnt - Use Cases
Micah Altman
 
Data Policy for Open Science
Data Policy for Open ScienceData Policy for Open Science
Data Policy for Open Science
Research Data Alliance
 
Data Policy for Open Science
Data Policy for Open ScienceData Policy for Open Science
Data Policy for Open Science
Mark Parsons
 
State of the Art Informatics for Research Reproducibility, Reliability, and...
 State of the Art  Informatics for Research Reproducibility, Reliability, and... State of the Art  Informatics for Research Reproducibility, Reliability, and...
State of the Art Informatics for Research Reproducibility, Reliability, and...
Micah Altman
 
Biobanks as Knowledge Institutions
Biobanks as Knowledge InstitutionsBiobanks as Knowledge Institutions
Biobanks as Knowledge Institutions
professormadison
 
INFORMATION WANTS SOMEONE ELSE TO PAY FOR IT : AS SCIENCE AND SCHOLARSHIP EVO...
INFORMATION WANTS SOMEONE ELSE TO PAY FOR IT : AS SCIENCE AND SCHOLARSHIP EVO...INFORMATION WANTS SOMEONE ELSE TO PAY FOR IT : AS SCIENCE AND SCHOLARSHIP EVO...
INFORMATION WANTS SOMEONE ELSE TO PAY FOR IT : AS SCIENCE AND SCHOLARSHIP EVO...
Micah Altman
 
Data Sharing & Data Citation
Data Sharing & Data CitationData Sharing & Data Citation
Data Sharing & Data Citation
Micah Altman
 
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCESBROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
Micah Altman
 
Scientific Reproducibility from an Informatics Perspective
Scientific Reproducibility from an Informatics PerspectiveScientific Reproducibility from an Informatics Perspective
Scientific Reproducibility from an Informatics Perspective
Micah Altman
 
Ethical Priniciples for the All Data Revolution
Ethical Priniciples for the All Data RevolutionEthical Priniciples for the All Data Revolution
Ethical Priniciples for the All Data Revolution
Melissa Moody
 
Introduction to open-data
Introduction to open-dataIntroduction to open-data
Introduction to open-data
OpenAccessBelgium
 
Managing data responsibly to enable research interity
Managing data responsibly to enable research interityManaging data responsibly to enable research interity
Managing data responsibly to enable research interity
IUPUI
 
AAPOR - comparing found data from social media and made data from surveys
AAPOR - comparing found data from social media and made data from surveysAAPOR - comparing found data from social media and made data from surveys
AAPOR - comparing found data from social media and made data from surveys
Cliff Lampe
 
Ps rwebinar january2019final
Ps rwebinar january2019finalPs rwebinar january2019final
Ps rwebinar january2019final
Margaret Henderson
 
Mind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeMind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and Practice
LizLyon
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decade
LizLyon
 
Data Science and Urban Science @ UW
Data Science and Urban Science @ UWData Science and Urban Science @ UW
Data Science and Urban Science @ UW
University of Washington
 
Alain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersAlain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersIncisive_Events
 

Similar to Best Practices for Sharing Economics Data (20)

Emerging Data Citation Infrastructure
Emerging Data Citation InfrastructureEmerging Data Citation Infrastructure
Emerging Data Citation Infrastructure
 
Privacy in Research Data Managemnt - Use Cases
Privacy in Research Data Managemnt - Use CasesPrivacy in Research Data Managemnt - Use Cases
Privacy in Research Data Managemnt - Use Cases
 
Data Policy for Open Science
Data Policy for Open ScienceData Policy for Open Science
Data Policy for Open Science
 
Data Policy for Open Science
Data Policy for Open ScienceData Policy for Open Science
Data Policy for Open Science
 
State of the Art Informatics for Research Reproducibility, Reliability, and...
 State of the Art  Informatics for Research Reproducibility, Reliability, and... State of the Art  Informatics for Research Reproducibility, Reliability, and...
State of the Art Informatics for Research Reproducibility, Reliability, and...
 
Biobanks as Knowledge Institutions
Biobanks as Knowledge InstitutionsBiobanks as Knowledge Institutions
Biobanks as Knowledge Institutions
 
INFORMATION WANTS SOMEONE ELSE TO PAY FOR IT : AS SCIENCE AND SCHOLARSHIP EVO...
INFORMATION WANTS SOMEONE ELSE TO PAY FOR IT : AS SCIENCE AND SCHOLARSHIP EVO...INFORMATION WANTS SOMEONE ELSE TO PAY FOR IT : AS SCIENCE AND SCHOLARSHIP EVO...
INFORMATION WANTS SOMEONE ELSE TO PAY FOR IT : AS SCIENCE AND SCHOLARSHIP EVO...
 
Data Sharing & Data Citation
Data Sharing & Data CitationData Sharing & Data Citation
Data Sharing & Data Citation
 
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCESBROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
 
Scientific Reproducibility from an Informatics Perspective
Scientific Reproducibility from an Informatics PerspectiveScientific Reproducibility from an Informatics Perspective
Scientific Reproducibility from an Informatics Perspective
 
Ethical Priniciples for the All Data Revolution
Ethical Priniciples for the All Data RevolutionEthical Priniciples for the All Data Revolution
Ethical Priniciples for the All Data Revolution
 
Introduction to open-data
Introduction to open-dataIntroduction to open-data
Introduction to open-data
 
Simon hodson
Simon hodsonSimon hodson
Simon hodson
 
Managing data responsibly to enable research interity
Managing data responsibly to enable research interityManaging data responsibly to enable research interity
Managing data responsibly to enable research interity
 
AAPOR - comparing found data from social media and made data from surveys
AAPOR - comparing found data from social media and made data from surveysAAPOR - comparing found data from social media and made data from surveys
AAPOR - comparing found data from social media and made data from surveys
 
Ps rwebinar january2019final
Ps rwebinar january2019finalPs rwebinar january2019final
Ps rwebinar january2019final
 
Mind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeMind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and Practice
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decade
 
Data Science and Urban Science @ UW
Data Science and Urban Science @ UWData Science and Urban Science @ UW
Data Science and Urban Science @ UW
 
Alain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersAlain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producers
 

More from Micah Altman

Selecting efficient and reliable preservation strategies
Selecting efficient and reliable preservation strategiesSelecting efficient and reliable preservation strategies
Selecting efficient and reliable preservation strategies
Micah Altman
 
Well-being A Sunset Conversation
Well-being A Sunset ConversationWell-being A Sunset Conversation
Well-being A Sunset Conversation
Micah Altman
 
Can We Fix Peer Review
Can We Fix Peer ReviewCan We Fix Peer Review
Can We Fix Peer Review
Micah Altman
 
Academy Owned Peer Review
Academy Owned Peer ReviewAcademy Owned Peer Review
Academy Owned Peer Review
Micah Altman
 
Redistricting in the US -- An Overview
Redistricting in the US -- An OverviewRedistricting in the US -- An Overview
Redistricting in the US -- An Overview
Micah Altman
 
A Future for Electoral Districting
A Future for Electoral DistrictingA Future for Electoral Districting
A Future for Electoral Districting
Micah Altman
 
A History of the Internet :Scott Bradner’s Program on Information Science Talk
A History of the Internet :Scott Bradner’s Program on Information Science Talk  A History of the Internet :Scott Bradner’s Program on Information Science Talk
A History of the Internet :Scott Bradner’s Program on Information Science Talk
Micah Altman
 
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
Micah Altman
 
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
Micah Altman
 
Utilizing VR and AR in the Library Space:
Utilizing VR and AR in the Library Space:Utilizing VR and AR in the Library Space:
Utilizing VR and AR in the Library Space:
Micah Altman
 
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-NotsCreative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
Micah Altman
 
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
Micah Altman
 
Ndsa 2016 opening plenary
Ndsa 2016 opening plenaryNdsa 2016 opening plenary
Ndsa 2016 opening plenary
Micah Altman
 
Making Decisions in a World Awash in Data: We’re going to need a different bo...
Making Decisions in a World Awash in Data: We’re going to need a different bo...Making Decisions in a World Awash in Data: We’re going to need a different bo...
Making Decisions in a World Awash in Data: We’re going to need a different bo...
Micah Altman
 
Software Repositories for Research-- An Environmental Scan
Software Repositories for Research-- An Environmental ScanSoftware Repositories for Research-- An Environmental Scan
Software Repositories for Research-- An Environmental Scan
Micah Altman
 
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
Micah Altman
 
Gary Price, MIT Program on Information Science
Gary Price, MIT Program on Information ScienceGary Price, MIT Program on Information Science
Gary Price, MIT Program on Information Science
Micah Altman
 
Attribution from a Research Library Perspective, on NISO Webinar: How Librari...
Attribution from a Research Library Perspective, on NISO Webinar: How Librari...Attribution from a Research Library Perspective, on NISO Webinar: How Librari...
Attribution from a Research Library Perspective, on NISO Webinar: How Librari...
Micah Altman
 
Agenda's for Preservation Research
Agenda's for Preservation ResearchAgenda's for Preservation Research
Agenda's for Preservation Research
Micah Altman
 
Software Repositories for Research -- An Environmental Scan
Software Repositories for Research -- An Environmental ScanSoftware Repositories for Research -- An Environmental Scan
Software Repositories for Research -- An Environmental Scan
Micah Altman
 

More from Micah Altman (20)

Selecting efficient and reliable preservation strategies
Selecting efficient and reliable preservation strategiesSelecting efficient and reliable preservation strategies
Selecting efficient and reliable preservation strategies
 
Well-being A Sunset Conversation
Well-being A Sunset ConversationWell-being A Sunset Conversation
Well-being A Sunset Conversation
 
Can We Fix Peer Review
Can We Fix Peer ReviewCan We Fix Peer Review
Can We Fix Peer Review
 
Academy Owned Peer Review
Academy Owned Peer ReviewAcademy Owned Peer Review
Academy Owned Peer Review
 
Redistricting in the US -- An Overview
Redistricting in the US -- An OverviewRedistricting in the US -- An Overview
Redistricting in the US -- An Overview
 
A Future for Electoral Districting
A Future for Electoral DistrictingA Future for Electoral Districting
A Future for Electoral Districting
 
A History of the Internet :Scott Bradner’s Program on Information Science Talk
A History of the Internet :Scott Bradner’s Program on Information Science Talk  A History of the Internet :Scott Bradner’s Program on Information Science Talk
A History of the Internet :Scott Bradner’s Program on Information Science Talk
 
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
 
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
 
Utilizing VR and AR in the Library Space:
Utilizing VR and AR in the Library Space:Utilizing VR and AR in the Library Space:
Utilizing VR and AR in the Library Space:
 
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-NotsCreative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
 
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
 
Ndsa 2016 opening plenary
Ndsa 2016 opening plenaryNdsa 2016 opening plenary
Ndsa 2016 opening plenary
 
Making Decisions in a World Awash in Data: We’re going to need a different bo...
Making Decisions in a World Awash in Data: We’re going to need a different bo...Making Decisions in a World Awash in Data: We’re going to need a different bo...
Making Decisions in a World Awash in Data: We’re going to need a different bo...
 
Software Repositories for Research-- An Environmental Scan
Software Repositories for Research-- An Environmental ScanSoftware Repositories for Research-- An Environmental Scan
Software Repositories for Research-- An Environmental Scan
 
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
 
Gary Price, MIT Program on Information Science
Gary Price, MIT Program on Information ScienceGary Price, MIT Program on Information Science
Gary Price, MIT Program on Information Science
 
Attribution from a Research Library Perspective, on NISO Webinar: How Librari...
Attribution from a Research Library Perspective, on NISO Webinar: How Librari...Attribution from a Research Library Perspective, on NISO Webinar: How Librari...
Attribution from a Research Library Perspective, on NISO Webinar: How Librari...
 
Agenda's for Preservation Research
Agenda's for Preservation ResearchAgenda's for Preservation Research
Agenda's for Preservation Research
 
Software Repositories for Research -- An Environmental Scan
Software Repositories for Research -- An Environmental ScanSoftware Repositories for Research -- An Environmental Scan
Software Repositories for Research -- An Environmental Scan
 

Recently uploaded

Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
Abida Shariff
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 

Recently uploaded (20)

Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 

Best Practices for Sharing Economics Data

  • 1. Prepared for Second Open Economics International Workshop June 2013 “Not-bad” Practices for Sharing Economics Data Dr. Micah Altman <escience@mit.edu> Director of Research, MIT Libraries Non-Resident Senior Fellow, The Brookings Institution
  • 2. DISCLAIMER These opinions are my own, they are not the opinions of MIT, Brookings, any of the project funders, nor (with the exception of co-authored previously published work) my collaborators Secondary disclaimer: “It’s tough to make predictions, especially about the future!” -- Attributed to Woody Allen, Yogi Berra, Niels Bohr, Vint Cerf, Winston Churchill, Confucius, Disreali [sic], Freeman Dyson, Cecil B. Demille, Albert Einstein, Enrico Fermi, Edgar R. Fiedler, Bob Fourer, Sam Goldwyn, Allan Lamport, Groucho Marx, Dan Quayle, George Bernard Shaw, Casey Stengel, Will Rogers, M. Taub, Mark Twain, Kerr L. White, etc. “Not-bad” Practices for Sharing Economics Data 2
  • 3. Collaborators & Co-Conspirators • Jonathan Crabtree, Merce Crosas, Gary King, Michael McDonald, Nancy McGovern, Salil Vadhan & many others • Research Support Thanks to the Library of Congress, the National Science Foundation, IMLS, the Sloan Foundation, the Joyce Foundation, the Massachusetts Institute of Technology, & Harvard University. “Not-bad” Practices for Sharing Economics Data 3
  • 4. Related Work • Altman (2013) Data Citation in The Dataverse Network ®,. In Developing Data Attribution and Citation Practices and Standards: Report from an International Workshop. • National Digital Stewardship Alliance, 2013 (Forthcoming), 2014 National Agenda for Digital Stewardship. • M. Altman, Adams, M., Crabtree, J., Donakowski, D., Maynard, M., Pienta, A., & Young, C. 2009. "Digital preservation through archival collaboration: The Data Preservation Alliance for the Social Sciences." The American Archivist. 72(1): 169-182 • M. Altman, 2008, "A Fingerprint Method for Verification of Scientific Data" in, Advances in Systems, Computing Sciences and Software Engineering, (Proceedings of the International Conference on Systems, Computing Sciences and Software Engineering 2007) , Springer-Verlag. • M. Altman and G. King. 2007. “A Proposed Standard for the Scholarly Citation of Quantitative Data”, D-Lib, 13, 3/4 (March/April). Most reprints available from: informatics.mit.edu “Not-bad” Practices for Sharing Economics Data 4
  • 5. “Not-bad” Practices for Sharing Economics Data „Not Bad‟ Practices 5
  • 6. Some Trends Shifting Evidence Base High Performance Collaboration (here comes everybody…) More Data Publish, then Filter More Learners 6 More Open The Lifecycle and Institutional Ecology of Data
  • 7. Why not ‘best’ practices? • Few models for systematic valuation of data – how much will data X be worth to community Y at time Z? See: National Digital Stewardship Alliance, 2013 (Forthcoming), 2014 National Agenda for Digital Stewardship. Library of Congress • Optimality of practices are generally strongly dependent on operational context • Context of data sharing very dynamic – change in publication models – change in evidence base – change in data management methodologies – change in policies • Paucity of evidence to establish data practices as best: – Descriptive: adoption, compliance – Predictive: association of best practices &desired outcomes – Causal: intervention with best practices linked to improvement “Not-bad” Practices for Sharing Economics Data 7 Best practices neither best nor practiced.
  • 8. Why ‘not bad’ practices? • Avoid clearly bad practices • Document operational and tacit knowledge • Elicit assumptions • Provide basis for auditing, evaluation, and improvement “Not-bad” Practices for Sharing Economics Data 8
  • 9. “Not-bad” Practices for Sharing Economics Data Probably Not Bad Practices 9
  • 10. Types of Practices • Analytic practices – Lifecycle analysis – Requirements analysis • Policy practices – Data dissemination policies – Data citation policies – Reproducibility policies • Technical practices – Sharing technologies – Reproducibility technologies “Not-bad” Practices for Sharing Economics Data 10
  • 11. Core Dimensions of Shared Information Infrastructure “Not-bad” Practices for Sharing Economics Data • Stakeholder incentives – recognition; citation; payment; compliance; services • Dissemination – access to metadata; documentation; data • Access control – authentication; authorization; rights management • Provenance – chain of control; verification of metadata, bits, semantic content • Persistence – bits; semantic content; use • Legal protection – rights management; consent; record keeping; • Usability for… – discovery; deposit; curation; administration; annotation; collaboration • Economic model – valuation models; cost models; business models • Trust model – verification; transparency; enforcement See: King 2007; ICSU 2004; NSB 2005; Schneier 2011 11
  • 13. Legal Constraints Contract Intellectual Property Access Rights Confidentiality Copyright Fair Use DMCA Database Rights Moral Rights Intellectual Attribution Trade Secret Patent Trademark Common Rule 45 CFR 26 HIPAA FERPA EU Privacy Directive Privacy Torts (Invasion, Defamation) Rights of Publicity Sensitive but Unclassified Potentially Harmful (Archeological Sites, Endangered Species, Animal Testing, …) Classified FOIA CIPSEA State Privacy Laws EAR State FOI Laws Journal Replication Requirements Funder Open Access Contract License Click-Wrap TOU ITAR Export Restrictions
  • 14. Data Dissemination Policies - How • License: Creative Commons Version 4.0 of the Creative Commons licenses – Legally well crafted – Avoids attribution stacking – attribution through links – Handles sui-generis database rights, licensee rights to publicity, etc. – Machine actionable See: wiki.creativecommons.org/4.0 • Confidentiality Deidentification & public use files insufficient. – Need multiple modes of access, including protected access to confidential data. See: National Research Council. 2005. Expanding access to research data: Reconciling risks and opportunities. Washington, DC: The National Academies Press. Vadhan, S. , et al. 2010. “Re: Advance Notice of Proposed Rulemaking: Human Subjects Research Protections”. Available from: http://dataprivacylab.org/projects/irb/Vadhan.pdf “Not-bad” Practices for Sharing Economics Data 14
  • 15. Data Dissemination Policy - When • Timeliness [NRC Recommendations] – Sharing data should be a regular practice. – Investigators should share their data by the time of publication of initial major results of analyses of the data except in compelling circumstances. – Data relevant to public policy should be shared as quickly and widely as possible. – Plans for data sharing should be an integral part of a research plan whenever data sharing is feasible. Fienberg, et al. (eds). 1985. Sharing Research data. Washington, DC: The National Academies Press. “Not-bad” Practices for Sharing Economics Data 15
  • 16. Data Dissemination Policy - Where • With journals. Follow NISO supplementary materials: http://www.niso.org/workrooms/supplementalre commendations • With sustainable well known collaboratively- stewarded repositories – Example: data-pass.org Also see: M. Altman, Adams, M., Crabtree, J., Donakowski, D., Maynard, M., Pienta, A., & Young, C. 2009. "Digital preservation through archival collaboration: The Data Preservation Alliance for the Social Sciences." The American Archivist. 72(1): 169-182 “Not-bad” Practices for Sharing Economics Data 16
  • 17. Data Citation Policies • Data Citation First Principles (Harvard Workshop, NRC Report, Co-Data Forthcoming) – Data citations should be treated as first-class objects of publication – At minimum, all data necessary to understand assess extend conclusions in scholarly work should be cited. See: Altman, Micah. “Data Citation in The Dataverse Network.” Developing Data Attribution and Citation Practices and Standards Report from an International Workshop. Ed. Paul F Uhlir. National Academies Press, 2012 M. Altman and G. King. 2007. “A Proposed Standard for the Scholarly Citation of Quantitative Data”, D-Lib, 13, 3/4 (March/April). • Data-PASS recommendations – Minimal elements: author, date, title, persistent id – Location: must appear with other elements – Recommended: fixity information, such as Universal Numeric Fingerprint See: data-pass.org/citations.html “Not-bad” Practices for Sharing Economics Data 17
  • 18. Reproducibility Policies • Science – “Unpublished data and personal communications. Citations to unpublished data and personal communications cannot be used to support claims in a published paper. Papers will be held for publication until all "in press" citations are published.” – “Data and materials availability All data necessary to understand, assess, and extend the conclusions of the manuscript must be available to any reader of Science. All computer codes involved in the creation or analysis of data must also be available to any reader of Science. “ • Support for publishing replication – Registered replication reports: http://www.psychologicalscience.org/index.php/replication – ICMJE Clinical Trials Registration: http://www.icmje.org/publishing_10register.html – Journals of Negative/Null Results “Not-bad” Practices for Sharing Economics Data 18
  • 19. Policies are not Self-Enforcing /Sustaining • Technical and financial sustainability must be planned, to ensure long term access See: National Science Board, Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century. NSF. http://www.nsf.gov/pubs/2005/nsb0540/nsb0540.pdf • Long-term access requires initial investment in data preparation – Capture tacit knowledge, create metadata – Transfer to stable formats “Not-bad” Practices for Sharing Economics Data 19
  • 20. Compliance with Data Sharing Policies is often Low The Lifecycle and Institutional Ecology of Data  Compliance is low even in best examples of journals  Checking compliance is labor- intensive without citation and repository standards [See Glandon 2011; Mucullough, et. al 2008] 20
  • 21. Technical Infrastructure Examples • CKAN – Open Source – Established – Built on drupal platform – http://ckan.org/ • Dataverse Network – http://thedata.org – Open Source – Flexible archival models – Semantic Fixity (UNF) [Altman 2008] • MyExperiment – http://www.myexperiment.org/ – Long lasting – Archives complete workflows to produce results “Not-bad” Practices for Sharing Economics Data 21 Technical Criteria • Long term access – Replication, independence • Verifiability and fixity • Provenance • Workflows/code
  • 22. Final Observations • Best practices aren’t… – document context of practice & measure desired outcomes • Not-bad practice starts with analysis… – lifecycle; requirements; sustainability ; predicted costs and benefits • Effective data sharing requires policies: – dissemination, citation, replication, auditing • Effective data sharing requires infrastructure: – For verifiability, provenance, workflows/code, & long term access • Policies are not self-enforcing – combine incentives, transparency, auditing, & evaluation “Not-bad” Practices for Sharing Economics Data 22
  • 23. Additional Bibliography (Selected) • McCullough, B.D., Kerry Anne McGeary, and Teresa D. Harrison. "Do Economics Journal Archives Promote Replicable Research?" Canadian Journal of Economics 41, no. 4 (2008). • Schneier, Bruce, 2012, Liars and Outliers. Wiley. • Borgman, Christine. “The Conundrum of Research Sharing.” Journal of the American Society for Information Science and Technology (2011):1-40. • Glandon P. , 2011. Report on the American Economic Review Data Availability Compliance Project. http://www.aeaweb.org/aer/2011_Data_Compliance_Report.pdf • King, Gary. 2007. An Introduction to the Dataverse Network as an Infrastructure for Data Sharing. Sociological Methods and Research 36: 173–199NSB • International Council For Science (ICSU) 2004. ICSU Report of the CSPR Assessment Panel on Scientific Data and Information. Report. “Not-bad” Practices for Sharing Economics Data 23
  • 24. Questions? E-mail: escience@mit.edu Web: micahaltman.com Twitter: @drmaltman “Not-bad” Practices for Sharing Economics Data 24

Editor's Notes

  1. This work. by Micah Altman (http://micahaltman.com) is licensed under the Creative Commons Attribution-Share Alike 3.0 United States License. To view a copy of this license, visit http://creativecommons.org/licenses/by-sa/3.0/us/ or send a letter to Creative Commons, 171 Second Street, Suite 300, San Francisco, California, 94105, USA.
  2. Best practices aren&apos;t.The core issue is that there are few models for the systematic valuation of data: We have no robust general proven ways of answering the question of how much data X be worth to community Y at time Z. Thus the &quot;bestness&quot; (optimality) of practices are generally strongly dependent on operational context.. and the context of data sharing is currently both highly complex and dynamic Until there is systematic descriptive evidence that best practices are used, predictive evidence that best practices are associated with future desired outcomes, and causal evidence that the application of best practices yields improved outcomes, we will be unsure that practices are &quot;best&quot;.Nevertheless, one should use established &quot;not-bad&quot; practices, for a number of reasons. First, to avoid practices that are clearly bad; second, because use of such practices acts to dcoument op[erational and tacit knowledge; third because selecting practices can help to elicit the underlying assumptions under which practices are applied; and finally because not-bad practcies provide a basis for auditing, evaluation, and eventual improvement.Specific not-bad practices for data sharing fall into roughly three categories :Analytic practices: lifecycle analysis &amp; requirements analysisPolicy practices for: data dissemination, licensing, privacy, availability, citation and reproducibilityTechnical practices for sharing and reproducibility, including fixity, replication, provenanceThis presentation at the Second Open Economics International Workshop (sponsored by the Sloan Foundation, MIT and OKFN) provides an overview of these and links to specific practices recommendations, standards, and tools:
  3. LHC produces a PB every 2 weeks, Sloan Galaxy zoo has hundreds of thousands of “authors”, 50K people attend a class from the University of michigan, and to understand public opinion instead of surveying 100’s of people per month we can analyze 10ooo tweets per second.
  4. Most of the different stakeholders have stronger relationships/stakes with research at different stages. But researchers and research institutions are in the middle – they have a strong stake in most stagesResearchers are more directly concerned with collection, processing, analysis, dissemination. Organizations have a higher stake in internal sharing, re-use, long-term access.