A presentation to the Illinois Association of
Astronomers and Astrophysicists (IAAA)
during the 2013 Conference in Chicago, IL.
By Katie Schmitt
WHAT IS DATA?
"A reinterpretable representation of information in a formalized
manner suitable for communication, interpretation, or processing.
Examples of data include a sequence of bits, a table of numbers,
the characters on a page, the recording of sounds made by a
person speaking, or a moon rock specimen“ – OAIS reference
Types of Data:
WHAT IS DATA CURATION?
Life Cycle Model
Source: DDI Structural Reform Group. “DDI Version 3.0 Conceptual Model." DDI
Alliance. 2004. http://www.icpsr.umich.edu/DDI/committee-info/Concept-Model-WD.pd
BEST PRACTICES - PROVENANCE
prov·e·nance [ próvvənənss ]:
the place of origin of something
the source and ownership history
Instrument characteristics, calibration data and method of discovery
Changes in location or instrumentation
Changes in ownership of the data
Where did the data come from and how did it get here?
BEST PRACTICES - METADATA
Used to enable data discovery
Metadata standards vary per data repository
In general, metadata must be:
Written for humans
In a digital format
BEST PRACTICES PRESERVATION
The best format is
Platform and Vendor-independent
Source: Week 4 Slides by Ruth Duerr,
BEST PRACTICES – ACCESS
Constant balance between preservation and access
Similar to preservation format
Master v. Access
TYPES OF REPOSITORIES
Often connected to a University
Usually provide high levels of service
Specialized by discipline
Excel in basic service
New to the data management realm.
A FEW RESOURCES…
Choudhury, G. S., Palmer, C. L., Baker, K. S., & DiLauro, T. (2013,
January). Levels of services and curation for high-functioning data.
Presented at the International Digital Curation Conference,
Miles, S., Deelman, E., Groth, P., Vahi, K., Mehta, G., & Moreau, L.
(2007). Connecting Scientific Data to Scientific Experiments with
Provenance. e-Science and Grid Computing, IEEE International
Conference, 179-186. http://dx.doi.org/ 10.1109/ESCIENCE.2007.22
Renear, A. H., Sacchi, S., & Wickett, K. M. (2010). Definitions of
dataset in the scientific and technical literature. Proceedings of the
American Society for Information Science and Technology, 47(1), 1–4