SlideShare a Scribd company logo
Alive and kicking!
Keeping data re-usable in the
European Values Study
IASSIST Cologne, May 2013
Astrid.Recker@gesis.org, Evelyn.Brislinger@gesis.org
GESIS, Data Archive for the Social Sciences
Overview
 Data and information flow in the EVS project
 Principles and workflows for managing data and
documentation in survey projects
GESIS Data Archive
Basis
Interplay between Principal Investigators (PI) and Data Archive
Agreement on submission of data and information packages
Goals
Ease access to data for a broad user community
Provide metadata for discovery, understanding, and good use of data
Preserve data and metadata for re-use and replications
Holdings
Studies, study series, and complex survey programs as ISSP, Eurobarometer,
ALLBUS, European Values Study (EVS), or election studies
Data and information created in a survey project
Total stock of data and
documentation created
Data and documentation
submitted to an archive
Further information necessary
for the project(?)
Selection processes
Management solutions for structuring data and information
Example: European Values Study (EVS)
9-year-period, 4 waves
49 countries, 125 national surveys
Cross-national, longitudinal
research program
National surveys
Waves
1981/1990/1999/2008
Longitudinal data File
1981-2008 (LdF)
Integrated Values Surveys
EVS/WVS (IVS)
Harmonization and integration process
Number of files
Size of files
Atlas of European Values
www.europeanvaluesstudy.eu/evs/evsatlas.html
Collaboration of actors involved (EVS 2008)
Data
created
processed
documented
National team
Data
standardized
harmonized
integrated
Central team
Data Archive Secondary usersPrincipal Investigators
Data
checked
documented
preserved
released
Data
re-used
Analyses
replicated
Results
reported
Users: analyze and evaluate outcomes
Questions
Check trend questions and original
questions
 ZACAT-Online Study Catalogue
Data
Analyze data, report errors, monitor
error reporting
 GESIS Data Catalogue
Publications
Replicate analysis of other projects
 EVS Repository
…. and detect peculiarities in
questions or problems in data
Peculiarities in question text spotted?
Project Design
Questionnaire Design
Questionnaire Translation
Data Collection
Data Documentation
Data Processing
Check question and translation
Master/field questionnaire, methodological
questionnaire, report ‘Translation History’
Check source of question
Trend question from EVS and WVS,
questions borrowed from other surveys
Identify consequences for
Countries sharing/adopting affected
language, languages belonging to a family,
further languages used in a country
EVS 2008 Data lifecycle
Data error detected?
Standardization and harmonization process: check comparability of surveys,
questions, variables  cumulate data and document each step
Integrated
Values
Surveys
EVS/WVS
Longitudi-
nal data
File
1981-2008
Wave
2008
National
data
Original
data file
Wave
1999
…..
National
data
…..
Retrace data processing steps across surveys: check data, syntax
files, and documentation  update data and highlight problems for next wave
Error detected
Data and information created
Designated communities
 Principal Investigator/Project
 Secondary user
Experiences from EVS project
Data and information packages
 Project package
 Archive package
Selection processes
 Within project
 Between project and archive
Project
Archive
Total stock
Communicating with the future: Activity on two levels
Macro level
Defining workflows, file and information paths on which
necessary information is passed on
Micro level
Organizing information so that it is
re-usable (RDM, metadata,
systematic file structures)
Begin by identifying principles for structuring and documenting files in
the project (Research Data Management)
Select
which information
is relevant
to whom?
A tidy house, a tidy mind!
Reference, don’t
duplicate files
whenever possible
Identify and
capture “kinship
relations”
Capture process
knowledge
classes
itineraries Make changes
traceable
versioning
document revisions &
annotations
minutes
protocols
The magic wand
 Follow principles of good research
data management (RDM)

 Use metadata to document process
and content information

 Use standards wherever possible
(e.g. DDI, Dublin Core, ISO codes,
file naming conventions, etc.)
(and not the one used by the sorcerer’s apprentice)
Document
Date
created
Language
Version
Format
Resource
Rights
Date
modified
English
Actor
Name
Collection
hasDate
hasModifier
creates
modifies
hasAccessRights
isA
hasVersion
isA
hasCreator
hasLanguage
hasIdentifier
isPartOf
hasFormat
hasIdentifier
hasRole
dc:creator
dc:created
dc:modified
dc:identifier
dc:format
dc:provenance
dc:description
dc:language
dc:accessRights
dc:collection
…
isA
Managing information flows in a collaborative, long-
term project
 Which paths does information (data, documentation, other
contextual material) take from producers to users?
 Two models helped us clarify processes and paths, as well as
identify helpful terminology and concepts
– Project life cycle
– Open Archival Information System (OAIS) reference model
(CCSDS 2012)
CCSDS (2012). Reference Model for an Open Archival Information System (OAIS). Recommended Practice.
http://public.ccsds.org/publications/archive/650x0m2.pdf
Project Repository
Ingest
Data processing
and enhancement
Data
Management
Temporary
Storage
Access
(project-internal
use, PIs)
Project Design
Data
Dissemination
Questionnaire
Design
Questionnaire
Translation
Data Collection
Data
Documentation
Data
Processing
Project life cycle: Data flow during creation of a survey
Guidelines
Data Archive
(preservation service provider)
Data
Management
Access
Archival Storage
(long-term)
Preservation Planning
Administration
Ingest
Secondary
Users
(future)
Principal
Investigators
SIP AIPAIP DIP
Project Repository
(content provider)
Ingest
Data processing
and enhancement
Data
Management
Temporary
Storage
Access
(project-internal
use, PIs)
Project and Data Archive as distributed system
PIP
PIP
PIP
PIP
PIP
PIP
PIP
PIP
PIP
PIP = Project Information Package, SIP = Submission Information Package,
AIP = Archival Information Package, DIP = Dissemination Information Package
Project Design
Data
Dissemination
Questionnaire
Design
Questionnaire
Translation
Data Collection
Data
Processing
Data
Documentation
Staying Alive! Where we are going from here
 Developing a guideline for projects
– structuring and annotating of information on the micro level
– issues to discuss with an Archive (preservation service provider)
 Testing our model
– implementing our ideas in smaller projects with the aim of
making the results available to other projects
Thank you for your attention!
Evelyn Brislinger | Astrid Recker
GESIS – Leibniz Institute for the Social Sciences, Data Archive
evelyn.brislinger@gesis.org | astrid.recker@gesis.org
www.gesis.org

More Related Content

What's hot

Metadata Selection (revised)
Metadata Selection (revised)Metadata Selection (revised)
Metadata Selection (revised)
Jill Strass
 
Claremont Report on Database Research: Research Directions (Le Gruenwald)
Claremont Report on Database Research: Research Directions (Le Gruenwald)Claremont Report on Database Research: Research Directions (Le Gruenwald)
Claremont Report on Database Research: Research Directions (Le Gruenwald)
infoblog
 
Metadata: Digital Humanties
Metadata: Digital HumantiesMetadata: Digital Humanties
Metadata: Digital Humanties
Matthew Miguez
 
How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?
Jian Qin
 
Information Extraction from EuroParliament and UK Parliament data
Information Extraction from EuroParliament and UK Parliament dataInformation Extraction from EuroParliament and UK Parliament data
Information Extraction from EuroParliament and UK Parliament data
Wim Peters
 
Information Extraction in the TalkOfEurope Creative Camp
Information Extraction in the TalkOfEurope Creative CampInformation Extraction in the TalkOfEurope Creative Camp
Information Extraction in the TalkOfEurope Creative Camp
Wim Peters
 
‘Good, better, best’? Examining the range and rationales of institutional dat...
‘Good, better, best’? Examining the range and rationales of institutional dat...‘Good, better, best’? Examining the range and rationales of institutional dat...
‘Good, better, best’? Examining the range and rationales of institutional dat...
Robin Rice
 
Advances in Scientific Workflow Environments
Advances in Scientific Workflow EnvironmentsAdvances in Scientific Workflow Environments
Advances in Scientific Workflow Environments
Carole Goble
 
RDAP14: Comparing disciplinary repositories: tDAR vs. Open Context
RDAP14: Comparing disciplinary repositories: tDAR vs. Open ContextRDAP14: Comparing disciplinary repositories: tDAR vs. Open Context
RDAP14: Comparing disciplinary repositories: tDAR vs. Open Context
ASIS&T
 
Closing the Gap: Data Models for Documentary Linguistics
Closing the Gap: Data Models for Documentary LinguisticsClosing the Gap: Data Models for Documentary Linguistics
Closing the Gap: Data Models for Documentary Linguistics
Baden Hughes
 
An introduction to the Digital Curation Centre
An introduction to the Digital Curation CentreAn introduction to the Digital Curation Centre
An introduction to the Digital Curation Centre
Michael Day
 
Smith T Bio Hdf Bosc2008
Smith T Bio Hdf Bosc2008Smith T Bio Hdf Bosc2008
Smith T Bio Hdf Bosc2008
bosc_2008
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...
Lucy McKenna
 
Reading Group: From Database to Dataspaces
Reading Group: From Database to DataspacesReading Group: From Database to Dataspaces
Reading Group: From Database to Dataspaces
Jürgen Umbrich
 
Moving OA to the scientific enterprise
Moving OA to the scientific enterpriseMoving OA to the scientific enterprise
Moving OA to the scientific enterprise
Michael Day
 
Web 3 Mark Greaves
Web 3 Mark GreavesWeb 3 Mark Greaves
Web 3 Mark Greaves
Mediabistro
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
Carole Goble
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08
Jian Qin
 
Sarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspectiveSarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspective
Jisc
 
Giving researchers credit for data
Giving researchers credit for dataGiving researchers credit for data
Giving researchers credit for data
Jisc
 

What's hot (20)

Metadata Selection (revised)
Metadata Selection (revised)Metadata Selection (revised)
Metadata Selection (revised)
 
Claremont Report on Database Research: Research Directions (Le Gruenwald)
Claremont Report on Database Research: Research Directions (Le Gruenwald)Claremont Report on Database Research: Research Directions (Le Gruenwald)
Claremont Report on Database Research: Research Directions (Le Gruenwald)
 
Metadata: Digital Humanties
Metadata: Digital HumantiesMetadata: Digital Humanties
Metadata: Digital Humanties
 
How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?
 
Information Extraction from EuroParliament and UK Parliament data
Information Extraction from EuroParliament and UK Parliament dataInformation Extraction from EuroParliament and UK Parliament data
Information Extraction from EuroParliament and UK Parliament data
 
Information Extraction in the TalkOfEurope Creative Camp
Information Extraction in the TalkOfEurope Creative CampInformation Extraction in the TalkOfEurope Creative Camp
Information Extraction in the TalkOfEurope Creative Camp
 
‘Good, better, best’? Examining the range and rationales of institutional dat...
‘Good, better, best’? Examining the range and rationales of institutional dat...‘Good, better, best’? Examining the range and rationales of institutional dat...
‘Good, better, best’? Examining the range and rationales of institutional dat...
 
Advances in Scientific Workflow Environments
Advances in Scientific Workflow EnvironmentsAdvances in Scientific Workflow Environments
Advances in Scientific Workflow Environments
 
RDAP14: Comparing disciplinary repositories: tDAR vs. Open Context
RDAP14: Comparing disciplinary repositories: tDAR vs. Open ContextRDAP14: Comparing disciplinary repositories: tDAR vs. Open Context
RDAP14: Comparing disciplinary repositories: tDAR vs. Open Context
 
Closing the Gap: Data Models for Documentary Linguistics
Closing the Gap: Data Models for Documentary LinguisticsClosing the Gap: Data Models for Documentary Linguistics
Closing the Gap: Data Models for Documentary Linguistics
 
An introduction to the Digital Curation Centre
An introduction to the Digital Curation CentreAn introduction to the Digital Curation Centre
An introduction to the Digital Curation Centre
 
Smith T Bio Hdf Bosc2008
Smith T Bio Hdf Bosc2008Smith T Bio Hdf Bosc2008
Smith T Bio Hdf Bosc2008
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...
 
Reading Group: From Database to Dataspaces
Reading Group: From Database to DataspacesReading Group: From Database to Dataspaces
Reading Group: From Database to Dataspaces
 
Moving OA to the scientific enterprise
Moving OA to the scientific enterpriseMoving OA to the scientific enterprise
Moving OA to the scientific enterprise
 
Web 3 Mark Greaves
Web 3 Mark GreavesWeb 3 Mark Greaves
Web 3 Mark Greaves
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08
 
Sarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspectiveSarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspective
 
Giving researchers credit for data
Giving researchers credit for dataGiving researchers credit for data
Giving researchers credit for data
 

Similar to Alive and kicking! Keeping data re-usable in the European Values Study

Edinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repositoryEdinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
Robin Rice
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional Repositories
Robin Rice
 
The web of data: how are we doing so far?
The web of data: how are we doing so far?The web of data: how are we doing so far?
The web of data: how are we doing so far?
Elena Simperl
 
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
faflrt
 
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...
Baden Hughes
 
Open government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impactOpen government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impact
Elena Simperl
 
Reference Model for an Open Archival Information Systems (OAIS): Overview and...
Reference Model for an Open Archival Information Systems (OAIS): Overview and...Reference Model for an Open Archival Information Systems (OAIS): Overview and...
Reference Model for an Open Archival Information Systems (OAIS): Overview and...
faflrt
 
Research Objects: more than the sum of the parts
Research Objects: more than the sum of the partsResearch Objects: more than the sum of the parts
Research Objects: more than the sum of the parts
Carole Goble
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
Vivien Bonazzi
 
British Library Social Science National Postgraduate Training Day - Datasets ...
British Library Social Science National Postgraduate Training Day - Datasets ...British Library Social Science National Postgraduate Training Day - Datasets ...
British Library Social Science National Postgraduate Training Day - Datasets ...
johnkayebl
 
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLANINCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
Arhiv družboslovnih podatkov
 
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012
IUPUI
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROV
EUDAT
 
Current and emerging scientific data curation practices
Current and emerging scientific data curation practicesCurrent and emerging scientific data curation practices
Current and emerging scientific data curation practices
Michael Day
 
UK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceUK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalface
LizLyon
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 
Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012
IUPUI
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...
EDINA, University of Edinburgh
 
OAIS: What is it and Where is it Going? - Don Sawyer (2002)
OAIS: What is it and Where is it Going? - Don Sawyer (2002)OAIS: What is it and Where is it Going? - Don Sawyer (2002)
OAIS: What is it and Where is it Going? - Don Sawyer (2002)
faflrt
 
Curation-Friendly Tools for the Scientific Researcher
Curation-Friendly Tools for the Scientific ResearcherCuration-Friendly Tools for the Scientific Researcher
Curation-Friendly Tools for the Scientific Researcher
bwestra
 

Similar to Alive and kicking! Keeping data re-usable in the European Values Study (20)

Edinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repositoryEdinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional Repositories
 
The web of data: how are we doing so far?
The web of data: how are we doing so far?The web of data: how are we doing so far?
The web of data: how are we doing so far?
 
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
 
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...
 
Open government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impactOpen government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impact
 
Reference Model for an Open Archival Information Systems (OAIS): Overview and...
Reference Model for an Open Archival Information Systems (OAIS): Overview and...Reference Model for an Open Archival Information Systems (OAIS): Overview and...
Reference Model for an Open Archival Information Systems (OAIS): Overview and...
 
Research Objects: more than the sum of the parts
Research Objects: more than the sum of the partsResearch Objects: more than the sum of the parts
Research Objects: more than the sum of the parts
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
British Library Social Science National Postgraduate Training Day - Datasets ...
British Library Social Science National Postgraduate Training Day - Datasets ...British Library Social Science National Postgraduate Training Day - Datasets ...
British Library Social Science National Postgraduate Training Day - Datasets ...
 
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLANINCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
 
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROV
 
Current and emerging scientific data curation practices
Current and emerging scientific data curation practicesCurrent and emerging scientific data curation practices
Current and emerging scientific data curation practices
 
UK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceUK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalface
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...
 
OAIS: What is it and Where is it Going? - Don Sawyer (2002)
OAIS: What is it and Where is it Going? - Don Sawyer (2002)OAIS: What is it and Where is it Going? - Don Sawyer (2002)
OAIS: What is it and Where is it Going? - Don Sawyer (2002)
 
Curation-Friendly Tools for the Scientific Researcher
Curation-Friendly Tools for the Scientific ResearcherCuration-Friendly Tools for the Scientific Researcher
Curation-Friendly Tools for the Scientific Researcher
 

More from CESSDA Training

Höher, schneller, weiter? Datenmanagement mit Augenmaß!
Höher, schneller, weiter? Datenmanagement mit Augenmaß!Höher, schneller, weiter? Datenmanagement mit Augenmaß!
Höher, schneller, weiter? Datenmanagement mit Augenmaß!
CESSDA Training
 
Was sind Daten, Forschungsdaten, Metadaten?
Was sind Daten, Forschungsdaten, Metadaten?Was sind Daten, Forschungsdaten, Metadaten?
Was sind Daten, Forschungsdaten, Metadaten?
CESSDA Training
 
Supporting the creation, management, and long-term preservation of social sc...
Supporting the creation, management, and  long-term preservation of social sc...Supporting the creation, management, and  long-term preservation of social sc...
Supporting the creation, management, and long-term preservation of social sc...
CESSDA Training
 
Academic Writing and Research Data Management
Academic Writing and Research Data ManagementAcademic Writing and Research Data Management
Academic Writing and Research Data Management
CESSDA Training
 
De-mystifying OAIS compliance Benefits and challenges of mapping the OAIS re...
De-mystifying OAIS compliance  Benefits and challenges of mapping the OAIS re...De-mystifying OAIS compliance  Benefits and challenges of mapping the OAIS re...
De-mystifying OAIS compliance Benefits and challenges of mapping the OAIS re...
CESSDA Training
 
Access Policies and Licensing for Archives and Repositories
Access Policies and Licensing for Archives and RepositoriesAccess Policies and Licensing for Archives and Repositories
Access Policies and Licensing for Archives and Repositories
CESSDA Training
 
A Look at CESSDA and Data Re-use Licenses
A Look at CESSDA and Data Re-use LicensesA Look at CESSDA and Data Re-use Licenses
A Look at CESSDA and Data Re-use Licenses
CESSDA Training
 
Archive and Data Management Training Center
Archive and Data Management Training CenterArchive and Data Management Training Center
Archive and Data Management Training Center
CESSDA Training
 
Archiving and Data Management Training and Information Center
Archiving and Data Management Training and Information CenterArchiving and Data Management Training and Information Center
Archiving and Data Management Training and Information Center
CESSDA Training
 
Archives as a market regulator, or how can archives connect supply and demand?
Archives as a market regulator, or how can archives connect supply and demand?Archives as a market regulator, or how can archives connect supply and demand?
Archives as a market regulator, or how can archives connect supply and demand?
CESSDA Training
 
Election studies: a research data management challenge
Election studies: a research data management challengeElection studies: a research data management challenge
Election studies: a research data management challenge
CESSDA Training
 

More from CESSDA Training (11)

Höher, schneller, weiter? Datenmanagement mit Augenmaß!
Höher, schneller, weiter? Datenmanagement mit Augenmaß!Höher, schneller, weiter? Datenmanagement mit Augenmaß!
Höher, schneller, weiter? Datenmanagement mit Augenmaß!
 
Was sind Daten, Forschungsdaten, Metadaten?
Was sind Daten, Forschungsdaten, Metadaten?Was sind Daten, Forschungsdaten, Metadaten?
Was sind Daten, Forschungsdaten, Metadaten?
 
Supporting the creation, management, and long-term preservation of social sc...
Supporting the creation, management, and  long-term preservation of social sc...Supporting the creation, management, and  long-term preservation of social sc...
Supporting the creation, management, and long-term preservation of social sc...
 
Academic Writing and Research Data Management
Academic Writing and Research Data ManagementAcademic Writing and Research Data Management
Academic Writing and Research Data Management
 
De-mystifying OAIS compliance Benefits and challenges of mapping the OAIS re...
De-mystifying OAIS compliance  Benefits and challenges of mapping the OAIS re...De-mystifying OAIS compliance  Benefits and challenges of mapping the OAIS re...
De-mystifying OAIS compliance Benefits and challenges of mapping the OAIS re...
 
Access Policies and Licensing for Archives and Repositories
Access Policies and Licensing for Archives and RepositoriesAccess Policies and Licensing for Archives and Repositories
Access Policies and Licensing for Archives and Repositories
 
A Look at CESSDA and Data Re-use Licenses
A Look at CESSDA and Data Re-use LicensesA Look at CESSDA and Data Re-use Licenses
A Look at CESSDA and Data Re-use Licenses
 
Archive and Data Management Training Center
Archive and Data Management Training CenterArchive and Data Management Training Center
Archive and Data Management Training Center
 
Archiving and Data Management Training and Information Center
Archiving and Data Management Training and Information CenterArchiving and Data Management Training and Information Center
Archiving and Data Management Training and Information Center
 
Archives as a market regulator, or how can archives connect supply and demand?
Archives as a market regulator, or how can archives connect supply and demand?Archives as a market regulator, or how can archives connect supply and demand?
Archives as a market regulator, or how can archives connect supply and demand?
 
Election studies: a research data management challenge
Election studies: a research data management challengeElection studies: a research data management challenge
Election studies: a research data management challenge
 

Alive and kicking! Keeping data re-usable in the European Values Study

  • 1. Alive and kicking! Keeping data re-usable in the European Values Study IASSIST Cologne, May 2013 Astrid.Recker@gesis.org, Evelyn.Brislinger@gesis.org GESIS, Data Archive for the Social Sciences
  • 2. Overview  Data and information flow in the EVS project  Principles and workflows for managing data and documentation in survey projects
  • 3. GESIS Data Archive Basis Interplay between Principal Investigators (PI) and Data Archive Agreement on submission of data and information packages Goals Ease access to data for a broad user community Provide metadata for discovery, understanding, and good use of data Preserve data and metadata for re-use and replications Holdings Studies, study series, and complex survey programs as ISSP, Eurobarometer, ALLBUS, European Values Study (EVS), or election studies
  • 4. Data and information created in a survey project Total stock of data and documentation created Data and documentation submitted to an archive Further information necessary for the project(?) Selection processes Management solutions for structuring data and information
  • 5. Example: European Values Study (EVS) 9-year-period, 4 waves 49 countries, 125 national surveys Cross-national, longitudinal research program National surveys Waves 1981/1990/1999/2008 Longitudinal data File 1981-2008 (LdF) Integrated Values Surveys EVS/WVS (IVS) Harmonization and integration process Number of files Size of files Atlas of European Values www.europeanvaluesstudy.eu/evs/evsatlas.html
  • 6. Collaboration of actors involved (EVS 2008) Data created processed documented National team Data standardized harmonized integrated Central team Data Archive Secondary usersPrincipal Investigators Data checked documented preserved released Data re-used Analyses replicated Results reported
  • 7. Users: analyze and evaluate outcomes Questions Check trend questions and original questions  ZACAT-Online Study Catalogue Data Analyze data, report errors, monitor error reporting  GESIS Data Catalogue Publications Replicate analysis of other projects  EVS Repository …. and detect peculiarities in questions or problems in data
  • 8. Peculiarities in question text spotted? Project Design Questionnaire Design Questionnaire Translation Data Collection Data Documentation Data Processing Check question and translation Master/field questionnaire, methodological questionnaire, report ‘Translation History’ Check source of question Trend question from EVS and WVS, questions borrowed from other surveys Identify consequences for Countries sharing/adopting affected language, languages belonging to a family, further languages used in a country EVS 2008 Data lifecycle
  • 9. Data error detected? Standardization and harmonization process: check comparability of surveys, questions, variables  cumulate data and document each step Integrated Values Surveys EVS/WVS Longitudi- nal data File 1981-2008 Wave 2008 National data Original data file Wave 1999 ….. National data ….. Retrace data processing steps across surveys: check data, syntax files, and documentation  update data and highlight problems for next wave Error detected
  • 10. Data and information created Designated communities  Principal Investigator/Project  Secondary user Experiences from EVS project Data and information packages  Project package  Archive package Selection processes  Within project  Between project and archive Project Archive Total stock
  • 11. Communicating with the future: Activity on two levels Macro level Defining workflows, file and information paths on which necessary information is passed on Micro level Organizing information so that it is re-usable (RDM, metadata, systematic file structures)
  • 12. Begin by identifying principles for structuring and documenting files in the project (Research Data Management) Select which information is relevant to whom? A tidy house, a tidy mind! Reference, don’t duplicate files whenever possible Identify and capture “kinship relations” Capture process knowledge classes itineraries Make changes traceable versioning document revisions & annotations minutes protocols
  • 13. The magic wand  Follow principles of good research data management (RDM)   Use metadata to document process and content information   Use standards wherever possible (e.g. DDI, Dublin Core, ISO codes, file naming conventions, etc.) (and not the one used by the sorcerer’s apprentice)
  • 15. Managing information flows in a collaborative, long- term project  Which paths does information (data, documentation, other contextual material) take from producers to users?  Two models helped us clarify processes and paths, as well as identify helpful terminology and concepts – Project life cycle – Open Archival Information System (OAIS) reference model (CCSDS 2012) CCSDS (2012). Reference Model for an Open Archival Information System (OAIS). Recommended Practice. http://public.ccsds.org/publications/archive/650x0m2.pdf
  • 16. Project Repository Ingest Data processing and enhancement Data Management Temporary Storage Access (project-internal use, PIs) Project Design Data Dissemination Questionnaire Design Questionnaire Translation Data Collection Data Documentation Data Processing Project life cycle: Data flow during creation of a survey Guidelines
  • 17. Data Archive (preservation service provider) Data Management Access Archival Storage (long-term) Preservation Planning Administration Ingest Secondary Users (future) Principal Investigators SIP AIPAIP DIP Project Repository (content provider) Ingest Data processing and enhancement Data Management Temporary Storage Access (project-internal use, PIs) Project and Data Archive as distributed system PIP PIP PIP PIP PIP PIP PIP PIP PIP PIP = Project Information Package, SIP = Submission Information Package, AIP = Archival Information Package, DIP = Dissemination Information Package Project Design Data Dissemination Questionnaire Design Questionnaire Translation Data Collection Data Processing Data Documentation
  • 18. Staying Alive! Where we are going from here  Developing a guideline for projects – structuring and annotating of information on the micro level – issues to discuss with an Archive (preservation service provider)  Testing our model – implementing our ideas in smaller projects with the aim of making the results available to other projects
  • 19. Thank you for your attention! Evelyn Brislinger | Astrid Recker GESIS – Leibniz Institute for the Social Sciences, Data Archive evelyn.brislinger@gesis.org | astrid.recker@gesis.org www.gesis.org