Research data “publishing” –
models, roles and responsibilities
             Sünje Dallmeier-Tiessen

       CERN | Scientific Information Service

           Humboldt Universität, Berlin
Outline
Research data “publishing”
• Introduction: Research data
• Models
      – Text-centered occurrence
      – Data-centered occurrence
      – Object-centric principle
• Roles
      – Motivation, Competences, Responsibilities
• Conclusions

Open Access – Open Data 2010   Sunje Dallmeier-Tiessen | CERN
Research data publishing

  1884                                                          2010




Open Access – Open Data 2010   Sunje Dallmeier-Tiessen | CERN
What is research data?
Research data                 How does it look like?

                              How can this be integrated
                                into modern scholarly
                                communication?




Foto: AWI Bremerhaven, Gutt
Research data “publishing” beyond
            disciplines?
 Discipline specific characteristics of research data,
   BUT challenges are similar:
       – Non existing data publishing culture
              • Missing infrastructure
              • Non established workflows
              • Missing awareness/motivation/submission
                “we need to get this paper out” vs. “we need to publish this
                dataset”
       – Diverse research datasets
              • Complexity: From simple files to complex datasets
              • Size: From small to “huge”
              • Documentation: varying requirements

Open Access – Open Data 2010    Sunje Dallmeier-Tiessen | CERN
Models
How does the world of research
data publishing look like today?
stand-alone, text centered, data
           centered
Occurence:
               text centered publication model

                                                                     Research
                                                                       Data
• Long tradition
• Linked research data in
  external data                                                                      Metadata
                                              Tables
  respositories, e.g. DataCite
                                                                   Text, Results,
• Attached supplemental                                           Interpretation
  material: tables, graphs,
  audio/video/data files (on
  same plattform as text
                                                        Graphs,                 Documen-
  publication)                                          Figures                   tation



  Open Access – Open Data 2010   Sunje Dallmeier-Tiessen | CERN
Supplement




                                                                 Source: Nature 2006

Open Access – Open Data 2010    Sunje Dallmeier-Tiessen | CERN
Occurrence: data centered publication
                   model
                                                         Interpretation,
                                                             Article
                                   Services                     I
                                                                       Interpretation,
                                                                           Article
                                                                              II




                                            Research
                          Tables
                                              Data
                                                                             Metadata



                                   Graphs,
                                   Figures                   Documen-
                                                               tation


Open Access – Open Data 2010          Sunje Dallmeier-Tiessen | CERN
Data Journal
Data centered publication

• including data documentation
• data access
• quality assurance – peer review
• open access

Incentive: publication for
enhanced data provision



Open Access – Open Data 2010    Sunje Dallmeier-Tiessen | CERN
Direct data access




Open Access – Open Data 2010       Sunje Dallmeier-Tiessen | CERN
With modern information technology,
        many opportunities
… to service the different needs in the research
  communities:
• When do I want to publish my data?
            • After production?
            • Before or after the “text” publication?
• And with whom would I like to share?
            •   With my colleagues?
            •   With my community?
            •   With the entire world?
            •   … or maybe this might change over time?
Different models and responsibilities for involved
 parties
Open Access – Open Data 2010    Sunje Dallmeier-Tiessen | CERN
Looking into the future:
        Principle – object centered (“mash up”)
Materials and services as individually addressed objects
• Persistence of objects/services,
• Persistent identifier

                                                                      Results and
                                                                    Interpretation
                                  Table



                Figure
                                                                               Research Data

                                     Documentation


                                                                         Services


   Open Access – Open Data 2010    Sunje Dallmeier-Tiessen | CERN
Hands-on!?

       Information
         Science/
        Publishers



      IT/        Research
Computing        Community




                             after Corral, 2008
                             Foto by katera, flickr
Prerequisites for Research Data Publishing
               Cooperation of researchers and service providers


                                          Access

              Research data
                                          Quality                        Infrastructure/
             (Documentation,
                                                                           Workflows
               Metadata…)
                                       Persistence,
                                          Trust-
                                       worthiness


     Community/Researchers:                                     Infrastructure provider
                                                                Workflows
     Provision,                                                 Offer infrastructure
     Documentation,                                             Standards
     Quality control                                            Trustworthiness
                                  Responsibilities
Open Access – Open Data 2010   Sunje Dallmeier-Tiessen | CERN
Role of researchers

… as data providers
[data production]
                                          Foto by G. F. Wicke, flickr
• Data documentation
• Data submission (preparation of publishable
  dataset)
• Data quality assurance/peer review

… as data users
• Citation of dataset, acknowledgement
                                                                Source: Nature 2006

Open Access – Open Data 2010   Sunje Dallmeier-Tiessen | CERN
Role of information scientists
Service
• Management of the research material
  remix
   – Facilitating new workflows
   – Integration of new features: e.g. data
     citation
   – Preservation, catalogue
• Interface between IT and researcher
   – Make publishing platforms usable
         • Submission
         • Search functionality

                                                                   foto: flickr
  Open Access – Open Data 2010    Sunje Dallmeier-Tiessen | CERN
Conclusions
• Missing data publishing culture in many
  communities
• New publishing models allow tailored
  services, developed with the community
• Significant role of community in research data
  preservation and publishing
      – Implications for content recruitment


Open Access – Open Data 2010    Sunje Dallmeier-Tiessen | CERN
QUESTIONS



   sunje.dallmeier-tiessen@cern.ch



   Funded by BMBF, Gentner Scholarship




Open Access – Open Data 2010   Sunje Dallmeier-Tiessen | CERN

Sünje Dallmeier-Tiessen: Research data "publishing": models, roles and responsibilities

  • 1.
    Research data “publishing”– models, roles and responsibilities Sünje Dallmeier-Tiessen CERN | Scientific Information Service Humboldt Universität, Berlin
  • 2.
    Outline Research data “publishing” •Introduction: Research data • Models – Text-centered occurrence – Data-centered occurrence – Object-centric principle • Roles – Motivation, Competences, Responsibilities • Conclusions Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
  • 3.
    Research data publishing 1884 2010 Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
  • 4.
    What is researchdata? Research data How does it look like? How can this be integrated into modern scholarly communication? Foto: AWI Bremerhaven, Gutt
  • 5.
    Research data “publishing”beyond disciplines? Discipline specific characteristics of research data, BUT challenges are similar: – Non existing data publishing culture • Missing infrastructure • Non established workflows • Missing awareness/motivation/submission “we need to get this paper out” vs. “we need to publish this dataset” – Diverse research datasets • Complexity: From simple files to complex datasets • Size: From small to “huge” • Documentation: varying requirements Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
  • 6.
    Models How does theworld of research data publishing look like today? stand-alone, text centered, data centered
  • 7.
    Occurence: text centered publication model Research Data • Long tradition • Linked research data in external data Metadata Tables respositories, e.g. DataCite Text, Results, • Attached supplemental Interpretation material: tables, graphs, audio/video/data files (on same plattform as text Graphs, Documen- publication) Figures tation Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
  • 8.
    Supplement Source: Nature 2006 Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
  • 9.
    Occurrence: data centeredpublication model Interpretation, Article Services I Interpretation, Article II Research Tables Data Metadata Graphs, Figures Documen- tation Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
  • 10.
    Data Journal Data centeredpublication • including data documentation • data access • quality assurance – peer review • open access Incentive: publication for enhanced data provision Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
  • 11.
    Direct data access OpenAccess – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
  • 12.
    With modern informationtechnology, many opportunities … to service the different needs in the research communities: • When do I want to publish my data? • After production? • Before or after the “text” publication? • And with whom would I like to share? • With my colleagues? • With my community? • With the entire world? • … or maybe this might change over time? Different models and responsibilities for involved parties Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
  • 13.
    Looking into thefuture: Principle – object centered (“mash up”) Materials and services as individually addressed objects • Persistence of objects/services, • Persistent identifier Results and Interpretation Table Figure Research Data Documentation Services Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
  • 14.
    Hands-on!? Information Science/ Publishers IT/ Research Computing Community after Corral, 2008 Foto by katera, flickr
  • 15.
    Prerequisites for ResearchData Publishing Cooperation of researchers and service providers Access Research data Quality Infrastructure/ (Documentation, Workflows Metadata…) Persistence, Trust- worthiness Community/Researchers: Infrastructure provider Workflows Provision, Offer infrastructure Documentation, Standards Quality control Trustworthiness Responsibilities Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
  • 16.
    Role of researchers …as data providers [data production] Foto by G. F. Wicke, flickr • Data documentation • Data submission (preparation of publishable dataset) • Data quality assurance/peer review … as data users • Citation of dataset, acknowledgement Source: Nature 2006 Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
  • 17.
    Role of informationscientists Service • Management of the research material remix – Facilitating new workflows – Integration of new features: e.g. data citation – Preservation, catalogue • Interface between IT and researcher – Make publishing platforms usable • Submission • Search functionality foto: flickr Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
  • 18.
    Conclusions • Missing datapublishing culture in many communities • New publishing models allow tailored services, developed with the community • Significant role of community in research data preservation and publishing – Implications for content recruitment Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
  • 19.
    QUESTIONS sunje.dallmeier-tiessen@cern.ch Funded by BMBF, Gentner Scholarship Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN

Editor's Notes

  • #4 Here add the files from gentner day
  • #5 Plus the advancements that Martin presented already
  • #16 Basis fuer die textorientierte und die datenorientierte Publikationsform, dadurch auch zeitliche unabhaengigkeit
  • #18 Publishers belong to infrastructure providers.