SlideShare a Scribd company logo
1 of 54
Building a Data Discovery Network
            for Sustainability Science
                                            Robert H. McDonald
                                 Deputy Director Data to Insight (D2I) Center
                                       Associate Dean – IU Libraries
                                             Indiana University
                            rhmcdona@indiana.edu | @mcdonald @SEADdatanet

                                  Presented at the VIVO 2012 Conference

                                         Miami, FL– August 24, 2012
http://slidesha.re/Q9q8VW        Available from: http://slidesha.re/Q9q8VW


                                                 © Trustees of Indiana University
                                                 Released under Creative Commons 3.0
                                                 unported license; license terms on last slide.
NSF DataNet Program
Motivation:
  “… one of the major challenges of this scientific
  generation: how to develop the new methods,
  management structures and technologies to
  manage the diversity, size, and complexity of
  current and future data sets and data streams.”
Response:
  DataNet creates “a set of exemplar national
  and global data research infrastructure
  organizations” to address this challenge.
Current NSF DataNet Projects
• SEAD
   – http://sead-data.net
• DataOne
   – http://www.dataone.org
• DataNet Federation Consortium
   – http://datafed.org
• Terra Populous
   – https://www.pop.umn.edu/terra_pop
Sustainable Environment Actionable Data
              (SEAD) - DataNet
• SEAD Strategy                     SEAD Partners - http://sead-data.net
  ― Serve scientists and
    researchers in the “long
    tail” of science
  ― Leverage social media for
    discovery of
    data, interest, and
    expertise
  ― Move data curation
    upstream in the data life
    cycle of science
  ― Take advantage of
    existing domain and
    institutional infrastructures
    (Institutional
    Repositories, ICPSR) for
    long-term preservation
SEAD TEAMS
              Margaret Hedstrom-PI, Ann Zimmerman-Co-PI, Karen
 Michigan     Woollams, George Alter (ICPSR), Bryan Beecher (ICPSR), Jude Yew

              Beth Plale-Co-PI, Katy Börner, Robert H. McDonald, Robert
  Indiana     Light, Kavitha Chandrasekar, Stacy Kowalczyk, Robert Ping

              James Myers-Co-PI, Ram Prasanna Govind Krishnan, Lindsay Todd
Rensselaear

              Praveen Kumar-Co-PI, Md Aktaruzzaman, Terry McLaren (NCSA), Rob
  Illinois    Kooper (NCSA), Luigi Marini (NCSA)
SEAD 18 month Pilot Phase
•   Domain Engagement:
     – National Center for Earth Systems Dynamics (NCESD), Illinois River
        Basin Observatory
     – Requirements, Use Cases, Prioritization of Data Types and Services
•   Active and Social Curation
     – Pilot Active Content Repository, VIVO deployments
     – Exemplar services for Data Ingest, Discovery, Re-use, Curation
        (Tupelo/Medici)
•   CI for Long-term Access (Virtual Archive)
     – Data model, protocol design/development
     – Pilot Federated Repository infrastructure
•   Education, Outreach, and Training
     – Post-doc mentoring
     – Web site, training materials, meetings, workshops, …
•   Project Oversight
     – Management, reporting, committees
     – Business model development
Sustainability
        Science

               Science


Cooperation               Technology




  Policy                  Economics

              Poverty &
               Justice




                                       7
Data challenges
•   Heterogeneity of
    all kinds
•   Multiple scales
•   Multidisciplinary
•   Many small
    datasets
The long tail of scientific research


• Small and derived data sets
• Heterogeneous data
• Multiple sources of data
• Short-lived data with long-term
  value
• Value of data grows when combined
  & integrated
SEAD notions of defined Data Phases
• Phases of data lifecycle acknowledge and accommodate
  the difference between public data and data still in work by a
  researcher.
• Research Data Phase: data set is research data
  collection, owned by individual and under their control.
   – Data need not be licensed at this time because it is not
     ready for broader release
   – Data need not have permanent IDs because still work in
     progress
   – Corresponds to first existence in Active Curation Repository
• Published Phase: Owner of research data collection
  determines that dataset is ready for publication
   – License terms set
   – Persistent ID
   – Made available as part of public profile in VIVO
   – Activated by user-controlled publish event
SEAD CI Technical Approach
       Active and Social Curation                                       OAIS Repository Federation
                                               Curation Boundary
                                                                       Automated Curation
      Data              Metadata                                         Workflow/Rule
  Acquisition, A                                                            Engine
                       Management
   nalysis and                                                               Operates on
   Simulation                  DDI3.
                                                                       Metadata, Content Objects       Scholarly
                      METS, PREMIS, MODS, DC
                        , SensorML, OGC, …                                and Trigger Events         Communication

                                           Ingest scripts:
                                                                                          Ingest, AIPs
                              Appraisal   fixity, integrity,                        Compound Objects - OAI-ORE
                                 and CI Technical Approach
                                          authentication,
   VIVO/
Linked Data     Active        Selection   transformation                     Digital Repository Federation
               Content                                                         (OAIS compliant)
                                                                                                          Preservation
              Repository                                                                                    Actions
                                                                                       Dissemination Packages

                                         Wide-Area File System

  Search, Browse                                                       Migration
          ,                                                               and                  Access Mechanisms and
  Annotation, Vis     Use, Reuse, Rep
                                                                       Emulation                E-Scholarship Services
  ualization Tools    urposing Tools              Contributor   User
                                                                         Tools
SEAD CI Technical Approach
     Active and Social Curation                                       OAIS Repository Federation
                                             Curation Boundary
                                                                     Automated Curation
    Data      Metadata                                                 Workflow/Rule
Acquisition, A                                                            Engine
A            Management
     standardized data
                    model
 nalysis and                                                               Operates on
                             DDI3.                                                                   Scholarly
and federation capability
 Simulation         METS, PREMIS, MODS, DC
                      , SensorML, OGC, …
                                                                     Metadata, Content Objects
                                                                        and Trigger Events         Communication
over OAIS-Standard
                    Ingest scripts:
Institutional Repositories                                                              Ingest, AIPs
                            Appraisal fixity, integrity, aut
                                                                                  Compound Objects - OAI-ORE
                               and CI Technical Approach
                                      hentication, transf
                            Selection       ormation
SEAD Active Data                                                           Digital Repository Federation
                                                                             (OAIS compliant)
    Systems                                                                                             Preservation
                                                                                                          Actions
                                                                                     Dissemination Packages

                                       Wide-Area File System

Search, Browse                                                       Migration
        ,                                                               and                  Access Mechanisms and
Annotation, Vis     Use, Reuse, Rep
                                                                     Emulation                E-Scholarship Services
ualization Tools    urposing Tools              Contributor   User
                                                                       Tools
SEAD CI Technical Approach
       Active and Social Curation                                         OAIS Repository Federation
                                                 Curation Boundary
                     A robust, replicated distributed file
                     system used as a large-scaleCuration
                                            Automated
                                                        backing
                                              Workflow/Rule
      Data               Metadata
  Acquisition, A     store
                        Management
                                                 Engine
   nalysis and                                                                 Operates on
   Simulation                    DDI3.
                                                                         Metadata, Content Objects       Scholarly
                        METS, PREMIS, MODS, DC
                          , SensorML, OGC, …                                and Trigger Events         Communication

                                             Ingest scripts:
                                                                                            Ingest, AIPs
                                Appraisal fixity, integrity, aut
                                                                                      Compound Objects - OAI-ORE
                                   and CI Technical Approach
                                          hentication, transf
   VIVO/
Linked Data     Active          Selection       ormation                       SEAD Trusted
                                                                                Digital Repository Federation
               Content                                                           (OAIS compliant)           Preservation
              Repository                                                                                      Actions
                                                                                         Dissemination Packages

                                           Wide-Area File System

  Search, Browse                                                         Migration
          ,                                                                 and                  Access Mechanisms and
  Annotation, Vis       Use, Reuse, Rep
                                                                         Emulation                E-Scholarship Services
  ualization Tools      urposing Tools              Contributor   User
                                                                           Tools
SEAD CI Technical Approach
       Active and Social Curation                                             OAIS Repository Federation
                                               Curation Boundary
                                                                            Automated Curation
      Data              MetadataAn Active Content               Repository    Workflow/Rule
                                                                                 Engine
  Acquisition, A
   nalysis and
                                based on standard
                       Management                               global IDs and
                                                           Operates on
   Simulation                  DDI3.
                                       semantic web technologies Scholarly
                      METS, PREMIS, MODS, DC
                                                     Metadata, Content Objects
                        , SensorML, OGC, …              and Trigger Events                             Communication
                                      - to collect and integrate
                                   data, metadata, and provenance
                                          Ingest scripts:
                                                                  Ingest, AIPs
                              Appraisal  fixity, integrity,
                                   information from multiple sources.
                                                            Compound Objects - OAI-ORE
                                 and CI Technical Approach
                                          authentication,
   VIVO/
Linked Data     Active        Selection   transformation
                                                      DC:Creator                   SEAD Trusted
                                                                   OPM:wasDerivedFrom
                                                                                    Digital Repository Federation
               Content                                             SWAN:isEvidenceFor…
                                                                                    (OAIS compliant)        Preservation
                                                                                                            Content
              Repository                                                                                      Content
                                                                                                              Actions
                                                                                                               Content
                                         Content                                          Dissemination Packages

                                         Wide-Area File System                           Lustre File
                                                                                          System
  Search, Browse                                                             Migration
          ,                                                                     and              Access Mechanisms and
  Annotation, Vis     Use, Reuse, Rep
                                                                             Emulation            E-Scholarship Services
  ualization Tools    urposing Tools              Contributor   User
                                                                               Tools
SEAD CI Technical Approach
       Active and Social Curation                                        OAIS Repository Federation
                                                Curation Boundary
                                                                        Automated Curation
      Data              SEAD will run a VIVO instance
                        Metadata                                          Workflow/Rule
  Acquisition, A                                                             Engine
   nalysis and
                        and may harvest Linked Data from
                       Management
                                                 Operates on
                                                                       Scholarly
   Simulation           other sources
                                DDI3.
                                           Metadata, Content Objects
                       METS, PREMIS, MODS, DC
                         , SensorML, OGC, …   and Trigger Events     Communication

                                            Ingest scripts:
                                                                                       Ingest, AIPs
                               Appraisal fixity, integrity, aut
                                                                                 Compound Objects - OAI-ORE
                                  and CI Technical Approach
                                         hentication, transf
   VIVO/
Linked Data     Active         Selection       ormation
                                             Digital Repository Federation
               Content VIVO Application: Open (OAIS compliant)
                                                Source                Preservation
              Repositoryfederatable Researcher                           Actions
                                                     Dissemination Packages
                         Information –
                               Wide-Area File System
                         people, papers, projects, center
                         s, fields, etc.
  Search, Browse                                                        Migration
          ,                                                                and           Access Mechanisms and
  Annotation, Vis      Use, Reuse, Rep
                                                                        Emulation         E-Scholarship Services
  ualization Tools     urposing Tools              Contributor   User
                                                                          Tools
SEAD CI Technical Approach
      Active and Social Curation                                        OAIS Repository Federation
                                               Curation Boundary
                                                                       Automated Curation
      Data              Metadata                                         Workflow/Rule
  Acquisition,                                                              Engine
                       Management
  Analysis and                                                               Operates on
   Simulation                  DDI3.
                                                                       Metadata, Content Objects       Scholarly
                      METS, PREMIS, MODS, DC
                        , SensorML, OGC, …                                and Trigger Events         Communication

                                           Ingest scripts:
                              Active and Social Curation AIPs
                              Appraisal fixity, integrity, aut
                                                                        Ingest,
                                                                  Compound Objects - OAI-ORE
   VIVO/
                              Services supporting automated and
                                 and CI Technical Approach
                                        hentication, transf
                              Selection
Linked Data     Active        interactive use use SEAD Trusted
                                              ormation
                              and interactive of SEADSEAD Federation
                                                               of Repository
                                                               Digital
               Content        - leveraging standard (OAIS compliant)
                                                    web                      Preservation
              Repository                                                       Actions
                              application/web service toolkits and
                                                           Dissemination Packages
                              virtual machine infrastructure
                              and virtual machine infrastructure
                                         Wide-Area File System

  Search, Browse                                                       Migration
          ,                                                               and                  Access Mechanisms and
  Annotation, Vis     Use, Reuse, Rep
                                                                       Emulation                E-Scholarship Services
  ualization Tools    urposing Tools              Contributor   User
                                                                         Tools
SEAD CI Technical Approach
     Active and Social Curation                                       OAIS Repository Federation
                                             Curation Boundary
                                                                     Automated Curation
    Data             Metadata                                          Workflow/Rule
Acquisition, A                                                            Engine
                    Management
 nalysis and                                                               Operates on
 Simulation                  DDI3.
                                                                     Metadata, Content Objects       Scholarly
                    METS, PREMIS, MODS, DC
                      , SensorML, OGC, …                                and Trigger Events         Communication

                                         Ingest scripts:
                                                                                        Ingest, AIPs
                            Appraisal fixity, integrity, aut
                                                                                  Compound Objects - OAI-ORE
                               and CI Technical Approach
                                      hentication, transf
                            Selection       ormation                       SEAD Trusted
  Active Content and Preservation Services Repository Federation
    Curation                           Digital also
   Repository                          (OAIS compliant)    Preservation
     leveraging standard web application/web                  Actions

     service toolkits and virtual machine Dissemination Packages

     infrastructure Wide-Area File System
Search, Browse                                                       Migration
        ,                                                               and                  Access Mechanisms and
Annotation, Vis     Use, Reuse, Rep
                                                                     Emulation                E-Scholarship Services
ualization Tools    urposing Tools              Contributor   User
                                                                       Tools
Active and Social Curation                                        OAIS Repository Federation
                                               Curation Boundary
                                                                       Automated Curation
      Data              Metadata                                         Workflow/Rule
  Acquisition,                                                              Engine
                       Management
  Analysis and                                                               Operates on
   Simulation                  DDI3.
                                                                       Metadata, Content Objects       Scholarly
                      METS, PREMIS, MODS, DC
                        , SensorML, OGC, …                                and Trigger Events         Communication

                                           Ingest scripts:
                                                                                          Ingest, AIPs
                              Appraisal fixity, integrity, aut
                                                                                    Compound Objects - OAI-ORE
                                 and    hentication, transf
   VIVO/
Linked Data     Active        Selection       ormation                       Digital Repository Federation
               Content                                                         (OAIS compliant)
                                                                                                          Preservation
              Repository                                                                                    Actions
                                                                                       Dissemination Packages

                                         Wide-Area File System

  Search, Browse                                                       Migration
          ,                                                               and                  Access Mechanisms and
  Annotation, Vis     Use, Reuse, Rep
                                                                       Emulation                E-Scholarship Services
  ualization Tools    urposing Tools              Contributor   User
                                                                         Tools
SEAD Active/Social Curation
        Repository
SEAD VIVO: RIS2N3
SEAD Virtual Archive
Faceted search(Solr-based)




                         Facets
Search Result
A dataset or file looks like this
Geospatial search(from Postgres
             index)
Geospatial search results
Login for data upload
Upload file




Files from Medici can also be added
Create collection (can have
       multiple files)
Upload complete
Data ingested to DSpace
  (Mississippi example)
SEAD Virtual Archive Architecture

                                     VIVO
                                                               IU                                          IR
                                                           DataCite                                     DSpace
                                    server
                                                           ID Server

                                                                                Store data object, its metadata object, and
                                                                                its relationship record (latter as RDF) in IR
                                                                                as a collection



                                       Register            Obtain
                                       DOI with           DOI from              register metadata to SOLR and PostgreS
                                        VIVO              DataCite              for rapid retrieval of metadata
            SIP
          (Data+                                                                                                     Core
         Metadata)                                                                                    Solr         Property+
                        Data
 SEAD                Validation
                                       Feature               SIP         AIP                         Index          Domain
Ingest                                Extraction         breakdown                                                 Metadata
                     (Fixity, Vir                                        +
                                      from Data
Client                   us)                                            Data
  /UI      Ack
                                                                                                                Geospatial
                                                                                                   PostgreS     +Temporal
                                    Preservation Metadata Generation (Events)                      QL Index     Metadata
Key Questions for SEAD
          Prototype
• What could SEAD capture when?
• How can SEAD provide direct
  value to data
  producers, users, and curators?
• How can web 2.0/3.0 and social
  computing lower barriers and
  reduce/realign costs?
Towards A Shared Data Future
                          Data                                                  User functionalities, data
                                                   Users                        capture & transfer, virtual
                        Generators                                              research environments
        Data Curation




                                                                                Data discovery &
                                                                                navigation, workflow
                          Community Support Services
Trust




                                                                                generation, annotation, interpre
                                                                                tability




                                                                                Persistent
                                                                                storage, identification, authentic
                             Common Data Services                               ity (provenance), workflow
                                                                                execution, data mining


                                          Source: EU HLEG Report on Data Deluge: Riding the Wave, pg 31, 2010
Data Interoperability
• NSF OCI: DataNet and INTEROP now
  DIBBs
• EUDAT
• Data Web Forum
• IETF Research Data Identifier BOF
• Upcoming Oct. US Meeting of
  DataNet, INTEROP, Data Web Forum
Acknowledgements
SEAD is funded by the National Science
Foundation under cooperative agreement
#OCI0940824

• For more on SEAD go to:
• http://sead-data.net

• Follow us on Twitter
  @SEADdatanet



                            http://sead-data.net
License terms
•   Please cite as: McDonald, R.H. et. al. Building a Data Discovery Network for
    Sustainability Science. 3rd International VIVO Conference, Miami, FL, 24
    August 2012. Available from: [http://slidesha.re/Q9q8VW]

•   Thanks to Margaret Hedstrom, who’s guided the team through the (really)
    lengthy review process and to Jim Myers, Beth Plale, Praveen Kumar, Terry
    McLaren, Luigi Marini, Kavitha Chandrasekar and others who provided
    content for this presentation.
•   The concepts and software being leveraged in SEAD represent the work of a
    broad range of people over multiple years – their contributions have been
    critical to launching SEAD.
•   Items indicated with a © are under copyright and used here with permission.
    Such items may not be reused without permission from the holder of copyright
    except where license terms noted on a slide permit reuse.
•   This document is released under the Creative Commons Attribution 3.0
    Unported license (http://creativecommons.org/licenses/by/3.0/). This license
    includes the following terms: You are free to share – to copy, distribute and
    transmit the work and to remix – to adapt the work under the following
    conditions: attribution – you must attribute the work in the manner specified
    by the author or licensor (but not in any way that suggests that they endorse
    you or your use of the work). For any reuse or distribution, you must make clear
    to others the license terms of this work.

More Related Content

What's hot

ESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsSEAD
 
Practical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object PreservationPractical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object PreservationSEAD
 
How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?Jian Qin
 
D paul ecn2013
D paul ecn2013D paul ecn2013
D paul ecn2013ECNOfficer
 
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...SEAD
 
Martin Donnelly Sarah Jones DMP Online
Martin Donnelly Sarah Jones DMP OnlineMartin Donnelly Sarah Jones DMP Online
Martin Donnelly Sarah Jones DMP OnlineFuture Perfect 2012
 
From policy to practice with DMP Online
From policy to practice with DMP OnlineFrom policy to practice with DMP Online
From policy to practice with DMP OnlineSarah Jones
 
Iassist 2012 dms public version
Iassist 2012 dms public versionIassist 2012 dms public version
Iassist 2012 dms public versionjhudms
 
Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12ASIS&T
 
Needs for Data Management & Citation Throughout the Information Lifecycle
Needs for Data Management & Citation Throughout  the Information LifecycleNeeds for Data Management & Citation Throughout  the Information Lifecycle
Needs for Data Management & Citation Throughout the Information LifecycleMicah Altman
 
Preparing eScience librarians -- RDAP 2012
Preparing eScience librarians -- RDAP 2012 Preparing eScience librarians -- RDAP 2012
Preparing eScience librarians -- RDAP 2012 Jian Qin
 
Supporting UC Research Data Management
Supporting UC Research Data ManagementSupporting UC Research Data Management
Supporting UC Research Data Managementslabrams
 
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...University of California Curation Center
 
Rdap12 wrap up reagan moore
Rdap12 wrap up reagan mooreRdap12 wrap up reagan moore
Rdap12 wrap up reagan mooreASIS&T
 

What's hot (20)

ESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and Tools
 
Digital Curation Technology: JHU Summit, October 2015
Digital Curation Technology: JHU Summit, October 2015Digital Curation Technology: JHU Summit, October 2015
Digital Curation Technology: JHU Summit, October 2015
 
Practical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object PreservationPractical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object Preservation
 
How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?
 
D paul ecn2013
D paul ecn2013D paul ecn2013
D paul ecn2013
 
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
 
NISO Forum, Denver, Sept. 24, 2012: DataCite and Campus Data Services
NISO Forum, Denver, Sept. 24, 2012: DataCite and Campus Data ServicesNISO Forum, Denver, Sept. 24, 2012: DataCite and Campus Data Services
NISO Forum, Denver, Sept. 24, 2012: DataCite and Campus Data Services
 
Wheeler & Benedict -- Enabling the Preservation Relay
Wheeler & Benedict -- Enabling the Preservation RelayWheeler & Benedict -- Enabling the Preservation Relay
Wheeler & Benedict -- Enabling the Preservation Relay
 
Martin Donnelly Sarah Jones DMP Online
Martin Donnelly Sarah Jones DMP OnlineMartin Donnelly Sarah Jones DMP Online
Martin Donnelly Sarah Jones DMP Online
 
From policy to practice with DMP Online
From policy to practice with DMP OnlineFrom policy to practice with DMP Online
From policy to practice with DMP Online
 
Iassist 2012 dms public version
Iassist 2012 dms public versionIassist 2012 dms public version
Iassist 2012 dms public version
 
Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12
 
Needs for Data Management & Citation Throughout the Information Lifecycle
Needs for Data Management & Citation Throughout  the Information LifecycleNeeds for Data Management & Citation Throughout  the Information Lifecycle
Needs for Data Management & Citation Throughout the Information Lifecycle
 
Preparing eScience librarians -- RDAP 2012
Preparing eScience librarians -- RDAP 2012 Preparing eScience librarians -- RDAP 2012
Preparing eScience librarians -- RDAP 2012
 
Supporting UC Research Data Management
Supporting UC Research Data ManagementSupporting UC Research Data Management
Supporting UC Research Data Management
 
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
 
Rdap12 wrap up reagan moore
Rdap12 wrap up reagan mooreRdap12 wrap up reagan moore
Rdap12 wrap up reagan moore
 
Or 2013-abrams-sharing-data-rich-research
Or 2013-abrams-sharing-data-rich-researchOr 2013-abrams-sharing-data-rich-research
Or 2013-abrams-sharing-data-rich-research
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
 
Digital Curation 101 - Taster
Digital Curation 101 - TasterDigital Curation 101 - Taster
Digital Curation 101 - Taster
 

Similar to Building a Data Discovery Network for Sustainability Science

CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...SEAD
 
Anthony J brookes
Anthony J brookesAnthony J brookes
Anthony J brookesEduserv
 
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...i_scienceEU
 
Big Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureBig Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureOdinot Stanislas
 
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrLarge Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrGrant Ingersoll
 
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrLarge Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrGrant Ingersoll
 
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012Calpont Corporation
 
e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and LibariesRob Grim
 
Data mining - GDi Techno Solutions
Data mining - GDi Techno SolutionsData mining - GDi Techno Solutions
Data mining - GDi Techno SolutionsGDi Techno Solutions
 
20120718 linkedopendataandnextgenerationsciencemcguinnessesip final
20120718 linkedopendataandnextgenerationsciencemcguinnessesip final20120718 linkedopendataandnextgenerationsciencemcguinnessesip final
20120718 linkedopendataandnextgenerationsciencemcguinnessesip finalDeborah McGuinness
 
Metadata in general and Dublin Core in specific; some experiences
Metadata in general and Dublin Core in specific; some experiencesMetadata in general and Dublin Core in specific; some experiences
Metadata in general and Dublin Core in specific; some experiencesKerstin Forsberg
 
Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...Jian Qin
 
Metadata and Taxonomies for More Flexible Information Architecture
Metadata and Taxonomies for More Flexible Information Architecture Metadata and Taxonomies for More Flexible Information Architecture
Metadata and Taxonomies for More Flexible Information Architecture jrhowe
 
Semantic Web powering Enterprise and Web Applications
Semantic Web powering Enterprise and Web ApplicationsSemantic Web powering Enterprise and Web Applications
Semantic Web powering Enterprise and Web ApplicationsAmit Sheth
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceCarole Goble
 
Mending the Gap between Library's Electronic and Print Collections in ILS and...
Mending the Gap between Library's Electronic and Print Collections in ILS and...Mending the Gap between Library's Electronic and Print Collections in ILS and...
Mending the Gap between Library's Electronic and Print Collections in ILS and...New York University
 

Similar to Building a Data Discovery Network for Sustainability Science (20)

CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
 
Data mining
Data miningData mining
Data mining
 
Saadallah vtls
Saadallah vtlsSaadallah vtls
Saadallah vtls
 
Anthony J brookes
Anthony J brookesAnthony J brookes
Anthony J brookes
 
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
 
Big Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureBig Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the Future
 
LRMI in Context, Brandt Redd
LRMI in Context, Brandt ReddLRMI in Context, Brandt Redd
LRMI in Context, Brandt Redd
 
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrLarge Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
 
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrLarge Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
 
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
 
e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and Libaries
 
Prototype Design of Open Access Institutional Repository
Prototype Design of Open Access Institutional RepositoryPrototype Design of Open Access Institutional Repository
Prototype Design of Open Access Institutional Repository
 
Data mining - GDi Techno Solutions
Data mining - GDi Techno SolutionsData mining - GDi Techno Solutions
Data mining - GDi Techno Solutions
 
20120718 linkedopendataandnextgenerationsciencemcguinnessesip final
20120718 linkedopendataandnextgenerationsciencemcguinnessesip final20120718 linkedopendataandnextgenerationsciencemcguinnessesip final
20120718 linkedopendataandnextgenerationsciencemcguinnessesip final
 
Metadata in general and Dublin Core in specific; some experiences
Metadata in general and Dublin Core in specific; some experiencesMetadata in general and Dublin Core in specific; some experiences
Metadata in general and Dublin Core in specific; some experiences
 
Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...
 
Metadata and Taxonomies for More Flexible Information Architecture
Metadata and Taxonomies for More Flexible Information Architecture Metadata and Taxonomies for More Flexible Information Architecture
Metadata and Taxonomies for More Flexible Information Architecture
 
Semantic Web powering Enterprise and Web Applications
Semantic Web powering Enterprise and Web ApplicationsSemantic Web powering Enterprise and Web Applications
Semantic Web powering Enterprise and Web Applications
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
Mending the Gap between Library's Electronic and Print Collections in ILS and...
Mending the Gap between Library's Electronic and Print Collections in ILS and...Mending the Gap between Library's Electronic and Print Collections in ILS and...
Mending the Gap between Library's Electronic and Print Collections in ILS and...
 

More from Robert H. McDonald

ER&L The Role of Choice in the Future of Discovery Evaluations Panel
ER&L The Role of Choice in the Future of Discovery Evaluations PanelER&L The Role of Choice in the Future of Discovery Evaluations Panel
ER&L The Role of Choice in the Future of Discovery Evaluations PanelRobert H. McDonald
 
The HathiTrust Research Center: Enabling New Knowledge Through Shared Infras...
The HathiTrust Research Center: Enabling New Knowledge Through Shared Infras...The HathiTrust Research Center: Enabling New Knowledge Through Shared Infras...
The HathiTrust Research Center: Enabling New Knowledge Through Shared Infras...Robert H. McDonald
 
Academic Libraries and Big Data: Trends in Collection, Publication, Preservat...
Academic Libraries and Big Data: Trends in Collection, Publication, Preservat...Academic Libraries and Big Data: Trends in Collection, Publication, Preservat...
Academic Libraries and Big Data: Trends in Collection, Publication, Preservat...Robert H. McDonald
 
JCDL 2015 Tutorial Opening Slides
JCDL 2015 Tutorial Opening SlidesJCDL 2015 Tutorial Opening Slides
JCDL 2015 Tutorial Opening SlidesRobert H. McDonald
 
TLT Discussion on "Saving My Stuff" - 06.05.15
TLT Discussion on "Saving My Stuff" - 06.05.15TLT Discussion on "Saving My Stuff" - 06.05.15
TLT Discussion on "Saving My Stuff" - 06.05.15Robert H. McDonald
 
The HathiTrust Research Center: An Overview of Advanced Computational Services
The HathiTrust Research Center: An Overview of Advanced Computational ServicesThe HathiTrust Research Center: An Overview of Advanced Computational Services
The HathiTrust Research Center: An Overview of Advanced Computational ServicesRobert H. McDonald
 
Elephant in the Room: Scaling Storage for the HathiTrust Research Center
Elephant in the Room: Scaling Storage for the HathiTrust Research CenterElephant in the Room: Scaling Storage for the HathiTrust Research Center
Elephant in the Room: Scaling Storage for the HathiTrust Research CenterRobert H. McDonald
 
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...Robert H. McDonald
 
ER&L 2015 Closing Keynote Slides
ER&L 2015 Closing Keynote SlidesER&L 2015 Closing Keynote Slides
ER&L 2015 Closing Keynote SlidesRobert H. McDonald
 
HathiTrust Research Center Data Capsule Overview 09.10.14
HathiTrust Research Center Data Capsule Overview 09.10.14HathiTrust Research Center Data Capsule Overview 09.10.14
HathiTrust Research Center Data Capsule Overview 09.10.14Robert H. McDonald
 
The HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
The HathiTrust Research Center: Big Data Analytics in a Secure Data FrameworkThe HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
The HathiTrust Research Center: Big Data Analytics in a Secure Data FrameworkRobert H. McDonald
 
Owning the Discovery Experience for Your Patrons
Owning the Discovery Experience for Your PatronsOwning the Discovery Experience for Your Patrons
Owning the Discovery Experience for Your PatronsRobert H. McDonald
 
Kuali OLE: Enabling Choices for Libraries
Kuali OLE: Enabling Choices for LibrariesKuali OLE: Enabling Choices for Libraries
Kuali OLE: Enabling Choices for LibrariesRobert H. McDonald
 
Charleston Seminar Being Earnest with our Collections - Legacy to Cloud
Charleston Seminar Being Earnest with our Collections - Legacy to CloudCharleston Seminar Being Earnest with our Collections - Legacy to Cloud
Charleston Seminar Being Earnest with our Collections - Legacy to CloudRobert H. McDonald
 
The HathiTrust Research Center (HTRC): An Overview and Demo
The HathiTrust Research Center (HTRC): An Overview and DemoThe HathiTrust Research Center (HTRC): An Overview and Demo
The HathiTrust Research Center (HTRC): An Overview and DemoRobert H. McDonald
 
New Perspectives for Business Intelligence: Library and Research Technologies...
New Perspectives for Business Intelligence: Library and Research Technologies...New Perspectives for Business Intelligence: Library and Research Technologies...
New Perspectives for Business Intelligence: Library and Research Technologies...Robert H. McDonald
 
Kuali OLE: Deep Library Collaboration and the Release of a Community-Sourced ...
Kuali OLE: Deep Library Collaboration and the Release of a Community-Sourced ...Kuali OLE: Deep Library Collaboration and the Release of a Community-Sourced ...
Kuali OLE: Deep Library Collaboration and the Release of a Community-Sourced ...Robert H. McDonald
 
GOKb & KB+: An International Partnership to leverage Open Access and Communit...
GOKb & KB+: An International Partnership to leverage Open Access and Communit...GOKb & KB+: An International Partnership to leverage Open Access and Communit...
GOKb & KB+: An International Partnership to leverage Open Access and Communit...Robert H. McDonald
 

More from Robert H. McDonald (20)

ER&L The Role of Choice in the Future of Discovery Evaluations Panel
ER&L The Role of Choice in the Future of Discovery Evaluations PanelER&L The Role of Choice in the Future of Discovery Evaluations Panel
ER&L The Role of Choice in the Future of Discovery Evaluations Panel
 
The HathiTrust Research Center: Enabling New Knowledge Through Shared Infras...
The HathiTrust Research Center: Enabling New Knowledge Through Shared Infras...The HathiTrust Research Center: Enabling New Knowledge Through Shared Infras...
The HathiTrust Research Center: Enabling New Knowledge Through Shared Infras...
 
Academic Libraries and Big Data: Trends in Collection, Publication, Preservat...
Academic Libraries and Big Data: Trends in Collection, Publication, Preservat...Academic Libraries and Big Data: Trends in Collection, Publication, Preservat...
Academic Libraries and Big Data: Trends in Collection, Publication, Preservat...
 
JCDL 2015 Tutorial Opening Slides
JCDL 2015 Tutorial Opening SlidesJCDL 2015 Tutorial Opening Slides
JCDL 2015 Tutorial Opening Slides
 
TLT Discussion on "Saving My Stuff" - 06.05.15
TLT Discussion on "Saving My Stuff" - 06.05.15TLT Discussion on "Saving My Stuff" - 06.05.15
TLT Discussion on "Saving My Stuff" - 06.05.15
 
The HathiTrust Research Center: An Overview of Advanced Computational Services
The HathiTrust Research Center: An Overview of Advanced Computational ServicesThe HathiTrust Research Center: An Overview of Advanced Computational Services
The HathiTrust Research Center: An Overview of Advanced Computational Services
 
Elephant in the Room: Scaling Storage for the HathiTrust Research Center
Elephant in the Room: Scaling Storage for the HathiTrust Research CenterElephant in the Room: Scaling Storage for the HathiTrust Research Center
Elephant in the Room: Scaling Storage for the HathiTrust Research Center
 
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
 
ER&L 2015 Closing Keynote Slides
ER&L 2015 Closing Keynote SlidesER&L 2015 Closing Keynote Slides
ER&L 2015 Closing Keynote Slides
 
HathiTrust Research Center Data Capsule Overview 09.10.14
HathiTrust Research Center Data Capsule Overview 09.10.14HathiTrust Research Center Data Capsule Overview 09.10.14
HathiTrust Research Center Data Capsule Overview 09.10.14
 
The HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
The HathiTrust Research Center: Big Data Analytics in a Secure Data FrameworkThe HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
The HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
 
Owning the Discovery Experience for Your Patrons
Owning the Discovery Experience for Your PatronsOwning the Discovery Experience for Your Patrons
Owning the Discovery Experience for Your Patrons
 
Kuali OLE: Enabling Choices for Libraries
Kuali OLE: Enabling Choices for LibrariesKuali OLE: Enabling Choices for Libraries
Kuali OLE: Enabling Choices for Libraries
 
Charleston Seminar Being Earnest with our Collections - Legacy to Cloud
Charleston Seminar Being Earnest with our Collections - Legacy to CloudCharleston Seminar Being Earnest with our Collections - Legacy to Cloud
Charleston Seminar Being Earnest with our Collections - Legacy to Cloud
 
The HathiTrust Research Center (HTRC): An Overview and Demo
The HathiTrust Research Center (HTRC): An Overview and DemoThe HathiTrust Research Center (HTRC): An Overview and Demo
The HathiTrust Research Center (HTRC): An Overview and Demo
 
SCONUL Kuali OLE Briefing
SCONUL Kuali OLE BriefingSCONUL Kuali OLE Briefing
SCONUL Kuali OLE Briefing
 
New Perspectives for Business Intelligence: Library and Research Technologies...
New Perspectives for Business Intelligence: Library and Research Technologies...New Perspectives for Business Intelligence: Library and Research Technologies...
New Perspectives for Business Intelligence: Library and Research Technologies...
 
Kuali OLE: Deep Library Collaboration and the Release of a Community-Sourced ...
Kuali OLE: Deep Library Collaboration and the Release of a Community-Sourced ...Kuali OLE: Deep Library Collaboration and the Release of a Community-Sourced ...
Kuali OLE: Deep Library Collaboration and the Release of a Community-Sourced ...
 
GOKb & KB+: An International Partnership to leverage Open Access and Communit...
GOKb & KB+: An International Partnership to leverage Open Access and Communit...GOKb & KB+: An International Partnership to leverage Open Access and Communit...
GOKb & KB+: An International Partnership to leverage Open Access and Communit...
 
Kuali OLE @ LITA Forum 2012
Kuali OLE @ LITA Forum 2012Kuali OLE @ LITA Forum 2012
Kuali OLE @ LITA Forum 2012
 

Recently uploaded

MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...JojoEDelaCruz
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationRosabel UA
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptxiammrhaywood
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
Music 9 - 4th quarter - Vocal Music of the Romantic Period.pptx
Music 9 - 4th quarter - Vocal Music of the Romantic Period.pptxMusic 9 - 4th quarter - Vocal Music of the Romantic Period.pptx
Music 9 - 4th quarter - Vocal Music of the Romantic Period.pptxleah joy valeriano
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 

Recently uploaded (20)

LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translation
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
Music 9 - 4th quarter - Vocal Music of the Romantic Period.pptx
Music 9 - 4th quarter - Vocal Music of the Romantic Period.pptxMusic 9 - 4th quarter - Vocal Music of the Romantic Period.pptx
Music 9 - 4th quarter - Vocal Music of the Romantic Period.pptx
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 

Building a Data Discovery Network for Sustainability Science

  • 1. Building a Data Discovery Network for Sustainability Science Robert H. McDonald Deputy Director Data to Insight (D2I) Center Associate Dean – IU Libraries Indiana University rhmcdona@indiana.edu | @mcdonald @SEADdatanet Presented at the VIVO 2012 Conference Miami, FL– August 24, 2012 http://slidesha.re/Q9q8VW Available from: http://slidesha.re/Q9q8VW © Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide.
  • 2. NSF DataNet Program Motivation: “… one of the major challenges of this scientific generation: how to develop the new methods, management structures and technologies to manage the diversity, size, and complexity of current and future data sets and data streams.” Response: DataNet creates “a set of exemplar national and global data research infrastructure organizations” to address this challenge.
  • 3. Current NSF DataNet Projects • SEAD – http://sead-data.net • DataOne – http://www.dataone.org • DataNet Federation Consortium – http://datafed.org • Terra Populous – https://www.pop.umn.edu/terra_pop
  • 4. Sustainable Environment Actionable Data (SEAD) - DataNet • SEAD Strategy SEAD Partners - http://sead-data.net ― Serve scientists and researchers in the “long tail” of science ― Leverage social media for discovery of data, interest, and expertise ― Move data curation upstream in the data life cycle of science ― Take advantage of existing domain and institutional infrastructures (Institutional Repositories, ICPSR) for long-term preservation
  • 5. SEAD TEAMS Margaret Hedstrom-PI, Ann Zimmerman-Co-PI, Karen Michigan Woollams, George Alter (ICPSR), Bryan Beecher (ICPSR), Jude Yew Beth Plale-Co-PI, Katy Börner, Robert H. McDonald, Robert Indiana Light, Kavitha Chandrasekar, Stacy Kowalczyk, Robert Ping James Myers-Co-PI, Ram Prasanna Govind Krishnan, Lindsay Todd Rensselaear Praveen Kumar-Co-PI, Md Aktaruzzaman, Terry McLaren (NCSA), Rob Illinois Kooper (NCSA), Luigi Marini (NCSA)
  • 6. SEAD 18 month Pilot Phase • Domain Engagement: – National Center for Earth Systems Dynamics (NCESD), Illinois River Basin Observatory – Requirements, Use Cases, Prioritization of Data Types and Services • Active and Social Curation – Pilot Active Content Repository, VIVO deployments – Exemplar services for Data Ingest, Discovery, Re-use, Curation (Tupelo/Medici) • CI for Long-term Access (Virtual Archive) – Data model, protocol design/development – Pilot Federated Repository infrastructure • Education, Outreach, and Training – Post-doc mentoring – Web site, training materials, meetings, workshops, … • Project Oversight – Management, reporting, committees – Business model development
  • 7. Sustainability Science Science Cooperation Technology Policy Economics Poverty & Justice 7
  • 8. Data challenges • Heterogeneity of all kinds • Multiple scales • Multidisciplinary • Many small datasets
  • 9. The long tail of scientific research • Small and derived data sets • Heterogeneous data • Multiple sources of data • Short-lived data with long-term value • Value of data grows when combined & integrated
  • 10.
  • 11. SEAD notions of defined Data Phases • Phases of data lifecycle acknowledge and accommodate the difference between public data and data still in work by a researcher. • Research Data Phase: data set is research data collection, owned by individual and under their control. – Data need not be licensed at this time because it is not ready for broader release – Data need not have permanent IDs because still work in progress – Corresponds to first existence in Active Curation Repository • Published Phase: Owner of research data collection determines that dataset is ready for publication – License terms set – Persistent ID – Made available as part of public profile in VIVO – Activated by user-controlled publish event
  • 12. SEAD CI Technical Approach Active and Social Curation OAIS Repository Federation Curation Boundary Automated Curation Data Metadata Workflow/Rule Acquisition, A Engine Management nalysis and Operates on Simulation DDI3. Metadata, Content Objects Scholarly METS, PREMIS, MODS, DC , SensorML, OGC, … and Trigger Events Communication Ingest scripts: Ingest, AIPs Appraisal fixity, integrity, Compound Objects - OAI-ORE and CI Technical Approach authentication, VIVO/ Linked Data Active Selection transformation Digital Repository Federation Content (OAIS compliant) Preservation Repository Actions Dissemination Packages Wide-Area File System Search, Browse Migration , and Access Mechanisms and Annotation, Vis Use, Reuse, Rep Emulation E-Scholarship Services ualization Tools urposing Tools Contributor User Tools
  • 13. SEAD CI Technical Approach Active and Social Curation OAIS Repository Federation Curation Boundary Automated Curation Data Metadata Workflow/Rule Acquisition, A Engine A Management standardized data model nalysis and Operates on DDI3. Scholarly and federation capability Simulation METS, PREMIS, MODS, DC , SensorML, OGC, … Metadata, Content Objects and Trigger Events Communication over OAIS-Standard Ingest scripts: Institutional Repositories Ingest, AIPs Appraisal fixity, integrity, aut Compound Objects - OAI-ORE and CI Technical Approach hentication, transf Selection ormation SEAD Active Data Digital Repository Federation (OAIS compliant) Systems Preservation Actions Dissemination Packages Wide-Area File System Search, Browse Migration , and Access Mechanisms and Annotation, Vis Use, Reuse, Rep Emulation E-Scholarship Services ualization Tools urposing Tools Contributor User Tools
  • 14.
  • 15. SEAD CI Technical Approach Active and Social Curation OAIS Repository Federation Curation Boundary A robust, replicated distributed file system used as a large-scaleCuration Automated backing Workflow/Rule Data Metadata Acquisition, A store Management Engine nalysis and Operates on Simulation DDI3. Metadata, Content Objects Scholarly METS, PREMIS, MODS, DC , SensorML, OGC, … and Trigger Events Communication Ingest scripts: Ingest, AIPs Appraisal fixity, integrity, aut Compound Objects - OAI-ORE and CI Technical Approach hentication, transf VIVO/ Linked Data Active Selection ormation SEAD Trusted Digital Repository Federation Content (OAIS compliant) Preservation Repository Actions Dissemination Packages Wide-Area File System Search, Browse Migration , and Access Mechanisms and Annotation, Vis Use, Reuse, Rep Emulation E-Scholarship Services ualization Tools urposing Tools Contributor User Tools
  • 16. SEAD CI Technical Approach Active and Social Curation OAIS Repository Federation Curation Boundary Automated Curation Data MetadataAn Active Content Repository Workflow/Rule Engine Acquisition, A nalysis and based on standard Management global IDs and Operates on Simulation DDI3. semantic web technologies Scholarly METS, PREMIS, MODS, DC Metadata, Content Objects , SensorML, OGC, … and Trigger Events Communication - to collect and integrate data, metadata, and provenance Ingest scripts: Ingest, AIPs Appraisal fixity, integrity, information from multiple sources. Compound Objects - OAI-ORE and CI Technical Approach authentication, VIVO/ Linked Data Active Selection transformation DC:Creator SEAD Trusted OPM:wasDerivedFrom Digital Repository Federation Content SWAN:isEvidenceFor… (OAIS compliant) Preservation Content Repository Content Actions Content Content Dissemination Packages Wide-Area File System Lustre File System Search, Browse Migration , and Access Mechanisms and Annotation, Vis Use, Reuse, Rep Emulation E-Scholarship Services ualization Tools urposing Tools Contributor User Tools
  • 17.
  • 18. SEAD CI Technical Approach Active and Social Curation OAIS Repository Federation Curation Boundary Automated Curation Data SEAD will run a VIVO instance Metadata Workflow/Rule Acquisition, A Engine nalysis and and may harvest Linked Data from Management Operates on Scholarly Simulation other sources DDI3. Metadata, Content Objects METS, PREMIS, MODS, DC , SensorML, OGC, … and Trigger Events Communication Ingest scripts: Ingest, AIPs Appraisal fixity, integrity, aut Compound Objects - OAI-ORE and CI Technical Approach hentication, transf VIVO/ Linked Data Active Selection ormation Digital Repository Federation Content VIVO Application: Open (OAIS compliant) Source Preservation Repositoryfederatable Researcher Actions Dissemination Packages Information – Wide-Area File System people, papers, projects, center s, fields, etc. Search, Browse Migration , and Access Mechanisms and Annotation, Vis Use, Reuse, Rep Emulation E-Scholarship Services ualization Tools urposing Tools Contributor User Tools
  • 19.
  • 20. SEAD CI Technical Approach Active and Social Curation OAIS Repository Federation Curation Boundary Automated Curation Data Metadata Workflow/Rule Acquisition, Engine Management Analysis and Operates on Simulation DDI3. Metadata, Content Objects Scholarly METS, PREMIS, MODS, DC , SensorML, OGC, … and Trigger Events Communication Ingest scripts: Active and Social Curation AIPs Appraisal fixity, integrity, aut Ingest, Compound Objects - OAI-ORE VIVO/ Services supporting automated and and CI Technical Approach hentication, transf Selection Linked Data Active interactive use use SEAD Trusted ormation and interactive of SEADSEAD Federation of Repository Digital Content - leveraging standard (OAIS compliant) web Preservation Repository Actions application/web service toolkits and Dissemination Packages virtual machine infrastructure and virtual machine infrastructure Wide-Area File System Search, Browse Migration , and Access Mechanisms and Annotation, Vis Use, Reuse, Rep Emulation E-Scholarship Services ualization Tools urposing Tools Contributor User Tools
  • 21. SEAD CI Technical Approach Active and Social Curation OAIS Repository Federation Curation Boundary Automated Curation Data Metadata Workflow/Rule Acquisition, A Engine Management nalysis and Operates on Simulation DDI3. Metadata, Content Objects Scholarly METS, PREMIS, MODS, DC , SensorML, OGC, … and Trigger Events Communication Ingest scripts: Ingest, AIPs Appraisal fixity, integrity, aut Compound Objects - OAI-ORE and CI Technical Approach hentication, transf Selection ormation SEAD Trusted Active Content and Preservation Services Repository Federation Curation Digital also Repository (OAIS compliant) Preservation leveraging standard web application/web Actions service toolkits and virtual machine Dissemination Packages infrastructure Wide-Area File System Search, Browse Migration , and Access Mechanisms and Annotation, Vis Use, Reuse, Rep Emulation E-Scholarship Services ualization Tools urposing Tools Contributor User Tools
  • 22. Active and Social Curation OAIS Repository Federation Curation Boundary Automated Curation Data Metadata Workflow/Rule Acquisition, Engine Management Analysis and Operates on Simulation DDI3. Metadata, Content Objects Scholarly METS, PREMIS, MODS, DC , SensorML, OGC, … and Trigger Events Communication Ingest scripts: Ingest, AIPs Appraisal fixity, integrity, aut Compound Objects - OAI-ORE and hentication, transf VIVO/ Linked Data Active Selection ormation Digital Repository Federation Content (OAIS compliant) Preservation Repository Actions Dissemination Packages Wide-Area File System Search, Browse Migration , and Access Mechanisms and Annotation, Vis Use, Reuse, Rep Emulation E-Scholarship Services ualization Tools urposing Tools Contributor User Tools
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.
  • 36.
  • 41. A dataset or file looks like this
  • 44. Login for data upload
  • 45. Upload file Files from Medici can also be added
  • 46. Create collection (can have multiple files)
  • 48. Data ingested to DSpace (Mississippi example)
  • 49. SEAD Virtual Archive Architecture VIVO IU IR DataCite DSpace server ID Server Store data object, its metadata object, and its relationship record (latter as RDF) in IR as a collection Register Obtain DOI with DOI from register metadata to SOLR and PostgreS VIVO DataCite for rapid retrieval of metadata SIP (Data+ Core Metadata) Solr Property+ Data SEAD Validation Feature SIP AIP Index Domain Ingest Extraction breakdown Metadata (Fixity, Vir + from Data Client us) Data /UI Ack Geospatial PostgreS +Temporal Preservation Metadata Generation (Events) QL Index Metadata
  • 50. Key Questions for SEAD Prototype • What could SEAD capture when? • How can SEAD provide direct value to data producers, users, and curators? • How can web 2.0/3.0 and social computing lower barriers and reduce/realign costs?
  • 51. Towards A Shared Data Future Data User functionalities, data Users capture & transfer, virtual Generators research environments Data Curation Data discovery & navigation, workflow Community Support Services Trust generation, annotation, interpre tability Persistent storage, identification, authentic Common Data Services ity (provenance), workflow execution, data mining Source: EU HLEG Report on Data Deluge: Riding the Wave, pg 31, 2010
  • 52. Data Interoperability • NSF OCI: DataNet and INTEROP now DIBBs • EUDAT • Data Web Forum • IETF Research Data Identifier BOF • Upcoming Oct. US Meeting of DataNet, INTEROP, Data Web Forum
  • 53. Acknowledgements SEAD is funded by the National Science Foundation under cooperative agreement #OCI0940824 • For more on SEAD go to: • http://sead-data.net • Follow us on Twitter @SEADdatanet http://sead-data.net
  • 54. License terms • Please cite as: McDonald, R.H. et. al. Building a Data Discovery Network for Sustainability Science. 3rd International VIVO Conference, Miami, FL, 24 August 2012. Available from: [http://slidesha.re/Q9q8VW] • Thanks to Margaret Hedstrom, who’s guided the team through the (really) lengthy review process and to Jim Myers, Beth Plale, Praveen Kumar, Terry McLaren, Luigi Marini, Kavitha Chandrasekar and others who provided content for this presentation. • The concepts and software being leveraged in SEAD represent the work of a broad range of people over multiple years – their contributions have been critical to launching SEAD. • Items indicated with a © are under copyright and used here with permission. Such items may not be reused without permission from the holder of copyright except where license terms noted on a slide permit reuse. • This document is released under the Creative Commons Attribution 3.0 Unported license (http://creativecommons.org/licenses/by/3.0/). This license includes the following terms: You are free to share – to copy, distribute and transmit the work and to remix – to adapt the work under the following conditions: attribution – you must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work). For any reuse or distribution, you must make clear to others the license terms of this work.

Editor's Notes

  1. A Collection of heterogeneous files. Users can tag and add comments to the entire ‘collection’ and individually tag and comment on the objects in the collection. Note: Extraction services and previewers are all driven by the file MIME type. Extraction services are customizable and are designed to automate derived data products from the file being uploaded. Examples follow…
  2. A Collection of heterogeneous files. Users can tag and add comments to the entire ‘collection’ and individually tag and comment on the objects in the collection. Note: Extraction services and previewers are all driven by the file MIME type. Extraction services are customizable and are designed to automate derived data products from the file being uploaded. Examples follow…
  3. Lidar data saved as .png.The Image extraction service does the following:Creates the thumbnail and preview imageCreates an image pyramid of the image (zoom/pan large images w/o downloading entire image via the SeaDragon webapp )Extract all header information from image file to include: Exif, GPS, Interoperability, etc… Extracted data is view by clicking on the “Extracted Information” section.
  4. A data set saved as a simple ASCII text file.- Users can preview the first 80 lines of the text file.
  5. Preview the contents of .csv files
  6. Simple map image User defined informationImage is part of multiple collectionsImage is tagged
  7. 3 Images (3 clicks)Standard Medici InfoScroll down to show location and annotationThis image file also contained geo location data which become visible in “Location”. Geo-location can be extracted from the image Exif data or authors can add a geo-location to any file in the repository.Note the creator tag and vivo reference.
  8. Tif support - relatively large 71MB fileClicks…Click Zoom to enable SeaDragon to explore the details of the file via zoom and pan with mouse.Click the lower right icon to enable full screen. Use + or – key to zoom (or wheel on mouse), click image and drag to panClick lower right icon to return to embedded window in Medici
  9. Image file that contains GPS data which is extracted by Medici as part of the upload process.
  10. Mpeg file uploads:Extraction service creates a flash version of the file for preview.
  11. PDF files Extraction service generates an image per page of the file. In this case a slide set from a presentation. Click ‘Pages’ to enable the slide set mode and click on the left or right arrows to navigate the pages. 2 images – click to advance slide.
  12. 3D object supportPreviewer provides multiple view options of the object which are accessible from the links above the preview.
  13. .shp files The components of shape file get uploaded to Medici as a zip Medici saves the zip blob and the extraction service registers the contents of the shp file with GeoServerOpenStreetMap displays the contents of the zipLayers are on by default but can be turned by clicking the ‘show’ button.Opacity of layers can be varied using the opacity scale.(WIP) We plan to embed OpenStreetMap in Medici as a previewer for .shp and .kml
  14. All layers off except Illinois Flood Zone map. Map zoomed into the Champaign region of interest.