RDAP14: DataNet Federal Consortium Update
Upcoming SlideShare
Loading in...5
×
 

RDAP14: DataNet Federal Consortium Update

on

  • 299 views

Research Data Access and Preservation Summit, 2014

Research Data Access and Preservation Summit, 2014
San Diego, CA
March 26-28, 2014

Mary Whitton, Project Manager, Datanet Federation Consortium

Statistics

Views

Total Views
299
Views on SlideShare
292
Embed Views
7

Actions

Likes
0
Downloads
6
Comments
0

1 Embed 7

http://www.scoop.it 7

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • Talk about what our goals are over this slide.
  • Who are the customers who must be satisfied?
  • Who are the customers who must be satisfied?
  • Two underlying capabilities…If DFC is about federation, then must have interoperability between iRODS grids and with othersIf DFC is to be a part of the long term national cyberinfrastructure for data management, must be sustainableLet me update you on that last one first.
  • Version 4.0 release March 31Merge of iRODS 3.3.1 (DICE/DFC developed) with “enterprise” irods RENCI developedNewer SW engineering best practicesOne click installRigorous testingServer + plug-ins modelNew features added as plug-ins
  • Version 4.0 release March 31One click installRigorous testingServer + plug-ins modelNew features added as plug-ins
  • Data discoveryData access from a workflowData manipulation (parsing of a data format)Data transformation (converting to a new coordinate system)Data transformation (creating new physical variables by combining other variables)Data transformation (converting to new physical units)Data subsetting (extracting  a sub-region)Data registration (GIS co-registration)Data visualizationCreation of derived data productsRodsWiki is a MediaWiki extension that enables MediaWiki file uploads to be stored in iRODS and to allow wiki users to download those files as well as to view and manipulate their metadata. This enables storage for large scientific datasets to leverage the benefits of being stored in iRODS while still seamlessly interacting with standard MediaWiki interfaces.
  • Puzzled when you ask about this….
  • Can iRODSAutomate distributed data sharing across different data management systems? Maintain control of different data sets across different storage systems? Allow and automate a range of data services without involving system administrators? YES.

RDAP14: DataNet Federal Consortium Update RDAP14: DataNet Federal Consortium Update Presentation Transcript

  • National Science Foundation Cooperative Agreement: OCI-0940841 Update March 2014 Mary Whitton, Project Manager Reagan Moore, PI
  • Who is DFC ?
  • What does DFC do? • Federate to enable collaboration – Federation of iRODS-based data grids – Interoperability for federation with other systems • Enable reproducible science – Workflows as first class data objects; provenance • Build on policy-based data system (iRODS) – Best practices in curation, archiving – Automated data grid administration functions
  • Production Ready
  • Production Ready Data Producers Data Users Curators, Archivists Data Center Managers
  • Production Ready Data Producers Data Users Curators, Archivists Data Center ManagersSustain- ability Interoper- ability
  • SW Quality & User Community Version 4.0 release March 31 Sustainability
  • SW Quality & User Community Sustainability iRODS User Meeting June 18-19, 2014 Boston
  • Federating across systems Interoperability DataONE member node looks like a another iRODS grid to iRODS user Capability has been demonstrated DataONE Member Node iRODS Federated Grid iRODS Data Grids Interface (via APIs) to DataONE Cloud Storage
  • Federating across systems Interoperability DataONE Member Node iRODS Grid DataONE Coordinating Node Interface (via APIs) to DataONE iRODS grid looks like a DataONE member node to a DataONE user Work is underway
  • Federating across systems Interoperability iRODS Grid DataVerse Network DataVerseDataVerse Work is underway
  • What our users want • Data discovery • Data access from a workflow • Data manipulation (parsing of a data format) • Data transformation – converting to a new coordinate system) – creating new physical variables by combining other variables – converting to new physical units • Data subsetting (extracting a sub-region) • Data registration (GIS co-registration) • Data visualization • Creation of derived data products Data Users
  • Current work: client side tools • Ingest-MediaWiki, iDropWeb – Metadata templating, bulk uploads – Database and indexing: plug-in in V. 4.0 • Access control – Access for user defined “group” (my team) • Integrated access to analysis tools • Interfaces: Jargon, message-passing IF framework Data Producers & Users
  • Standards and Policies Curators, Archivists • Community practices and policies – Unwritten, non-existent • Developing international standards • Implementation in iRODS server • Future: tools to make writing rules easier
  • Repository management tools • Best practices embodied in iRODS rules and policies – Trustworthy repository • Automatic execution – Copy, backup, checksum – Triggers: time, event Data Center Managers • Tools for grid administrators
  • Production Ready Data Producers Data Users Curators, Archivists Data Center ManagersSustain- ability Interoper- ability
  • National Science Foundation Cooperative Agreement: OCI-0940841 www.datafed.org www.irods.org