Policy Based Data Management iRODS - Reagan Moore - RDAP12
Upcoming SlideShare
Loading in...5
×
 

Policy Based Data Management iRODS - Reagan Moore - RDAP12

on

  • 820 views

Policy Based Data Management iRODS

Policy Based Data Management iRODS
Reagan Moore
Presentation at Research Data Access & Preservation Summit
22 March 2012

Statistics

Views

Total Views
820
Views on SlideShare
820
Embed Views
0

Actions

Likes
0
Downloads
10
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Policy Based Data Management iRODS - Reagan Moore - RDAP12 Policy Based Data Management iRODS - Reagan Moore - RDAP12 Presentation Transcript

  • Policy Based Data Management iRODS 1
  • Policy-based Data Environments• Purpose - reason a collection is assembled• Properties - attributes needed to ensure the purpose• Policies - controls for enforcing desired properties,• mapped to computer actionable rules• Procedures - functions that implement the policies• mapped to computer actionable workflows• Persistent state information - results of applying the procedures• mapped to system metadata• Assessment criteria - validation that state information conforms to the desired purpose• mapped to periodically executed policies 2 2
  • Life-Time Library• Student digital libraries – Enable students to build collections of • Photographs • MP3 audio files • Video • Class documents • Web site archive• Resources provided by School of Information and Library Science – Student collections range from 2 GBytes to 150 Gbytes – Number of files from 2000 to 12,000 3
  • Policies• Integrity – Replication – Checksums – Versioning – Strict access controls – Quotas – Metadata catalog replication – Installation environment archiving• Ingestion – Automated synchronization of student directory with LifeTime Library 4
  • Enforcement• Policies enforced by iRODS data grid – Some policies should be implemented before digital library creation • Versioning • Strict access controls • Quotas – Other policies • Replication, when turned on, replicated all initial files • Synchronization, student controlled 5
  • New Policies• Students want to control: – Staging area into which laptop files are synchronized – Automated extraction of metadata – iTunes – Tagging of files – Integrity validation periodicity – Arrangement of files, sorting by type or class – Descriptive metadata – Dublin core or reserved words 6
  • DataNet Federation Consortium Data Driven Science• Implement national data grid – Federate existing discipline-specific data management Project systems to enable national research collaborations Shared Collection• Enable collaborative research on shared data collections Processing Pipeline – Manage collection life cycle as the user community broadens Digital Library• Integrate “live” research data into education initiatives Reference Collection – Enable student research participation through control Federation policies Collection Life Cycle Cyber-infrastructure Partners: Science and Engineering Initiatives: Univ. of North Carolina, Chapel Hill Ocean Observatories Initiative Univ. of California, San Diego the iPlant Collaborative Arizona State University CUAHSI Drexel University CIBER-U Duke University Odum Social Science Institute University of Arizona Temporal Dynamics of Learning University of South Carolina Center Policy-basedNational Science Foundation Cooperative Agreement: OCI-0940841 data management 7
  • iRODS - Open Source Software Reagan W. Moore rwmoore@renci.org http://irods.diceresearch.orgNSF OCI-0940841 “DataNet Federation Consortium”NSF OCI-1032732 “Improvement of iRODS for Multi-Disciplinary Applications”NSF OCI-0848296 “NARA Transcontinental Persistent Archives Prototype”NSF SDCI-0721400 “Data Grids for Community Driven Applications” 8 8
  • Policy-based Data Management Client Each domain implements OOI TDLC unique grid grid policies in their data Federation grid. hub Hydro Odum Policies grid grid control interactions CIBER iPlant beteen grid grid data grids 9
  • Types of Federation• Within a data grid – Assemble distributed data into a policy- controlled collection• Between data grids – Establish policies for sharing between communities• Soft links – Register data from another system into a collaboration environment – Access data using the remote system protocol 10