SlideShare a Scribd company logo
1 of 13
Download to read offline
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
1!
Academic Workflow for
Research Repositories
Using iRODS and Object
Storage
2016 iRODS User’s Group Meeting
9 June 2016 Randall Splinter, Ph.D.
HPC Research Computing Solutions Architect
RSplinter@ddn.com
770.633.2994
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
2! Agenda
▶  Introduction to the Problem
•  HPC Workflows
•  The Problem of Long Term Archiving
▶  Object Store to the Rescue
▶  How iRODS Enables Object Storage
▶  Why iRODS with DDN WOS is a Superior
Solution for Research Repositories
▶  A Case Study
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
3!
Introduction to the Problem
▶  The Problem of Collaboration
•  NAS technologies (NFS, CIFS) are local
o  They don’t tend to scale well over WAN distances
– But collaborators are frequently widely separated in
distance
o  How to enable researchers to share data without
administrative overhead – securely
– Typically, only system administrators can control the
ACLs on NFS/CIFS mounts
•  FTP
o  Security
•  Tape
o  Yech
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
4!
HPC Workflows
▶  Ingest data from a source (Analysis of data)
•  Pre-analysis on low-end storage
•  Move cleaned up data to a PFS and compute
•  After full analysis data is moved to long term storage
o  Can this be automated? – Yes, with iRODS.
▶  No ingest (Pure simulation)
•  Compute models are run on a compute cluster with PFS
•  After full analysis data is moved to long term storage
o  Again automation is key.
▶  Data sets are exploding!
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
5!
The Problem of Long Term Archiving
▶  Data must be secure
•  From deletion (accident or deliberate)
•  Loss from theft
o  Security hacks
o  Faculty or students leave and take IP with them
▶  Must satisfy regulatory restrictions
•  HIPAA, for instance
▶  Changing hardware standards
•  In particular tape standards
▶  Hardware availability
•  Spinning media is mechanical and will not last forever
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
6!
Object Store to the Rescue
▶  All Object stores provide a way to replicate data over
large distances
•  Some more effectively than others
o  Provides a way to effectively share data over WAN scales
•  Most object stores were designed for cloud storage
o  Security has always been important
o  Ease of data sharing has been important
▶  This now enables more effective data sharing and data
security than with traditional storage solutions at price
points that traditional NAS systems cannot approach.
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
7!
How iRODS Enables Object Storage
▶  iRODS is a very effective middleware layer for
accessing multiple storage resources
▶  iRODS handles the security and database
management of the ingested data
▶  Provides a powerful metadata search capability
for ingested data
▶  Provides a rules engine for the processing of
incoming data and the manipulation of data on
the back-end
•  For instance,
o  Data can be moved to slower storage resources as they age or
another criteria is met (Essentially HSM)
o  Data can be secured from removal, editing or modification based
upon criteria using the “null” chmod – Retention policies!
o  Anything else?
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
8!
DDN WOS: Key Features
Fully-Integrated Object
Storage Appliance
WOS7000, 60 Drives in 4U,
with 1 or 2 object storage
servers per appliance
Federated, Global Namespace
Locally or across multiple geographies
with smart policies for performance and/or
disaster recovery on a per-object basis
Pure Object Storage
Formats drives with custom WOS disk file
system, no Linux file I/Os, no fragmentation,
fully contiguous object read and write
operations for maximum disk efficiency
Latency-Aware Access Manager
WOS intelligently makes decisions on
the best geographies to get from based
upon location access load and latency
User Defined Metadata and
Metadata Search
Applications can assign their own metadata via
object storage API, WOS now also supports
parallel search of WOS user metadata
Self-healing Architecture
No hard tie between physical disks and data.
Failed drives are recovered through dispersed
data placement – rebuilds happen at read, not
write, speed – rebuilding only data.
Flexible Data Protection
Select multiple policy-driven data protection
schemes to meet application, workflow and
disaster recovery requirements
Exabyte Scalability
Create virtually limitless data repositories, non-disruptively
seamlessly scale to over an exabyte of capacity
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
9!
Why iRODS with DDN WOS is a Superior
Solution for Repositories
▶  Ease of Scalability with WOS
▶  Ease of administration – Once rules are tested and in
place the system can be managed with a minimum of
administrative overhead
▶  Automating workflows to guarantee consistency and
reproducibility in the science that is produced
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
10!
Why iRODS with DDN WOS is a Superior
Solution for Repositories
▶  Ease of auditing for both usage and back charging and
for maintaining adequate data security compliance
▶  DDN WOS makes remote replication simple and
provides a straightforward way to manage DR systems
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
11!
Why iRODS with WOS is a Superior
Solution for Repositories
▶  Central to any repository is the ability to add metadata
tags and search metadata.
•  iRODS has extensive abilities to do that – Significantly
better than any competing options
# imeta add –d filename “Date” “2 Feb 2016”
# imeta ls –d filename
AVUs defined for dataObj filename:
attribute: Meta1
value: hello
units:
---
attribute: Date
value: 2 Feb 2016
units:
# imeta rm –d filename “Meta1” “hello”
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
12!
A Case Study
Hrothgar
Compute
Cluster
Lustre
Filesystem
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
13!
9351 Deering Avenue
Chatsworth, CA 91311
1.800.837.2298
1.818.700.4000
company/datadirect-networks
@ddn_limitless
sales@ddn.com
Thank You!
Keep in touch with us
Questions?
.

More Related Content

What's hot

Spinning Brown Donuts
Spinning Brown DonutsSpinning Brown Donuts
Spinning Brown DonutsDavid Pechon
 
Webinar: The Four Requirements of a Cloud-Era File System
Webinar: The Four Requirements of a Cloud-Era File SystemWebinar: The Four Requirements of a Cloud-Era File System
Webinar: The Four Requirements of a Cloud-Era File SystemStorage Switzerland
 
Long Live Posix - HPC Storage and the HPC Datacenter
Long Live Posix - HPC Storage and the HPC DatacenterLong Live Posix - HPC Storage and the HPC Datacenter
Long Live Posix - HPC Storage and the HPC Datacenterinside-BigData.com
 
White paper whitewater-datastorageinthecloud
White paper whitewater-datastorageinthecloudWhite paper whitewater-datastorageinthecloud
White paper whitewater-datastorageinthecloudAccenture
 
مشروع قواعد البيانات
مشروع قواعد البيانات مشروع قواعد البيانات
مشروع قواعد البيانات Safiya Najeh
 
Insiders Guide- Managing Storage Performance
Insiders Guide- Managing Storage PerformanceInsiders Guide- Managing Storage Performance
Insiders Guide- Managing Storage PerformanceDataCore Software
 
Scality presentation cloud Computing Expo NY 2012 v1.0
Scality presentation cloud Computing Expo NY 2012 v1.0Scality presentation cloud Computing Expo NY 2012 v1.0
Scality presentation cloud Computing Expo NY 2012 v1.0Marc Villemade
 
EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu | EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu | EUDAT
 
Lecture 05 - The Data Warehouse and Technology
Lecture 05 - The Data Warehouse and TechnologyLecture 05 - The Data Warehouse and Technology
Lecture 05 - The Data Warehouse and Technologyphanleson
 
Dave Debre - Backup Options
Dave Debre - Backup OptionsDave Debre - Backup Options
Dave Debre - Backup Optionsasyma
 
IBM PROTECTIER AND SAP: CRITICAL DATA PROTECTION WITHOUT DATA DISRUPTION
IBM PROTECTIER AND SAP: CRITICAL DATA PROTECTION WITHOUT DATA DISRUPTIONIBM PROTECTIER AND SAP: CRITICAL DATA PROTECTION WITHOUT DATA DISRUPTION
IBM PROTECTIER AND SAP: CRITICAL DATA PROTECTION WITHOUT DATA DISRUPTIONIBM India Smarter Computing
 
Development_data_standards_data_integration_tools
Development_data_standards_data_integration_toolsDevelopment_data_standards_data_integration_tools
Development_data_standards_data_integration_toolsRafael Romero
 
Introduction to data warehousing
Introduction to data warehousingIntroduction to data warehousing
Introduction to data warehousinguncleRhyme
 
Webinar: SDS is Broken - And How to Fix it
Webinar: SDS is Broken - And How to Fix itWebinar: SDS is Broken - And How to Fix it
Webinar: SDS is Broken - And How to Fix itStorage Switzerland
 
Panzura & Scality - Cloud Storage made seamless - Cloud Expo New York City 2012
Panzura & Scality - Cloud Storage made seamless - Cloud Expo New York City 2012Panzura & Scality - Cloud Storage made seamless - Cloud Expo New York City 2012
Panzura & Scality - Cloud Storage made seamless - Cloud Expo New York City 2012Marc Villemade
 

What's hot (20)

Tandberg Data - Data Protection Solutions Guide
Tandberg Data  - Data Protection Solutions GuideTandberg Data  - Data Protection Solutions Guide
Tandberg Data - Data Protection Solutions Guide
 
Spinning Brown Donuts
Spinning Brown DonutsSpinning Brown Donuts
Spinning Brown Donuts
 
Raid(Storage Technology)
Raid(Storage Technology)Raid(Storage Technology)
Raid(Storage Technology)
 
Webinar: The Four Requirements of a Cloud-Era File System
Webinar: The Four Requirements of a Cloud-Era File SystemWebinar: The Four Requirements of a Cloud-Era File System
Webinar: The Four Requirements of a Cloud-Era File System
 
Long Live Posix - HPC Storage and the HPC Datacenter
Long Live Posix - HPC Storage and the HPC DatacenterLong Live Posix - HPC Storage and the HPC Datacenter
Long Live Posix - HPC Storage and the HPC Datacenter
 
White paper whitewater-datastorageinthecloud
White paper whitewater-datastorageinthecloudWhite paper whitewater-datastorageinthecloud
White paper whitewater-datastorageinthecloud
 
مشروع قواعد البيانات
مشروع قواعد البيانات مشروع قواعد البيانات
مشروع قواعد البيانات
 
Insiders Guide- Managing Storage Performance
Insiders Guide- Managing Storage PerformanceInsiders Guide- Managing Storage Performance
Insiders Guide- Managing Storage Performance
 
Presentation
PresentationPresentation
Presentation
 
Scality presentation cloud Computing Expo NY 2012 v1.0
Scality presentation cloud Computing Expo NY 2012 v1.0Scality presentation cloud Computing Expo NY 2012 v1.0
Scality presentation cloud Computing Expo NY 2012 v1.0
 
EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu | EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu |
 
Lecture 05 - The Data Warehouse and Technology
Lecture 05 - The Data Warehouse and TechnologyLecture 05 - The Data Warehouse and Technology
Lecture 05 - The Data Warehouse and Technology
 
Dave Debre - Backup Options
Dave Debre - Backup OptionsDave Debre - Backup Options
Dave Debre - Backup Options
 
IBM PROTECTIER AND SAP: CRITICAL DATA PROTECTION WITHOUT DATA DISRUPTION
IBM PROTECTIER AND SAP: CRITICAL DATA PROTECTION WITHOUT DATA DISRUPTIONIBM PROTECTIER AND SAP: CRITICAL DATA PROTECTION WITHOUT DATA DISRUPTION
IBM PROTECTIER AND SAP: CRITICAL DATA PROTECTION WITHOUT DATA DISRUPTION
 
Development_data_standards_data_integration_tools
Development_data_standards_data_integration_toolsDevelopment_data_standards_data_integration_tools
Development_data_standards_data_integration_tools
 
Generic RLM White Paper
Generic RLM White PaperGeneric RLM White Paper
Generic RLM White Paper
 
Raid levels
Raid levelsRaid levels
Raid levels
 
Introduction to data warehousing
Introduction to data warehousingIntroduction to data warehousing
Introduction to data warehousing
 
Webinar: SDS is Broken - And How to Fix it
Webinar: SDS is Broken - And How to Fix itWebinar: SDS is Broken - And How to Fix it
Webinar: SDS is Broken - And How to Fix it
 
Panzura & Scality - Cloud Storage made seamless - Cloud Expo New York City 2012
Panzura & Scality - Cloud Storage made seamless - Cloud Expo New York City 2012Panzura & Scality - Cloud Storage made seamless - Cloud Expo New York City 2012
Panzura & Scality - Cloud Storage made seamless - Cloud Expo New York City 2012
 

Viewers also liked

Ο Πληθυσμός της Σμύρνης και η Ελληνική Κοινότητα (17ος - 19ος αιώνας)
Ο Πληθυσμός της Σμύρνης και η Ελληνική Κοινότητα (17ος - 19ος αιώνας)Ο Πληθυσμός της Σμύρνης και η Ελληνική Κοινότητα (17ος - 19ος αιώνας)
Ο Πληθυσμός της Σμύρνης και η Ελληνική Κοινότητα (17ος - 19ος αιώνας)Βάσω Αρέλη
 
ο πληθυσμός της σμύρνης και η ελληνική κοινότητα
ο πληθυσμός της σμύρνης και η ελληνική κοινότηταο πληθυσμός της σμύρνης και η ελληνική κοινότητα
ο πληθυσμός της σμύρνης και η ελληνική κοινότηταΒάσω Αρέλη
 
Sinaunang Kabihasnan sa Egypt
Sinaunang Kabihasnan sa EgyptSinaunang Kabihasnan sa Egypt
Sinaunang Kabihasnan sa Egypttwocrowns
 
Lise bourbeau asculta-ti corpul
Lise bourbeau   asculta-ti corpulLise bourbeau   asculta-ti corpul
Lise bourbeau asculta-ti corpulCristina Gioada
 
[A. v. arasu]_turbo_machines(book_fi.org)
[A. v. arasu]_turbo_machines(book_fi.org)[A. v. arasu]_turbo_machines(book_fi.org)
[A. v. arasu]_turbo_machines(book_fi.org)wondie chanie
 
Servicios AQCLab 2017
Servicios AQCLab 2017Servicios AQCLab 2017
Servicios AQCLab 2017AQCLab
 
بهینه‌سازی تجربه‌کاربری
بهینه‌سازی تجربه‌کاربریبهینه‌سازی تجربه‌کاربری
بهینه‌سازی تجربه‌کاربریWeb Standards School
 
DDN and Intel: Partnered for Exascale
DDN and Intel: Partnered for ExascaleDDN and Intel: Partnered for Exascale
DDN and Intel: Partnered for ExascaleIntel IT Center
 
SNIA 2012 - Creating an Enterprise Hadoop Platform
SNIA 2012 - Creating an Enterprise Hadoop PlatformSNIA 2012 - Creating an Enterprise Hadoop Platform
SNIA 2012 - Creating an Enterprise Hadoop PlatformJoey Jablonski
 
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-finalDDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-finalIntelHealthcare
 
DDN GS7K - Easy-to-deploy, High Performance Scale-Out Parallel File System Ap...
DDN GS7K - Easy-to-deploy, High Performance Scale-Out Parallel File System Ap...DDN GS7K - Easy-to-deploy, High Performance Scale-Out Parallel File System Ap...
DDN GS7K - Easy-to-deploy, High Performance Scale-Out Parallel File System Ap...inside-BigData.com
 
Phan tich co phieu JVC, DNM, DDN (fintzone)
Phan tich co phieu JVC, DNM, DDN  (fintzone)Phan tich co phieu JVC, DNM, DDN  (fintzone)
Phan tich co phieu JVC, DNM, DDN (fintzone)Tony Auditor
 
DDN: Protecting Your Data, Protecting Your Hardware
DDN: Protecting Your Data, Protecting Your HardwareDDN: Protecting Your Data, Protecting Your Hardware
DDN: Protecting Your Data, Protecting Your Hardwareinside-BigData.com
 
Optimizing Lustre and GPFS with DDN
Optimizing Lustre and GPFS with DDNOptimizing Lustre and GPFS with DDN
Optimizing Lustre and GPFS with DDNinside-BigData.com
 
IBM general parallel file system - introduction
IBM general parallel file system - introductionIBM general parallel file system - introduction
IBM general parallel file system - introductionIBM Danmark
 

Viewers also liked (20)

Avni YÜKSEL-CV
Avni YÜKSEL-CVAvni YÜKSEL-CV
Avni YÜKSEL-CV
 
Ο Πληθυσμός της Σμύρνης και η Ελληνική Κοινότητα (17ος - 19ος αιώνας)
Ο Πληθυσμός της Σμύρνης και η Ελληνική Κοινότητα (17ος - 19ος αιώνας)Ο Πληθυσμός της Σμύρνης και η Ελληνική Κοινότητα (17ος - 19ος αιώνας)
Ο Πληθυσμός της Σμύρνης και η Ελληνική Κοινότητα (17ος - 19ος αιώνας)
 
ο πληθυσμός της σμύρνης και η ελληνική κοινότητα
ο πληθυσμός της σμύρνης και η ελληνική κοινότηταο πληθυσμός της σμύρνης και η ελληνική κοινότητα
ο πληθυσμός της σμύρνης και η ελληνική κοινότητα
 
Horti community
Horti communityHorti community
Horti community
 
Sinaunang Kabihasnan sa Egypt
Sinaunang Kabihasnan sa EgyptSinaunang Kabihasnan sa Egypt
Sinaunang Kabihasnan sa Egypt
 
Lise bourbeau asculta-ti corpul
Lise bourbeau   asculta-ti corpulLise bourbeau   asculta-ti corpul
Lise bourbeau asculta-ti corpul
 
[A. v. arasu]_turbo_machines(book_fi.org)
[A. v. arasu]_turbo_machines(book_fi.org)[A. v. arasu]_turbo_machines(book_fi.org)
[A. v. arasu]_turbo_machines(book_fi.org)
 
Servicios AQCLab 2017
Servicios AQCLab 2017Servicios AQCLab 2017
Servicios AQCLab 2017
 
بهینه‌سازی تجربه‌کاربری
بهینه‌سازی تجربه‌کاربریبهینه‌سازی تجربه‌کاربری
بهینه‌سازی تجربه‌کاربری
 
DDN and Intel: Partnered for Exascale
DDN and Intel: Partnered for ExascaleDDN and Intel: Partnered for Exascale
DDN and Intel: Partnered for Exascale
 
SNIA 2012 - Creating an Enterprise Hadoop Platform
SNIA 2012 - Creating an Enterprise Hadoop PlatformSNIA 2012 - Creating an Enterprise Hadoop Platform
SNIA 2012 - Creating an Enterprise Hadoop Platform
 
DDN Service Strategy
DDN Service StrategyDDN Service Strategy
DDN Service Strategy
 
Ddn Vision
Ddn VisionDdn Vision
Ddn Vision
 
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-finalDDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
 
DDN GS7K - Easy-to-deploy, High Performance Scale-Out Parallel File System Ap...
DDN GS7K - Easy-to-deploy, High Performance Scale-Out Parallel File System Ap...DDN GS7K - Easy-to-deploy, High Performance Scale-Out Parallel File System Ap...
DDN GS7K - Easy-to-deploy, High Performance Scale-Out Parallel File System Ap...
 
Corralling Big Data at TACC
Corralling Big Data at TACCCorralling Big Data at TACC
Corralling Big Data at TACC
 
Phan tich co phieu JVC, DNM, DDN (fintzone)
Phan tich co phieu JVC, DNM, DDN  (fintzone)Phan tich co phieu JVC, DNM, DDN  (fintzone)
Phan tich co phieu JVC, DNM, DDN (fintzone)
 
DDN: Protecting Your Data, Protecting Your Hardware
DDN: Protecting Your Data, Protecting Your HardwareDDN: Protecting Your Data, Protecting Your Hardware
DDN: Protecting Your Data, Protecting Your Hardware
 
Optimizing Lustre and GPFS with DDN
Optimizing Lustre and GPFS with DDNOptimizing Lustre and GPFS with DDN
Optimizing Lustre and GPFS with DDN
 
IBM general parallel file system - introduction
IBM general parallel file system - introductionIBM general parallel file system - introduction
IBM general parallel file system - introduction
 

Similar to Academic Workflow Research Repositories iRODS Object Storage

DDN Strategic Vision Tour June 2015
DDN Strategic Vision Tour June 2015DDN Strategic Vision Tour June 2015
DDN Strategic Vision Tour June 2015inside-BigData.com
 
Eliminating the Problems of Exponential Data Growth, Forever
Eliminating the Problems of Exponential Data Growth, ForeverEliminating the Problems of Exponential Data Growth, Forever
Eliminating the Problems of Exponential Data Growth, Foreverspectralogic
 
Webinar: End NAS Sprawl - Gain Control Over Unstructured Data
Webinar: End NAS Sprawl - Gain Control Over Unstructured DataWebinar: End NAS Sprawl - Gain Control Over Unstructured Data
Webinar: End NAS Sprawl - Gain Control Over Unstructured DataStorage Switzerland
 
Why Data Mesh Needs Data Virtualization (ASEAN)
Why Data Mesh Needs Data Virtualization (ASEAN)Why Data Mesh Needs Data Virtualization (ASEAN)
Why Data Mesh Needs Data Virtualization (ASEAN)Denodo
 
Data core overview - haluk-final
Data core overview - haluk-finalData core overview - haluk-final
Data core overview - haluk-finalHaluk Ulubay
 
The Importance of Fast, Scalable Storage for Today’s HPC
The Importance of Fast, Scalable Storage for Today’s HPCThe Importance of Fast, Scalable Storage for Today’s HPC
The Importance of Fast, Scalable Storage for Today’s HPCIntel IT Center
 
04 - VMUGIT - Lecce 2018 - Giampiero Petrosi, Rubrik
04 - VMUGIT - Lecce 2018 - Giampiero Petrosi, Rubrik04 - VMUGIT - Lecce 2018 - Giampiero Petrosi, Rubrik
04 - VMUGIT - Lecce 2018 - Giampiero Petrosi, RubrikVMUG IT
 
Webinar: Overcoming the Storage Roadblock to Data Center Modernization
Webinar: Overcoming the Storage Roadblock to Data Center ModernizationWebinar: Overcoming the Storage Roadblock to Data Center Modernization
Webinar: Overcoming the Storage Roadblock to Data Center ModernizationStorage Switzerland
 
Four Reasons Why Your Backup & Recovery Hardware will Break by 2020
Four Reasons Why Your Backup & Recovery Hardware will Break by 2020Four Reasons Why Your Backup & Recovery Hardware will Break by 2020
Four Reasons Why Your Backup & Recovery Hardware will Break by 2020Storage Switzerland
 
Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...
Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...
Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...Denodo
 
Asset Management and Workflow
Asset Management and WorkflowAsset Management and Workflow
Asset Management and WorkflowVirtu Institute
 
Seqrite Data Loss Prevention- Complete Protection from Data Theft and Data Loss
Seqrite Data Loss Prevention- Complete Protection from Data Theft and Data LossSeqrite Data Loss Prevention- Complete Protection from Data Theft and Data Loss
Seqrite Data Loss Prevention- Complete Protection from Data Theft and Data LossQuick Heal Technologies Ltd.
 
Enabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationDenodo
 
ArchivePod a legacy data solution when migrating to the #CLOUD
ArchivePod a legacy data solution when migrating to the #CLOUDArchivePod a legacy data solution when migrating to the #CLOUD
ArchivePod a legacy data solution when migrating to the #CLOUDGaret Keller
 
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)Denodo
 
Presentation dell™ power vault™ md3
Presentation   dell™ power vault™ md3Presentation   dell™ power vault™ md3
Presentation dell™ power vault™ md3xKinAnx
 

Similar to Academic Workflow Research Repositories iRODS Object Storage (20)

DDN Product Update from SC13
DDN Product Update from SC13DDN Product Update from SC13
DDN Product Update from SC13
 
DDN Strategic Vision Tour June 2015
DDN Strategic Vision Tour June 2015DDN Strategic Vision Tour June 2015
DDN Strategic Vision Tour June 2015
 
Eliminating the Problems of Exponential Data Growth, Forever
Eliminating the Problems of Exponential Data Growth, ForeverEliminating the Problems of Exponential Data Growth, Forever
Eliminating the Problems of Exponential Data Growth, Forever
 
Webinar: End NAS Sprawl - Gain Control Over Unstructured Data
Webinar: End NAS Sprawl - Gain Control Over Unstructured DataWebinar: End NAS Sprawl - Gain Control Over Unstructured Data
Webinar: End NAS Sprawl - Gain Control Over Unstructured Data
 
Why Data Mesh Needs Data Virtualization (ASEAN)
Why Data Mesh Needs Data Virtualization (ASEAN)Why Data Mesh Needs Data Virtualization (ASEAN)
Why Data Mesh Needs Data Virtualization (ASEAN)
 
Data core overview - haluk-final
Data core overview - haluk-finalData core overview - haluk-final
Data core overview - haluk-final
 
The Importance of Fast, Scalable Storage for Today’s HPC
The Importance of Fast, Scalable Storage for Today’s HPCThe Importance of Fast, Scalable Storage for Today’s HPC
The Importance of Fast, Scalable Storage for Today’s HPC
 
04 - VMUGIT - Lecce 2018 - Giampiero Petrosi, Rubrik
04 - VMUGIT - Lecce 2018 - Giampiero Petrosi, Rubrik04 - VMUGIT - Lecce 2018 - Giampiero Petrosi, Rubrik
04 - VMUGIT - Lecce 2018 - Giampiero Petrosi, Rubrik
 
Webinar: Overcoming the Storage Roadblock to Data Center Modernization
Webinar: Overcoming the Storage Roadblock to Data Center ModernizationWebinar: Overcoming the Storage Roadblock to Data Center Modernization
Webinar: Overcoming the Storage Roadblock to Data Center Modernization
 
Four Reasons Why Your Backup & Recovery Hardware will Break by 2020
Four Reasons Why Your Backup & Recovery Hardware will Break by 2020Four Reasons Why Your Backup & Recovery Hardware will Break by 2020
Four Reasons Why Your Backup & Recovery Hardware will Break by 2020
 
The storage matrix netmagic
The storage matrix   netmagicThe storage matrix   netmagic
The storage matrix netmagic
 
Netmagic the-storage-matrix
Netmagic the-storage-matrixNetmagic the-storage-matrix
Netmagic the-storage-matrix
 
Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...
Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...
Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...
 
EMC config Hadoop
EMC config HadoopEMC config Hadoop
EMC config Hadoop
 
Asset Management and Workflow
Asset Management and WorkflowAsset Management and Workflow
Asset Management and Workflow
 
Seqrite Data Loss Prevention- Complete Protection from Data Theft and Data Loss
Seqrite Data Loss Prevention- Complete Protection from Data Theft and Data LossSeqrite Data Loss Prevention- Complete Protection from Data Theft and Data Loss
Seqrite Data Loss Prevention- Complete Protection from Data Theft and Data Loss
 
Enabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data Virtualization
 
ArchivePod a legacy data solution when migrating to the #CLOUD
ArchivePod a legacy data solution when migrating to the #CLOUDArchivePod a legacy data solution when migrating to the #CLOUD
ArchivePod a legacy data solution when migrating to the #CLOUD
 
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
 
Presentation dell™ power vault™ md3
Presentation   dell™ power vault™ md3Presentation   dell™ power vault™ md3
Presentation dell™ power vault™ md3
 

Academic Workflow Research Repositories iRODS Object Storage

  • 1. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 1! Academic Workflow for Research Repositories Using iRODS and Object Storage 2016 iRODS User’s Group Meeting 9 June 2016 Randall Splinter, Ph.D. HPC Research Computing Solutions Architect RSplinter@ddn.com 770.633.2994
  • 2. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 2! Agenda ▶  Introduction to the Problem •  HPC Workflows •  The Problem of Long Term Archiving ▶  Object Store to the Rescue ▶  How iRODS Enables Object Storage ▶  Why iRODS with DDN WOS is a Superior Solution for Research Repositories ▶  A Case Study
  • 3. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 3! Introduction to the Problem ▶  The Problem of Collaboration •  NAS technologies (NFS, CIFS) are local o  They don’t tend to scale well over WAN distances – But collaborators are frequently widely separated in distance o  How to enable researchers to share data without administrative overhead – securely – Typically, only system administrators can control the ACLs on NFS/CIFS mounts •  FTP o  Security •  Tape o  Yech
  • 4. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 4! HPC Workflows ▶  Ingest data from a source (Analysis of data) •  Pre-analysis on low-end storage •  Move cleaned up data to a PFS and compute •  After full analysis data is moved to long term storage o  Can this be automated? – Yes, with iRODS. ▶  No ingest (Pure simulation) •  Compute models are run on a compute cluster with PFS •  After full analysis data is moved to long term storage o  Again automation is key. ▶  Data sets are exploding!
  • 5. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 5! The Problem of Long Term Archiving ▶  Data must be secure •  From deletion (accident or deliberate) •  Loss from theft o  Security hacks o  Faculty or students leave and take IP with them ▶  Must satisfy regulatory restrictions •  HIPAA, for instance ▶  Changing hardware standards •  In particular tape standards ▶  Hardware availability •  Spinning media is mechanical and will not last forever
  • 6. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 6! Object Store to the Rescue ▶  All Object stores provide a way to replicate data over large distances •  Some more effectively than others o  Provides a way to effectively share data over WAN scales •  Most object stores were designed for cloud storage o  Security has always been important o  Ease of data sharing has been important ▶  This now enables more effective data sharing and data security than with traditional storage solutions at price points that traditional NAS systems cannot approach.
  • 7. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 7! How iRODS Enables Object Storage ▶  iRODS is a very effective middleware layer for accessing multiple storage resources ▶  iRODS handles the security and database management of the ingested data ▶  Provides a powerful metadata search capability for ingested data ▶  Provides a rules engine for the processing of incoming data and the manipulation of data on the back-end •  For instance, o  Data can be moved to slower storage resources as they age or another criteria is met (Essentially HSM) o  Data can be secured from removal, editing or modification based upon criteria using the “null” chmod – Retention policies! o  Anything else?
  • 8. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 8! DDN WOS: Key Features Fully-Integrated Object Storage Appliance WOS7000, 60 Drives in 4U, with 1 or 2 object storage servers per appliance Federated, Global Namespace Locally or across multiple geographies with smart policies for performance and/or disaster recovery on a per-object basis Pure Object Storage Formats drives with custom WOS disk file system, no Linux file I/Os, no fragmentation, fully contiguous object read and write operations for maximum disk efficiency Latency-Aware Access Manager WOS intelligently makes decisions on the best geographies to get from based upon location access load and latency User Defined Metadata and Metadata Search Applications can assign their own metadata via object storage API, WOS now also supports parallel search of WOS user metadata Self-healing Architecture No hard tie between physical disks and data. Failed drives are recovered through dispersed data placement – rebuilds happen at read, not write, speed – rebuilding only data. Flexible Data Protection Select multiple policy-driven data protection schemes to meet application, workflow and disaster recovery requirements Exabyte Scalability Create virtually limitless data repositories, non-disruptively seamlessly scale to over an exabyte of capacity
  • 9. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 9! Why iRODS with DDN WOS is a Superior Solution for Repositories ▶  Ease of Scalability with WOS ▶  Ease of administration – Once rules are tested and in place the system can be managed with a minimum of administrative overhead ▶  Automating workflows to guarantee consistency and reproducibility in the science that is produced
  • 10. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 10! Why iRODS with DDN WOS is a Superior Solution for Repositories ▶  Ease of auditing for both usage and back charging and for maintaining adequate data security compliance ▶  DDN WOS makes remote replication simple and provides a straightforward way to manage DR systems
  • 11. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 11! Why iRODS with WOS is a Superior Solution for Repositories ▶  Central to any repository is the ability to add metadata tags and search metadata. •  iRODS has extensive abilities to do that – Significantly better than any competing options # imeta add –d filename “Date” “2 Feb 2016” # imeta ls –d filename AVUs defined for dataObj filename: attribute: Meta1 value: hello units: --- attribute: Date value: 2 Feb 2016 units: # imeta rm –d filename “Meta1” “hello”
  • 12. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 12! A Case Study Hrothgar Compute Cluster Lustre Filesystem
  • 13. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 13! 9351 Deering Avenue Chatsworth, CA 91311 1.800.837.2298 1.818.700.4000 company/datadirect-networks @ddn_limitless sales@ddn.com Thank You! Keep in touch with us Questions? .