SlideShare a Scribd company logo
1May 2017 Igor Sfiligoi
Nirvana and EasyHSM
Storage Solutions
from
General Atomics
2
• The Storage Software Group at General Atomics has
a long tradition in data handling
– Going back to 1995 when we ran
the San Diego Supercomputing Center
• Nirvana is a complete object store solution
– Storage abstraction, federation and tiering
– Metadata repository, extraction and handling
– Audit trail
• EasyHSM is a POSIX solution
– Integrated with IBM Spectrum Scale (GPFS)
– Optimized for storage abstraction and tiering
What do we offer?
3
4
Nirvana can be called a virtual object store
Main characteristics:
• Global namespace,
spanning multiple physical storage pools
• Layers on top of existing storage resources
– Supports POSIX, Windows, NFS, S3 and
Google Storage APIs
• Implements replication and migration logic
– Both explicit and policy driven
Nirvana Storage Handling
5
• Nirvana does not provide its own
filesystem solution
– There are many available,
why re-invent the wheel?
• Nirvana can operate
on anything that supports
– File paths
– Basic file permissions
– Data ingest and retrieval
• Straight mapping between Nirvana’s global
namespace and underlying storage system
– No obfuscation, easy to interact with external tools
Operating on top of existing storage
At least in theory, we
provide implementation
for a fixed set
6
• Global namespace allows for decoupling
of logical location and physical storage
• Users care about the logical location
– E.g. files in my home directory
– They don’t care where the data is stored, as long as
they can get it (fast enough) when needed
• System administrators can manage physical space
– Based on SLAs and budget constraints
– Files in a single logical folder may be partitioned
across multiple storage pools (e.g. based on age),
or have different number of replicas (based on type)
Global namespace
7
• Metadata is “data about data”
– i.e. File attributes
• By adding attributes to your files
– Easier to find your files
– Easier for others to find your files
– Dynamically sort files in virtual collections
Nirvana is also Metadata-centric
Saves time
and money
8
• Nirvana stores metadata in a standard RDBMS
– Supports PostrgeSQL and Oracle
• Administrators define the attribute schemas
• Metadata can be generated
– By hand, using either cmdline or Web client tools
– Automatically extracted from the filesystem using
provided scrubbing/reconciliation tools
Metadata generation
9
• Nirvana (of course) provides per-file attribute lookup
• Nirvana also provides
attribute-based file lookup
– Both cmdline and Web client tools
• Nirvana also provides virtual collections
– Indistinguishable from a traditional read-only directory
from the user point of view
Metadata-based search
Try to do this
with AWS S3!
10
• User grouping and federation
• Fine grained file permissions
– Supports discretionary,
role based and
mandatory access controls
• Keeps a detailed audit trail
Nirvana Security features
11
• Dramatic scrubbing/reconciliation speedup
– System attributes: observed 1M files/minute (16k/s)
– JPG scrubbing: observed 60k files/minute (1k/s)
• New Web Administration interface
– Web client available since a long time
– Old-style Java-based GUI still available, too
• Nirvana now offers a S3 REST server, too
– In addition to the proprietary APIs and tools
Recent changes
12
13
• EasyHSM is a Hierarchical Storage Management (HSM)
solution for IBM Spectrum Scale (GPFS)
• GPFS provides the namespace management
– And hot data storage management
• EasyHSM provides the tiering logic
– Between GPFS storage and external storage
EasyHSM is a Transparent Tiering Solution
IBM Spectrum Scale
EasyHSM
Smallish but fast layer
Large but slower
bulk storage
14
• A mature, very reliable storage system
– With solid backing from IBM
• Highly scalable
– Originally designed for HPC applications
• Multiple access point protocols
– Traditional NFS, SMB/CIFS, S3
– High performance Native Client Drivers
The IBM Spectrum Scale Advantage
GPFS
Clients
SMB
NFSS3
15
• Not all data needs to be in the fast, GPFS layer
– But all files must be visible to the users
• EasyHSM provides an effective tiering solution
– Uses HSM file stubs to link to external storage
– Policy driven explicit movement of data
between GPFS and external storage
– Transparent high performance
retrieval of data into the GPFS layer
on user access
Tiering with EasyHSM
16
• Tools for migrating data from GFPS to external storage
• Tools for freeing disk space on GPFS (stubbing)
– Invisible to final users
• A daemon for transparent recall on access
– And handling file removal
• Tools for explicit recall from external storage to GPFS
• Support scripts for easy
integration with GPFS Policy Manager (ILM)
EasyHSM components
17
• EasyHSM is the only solution that allows tiering using
multiple external protocols (both POSIX and Object)
• EasyHSM does a straight mapping between GPFS
path and external storage path/URI
– Minimal massaging, e.g. adding protocol and IP
– No obfuscation, easy to interact with external tools
• Scalable, low overhead implementation
– Linear scaling over multiple hardware nodes
– Not in the path for hot data
• Pure software solution
– Does not need dedicated hardware
• Fixed price licensing (not capacity based)
EasyHSM vs Competition
18
• EasyHSM does support Nirvana as external storage
– But can (obviously) work without it, too
• When used together, separation of duties
– EasyHSM provides the tiering from GPFS
– Nirvana handles replication and migration
between multiple external storage systems
EasyHSM and Nirvana
IBM Spectrum Scale
EasyHSM Nirvana
19
• EasyHSM home page:
http://www.ga.com/easyhsm
• Nirvana home page:
http://www.ga.com/nirvana
If you just want to know more, feel free to
– Contact me at
Igor.Sfiligoi@ga.com
– Contact the program manager at
Robert.Murphy@ga.com
For more information

More Related Content

Similar to Storage Solutions from General Atomics

Building a Global Namespace with Nirvana
Building a Global Namespace with NirvanaBuilding a Global Namespace with Nirvana
Building a Global Namespace with Nirvana
Igor Sfiligoi
 
Spectrum Scale Unified File and Object with WAN Caching
Spectrum Scale Unified File and Object with WAN CachingSpectrum Scale Unified File and Object with WAN Caching
Spectrum Scale Unified File and Object with WAN Caching
Sandeep Patil
 
Software Defined Analytics with File and Object Access Plus Geographically Di...
Software Defined Analytics with File and Object Access Plus Geographically Di...Software Defined Analytics with File and Object Access Plus Geographically Di...
Software Defined Analytics with File and Object Access Plus Geographically Di...
Trishali Nayar
 
11. dfs
11. dfs11. dfs
12. dfs
12. dfs12. dfs
operating systems and File Management.ppt
operating systems and File Management.pptoperating systems and File Management.ppt
operating systems and File Management.ppt
JmmJb
 
Gpfs introandsetup
Gpfs introandsetupGpfs introandsetup
Gpfs introandsetup
asihan
 
HSM migration with EasyHSM and Nirvana
HSM migration with EasyHSM and NirvanaHSM migration with EasyHSM and Nirvana
HSM migration with EasyHSM and Nirvana
Igor Sfiligoi
 
Ch10 file system interface
Ch10   file system interfaceCh10   file system interface
Ch10 file system interface
Welly Dian Astika
 
Introduction to Data Storage and Cloud Computing
Introduction to Data Storage and Cloud ComputingIntroduction to Data Storage and Cloud Computing
Introduction to Data Storage and Cloud Computing
Rutuja751147
 
Introduction to distributed file systems
Introduction to distributed file systemsIntroduction to distributed file systems
Introduction to distributed file systemsViet-Trung TRAN
 
Chapter-5-DFS.ppt
Chapter-5-DFS.pptChapter-5-DFS.ppt
Chapter-5-DFS.ppt
rameshwarchintamani
 
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
Doug O'Flaherty
 
Dustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep DiveDustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep Dive
Gluster.org
 
Linux Memory Analysis with Volatility
Linux Memory Analysis with VolatilityLinux Memory Analysis with Volatility
Linux Memory Analysis with Volatility
Andrew Case
 
Augmenting Big Data Analytics with Nirvana
Augmenting Big Data Analytics with NirvanaAugmenting Big Data Analytics with Nirvana
Augmenting Big Data Analytics with Nirvana
Igor Sfiligoi
 
HDFS_architecture.ppt
HDFS_architecture.pptHDFS_architecture.ppt
HDFS_architecture.ppt
vijayapraba1
 
Big Data Architecture Workshop - Vahid Amiri
Big Data Architecture Workshop -  Vahid AmiriBig Data Architecture Workshop -  Vahid Amiri
Big Data Architecture Workshop - Vahid Amiri
datastack
 

Similar to Storage Solutions from General Atomics (20)

Building a Global Namespace with Nirvana
Building a Global Namespace with NirvanaBuilding a Global Namespace with Nirvana
Building a Global Namespace with Nirvana
 
Spectrum Scale Unified File and Object with WAN Caching
Spectrum Scale Unified File and Object with WAN CachingSpectrum Scale Unified File and Object with WAN Caching
Spectrum Scale Unified File and Object with WAN Caching
 
Software Defined Analytics with File and Object Access Plus Geographically Di...
Software Defined Analytics with File and Object Access Plus Geographically Di...Software Defined Analytics with File and Object Access Plus Geographically Di...
Software Defined Analytics with File and Object Access Plus Geographically Di...
 
11. dfs
11. dfs11. dfs
11. dfs
 
12. dfs
12. dfs12. dfs
12. dfs
 
operating systems and File Management.ppt
operating systems and File Management.pptoperating systems and File Management.ppt
operating systems and File Management.ppt
 
Dfs in iaa_s
Dfs in iaa_sDfs in iaa_s
Dfs in iaa_s
 
Gpfs introandsetup
Gpfs introandsetupGpfs introandsetup
Gpfs introandsetup
 
HSM migration with EasyHSM and Nirvana
HSM migration with EasyHSM and NirvanaHSM migration with EasyHSM and Nirvana
HSM migration with EasyHSM and Nirvana
 
Ch10 file system interface
Ch10   file system interfaceCh10   file system interface
Ch10 file system interface
 
Introduction to Data Storage and Cloud Computing
Introduction to Data Storage and Cloud ComputingIntroduction to Data Storage and Cloud Computing
Introduction to Data Storage and Cloud Computing
 
Introduction to distributed file systems
Introduction to distributed file systemsIntroduction to distributed file systems
Introduction to distributed file systems
 
Chapter-5-DFS.ppt
Chapter-5-DFS.pptChapter-5-DFS.ppt
Chapter-5-DFS.ppt
 
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
 
Dustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep DiveDustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep Dive
 
Linux Memory Analysis with Volatility
Linux Memory Analysis with VolatilityLinux Memory Analysis with Volatility
Linux Memory Analysis with Volatility
 
file management
file managementfile management
file management
 
Augmenting Big Data Analytics with Nirvana
Augmenting Big Data Analytics with NirvanaAugmenting Big Data Analytics with Nirvana
Augmenting Big Data Analytics with Nirvana
 
HDFS_architecture.ppt
HDFS_architecture.pptHDFS_architecture.ppt
HDFS_architecture.ppt
 
Big Data Architecture Workshop - Vahid Amiri
Big Data Architecture Workshop -  Vahid AmiriBig Data Architecture Workshop -  Vahid Amiri
Big Data Architecture Workshop - Vahid Amiri
 

More from Igor Sfiligoi

Preparing Fusion codes for Perlmutter - CGYRO
Preparing Fusion codes for Perlmutter - CGYROPreparing Fusion codes for Perlmutter - CGYRO
Preparing Fusion codes for Perlmutter - CGYRO
Igor Sfiligoi
 
O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...
O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...
O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...
Igor Sfiligoi
 
Comparing single-node and multi-node performance of an important fusion HPC c...
Comparing single-node and multi-node performance of an important fusion HPC c...Comparing single-node and multi-node performance of an important fusion HPC c...
Comparing single-node and multi-node performance of an important fusion HPC c...
Igor Sfiligoi
 
The anachronism of whole-GPU accounting
The anachronism of whole-GPU accountingThe anachronism of whole-GPU accounting
The anachronism of whole-GPU accounting
Igor Sfiligoi
 
Auto-scaling HTCondor pools using Kubernetes compute resources
Auto-scaling HTCondor pools using Kubernetes compute resourcesAuto-scaling HTCondor pools using Kubernetes compute resources
Auto-scaling HTCondor pools using Kubernetes compute resources
Igor Sfiligoi
 
Speeding up bowtie2 by improving cache-hit rate
Speeding up bowtie2 by improving cache-hit rateSpeeding up bowtie2 by improving cache-hit rate
Speeding up bowtie2 by improving cache-hit rate
Igor Sfiligoi
 
Performance Optimization of CGYRO for Multiscale Turbulence Simulations
Performance Optimization of CGYRO for Multiscale Turbulence SimulationsPerformance Optimization of CGYRO for Multiscale Turbulence Simulations
Performance Optimization of CGYRO for Multiscale Turbulence Simulations
Igor Sfiligoi
 
Comparing GPU effectiveness for Unifrac distance compute
Comparing GPU effectiveness for Unifrac distance computeComparing GPU effectiveness for Unifrac distance compute
Comparing GPU effectiveness for Unifrac distance compute
Igor Sfiligoi
 
Managing Cloud networking costs for data-intensive applications by provisioni...
Managing Cloud networking costs for data-intensive applications by provisioni...Managing Cloud networking costs for data-intensive applications by provisioni...
Managing Cloud networking costs for data-intensive applications by provisioni...
Igor Sfiligoi
 
Accelerating Key Bioinformatics Tasks 100-fold by Improving Memory Access
Accelerating Key Bioinformatics Tasks 100-fold by Improving Memory AccessAccelerating Key Bioinformatics Tasks 100-fold by Improving Memory Access
Accelerating Key Bioinformatics Tasks 100-fold by Improving Memory Access
Igor Sfiligoi
 
Using A100 MIG to Scale Astronomy Scientific Output
Using A100 MIG to Scale Astronomy Scientific OutputUsing A100 MIG to Scale Astronomy Scientific Output
Using A100 MIG to Scale Astronomy Scientific Output
Igor Sfiligoi
 
Using commercial Clouds to process IceCube jobs
Using commercial Clouds to process IceCube jobsUsing commercial Clouds to process IceCube jobs
Using commercial Clouds to process IceCube jobs
Igor Sfiligoi
 
Modest scale HPC on Azure using CGYRO
Modest scale HPC on Azure using CGYROModest scale HPC on Azure using CGYRO
Modest scale HPC on Azure using CGYRO
Igor Sfiligoi
 
Data-intensive IceCube Cloud Burst
Data-intensive IceCube Cloud BurstData-intensive IceCube Cloud Burst
Data-intensive IceCube Cloud Burst
Igor Sfiligoi
 
Scheduling a Kubernetes Federation with Admiralty
Scheduling a Kubernetes Federation with AdmiraltyScheduling a Kubernetes Federation with Admiralty
Scheduling a Kubernetes Federation with Admiralty
Igor Sfiligoi
 
Accelerating microbiome research with OpenACC
Accelerating microbiome research with OpenACCAccelerating microbiome research with OpenACC
Accelerating microbiome research with OpenACC
Igor Sfiligoi
 
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
Igor Sfiligoi
 
Porting and optimizing UniFrac for GPUs
Porting and optimizing UniFrac for GPUsPorting and optimizing UniFrac for GPUs
Porting and optimizing UniFrac for GPUs
Igor Sfiligoi
 
Demonstrating 100 Gbps in and out of the public Clouds
Demonstrating 100 Gbps in and out of the public CloudsDemonstrating 100 Gbps in and out of the public Clouds
Demonstrating 100 Gbps in and out of the public Clouds
Igor Sfiligoi
 
TransAtlantic Networking using Cloud links
TransAtlantic Networking using Cloud linksTransAtlantic Networking using Cloud links
TransAtlantic Networking using Cloud links
Igor Sfiligoi
 

More from Igor Sfiligoi (20)

Preparing Fusion codes for Perlmutter - CGYRO
Preparing Fusion codes for Perlmutter - CGYROPreparing Fusion codes for Perlmutter - CGYRO
Preparing Fusion codes for Perlmutter - CGYRO
 
O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...
O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...
O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...
 
Comparing single-node and multi-node performance of an important fusion HPC c...
Comparing single-node and multi-node performance of an important fusion HPC c...Comparing single-node and multi-node performance of an important fusion HPC c...
Comparing single-node and multi-node performance of an important fusion HPC c...
 
The anachronism of whole-GPU accounting
The anachronism of whole-GPU accountingThe anachronism of whole-GPU accounting
The anachronism of whole-GPU accounting
 
Auto-scaling HTCondor pools using Kubernetes compute resources
Auto-scaling HTCondor pools using Kubernetes compute resourcesAuto-scaling HTCondor pools using Kubernetes compute resources
Auto-scaling HTCondor pools using Kubernetes compute resources
 
Speeding up bowtie2 by improving cache-hit rate
Speeding up bowtie2 by improving cache-hit rateSpeeding up bowtie2 by improving cache-hit rate
Speeding up bowtie2 by improving cache-hit rate
 
Performance Optimization of CGYRO for Multiscale Turbulence Simulations
Performance Optimization of CGYRO for Multiscale Turbulence SimulationsPerformance Optimization of CGYRO for Multiscale Turbulence Simulations
Performance Optimization of CGYRO for Multiscale Turbulence Simulations
 
Comparing GPU effectiveness for Unifrac distance compute
Comparing GPU effectiveness for Unifrac distance computeComparing GPU effectiveness for Unifrac distance compute
Comparing GPU effectiveness for Unifrac distance compute
 
Managing Cloud networking costs for data-intensive applications by provisioni...
Managing Cloud networking costs for data-intensive applications by provisioni...Managing Cloud networking costs for data-intensive applications by provisioni...
Managing Cloud networking costs for data-intensive applications by provisioni...
 
Accelerating Key Bioinformatics Tasks 100-fold by Improving Memory Access
Accelerating Key Bioinformatics Tasks 100-fold by Improving Memory AccessAccelerating Key Bioinformatics Tasks 100-fold by Improving Memory Access
Accelerating Key Bioinformatics Tasks 100-fold by Improving Memory Access
 
Using A100 MIG to Scale Astronomy Scientific Output
Using A100 MIG to Scale Astronomy Scientific OutputUsing A100 MIG to Scale Astronomy Scientific Output
Using A100 MIG to Scale Astronomy Scientific Output
 
Using commercial Clouds to process IceCube jobs
Using commercial Clouds to process IceCube jobsUsing commercial Clouds to process IceCube jobs
Using commercial Clouds to process IceCube jobs
 
Modest scale HPC on Azure using CGYRO
Modest scale HPC on Azure using CGYROModest scale HPC on Azure using CGYRO
Modest scale HPC on Azure using CGYRO
 
Data-intensive IceCube Cloud Burst
Data-intensive IceCube Cloud BurstData-intensive IceCube Cloud Burst
Data-intensive IceCube Cloud Burst
 
Scheduling a Kubernetes Federation with Admiralty
Scheduling a Kubernetes Federation with AdmiraltyScheduling a Kubernetes Federation with Admiralty
Scheduling a Kubernetes Federation with Admiralty
 
Accelerating microbiome research with OpenACC
Accelerating microbiome research with OpenACCAccelerating microbiome research with OpenACC
Accelerating microbiome research with OpenACC
 
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
 
Porting and optimizing UniFrac for GPUs
Porting and optimizing UniFrac for GPUsPorting and optimizing UniFrac for GPUs
Porting and optimizing UniFrac for GPUs
 
Demonstrating 100 Gbps in and out of the public Clouds
Demonstrating 100 Gbps in and out of the public CloudsDemonstrating 100 Gbps in and out of the public Clouds
Demonstrating 100 Gbps in and out of the public Clouds
 
TransAtlantic Networking using Cloud links
TransAtlantic Networking using Cloud linksTransAtlantic Networking using Cloud links
TransAtlantic Networking using Cloud links
 

Recently uploaded

Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
Vlad Stirbu
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 

Recently uploaded (20)

Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 

Storage Solutions from General Atomics

  • 1. 1May 2017 Igor Sfiligoi Nirvana and EasyHSM Storage Solutions from General Atomics
  • 2. 2 • The Storage Software Group at General Atomics has a long tradition in data handling – Going back to 1995 when we ran the San Diego Supercomputing Center • Nirvana is a complete object store solution – Storage abstraction, federation and tiering – Metadata repository, extraction and handling – Audit trail • EasyHSM is a POSIX solution – Integrated with IBM Spectrum Scale (GPFS) – Optimized for storage abstraction and tiering What do we offer?
  • 3. 3
  • 4. 4 Nirvana can be called a virtual object store Main characteristics: • Global namespace, spanning multiple physical storage pools • Layers on top of existing storage resources – Supports POSIX, Windows, NFS, S3 and Google Storage APIs • Implements replication and migration logic – Both explicit and policy driven Nirvana Storage Handling
  • 5. 5 • Nirvana does not provide its own filesystem solution – There are many available, why re-invent the wheel? • Nirvana can operate on anything that supports – File paths – Basic file permissions – Data ingest and retrieval • Straight mapping between Nirvana’s global namespace and underlying storage system – No obfuscation, easy to interact with external tools Operating on top of existing storage At least in theory, we provide implementation for a fixed set
  • 6. 6 • Global namespace allows for decoupling of logical location and physical storage • Users care about the logical location – E.g. files in my home directory – They don’t care where the data is stored, as long as they can get it (fast enough) when needed • System administrators can manage physical space – Based on SLAs and budget constraints – Files in a single logical folder may be partitioned across multiple storage pools (e.g. based on age), or have different number of replicas (based on type) Global namespace
  • 7. 7 • Metadata is “data about data” – i.e. File attributes • By adding attributes to your files – Easier to find your files – Easier for others to find your files – Dynamically sort files in virtual collections Nirvana is also Metadata-centric Saves time and money
  • 8. 8 • Nirvana stores metadata in a standard RDBMS – Supports PostrgeSQL and Oracle • Administrators define the attribute schemas • Metadata can be generated – By hand, using either cmdline or Web client tools – Automatically extracted from the filesystem using provided scrubbing/reconciliation tools Metadata generation
  • 9. 9 • Nirvana (of course) provides per-file attribute lookup • Nirvana also provides attribute-based file lookup – Both cmdline and Web client tools • Nirvana also provides virtual collections – Indistinguishable from a traditional read-only directory from the user point of view Metadata-based search Try to do this with AWS S3!
  • 10. 10 • User grouping and federation • Fine grained file permissions – Supports discretionary, role based and mandatory access controls • Keeps a detailed audit trail Nirvana Security features
  • 11. 11 • Dramatic scrubbing/reconciliation speedup – System attributes: observed 1M files/minute (16k/s) – JPG scrubbing: observed 60k files/minute (1k/s) • New Web Administration interface – Web client available since a long time – Old-style Java-based GUI still available, too • Nirvana now offers a S3 REST server, too – In addition to the proprietary APIs and tools Recent changes
  • 12. 12
  • 13. 13 • EasyHSM is a Hierarchical Storage Management (HSM) solution for IBM Spectrum Scale (GPFS) • GPFS provides the namespace management – And hot data storage management • EasyHSM provides the tiering logic – Between GPFS storage and external storage EasyHSM is a Transparent Tiering Solution IBM Spectrum Scale EasyHSM Smallish but fast layer Large but slower bulk storage
  • 14. 14 • A mature, very reliable storage system – With solid backing from IBM • Highly scalable – Originally designed for HPC applications • Multiple access point protocols – Traditional NFS, SMB/CIFS, S3 – High performance Native Client Drivers The IBM Spectrum Scale Advantage GPFS Clients SMB NFSS3
  • 15. 15 • Not all data needs to be in the fast, GPFS layer – But all files must be visible to the users • EasyHSM provides an effective tiering solution – Uses HSM file stubs to link to external storage – Policy driven explicit movement of data between GPFS and external storage – Transparent high performance retrieval of data into the GPFS layer on user access Tiering with EasyHSM
  • 16. 16 • Tools for migrating data from GFPS to external storage • Tools for freeing disk space on GPFS (stubbing) – Invisible to final users • A daemon for transparent recall on access – And handling file removal • Tools for explicit recall from external storage to GPFS • Support scripts for easy integration with GPFS Policy Manager (ILM) EasyHSM components
  • 17. 17 • EasyHSM is the only solution that allows tiering using multiple external protocols (both POSIX and Object) • EasyHSM does a straight mapping between GPFS path and external storage path/URI – Minimal massaging, e.g. adding protocol and IP – No obfuscation, easy to interact with external tools • Scalable, low overhead implementation – Linear scaling over multiple hardware nodes – Not in the path for hot data • Pure software solution – Does not need dedicated hardware • Fixed price licensing (not capacity based) EasyHSM vs Competition
  • 18. 18 • EasyHSM does support Nirvana as external storage – But can (obviously) work without it, too • When used together, separation of duties – EasyHSM provides the tiering from GPFS – Nirvana handles replication and migration between multiple external storage systems EasyHSM and Nirvana IBM Spectrum Scale EasyHSM Nirvana
  • 19. 19 • EasyHSM home page: http://www.ga.com/easyhsm • Nirvana home page: http://www.ga.com/nirvana If you just want to know more, feel free to – Contact me at Igor.Sfiligoi@ga.com – Contact the program manager at Robert.Murphy@ga.com For more information