SlideShare a Scribd company logo
2
Integrating Globus into LRZ’s
Data Science Storage Service
GlobusWorld 2019 | 2019-05-01 | Stephan Peinkofer
Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
3Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
Bavarian Academy of Sciences and Humanities
Leibniz Supercomputing Centre
Computer Centre
for all Munich Universities250
employees
approx.
57
years of
IT support
IT Service Backbone for the Advancement of Science and Research
Regional Computer Centre
for all Bavarian Universities
National Supercomputing Centre
(GCS)
European Supercomputing Centre
(PRACE)
High Performance Computing
SuperMUC-NG, LRZ Linux Cluster
Virtual Reality and Visualisation
V2C (CAVE, Powerwall)
4
Operating Cutting-Edge IT Infrastructure
LRZ as an IT Center of Excellence
Storage
Network
Cloud Computing
Cluster
HPC
Training
Consultancy
Email
High Speed Networking
Munich Scientific Network
Big Data
Bavarian State Library Digital Archive
Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
5Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
Data Silos
Increasing User Demand
6
I need to share a 400TB dataset
with someone in Canada!
My experiment will generate
multiple PBs, that have to be
analyzed and backed up! How?
I want to build a WebApp that allows
users to interactively analyze my
500TB SuperMUC simulation data!
I need to share
some data
on SuperMUC
between multiple
projects!
I want to analyze a large
dataset, generated on Super-
MUC, using some special OS
image on the LRZ Cloud!
Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
7Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
Satisfying User Demands
So basically we need to provide …
A file system that can be
shared amongst the complete
LRZ HPC Ecosystem
Some kind of external
access mechanism
for arbitrary entities
A Dropbox like
data management
approach
LRZ Data Science Storage
8Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
Interactive processing
on LRZ Compute Cloud
Remote visualisation
on LRZs visualisation
systems
External access and sharing
via Globus Online
High performance backup
and archive of data on LRZs
Backup- and Archive System
Batch and interactive processing
on dedicated, hosted HPC Cluster at LRZ
High throughput batch processing
on LRZs Linux Cluster or SuperMUC
LRZ
Data
Science
Storage
IBM
Spectrum
SCALE
IBM
Spectrum
PROTECT
LRZ Identity
Managment
System
Globus
Mission Control
9Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
DSSWeb Self Service Portal
REST
API
CES Globus
Sharing
Globus
Connect
Server
RabbitMQ
Message
Bus
REST
API
Client
Manage-
ment
Service
REST API
Operations
Center
REST API
The Big Picture
10Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
Huber
LMU User: lmuuser2
LinuxCluster SuperMUC
Project: lxpr2 Project: smpr2
User: lx22bp User: sm33sx
DSS Containers
11
Maier
TUM User: tumuser1
LinuxCluster SuperMUC
Project: lxpr1 Project: smpr1
User: lx11xc User: sm11bb
DSS POSIX Group in IDM/LDAP
pr45xa-dss-0000
DSS Container à GPFS Independent Fileset
/dss/dssfs01/pr45xa-dss-0000
drwxrws--- root pr45xa-dss-0000
Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
Technical Integration of Globus to LRZ DSS
Goal
12
Integrate Globus Sharing to
DSSWeb Self-Service Portal.
Allow Data Curators to share
DSS Containers with
arbitrary external users.
Problem Action
Globus let’s us control.
Who can share?
What can be shared?
We need to control.
Who can share what?
Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
LRZ Data Science Storage
Technical Integration of Globus to LRZ DSS
13
DSS Container X
Container Group
/dss/dssfs01/dsscontX
DSS Container Directory
Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
DSSWeb
Globus Online
LRZ MyProxy
DSS Globus Endpoint
1. Enable Globus Sharing
for DSS Container X
Data Curator
RobotUser aka
RobotUser@globusid.org
2. Login
to
MyProxy to
get
Certificate
3. Enable DSS Globus Endpoint
4. Create Shared Endpoint “LRZ DSS Container X”
LRZ DSS Container X
Shared Endpoint
6. Add RobotUser to
Container Access Group
5. Globus Magic
Technical Integration of Globus to LRZ DSS
14Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
DSSWeb
1. Invite
bop@wherever.com to
access DSS Container X
via Globus
Data Curator
RobotUser aka
RobotUser@globusid.org
2. Check if identity bop@wherever.com is already
known by Globus and if not create it
3. Add Globus ACL for Shared Endpoint LRZ DSS
Container X for identity bop@wherever.com
4. Globus Magic
bop@wherever.com
5. Bop is happy
LRZ Data Science Storage
DSS Container X
Container Group
/dss/dssfs01/dsscontX
DSS Container Directory
DSS Globus Endpoint*
LRZ DSS Container X
Shared Endpoint
Globus Online
Legal Integration of Globus to LRZ DSS
Regulation
15
European Union enforced the
EU General Data Protection
Regulation (GDPR) on 2018-05-
25
Use/Integration of Cloud
Services that process PII
requires a formal Controller-
Processor Agreement.
Transfer of personal data to third
countries requires special
safeguards
HIPPA and NIST rescue BAA to the rescue
HIPPA and NIST require
roughly similar technical and
organizational security controls
that are required by GDPR to
protect PII
Globus agreed to sign a
Controller-Processor
Agreement that contains the
EU-Model Clauses
Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
16Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer

More Related Content

What's hot

We have the Bricks to Build Cloud-native Cathedrals - But do we have the mortar?
We have the Bricks to Build Cloud-native Cathedrals - But do we have the mortar?We have the Bricks to Build Cloud-native Cathedrals - But do we have the mortar?
We have the Bricks to Build Cloud-native Cathedrals - But do we have the mortar?
Nane Kratzke
 
Open Tracing, to order and understand your mess. - ApiConf 2017
Open Tracing, to order and understand your mess. - ApiConf 2017Open Tracing, to order and understand your mess. - ApiConf 2017
Open Tracing, to order and understand your mess. - ApiConf 2017
Gianluca Arbezzano
 
Cloudviews eurocloud rcosta
Cloudviews eurocloud rcostaCloudviews eurocloud rcosta
Cloudviews eurocloud rcostaEuroCloud
 
New web service oriented ARC
New web service oriented ARCNew web service oriented ARC
New web service oriented ARC
Ferenc Szalai
 
Cloud Computing Research Projects
Cloud Computing Research ProjectsCloud Computing Research Projects
Cloud Computing Research Projects
Phdtopiccom
 
Approaches for duplicating Kubernetes Storage with Gluster
Approaches for duplicating Kubernetes Storage with GlusterApproaches for duplicating Kubernetes Storage with Gluster
Approaches for duplicating Kubernetes Storage with Gluster
mountpoint.io
 
KNIME Software Overview
KNIME Software OverviewKNIME Software Overview
KNIME Software Overview
KNIMESlides
 
OpenNebulaConf2017EU: Enabling Dev and Infra teams by Lodewijk De Schuyter,De...
OpenNebulaConf2017EU: Enabling Dev and Infra teams by Lodewijk De Schuyter,De...OpenNebulaConf2017EU: Enabling Dev and Infra teams by Lodewijk De Schuyter,De...
OpenNebulaConf2017EU: Enabling Dev and Infra teams by Lodewijk De Schuyter,De...
OpenNebula Project
 
Scalable and reproducible workflows with Pachyderm
Scalable and reproducible workflows with PachydermScalable and reproducible workflows with Pachyderm
Scalable and reproducible workflows with Pachyderm
Jon Ander Novella
 
Monitor your application and sleep
Monitor your application and sleepMonitor your application and sleep
Monitor your application and sleep
Gianluca Arbezzano
 
Caching in the Cloud. Code Camp Iași April 2016.
Caching in the Cloud. Code Camp Iași April 2016. Caching in the Cloud. Code Camp Iași April 2016.
Caching in the Cloud. Code Camp Iași April 2016.
Expert Network
 
MQTT. Kafka. InfluxDB. SQL. IoT Harmony. #tutorial by Stefan Bocutiu
MQTT. Kafka. InfluxDB. SQL. IoT Harmony. #tutorial by Stefan BocutiuMQTT. Kafka. InfluxDB. SQL. IoT Harmony. #tutorial by Stefan Bocutiu
MQTT. Kafka. InfluxDB. SQL. IoT Harmony. #tutorial by Stefan Bocutiu
landoop
 
OpenNebula Conference 2014
OpenNebula Conference 2014OpenNebula Conference 2014
OpenNebula Conference 2014
terradue
 
What's new in confluent platform 5.4 online talk
What's new in confluent platform 5.4 online talkWhat's new in confluent platform 5.4 online talk
What's new in confluent platform 5.4 online talk
confluent
 
On-node resource manager for containerized HPC workloads
On-node resource manager for containerized HPC workloadsOn-node resource manager for containerized HPC workloads
On-node resource manager for containerized HPC workloads
GeoffroyVallee
 
Distributed Tracing
Distributed TracingDistributed Tracing
Distributed Tracing
Kevin Ingelman
 
What does Netflix, NTT and Rubicon Project have in common? Apache Druid.
What does Netflix, NTT and Rubicon Project have in common? Apache Druid.What does Netflix, NTT and Rubicon Project have in common? Apache Druid.
What does Netflix, NTT and Rubicon Project have in common? Apache Druid.
Rommel Garcia
 
Flink for Everyone: Self Service Data Analytics with StreamPipes - Philipp Ze...
Flink for Everyone: Self Service Data Analytics with StreamPipes - Philipp Ze...Flink for Everyone: Self Service Data Analytics with StreamPipes - Philipp Ze...
Flink for Everyone: Self Service Data Analytics with StreamPipes - Philipp Ze...
Flink Forward
 

What's hot (18)

We have the Bricks to Build Cloud-native Cathedrals - But do we have the mortar?
We have the Bricks to Build Cloud-native Cathedrals - But do we have the mortar?We have the Bricks to Build Cloud-native Cathedrals - But do we have the mortar?
We have the Bricks to Build Cloud-native Cathedrals - But do we have the mortar?
 
Open Tracing, to order and understand your mess. - ApiConf 2017
Open Tracing, to order and understand your mess. - ApiConf 2017Open Tracing, to order and understand your mess. - ApiConf 2017
Open Tracing, to order and understand your mess. - ApiConf 2017
 
Cloudviews eurocloud rcosta
Cloudviews eurocloud rcostaCloudviews eurocloud rcosta
Cloudviews eurocloud rcosta
 
New web service oriented ARC
New web service oriented ARCNew web service oriented ARC
New web service oriented ARC
 
Cloud Computing Research Projects
Cloud Computing Research ProjectsCloud Computing Research Projects
Cloud Computing Research Projects
 
Approaches for duplicating Kubernetes Storage with Gluster
Approaches for duplicating Kubernetes Storage with GlusterApproaches for duplicating Kubernetes Storage with Gluster
Approaches for duplicating Kubernetes Storage with Gluster
 
KNIME Software Overview
KNIME Software OverviewKNIME Software Overview
KNIME Software Overview
 
OpenNebulaConf2017EU: Enabling Dev and Infra teams by Lodewijk De Schuyter,De...
OpenNebulaConf2017EU: Enabling Dev and Infra teams by Lodewijk De Schuyter,De...OpenNebulaConf2017EU: Enabling Dev and Infra teams by Lodewijk De Schuyter,De...
OpenNebulaConf2017EU: Enabling Dev and Infra teams by Lodewijk De Schuyter,De...
 
Scalable and reproducible workflows with Pachyderm
Scalable and reproducible workflows with PachydermScalable and reproducible workflows with Pachyderm
Scalable and reproducible workflows with Pachyderm
 
Monitor your application and sleep
Monitor your application and sleepMonitor your application and sleep
Monitor your application and sleep
 
Caching in the Cloud. Code Camp Iași April 2016.
Caching in the Cloud. Code Camp Iași April 2016. Caching in the Cloud. Code Camp Iași April 2016.
Caching in the Cloud. Code Camp Iași April 2016.
 
MQTT. Kafka. InfluxDB. SQL. IoT Harmony. #tutorial by Stefan Bocutiu
MQTT. Kafka. InfluxDB. SQL. IoT Harmony. #tutorial by Stefan BocutiuMQTT. Kafka. InfluxDB. SQL. IoT Harmony. #tutorial by Stefan Bocutiu
MQTT. Kafka. InfluxDB. SQL. IoT Harmony. #tutorial by Stefan Bocutiu
 
OpenNebula Conference 2014
OpenNebula Conference 2014OpenNebula Conference 2014
OpenNebula Conference 2014
 
What's new in confluent platform 5.4 online talk
What's new in confluent platform 5.4 online talkWhat's new in confluent platform 5.4 online talk
What's new in confluent platform 5.4 online talk
 
On-node resource manager for containerized HPC workloads
On-node resource manager for containerized HPC workloadsOn-node resource manager for containerized HPC workloads
On-node resource manager for containerized HPC workloads
 
Distributed Tracing
Distributed TracingDistributed Tracing
Distributed Tracing
 
What does Netflix, NTT and Rubicon Project have in common? Apache Druid.
What does Netflix, NTT and Rubicon Project have in common? Apache Druid.What does Netflix, NTT and Rubicon Project have in common? Apache Druid.
What does Netflix, NTT and Rubicon Project have in common? Apache Druid.
 
Flink for Everyone: Self Service Data Analytics with StreamPipes - Philipp Ze...
Flink for Everyone: Self Service Data Analytics with StreamPipes - Philipp Ze...Flink for Everyone: Self Service Data Analytics with StreamPipes - Philipp Ze...
Flink for Everyone: Self Service Data Analytics with StreamPipes - Philipp Ze...
 

Similar to Integrating Globus into LRZ's Data Science Storage Service

ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...
Abhinav Joshi
 
Liberate Your Files with a Private Cloud Storage Solution powered by Open Source
Liberate Your Files with a Private Cloud Storage Solution powered by Open SourceLiberate Your Files with a Private Cloud Storage Solution powered by Open Source
Liberate Your Files with a Private Cloud Storage Solution powered by Open Source
Isaac Christoffersen
 
OpenShift Meetup - Summit 2021 (Part 1)
OpenShift Meetup - Summit 2021 (Part 1)OpenShift Meetup - Summit 2021 (Part 1)
OpenShift Meetup - Summit 2021 (Part 1)
ConSol Consulting & Solutions Software GmbH
 
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red HatThe Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat
OpenStack
 
Oscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby projectOscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby project
Patrick Chanezon
 
From ECM to Content Services - Analyst Webinar
From ECM to Content Services - Analyst WebinarFrom ECM to Content Services - Analyst Webinar
From ECM to Content Services - Analyst Webinar
Nuxeo
 
Orchestrating Stateful Applications with PKS and Portworx
Orchestrating Stateful Applications with PKS and PortworxOrchestrating Stateful Applications with PKS and Portworx
Orchestrating Stateful Applications with PKS and Portworx
Carlos Andrés García
 
Orchestrating stateful applications with PKS and Portworx
Orchestrating stateful applications with PKS and PortworxOrchestrating stateful applications with PKS and Portworx
Orchestrating stateful applications with PKS and Portworx
VMware Tanzu
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAccelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud Era
Alluxio, Inc.
 
IPv6 on Container Plattforms
IPv6 on Container PlattformsIPv6 on Container Plattforms
IPv6 on Container Plattforms
Aarno Aukia
 
Swiss IPv6 Council – Case Study - Deployment von IPv6 in einer Container Plat...
Swiss IPv6 Council – Case Study - Deployment von IPv6 in einer Container Plat...Swiss IPv6 Council – Case Study - Deployment von IPv6 in einer Container Plat...
Swiss IPv6 Council – Case Study - Deployment von IPv6 in einer Container Plat...
Digicomp Academy AG
 
End to-end ml pipelines with beam, flink, tensor flow, and hopsworks (beam su...
End to-end ml pipelines with beam, flink, tensor flow, and hopsworks (beam su...End to-end ml pipelines with beam, flink, tensor flow, and hopsworks (beam su...
End to-end ml pipelines with beam, flink, tensor flow, and hopsworks (beam su...
Theofilos Kakantousis
 
OSCON 2013 - The Hitchiker’s Guide to Open Source Cloud Computing
OSCON 2013 - The Hitchiker’s Guide to Open Source Cloud ComputingOSCON 2013 - The Hitchiker’s Guide to Open Source Cloud Computing
OSCON 2013 - The Hitchiker’s Guide to Open Source Cloud Computing
Mark Hinkle
 
Comparison of control plane deployment architectures in the scope of hypercon...
Comparison of control plane deployment architectures in the scope of hypercon...Comparison of control plane deployment architectures in the scope of hypercon...
Comparison of control plane deployment architectures in the scope of hypercon...
Miroslav Halas
 
Scientific Computing @ Fred Hutch
Scientific Computing @ Fred HutchScientific Computing @ Fred Hutch
Scientific Computing @ Fred Hutch
Dirk Petersen
 
Cloud Native Application @ VMUG.IT 20150529
Cloud Native Application @ VMUG.IT 20150529Cloud Native Application @ VMUG.IT 20150529
Cloud Native Application @ VMUG.IT 20150529
VMUG IT
 
High Availability - Brett Thurber - ManageIQ Design Summit 2016
High Availability - Brett Thurber - ManageIQ Design Summit 2016High Availability - Brett Thurber - ManageIQ Design Summit 2016
High Availability - Brett Thurber - ManageIQ Design Summit 2016
ManageIQ
 
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaSOpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
Daniel Krook
 
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes
John Archer
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
DataWorks Summit
 

Similar to Integrating Globus into LRZ's Data Science Storage Service (20)

ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...
 
Liberate Your Files with a Private Cloud Storage Solution powered by Open Source
Liberate Your Files with a Private Cloud Storage Solution powered by Open SourceLiberate Your Files with a Private Cloud Storage Solution powered by Open Source
Liberate Your Files with a Private Cloud Storage Solution powered by Open Source
 
OpenShift Meetup - Summit 2021 (Part 1)
OpenShift Meetup - Summit 2021 (Part 1)OpenShift Meetup - Summit 2021 (Part 1)
OpenShift Meetup - Summit 2021 (Part 1)
 
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red HatThe Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat
 
Oscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby projectOscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby project
 
From ECM to Content Services - Analyst Webinar
From ECM to Content Services - Analyst WebinarFrom ECM to Content Services - Analyst Webinar
From ECM to Content Services - Analyst Webinar
 
Orchestrating Stateful Applications with PKS and Portworx
Orchestrating Stateful Applications with PKS and PortworxOrchestrating Stateful Applications with PKS and Portworx
Orchestrating Stateful Applications with PKS and Portworx
 
Orchestrating stateful applications with PKS and Portworx
Orchestrating stateful applications with PKS and PortworxOrchestrating stateful applications with PKS and Portworx
Orchestrating stateful applications with PKS and Portworx
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAccelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud Era
 
IPv6 on Container Plattforms
IPv6 on Container PlattformsIPv6 on Container Plattforms
IPv6 on Container Plattforms
 
Swiss IPv6 Council – Case Study - Deployment von IPv6 in einer Container Plat...
Swiss IPv6 Council – Case Study - Deployment von IPv6 in einer Container Plat...Swiss IPv6 Council – Case Study - Deployment von IPv6 in einer Container Plat...
Swiss IPv6 Council – Case Study - Deployment von IPv6 in einer Container Plat...
 
End to-end ml pipelines with beam, flink, tensor flow, and hopsworks (beam su...
End to-end ml pipelines with beam, flink, tensor flow, and hopsworks (beam su...End to-end ml pipelines with beam, flink, tensor flow, and hopsworks (beam su...
End to-end ml pipelines with beam, flink, tensor flow, and hopsworks (beam su...
 
OSCON 2013 - The Hitchiker’s Guide to Open Source Cloud Computing
OSCON 2013 - The Hitchiker’s Guide to Open Source Cloud ComputingOSCON 2013 - The Hitchiker’s Guide to Open Source Cloud Computing
OSCON 2013 - The Hitchiker’s Guide to Open Source Cloud Computing
 
Comparison of control plane deployment architectures in the scope of hypercon...
Comparison of control plane deployment architectures in the scope of hypercon...Comparison of control plane deployment architectures in the scope of hypercon...
Comparison of control plane deployment architectures in the scope of hypercon...
 
Scientific Computing @ Fred Hutch
Scientific Computing @ Fred HutchScientific Computing @ Fred Hutch
Scientific Computing @ Fred Hutch
 
Cloud Native Application @ VMUG.IT 20150529
Cloud Native Application @ VMUG.IT 20150529Cloud Native Application @ VMUG.IT 20150529
Cloud Native Application @ VMUG.IT 20150529
 
High Availability - Brett Thurber - ManageIQ Design Summit 2016
High Availability - Brett Thurber - ManageIQ Design Summit 2016High Availability - Brett Thurber - ManageIQ Design Summit 2016
High Availability - Brett Thurber - ManageIQ Design Summit 2016
 
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaSOpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
 
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
 

More from Globus

Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus
 
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Globus
 
Globus Compute Introduction - GlobusWorld 2024
Globus Compute Introduction - GlobusWorld 2024Globus Compute Introduction - GlobusWorld 2024
Globus Compute Introduction - GlobusWorld 2024
Globus
 
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus
 
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Globus
 
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data AnalysisProviding Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Globus
 
First Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User EndpointsFirst Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User Endpoints
Globus
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
Globus
 
Understanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSageUnderstanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSage
Globus
 
How to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good PracticesHow to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good Practices
Globus
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Globus
 
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Globus
 
The Department of Energy's Integrated Research Infrastructure (IRI)
The Department of Energy's Integrated Research Infrastructure (IRI)The Department of Energy's Integrated Research Infrastructure (IRI)
The Department of Energy's Integrated Research Infrastructure (IRI)
Globus
 
GlobusWorld 2024 Opening Keynote session
GlobusWorld 2024 Opening Keynote sessionGlobusWorld 2024 Opening Keynote session
GlobusWorld 2024 Opening Keynote session
Globus
 
Enhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZEnhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZ
Globus
 
Extending Globus into a Site-wide Automated Data Infrastructure.pdf
Extending Globus into a Site-wide Automated Data Infrastructure.pdfExtending Globus into a Site-wide Automated Data Infrastructure.pdf
Extending Globus into a Site-wide Automated Data Infrastructure.pdf
Globus
 
Globus at the United States Geological Survey
Globus at the United States Geological SurveyGlobus at the United States Geological Survey
Globus at the United States Geological Survey
Globus
 
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data AnalysisProviding Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Globus
 
Globus Compute with Integrated Research Infrastructure (IRI) workflows
Globus Compute with Integrated Research Infrastructure (IRI) workflowsGlobus Compute with Integrated Research Infrastructure (IRI) workflows
Globus Compute with Integrated Research Infrastructure (IRI) workflows
Globus
 
Reactive Documents and Computational Pipelines - Bridging the Gap
Reactive Documents and Computational Pipelines - Bridging the GapReactive Documents and Computational Pipelines - Bridging the Gap
Reactive Documents and Computational Pipelines - Bridging the Gap
Globus
 

More from Globus (20)

Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024
 
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
 
Globus Compute Introduction - GlobusWorld 2024
Globus Compute Introduction - GlobusWorld 2024Globus Compute Introduction - GlobusWorld 2024
Globus Compute Introduction - GlobusWorld 2024
 
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
 
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
 
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data AnalysisProviding Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
 
First Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User EndpointsFirst Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User Endpoints
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
 
Understanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSageUnderstanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSage
 
How to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good PracticesHow to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good Practices
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
 
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
 
The Department of Energy's Integrated Research Infrastructure (IRI)
The Department of Energy's Integrated Research Infrastructure (IRI)The Department of Energy's Integrated Research Infrastructure (IRI)
The Department of Energy's Integrated Research Infrastructure (IRI)
 
GlobusWorld 2024 Opening Keynote session
GlobusWorld 2024 Opening Keynote sessionGlobusWorld 2024 Opening Keynote session
GlobusWorld 2024 Opening Keynote session
 
Enhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZEnhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZ
 
Extending Globus into a Site-wide Automated Data Infrastructure.pdf
Extending Globus into a Site-wide Automated Data Infrastructure.pdfExtending Globus into a Site-wide Automated Data Infrastructure.pdf
Extending Globus into a Site-wide Automated Data Infrastructure.pdf
 
Globus at the United States Geological Survey
Globus at the United States Geological SurveyGlobus at the United States Geological Survey
Globus at the United States Geological Survey
 
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data AnalysisProviding Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
 
Globus Compute with Integrated Research Infrastructure (IRI) workflows
Globus Compute with Integrated Research Infrastructure (IRI) workflowsGlobus Compute with Integrated Research Infrastructure (IRI) workflows
Globus Compute with Integrated Research Infrastructure (IRI) workflows
 
Reactive Documents and Computational Pipelines - Bridging the Gap
Reactive Documents and Computational Pipelines - Bridging the GapReactive Documents and Computational Pipelines - Bridging the Gap
Reactive Documents and Computational Pipelines - Bridging the Gap
 

Recently uploaded

Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 

Recently uploaded (20)

Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 

Integrating Globus into LRZ's Data Science Storage Service

  • 1.
  • 2. 2 Integrating Globus into LRZ’s Data Science Storage Service GlobusWorld 2019 | 2019-05-01 | Stephan Peinkofer Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
  • 3. 3Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer Bavarian Academy of Sciences and Humanities Leibniz Supercomputing Centre Computer Centre for all Munich Universities250 employees approx. 57 years of IT support IT Service Backbone for the Advancement of Science and Research Regional Computer Centre for all Bavarian Universities National Supercomputing Centre (GCS) European Supercomputing Centre (PRACE)
  • 4. High Performance Computing SuperMUC-NG, LRZ Linux Cluster Virtual Reality and Visualisation V2C (CAVE, Powerwall) 4 Operating Cutting-Edge IT Infrastructure LRZ as an IT Center of Excellence Storage Network Cloud Computing Cluster HPC Training Consultancy Email High Speed Networking Munich Scientific Network Big Data Bavarian State Library Digital Archive Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
  • 5. 5Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer Data Silos
  • 6. Increasing User Demand 6 I need to share a 400TB dataset with someone in Canada! My experiment will generate multiple PBs, that have to be analyzed and backed up! How? I want to build a WebApp that allows users to interactively analyze my 500TB SuperMUC simulation data! I need to share some data on SuperMUC between multiple projects! I want to analyze a large dataset, generated on Super- MUC, using some special OS image on the LRZ Cloud! Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
  • 7. 7Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer Satisfying User Demands So basically we need to provide … A file system that can be shared amongst the complete LRZ HPC Ecosystem Some kind of external access mechanism for arbitrary entities A Dropbox like data management approach
  • 8. LRZ Data Science Storage 8Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer Interactive processing on LRZ Compute Cloud Remote visualisation on LRZs visualisation systems External access and sharing via Globus Online High performance backup and archive of data on LRZs Backup- and Archive System Batch and interactive processing on dedicated, hosted HPC Cluster at LRZ High throughput batch processing on LRZs Linux Cluster or SuperMUC LRZ Data Science Storage
  • 9. IBM Spectrum SCALE IBM Spectrum PROTECT LRZ Identity Managment System Globus Mission Control 9Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer DSSWeb Self Service Portal REST API CES Globus Sharing Globus Connect Server RabbitMQ Message Bus REST API Client Manage- ment Service REST API Operations Center REST API
  • 10. The Big Picture 10Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
  • 11. Huber LMU User: lmuuser2 LinuxCluster SuperMUC Project: lxpr2 Project: smpr2 User: lx22bp User: sm33sx DSS Containers 11 Maier TUM User: tumuser1 LinuxCluster SuperMUC Project: lxpr1 Project: smpr1 User: lx11xc User: sm11bb DSS POSIX Group in IDM/LDAP pr45xa-dss-0000 DSS Container à GPFS Independent Fileset /dss/dssfs01/pr45xa-dss-0000 drwxrws--- root pr45xa-dss-0000 Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
  • 12. Technical Integration of Globus to LRZ DSS Goal 12 Integrate Globus Sharing to DSSWeb Self-Service Portal. Allow Data Curators to share DSS Containers with arbitrary external users. Problem Action Globus let’s us control. Who can share? What can be shared? We need to control. Who can share what? Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
  • 13. LRZ Data Science Storage Technical Integration of Globus to LRZ DSS 13 DSS Container X Container Group /dss/dssfs01/dsscontX DSS Container Directory Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer DSSWeb Globus Online LRZ MyProxy DSS Globus Endpoint 1. Enable Globus Sharing for DSS Container X Data Curator RobotUser aka RobotUser@globusid.org 2. Login to MyProxy to get Certificate 3. Enable DSS Globus Endpoint 4. Create Shared Endpoint “LRZ DSS Container X” LRZ DSS Container X Shared Endpoint 6. Add RobotUser to Container Access Group 5. Globus Magic
  • 14. Technical Integration of Globus to LRZ DSS 14Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer DSSWeb 1. Invite bop@wherever.com to access DSS Container X via Globus Data Curator RobotUser aka RobotUser@globusid.org 2. Check if identity bop@wherever.com is already known by Globus and if not create it 3. Add Globus ACL for Shared Endpoint LRZ DSS Container X for identity bop@wherever.com 4. Globus Magic bop@wherever.com 5. Bop is happy LRZ Data Science Storage DSS Container X Container Group /dss/dssfs01/dsscontX DSS Container Directory DSS Globus Endpoint* LRZ DSS Container X Shared Endpoint Globus Online
  • 15. Legal Integration of Globus to LRZ DSS Regulation 15 European Union enforced the EU General Data Protection Regulation (GDPR) on 2018-05- 25 Use/Integration of Cloud Services that process PII requires a formal Controller- Processor Agreement. Transfer of personal data to third countries requires special safeguards HIPPA and NIST rescue BAA to the rescue HIPPA and NIST require roughly similar technical and organizational security controls that are required by GDPR to protect PII Globus agreed to sign a Controller-Processor Agreement that contains the EU-Model Clauses Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
  • 16. 16Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer