Integrating Globus into LRZ's Data Science Storage Service

Globus
Integrating Globus into LRZ's Data Science Storage Service
2
Integrating Globus into LRZ’s
Data Science Storage Service
GlobusWorld 2019 | 2019-05-01 | Stephan Peinkofer
Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
3Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
Bavarian Academy of Sciences and Humanities
Leibniz Supercomputing Centre
Computer Centre
for all Munich Universities250
employees
approx.
57
years of
IT support
IT Service Backbone for the Advancement of Science and Research
Regional Computer Centre
for all Bavarian Universities
National Supercomputing Centre
(GCS)
European Supercomputing Centre
(PRACE)
High Performance Computing
SuperMUC-NG, LRZ Linux Cluster
Virtual Reality and Visualisation
V2C (CAVE, Powerwall)
4
Operating Cutting-Edge IT Infrastructure
LRZ as an IT Center of Excellence
Storage
Network
Cloud Computing
Cluster
HPC
Training
Consultancy
Email
High Speed Networking
Munich Scientific Network
Big Data
Bavarian State Library Digital Archive
Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
5Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
Data Silos
Increasing User Demand
6
I need to share a 400TB dataset
with someone in Canada!
My experiment will generate
multiple PBs, that have to be
analyzed and backed up! How?
I want to build a WebApp that allows
users to interactively analyze my
500TB SuperMUC simulation data!
I need to share
some data
on SuperMUC
between multiple
projects!
I want to analyze a large
dataset, generated on Super-
MUC, using some special OS
image on the LRZ Cloud!
Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
7Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
Satisfying User Demands
So basically we need to provide …
A file system that can be
shared amongst the complete
LRZ HPC Ecosystem
Some kind of external
access mechanism
for arbitrary entities
A Dropbox like
data management
approach
LRZ Data Science Storage
8Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
Interactive processing
on LRZ Compute Cloud
Remote visualisation
on LRZs visualisation
systems
External access and sharing
via Globus Online
High performance backup
and archive of data on LRZs
Backup- and Archive System
Batch and interactive processing
on dedicated, hosted HPC Cluster at LRZ
High throughput batch processing
on LRZs Linux Cluster or SuperMUC
LRZ
Data
Science
Storage
IBM
Spectrum
SCALE
IBM
Spectrum
PROTECT
LRZ Identity
Managment
System
Globus
Mission Control
9Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
DSSWeb Self Service Portal
REST
API
CES Globus
Sharing
Globus
Connect
Server
RabbitMQ
Message
Bus
REST
API
Client
Manage-
ment
Service
REST API
Operations
Center
REST API
The Big Picture
10Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
Huber
LMU User: lmuuser2
LinuxCluster SuperMUC
Project: lxpr2 Project: smpr2
User: lx22bp User: sm33sx
DSS Containers
11
Maier
TUM User: tumuser1
LinuxCluster SuperMUC
Project: lxpr1 Project: smpr1
User: lx11xc User: sm11bb
DSS POSIX Group in IDM/LDAP
pr45xa-dss-0000
DSS Container à GPFS Independent Fileset
/dss/dssfs01/pr45xa-dss-0000
drwxrws--- root pr45xa-dss-0000
Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
Technical Integration of Globus to LRZ DSS
Goal
12
Integrate Globus Sharing to
DSSWeb Self-Service Portal.
Allow Data Curators to share
DSS Containers with
arbitrary external users.
Problem Action
Globus let’s us control.
Who can share?
What can be shared?
We need to control.
Who can share what?
Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
LRZ Data Science Storage
Technical Integration of Globus to LRZ DSS
13
DSS Container X
Container Group
/dss/dssfs01/dsscontX
DSS Container Directory
Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
DSSWeb
Globus Online
LRZ MyProxy
DSS Globus Endpoint
1. Enable Globus Sharing
for DSS Container X
Data Curator
RobotUser aka
RobotUser@globusid.org
2. Login
to
MyProxy to
get
Certificate
3. Enable DSS Globus Endpoint
4. Create Shared Endpoint “LRZ DSS Container X”
LRZ DSS Container X
Shared Endpoint
6. Add RobotUser to
Container Access Group
5. Globus Magic
Technical Integration of Globus to LRZ DSS
14Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
DSSWeb
1. Invite
bop@wherever.com to
access DSS Container X
via Globus
Data Curator
RobotUser aka
RobotUser@globusid.org
2. Check if identity bop@wherever.com is already
known by Globus and if not create it
3. Add Globus ACL for Shared Endpoint LRZ DSS
Container X for identity bop@wherever.com
4. Globus Magic
bop@wherever.com
5. Bop is happy
LRZ Data Science Storage
DSS Container X
Container Group
/dss/dssfs01/dsscontX
DSS Container Directory
DSS Globus Endpoint*
LRZ DSS Container X
Shared Endpoint
Globus Online
Legal Integration of Globus to LRZ DSS
Regulation
15
European Union enforced the
EU General Data Protection
Regulation (GDPR) on 2018-05-
25
Use/Integration of Cloud
Services that process PII
requires a formal Controller-
Processor Agreement.
Transfer of personal data to third
countries requires special
safeguards
HIPPA and NIST rescue BAA to the rescue
HIPPA and NIST require
roughly similar technical and
organizational security controls
that are required by GDPR to
protect PII
Globus agreed to sign a
Controller-Processor
Agreement that contains the
EU-Model Clauses
Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
16Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
1 of 16

Recommended

Hybrid cloud for science by
Hybrid cloud for science Hybrid cloud for science
Hybrid cloud for science Helix Nebula The Science Cloud
668 views11 slides
IBM Cloud & Helix Nebula by
IBM Cloud & Helix NebulaIBM Cloud & Helix Nebula
IBM Cloud & Helix NebulaHelix Nebula The Science Cloud
649 views10 slides
SC4 Hangout - Luigi Selmi, Transport pilot architecture by
SC4 Hangout - Luigi Selmi, Transport pilot architectureSC4 Hangout - Luigi Selmi, Transport pilot architecture
SC4 Hangout - Luigi Selmi, Transport pilot architectureBigData_Europe
858 views13 slides
Accelerating Spark with Kubernetes by
Accelerating Spark with KubernetesAccelerating Spark with Kubernetes
Accelerating Spark with KubernetesAlluxio, Inc.
477 views28 slides
"What's New With Globus" Webinar: Spring 2018 by
"What's New With Globus" Webinar: Spring 2018"What's New With Globus" Webinar: Spring 2018
"What's New With Globus" Webinar: Spring 2018Globus
270 views42 slides
OSIS19_Cloud : Des objets dans le cloud, et qui y restent -- L'expérience du ... by
OSIS19_Cloud : Des objets dans le cloud, et qui y restent -- L'expérience du ...OSIS19_Cloud : Des objets dans le cloud, et qui y restent -- L'expérience du ...
OSIS19_Cloud : Des objets dans le cloud, et qui y restent -- L'expérience du ...Pôle Systematic Paris-Region
271 views30 slides

More Related Content

What's hot

We have the Bricks to Build Cloud-native Cathedrals - But do we have the mortar? by
We have the Bricks to Build Cloud-native Cathedrals - But do we have the mortar?We have the Bricks to Build Cloud-native Cathedrals - But do we have the mortar?
We have the Bricks to Build Cloud-native Cathedrals - But do we have the mortar?Nane Kratzke
424 views11 slides
Open Tracing, to order and understand your mess. - ApiConf 2017 by
Open Tracing, to order and understand your mess. - ApiConf 2017Open Tracing, to order and understand your mess. - ApiConf 2017
Open Tracing, to order and understand your mess. - ApiConf 2017Gianluca Arbezzano
1.3K views18 slides
Cloudviews eurocloud rcosta by
Cloudviews eurocloud rcostaCloudviews eurocloud rcosta
Cloudviews eurocloud rcostaEuroCloud
427 views18 slides
New web service oriented ARC by
New web service oriented ARCNew web service oriented ARC
New web service oriented ARCFerenc Szalai
285 views17 slides
Cloud Computing Research Projects by
Cloud Computing Research ProjectsCloud Computing Research Projects
Cloud Computing Research ProjectsPhdtopiccom
47 views5 slides
Approaches for duplicating Kubernetes Storage with Gluster by
Approaches for duplicating Kubernetes Storage with GlusterApproaches for duplicating Kubernetes Storage with Gluster
Approaches for duplicating Kubernetes Storage with Glustermountpoint.io
163 views21 slides

What's hot(18)

We have the Bricks to Build Cloud-native Cathedrals - But do we have the mortar? by Nane Kratzke
We have the Bricks to Build Cloud-native Cathedrals - But do we have the mortar?We have the Bricks to Build Cloud-native Cathedrals - But do we have the mortar?
We have the Bricks to Build Cloud-native Cathedrals - But do we have the mortar?
Nane Kratzke424 views
Open Tracing, to order and understand your mess. - ApiConf 2017 by Gianluca Arbezzano
Open Tracing, to order and understand your mess. - ApiConf 2017Open Tracing, to order and understand your mess. - ApiConf 2017
Open Tracing, to order and understand your mess. - ApiConf 2017
Gianluca Arbezzano1.3K views
Cloudviews eurocloud rcosta by EuroCloud
Cloudviews eurocloud rcostaCloudviews eurocloud rcosta
Cloudviews eurocloud rcosta
EuroCloud427 views
New web service oriented ARC by Ferenc Szalai
New web service oriented ARCNew web service oriented ARC
New web service oriented ARC
Ferenc Szalai285 views
Cloud Computing Research Projects by Phdtopiccom
Cloud Computing Research ProjectsCloud Computing Research Projects
Cloud Computing Research Projects
Phdtopiccom47 views
Approaches for duplicating Kubernetes Storage with Gluster by mountpoint.io
Approaches for duplicating Kubernetes Storage with GlusterApproaches for duplicating Kubernetes Storage with Gluster
Approaches for duplicating Kubernetes Storage with Gluster
mountpoint.io163 views
KNIME Software Overview by KNIMESlides
KNIME Software OverviewKNIME Software Overview
KNIME Software Overview
KNIMESlides5.4K views
OpenNebulaConf2017EU: Enabling Dev and Infra teams by Lodewijk De Schuyter,De... by OpenNebula Project
OpenNebulaConf2017EU: Enabling Dev and Infra teams by Lodewijk De Schuyter,De...OpenNebulaConf2017EU: Enabling Dev and Infra teams by Lodewijk De Schuyter,De...
OpenNebulaConf2017EU: Enabling Dev and Infra teams by Lodewijk De Schuyter,De...
OpenNebula Project241 views
Scalable and reproducible workflows with Pachyderm by Jon Ander Novella
Scalable and reproducible workflows with PachydermScalable and reproducible workflows with Pachyderm
Scalable and reproducible workflows with Pachyderm
Jon Ander Novella530 views
Caching in the Cloud. Code Camp Iași April 2016. by Expert Network
Caching in the Cloud. Code Camp Iași April 2016. Caching in the Cloud. Code Camp Iași April 2016.
Caching in the Cloud. Code Camp Iași April 2016.
Expert Network465 views
MQTT. Kafka. InfluxDB. SQL. IoT Harmony. #tutorial by Stefan Bocutiu by landoop
MQTT. Kafka. InfluxDB. SQL. IoT Harmony. #tutorial by Stefan BocutiuMQTT. Kafka. InfluxDB. SQL. IoT Harmony. #tutorial by Stefan Bocutiu
MQTT. Kafka. InfluxDB. SQL. IoT Harmony. #tutorial by Stefan Bocutiu
landoop981 views
OpenNebula Conference 2014 by terradue
OpenNebula Conference 2014OpenNebula Conference 2014
OpenNebula Conference 2014
terradue960 views
What's new in confluent platform 5.4 online talk by confluent
What's new in confluent platform 5.4 online talkWhat's new in confluent platform 5.4 online talk
What's new in confluent platform 5.4 online talk
confluent552 views
On-node resource manager for containerized HPC workloads by GeoffroyVallee
On-node resource manager for containerized HPC workloadsOn-node resource manager for containerized HPC workloads
On-node resource manager for containerized HPC workloads
GeoffroyVallee27 views
What does Netflix, NTT and Rubicon Project have in common? Apache Druid. by Rommel Garcia
What does Netflix, NTT and Rubicon Project have in common? Apache Druid.What does Netflix, NTT and Rubicon Project have in common? Apache Druid.
What does Netflix, NTT and Rubicon Project have in common? Apache Druid.
Rommel Garcia252 views
Flink for Everyone: Self Service Data Analytics with StreamPipes - Philipp Ze... by Flink Forward
Flink for Everyone: Self Service Data Analytics with StreamPipes - Philipp Ze...Flink for Everyone: Self Service Data Analytics with StreamPipes - Philipp Ze...
Flink for Everyone: Self Service Data Analytics with StreamPipes - Philipp Ze...
Flink Forward506 views

Similar to Integrating Globus into LRZ's Data Science Storage Service

ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da... by
ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...Abhinav Joshi
99 views26 slides
Liberate Your Files with a Private Cloud Storage Solution powered by Open Source by
Liberate Your Files with a Private Cloud Storage Solution powered by Open SourceLiberate Your Files with a Private Cloud Storage Solution powered by Open Source
Liberate Your Files with a Private Cloud Storage Solution powered by Open SourceIsaac Christoffersen
1.8K views24 slides
OpenShift Meetup - Summit 2021 (Part 1) by
OpenShift Meetup - Summit 2021 (Part 1)OpenShift Meetup - Summit 2021 (Part 1)
OpenShift Meetup - Summit 2021 (Part 1)ConSol Consulting & Solutions Software GmbH
91 views29 slides
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat by
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red HatThe Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red HatOpenStack
4K views22 slides
Oscon 2017: Build your own container-based system with the Moby project by
Oscon 2017: Build your own container-based system with the Moby projectOscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby projectPatrick Chanezon
4.1K views70 slides
From ECM to Content Services - Analyst Webinar by
From ECM to Content Services - Analyst WebinarFrom ECM to Content Services - Analyst Webinar
From ECM to Content Services - Analyst WebinarNuxeo
1.2K views25 slides

Similar to Integrating Globus into LRZ's Data Science Storage Service(20)

ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da... by Abhinav Joshi
ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...
Abhinav Joshi99 views
Liberate Your Files with a Private Cloud Storage Solution powered by Open Source by Isaac Christoffersen
Liberate Your Files with a Private Cloud Storage Solution powered by Open SourceLiberate Your Files with a Private Cloud Storage Solution powered by Open Source
Liberate Your Files with a Private Cloud Storage Solution powered by Open Source
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat by OpenStack
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red HatThe Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat
OpenStack4K views
Oscon 2017: Build your own container-based system with the Moby project by Patrick Chanezon
Oscon 2017: Build your own container-based system with the Moby projectOscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby project
Patrick Chanezon4.1K views
From ECM to Content Services - Analyst Webinar by Nuxeo
From ECM to Content Services - Analyst WebinarFrom ECM to Content Services - Analyst Webinar
From ECM to Content Services - Analyst Webinar
Nuxeo1.2K views
Orchestrating stateful applications with PKS and Portworx by VMware Tanzu
Orchestrating stateful applications with PKS and PortworxOrchestrating stateful applications with PKS and Portworx
Orchestrating stateful applications with PKS and Portworx
VMware Tanzu986 views
Accelerate Analytics and ML in the Hybrid Cloud Era by Alluxio, Inc.
Accelerate Analytics and ML in the Hybrid Cloud EraAccelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud Era
Alluxio, Inc.321 views
Swiss IPv6 Council – Case Study - Deployment von IPv6 in einer Container Plat... by Digicomp Academy AG
Swiss IPv6 Council – Case Study - Deployment von IPv6 in einer Container Plat...Swiss IPv6 Council – Case Study - Deployment von IPv6 in einer Container Plat...
Swiss IPv6 Council – Case Study - Deployment von IPv6 in einer Container Plat...
IPv6 on Container Plattforms by Aarno Aukia
IPv6 on Container PlattformsIPv6 on Container Plattforms
IPv6 on Container Plattforms
Aarno Aukia141 views
End to-end ml pipelines with beam, flink, tensor flow, and hopsworks (beam su... by Theofilos Kakantousis
End to-end ml pipelines with beam, flink, tensor flow, and hopsworks (beam su...End to-end ml pipelines with beam, flink, tensor flow, and hopsworks (beam su...
End to-end ml pipelines with beam, flink, tensor flow, and hopsworks (beam su...
OSCON 2013 - The Hitchiker’s Guide to Open Source Cloud Computing by Mark Hinkle
OSCON 2013 - The Hitchiker’s Guide to Open Source Cloud ComputingOSCON 2013 - The Hitchiker’s Guide to Open Source Cloud Computing
OSCON 2013 - The Hitchiker’s Guide to Open Source Cloud Computing
Mark Hinkle4.2K views
Comparison of control plane deployment architectures in the scope of hypercon... by Miroslav Halas
Comparison of control plane deployment architectures in the scope of hypercon...Comparison of control plane deployment architectures in the scope of hypercon...
Comparison of control plane deployment architectures in the scope of hypercon...
Miroslav Halas433 views
Scientific Computing @ Fred Hutch by Dirk Petersen
Scientific Computing @ Fred HutchScientific Computing @ Fred Hutch
Scientific Computing @ Fred Hutch
Dirk Petersen566 views
Cloud Native Application @ VMUG.IT 20150529 by VMUG IT
Cloud Native Application @ VMUG.IT 20150529Cloud Native Application @ VMUG.IT 20150529
Cloud Native Application @ VMUG.IT 20150529
VMUG IT1.1K views
High Availability - Brett Thurber - ManageIQ Design Summit 2016 by ManageIQ
High Availability - Brett Thurber - ManageIQ Design Summit 2016High Availability - Brett Thurber - ManageIQ Design Summit 2016
High Availability - Brett Thurber - ManageIQ Design Summit 2016
ManageIQ1.2K views
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS by Daniel Krook
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaSOpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
Daniel Krook4.8K views
Democratizing Data Science on Kubernetes by John Archer
Democratizing Data Science on Kubernetes Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes
John Archer3.8K views

More from Globus

Introduction to Globus for System Administrators by
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System AdministratorsGlobus
11 views55 slides
Introduction to Data Transfer and Sharing for Researchers by
Introduction to Data Transfer and Sharing for ResearchersIntroduction to Data Transfer and Sharing for Researchers
Introduction to Data Transfer and Sharing for ResearchersGlobus
4 views33 slides
Introduction to the Globus Platform for Developers by
Introduction to the Globus Platform for DevelopersIntroduction to the Globus Platform for Developers
Introduction to the Globus Platform for DevelopersGlobus
4 views28 slides
Introduction to the Command Line Interface (CLI) by
Introduction to the Command Line Interface (CLI)Introduction to the Command Line Interface (CLI)
Introduction to the Command Line Interface (CLI)Globus
12 views12 slides
Automating Research Data with Globus Flows and Compute by
Automating Research Data with Globus Flows and ComputeAutomating Research Data with Globus Flows and Compute
Automating Research Data with Globus Flows and ComputeGlobus
6 views60 slides
Automating Research Data Flows and Introduction to the Globus Platform by
Automating Research Data Flows and Introduction to the Globus PlatformAutomating Research Data Flows and Introduction to the Globus Platform
Automating Research Data Flows and Introduction to the Globus PlatformGlobus
50 views41 slides

More from Globus (20)

Introduction to Globus for System Administrators by Globus
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System Administrators
Globus 11 views
Introduction to Data Transfer and Sharing for Researchers by Globus
Introduction to Data Transfer and Sharing for ResearchersIntroduction to Data Transfer and Sharing for Researchers
Introduction to Data Transfer and Sharing for Researchers
Globus 4 views
Introduction to the Globus Platform for Developers by Globus
Introduction to the Globus Platform for DevelopersIntroduction to the Globus Platform for Developers
Introduction to the Globus Platform for Developers
Globus 4 views
Introduction to the Command Line Interface (CLI) by Globus
Introduction to the Command Line Interface (CLI)Introduction to the Command Line Interface (CLI)
Introduction to the Command Line Interface (CLI)
Globus 12 views
Automating Research Data with Globus Flows and Compute by Globus
Automating Research Data with Globus Flows and ComputeAutomating Research Data with Globus Flows and Compute
Automating Research Data with Globus Flows and Compute
Globus 6 views
Automating Research Data Flows and Introduction to the Globus Platform by Globus
Automating Research Data Flows and Introduction to the Globus PlatformAutomating Research Data Flows and Introduction to the Globus Platform
Automating Research Data Flows and Introduction to the Globus Platform
Globus 50 views
Advanced Globus System Administration by Globus
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System Administration
Globus 26 views
Introduction to Globus for System Administrators by Globus
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System Administrators
Globus 94 views
Introduction to Globus for New Users by Globus
Introduction to Globus for New UsersIntroduction to Globus for New Users
Introduction to Globus for New Users
Globus 55 views
Working with Globus Platform Services and Portals by Globus
Working with Globus Platform Services and PortalsWorking with Globus Platform Services and Portals
Working with Globus Platform Services and Portals
Globus 28 views
Globus Automation by Globus
Globus AutomationGlobus Automation
Globus Automation
Globus 23 views
Advanced Globus System Administration by Globus
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System Administration
Globus 21 views
Introduction to Globus by Globus
Introduction to GlobusIntroduction to Globus
Introduction to Globus
Globus 43 views
Introduction to Globus for System Administrators by Globus
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System Administrators
Globus 27 views
Working with Globus Platform Services by Globus
Working with Globus Platform ServicesWorking with Globus Platform Services
Working with Globus Platform Services
Globus 41 views
Advanced Globus System Administration by Globus
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System Administration
Globus 29 views
Introduction to Globus for System Administrators by Globus
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System Administrators
Globus 145 views
Using Globus to Streamline Research at Scale by Globus
Using Globus to Streamline Research at ScaleUsing Globus to Streamline Research at Scale
Using Globus to Streamline Research at Scale
Globus 30 views
Introduction to Globus for Researchers by Globus
Introduction to Globus for ResearchersIntroduction to Globus for Researchers
Introduction to Globus for Researchers
Globus 89 views
Automating Research Data Flows and an Introduction to the Globus Platform by Globus
Automating Research Data Flows and an Introduction to the Globus PlatformAutomating Research Data Flows and an Introduction to the Globus Platform
Automating Research Data Flows and an Introduction to the Globus Platform
Globus 132 views

Recently uploaded

Vertical User Stories by
Vertical User StoriesVertical User Stories
Vertical User StoriesMoisés Armani Ramírez
12 views16 slides
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensors by
TouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective SensorsTouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective Sensors
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensorssugiuralab
19 views15 slides
Igniting Next Level Productivity with AI-Infused Data Integration Workflows by
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Safe Software
257 views86 slides
Piloting & Scaling Successfully With Microsoft Viva by
Piloting & Scaling Successfully With Microsoft VivaPiloting & Scaling Successfully With Microsoft Viva
Piloting & Scaling Successfully With Microsoft VivaRichard Harbridge
12 views160 slides
Tunable Laser (1).pptx by
Tunable Laser (1).pptxTunable Laser (1).pptx
Tunable Laser (1).pptxHajira Mahmood
24 views37 slides
Kyo - Functional Scala 2023.pdf by
Kyo - Functional Scala 2023.pdfKyo - Functional Scala 2023.pdf
Kyo - Functional Scala 2023.pdfFlavio W. Brasil
298 views92 slides

Recently uploaded(20)

TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensors by sugiuralab
TouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective SensorsTouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective Sensors
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensors
sugiuralab19 views
Igniting Next Level Productivity with AI-Infused Data Integration Workflows by Safe Software
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Safe Software257 views
Piloting & Scaling Successfully With Microsoft Viva by Richard Harbridge
Piloting & Scaling Successfully With Microsoft VivaPiloting & Scaling Successfully With Microsoft Viva
Piloting & Scaling Successfully With Microsoft Viva
DALI Basics Course 2023 by Ivory Egg
DALI Basics Course  2023DALI Basics Course  2023
DALI Basics Course 2023
Ivory Egg16 views
1st parposal presentation.pptx by i238212
1st parposal presentation.pptx1st parposal presentation.pptx
1st parposal presentation.pptx
i2382129 views
Spesifikasi Lengkap ASUS Vivobook Go 14 by Dot Semarang
Spesifikasi Lengkap ASUS Vivobook Go 14Spesifikasi Lengkap ASUS Vivobook Go 14
Spesifikasi Lengkap ASUS Vivobook Go 14
Dot Semarang37 views
Voice Logger - Telephony Integration Solution at Aegis by Nirmal Sharma
Voice Logger - Telephony Integration Solution at AegisVoice Logger - Telephony Integration Solution at Aegis
Voice Logger - Telephony Integration Solution at Aegis
Nirmal Sharma31 views
Data-centric AI and the convergence of data and model engineering: opportunit... by Paolo Missier
Data-centric AI and the convergence of data and model engineering:opportunit...Data-centric AI and the convergence of data and model engineering:opportunit...
Data-centric AI and the convergence of data and model engineering: opportunit...
Paolo Missier39 views
Unit 1_Lecture 2_Physical Design of IoT.pdf by StephenTec
Unit 1_Lecture 2_Physical Design of IoT.pdfUnit 1_Lecture 2_Physical Design of IoT.pdf
Unit 1_Lecture 2_Physical Design of IoT.pdf
StephenTec12 views
AMAZON PRODUCT RESEARCH.pdf by JerikkLaureta
AMAZON PRODUCT RESEARCH.pdfAMAZON PRODUCT RESEARCH.pdf
AMAZON PRODUCT RESEARCH.pdf
JerikkLaureta19 views
Case Study Copenhagen Energy and Business Central.pdf by Aitana
Case Study Copenhagen Energy and Business Central.pdfCase Study Copenhagen Energy and Business Central.pdf
Case Study Copenhagen Energy and Business Central.pdf
Aitana16 views
6g - REPORT.pdf by Liveplex
6g - REPORT.pdf6g - REPORT.pdf
6g - REPORT.pdf
Liveplex10 views

Integrating Globus into LRZ's Data Science Storage Service

  • 2. 2 Integrating Globus into LRZ’s Data Science Storage Service GlobusWorld 2019 | 2019-05-01 | Stephan Peinkofer Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
  • 3. 3Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer Bavarian Academy of Sciences and Humanities Leibniz Supercomputing Centre Computer Centre for all Munich Universities250 employees approx. 57 years of IT support IT Service Backbone for the Advancement of Science and Research Regional Computer Centre for all Bavarian Universities National Supercomputing Centre (GCS) European Supercomputing Centre (PRACE)
  • 4. High Performance Computing SuperMUC-NG, LRZ Linux Cluster Virtual Reality and Visualisation V2C (CAVE, Powerwall) 4 Operating Cutting-Edge IT Infrastructure LRZ as an IT Center of Excellence Storage Network Cloud Computing Cluster HPC Training Consultancy Email High Speed Networking Munich Scientific Network Big Data Bavarian State Library Digital Archive Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
  • 5. 5Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer Data Silos
  • 6. Increasing User Demand 6 I need to share a 400TB dataset with someone in Canada! My experiment will generate multiple PBs, that have to be analyzed and backed up! How? I want to build a WebApp that allows users to interactively analyze my 500TB SuperMUC simulation data! I need to share some data on SuperMUC between multiple projects! I want to analyze a large dataset, generated on Super- MUC, using some special OS image on the LRZ Cloud! Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
  • 7. 7Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer Satisfying User Demands So basically we need to provide … A file system that can be shared amongst the complete LRZ HPC Ecosystem Some kind of external access mechanism for arbitrary entities A Dropbox like data management approach
  • 8. LRZ Data Science Storage 8Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer Interactive processing on LRZ Compute Cloud Remote visualisation on LRZs visualisation systems External access and sharing via Globus Online High performance backup and archive of data on LRZs Backup- and Archive System Batch and interactive processing on dedicated, hosted HPC Cluster at LRZ High throughput batch processing on LRZs Linux Cluster or SuperMUC LRZ Data Science Storage
  • 9. IBM Spectrum SCALE IBM Spectrum PROTECT LRZ Identity Managment System Globus Mission Control 9Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer DSSWeb Self Service Portal REST API CES Globus Sharing Globus Connect Server RabbitMQ Message Bus REST API Client Manage- ment Service REST API Operations Center REST API
  • 10. The Big Picture 10Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
  • 11. Huber LMU User: lmuuser2 LinuxCluster SuperMUC Project: lxpr2 Project: smpr2 User: lx22bp User: sm33sx DSS Containers 11 Maier TUM User: tumuser1 LinuxCluster SuperMUC Project: lxpr1 Project: smpr1 User: lx11xc User: sm11bb DSS POSIX Group in IDM/LDAP pr45xa-dss-0000 DSS Container à GPFS Independent Fileset /dss/dssfs01/pr45xa-dss-0000 drwxrws--- root pr45xa-dss-0000 Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
  • 12. Technical Integration of Globus to LRZ DSS Goal 12 Integrate Globus Sharing to DSSWeb Self-Service Portal. Allow Data Curators to share DSS Containers with arbitrary external users. Problem Action Globus let’s us control. Who can share? What can be shared? We need to control. Who can share what? Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
  • 13. LRZ Data Science Storage Technical Integration of Globus to LRZ DSS 13 DSS Container X Container Group /dss/dssfs01/dsscontX DSS Container Directory Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer DSSWeb Globus Online LRZ MyProxy DSS Globus Endpoint 1. Enable Globus Sharing for DSS Container X Data Curator RobotUser aka RobotUser@globusid.org 2. Login to MyProxy to get Certificate 3. Enable DSS Globus Endpoint 4. Create Shared Endpoint “LRZ DSS Container X” LRZ DSS Container X Shared Endpoint 6. Add RobotUser to Container Access Group 5. Globus Magic
  • 14. Technical Integration of Globus to LRZ DSS 14Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer DSSWeb 1. Invite bop@wherever.com to access DSS Container X via Globus Data Curator RobotUser aka RobotUser@globusid.org 2. Check if identity bop@wherever.com is already known by Globus and if not create it 3. Add Globus ACL for Shared Endpoint LRZ DSS Container X for identity bop@wherever.com 4. Globus Magic bop@wherever.com 5. Bop is happy LRZ Data Science Storage DSS Container X Container Group /dss/dssfs01/dsscontX DSS Container Directory DSS Globus Endpoint* LRZ DSS Container X Shared Endpoint Globus Online
  • 15. Legal Integration of Globus to LRZ DSS Regulation 15 European Union enforced the EU General Data Protection Regulation (GDPR) on 2018-05- 25 Use/Integration of Cloud Services that process PII requires a formal Controller- Processor Agreement. Transfer of personal data to third countries requires special safeguards HIPPA and NIST rescue BAA to the rescue HIPPA and NIST require roughly similar technical and organizational security controls that are required by GDPR to protect PII Globus agreed to sign a Controller-Processor Agreement that contains the EU-Model Clauses Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer
  • 16. 16Integrating Globus into LRZ’s Data Science Storage Service | 2019-05-01 | Stephan Peinkofer