SlideShare a Scribd company logo
1 of 53
Download to read offline
Globus for System Administrators
Vas Vasiliadis
vas@uchicago.edu
Columbia University – April 24, 2019
Globus Connect Server
2
• Makes your storage accessible via Globus
• Multi-user server, installed and managed by sysadmin
docs.globus.org/globus-connect-server-installation-guide/
Local system users
Local Storage System
(HPC cluster, NAS, …)
Globus Connect Server
MyProxy
CA
GridFTP
Server
OAuth
Server
DTN
• Default access for
all local accounts
• Native packaging
Linux: DEB, RPM
Globus Connect Server
3
Local system users
Local Storage System
(HPC cluster, NAS, …)
Globus Connect Server
MyProxy
CA
GridFTP
Server
OAuth
Server
DTN
Non-POSIX Connectors
POSIX-compliant Connector
server
IBM Spectrum Scale
Current Planned
Storage Connectors - globus.org/connectors
ActiveScale
Creating a Globus endpoint on your server
• In this example, Server = Amazon EC2 instance
• Installation and configuration of Globus Connect
Server requires a Globus ID
• Go to globusid.org
• Click “create a Globus ID”
– Optional: associate it with your Globus account
5
What we are going to do:
Install Globus Connect Server
• Access server as user “campusadmin”
• Update repo
• Install package
• Setup Globus Connect Server
Server
(AWS EC2)
ssh
ec2-22-23-24-25
Log into Globus
Transfer a file
1
2
3
Access the newly created
endpoint (as user ‘researcher’)
4
6
Access your server
• Get the IP address for your EC2 server
• Log in as user ‘campusadmin’
ssh campusadmin@<EC2_instance_IP_address>
• Please sudo su before continuing
– User ‘campusadmin’ has passwordless sudo privileges
7
Install Globus Connect Server
$ sudo su
$ curl –LOs
http://downloads.globus.org/toolkit/globus-connect-
server/globus-connect-server-repo_latest_all.deb
$ dpkg –i globus-connect-server-repo_latest_all.deb
$ apt-get update
$ apt-get -y install globus-connect-server
$ globus-connect-server-setup
You have a working Globus endpoint!
Use your Globus ID username and
password when prompted
Access the Globus endpoint
• Go to Manage Data à Transfer Files
• Access the endpoint you just created
– Search for your EC2 host name in the Endpoint field
– Log in as “researcher”; you will see the user’s home directory
• Transfer files between a test endpoint (e.g. ESnet
read-only) and your EC2 endpoint
9
Globus accounts and endpoint access
• Globus account: Primary identity (+ Linked Identities)
• Endpoint initially accessible by creator
• Endpoint not visible?
– Primary identity is your institutional ID?
– Link your Globus ID!
Configuring Globus
Connect Server
11
Endpoint configuration
• On the Globus service: app.globus.org/endpoints
• On your DTN: /etc/globus-connect-server.conf
– Standard .ini format: [Section] Option = Value
– To enable changes run globus-connect-server-setup
– “Rinse and repeat”
12
Common configuration options
• Endpoints page
– Display Name
– Visibility
– Encryption
• DTN configuration file
– RestrictPaths
– Sharing
– IdentityMethod (CILogon, Oauth)
– SharingRestrictPaths
Exercise: Make your endpoint visible
• Edit endpoint attributes
– Change the name to something useful, e.g. <your_name> EC2
Endpoint
– For the “Visible To” attribute select “Public - Visible to all users”
• Find your neighbor’s endpoint
– Thanks to our superb security …you can access it too J
14
Path Restriction
• Default configuration:
– All paths allowed, access control handled by the OS
• Use RestrictPaths to customize
– Specifies a comma separated list of full paths that clients may access
– Each path may be prefixed by R (read) and/or W (write), or N (none) to explicitly
deny access to a path
– '~’ for authenticated user’s home directory, and * may be used for simple
wildcard matching.
• e.g. Full access to home directory, read access to /data:
– RestrictPaths = RW~,R/data
• e.g. Full access to home directory, deny hidden files:
– RestrictPaths = RW~,N~/.*
15
Exercise: Restrict access
• Set RestrictPaths=RW~,N~/archive
• Run globus-connect-server-setup
• Access your endpoint as ‘researcher’
• What’s changed?
16
• In config file, set Sharing=True
• Run globus-connect-server-setup
• Flag endpoint as “managed” (in web app or via CLI)
* Note: Creation of shared endpoints requires a
Globus subscription for the managed endpoint
Enabling sharing on an endpoint
17
Limit sharing to specific accounts
• SharingUsersAllow =
• SharingGroupsAllow =
• SharingUsersDeny =
• SharingGroupsDeny =
18
Sharing Path Restriction
• Restrict paths where users can create shared endpoints
• Use SharingRestrictPaths to customize
– Same syntax as RestrictPaths
• e.g. Full access to home directory, deny hidden files:
– SharingRestrictPaths = RW~,N~/.*
• e.g. Full access to public folder under home directory:
– SharingRestrictPaths = RW~/public
• e.g. Full access to /proj, read access to /scratch:
– SharingRestrictPaths = RW/proj,R/scratch
19
Endpoint Access
Control/Activation
20
Ports needed for Globus
• Inbound: 2811 (control channel)
• Inbound: 7512 (MyProxy), 443 (OAuth)
• Inbound: 50000-51000 (data channel)
• If restricting outbound connections, allow
connections on:
– 80, 2223 (used during install/config)
– 50000-51000 (GridFTP data channel)
21
EndpointactivationusingMyProxy
Default
configuration
(avoid if at all
possible)
EndpointactivationusingMyProxyOAuth
Best practice
configuration
Single Sign-On with InCommon/CILogon
• Your Shibboleth server must release R&S attributes to
CILogon—especially the ePPN attribute
• Local account must match institutional ID (InCommon ID)
– Test by creating a local user with same name
• In /etc/globus-connect-server.conf set:
AuthorizationMethod = CILogon
CILogonIdentityProvider =
<institution_listed_in_CILogon_IdP_list>
24
Managed endpoints and
subscriptions
25
Subscription configuration
• Subscription manager
– Create/upgrade managed endpoints
– Requires Globus ID linked to Globus account
• Management console permissions
– Independent of subscription manager
– Map managed endpoint to Globus ID
• Globus Plus group
– Subscription Manager is admin
– Can grant admin rights to other members
26
Creating managed endpoints
• Required for sharing, management console, reporting, …
• Convert existing endpoint to managed via CLI (or web):
globus endpoint update --managed <endpt_uuid>
• Must be run by subscription manager
• Important: Re-run endpoint update after deleting/re-
creating endpoint
27
Monitoring and managing
Globus endpoint activity
28
Management console
• Monitor all transfers
• Pause/resume specific transfers
• Add pause conditions with various options
• Resume specific tasks overriding pause conditions
• Cancel tasks
• View sharing ACLs
29
Endpoint Roles
• Administrator: define endpoint and roles
• Access Manager: manage permissions
• Activity Manager: perform control tasks
• Activity Monitor: view activity
30
Demonstration:
Management console
Endpoint Roles
Usage Reporting
31
…on performance
32
Balance: performance - reliability
• Network use parameters: concurrency, parallelism
• Maximum, Preferred values for each
• Transfer considers source and destination endpoint settings
min(
max(preferred src, preferred dest),
max src,
max dest
)
• Service limits, e.g. concurrent requests
33
Illustrative
performance
Disk-to-Disk Throughput: ESnet Testing
0 1,000 2,000 3,000 4,000 5,000 6,000 7,000 8,000 9,000
scp
scp (w/HPN)
sftp
GridFTP
(1 stream)
GridFTP
(4 streams)
Disk-to-Disk Throughput (Mbps)
35Source: ESnet (2016)
• Berkeley, CA to Argonne, IL
(RTT: 53 ms, Capacity: 10Gbps)
• scp is 24x slower than GridFTP on
this path
• >1 Gbps (125 MB/s) disk-to-disk
requires RAID array
Deployment Scenarios
36
Legacy Architecture
10GE
Border Router
WAN
Firewall
Enterprise
perfSONAR
perfSONAR
Filesystem
(data store)
10GE
Portal
Server
Browsing path
Query path
Data path
Portal server applications:
· web server
· search
· database
· authentication
· data service
Current best practice
10GE10GE
10GE
10GE
Border Router
WAN
Science DMZ
Switch/Router
Firewall
Enterprise
perfSONAR
perfSONAR
10GE
10GE
10GE
10GE
DTN
DTN
API DTNs
(data access governed
by portal)
DTN
DTN
perfSONAR
Filesystem
(data store)
10GE
Portal
Server
Browsing path
Query path
Portal server applications:
· web server
· search
· database
· authentication
Data Path
Data Transfer Path
Portal Query/Browse Path
Science DMZ configuration
39
Source
security
filters
Destination
security
filters
Destination
Science DMZ
Source
Science DMZ
Source
Border Router
Destination
Border Router
Source Router Destination Router
User
Organization
DATA
CONTROL
Physical Control Path
Logical Control Path
Physical Data Path
Logical Data Path
* Ports 443,
2811, 7512
* Ports 50000-
51000
Data Transfer
Node (DTN)
Data Transfer
Node (DTN)
* Please see TCP ports reference: https://docs.globus.org/resource-provider-guide/#open-tcp-ports_section
ext*
XFS
ZFS
~/
~/scratch
Common endpoint configuration (GCSv4)
Data Transfer Node
POSIX
Connector
ext*
XFS
ZFS
GPFS
Lustre
~/projects
Common endpoint configuration (GCSv4)
Data Transfer Node
POSIX
Connector
~/
~/scratch
ext*
XFS
ZFS
GPFS
Lustre
~/projects
Multi-endpoint configuration (GCSv4)
Data Transfer Node
POSIX
Connector
Western
Digital
ActiveScale
Connector
~/archive
~/
~/scratch
~/vault
ext*
XFS
ZFS
GPFS
Lustre
~/projects
Multi-endpoint configuration (GCSv4)
43
Data Transfer Node
POSIX
Connector
Western
Digital
ActiveScale
Connector
~/archive
Amazon
S3
Bucket
Amazon S3
Connector
~/
~/scratch
Data
Control
Object Store /
Archive
Deploying a premium connector gateway
server
Object Store/
Archive Connector
Globus endpoint
(e.g. WD ActiveScale gateway)
Other
Globus
endpoints
Globus
Share
Globus
Share
Network paths
• Separate control and data interfaces
• "DataInterface =" option in globus-connect-server-
conf
• Common scenario: route data flows over Science
DMZ link
45
Dual-homed DTN – high speed data path
Data
Transfer
Node
GridFTP
Server
Science DMZ
Control
Channel
Data
Transfer
Node
GridFTP
Server
Data Channel
if0
if1
Internet2
path
Control
Channel
Dual-homed DTN – internal data path
Data
Transfer
Node
GridFTP
Server
Science DMZ
Control
Channel
Data
Transfer
Node
GridFTP
Server
Data Channel
if0
if1
LAN/
Intranet
path
Control
Channel
Firewall
if0
if1
Other Deployment Options
48
Encryption
• Requiring encryption on an endpoint
– User cannot override
– Useful for “sensitive” data
• Globus uses OpenSSL cipher stack as currently
configured on your DTN
• FIPS 140-2 compliance: ensure use of FIPS capable
OpenSSL libraries on DTN
www.openssl.org/docs/fips/UserGuide-2.0.pdf
49
Distributing Globus Connect Server components
• Globus Connect Server components
– globus-connect-server-io, -id, -web
• Default: -io, –id and –web on single server
• Common options
– Multiple –io servers for load balancing, failover, and
performance
– No -id server, e.g. third-party IdP
– -id on separate server, e.g. non-DTN nodes
– -web on either –id server or separate server for OAuth interface
50
ext*
XFS
ZFS
Distributing Globus Connect Server components
Data
Transfer
Node
OAuth
Server
GridFTP
Server
MyProxy
CA
Science DMZ
(ACL limited)
Port 2811
accepts inbound
connections
from Globus
Firewall
Setting up multiple –io servers
• Guidelines
– Use the same .conf file on all servers
– First install on the server running the –id component, then all others
• Install Globus Connect Server on all servers
• Edit .conf file on one of the servers and set [MyProxy] Server to the hostname
of the server you want the –id component installed on
• Copy Globus Connect Server configuration file to all servers
• Run globus-connect-server-setup on the server running the –id component
• Run globus-connect-server-setup on all other servers
• Repeat steps 2-5 as necessary to update configurations
52
Example: Two-node DTN
53
-id
-io
-io
On other DTN nodes:
/etc/globus-connect-server.conf
[Endpoint] Name = globus_dtn
[MyProxy] Server = 34.20.29.57
On “primary” DTN node (34.20.29.57):
/etc/globus-connect-server.conf
[Endpoint] Name = globus_dtn
[MyProxy] Server = 34.20.29.57

More Related Content

What's hot

Globus Command Line Interface (APS Workshop)
Globus Command Line Interface (APS Workshop)Globus Command Line Interface (APS Workshop)
Globus Command Line Interface (APS Workshop)Globus
 
GlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDK
GlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDKGlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDK
GlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDKGlobus
 
Asynchronous Replication for PostgreSQL Slony
Asynchronous Replication for PostgreSQL SlonyAsynchronous Replication for PostgreSQL Slony
Asynchronous Replication for PostgreSQL Slonyelliando dias
 
HBaseConEast2016: Practical Kerberos with Apache HBase
HBaseConEast2016: Practical Kerberos with Apache HBaseHBaseConEast2016: Practical Kerberos with Apache HBase
HBaseConEast2016: Practical Kerberos with Apache HBaseMichael Stack
 
Linux containers and docker
Linux containers and dockerLinux containers and docker
Linux containers and dockerFabio Fumarola
 
PostgreSQL High Availability via SLONY and PG POOL II
PostgreSQL High Availability via SLONY and PG POOL IIPostgreSQL High Availability via SLONY and PG POOL II
PostgreSQL High Availability via SLONY and PG POOL IICommand Prompt., Inc
 
LuSql: (Quickly and easily) Getting your data from your DBMS into Lucene
LuSql: (Quickly and easily) Getting your data from your DBMS into LuceneLuSql: (Quickly and easily) Getting your data from your DBMS into Lucene
LuSql: (Quickly and easily) Getting your data from your DBMS into Luceneeby
 
Introduction to Globus (GlobusWorld Tour West)
Introduction to Globus (GlobusWorld Tour West)Introduction to Globus (GlobusWorld Tour West)
Introduction to Globus (GlobusWorld Tour West)Globus
 
From Kubernetes to OpenStack in Sydney
From Kubernetes to OpenStack in SydneyFrom Kubernetes to OpenStack in Sydney
From Kubernetes to OpenStack in SydneySK Telecom
 
Cross-Site BigTable using HBase
Cross-Site BigTable using HBaseCross-Site BigTable using HBase
Cross-Site BigTable using HBaseHBaseCon
 
Introduction to Globus (APS Workshop)
Introduction to Globus (APS Workshop)Introduction to Globus (APS Workshop)
Introduction to Globus (APS Workshop)Globus
 
LibX 2.0
LibX 2.0LibX 2.0
LibX 2.0eby
 
Jsonnet, terraform & packer
Jsonnet, terraform & packerJsonnet, terraform & packer
Jsonnet, terraform & packerDavid Cunningham
 
djatoka for djummies
djatoka for djummiesdjatoka for djummies
djatoka for djummieseby
 
Kubernetes Walk Through from Technical View
Kubernetes Walk Through from Technical ViewKubernetes Walk Through from Technical View
Kubernetes Walk Through from Technical ViewLei (Harry) Zhang
 
Best Practices for Data Sharing (GlobusWorld Tour - UCSD)
Best Practices for Data Sharing (GlobusWorld Tour - UCSD)Best Practices for Data Sharing (GlobusWorld Tour - UCSD)
Best Practices for Data Sharing (GlobusWorld Tour - UCSD)Globus
 
Lisa 2015-gluster fs-hands-on
Lisa 2015-gluster fs-hands-onLisa 2015-gluster fs-hands-on
Lisa 2015-gluster fs-hands-onGluster.org
 

What's hot (20)

Docker volume
Docker volumeDocker volume
Docker volume
 
Globus Command Line Interface (APS Workshop)
Globus Command Line Interface (APS Workshop)Globus Command Line Interface (APS Workshop)
Globus Command Line Interface (APS Workshop)
 
GlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDK
GlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDKGlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDK
GlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDK
 
Asynchronous Replication for PostgreSQL Slony
Asynchronous Replication for PostgreSQL SlonyAsynchronous Replication for PostgreSQL Slony
Asynchronous Replication for PostgreSQL Slony
 
HBaseConEast2016: Practical Kerberos with Apache HBase
HBaseConEast2016: Practical Kerberos with Apache HBaseHBaseConEast2016: Practical Kerberos with Apache HBase
HBaseConEast2016: Practical Kerberos with Apache HBase
 
Linux containers and docker
Linux containers and dockerLinux containers and docker
Linux containers and docker
 
PostgreSQL High Availability via SLONY and PG POOL II
PostgreSQL High Availability via SLONY and PG POOL IIPostgreSQL High Availability via SLONY and PG POOL II
PostgreSQL High Availability via SLONY and PG POOL II
 
LuSql: (Quickly and easily) Getting your data from your DBMS into Lucene
LuSql: (Quickly and easily) Getting your data from your DBMS into LuceneLuSql: (Quickly and easily) Getting your data from your DBMS into Lucene
LuSql: (Quickly and easily) Getting your data from your DBMS into Lucene
 
Introduction to Globus (GlobusWorld Tour West)
Introduction to Globus (GlobusWorld Tour West)Introduction to Globus (GlobusWorld Tour West)
Introduction to Globus (GlobusWorld Tour West)
 
From Kubernetes to OpenStack in Sydney
From Kubernetes to OpenStack in SydneyFrom Kubernetes to OpenStack in Sydney
From Kubernetes to OpenStack in Sydney
 
Cross-Site BigTable using HBase
Cross-Site BigTable using HBaseCross-Site BigTable using HBase
Cross-Site BigTable using HBase
 
Introduction to Globus (APS Workshop)
Introduction to Globus (APS Workshop)Introduction to Globus (APS Workshop)
Introduction to Globus (APS Workshop)
 
LibX 2.0
LibX 2.0LibX 2.0
LibX 2.0
 
Jsonnet, terraform & packer
Jsonnet, terraform & packerJsonnet, terraform & packer
Jsonnet, terraform & packer
 
djatoka for djummies
djatoka for djummiesdjatoka for djummies
djatoka for djummies
 
ClickHouse Keeper
ClickHouse KeeperClickHouse Keeper
ClickHouse Keeper
 
Kubernetes Walk Through from Technical View
Kubernetes Walk Through from Technical ViewKubernetes Walk Through from Technical View
Kubernetes Walk Through from Technical View
 
Best Practices for Data Sharing (GlobusWorld Tour - UCSD)
Best Practices for Data Sharing (GlobusWorld Tour - UCSD)Best Practices for Data Sharing (GlobusWorld Tour - UCSD)
Best Practices for Data Sharing (GlobusWorld Tour - UCSD)
 
Lisa 2015-gluster fs-hands-on
Lisa 2015-gluster fs-hands-onLisa 2015-gluster fs-hands-on
Lisa 2015-gluster fs-hands-on
 
Kubernetes Node Deep Dive
Kubernetes Node Deep DiveKubernetes Node Deep Dive
Kubernetes Node Deep Dive
 

Similar to Globus for System Administrators (GlobusWorld Tour - Columbia University)

Globus Endpoint Administration (GlobusWorld Tour - STFC)
Globus Endpoint Administration (GlobusWorld Tour - STFC)Globus Endpoint Administration (GlobusWorld Tour - STFC)
Globus Endpoint Administration (GlobusWorld Tour - STFC)Globus
 
Globus for System Administrators
Globus for System AdministratorsGlobus for System Administrators
Globus for System AdministratorsGlobus
 
Advanced Globus System Administration Topics
Advanced Globus System Administration TopicsAdvanced Globus System Administration Topics
Advanced Globus System Administration TopicsGlobus
 
GlobusWorld 2021 Tutorial: Globus for System Administrators
GlobusWorld 2021 Tutorial: Globus for System AdministratorsGlobusWorld 2021 Tutorial: Globus for System Administrators
GlobusWorld 2021 Tutorial: Globus for System AdministratorsGlobus
 
Advanced Globus System Administration
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System AdministrationGlobus
 
Advanced Globus System Administration
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System AdministrationGlobus
 
Data Publication and Discovery with Globus
Data Publication and Discovery with GlobusData Publication and Discovery with Globus
Data Publication and Discovery with GlobusGlobus
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System AdministratorsGlobus
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System AdministratorsGlobus
 
Globus for System Administrators
Globus for System AdministratorsGlobus for System Administrators
Globus for System AdministratorsGlobus
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System AdministratorsGlobus
 
Advanced Globus System Administration
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System AdministrationGlobus
 
Leveraging the Globus Platform in Web Applications (CHPC 2019 - South Africa)
Leveraging the Globus Platform in Web Applications (CHPC 2019 - South Africa)Leveraging the Globus Platform in Web Applications (CHPC 2019 - South Africa)
Leveraging the Globus Platform in Web Applications (CHPC 2019 - South Africa)Globus
 
Globus for System Administrators
Globus for System AdministratorsGlobus for System Administrators
Globus for System AdministratorsGlobus
 
Connecting Your System to Globus (APS Workshop)
Connecting Your System to Globus (APS Workshop)Connecting Your System to Globus (APS Workshop)
Connecting Your System to Globus (APS Workshop)Globus
 
Automating Research Data Flows and Introduction to the Globus Platform
Automating Research Data Flows and Introduction to the Globus PlatformAutomating Research Data Flows and Introduction to the Globus Platform
Automating Research Data Flows and Introduction to the Globus PlatformGlobus
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System AdministratorsGlobus
 
Leveraging the Globus Platform (GlobusWorld Tour - Columbia University)
Leveraging the Globus Platform (GlobusWorld Tour - Columbia University)Leveraging the Globus Platform (GlobusWorld Tour - Columbia University)
Leveraging the Globus Platform (GlobusWorld Tour - Columbia University)Globus
 
Automating Research Data Flows and an Introduction to the Globus Platform
Automating Research Data Flows and an Introduction to the Globus PlatformAutomating Research Data Flows and an Introduction to the Globus Platform
Automating Research Data Flows and an Introduction to the Globus PlatformGlobus
 
Jupyter + Globus: The Foundation for Interactive Data Science
Jupyter + Globus: The Foundation for Interactive Data ScienceJupyter + Globus: The Foundation for Interactive Data Science
Jupyter + Globus: The Foundation for Interactive Data ScienceGlobus
 

Similar to Globus for System Administrators (GlobusWorld Tour - Columbia University) (20)

Globus Endpoint Administration (GlobusWorld Tour - STFC)
Globus Endpoint Administration (GlobusWorld Tour - STFC)Globus Endpoint Administration (GlobusWorld Tour - STFC)
Globus Endpoint Administration (GlobusWorld Tour - STFC)
 
Globus for System Administrators
Globus for System AdministratorsGlobus for System Administrators
Globus for System Administrators
 
Advanced Globus System Administration Topics
Advanced Globus System Administration TopicsAdvanced Globus System Administration Topics
Advanced Globus System Administration Topics
 
GlobusWorld 2021 Tutorial: Globus for System Administrators
GlobusWorld 2021 Tutorial: Globus for System AdministratorsGlobusWorld 2021 Tutorial: Globus for System Administrators
GlobusWorld 2021 Tutorial: Globus for System Administrators
 
Advanced Globus System Administration
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System Administration
 
Advanced Globus System Administration
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System Administration
 
Data Publication and Discovery with Globus
Data Publication and Discovery with GlobusData Publication and Discovery with Globus
Data Publication and Discovery with Globus
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System Administrators
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System Administrators
 
Globus for System Administrators
Globus for System AdministratorsGlobus for System Administrators
Globus for System Administrators
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System Administrators
 
Advanced Globus System Administration
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System Administration
 
Leveraging the Globus Platform in Web Applications (CHPC 2019 - South Africa)
Leveraging the Globus Platform in Web Applications (CHPC 2019 - South Africa)Leveraging the Globus Platform in Web Applications (CHPC 2019 - South Africa)
Leveraging the Globus Platform in Web Applications (CHPC 2019 - South Africa)
 
Globus for System Administrators
Globus for System AdministratorsGlobus for System Administrators
Globus for System Administrators
 
Connecting Your System to Globus (APS Workshop)
Connecting Your System to Globus (APS Workshop)Connecting Your System to Globus (APS Workshop)
Connecting Your System to Globus (APS Workshop)
 
Automating Research Data Flows and Introduction to the Globus Platform
Automating Research Data Flows and Introduction to the Globus PlatformAutomating Research Data Flows and Introduction to the Globus Platform
Automating Research Data Flows and Introduction to the Globus Platform
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System Administrators
 
Leveraging the Globus Platform (GlobusWorld Tour - Columbia University)
Leveraging the Globus Platform (GlobusWorld Tour - Columbia University)Leveraging the Globus Platform (GlobusWorld Tour - Columbia University)
Leveraging the Globus Platform (GlobusWorld Tour - Columbia University)
 
Automating Research Data Flows and an Introduction to the Globus Platform
Automating Research Data Flows and an Introduction to the Globus PlatformAutomating Research Data Flows and an Introduction to the Globus Platform
Automating Research Data Flows and an Introduction to the Globus Platform
 
Jupyter + Globus: The Foundation for Interactive Data Science
Jupyter + Globus: The Foundation for Interactive Data ScienceJupyter + Globus: The Foundation for Interactive Data Science
Jupyter + Globus: The Foundation for Interactive Data Science
 

More from Globus

Instrument Data Automation: The Life of a Flow
Instrument Data Automation: The Life of a FlowInstrument Data Automation: The Life of a Flow
Instrument Data Automation: The Life of a FlowGlobus
 
Building Research Applications with Globus PaaS
Building Research Applications with Globus PaaSBuilding Research Applications with Globus PaaS
Building Research Applications with Globus PaaSGlobus
 
Reliable, Remote Computation at All Scales
Reliable, Remote Computation at All ScalesReliable, Remote Computation at All Scales
Reliable, Remote Computation at All ScalesGlobus
 
Best Practices for Data Sharing Using Globus
Best Practices for Data Sharing Using GlobusBest Practices for Data Sharing Using Globus
Best Practices for Data Sharing Using GlobusGlobus
 
An Introduction to Globus for Researchers
An Introduction to Globus for ResearchersAn Introduction to Globus for Researchers
An Introduction to Globus for ResearchersGlobus
 
Introduction to Research Automation with Globus
Introduction to Research Automation with GlobusIntroduction to Research Automation with Globus
Introduction to Research Automation with GlobusGlobus
 
Introduction to Data Transfer and Sharing for Researchers
Introduction to Data Transfer and Sharing for ResearchersIntroduction to Data Transfer and Sharing for Researchers
Introduction to Data Transfer and Sharing for ResearchersGlobus
 
Introduction to the Globus Platform for Developers
Introduction to the Globus Platform for DevelopersIntroduction to the Globus Platform for Developers
Introduction to the Globus Platform for DevelopersGlobus
 
Introduction to the Command Line Interface (CLI)
Introduction to the Command Line Interface (CLI)Introduction to the Command Line Interface (CLI)
Introduction to the Command Line Interface (CLI)Globus
 
Automating Research Data with Globus Flows and Compute
Automating Research Data with Globus Flows and ComputeAutomating Research Data with Globus Flows and Compute
Automating Research Data with Globus Flows and ComputeGlobus
 
Introduction to Globus for New Users
Introduction to Globus for New UsersIntroduction to Globus for New Users
Introduction to Globus for New UsersGlobus
 
Working with Globus Platform Services and Portals
Working with Globus Platform Services and PortalsWorking with Globus Platform Services and Portals
Working with Globus Platform Services and PortalsGlobus
 
Globus Automation
Globus AutomationGlobus Automation
Globus AutomationGlobus
 
Introduction to Globus
Introduction to GlobusIntroduction to Globus
Introduction to GlobusGlobus
 
Working with Globus Platform Services
Working with Globus Platform ServicesWorking with Globus Platform Services
Working with Globus Platform ServicesGlobus
 
Using Globus to Streamline Research at Scale
Using Globus to Streamline Research at ScaleUsing Globus to Streamline Research at Scale
Using Globus to Streamline Research at ScaleGlobus
 
Introduction to Globus for Researchers
Introduction to Globus for ResearchersIntroduction to Globus for Researchers
Introduction to Globus for ResearchersGlobus
 
Introduction to Globus for New Users
Introduction to Globus for New UsersIntroduction to Globus for New Users
Introduction to Globus for New UsersGlobus
 
Globus Endpoint Migration and Advanced Administration Topics
Globus Endpoint Migration and Advanced Administration TopicsGlobus Endpoint Migration and Advanced Administration Topics
Globus Endpoint Migration and Advanced Administration TopicsGlobus
 

More from Globus (19)

Instrument Data Automation: The Life of a Flow
Instrument Data Automation: The Life of a FlowInstrument Data Automation: The Life of a Flow
Instrument Data Automation: The Life of a Flow
 
Building Research Applications with Globus PaaS
Building Research Applications with Globus PaaSBuilding Research Applications with Globus PaaS
Building Research Applications with Globus PaaS
 
Reliable, Remote Computation at All Scales
Reliable, Remote Computation at All ScalesReliable, Remote Computation at All Scales
Reliable, Remote Computation at All Scales
 
Best Practices for Data Sharing Using Globus
Best Practices for Data Sharing Using GlobusBest Practices for Data Sharing Using Globus
Best Practices for Data Sharing Using Globus
 
An Introduction to Globus for Researchers
An Introduction to Globus for ResearchersAn Introduction to Globus for Researchers
An Introduction to Globus for Researchers
 
Introduction to Research Automation with Globus
Introduction to Research Automation with GlobusIntroduction to Research Automation with Globus
Introduction to Research Automation with Globus
 
Introduction to Data Transfer and Sharing for Researchers
Introduction to Data Transfer and Sharing for ResearchersIntroduction to Data Transfer and Sharing for Researchers
Introduction to Data Transfer and Sharing for Researchers
 
Introduction to the Globus Platform for Developers
Introduction to the Globus Platform for DevelopersIntroduction to the Globus Platform for Developers
Introduction to the Globus Platform for Developers
 
Introduction to the Command Line Interface (CLI)
Introduction to the Command Line Interface (CLI)Introduction to the Command Line Interface (CLI)
Introduction to the Command Line Interface (CLI)
 
Automating Research Data with Globus Flows and Compute
Automating Research Data with Globus Flows and ComputeAutomating Research Data with Globus Flows and Compute
Automating Research Data with Globus Flows and Compute
 
Introduction to Globus for New Users
Introduction to Globus for New UsersIntroduction to Globus for New Users
Introduction to Globus for New Users
 
Working with Globus Platform Services and Portals
Working with Globus Platform Services and PortalsWorking with Globus Platform Services and Portals
Working with Globus Platform Services and Portals
 
Globus Automation
Globus AutomationGlobus Automation
Globus Automation
 
Introduction to Globus
Introduction to GlobusIntroduction to Globus
Introduction to Globus
 
Working with Globus Platform Services
Working with Globus Platform ServicesWorking with Globus Platform Services
Working with Globus Platform Services
 
Using Globus to Streamline Research at Scale
Using Globus to Streamline Research at ScaleUsing Globus to Streamline Research at Scale
Using Globus to Streamline Research at Scale
 
Introduction to Globus for Researchers
Introduction to Globus for ResearchersIntroduction to Globus for Researchers
Introduction to Globus for Researchers
 
Introduction to Globus for New Users
Introduction to Globus for New UsersIntroduction to Globus for New Users
Introduction to Globus for New Users
 
Globus Endpoint Migration and Advanced Administration Topics
Globus Endpoint Migration and Advanced Administration TopicsGlobus Endpoint Migration and Advanced Administration Topics
Globus Endpoint Migration and Advanced Administration Topics
 

Recently uploaded

Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in collegessuser7a7cd61
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 

Recently uploaded (20)

Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in college
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 

Globus for System Administrators (GlobusWorld Tour - Columbia University)

  • 1. Globus for System Administrators Vas Vasiliadis vas@uchicago.edu Columbia University – April 24, 2019
  • 2. Globus Connect Server 2 • Makes your storage accessible via Globus • Multi-user server, installed and managed by sysadmin docs.globus.org/globus-connect-server-installation-guide/ Local system users Local Storage System (HPC cluster, NAS, …) Globus Connect Server MyProxy CA GridFTP Server OAuth Server DTN • Default access for all local accounts • Native packaging Linux: DEB, RPM
  • 3. Globus Connect Server 3 Local system users Local Storage System (HPC cluster, NAS, …) Globus Connect Server MyProxy CA GridFTP Server OAuth Server DTN Non-POSIX Connectors POSIX-compliant Connector server
  • 4. IBM Spectrum Scale Current Planned Storage Connectors - globus.org/connectors ActiveScale
  • 5. Creating a Globus endpoint on your server • In this example, Server = Amazon EC2 instance • Installation and configuration of Globus Connect Server requires a Globus ID • Go to globusid.org • Click “create a Globus ID” – Optional: associate it with your Globus account 5
  • 6. What we are going to do: Install Globus Connect Server • Access server as user “campusadmin” • Update repo • Install package • Setup Globus Connect Server Server (AWS EC2) ssh ec2-22-23-24-25 Log into Globus Transfer a file 1 2 3 Access the newly created endpoint (as user ‘researcher’) 4 6
  • 7. Access your server • Get the IP address for your EC2 server • Log in as user ‘campusadmin’ ssh campusadmin@<EC2_instance_IP_address> • Please sudo su before continuing – User ‘campusadmin’ has passwordless sudo privileges 7
  • 8. Install Globus Connect Server $ sudo su $ curl –LOs http://downloads.globus.org/toolkit/globus-connect- server/globus-connect-server-repo_latest_all.deb $ dpkg –i globus-connect-server-repo_latest_all.deb $ apt-get update $ apt-get -y install globus-connect-server $ globus-connect-server-setup You have a working Globus endpoint! Use your Globus ID username and password when prompted
  • 9. Access the Globus endpoint • Go to Manage Data à Transfer Files • Access the endpoint you just created – Search for your EC2 host name in the Endpoint field – Log in as “researcher”; you will see the user’s home directory • Transfer files between a test endpoint (e.g. ESnet read-only) and your EC2 endpoint 9
  • 10. Globus accounts and endpoint access • Globus account: Primary identity (+ Linked Identities) • Endpoint initially accessible by creator • Endpoint not visible? – Primary identity is your institutional ID? – Link your Globus ID!
  • 12. Endpoint configuration • On the Globus service: app.globus.org/endpoints • On your DTN: /etc/globus-connect-server.conf – Standard .ini format: [Section] Option = Value – To enable changes run globus-connect-server-setup – “Rinse and repeat” 12
  • 13. Common configuration options • Endpoints page – Display Name – Visibility – Encryption • DTN configuration file – RestrictPaths – Sharing – IdentityMethod (CILogon, Oauth) – SharingRestrictPaths
  • 14. Exercise: Make your endpoint visible • Edit endpoint attributes – Change the name to something useful, e.g. <your_name> EC2 Endpoint – For the “Visible To” attribute select “Public - Visible to all users” • Find your neighbor’s endpoint – Thanks to our superb security …you can access it too J 14
  • 15. Path Restriction • Default configuration: – All paths allowed, access control handled by the OS • Use RestrictPaths to customize – Specifies a comma separated list of full paths that clients may access – Each path may be prefixed by R (read) and/or W (write), or N (none) to explicitly deny access to a path – '~’ for authenticated user’s home directory, and * may be used for simple wildcard matching. • e.g. Full access to home directory, read access to /data: – RestrictPaths = RW~,R/data • e.g. Full access to home directory, deny hidden files: – RestrictPaths = RW~,N~/.* 15
  • 16. Exercise: Restrict access • Set RestrictPaths=RW~,N~/archive • Run globus-connect-server-setup • Access your endpoint as ‘researcher’ • What’s changed? 16
  • 17. • In config file, set Sharing=True • Run globus-connect-server-setup • Flag endpoint as “managed” (in web app or via CLI) * Note: Creation of shared endpoints requires a Globus subscription for the managed endpoint Enabling sharing on an endpoint 17
  • 18. Limit sharing to specific accounts • SharingUsersAllow = • SharingGroupsAllow = • SharingUsersDeny = • SharingGroupsDeny = 18
  • 19. Sharing Path Restriction • Restrict paths where users can create shared endpoints • Use SharingRestrictPaths to customize – Same syntax as RestrictPaths • e.g. Full access to home directory, deny hidden files: – SharingRestrictPaths = RW~,N~/.* • e.g. Full access to public folder under home directory: – SharingRestrictPaths = RW~/public • e.g. Full access to /proj, read access to /scratch: – SharingRestrictPaths = RW/proj,R/scratch 19
  • 21. Ports needed for Globus • Inbound: 2811 (control channel) • Inbound: 7512 (MyProxy), 443 (OAuth) • Inbound: 50000-51000 (data channel) • If restricting outbound connections, allow connections on: – 80, 2223 (used during install/config) – 50000-51000 (GridFTP data channel) 21
  • 24. Single Sign-On with InCommon/CILogon • Your Shibboleth server must release R&S attributes to CILogon—especially the ePPN attribute • Local account must match institutional ID (InCommon ID) – Test by creating a local user with same name • In /etc/globus-connect-server.conf set: AuthorizationMethod = CILogon CILogonIdentityProvider = <institution_listed_in_CILogon_IdP_list> 24
  • 26. Subscription configuration • Subscription manager – Create/upgrade managed endpoints – Requires Globus ID linked to Globus account • Management console permissions – Independent of subscription manager – Map managed endpoint to Globus ID • Globus Plus group – Subscription Manager is admin – Can grant admin rights to other members 26
  • 27. Creating managed endpoints • Required for sharing, management console, reporting, … • Convert existing endpoint to managed via CLI (or web): globus endpoint update --managed <endpt_uuid> • Must be run by subscription manager • Important: Re-run endpoint update after deleting/re- creating endpoint 27
  • 28. Monitoring and managing Globus endpoint activity 28
  • 29. Management console • Monitor all transfers • Pause/resume specific transfers • Add pause conditions with various options • Resume specific tasks overriding pause conditions • Cancel tasks • View sharing ACLs 29
  • 30. Endpoint Roles • Administrator: define endpoint and roles • Access Manager: manage permissions • Activity Manager: perform control tasks • Activity Monitor: view activity 30
  • 33. Balance: performance - reliability • Network use parameters: concurrency, parallelism • Maximum, Preferred values for each • Transfer considers source and destination endpoint settings min( max(preferred src, preferred dest), max src, max dest ) • Service limits, e.g. concurrent requests 33
  • 35. Disk-to-Disk Throughput: ESnet Testing 0 1,000 2,000 3,000 4,000 5,000 6,000 7,000 8,000 9,000 scp scp (w/HPN) sftp GridFTP (1 stream) GridFTP (4 streams) Disk-to-Disk Throughput (Mbps) 35Source: ESnet (2016) • Berkeley, CA to Argonne, IL (RTT: 53 ms, Capacity: 10Gbps) • scp is 24x slower than GridFTP on this path • >1 Gbps (125 MB/s) disk-to-disk requires RAID array
  • 37. Legacy Architecture 10GE Border Router WAN Firewall Enterprise perfSONAR perfSONAR Filesystem (data store) 10GE Portal Server Browsing path Query path Data path Portal server applications: · web server · search · database · authentication · data service
  • 38. Current best practice 10GE10GE 10GE 10GE Border Router WAN Science DMZ Switch/Router Firewall Enterprise perfSONAR perfSONAR 10GE 10GE 10GE 10GE DTN DTN API DTNs (data access governed by portal) DTN DTN perfSONAR Filesystem (data store) 10GE Portal Server Browsing path Query path Portal server applications: · web server · search · database · authentication Data Path Data Transfer Path Portal Query/Browse Path
  • 39. Science DMZ configuration 39 Source security filters Destination security filters Destination Science DMZ Source Science DMZ Source Border Router Destination Border Router Source Router Destination Router User Organization DATA CONTROL Physical Control Path Logical Control Path Physical Data Path Logical Data Path * Ports 443, 2811, 7512 * Ports 50000- 51000 Data Transfer Node (DTN) Data Transfer Node (DTN) * Please see TCP ports reference: https://docs.globus.org/resource-provider-guide/#open-tcp-ports_section
  • 40. ext* XFS ZFS ~/ ~/scratch Common endpoint configuration (GCSv4) Data Transfer Node POSIX Connector
  • 41. ext* XFS ZFS GPFS Lustre ~/projects Common endpoint configuration (GCSv4) Data Transfer Node POSIX Connector ~/ ~/scratch
  • 42. ext* XFS ZFS GPFS Lustre ~/projects Multi-endpoint configuration (GCSv4) Data Transfer Node POSIX Connector Western Digital ActiveScale Connector ~/archive ~/ ~/scratch
  • 43. ~/vault ext* XFS ZFS GPFS Lustre ~/projects Multi-endpoint configuration (GCSv4) 43 Data Transfer Node POSIX Connector Western Digital ActiveScale Connector ~/archive Amazon S3 Bucket Amazon S3 Connector ~/ ~/scratch
  • 44. Data Control Object Store / Archive Deploying a premium connector gateway server Object Store/ Archive Connector Globus endpoint (e.g. WD ActiveScale gateway) Other Globus endpoints Globus Share Globus Share
  • 45. Network paths • Separate control and data interfaces • "DataInterface =" option in globus-connect-server- conf • Common scenario: route data flows over Science DMZ link 45
  • 46. Dual-homed DTN – high speed data path Data Transfer Node GridFTP Server Science DMZ Control Channel Data Transfer Node GridFTP Server Data Channel if0 if1 Internet2 path Control Channel
  • 47. Dual-homed DTN – internal data path Data Transfer Node GridFTP Server Science DMZ Control Channel Data Transfer Node GridFTP Server Data Channel if0 if1 LAN/ Intranet path Control Channel Firewall if0 if1
  • 49. Encryption • Requiring encryption on an endpoint – User cannot override – Useful for “sensitive” data • Globus uses OpenSSL cipher stack as currently configured on your DTN • FIPS 140-2 compliance: ensure use of FIPS capable OpenSSL libraries on DTN www.openssl.org/docs/fips/UserGuide-2.0.pdf 49
  • 50. Distributing Globus Connect Server components • Globus Connect Server components – globus-connect-server-io, -id, -web • Default: -io, –id and –web on single server • Common options – Multiple –io servers for load balancing, failover, and performance – No -id server, e.g. third-party IdP – -id on separate server, e.g. non-DTN nodes – -web on either –id server or separate server for OAuth interface 50
  • 51. ext* XFS ZFS Distributing Globus Connect Server components Data Transfer Node OAuth Server GridFTP Server MyProxy CA Science DMZ (ACL limited) Port 2811 accepts inbound connections from Globus Firewall
  • 52. Setting up multiple –io servers • Guidelines – Use the same .conf file on all servers – First install on the server running the –id component, then all others • Install Globus Connect Server on all servers • Edit .conf file on one of the servers and set [MyProxy] Server to the hostname of the server you want the –id component installed on • Copy Globus Connect Server configuration file to all servers • Run globus-connect-server-setup on the server running the –id component • Run globus-connect-server-setup on all other servers • Repeat steps 2-5 as necessary to update configurations 52
  • 53. Example: Two-node DTN 53 -id -io -io On other DTN nodes: /etc/globus-connect-server.conf [Endpoint] Name = globus_dtn [MyProxy] Server = 34.20.29.57 On “primary” DTN node (34.20.29.57): /etc/globus-connect-server.conf [Endpoint] Name = globus_dtn [MyProxy] Server = 34.20.29.57