SlideShare a Scribd company logo
1 of 18
Download to read offline
Arecibo Observatory
Data Movement - so much more than data
George B. Robb III, grobb3@es.net
EPOC - Performance Chaser
ESnet - Infrastructure Team
National Science Foundation Award #1826994
Arecibo Observatory
•Treasure of Astronomical Sciences since 1963
Inspiration for all, just a few highlights:
• Active radio telescope
• NEO, planetary, and atmospheric imaging
• Full spectrum 1Hz - 10 GHz
• Distance to Pleiades cluster
• Metric tons of scientific discoveries and publications.
https://www.naic.edu/ao/legacy-discoveries
5/12/2021 © 2021, Engagement and Performance Operations Center (EPOC) 2
Takes a hit, keeps on making science!
• 2006 - NSF 15% Budget cut across astronomical sciences
• 2007 - Arecibo budget slashed from$10.5 million to $8 million
(NASA pitches in with ~$2.6 million to help operations budget).
• 2011 - NSF implements Data Management Plan requirement.
• 2015 - Facilities director Kerr quits due to funding clashes.
• 2018 - University of Central Florida takes on stewardship.
Operations working tirelessly making science to happen.
5/12/2021 3
© 2021, Engagement and Performance Operations Center (EPOC)
Takes another hit.
• Hurricane Maria ( Sept 20, 2017 )
• Category 4
• Damage sustained
• R&E ( I2) network connection lost
and remained offline since 2012 to
UPR 10GbE !
• Too expensive to repair due to
budget cuts.
• Operations resume when the
power came back... And the
science goes on!
5/12/2021 4
© 2021, Engagement and Performance Operations Center (EPOC)
Takes another hit
5/12/2021 5
© 2021, Engagement and Performance Operations Center (EPOC)
•5.0 - 6.4 Earthquakes ( January 7-11, 2020 )
•Operations still going
• shaken not stirred ( yes 007 reference)
Takes and yet another
5/12/2021 6
© 2021, Engagement and Performance Operations Center (EPOC)
• Tropical Storm Isaias ( July 30, 2020 )
• And, SCIENCE vs operations
• still going…
• First [auxiliary] cable snap
• (Aug 10, 2020)
• Second cable [ primary ] snaps
• (Nov 6, 2020)
Thermodynamics, Economics, and Gravity win.
(Dec 1, 2020)
5/12/2021 7
© 2021, Engagement and Performance Operations Center (EPOC)
D
a
t
a
C
e
n
t
e
r
B
u
i
l
d
i
n
g
1
.
Thermodynamics, Economics, and Gravity win.
(Dec 1, 2020) - In a SECOND all changed.
5/12/2021 8
© 2021, Engagement and Performance Operations Center (EPOC)
5/12/2021 9
© 2021, Engagement and Performance Operations Center (EPOC)
MISSION: Get the DATA to a safe stable state
• 2 Petabytes of Golden Copy need to move.
• Fiber is still cut!
• Transfers in MB/s with PB to go.
• Data movement at a scale of years
• Network Attached Storage (NAS) “Appliances” being used with sneakernet
• 100+ Terabytes at a time. ( Full capacity of NAS device )
• Onsite team hand carries this priceless data to the closest 10GbE
links at University of Puerto Rico - Mayaguez (RUM) and Engine-4
Collaboration Space.
Operations still going, harder than ever, hand carrying copies
of priceless scientific data to safety.
5/12/2021 10
© 2021, Engagement and Performance Operations Center (EPOC)
Call for assistance:
• Team forms to see what can be done to get the data moving to
safe stable state ( we are in FULL DR ) .
• EPOC - Architect solutions to implement
• Globus - let's get the data on the wire
• Anyone want to saturate a link ( Also, redline a NAS )
• UPR (RUM) and Engine-4 - Provide transport 10GbE link!
• UCF - let’s facilitate the migrations
• TACC - let’s catch and secure the data
5/12/2021 11
© 2021, Engagement and Performance Operations Center (EPOC)
Call for assistance - EPOC
• Worked with UCF teams to understand the nature
of the data.
• Architect course of action:
• People are the network (let’s help make this
happen).
• The the fabulous community really knocked
this out of the park THANK YOU
5/12/2021 12
© 2021, Engagement and Performance Operations Center (EPOC)
Call for assistance - EPOC
• Understand Problem set:
• Site is not accepting new installations
• Oh, Global PANDEMIC
• We can’t just ship hardware.
• Closest 10GbE link an hour drive away.
• THANK YOU University of Puerto Rico (RUM) - UPR and Engine-4
• Network Attached Storage (movable storage
appliance) devices small but, mighty solutions
that can be tuned
5/12/2021 13
© 2021, Engagement and Performance Operations Center (EPOC)
Call for assistance - TACC
•Offered to catch and distribute the data
• Waved a magical storage wand to create a safe
and secure 2 Petabyte landing zone.
• 10GbE link analysis and monitoring
• NetSage and dashboarding
• Globus native landing zone ( woo hoo )!
5/12/2021 14
© 2021, Engagement and Performance Operations Center (EPOC)
Call for assistance - Globus
•Always part of the team and joined in person with
additional support - THANK YOU!
• rsync vs. globus
• ( spoiler: globus is an obvious winner )
• Timescale was years to move 2PB
• Data Transfer Node (DTN) tuning only helps!
>> Teams are pulling data from tape at this point!!!
5/12/2021 15
© 2021, Engagement and Performance Operations Center (EPOC)
Move that data! - Globus
5/12/2021 16
© 2021, Engagement and Performance Operations Center (EPOC)
Call for assistance -
so much more than data.
FULL Disaster Recovery - The instrument fell!
• Community engaged immediately
• Initial transfers 10s of Megabytes per second
• Tuned Globus transfers 100s of Megabytes per second
• Tools we have
• Globus - share that data!
• ESnet - https://es.net Fasterdata, DMZ, DTN, DME, Tuning, and much more.
• EPOC - https://epoc.global
• Supercharge your human network.
• We use only tools available, no such thing as silver bullets
• Hit the mission targets.
5/12/2021 17
© 2021, Engagement and Performance Operations Center (EPOC)
Words of caution ( or fear ) - FULL Disaster
Recovery - THE INSTRUMENT FELL.
( think for a moment on this )
• The network is part of the instrument!
• NSF’s 2011 Data Management Plan requirement
• https://www.nsf.gov/bfa/dias/policy/dmp.jsp
• Large facilities what is the current status of:
• Data management plan
• Replica site
• Backups
• Fiber paths
5/12/2021 18
© 2021, Engagement and Performance Operations Center (EPOC)
Words of caution ( or hope ) - FULL Disaster
Recovery - THE INSTRUMENT FELL.
• Call for assistance and engage your communities!
• EPOC - https://epoc.global epoc@iu.edu
• ESnet - https://es.net
• DTN - https://fasterdata.es.net/science-dmz/DTN/
• DME -
https://fasterdata.es.net/performance-testing/2019-2020-data-mobili
ty-workshop-and-exhibition/2019-2020-data-mobility-exhibition/
• ISI - Center of Excellence pilot, for Data Management
• https://www.nsf.gov/awardsearch/showAward?AWD_ID=1842042
National Science Foundation Award #1826994

More Related Content

What's hot

Improving access to geospatial Big Data in the hydrology domain
Improving access to geospatial Big Data in the hydrology domainImproving access to geospatial Big Data in the hydrology domain
Improving access to geospatial Big Data in the hydrology domainClaudia Vitolo
 
Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simu...
Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simu...Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simu...
Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simu...inside-BigData.com
 
Austin T Schaffer - Writing Sample - NASA LaRC Internship Responsibilities
Austin T Schaffer - Writing Sample - NASA LaRC Internship ResponsibilitiesAustin T Schaffer - Writing Sample - NASA LaRC Internship Responsibilities
Austin T Schaffer - Writing Sample - NASA LaRC Internship ResponsibilitiesAustin Schaffer
 
Maintaining scholarly standards in the digital age: Publishing historical gaz...
Maintaining scholarly standards in the digital age: Publishing historical gaz...Maintaining scholarly standards in the digital age: Publishing historical gaz...
Maintaining scholarly standards in the digital age: Publishing historical gaz...Humphrey Southall
 
AusCover Earth Observation Services and Data Cubes
AusCover Earth Observation Services and Data CubesAusCover Earth Observation Services and Data Cubes
AusCover Earth Observation Services and Data CubesTERN Australia
 
Pic archiver stansted
Pic archiver stanstedPic archiver stansted
Pic archiver stanstedArchiver
 
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...Laurent Lefort
 
ResourceSync Introduction at SWIB13
ResourceSync Introduction at SWIB13ResourceSync Introduction at SWIB13
ResourceSync Introduction at SWIB13Simeon Warner
 
Big Data, Beyond the Data Center
Big Data, Beyond the Data CenterBig Data, Beyond the Data Center
Big Data, Beyond the Data CenterGilles Fedak
 
Building High Performance Computing Capability in the African Continent/Happy...
Building High Performance Computing Capability in the African Continent/Happy...Building High Performance Computing Capability in the African Continent/Happy...
Building High Performance Computing Capability in the African Continent/Happy...Academy of Science of South Africa (ASSAf)
 
Stansted slides-desy
Stansted slides-desyStansted slides-desy
Stansted slides-desyArchiver
 
Linked Sensor Data cube
Linked Sensor Data cubeLinked Sensor Data cube
Linked Sensor Data cubeLaurent Lefort
 
Big data in the research life cycle: technologies, infrastructures, policies
Big data in the research life cycle: technologies, infrastructures, policiesBig data in the research life cycle: technologies, infrastructures, policies
Big data in the research life cycle: technologies, infrastructures, policiesBigData_Europe
 
ApacheCon NA 2013 VFASTR
ApacheCon NA 2013 VFASTRApacheCon NA 2013 VFASTR
ApacheCon NA 2013 VFASTRLucaCinquini
 
Exascale Computing Project (ECP) Update
Exascale Computing Project (ECP) UpdateExascale Computing Project (ECP) Update
Exascale Computing Project (ECP) Updateinside-BigData.com
 
Semantically-Enabling the Web of Things: The W3C Semantic Sensor Network Onto...
Semantically-Enabling the Web of Things: The W3C Semantic Sensor Network Onto...Semantically-Enabling the Web of Things: The W3C Semantic Sensor Network Onto...
Semantically-Enabling the Web of Things: The W3C Semantic Sensor Network Onto...Laurent Lefort
 
Food Security Use Case - ExtremeEarth Open Workshop
Food Security Use Case - ExtremeEarth Open WorkshopFood Security Use Case - ExtremeEarth Open Workshop
Food Security Use Case - ExtremeEarth Open WorkshopExtremeEarth
 
Gridforum Juergen Knobloch Grids For Science 20080402
Gridforum Juergen Knobloch Grids For Science 20080402Gridforum Juergen Knobloch Grids For Science 20080402
Gridforum Juergen Knobloch Grids For Science 20080402vrij
 

What's hot (20)

Improving access to geospatial Big Data in the hydrology domain
Improving access to geospatial Big Data in the hydrology domainImproving access to geospatial Big Data in the hydrology domain
Improving access to geospatial Big Data in the hydrology domain
 
Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simu...
Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simu...Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simu...
Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simu...
 
Austin T Schaffer - Writing Sample - NASA LaRC Internship Responsibilities
Austin T Schaffer - Writing Sample - NASA LaRC Internship ResponsibilitiesAustin T Schaffer - Writing Sample - NASA LaRC Internship Responsibilities
Austin T Schaffer - Writing Sample - NASA LaRC Internship Responsibilities
 
Maintaining scholarly standards in the digital age: Publishing historical gaz...
Maintaining scholarly standards in the digital age: Publishing historical gaz...Maintaining scholarly standards in the digital age: Publishing historical gaz...
Maintaining scholarly standards in the digital age: Publishing historical gaz...
 
AusCover Earth Observation Services and Data Cubes
AusCover Earth Observation Services and Data CubesAusCover Earth Observation Services and Data Cubes
AusCover Earth Observation Services and Data Cubes
 
Pic archiver stansted
Pic archiver stanstedPic archiver stansted
Pic archiver stansted
 
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
 
ResourceSync Introduction at SWIB13
ResourceSync Introduction at SWIB13ResourceSync Introduction at SWIB13
ResourceSync Introduction at SWIB13
 
Big Data, Beyond the Data Center
Big Data, Beyond the Data CenterBig Data, Beyond the Data Center
Big Data, Beyond the Data Center
 
Building High Performance Computing Capability in the African Continent/Happy...
Building High Performance Computing Capability in the African Continent/Happy...Building High Performance Computing Capability in the African Continent/Happy...
Building High Performance Computing Capability in the African Continent/Happy...
 
Stansted slides-desy
Stansted slides-desyStansted slides-desy
Stansted slides-desy
 
Linked Sensor Data cube
Linked Sensor Data cubeLinked Sensor Data cube
Linked Sensor Data cube
 
Big data in the research life cycle: technologies, infrastructures, policies
Big data in the research life cycle: technologies, infrastructures, policiesBig data in the research life cycle: technologies, infrastructures, policies
Big data in the research life cycle: technologies, infrastructures, policies
 
ApacheCon NA 2013 VFASTR
ApacheCon NA 2013 VFASTRApacheCon NA 2013 VFASTR
ApacheCon NA 2013 VFASTR
 
Final Suli Report
Final Suli ReportFinal Suli Report
Final Suli Report
 
Exascale Computing Project (ECP) Update
Exascale Computing Project (ECP) UpdateExascale Computing Project (ECP) Update
Exascale Computing Project (ECP) Update
 
Semantically-Enabling the Web of Things: The W3C Semantic Sensor Network Onto...
Semantically-Enabling the Web of Things: The W3C Semantic Sensor Network Onto...Semantically-Enabling the Web of Things: The W3C Semantic Sensor Network Onto...
Semantically-Enabling the Web of Things: The W3C Semantic Sensor Network Onto...
 
Statistical data in RDF
Statistical data in RDFStatistical data in RDF
Statistical data in RDF
 
Food Security Use Case - ExtremeEarth Open Workshop
Food Security Use Case - ExtremeEarth Open WorkshopFood Security Use Case - ExtremeEarth Open Workshop
Food Security Use Case - ExtremeEarth Open Workshop
 
Gridforum Juergen Knobloch Grids For Science 20080402
Gridforum Juergen Knobloch Grids For Science 20080402Gridforum Juergen Knobloch Grids For Science 20080402
Gridforum Juergen Knobloch Grids For Science 20080402
 

Similar to GlobusWorld 2021: Arecibo Observatory Data Movement

Data Mobility Exhibition
Data Mobility ExhibitionData Mobility Exhibition
Data Mobility ExhibitionGlobus
 
Network Engineering for High Speed Data Sharing
Network Engineering for High Speed Data SharingNetwork Engineering for High Speed Data Sharing
Network Engineering for High Speed Data SharingGlobus
 
Blue Waters and Resource Management - Now and in the Future
 Blue Waters and Resource Management - Now and in the Future Blue Waters and Resource Management - Now and in the Future
Blue Waters and Resource Management - Now and in the Futureinside-BigData.com
 
Enabling efficient movement of data into & out of a high-performance analysis...
Enabling efficient movement of data into & out of a high-performance analysis...Enabling efficient movement of data into & out of a high-performance analysis...
Enabling efficient movement of data into & out of a high-performance analysis...Jisc
 
Ben Evans SPEDDEXES 2014
Ben Evans SPEDDEXES 2014Ben Evans SPEDDEXES 2014
Ben Evans SPEDDEXES 2014aceas13tern
 
IPv6 deployment on GridPP & WLCG
IPv6 deployment on GridPP & WLCGIPv6 deployment on GridPP & WLCG
IPv6 deployment on GridPP & WLCGJisc
 
Burst data retrieval after 50k GPU Cloud run
Burst data retrieval after 50k GPU Cloud runBurst data retrieval after 50k GPU Cloud run
Burst data retrieval after 50k GPU Cloud runIgor Sfiligoi
 
Linac Coherent Light Source (LCLS) Data Transfer Requirements
Linac Coherent Light Source (LCLS) Data Transfer RequirementsLinac Coherent Light Source (LCLS) Data Transfer Requirements
Linac Coherent Light Source (LCLS) Data Transfer Requirementsinside-BigData.com
 
Building a Regional 100G Collaboration Infrastructure
Building a Regional 100G Collaboration InfrastructureBuilding a Regional 100G Collaboration Infrastructure
Building a Regional 100G Collaboration InfrastructureLarry Smarr
 
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...Igor Sfiligoi
 
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Frank Wuerthwein
 
Using commercial Clouds to process IceCube jobs
Using commercial Clouds to process IceCube jobsUsing commercial Clouds to process IceCube jobs
Using commercial Clouds to process IceCube jobsIgor Sfiligoi
 
GRIMES_Visualizing_Telemetry
GRIMES_Visualizing_TelemetryGRIMES_Visualizing_Telemetry
GRIMES_Visualizing_TelemetryKevin Grimes
 
DATA LAKE AND THE RISE OF THE MICROSERVICES - ALEX BORDEI
DATA LAKE AND THE RISE OF THE MICROSERVICES - ALEX BORDEIDATA LAKE AND THE RISE OF THE MICROSERVICES - ALEX BORDEI
DATA LAKE AND THE RISE OF THE MICROSERVICES - ALEX BORDEIBig Data Week
 

Similar to GlobusWorld 2021: Arecibo Observatory Data Movement (20)

Data Mobility Exhibition
Data Mobility ExhibitionData Mobility Exhibition
Data Mobility Exhibition
 
ESDIS Status (2002)
ESDIS Status (2002)ESDIS Status (2002)
ESDIS Status (2002)
 
Network Engineering for High Speed Data Sharing
Network Engineering for High Speed Data SharingNetwork Engineering for High Speed Data Sharing
Network Engineering for High Speed Data Sharing
 
Blue Waters and Resource Management - Now and in the Future
 Blue Waters and Resource Management - Now and in the Future Blue Waters and Resource Management - Now and in the Future
Blue Waters and Resource Management - Now and in the Future
 
[.ppt]
[.ppt][.ppt]
[.ppt]
 
EOSDIS Status
EOSDIS StatusEOSDIS Status
EOSDIS Status
 
Enabling efficient movement of data into & out of a high-performance analysis...
Enabling efficient movement of data into & out of a high-performance analysis...Enabling efficient movement of data into & out of a high-performance analysis...
Enabling efficient movement of data into & out of a high-performance analysis...
 
Ben Evans SPEDDEXES 2014
Ben Evans SPEDDEXES 2014Ben Evans SPEDDEXES 2014
Ben Evans SPEDDEXES 2014
 
IPv6 deployment on GridPP & WLCG
IPv6 deployment on GridPP & WLCGIPv6 deployment on GridPP & WLCG
IPv6 deployment on GridPP & WLCG
 
Burst data retrieval after 50k GPU Cloud run
Burst data retrieval after 50k GPU Cloud runBurst data retrieval after 50k GPU Cloud run
Burst data retrieval after 50k GPU Cloud run
 
Linac Coherent Light Source (LCLS) Data Transfer Requirements
Linac Coherent Light Source (LCLS) Data Transfer RequirementsLinac Coherent Light Source (LCLS) Data Transfer Requirements
Linac Coherent Light Source (LCLS) Data Transfer Requirements
 
Building a Regional 100G Collaboration Infrastructure
Building a Regional 100G Collaboration InfrastructureBuilding a Regional 100G Collaboration Infrastructure
Building a Regional 100G Collaboration Infrastructure
 
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
 
BDIA Findings
BDIA FindingsBDIA Findings
BDIA Findings
 
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
 
Using commercial Clouds to process IceCube jobs
Using commercial Clouds to process IceCube jobsUsing commercial Clouds to process IceCube jobs
Using commercial Clouds to process IceCube jobs
 
GRIMES_Visualizing_Telemetry
GRIMES_Visualizing_TelemetryGRIMES_Visualizing_Telemetry
GRIMES_Visualizing_Telemetry
 
DATA LAKE AND THE RISE OF THE MICROSERVICES - ALEX BORDEI
DATA LAKE AND THE RISE OF THE MICROSERVICES - ALEX BORDEIDATA LAKE AND THE RISE OF THE MICROSERVICES - ALEX BORDEI
DATA LAKE AND THE RISE OF THE MICROSERVICES - ALEX BORDEI
 
Earth Science Data and Information System (ESDIS) Project Update
Earth Science Data and Information System (ESDIS) Project UpdateEarth Science Data and Information System (ESDIS) Project Update
Earth Science Data and Information System (ESDIS) Project Update
 
CLIM Program: Remote Sensing Workshop, Satellites and Stovepipes - Jay Morris...
CLIM Program: Remote Sensing Workshop, Satellites and Stovepipes - Jay Morris...CLIM Program: Remote Sensing Workshop, Satellites and Stovepipes - Jay Morris...
CLIM Program: Remote Sensing Workshop, Satellites and Stovepipes - Jay Morris...
 

More from Globus

Advanced Globus System Administration Topics
Advanced Globus System Administration TopicsAdvanced Globus System Administration Topics
Advanced Globus System Administration TopicsGlobus
 
Instrument Data Automation: The Life of a Flow
Instrument Data Automation: The Life of a FlowInstrument Data Automation: The Life of a Flow
Instrument Data Automation: The Life of a FlowGlobus
 
Building Research Applications with Globus PaaS
Building Research Applications with Globus PaaSBuilding Research Applications with Globus PaaS
Building Research Applications with Globus PaaSGlobus
 
Reliable, Remote Computation at All Scales
Reliable, Remote Computation at All ScalesReliable, Remote Computation at All Scales
Reliable, Remote Computation at All ScalesGlobus
 
Best Practices for Data Sharing Using Globus
Best Practices for Data Sharing Using GlobusBest Practices for Data Sharing Using Globus
Best Practices for Data Sharing Using GlobusGlobus
 
An Introduction to Globus for Researchers
An Introduction to Globus for ResearchersAn Introduction to Globus for Researchers
An Introduction to Globus for ResearchersGlobus
 
Introduction to Research Automation with Globus
Introduction to Research Automation with GlobusIntroduction to Research Automation with Globus
Introduction to Research Automation with GlobusGlobus
 
Globus for System Administrators
Globus for System AdministratorsGlobus for System Administrators
Globus for System AdministratorsGlobus
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System AdministratorsGlobus
 
Introduction to Data Transfer and Sharing for Researchers
Introduction to Data Transfer and Sharing for ResearchersIntroduction to Data Transfer and Sharing for Researchers
Introduction to Data Transfer and Sharing for ResearchersGlobus
 
Introduction to the Globus Platform for Developers
Introduction to the Globus Platform for DevelopersIntroduction to the Globus Platform for Developers
Introduction to the Globus Platform for DevelopersGlobus
 
Introduction to the Command Line Interface (CLI)
Introduction to the Command Line Interface (CLI)Introduction to the Command Line Interface (CLI)
Introduction to the Command Line Interface (CLI)Globus
 
Automating Research Data with Globus Flows and Compute
Automating Research Data with Globus Flows and ComputeAutomating Research Data with Globus Flows and Compute
Automating Research Data with Globus Flows and ComputeGlobus
 
Automating Research Data Flows and Introduction to the Globus Platform
Automating Research Data Flows and Introduction to the Globus PlatformAutomating Research Data Flows and Introduction to the Globus Platform
Automating Research Data Flows and Introduction to the Globus PlatformGlobus
 
Advanced Globus System Administration
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System AdministrationGlobus
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System AdministratorsGlobus
 
Introduction to Globus for New Users
Introduction to Globus for New UsersIntroduction to Globus for New Users
Introduction to Globus for New UsersGlobus
 
Working with Globus Platform Services and Portals
Working with Globus Platform Services and PortalsWorking with Globus Platform Services and Portals
Working with Globus Platform Services and PortalsGlobus
 
Globus Automation
Globus AutomationGlobus Automation
Globus AutomationGlobus
 
Advanced Globus System Administration
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System AdministrationGlobus
 

More from Globus (20)

Advanced Globus System Administration Topics
Advanced Globus System Administration TopicsAdvanced Globus System Administration Topics
Advanced Globus System Administration Topics
 
Instrument Data Automation: The Life of a Flow
Instrument Data Automation: The Life of a FlowInstrument Data Automation: The Life of a Flow
Instrument Data Automation: The Life of a Flow
 
Building Research Applications with Globus PaaS
Building Research Applications with Globus PaaSBuilding Research Applications with Globus PaaS
Building Research Applications with Globus PaaS
 
Reliable, Remote Computation at All Scales
Reliable, Remote Computation at All ScalesReliable, Remote Computation at All Scales
Reliable, Remote Computation at All Scales
 
Best Practices for Data Sharing Using Globus
Best Practices for Data Sharing Using GlobusBest Practices for Data Sharing Using Globus
Best Practices for Data Sharing Using Globus
 
An Introduction to Globus for Researchers
An Introduction to Globus for ResearchersAn Introduction to Globus for Researchers
An Introduction to Globus for Researchers
 
Introduction to Research Automation with Globus
Introduction to Research Automation with GlobusIntroduction to Research Automation with Globus
Introduction to Research Automation with Globus
 
Globus for System Administrators
Globus for System AdministratorsGlobus for System Administrators
Globus for System Administrators
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System Administrators
 
Introduction to Data Transfer and Sharing for Researchers
Introduction to Data Transfer and Sharing for ResearchersIntroduction to Data Transfer and Sharing for Researchers
Introduction to Data Transfer and Sharing for Researchers
 
Introduction to the Globus Platform for Developers
Introduction to the Globus Platform for DevelopersIntroduction to the Globus Platform for Developers
Introduction to the Globus Platform for Developers
 
Introduction to the Command Line Interface (CLI)
Introduction to the Command Line Interface (CLI)Introduction to the Command Line Interface (CLI)
Introduction to the Command Line Interface (CLI)
 
Automating Research Data with Globus Flows and Compute
Automating Research Data with Globus Flows and ComputeAutomating Research Data with Globus Flows and Compute
Automating Research Data with Globus Flows and Compute
 
Automating Research Data Flows and Introduction to the Globus Platform
Automating Research Data Flows and Introduction to the Globus PlatformAutomating Research Data Flows and Introduction to the Globus Platform
Automating Research Data Flows and Introduction to the Globus Platform
 
Advanced Globus System Administration
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System Administration
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System Administrators
 
Introduction to Globus for New Users
Introduction to Globus for New UsersIntroduction to Globus for New Users
Introduction to Globus for New Users
 
Working with Globus Platform Services and Portals
Working with Globus Platform Services and PortalsWorking with Globus Platform Services and Portals
Working with Globus Platform Services and Portals
 
Globus Automation
Globus AutomationGlobus Automation
Globus Automation
 
Advanced Globus System Administration
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System Administration
 

Recently uploaded

Introduction to Firebase Workshop Slides
Introduction to Firebase Workshop SlidesIntroduction to Firebase Workshop Slides
Introduction to Firebase Workshop Slidesvaideheekore1
 
Post Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on IdentityPost Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on Identityteam-WIBU
 
SoftTeco - Software Development Company Profile
SoftTeco - Software Development Company ProfileSoftTeco - Software Development Company Profile
SoftTeco - Software Development Company Profileakrivarotava
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsSafe Software
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringHironori Washizaki
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa
 
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...OnePlan Solutions
 
VictoriaMetrics Anomaly Detection Updates: Q1 2024
VictoriaMetrics Anomaly Detection Updates: Q1 2024VictoriaMetrics Anomaly Detection Updates: Q1 2024
VictoriaMetrics Anomaly Detection Updates: Q1 2024VictoriaMetrics
 
Enhancing Supply Chain Visibility with Cargo Cloud Solutions.pdf
Enhancing Supply Chain Visibility with Cargo Cloud Solutions.pdfEnhancing Supply Chain Visibility with Cargo Cloud Solutions.pdf
Enhancing Supply Chain Visibility with Cargo Cloud Solutions.pdfRTS corp
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfDrew Moseley
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 
Not a Kubernetes fan? The state of PaaS in 2024
Not a Kubernetes fan? The state of PaaS in 2024Not a Kubernetes fan? The state of PaaS in 2024
Not a Kubernetes fan? The state of PaaS in 2024Anthony Dahanne
 
Effectively Troubleshoot 9 Types of OutOfMemoryError
Effectively Troubleshoot 9 Types of OutOfMemoryErrorEffectively Troubleshoot 9 Types of OutOfMemoryError
Effectively Troubleshoot 9 Types of OutOfMemoryErrorTier1 app
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsChristian Birchler
 
Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Rob Geurden
 
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfExploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfkalichargn70th171
 
Keeping your build tool updated in a multi repository world
Keeping your build tool updated in a multi repository worldKeeping your build tool updated in a multi repository world
Keeping your build tool updated in a multi repository worldRoberto Pérez Alcolea
 
Large Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLarge Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLionel Briand
 
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full RecordingOpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full RecordingShane Coughlan
 
Best Angular 17 Classroom & Online training - Naresh IT
Best Angular 17 Classroom & Online training - Naresh ITBest Angular 17 Classroom & Online training - Naresh IT
Best Angular 17 Classroom & Online training - Naresh ITmanoharjgpsolutions
 

Recently uploaded (20)

Introduction to Firebase Workshop Slides
Introduction to Firebase Workshop SlidesIntroduction to Firebase Workshop Slides
Introduction to Firebase Workshop Slides
 
Post Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on IdentityPost Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on Identity
 
SoftTeco - Software Development Company Profile
SoftTeco - Software Development Company ProfileSoftTeco - Software Development Company Profile
SoftTeco - Software Development Company Profile
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data Streams
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their Engineering
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
 
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
 
VictoriaMetrics Anomaly Detection Updates: Q1 2024
VictoriaMetrics Anomaly Detection Updates: Q1 2024VictoriaMetrics Anomaly Detection Updates: Q1 2024
VictoriaMetrics Anomaly Detection Updates: Q1 2024
 
Enhancing Supply Chain Visibility with Cargo Cloud Solutions.pdf
Enhancing Supply Chain Visibility with Cargo Cloud Solutions.pdfEnhancing Supply Chain Visibility with Cargo Cloud Solutions.pdf
Enhancing Supply Chain Visibility with Cargo Cloud Solutions.pdf
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdf
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 
Not a Kubernetes fan? The state of PaaS in 2024
Not a Kubernetes fan? The state of PaaS in 2024Not a Kubernetes fan? The state of PaaS in 2024
Not a Kubernetes fan? The state of PaaS in 2024
 
Effectively Troubleshoot 9 Types of OutOfMemoryError
Effectively Troubleshoot 9 Types of OutOfMemoryErrorEffectively Troubleshoot 9 Types of OutOfMemoryError
Effectively Troubleshoot 9 Types of OutOfMemoryError
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
 
Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...
 
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfExploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
 
Keeping your build tool updated in a multi repository world
Keeping your build tool updated in a multi repository worldKeeping your build tool updated in a multi repository world
Keeping your build tool updated in a multi repository world
 
Large Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLarge Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and Repair
 
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full RecordingOpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
 
Best Angular 17 Classroom & Online training - Naresh IT
Best Angular 17 Classroom & Online training - Naresh ITBest Angular 17 Classroom & Online training - Naresh IT
Best Angular 17 Classroom & Online training - Naresh IT
 

GlobusWorld 2021: Arecibo Observatory Data Movement

  • 1. Arecibo Observatory Data Movement - so much more than data George B. Robb III, grobb3@es.net EPOC - Performance Chaser ESnet - Infrastructure Team National Science Foundation Award #1826994
  • 2. Arecibo Observatory •Treasure of Astronomical Sciences since 1963 Inspiration for all, just a few highlights: • Active radio telescope • NEO, planetary, and atmospheric imaging • Full spectrum 1Hz - 10 GHz • Distance to Pleiades cluster • Metric tons of scientific discoveries and publications. https://www.naic.edu/ao/legacy-discoveries 5/12/2021 © 2021, Engagement and Performance Operations Center (EPOC) 2
  • 3. Takes a hit, keeps on making science! • 2006 - NSF 15% Budget cut across astronomical sciences • 2007 - Arecibo budget slashed from$10.5 million to $8 million (NASA pitches in with ~$2.6 million to help operations budget). • 2011 - NSF implements Data Management Plan requirement. • 2015 - Facilities director Kerr quits due to funding clashes. • 2018 - University of Central Florida takes on stewardship. Operations working tirelessly making science to happen. 5/12/2021 3 © 2021, Engagement and Performance Operations Center (EPOC)
  • 4. Takes another hit. • Hurricane Maria ( Sept 20, 2017 ) • Category 4 • Damage sustained • R&E ( I2) network connection lost and remained offline since 2012 to UPR 10GbE ! • Too expensive to repair due to budget cuts. • Operations resume when the power came back... And the science goes on! 5/12/2021 4 © 2021, Engagement and Performance Operations Center (EPOC)
  • 5. Takes another hit 5/12/2021 5 © 2021, Engagement and Performance Operations Center (EPOC) •5.0 - 6.4 Earthquakes ( January 7-11, 2020 ) •Operations still going • shaken not stirred ( yes 007 reference)
  • 6. Takes and yet another 5/12/2021 6 © 2021, Engagement and Performance Operations Center (EPOC) • Tropical Storm Isaias ( July 30, 2020 ) • And, SCIENCE vs operations • still going… • First [auxiliary] cable snap • (Aug 10, 2020) • Second cable [ primary ] snaps • (Nov 6, 2020)
  • 7. Thermodynamics, Economics, and Gravity win. (Dec 1, 2020) 5/12/2021 7 © 2021, Engagement and Performance Operations Center (EPOC) D a t a C e n t e r B u i l d i n g 1 .
  • 8. Thermodynamics, Economics, and Gravity win. (Dec 1, 2020) - In a SECOND all changed. 5/12/2021 8 © 2021, Engagement and Performance Operations Center (EPOC)
  • 9. 5/12/2021 9 © 2021, Engagement and Performance Operations Center (EPOC) MISSION: Get the DATA to a safe stable state • 2 Petabytes of Golden Copy need to move. • Fiber is still cut! • Transfers in MB/s with PB to go. • Data movement at a scale of years • Network Attached Storage (NAS) “Appliances” being used with sneakernet • 100+ Terabytes at a time. ( Full capacity of NAS device ) • Onsite team hand carries this priceless data to the closest 10GbE links at University of Puerto Rico - Mayaguez (RUM) and Engine-4 Collaboration Space. Operations still going, harder than ever, hand carrying copies of priceless scientific data to safety.
  • 10. 5/12/2021 10 © 2021, Engagement and Performance Operations Center (EPOC) Call for assistance: • Team forms to see what can be done to get the data moving to safe stable state ( we are in FULL DR ) . • EPOC - Architect solutions to implement • Globus - let's get the data on the wire • Anyone want to saturate a link ( Also, redline a NAS ) • UPR (RUM) and Engine-4 - Provide transport 10GbE link! • UCF - let’s facilitate the migrations • TACC - let’s catch and secure the data
  • 11. 5/12/2021 11 © 2021, Engagement and Performance Operations Center (EPOC) Call for assistance - EPOC • Worked with UCF teams to understand the nature of the data. • Architect course of action: • People are the network (let’s help make this happen). • The the fabulous community really knocked this out of the park THANK YOU
  • 12. 5/12/2021 12 © 2021, Engagement and Performance Operations Center (EPOC) Call for assistance - EPOC • Understand Problem set: • Site is not accepting new installations • Oh, Global PANDEMIC • We can’t just ship hardware. • Closest 10GbE link an hour drive away. • THANK YOU University of Puerto Rico (RUM) - UPR and Engine-4 • Network Attached Storage (movable storage appliance) devices small but, mighty solutions that can be tuned
  • 13. 5/12/2021 13 © 2021, Engagement and Performance Operations Center (EPOC) Call for assistance - TACC •Offered to catch and distribute the data • Waved a magical storage wand to create a safe and secure 2 Petabyte landing zone. • 10GbE link analysis and monitoring • NetSage and dashboarding • Globus native landing zone ( woo hoo )!
  • 14. 5/12/2021 14 © 2021, Engagement and Performance Operations Center (EPOC) Call for assistance - Globus •Always part of the team and joined in person with additional support - THANK YOU! • rsync vs. globus • ( spoiler: globus is an obvious winner ) • Timescale was years to move 2PB • Data Transfer Node (DTN) tuning only helps! >> Teams are pulling data from tape at this point!!!
  • 15. 5/12/2021 15 © 2021, Engagement and Performance Operations Center (EPOC) Move that data! - Globus
  • 16. 5/12/2021 16 © 2021, Engagement and Performance Operations Center (EPOC) Call for assistance - so much more than data. FULL Disaster Recovery - The instrument fell! • Community engaged immediately • Initial transfers 10s of Megabytes per second • Tuned Globus transfers 100s of Megabytes per second • Tools we have • Globus - share that data! • ESnet - https://es.net Fasterdata, DMZ, DTN, DME, Tuning, and much more. • EPOC - https://epoc.global • Supercharge your human network. • We use only tools available, no such thing as silver bullets • Hit the mission targets.
  • 17. 5/12/2021 17 © 2021, Engagement and Performance Operations Center (EPOC) Words of caution ( or fear ) - FULL Disaster Recovery - THE INSTRUMENT FELL. ( think for a moment on this ) • The network is part of the instrument! • NSF’s 2011 Data Management Plan requirement • https://www.nsf.gov/bfa/dias/policy/dmp.jsp • Large facilities what is the current status of: • Data management plan • Replica site • Backups • Fiber paths
  • 18. 5/12/2021 18 © 2021, Engagement and Performance Operations Center (EPOC) Words of caution ( or hope ) - FULL Disaster Recovery - THE INSTRUMENT FELL. • Call for assistance and engage your communities! • EPOC - https://epoc.global epoc@iu.edu • ESnet - https://es.net • DTN - https://fasterdata.es.net/science-dmz/DTN/ • DME - https://fasterdata.es.net/performance-testing/2019-2020-data-mobili ty-workshop-and-exhibition/2019-2020-data-mobility-exhibition/ • ISI - Center of Excellence pilot, for Data Management • https://www.nsf.gov/awardsearch/showAward?AWD_ID=1842042 National Science Foundation Award #1826994