Globus + Dataverse:
Towards Big Data Publication
Vas Vasiliadis - vas@uchicago.edu
Globus is …
a non-profit service
developed and operated by
Our mission is to…
increase the efficiency and
effectiveness of researchers
engaged in data-driven
science and scholarship
through sustainable software
4
Development is funded by...
U . S . D E PA R T M E N T O F
ENERGY
Operations are funded by subscribers
7
Research Computing HPC
Desktop Workstations
Archives Instruments
Personal Systems
Public Cloud Storage
National Resources
We unify data access across disparate systems…
“I need to easily,
securely, & reliably
move or replicate
my data between
systems.”
Globus Connectors support diverse systems
Public / private cloud stores
Campus
stores
Project repositories,
replication stores
Public repositories
…simplify secure sharing with collaborators…
Analysis
store
Next-Gen Sequencer
MRI
Advanced Light Source
Personal system
Remote visualization
Light Sheet Microscope
High-durability,
low-cost store
…and help researchers manage instrument data
Cryo-EM
A first attempt at data publication…
Dataverse + Globus
Enabling community data publication efforts
13
v
Repository data distribution
Bulk data
transfer
2
Search, request
data of interest
1
• Data portal/web app
enables faceted search
• Enforces fine-grained
authorization
• HTTPS download for
“small” data
• Asynchronous transfer
for larger data sets
2
Browser based
download
Globally accessible
multi-tenant service
2
demonstration…
docs.globus.org
globus.org/connectors
outreach@globus.org
support@globus.org
globus.org/subscriptions

Globus and Dataverse: Towards big Data Publication