David Minor Director, Preservation Initiatives UC San Diego LibrariesSan Diego Supercomputer Center March, 2012
A brief history …• 2008: Campus-wide needs assessment – What do campus users need today? – What do they think they need tomorrow? – What is hindering their research?
A brief history …• 70% indicated need for short-term storage – 1-3 years• 64% indicated need for long-term preservation of data sets• Also needed data management help, metadata creation, tools for sharing, etc.
A brief history …• April, 2009: Blueprint for the Digital University – Publically available – Indicates directions and goals for campus
A brief history …• April, 2010: Cyberinfrastructure Planning and Operations Committee Report issued – Operationalized the Blueprint – Actual plans, budgets and projections
A brief history …• January, 2011: RCI Oversight Committee formed – business plan accepted, oversight committee charged – Let’s go DO this
RCI elements• High-Performance Computing• Data Center Colocation• Storage• Networking and other services• Data curation
High-performance computing• Triton Resource: a cost–effective and accessible high- performance computing system primarily for UC San Diego and UC researchers• Triton Affiliates and Partners Program (TAPP): high performance cluster computing time at a reasonable cost.• New developments include “condo” computing http://www.sdsc.edu/us/tapp
Data center colocation• Standard rack provided with ISO-Base seismic protection, aisle containment, and 2x30A power distribution• 10+ Gb networking fabric connectivity both throughout SDSC aggregation fabric and into CENIC• 24/7 operations staff providing facility oversight and emergency "remote hands" hardware assistance http://rci.ucsd.edu/services/colocation.html
Networking and other services• Web Hosting• Database Hosting• 10GigE research network throughout campus http://rci.ucsd.edu/services/other-services.html
StorageStorage Type Cost per Terabyte-Year Availability Application PerformanceParallel File System • Designed for HPC users 99.5% Up to 100 GB/sProject Storage • Standard Availability, • 99.5% • Up to 1 GB/s Single-Site Durability • High Availability, Multiple- Site Durability • 99.95% • Up to 1 GB/sCloud Storage • Single-Site Durability • 99.5% • Up to 100 MB/s • Triple Copy • 99.5% • Up to 100 MB/s
Data curation• Starting with a two year pilot phase• Using existing tools whenever possible – Storage at SDSC – Digital Asset Management System at UCSD Libraries – Campus high-speed networking – Chronopolis digital preservation network http://rci.ucsd.edu/services/data-curation.html http://rci.ucsd.edu/pilots
The Brain ObservatoryPreserve and curate the digital version of thebrain of patient HM, the most studiedneuropsychological patient in modern medicine.
The Brain Observatory• Aspects of image preservation• Interaction with a commercial site• Work with combinations of physical slides, images, pyramidal structures
NSF OpenTopography FacilityOpenTopography facilitates community access to high-resolution, Earth science-oriented, topography data,and related tools and resources
NSF OpenTopography Facility• Preservation of raw data• Provide DOIs for complex datasets• Information passing between portals
Levantine Archaeology LaboratoryFocuses on archaeological investigationsconcerning the evolution of societies in thesouthern Levant from the Neolithic to Islamicperiods.
Levantine Archaeology Laboratory• Cyber-archaeology• Tools for uniting field work, objects in cold storage, and digital imagery• Develop the infrastructure needed to curate cultural heritage data that is spurred by new visualization and analysis tools.
Scripps Institution of Oceanography Geological CollectionsThe Sediment Core collection contains samplescollected from as early as 1916. The Cored SedimentCollection is a growing archive of sea-floor samples andassociated data supporting a diverse variety of scientificresearch.
Scripps Institution of Oceanography Geological Collections• Work with local data and a national community• Assist with the creation of a standards-based access, discovery and preservation system for one of the largest collections of marine geology samples in the United States.
The Laboratory for Computational AstrophysicsDedicated to advancing the state-of-the-art ofastrophysical simulation through the development anddissemination of community codes, and through large-scale simulations of astrophysical and cosmologicalsystems.
The Laboratory for Computational Astrophysics• Provide data management and curation to improve collaborations with other researchers• Support publishing simulations of astrophysical phenomenon in cosmology, star formation and turbulence• Provide metadata support
Data management plans• Resources and contacts available to UCSD researchers• Examples from submitted proposals• Guidance, tips and recommendations for DMP preparation• UCSD-centered version of DMP Tool http://rci.ucsd.edu/dmp/index.html
http://rci.ucsd.eduDavid MinorDirector, Digital Preservation InitiativesUC San Diego LibrariesSan Diego Supercomputer Centerminor@sdsc.edu