Manages cloud computing infrastructure to support scientific research: Open Science Data Cloud.
Manages cloud computing testbeds: Open Cloud Testbed.
Develop reference implementations, benchmarks and standards.
OCC Members Companies: Cisco, Citrix, Yahoo!, … Universities: University of Chicago, Northwestern Univ., Johns Hopkins, Calit2, ORNL, University of Illinois at Chicago, … Federal agencies: NASA International Partners: AIST (Japan) Other: National Lambda Rail Beginning to add international partnersin 2011. 5
Variety of analysis Scientist with laptop Wide Open Science Data Cloud Med High energy physics, astronomy Low Data Size Medium to Large Small Very Large Dedicated infrastructure No infrastructure General infrastructure
Persistent data Large data clouds Med databases HPC Small Cycles Large & spec. clusters Small to medium clusters Single workstations
Hosted, managed, distributed facility to: Manage & archive your medium and large datasets Provide computational resources to analyze it Provide networking to share it with your colleagues and the public.
Long Time Goal Build a (small) data center for science.
And preserve your data the same way that libraries preserve books & museums preserve art.
What Could You Do With 1 PB of Genomics Data? The NIH in the U.S. currently makes available for download approximately 2PB of data. Bionimbus today consists of 6 racks, 212 nodes, 1568 cores and 0.9 PB of storage. We plan to add approximately 1 PB of genomics and other data from the biological sciences to Bionimbus in 2011.
Case Study: ModENCODE Bionimbus is used to process the modENCODE data from the White lab (over 1000 experiments). BionimbusVMs were used for some of the integrative analysis. Bionimbus is used as a backup for the modENCODE DCC
Project Matsu 2: An Elastic Cloud For Disaster Response Daniel Mandl - NASA/GSFC, Lead 20
Provide Fire / Flood Data to Rescue Workers Note blue bars indicating a surge of rainfall upstream Flood Dashboard Then a flood wave appears downstream at Rundu river gauge days later Short Term Pilot for 2011
Colored areas represent catchments where rainfall collects and drains to river basins
Nominate one of your graduate students to spend a summer working with one of the OSDC PIRE Foreign Partners
Send one of your graduate students to hands-on Workshops, such as Introduction to Data Intensive Computing
Submit your most impressive dataset to the OSDC Data Challenge
Buy a container of computers and join the OSDC
Open Science Data Cloud Sustainability Model
Towards a Long Term, Sustainable Model Capital Exp about $1M/year Operating Exp about $1M/year Moore Foundation providing $1M/year for 2011 and 2012 to support the Cap Exp.
Who do you most trust to manage your data for 100 years? Companies may not be here tomorrow. Government agencies have a role, but not always easy to use. Think of a not for profit with that mission.
Buy A Container and Join the OCC Use 2/3 of the container for your own purposes. Provide 1/3 of the container to the OCC for a share replica space.
To Get Involved Join the Open Cloud Consortium: www.opencloudconsortium.org