Open Science Data Cloud (IEEE Cloud 2011)

2,083 views

Published on

These are the slides from a plenary panel that I participated in at IEEE Cloud 2011 on July 5, 2011 in Washington, D.C. I discussed the Open Science Data Cloud and concluded the talk by three research questions

Published in: Technology, Education
1 Comment
4 Likes
Statistics
Notes
No Downloads
Views
Total views
2,083
On SlideShare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
32
Comments
1
Likes
4
Embeds 0
No embeds

No notes for slide

Open Science Data Cloud (IEEE Cloud 2011)

  1. 1. OCC Open Science Data Cloud(www.opensciencedatacloud.org)<br />Robert Grossman<br />University of ChicagoOpen Cloud Consortium<br />Open Data Group<br />July 5, 2011<br />1<br />
  2. 2. I’ll describe a new project (the Open Science Data Cloud) and three research questions generated by the project. <br />
  3. 3. Open Science Data Cloud<br />The OCC is a not-for-profit supporting the scientific community by operating cloud infrastructure.<br />
  4. 4. The OSDC is a hosted distributed facility managed by the OCC that:<br />Manages & archives medium and large size datasets.<br />Provides computational resources to analyze them.<br />Provides networking to share the datasets with your colleagues and with the public.<br />
  5. 5. Phase 12011 - 2014<br />Proof of Concept2008 - 2010<br />Phase 2<br />2015-2020<br /><ul><li>4 locations
  6. 6. 10G networks
  7. 7. 450+ nodes
  8. 8. 3000 cores
  9. 9. 2 PB
  10. 10. Build a data center for science.
  11. 11. Drive the the 4th paradigm.
  12. 12. 6+ locations
  13. 13. 100G networks
  14. 14. $1M - $2M hardware/year
  15. 15. Sept, 2011</li></li></ul><li>Why Another Cloud Project?<br />
  16. 16. Variety of analysis<br />Scientist with laptop<br />Wide<br />Open Science Data Cloud<br />Med<br />High energy physics, astronomy<br />Low<br />Data Size<br />Medium to Large <br />Small<br />Very Large<br />Dedicated infrastructure<br />No infrastructure<br />General infrastructure<br />
  17. 17. OSDC Perspective<br /><ul><li>Take a long term point of view (think like an underfunded library not a cloud service provider).
  18. 18. Manage both the data and the analysis environment.
  19. 19. Develop open architecture that interoperates with other private and public clouds.
  20. 20. Operate vendor neutralinfrastructure at the scale of a small data center.</li></li></ul><li>Project 1. Bionimbus<br />www.bionimbus.org (biological data)<br />
  21. 21. Project Matsu 2: An Elastic Cloud For Earth Science Data<br />10<br />matsu.opencloudconsortium.org<br />
  22. 22. Research Questions<br />Develop technology to encapsulate a scientist’s data and analysis tools and to export, save and move these between clouds.<br />Develop protocols, utilities, and applications so that new racks and containers can be added to data clouds with minimal human involvement. <br />Develop technology to support the long term, low cost preservation of data in clouds.<br />

×