This document discusses challenges around data-intensive science and high performance computing. It describes the data lifecycle of scientific research and highlights both the desire among scientists to share data and the gaps between this desire and current practices. The document also discusses issues around defining "big data", integrating data management across storage and information systems, and leveraging extensive citizen science networks to build knowledge.