HDFS provides a filesystem that can scale to tens of petabytes, with high IO bandwidth and data replication across machines. Locality is important for performance in Hadoop. There are ongoing issues with data loss, handling full disks, underreplication of data, and limits to scaling the namenode. Failover handling and security also need improvement before HDFS is ready for broad production use.