This document summarizes Nvidia's GPU technology conference (GTC16) including announcements about their Tesla P100 GPU and DGX-1 deep learning supercomputer. Key points include:
- The new Tesla P100 GPU delivers up to 21 teraflops of performance for deep learning and uses new technologies like NVLink, HBM2 memory, and a page migration engine.
- The Nvidia DGX-1 is a deep learning supercomputer powered by 8 Tesla P100 GPUs with over 170 teraflops of performance for training neural networks.
- CUDA 8 and unified memory improvements on the P100 enable simpler programming and larger datasets by allowing allocations beyond GPU memory size and