Part of the Global Unified Open Data Architecture (GUODA) infrastructure hosted at iDigBio is providing Jupyter Notebooks to biodiversity researchers to facilitate analyzing large datasets with Apache Spark. This talk was delivered at the TDWG 2016 Annual Conference in Santa Clara de San Carlos, Costa Rica.