The document describes Ancestry's journey moving from a single machine DNA matching process to a scalable Hadoop and HBase solution. It details how they first parallelized the ethnicity prediction step using Hadoop as a job scheduler. This freed resources for the more challenging matching algorithm. It then explains how they developed "Jermline", storing matching data in HBase and using MapReduce to efficiently find new matches for incremental DNA samples. The new distributed solution allowed matching to scale to millions of DNA samples.