Ancestry.com is the world's largest online family history resource with over 30,000 historical collections, 11 billion records, and 4 petabytes of data. It allows users to spit in a tube, pay $99, and learn about their family origins and find long-lost relatives through Ancestry DNA, which has over 120,000 samples in its database. However, the original DNA matching algorithm called GERMLINE did not scale well to large data sets. Ancestry.com developed its own algorithm called Jermline that uses Hadoop and HBase to match DNA samples in parallel across a cluster of servers, providing a 1700% performance improvement over GERMLINE.