This document discusses finding allelic frequencies using MapReduce/Hadoop. It begins with an introduction of the presenter and an overview of analyzing genetic variants identified through sequencing to estimate allelic frequency differences between patient groups. It then provides definitions for key genetic terms. The document describes the source data in Variant Call Format files and converting to bioset records. It outlines performing allelic frequency analysis on two groups of biosets to find frequencies, p-values, and top differences between the groups.