The scientific community is reaching an era of inexpensive whole genome sequencing, often with analysis focusing on annotated genome variants. We anticipate researchers and physicians will want to efficiently query, visualize, and analyze 100’s - 100,000’s of genomes at a time. Thus, we present a new format, Compressed Genome Format (CGF), designed for losslessly storing phased or unphased genetic variation data that can be queried and analyzed without decompression. Our preliminary implementation of CGF represents chromosomes 13 and 17 of 174 whole genome sequences, from 41.2-52.1 MB to 7.0-7.4 MB. Loading these 174 people to a format conducive to machine learning uses 1.805 GB and takes on average 33.37s. Projection onto the first principal component separated the population according to ancestral geographical location, as documented by the 1000 Genomes Project Consortium to occur for human SNP data. CGF also supports a GA4GH Beacon, is compatible with other GA4GH API, adapts easily to changes in the reference genome, can be losslessly visualized, and can be easily annotated.