This document outlines a proposal to conceptualize and prototype a scalable genomic data analysis pipeline using Project Glow and Apache Spark on Amazon Web Services' (AWS) Databricks platform. It discusses the growing amounts of human genomic data, limitations of existing bioinformatics tools, and how distributed computing frameworks like Apache Spark and cloud platforms can enable analysis of huge genomic datasets.