The FDA Office of Regulatory Affairs (ORA) manages the process whereby all products imported into United States are screened by electronic systems and human inspections, https://www.fda.gov/ForIndustry/ImportProgram/. About 40 million products are monitored annually resulting in 6 billion data records that need to be processed every night. Booz Allen built an Apache Spark system to analyze the FDA ORA data and to predict violations. The solution uses enterprise friendly SQL framework to expand from data aggregation to Machine Learning without heavy coding. The system enables any enterprise DBA or analyst easily access, filter and transform data to apply the latest machine learning models. These analysts are able to process 6 billion records from various databases and other sources every night without any prior experience with Apache Spark. This helped to scale the Apache Spark solution enable data warehouse/RDBM experts to process powerful analytics workloads without needing to know Scala or Python.