The document discusses building data lakes and analytics on AWS. It provides an overview of challenges posed by big data including volume, velocity, variety and veracity of data. It then describes how AWS services like S3, Glue and Athena can help address these challenges by allowing quick ingestion and storage of raw data in its original format. The document also discusses best practices for preparing and analyzing data in the lake using services like EMR, Redshift and SageMaker to derive insights and drive machine learning models.