This document summarizes a talk given at the Apache Big Data Conference 2016 in Vancouver. The talk was given by Andreas Zitzelsberger and focused on a real-world example of using Apache Spark, Kafka, Parquet and HDFS to build a real-time clickstream analysis platform as a service solution. The talk explained the design decisions for this solution and presented lessons learned from the project.