Hadoop distributions contain numerous Open Source components which support data processing at large scale (Hadoop, Hive, Spark, etc). These projects are developed and maintained as Top Level Apache Software Projects which provides them the freedom to innovate and improve in a liberal way. As an example, HDP’s key advantage is the rock-solid quality and integration of the shipped software. In this talk we will provide insights about the process which enables scheduled maintenance releases and yields enterprise ready open source software distribution. Topics: Triaging: identifying the finest selection of patches from upstream Certification process: System, regression, upgrade & unit testing Stack validation using long-running workloads Apache License Compliance