In this talk, you’ll learn how we’re taking Comcast’s Technology and Product group’s massive, heterogeneous set of data collection systems and centralizing on a single platform built around Kafka. These data collection systems are used for everything from business analytics, to near-real time operations, to executive reporting. We’ll go over what it takes to wrangle streaming data across an enterprise, including the need for, and our approaches to: Schema management, both at schema creation time and when schema evolution is required Data ingest and cleansing Multi-datacenter collection and failover How we use the same data stream for many different purposes, across many different teams