This document discusses technologies for data ingestion, transformation, and analytics. It introduces Gobblin for scalable data ingestion from diverse sources, Cubert for converting data formats, WhereHows for data lineage tracking, and Pinot for real-time analytics. Gobblin provides a framework for extracting, converting, validating data in parallel tasks. Cubert allows converting data between formats using a domain-specific language. WhereHows tracks lineage metadata to answer questions about where data came from and how it flows. Pinot is a real-time distributed OLAP store for interactive queries on fresh data using a SQL-like interface.