6. Let’s talk about data-pipelines
a data pipeline is the software that consolidates data from
multiple sources and makes it available to be used
strategically
20. • ETL processes were written in MapReduce (Hive) -
not easy to work with and add new processes.
• UI was custom made. Every report required
planning, developer time and a deploy.
• Some reports running on production DB
Pain points