This white paper proposes a reference architecture for optimizing data warehouses using Hadoop. It combines Informatica and Cloudera technologies to offload processing and infrequently used data from data warehouses to Hadoop. This alleviates strain on warehouses and frees up storage space. The architecture provides universal data access, flexible data ingestion methods, streamlined data pipelines, scalable processing and storage using Hadoop, end-to-end data management, and real-time queries of Hadoop data. The goal is to optimize warehouse performance and costs by leveraging Hadoop for large-scale data storage and preprocessing.