This document discusses how data locality is challenged in cloud computing environments where data is distributed across remote networks. It introduces LLAP (Locality is Locality Abstraction for Pipelines), a caching technique used by Hortonworks Data Cloud that decentralizes data in columnar caches across nodes to improve query performance even when data is remote. The document explains how LLAP handles issues like distributed transactions and node failures to maintain cache consistency and affinity without losing performance. Overall, LLAP aims to overcome data locality issues in the cloud by leveraging efficient caching techniques.