This document discusses moving big data caching from the cloud to the edge using machine learning. It proposes (1) harnessing big data and machine learning to estimate content popularity for proactive caching, (2) implementing a cache-enabled architecture at the edge where devices provide cloud-like computing instead of the cloud, and (3) using a case study analyzing real data traces to show potential backhaul offloading gains.