The document proposes a Twiche framework for caching intermediate data from MapReduce jobs processing large amounts of Twitter data. Twiche would cache intermediate results on the reduce tasks to eliminate duplicate computations. It requires minimal changes to the original MapReduce model. The authors implemented Twiche in Hadoop by extending relevant components. Experiments showed Twiche could eliminate all duplicate tasks in incremental MapReduce jobs with minimal application code changes.