This document presents an MSc thesis on big data in healthcare. It discusses how the healthcare sector is generating large amounts of data and how big data can be used in healthcare. The document outlines a plan to first discuss why big data is important in healthcare, providing examples of data usage history and current applications. It then details how big data can be collected, processed and analyzed in the healthcare sector using tools like Hadoop, Hive, Pig and Sqoop. The future potential of big data in healthcare is also envisioned, with real-time uses.