This document provides cheat sheets and resources for various programming languages and tools used for data science. It defines a data scientist as someone who can write code in languages like R, Python, Java, SQL and Hadoop, understands statistics, and can derive insights from data to help businesses make decisions. Links are included for quick reference sheets on topics like Java, Linux, SQL, Hive QL, Python, R, Pig, HDFS, and Git to aid data scientists in their work.