This document summarizes a training for librarians on data scientist skills. It introduces the speaker, Tom Morris, and covers the typical data analysis lifecycle of finding, preparing, analyzing, and visualizing data. Specific topics include the importance of recording data provenance, using tools like OpenRefine for working with messy data, characterizing data fields, and scaling techniques for larger data sets. Resources for further learning are also provided.