This document provides an introduction to data cleaning and OpenRefine, an open-source tool for cleaning messy data. It discusses what constitutes messy data such as spelling errors, inconsistent formatting of dates, numbers formatted as text, missing values, and multiple variables in one column. It then introduces OpenRefine, describing it as a locally-run but browser-based tool formerly from Google that is now open source. It can be used for tasks like sorting, removing whitespace, splitting columns, converting formats, geocoding, and clustering to clean data. Finally, it provides examples of cleaning practices and questions to try on sample data.