This presentation was provided by William Mattingly of the Smithsonian Institution, for the third session of NISO's 2023 Training Series on Text and Data Mining. Session three, "Data Pre-processing for Libraries," was held on Thursday, October 26, 2023.