This document summarizes Manuel Martín Salvador's background and research interests in automated and adaptive data pre-processing for building predictive models. It discusses how data pre-processing makes up a large portion of the data mining process but is labor intensive. The document also outlines OpenML, a scientific workflow platform and repository for machine learning experiments, and highlights opportunities to increase the number and types of pre-processing methods available on the platform as well as improve flow representation and recommendation.