This document discusses data and data preprocessing in data mining. It defines what data is, including data objects and attributes. It describes different attribute types like nominal, binary, ordinal, interval-scaled and ratio-scaled numeric attributes. It also discusses measuring the central tendency of data using the mean, median and mode. Additionally, it covers measuring data distribution through variance, standard deviation and z-scores. Finally, it briefly introduces measuring data similarity and dissimilarity, as well as an overview of data preprocessing.