Embed presentation






Categorical and continuous variables can be divided into nominal, binary, ordinal, integer, interval-scaled and ratio-scaled types. Data cleaning involves detecting and removing noisy, invalid and outlier values that are incorrectly recorded or outside normal ranges. Missing values can occur due to equipment malfunctions, added fields, or unavailable information, and can be dealt with by discarding instances or replacing values with averages or most frequent values.




