This document discusses using data mining techniques to predict air pollution levels in a smart city. It begins with background on the health impacts of various air pollutants like particulate matter, nitrogen dioxide, sulfur dioxide, and carbon monoxide. It then describes using a multivariate, multistep time series approach with a random forest algorithm to predict future air pollution levels based on past and current pollution data as well as other attributes like temperature, wind speed, humidity, traffic levels, and the previous day's air quality. Finally, it reviews several related works that used techniques like neural networks, linear regression, extreme learning learning, and decision trees to predict pollution trends in other cities and compares their limitations and accuracies.