This document summarizes a team's analysis of a flight delay prediction problem. The team analyzed a dataset with 29 features and 484,551 rows to understand missing values, duplicates, outliers, and categorical variables. They performed data visualization, feature engineering including encoding, scaling, and selection. Models tested include linear regression, Ridge regression, SVC, random forest, and neural network. Ridge regression and random forest performed best with 98-99% accuracy. Issues with linear regression like overfitting were addressed using regularization. SVC was unsuitable due to time and accuracy. Future work may include more data and relevant features.