Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Flight Delay Prediction


Published on

ADS Team 3 Final Presentation.
Discusses different models to perform delay prediction, flight cancellation, flight recommendation and flight fare prediction.

  • who will win this game? get free picks and predictions. ★★★
    Are you sure you want to  Yes  No
    Your message goes here

Flight Delay Prediction

  2. 2. Business cases/problem statement:  Crowded airspace becoming unpredictable.  Rescheduling of critical government air space operations because of delays  Problems in liaisoning between US military and the civilian Air Traffic Control because of sudden delays.  Bad customer satisfaction for US residents.  Sudden surge/decrease in the airfare.  Solution :  Delay Prediction  Average Price Prediction  Flight cancellation Prediction  Flight Recommendation
  3. 3. Data  We have gathered the data from Statistical Computing Statistical Graphics section of American Statistical Association Website.  Data had around 5 million rows and 25 columns.  We processed our prediction on .5 million rows.  Recommendation: we ran the matchbox recommendation algorithm against 35,000 reviews who had reviewed the airline carriers.
  4. 4. Flight Cancellation classification Model Accuracy Precision Two Class Logistic Regression 0.978 0.565 Two Class Neural Network 0.980 0.756 Two Class Boosted DecisionTree 0.982 0.758 Two Class Decision Forest 0.980 0.591 Two Class Decision Jungle 0.981 0.872 • Classification done on the Cancelled Column of the dataset. 0 stands for not cancelled and 1 for cancelled. • Two Class Boosted Decision Tree gives better accuracy. • Weather data was scraped from wunderground website. • On Feature Selection, we selected flightnum, hour, temperature, visibility and sea level pressure as the variables that help in better prediction.
  5. 5. Arrival Delay Prediction  Based on feature selection, used- hour, flight number, day of the month, visibility, day of week and departure delay to train various regression models.  Used Linear regression, boosted decision tree, Neural Network, and Decision Forest.  Concluded that the prediction required even more features like like mechanical issues, airport congestion, etc. which were not present in the dataset.  Found that Boosted decision tree was the best algorithm amongst all.
  6. 6. Flight Delay Models Matrix Linear Regression Neural Networks Decision Forest Regression Boosted Decision Tree MAE 15.50 13.50 11.68 12.38 RMSE 27.70 18.83 17.23 17.32 Relative Absolute Error 0.57 0.47 0.42 0.45 Relative Squared Error 0.38 0.17 0.14 0.15 Coefficient 0.61 0.84 0.85 0.84
  7. 7. Visualization
  8. 8. Average Price Prediction  Predict the average price of flights, depending on destination address.  Predict average ticket price according to Flight Carrier.  We found Boosted Decision Tree to be the best model among all others.
  9. 9. Air Carrier Recommendation  We are using the Microsoft Azure recommendation System to get the related Airlines carriers.  The dataset is trained on UserName, Airlines carrier and their ratings.
  10. 10. Thank You !!