SlideShare a Scribd company logo
Submitted By............
SANJIB MITRA(150403074)
SANTANU SINGHA (150403076)
SHRUTI KULSHRESTHA (150403085)
SUBHAM KUMAR MAHANTY(150403101)
Bachelor of Technology
In
Electronics and Communication
Underthesupervisionof
Mr. Souvik Pal
Department of Computer Science and Engineering Engineering
CONTENTS
 Abstract
 Aim Of heE Project
 What is big data
 Tools We Have Use In Our Project
 WHAT WE HAVE DONE IN OUR PROJECT
 Some Output Of Our Project
 Discussion
 Conclusion
ABSTRACT
 To analyzing the big data of flight database to identify the various
factors which drives an airline company into loss.
 For analyzing the data we have used the major technologies such
as Big data concepts, Apache Pig, Map Reduce etc.
 We have created some queries which gives a clear view of reasons
on which an airline company should work or take some step in
order to get increases the predictability.
 We believe that our approach will be helpful to bring some growth
in business of airline companies as well as the business analyst.
AIM OF THE PROJECT
 The main aim of the project was optimization.
At first we had to analyze the data so that we can work upon the obvious
reasons which today’s people suffer while travelling in flights .
Here we generate few queries and try to optimize the time between
various destinations so that we can use it for some better purpose and
improvements,
It is noticed that many a time due to the same reasons many flights get
delayed over and over again so we accumulated data of a certain period of
time analyzed it and worked over certain areas.
What is big data
A collection of data setssolarge and
complex that it becomes difficult to
processusing on-hand database
managementtools or traditional
data processing applications.”
OR
“Big data is high-volume, high-
velocity and high-variety
information assetsthat demand cost-
effective, innovative forms of
information processing for enhanced
insight and decision making.”
Tools We Have Use In Our Project
WHAT WE HAVE DONE IN OUR PROJECT
I. Find out the top five most visited destinations.
II. Which month has seen the most number of cancellations due to bad weather?
III. Top ten origins with the highest, AVG departure_delay.
IV. Which route (origin & destination) has seen the maximum diversion?
V. Maximum no of flights cancelled in which month?
VI. Find out the top ten ORIGINS for which the reason of delay Is ” security _ delay”.
VII. Top ten destinations with the average arrival_ delay?
VIII. Top twenty five airports where minimum numbers of flight landed?
IX. Which route origin and destination has average Air System delay?
X. Top ten origins with the highest Average WEATHER_DELAY?
XI. Reason for which maximum numbers of flights were cancelled?
XII. Which airport has seen the maximum number of flights cancelled?
XIII. Find the top 10 routes with maximum distance, between origin and destination?
Which route (origin & destination) has seen the maximum
diversion?
Queries Answer
Top twenty five airports where minimum numbers of flight
landed?
Queries Answer
Which airport has seen the maximum number of flights cancelled?
Queries Answer
DISCUSSION
 Hence in the given project we analyzed a given flight data with 1Crore * 31 Rows and
Columns respectively and then going through it. There were around thirteen queries after
analyzing the data carefully.
 These queries mainly consisted of reasons for delay and no. of flights and its origins and
destinations.
 Hence after going through the problems we tried our best to minimize the loses so that we
can increase the profits of the flight companies and reduce the harassments caused the
passengers due to weather conditions, air system delay, security delay, airline delay, late
aircraft delay, weather delay.
 We along with our project mentor took forward the steps to look into the project and hence
find out in details which is kept unseen till now.
CONCLUSION
 Hadoop Mapreduce is now a popular choice for
performing large-scale data analytics. Bigdata analytics
using pig
 sheds light on significant issues faced by flight data and
we can find the numbers of flight cancelled per month.
THANK YOU

More Related Content

What's hot

Npd presentation file risk management
Npd presentation file   risk managementNpd presentation file   risk management
Npd presentation file risk managementOmid Aminzadeh Gohari
 
rules of formulating network planning model .
rules of formulating network planning model .rules of formulating network planning model .
rules of formulating network planning model .
ritambharaaatre
 
Unit 7 dynamic programming
Unit 7   dynamic programmingUnit 7   dynamic programming
Unit 7 dynamic programming
Nageswara Rao Thots
 
An introduction to Deep Learning
An introduction to Deep LearningAn introduction to Deep Learning
An introduction to Deep Learning
Julien SIMON
 
Unit 8-risk manaegement (1) -
Unit 8-risk manaegement (1) - Unit 8-risk manaegement (1) -
Unit 8-risk manaegement (1) -
Shashi Kumar
 
Productivity management
Productivity management   Productivity management
Productivity management
Nikhil Nimbarte
 
Stock market analysis using supervised machine learning
Stock market analysis using supervised machine learningStock market analysis using supervised machine learning
Stock market analysis using supervised machine learning
Priyanshu Gandhi
 
Managing contracts
Managing contractsManaging contracts
Managing contracts
tumetr1
 
Risk Management
Risk ManagementRisk Management
Risk Management
Saqib Raza
 
Enterprise Resource Planning Unit 4 post implementation on ERP
Enterprise Resource Planning Unit 4 post implementation on ERPEnterprise Resource Planning Unit 4 post implementation on ERP
Enterprise Resource Planning Unit 4 post implementation on ERP
Ganesha Pandian
 
MG6088 SOFTWARE PROJECT MANAGEMENT
MG6088 SOFTWARE PROJECT MANAGEMENTMG6088 SOFTWARE PROJECT MANAGEMENT
MG6088 SOFTWARE PROJECT MANAGEMENT
Kathirvel Ayyaswamy
 
Gr 12 Difference Between IT an Information Systems
Gr 12 Difference Between IT an Information SystemsGr 12 Difference Between IT an Information Systems
Gr 12 Difference Between IT an Information Systems
university of education,Lahore
 
Project risk management notes bagamoyo 12.10.2017 final v1
Project risk management  notes bagamoyo 12.10.2017 final v1Project risk management  notes bagamoyo 12.10.2017 final v1
Project risk management notes bagamoyo 12.10.2017 final v1
EMAC Consulting Group
 
08-Management Information System
08-Management Information System08-Management Information System
08-Management Information System
Wahyu Wijanarko
 
Software Project Management: Risk Management
Software Project Management: Risk ManagementSoftware Project Management: Risk Management
Software Project Management: Risk Management
Minhas Kamal
 

What's hot (20)

Npd presentation file risk management
Npd presentation file   risk managementNpd presentation file   risk management
Npd presentation file risk management
 
rules of formulating network planning model .
rules of formulating network planning model .rules of formulating network planning model .
rules of formulating network planning model .
 
Unit 7 dynamic programming
Unit 7   dynamic programmingUnit 7   dynamic programming
Unit 7 dynamic programming
 
An introduction to Deep Learning
An introduction to Deep LearningAn introduction to Deep Learning
An introduction to Deep Learning
 
Unit 8-risk manaegement (1) -
Unit 8-risk manaegement (1) - Unit 8-risk manaegement (1) -
Unit 8-risk manaegement (1) -
 
Productivity management
Productivity management   Productivity management
Productivity management
 
Stock market analysis using supervised machine learning
Stock market analysis using supervised machine learningStock market analysis using supervised machine learning
Stock market analysis using supervised machine learning
 
Managing contracts
Managing contractsManaging contracts
Managing contracts
 
Takt Time, Cycle Time & Line Balancing
Takt Time, Cycle Time & Line BalancingTakt Time, Cycle Time & Line Balancing
Takt Time, Cycle Time & Line Balancing
 
Risk Management
Risk ManagementRisk Management
Risk Management
 
Role of IT in Business
Role of IT in BusinessRole of IT in Business
Role of IT in Business
 
Sla and kpi
Sla and kpiSla and kpi
Sla and kpi
 
Enterprise Resource Planning Unit 4 post implementation on ERP
Enterprise Resource Planning Unit 4 post implementation on ERPEnterprise Resource Planning Unit 4 post implementation on ERP
Enterprise Resource Planning Unit 4 post implementation on ERP
 
Input and output design
Input and output designInput and output design
Input and output design
 
MG6088 SOFTWARE PROJECT MANAGEMENT
MG6088 SOFTWARE PROJECT MANAGEMENTMG6088 SOFTWARE PROJECT MANAGEMENT
MG6088 SOFTWARE PROJECT MANAGEMENT
 
Gr 12 Difference Between IT an Information Systems
Gr 12 Difference Between IT an Information SystemsGr 12 Difference Between IT an Information Systems
Gr 12 Difference Between IT an Information Systems
 
Spm unit 1
Spm unit 1Spm unit 1
Spm unit 1
 
Project risk management notes bagamoyo 12.10.2017 final v1
Project risk management  notes bagamoyo 12.10.2017 final v1Project risk management  notes bagamoyo 12.10.2017 final v1
Project risk management notes bagamoyo 12.10.2017 final v1
 
08-Management Information System
08-Management Information System08-Management Information System
08-Management Information System
 
Software Project Management: Risk Management
Software Project Management: Risk ManagementSoftware Project Management: Risk Management
Software Project Management: Risk Management
 

Similar to Flight data analysis using apache pig--------------Final Year Project

Big Data to avoid weather related flight delays
Big Data to avoid weather related flight delaysBig Data to avoid weather related flight delays
Big Data to avoid weather related flight delays
AkshatGiri3
 
bigdatatoavoidweatherrelatedflightdelays-201219091805.pptx
bigdatatoavoidweatherrelatedflightdelays-201219091805.pptxbigdatatoavoidweatherrelatedflightdelays-201219091805.pptx
bigdatatoavoidweatherrelatedflightdelays-201219091805.pptx
eternalisone
 
Improving Passenger Experience at Brussels Airport through (real-time) Analyt...
Improving Passenger Experience at Brussels Airport through (real-time) Analyt...Improving Passenger Experience at Brussels Airport through (real-time) Analyt...
Improving Passenger Experience at Brussels Airport through (real-time) Analyt...
Patrick Van Renterghem
 
Big Data Analytics and Artifical Intelligence
Big Data Analytics and Artifical IntelligenceBig Data Analytics and Artifical Intelligence
Big Data Analytics and Artifical Intelligence
Anand Narayanan
 
How Bluemix Helps NASA Innovate
How Bluemix Helps NASA InnovateHow Bluemix Helps NASA Innovate
How Bluemix Helps NASA Innovate
IBM
 
HOW_DATA_CAN_HELP_TO_REDUCE_AVIATION_ACCIDENTS
HOW_DATA_CAN_HELP_TO_REDUCE_AVIATION_ACCIDENTSHOW_DATA_CAN_HELP_TO_REDUCE_AVIATION_ACCIDENTS
HOW_DATA_CAN_HELP_TO_REDUCE_AVIATION_ACCIDENTSSunil Kakade
 
Flight delay detection data mining project
Flight delay detection data mining projectFlight delay detection data mining project
Flight delay detection data mining project
Akshay Kumar Bhushan
 
Business Case London Heathrow Airport Launches BI and Machine Learn.pdf
Business Case London Heathrow Airport Launches BI and Machine Learn.pdfBusiness Case London Heathrow Airport Launches BI and Machine Learn.pdf
Business Case London Heathrow Airport Launches BI and Machine Learn.pdf
info189835
 
Air Travel Analytics in SAS
Air Travel Analytics in SASAir Travel Analytics in SAS
Air Travel Analytics in SASRohan Nanda
 
INFORM-Measuring and Monitoring Aircraft Turn Operations v3
INFORM-Measuring and Monitoring Aircraft Turn Operations v3INFORM-Measuring and Monitoring Aircraft Turn Operations v3
INFORM-Measuring and Monitoring Aircraft Turn Operations v3David Foster
 
Validating enterprise data lake using open source data validator
Validating enterprise data lake using open source data validatorValidating enterprise data lake using open source data validator
Validating enterprise data lake using open source data validator
Prachi Gupta
 
Airport ConsultingProject BriefingsAVS 4999 – Aviation Syste.docx
Airport ConsultingProject BriefingsAVS 4999 – Aviation Syste.docxAirport ConsultingProject BriefingsAVS 4999 – Aviation Syste.docx
Airport ConsultingProject BriefingsAVS 4999 – Aviation Syste.docx
jesuslightbody
 
Is it harder to find a taxi when it is raining?
Is it harder to find a taxi when it is raining? Is it harder to find a taxi when it is raining?
Is it harder to find a taxi when it is raining?
Wilfried Hoge
 
Airline Analysis of Data Using Hadoop
Airline Analysis of Data Using HadoopAirline Analysis of Data Using Hadoop
Airline Analysis of Data Using Hadoop
Greater Noida Institute Of Technology
 
Avi news letter 15th issue
Avi news letter 15th issueAvi news letter 15th issue
Avi news letter 15th issue
AvitrueSpares
 
AVI-NEWS Letter 15th Issue
AVI-NEWS Letter 15th IssueAVI-NEWS Letter 15th Issue
AVI-NEWS Letter 15th Issue
Avitrue Spares
 
A statistical approach to predict flight delay
A statistical approach to predict flight delayA statistical approach to predict flight delay
A statistical approach to predict flight delay
iDTechTechnologies
 
World Routes 2014 Keynote Presentation – How Big Date Changes Aviation Effici...
World Routes 2014 Keynote Presentation – How Big Date Changes Aviation Effici...World Routes 2014 Keynote Presentation – How Big Date Changes Aviation Effici...
World Routes 2014 Keynote Presentation – How Big Date Changes Aviation Effici...
pmccann1984
 

Similar to Flight data analysis using apache pig--------------Final Year Project (20)

BritishAirways-CS-FINAL
BritishAirways-CS-FINALBritishAirways-CS-FINAL
BritishAirways-CS-FINAL
 
Big Data to avoid weather related flight delays
Big Data to avoid weather related flight delaysBig Data to avoid weather related flight delays
Big Data to avoid weather related flight delays
 
bigdatatoavoidweatherrelatedflightdelays-201219091805.pptx
bigdatatoavoidweatherrelatedflightdelays-201219091805.pptxbigdatatoavoidweatherrelatedflightdelays-201219091805.pptx
bigdatatoavoidweatherrelatedflightdelays-201219091805.pptx
 
Improving Passenger Experience at Brussels Airport through (real-time) Analyt...
Improving Passenger Experience at Brussels Airport through (real-time) Analyt...Improving Passenger Experience at Brussels Airport through (real-time) Analyt...
Improving Passenger Experience at Brussels Airport through (real-time) Analyt...
 
Big Data Analytics and Artifical Intelligence
Big Data Analytics and Artifical IntelligenceBig Data Analytics and Artifical Intelligence
Big Data Analytics and Artifical Intelligence
 
How Bluemix Helps NASA Innovate
How Bluemix Helps NASA InnovateHow Bluemix Helps NASA Innovate
How Bluemix Helps NASA Innovate
 
HOW_DATA_CAN_HELP_TO_REDUCE_AVIATION_ACCIDENTS
HOW_DATA_CAN_HELP_TO_REDUCE_AVIATION_ACCIDENTSHOW_DATA_CAN_HELP_TO_REDUCE_AVIATION_ACCIDENTS
HOW_DATA_CAN_HELP_TO_REDUCE_AVIATION_ACCIDENTS
 
Flight delay detection data mining project
Flight delay detection data mining projectFlight delay detection data mining project
Flight delay detection data mining project
 
industrial
industrialindustrial
industrial
 
Business Case London Heathrow Airport Launches BI and Machine Learn.pdf
Business Case London Heathrow Airport Launches BI and Machine Learn.pdfBusiness Case London Heathrow Airport Launches BI and Machine Learn.pdf
Business Case London Heathrow Airport Launches BI and Machine Learn.pdf
 
Air Travel Analytics in SAS
Air Travel Analytics in SASAir Travel Analytics in SAS
Air Travel Analytics in SAS
 
INFORM-Measuring and Monitoring Aircraft Turn Operations v3
INFORM-Measuring and Monitoring Aircraft Turn Operations v3INFORM-Measuring and Monitoring Aircraft Turn Operations v3
INFORM-Measuring and Monitoring Aircraft Turn Operations v3
 
Validating enterprise data lake using open source data validator
Validating enterprise data lake using open source data validatorValidating enterprise data lake using open source data validator
Validating enterprise data lake using open source data validator
 
Airport ConsultingProject BriefingsAVS 4999 – Aviation Syste.docx
Airport ConsultingProject BriefingsAVS 4999 – Aviation Syste.docxAirport ConsultingProject BriefingsAVS 4999 – Aviation Syste.docx
Airport ConsultingProject BriefingsAVS 4999 – Aviation Syste.docx
 
Is it harder to find a taxi when it is raining?
Is it harder to find a taxi when it is raining? Is it harder to find a taxi when it is raining?
Is it harder to find a taxi when it is raining?
 
Airline Analysis of Data Using Hadoop
Airline Analysis of Data Using HadoopAirline Analysis of Data Using Hadoop
Airline Analysis of Data Using Hadoop
 
Avi news letter 15th issue
Avi news letter 15th issueAvi news letter 15th issue
Avi news letter 15th issue
 
AVI-NEWS Letter 15th Issue
AVI-NEWS Letter 15th IssueAVI-NEWS Letter 15th Issue
AVI-NEWS Letter 15th Issue
 
A statistical approach to predict flight delay
A statistical approach to predict flight delayA statistical approach to predict flight delay
A statistical approach to predict flight delay
 
World Routes 2014 Keynote Presentation – How Big Date Changes Aviation Effici...
World Routes 2014 Keynote Presentation – How Big Date Changes Aviation Effici...World Routes 2014 Keynote Presentation – How Big Date Changes Aviation Effici...
World Routes 2014 Keynote Presentation – How Big Date Changes Aviation Effici...
 

Recently uploaded

一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 

Recently uploaded (20)

一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 

Flight data analysis using apache pig--------------Final Year Project

  • 1. Submitted By............ SANJIB MITRA(150403074) SANTANU SINGHA (150403076) SHRUTI KULSHRESTHA (150403085) SUBHAM KUMAR MAHANTY(150403101) Bachelor of Technology In Electronics and Communication Underthesupervisionof Mr. Souvik Pal Department of Computer Science and Engineering Engineering
  • 2. CONTENTS  Abstract  Aim Of heE Project  What is big data  Tools We Have Use In Our Project  WHAT WE HAVE DONE IN OUR PROJECT  Some Output Of Our Project  Discussion  Conclusion
  • 3. ABSTRACT  To analyzing the big data of flight database to identify the various factors which drives an airline company into loss.  For analyzing the data we have used the major technologies such as Big data concepts, Apache Pig, Map Reduce etc.  We have created some queries which gives a clear view of reasons on which an airline company should work or take some step in order to get increases the predictability.  We believe that our approach will be helpful to bring some growth in business of airline companies as well as the business analyst.
  • 4. AIM OF THE PROJECT  The main aim of the project was optimization. At first we had to analyze the data so that we can work upon the obvious reasons which today’s people suffer while travelling in flights . Here we generate few queries and try to optimize the time between various destinations so that we can use it for some better purpose and improvements, It is noticed that many a time due to the same reasons many flights get delayed over and over again so we accumulated data of a certain period of time analyzed it and worked over certain areas.
  • 5. What is big data A collection of data setssolarge and complex that it becomes difficult to processusing on-hand database managementtools or traditional data processing applications.” OR “Big data is high-volume, high- velocity and high-variety information assetsthat demand cost- effective, innovative forms of information processing for enhanced insight and decision making.”
  • 6. Tools We Have Use In Our Project
  • 7. WHAT WE HAVE DONE IN OUR PROJECT I. Find out the top five most visited destinations. II. Which month has seen the most number of cancellations due to bad weather? III. Top ten origins with the highest, AVG departure_delay. IV. Which route (origin & destination) has seen the maximum diversion? V. Maximum no of flights cancelled in which month? VI. Find out the top ten ORIGINS for which the reason of delay Is ” security _ delay”. VII. Top ten destinations with the average arrival_ delay? VIII. Top twenty five airports where minimum numbers of flight landed? IX. Which route origin and destination has average Air System delay? X. Top ten origins with the highest Average WEATHER_DELAY? XI. Reason for which maximum numbers of flights were cancelled? XII. Which airport has seen the maximum number of flights cancelled? XIII. Find the top 10 routes with maximum distance, between origin and destination?
  • 8.
  • 9. Which route (origin & destination) has seen the maximum diversion? Queries Answer
  • 10. Top twenty five airports where minimum numbers of flight landed? Queries Answer
  • 11. Which airport has seen the maximum number of flights cancelled? Queries Answer
  • 12. DISCUSSION  Hence in the given project we analyzed a given flight data with 1Crore * 31 Rows and Columns respectively and then going through it. There were around thirteen queries after analyzing the data carefully.  These queries mainly consisted of reasons for delay and no. of flights and its origins and destinations.  Hence after going through the problems we tried our best to minimize the loses so that we can increase the profits of the flight companies and reduce the harassments caused the passengers due to weather conditions, air system delay, security delay, airline delay, late aircraft delay, weather delay.  We along with our project mentor took forward the steps to look into the project and hence find out in details which is kept unseen till now.
  • 13. CONCLUSION  Hadoop Mapreduce is now a popular choice for performing large-scale data analytics. Bigdata analytics using pig  sheds light on significant issues faced by flight data and we can find the numbers of flight cancelled per month.