SlideShare a Scribd company logo
1 of 14
Submitted By............
SANJIB MITRA(150403074)
SANTANU SINGHA (150403076)
SHRUTI KULSHRESTHA (150403085)
SUBHAM KUMAR MAHANTY(150403101)
Bachelor of Technology
In
Electronics and Communication
Underthesupervisionof
Mr. Souvik Pal
Department of Computer Science and Engineering Engineering
CONTENTS
 Abstract
 Aim Of heE Project
 What is big data
 Tools We Have Use In Our Project
 WHAT WE HAVE DONE IN OUR PROJECT
 Some Output Of Our Project
 Discussion
 Conclusion
ABSTRACT
 To analyzing the big data of flight database to identify the various
factors which drives an airline company into loss.
 For analyzing the data we have used the major technologies such
as Big data concepts, Apache Pig, Map Reduce etc.
 We have created some queries which gives a clear view of reasons
on which an airline company should work or take some step in
order to get increases the predictability.
 We believe that our approach will be helpful to bring some growth
in business of airline companies as well as the business analyst.
AIM OF THE PROJECT
 The main aim of the project was optimization.
At first we had to analyze the data so that we can work upon the obvious
reasons which today’s people suffer while travelling in flights .
Here we generate few queries and try to optimize the time between
various destinations so that we can use it for some better purpose and
improvements,
It is noticed that many a time due to the same reasons many flights get
delayed over and over again so we accumulated data of a certain period of
time analyzed it and worked over certain areas.
What is big data
A collection of data setssolarge and
complex that it becomes difficult to
processusing on-hand database
managementtools or traditional
data processing applications.”
OR
“Big data is high-volume, high-
velocity and high-variety
information assetsthat demand cost-
effective, innovative forms of
information processing for enhanced
insight and decision making.”
Tools We Have Use In Our Project
WHAT WE HAVE DONE IN OUR PROJECT
I. Find out the top five most visited destinations.
II. Which month has seen the most number of cancellations due to bad weather?
III. Top ten origins with the highest, AVG departure_delay.
IV. Which route (origin & destination) has seen the maximum diversion?
V. Maximum no of flights cancelled in which month?
VI. Find out the top ten ORIGINS for which the reason of delay Is ” security _ delay”.
VII. Top ten destinations with the average arrival_ delay?
VIII. Top twenty five airports where minimum numbers of flight landed?
IX. Which route origin and destination has average Air System delay?
X. Top ten origins with the highest Average WEATHER_DELAY?
XI. Reason for which maximum numbers of flights were cancelled?
XII. Which airport has seen the maximum number of flights cancelled?
XIII. Find the top 10 routes with maximum distance, between origin and destination?
Which route (origin & destination) has seen the maximum
diversion?
Queries Answer
Top twenty five airports where minimum numbers of flight
landed?
Queries Answer
Which airport has seen the maximum number of flights cancelled?
Queries Answer
DISCUSSION
 Hence in the given project we analyzed a given flight data with 1Crore * 31 Rows and
Columns respectively and then going through it. There were around thirteen queries after
analyzing the data carefully.
 These queries mainly consisted of reasons for delay and no. of flights and its origins and
destinations.
 Hence after going through the problems we tried our best to minimize the loses so that we
can increase the profits of the flight companies and reduce the harassments caused the
passengers due to weather conditions, air system delay, security delay, airline delay, late
aircraft delay, weather delay.
 We along with our project mentor took forward the steps to look into the project and hence
find out in details which is kept unseen till now.
CONCLUSION
 Hadoop Mapreduce is now a popular choice for
performing large-scale data analytics. Bigdata analytics
using pig
 sheds light on significant issues faced by flight data and
we can find the numbers of flight cancelled per month.
THANK YOU

More Related Content

What's hot

Heart disease prediction using machine learning algorithm
Heart disease prediction using machine learning algorithm Heart disease prediction using machine learning algorithm
Heart disease prediction using machine learning algorithm Kedar Damkondwar
 
Task Scheduling using Tabu Search algorithm in Cloud Computing Environment us...
Task Scheduling using Tabu Search algorithm in Cloud Computing Environment us...Task Scheduling using Tabu Search algorithm in Cloud Computing Environment us...
Task Scheduling using Tabu Search algorithm in Cloud Computing Environment us...AzarulIkhwan
 
Ppt for Application of big data
Ppt for Application of big dataPpt for Application of big data
Ppt for Application of big dataPrashant Sharma
 
Air Ticket Price Prediction.pdf
Air Ticket Price Prediction.pdfAir Ticket Price Prediction.pdf
Air Ticket Price Prediction.pdfAdityaAryan45
 
ANALYSIS AND PREDICTION OF RAINFALL USING MACHINE LEARNING TECHNIQUES
ANALYSIS AND PREDICTION OF RAINFALL USING MACHINE LEARNING TECHNIQUESANALYSIS AND PREDICTION OF RAINFALL USING MACHINE LEARNING TECHNIQUES
ANALYSIS AND PREDICTION OF RAINFALL USING MACHINE LEARNING TECHNIQUESIRJET Journal
 
Cloud Computing Architecture
Cloud Computing Architecture Cloud Computing Architecture
Cloud Computing Architecture Vasu Jain
 
Business cases for the need of cloud computing
Business cases for the need of cloud computingBusiness cases for the need of cloud computing
Business cases for the need of cloud computingDr.Neeraj Kumar Pandey
 
Heart Attack Prediction System Using Fuzzy C Means Classifier
Heart Attack Prediction System Using Fuzzy C Means ClassifierHeart Attack Prediction System Using Fuzzy C Means Classifier
Heart Attack Prediction System Using Fuzzy C Means ClassifierIOSR Journals
 

What's hot (20)

Big Data Synopsis
Big Data SynopsisBig Data Synopsis
Big Data Synopsis
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Heart disease prediction using machine learning algorithm
Heart disease prediction using machine learning algorithm Heart disease prediction using machine learning algorithm
Heart disease prediction using machine learning algorithm
 
Task Scheduling using Tabu Search algorithm in Cloud Computing Environment us...
Task Scheduling using Tabu Search algorithm in Cloud Computing Environment us...Task Scheduling using Tabu Search algorithm in Cloud Computing Environment us...
Task Scheduling using Tabu Search algorithm in Cloud Computing Environment us...
 
Big Data
Big DataBig Data
Big Data
 
Data analysis of weather forecasting
Data analysis of weather forecastingData analysis of weather forecasting
Data analysis of weather forecasting
 
Rain project
Rain project Rain project
Rain project
 
What is Data Science
What is Data ScienceWhat is Data Science
What is Data Science
 
Ppt for Application of big data
Ppt for Application of big dataPpt for Application of big data
Ppt for Application of big data
 
Air Ticket Price Prediction.pdf
Air Ticket Price Prediction.pdfAir Ticket Price Prediction.pdf
Air Ticket Price Prediction.pdf
 
Big data architecture
Big data architectureBig data architecture
Big data architecture
 
ANALYSIS AND PREDICTION OF RAINFALL USING MACHINE LEARNING TECHNIQUES
ANALYSIS AND PREDICTION OF RAINFALL USING MACHINE LEARNING TECHNIQUESANALYSIS AND PREDICTION OF RAINFALL USING MACHINE LEARNING TECHNIQUES
ANALYSIS AND PREDICTION OF RAINFALL USING MACHINE LEARNING TECHNIQUES
 
1. GRID COMPUTING
1. GRID COMPUTING1. GRID COMPUTING
1. GRID COMPUTING
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
Cloud Computing Architecture
Cloud Computing Architecture Cloud Computing Architecture
Cloud Computing Architecture
 
Modelling and simulation
Modelling and simulationModelling and simulation
Modelling and simulation
 
Data analytics
Data analyticsData analytics
Data analytics
 
Airline Analysis of Data Using Hadoop
Airline Analysis of Data Using HadoopAirline Analysis of Data Using Hadoop
Airline Analysis of Data Using Hadoop
 
Business cases for the need of cloud computing
Business cases for the need of cloud computingBusiness cases for the need of cloud computing
Business cases for the need of cloud computing
 
Heart Attack Prediction System Using Fuzzy C Means Classifier
Heart Attack Prediction System Using Fuzzy C Means ClassifierHeart Attack Prediction System Using Fuzzy C Means Classifier
Heart Attack Prediction System Using Fuzzy C Means Classifier
 

Similar to Flight data analysis using apache pig--------------Final Year Project

bigdatatoavoidweatherrelatedflightdelays-201219091805.pptx
bigdatatoavoidweatherrelatedflightdelays-201219091805.pptxbigdatatoavoidweatherrelatedflightdelays-201219091805.pptx
bigdatatoavoidweatherrelatedflightdelays-201219091805.pptxeternalisone
 
Improving Passenger Experience at Brussels Airport through (real-time) Analyt...
Improving Passenger Experience at Brussels Airport through (real-time) Analyt...Improving Passenger Experience at Brussels Airport through (real-time) Analyt...
Improving Passenger Experience at Brussels Airport through (real-time) Analyt...Patrick Van Renterghem
 
Big Data Analytics and Artifical Intelligence
Big Data Analytics and Artifical IntelligenceBig Data Analytics and Artifical Intelligence
Big Data Analytics and Artifical IntelligenceAnand Narayanan
 
How Bluemix Helps NASA Innovate
How Bluemix Helps NASA InnovateHow Bluemix Helps NASA Innovate
How Bluemix Helps NASA InnovateIBM
 
HOW_DATA_CAN_HELP_TO_REDUCE_AVIATION_ACCIDENTS
HOW_DATA_CAN_HELP_TO_REDUCE_AVIATION_ACCIDENTSHOW_DATA_CAN_HELP_TO_REDUCE_AVIATION_ACCIDENTS
HOW_DATA_CAN_HELP_TO_REDUCE_AVIATION_ACCIDENTSSunil Kakade
 
Flight delay detection data mining project
Flight delay detection data mining projectFlight delay detection data mining project
Flight delay detection data mining projectAkshay Kumar Bhushan
 
Business Case London Heathrow Airport Launches BI and Machine Learn.pdf
Business Case London Heathrow Airport Launches BI and Machine Learn.pdfBusiness Case London Heathrow Airport Launches BI and Machine Learn.pdf
Business Case London Heathrow Airport Launches BI and Machine Learn.pdfinfo189835
 
Air Travel Analytics in SAS
Air Travel Analytics in SASAir Travel Analytics in SAS
Air Travel Analytics in SASRohan Nanda
 
INFORM-Measuring and Monitoring Aircraft Turn Operations v3
INFORM-Measuring and Monitoring Aircraft Turn Operations v3INFORM-Measuring and Monitoring Aircraft Turn Operations v3
INFORM-Measuring and Monitoring Aircraft Turn Operations v3David Foster
 
Validating enterprise data lake using open source data validator
Validating enterprise data lake using open source data validatorValidating enterprise data lake using open source data validator
Validating enterprise data lake using open source data validatorPrachi Gupta
 
Airport ConsultingProject BriefingsAVS 4999 – Aviation Syste.docx
Airport ConsultingProject BriefingsAVS 4999 – Aviation Syste.docxAirport ConsultingProject BriefingsAVS 4999 – Aviation Syste.docx
Airport ConsultingProject BriefingsAVS 4999 – Aviation Syste.docxjesuslightbody
 
Is it harder to find a taxi when it is raining?
Is it harder to find a taxi when it is raining? Is it harder to find a taxi when it is raining?
Is it harder to find a taxi when it is raining? Wilfried Hoge
 
Avi news letter 15th issue
Avi news letter 15th issueAvi news letter 15th issue
Avi news letter 15th issueAvitrueSpares
 
AVI-NEWS Letter 15th Issue
AVI-NEWS Letter 15th IssueAVI-NEWS Letter 15th Issue
AVI-NEWS Letter 15th IssueAvitrue Spares
 
A statistical approach to predict flight delay
A statistical approach to predict flight delayA statistical approach to predict flight delay
A statistical approach to predict flight delayiDTechTechnologies
 
World Routes 2014 Keynote Presentation – How Big Date Changes Aviation Effici...
World Routes 2014 Keynote Presentation – How Big Date Changes Aviation Effici...World Routes 2014 Keynote Presentation – How Big Date Changes Aviation Effici...
World Routes 2014 Keynote Presentation – How Big Date Changes Aviation Effici...pmccann1984
 
Keynote Presentation – How Big Date Changes Aviation Efficiency (Josh Marks, ...
Keynote Presentation – How Big Date Changes Aviation Efficiency (Josh Marks, ...Keynote Presentation – How Big Date Changes Aviation Efficiency (Josh Marks, ...
Keynote Presentation – How Big Date Changes Aviation Efficiency (Josh Marks, ...Routesonline
 

Similar to Flight data analysis using apache pig--------------Final Year Project (20)

BritishAirways-CS-FINAL
BritishAirways-CS-FINALBritishAirways-CS-FINAL
BritishAirways-CS-FINAL
 
bigdatatoavoidweatherrelatedflightdelays-201219091805.pptx
bigdatatoavoidweatherrelatedflightdelays-201219091805.pptxbigdatatoavoidweatherrelatedflightdelays-201219091805.pptx
bigdatatoavoidweatherrelatedflightdelays-201219091805.pptx
 
Improving Passenger Experience at Brussels Airport through (real-time) Analyt...
Improving Passenger Experience at Brussels Airport through (real-time) Analyt...Improving Passenger Experience at Brussels Airport through (real-time) Analyt...
Improving Passenger Experience at Brussels Airport through (real-time) Analyt...
 
Big Data Analytics and Artifical Intelligence
Big Data Analytics and Artifical IntelligenceBig Data Analytics and Artifical Intelligence
Big Data Analytics and Artifical Intelligence
 
How Bluemix Helps NASA Innovate
How Bluemix Helps NASA InnovateHow Bluemix Helps NASA Innovate
How Bluemix Helps NASA Innovate
 
HOW_DATA_CAN_HELP_TO_REDUCE_AVIATION_ACCIDENTS
HOW_DATA_CAN_HELP_TO_REDUCE_AVIATION_ACCIDENTSHOW_DATA_CAN_HELP_TO_REDUCE_AVIATION_ACCIDENTS
HOW_DATA_CAN_HELP_TO_REDUCE_AVIATION_ACCIDENTS
 
Flight delay detection data mining project
Flight delay detection data mining projectFlight delay detection data mining project
Flight delay detection data mining project
 
industrial
industrialindustrial
industrial
 
Business Case London Heathrow Airport Launches BI and Machine Learn.pdf
Business Case London Heathrow Airport Launches BI and Machine Learn.pdfBusiness Case London Heathrow Airport Launches BI and Machine Learn.pdf
Business Case London Heathrow Airport Launches BI and Machine Learn.pdf
 
Air Travel Analytics in SAS
Air Travel Analytics in SASAir Travel Analytics in SAS
Air Travel Analytics in SAS
 
INFORM-Measuring and Monitoring Aircraft Turn Operations v3
INFORM-Measuring and Monitoring Aircraft Turn Operations v3INFORM-Measuring and Monitoring Aircraft Turn Operations v3
INFORM-Measuring and Monitoring Aircraft Turn Operations v3
 
Validating enterprise data lake using open source data validator
Validating enterprise data lake using open source data validatorValidating enterprise data lake using open source data validator
Validating enterprise data lake using open source data validator
 
Airport ConsultingProject BriefingsAVS 4999 – Aviation Syste.docx
Airport ConsultingProject BriefingsAVS 4999 – Aviation Syste.docxAirport ConsultingProject BriefingsAVS 4999 – Aviation Syste.docx
Airport ConsultingProject BriefingsAVS 4999 – Aviation Syste.docx
 
Is it harder to find a taxi when it is raining?
Is it harder to find a taxi when it is raining? Is it harder to find a taxi when it is raining?
Is it harder to find a taxi when it is raining?
 
Avi news letter 15th issue
Avi news letter 15th issueAvi news letter 15th issue
Avi news letter 15th issue
 
AVI-NEWS Letter 15th Issue
AVI-NEWS Letter 15th IssueAVI-NEWS Letter 15th Issue
AVI-NEWS Letter 15th Issue
 
A statistical approach to predict flight delay
A statistical approach to predict flight delayA statistical approach to predict flight delay
A statistical approach to predict flight delay
 
World Routes 2014 Keynote Presentation – How Big Date Changes Aviation Effici...
World Routes 2014 Keynote Presentation – How Big Date Changes Aviation Effici...World Routes 2014 Keynote Presentation – How Big Date Changes Aviation Effici...
World Routes 2014 Keynote Presentation – How Big Date Changes Aviation Effici...
 
Keynote Presentation – How Big Date Changes Aviation Efficiency (Josh Marks, ...
Keynote Presentation – How Big Date Changes Aviation Efficiency (Josh Marks, ...Keynote Presentation – How Big Date Changes Aviation Efficiency (Josh Marks, ...
Keynote Presentation – How Big Date Changes Aviation Efficiency (Josh Marks, ...
 
Big data
Big data Big data
Big data
 

Recently uploaded

Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 

Recently uploaded (20)

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 

Flight data analysis using apache pig--------------Final Year Project

  • 1. Submitted By............ SANJIB MITRA(150403074) SANTANU SINGHA (150403076) SHRUTI KULSHRESTHA (150403085) SUBHAM KUMAR MAHANTY(150403101) Bachelor of Technology In Electronics and Communication Underthesupervisionof Mr. Souvik Pal Department of Computer Science and Engineering Engineering
  • 2. CONTENTS  Abstract  Aim Of heE Project  What is big data  Tools We Have Use In Our Project  WHAT WE HAVE DONE IN OUR PROJECT  Some Output Of Our Project  Discussion  Conclusion
  • 3. ABSTRACT  To analyzing the big data of flight database to identify the various factors which drives an airline company into loss.  For analyzing the data we have used the major technologies such as Big data concepts, Apache Pig, Map Reduce etc.  We have created some queries which gives a clear view of reasons on which an airline company should work or take some step in order to get increases the predictability.  We believe that our approach will be helpful to bring some growth in business of airline companies as well as the business analyst.
  • 4. AIM OF THE PROJECT  The main aim of the project was optimization. At first we had to analyze the data so that we can work upon the obvious reasons which today’s people suffer while travelling in flights . Here we generate few queries and try to optimize the time between various destinations so that we can use it for some better purpose and improvements, It is noticed that many a time due to the same reasons many flights get delayed over and over again so we accumulated data of a certain period of time analyzed it and worked over certain areas.
  • 5. What is big data A collection of data setssolarge and complex that it becomes difficult to processusing on-hand database managementtools or traditional data processing applications.” OR “Big data is high-volume, high- velocity and high-variety information assetsthat demand cost- effective, innovative forms of information processing for enhanced insight and decision making.”
  • 6. Tools We Have Use In Our Project
  • 7. WHAT WE HAVE DONE IN OUR PROJECT I. Find out the top five most visited destinations. II. Which month has seen the most number of cancellations due to bad weather? III. Top ten origins with the highest, AVG departure_delay. IV. Which route (origin & destination) has seen the maximum diversion? V. Maximum no of flights cancelled in which month? VI. Find out the top ten ORIGINS for which the reason of delay Is ” security _ delay”. VII. Top ten destinations with the average arrival_ delay? VIII. Top twenty five airports where minimum numbers of flight landed? IX. Which route origin and destination has average Air System delay? X. Top ten origins with the highest Average WEATHER_DELAY? XI. Reason for which maximum numbers of flights were cancelled? XII. Which airport has seen the maximum number of flights cancelled? XIII. Find the top 10 routes with maximum distance, between origin and destination?
  • 8.
  • 9. Which route (origin & destination) has seen the maximum diversion? Queries Answer
  • 10. Top twenty five airports where minimum numbers of flight landed? Queries Answer
  • 11. Which airport has seen the maximum number of flights cancelled? Queries Answer
  • 12. DISCUSSION  Hence in the given project we analyzed a given flight data with 1Crore * 31 Rows and Columns respectively and then going through it. There were around thirteen queries after analyzing the data carefully.  These queries mainly consisted of reasons for delay and no. of flights and its origins and destinations.  Hence after going through the problems we tried our best to minimize the loses so that we can increase the profits of the flight companies and reduce the harassments caused the passengers due to weather conditions, air system delay, security delay, airline delay, late aircraft delay, weather delay.  We along with our project mentor took forward the steps to look into the project and hence find out in details which is kept unseen till now.
  • 13. CONCLUSION  Hadoop Mapreduce is now a popular choice for performing large-scale data analytics. Bigdata analytics using pig  sheds light on significant issues faced by flight data and we can find the numbers of flight cancelled per month.