SlideShare a Scribd company logo
1 of 40
Capstone Project - 1
Hotel Booking Analysis
Team members:
Minaz Uddin R.
Kiran Mamtani
1
Content
• Importing libraries and Database
• Summary of Data
• Data wrangling
• Data Analysis
• Data visualization
• Challenges
• Conclusion
2
Importing Libraries and Database
• NumPy
• Pandas
• Matplotlib
• Seaborn
3
Summary of Data
About Data - This Database consists of information
between 2 different hotels over 3 years (2015 to 2017)
Size of Data:
● Rows - 119390
● Columns - 32
Source: https://www.freepik.com/
4
Data wrangling
Data cleaning :
● This Database consists of null values ‘Nan’.
● Replaced those null values with zero.
Data preparation :
● Database consist of different types of data types.
● Data types : integer, float, object
● Converted the ‘Float’ Data type into ‘integer’ Data type
5
Data Analysis
Below are some questions came to our mind before analysis and we thought to include them…
1. Which type of hotels they have booked
2. Which type of customer are booking the hotels
3. From which medium the maximum booking are happening
4. Find the rate of cancelation of hotels
5. Find the month and year that has maximum booking ? Compared
it with hotel type and hotel canceled
6. Find the top 10 countries that has maximum bookings
Continues…
6
7. Find the relationship between room change and cancellations
8. What if you wanted to predict whether or not a hotel was likely to
receive a disproportionately high number of special request ?
9. What kind of people book the most hotel ?
10. For how long the people stay in hotel ?
11. Percentage of repeated guest in hotel ?
12. Analyze average ADR for given statement
a) Avg ADR for each year
b) Avg ADR for resort and hotels
c) Avg ADR for each country
d) Avg ADR of top 10 countries who has highest number of bookings
Data Analysis
7
Data Analysis and Visualization
8
Here we can observe that, out
of 100%, City hotels are more
booked (66.4%) compared to
resort hotels (33.6%)
1. Which type of hotels they have booked ?
9
2. Which type of customers is booking hotels?
Customer type Count Customer type %
Transient 89613 75.059050
Transient-party 25124 21.043638
Contract 4076 3.414021
Group 577 0.483290
So the maximum customer type is transient (75%). It means that
these are the customers who are living for the short period of time.
10
3. From which medium the maximum booking
had happened ?
Market segment Market segment
counts
Market segment
percentage
Online TA 56477 47.304632
Offline TA/TO 24219 20.285619
Groups 19811 16.593517
Direct 12606 10.558673
Corporate 5295 4.435045
Complementary 743 0.622330
Aviation 237 0.198509
Undefined 2 0.001675
So 47% of the customers
are booking hotels online
11
4. Find the rate of cancellation of hotels?
When compared with City and Resort
hotels. City hotel cancellation is 42%
and Resort hotel cancellation is 28%
Canceled Counts Percentage
0 (Not canceled) 75166 62.95
1 (Canceled) 44224 37.04
So 37% of total booking got canceled
12
5. Find the month and year that has
maximum booking? Compare it with
hotel type and hotel canceled.
So August has highest
number of bookings.
But this include both
resort and city hotels
13
Lets see resort and city differently
So resort and city both hotels has highest number of bookings in August month
14
To find the number of cancellations on monthly basis
So when compared to all the months ,August month has maximum
number of cancellations followed by July and may month.
15
Now lets see which year has the highest number bookings
Bookings has increased in 2016 than decreased in 2017. And max
booking was in 2016 which include both resort and city hotels
16
Now lets see resort and city hotels differently in each year
So resort and city both hotels has highest number of bookings in 2016
17
Lets see which year has maximum cancellation
We could find that maximum number of booking are been cancelled in the year 2016
18
Now lets see year wise booking of each month
• As we can see in above graph, we have data
from July 2015 till August 2017.
• Earlier we have seen August is the month
who has maximum booking but that was
cumulative result of all three years. August is
repeating in every year.
• Here in year 2015, the maximum booking was
in September. In year 2016, it is October and
in year 2017, it is May.
• In none of the year we have got August.
• So we can't say August has maximum booking
in each year, it has maximum booking but in
the given period of time in our data base.
• If our data was from January to December or
from July 2015 till June 2017 then we can get
accurate number in the outcome
19
6. Find the top 10 countries that had
maximum bookings?
Portugal has highest bookings followed by Germany and France
20
Lets compare country wise bookings with hotel types
• Portugal has max booking in resort as well as city hotel.
• The 2nd highest booking is in Germany for both hotel & city hotels.
• The 3rd highest booking for resort hotel is in Spain and for city hotel is in France.
21
Now lets see which countries has highest cancellation
Portugal has highest cancellation followed by Germany and Spain
22
Now lets see the actual people stayed and enjoyed
their holiday’s
So after looking at the actual booking data, USA has taken
place in top 10 and BRA (Brazil) got removed from top 10
23
7. Find the relationship between room
change and cancellation?
Customer who has not changed
room type has cancellation more
and 5.7% of people cancelled room
after they had assigned different
room from booking room.
24
8. What if you wanted to predict whether or not
a hotel was likely to receive a disproportionately
high number of special requests?
So the probability of 0
special request is highest
25
9. What Kind of People Book the Most Hotels?
So as it is seen max booking was done by couples
26
Now lets see which type of people has canceled
booking more
So couples has canceled maximum booking followed by single then family respectively
27
10. For how long people stay in hotel?
So most of people staying in hotel for 2 days follow by 3 days and 1 day.
28
Now lets compare resort and city hotel stays...
For city hotel most popular stay in 2,3,1 days respectively.
And for resort its 1,7,2 days respectively
29
11. Percentage of Repeated Guests in hotels.
In city hotel 53.3% people are
repeated guest while in resort
the percentage is 46.7%.
30
12. Calculate average ADR for given
statements ?
a) Avg ADR for each year
As we can see in above graph the avg
ADR is gradually increasing from 2015
to 2017. So in 2017 the avg ADR is max
31
b) Avg ADR for hotel and resort
As the ADR between city hotel and
resort are the Slight change. when
compare to city hotel is more
32
c) Avg ADR for each country
So the highest ADR is of Djibouti (DJI) . But the above 10 countries are not our top 10 countries
who has highest booking. So lets see our top 10 countries with highest booking and their ADR
33
d) Avg ADR of top 10 countries who has highest number of bookings
So as we know Portugal has highest booking and its ADR is 92. And
the highest ADR from top ten countries is of Spain i.e. 116.99 ~ 117
34
Conclusion
 In our analysis of hotel booking we have tried different methods and visualized it by graphs.
 We started with asking few question to ourself and drafted the same to get the best solution.
 The first and most important step was cleaning the data. And then our analysis of data started.
Here is the conclusion of all the solution we got from our analysis:
• People are preferring hotel booking more than the resort hotels.
• The preferred booking of mode is online booking
• People preferred to stay for shorter period of time. For city hotel the popular stay is 2,3,1 days
respectively and for resort hotel it is 1,7,2 days respectively.
• Couples books more hotel than single and family.
• 37% of people are cancelling their bookings (City & Resort). Resort hotel has less cancellation rate
(28%) in compare to city hotels (42%)
Continues… 35
Conclusion
Now our data is from July 2015 till August 2017. But we have analyzed for year and months so
it will not give us accurate results.
• 2016 has highest booking in both hotel & resort
• August has highest number of booking in both hotel & resort
• In year 2015, the maximum booking was in September. In year 2016, it was October and in year
2017, it was May
• Also, 2016 and August has highest number of cancellation
• The top three countries who has highest number of bookings are Portugal, Germany and France
• If we analyze hotel wise: Portugal has max booking in resort as well as city hotel. 2nd highest
booking is in Germany. The 3rd highest booking for resort hotel is Spain and for city hotel is France.
36
Continues…
Conclusion
•Portugal, Germany and Spain has maximum cancellation
• We analyzed relationship between room changed and cancellation: We conclude that Customer who
has not changed room type has cancellation more
• 5.7% of people cancelled room after they had assigned different room from booking room.
• The highest probability of 0 special request is highest
• The highest average ADR is for 2017 i.e. 114.63
• The average ADR for city hotel is 105.30 and for resort hotel is 94.95
• The highest average ADR is for Djibouti (DJI) country i.e. 273.
• And the highest average ADR for Portugal is 92.04
37
Summation
• Q2 & Q3 are having higher number of Bookings So hotels can avail some discount offers in Q1
and Q4 as Bookings rate is less in these quarters.
• Hotels can promote their business online as most of the Bookings are coming from online
portals.
• Special discount can be given to repeated guests.
• City Hotel can come up with special plans for long staying guests.
• Resort Hotel can come up with special plans for short staying guests
38
Challenges
• Most difficult part was to decide how to clean data. Should we remove the columns? Or should
we replace it with mean? Or should we replace it with 0.
• Second was should we remove all the canceled rows then continue with the analysis. Or we
should continue with data and compare each analysis with canceled column.
• At the starting point of the project, got confused in groupby method.
• It was also difficult to choose between what to include or what not.
39
Thank you!
40

More Related Content

Similar to Science preject.pptx

Global hotel-business-index
Global hotel-business-indexGlobal hotel-business-index
Global hotel-business-indexSarah Gasim
 
Advantages of Data mining Techniques in improving CRM in the Hospitality domain
Advantages of Data mining Techniques  in improving CRM in the Hospitality domainAdvantages of Data mining Techniques  in improving CRM in the Hospitality domain
Advantages of Data mining Techniques in improving CRM in the Hospitality domainWafa Raboudi
 
3rd Azerbaijan Tourism Forum - Rate Disparity. So Now What?
3rd Azerbaijan Tourism Forum - Rate Disparity. So Now What?3rd Azerbaijan Tourism Forum - Rate Disparity. So Now What?
3rd Azerbaijan Tourism Forum - Rate Disparity. So Now What?Nikolaos Tsitsoulis, CRME
 
Demand for interaction
Demand for interactionDemand for interaction
Demand for interactionRingPartner
 
Hospitality industry research report
Hospitality industry research reportHospitality industry research report
Hospitality industry research reportAli Hamed
 
Implement a Sucessfull E-Distribution Strategy by Hopineo
Implement a Sucessfull E-Distribution Strategy by HopineoImplement a Sucessfull E-Distribution Strategy by Hopineo
Implement a Sucessfull E-Distribution Strategy by HopineoFlorie Thielin
 
Michael Mitroff Indocabs Report
Michael Mitroff Indocabs Report Michael Mitroff Indocabs Report
Michael Mitroff Indocabs Report MichaelMitroff
 
Sample report: Global Online Accommodation Booking Market 2018
Sample report: Global Online Accommodation Booking Market 2018Sample report: Global Online Accommodation Booking Market 2018
Sample report: Global Online Accommodation Booking Market 2018yStats.com
 
Amárach Hotel Investment Survey December 2013
Amárach Hotel Investment Survey December 2013Amárach Hotel Investment Survey December 2013
Amárach Hotel Investment Survey December 2013Amarach Research
 
Product Brochure: Global Online Accommodation Booking Market 2018
Product Brochure: Global Online Accommodation Booking Market 2018Product Brochure: Global Online Accommodation Booking Market 2018
Product Brochure: Global Online Accommodation Booking Market 2018yStats.com
 
Marketing information and research - September 2014
Marketing information and research - September 2014Marketing information and research - September 2014
Marketing information and research - September 2014Enrico Rosi
 
5 important US hotel and travel trends with the power to shape 2016
5 important US hotel and travel trends with the power to shape 20165 important US hotel and travel trends with the power to shape 2016
5 important US hotel and travel trends with the power to shape 2016SiteMinder
 
Working with Online Travel Agencies (so It Works for You)
Working with Online Travel Agencies (so It Works for You)Working with Online Travel Agencies (so It Works for You)
Working with Online Travel Agencies (so It Works for You)Arival
 
Implement a successfull e distribution strategy by hopineo
Implement a successfull e distribution strategy by hopineoImplement a successfull e distribution strategy by hopineo
Implement a successfull e distribution strategy by hopineoBen Voyage
 
Hotel Booking Analysis : This project aims to uncover the factors behind high...
Hotel Booking Analysis : This project aims to uncover the factors behind high...Hotel Booking Analysis : This project aims to uncover the factors behind high...
Hotel Booking Analysis : This project aims to uncover the factors behind high...DeepakBhatt43904
 
Beginners guide to revenue management
Beginners guide to revenue managementBeginners guide to revenue management
Beginners guide to revenue managementAriel Ampon
 

Similar to Science preject.pptx (20)

Global hotel-business-index
Global hotel-business-indexGlobal hotel-business-index
Global hotel-business-index
 
Advantages of Data mining Techniques in improving CRM in the Hospitality domain
Advantages of Data mining Techniques  in improving CRM in the Hospitality domainAdvantages of Data mining Techniques  in improving CRM in the Hospitality domain
Advantages of Data mining Techniques in improving CRM in the Hospitality domain
 
3rd Azerbaijan Tourism Forum - Rate Disparity. So Now What?
3rd Azerbaijan Tourism Forum - Rate Disparity. So Now What?3rd Azerbaijan Tourism Forum - Rate Disparity. So Now What?
3rd Azerbaijan Tourism Forum - Rate Disparity. So Now What?
 
Demand for interaction
Demand for interactionDemand for interaction
Demand for interaction
 
Hospitality industry research report
Hospitality industry research reportHospitality industry research report
Hospitality industry research report
 
Implement a Sucessfull E-Distribution Strategy by Hopineo
Implement a Sucessfull E-Distribution Strategy by HopineoImplement a Sucessfull E-Distribution Strategy by Hopineo
Implement a Sucessfull E-Distribution Strategy by Hopineo
 
Michael Mitroff Indocabs Report
Michael Mitroff Indocabs Report Michael Mitroff Indocabs Report
Michael Mitroff Indocabs Report
 
Sample report: Global Online Accommodation Booking Market 2018
Sample report: Global Online Accommodation Booking Market 2018Sample report: Global Online Accommodation Booking Market 2018
Sample report: Global Online Accommodation Booking Market 2018
 
Amárach Hotel Investment Survey December 2013
Amárach Hotel Investment Survey December 2013Amárach Hotel Investment Survey December 2013
Amárach Hotel Investment Survey December 2013
 
Hotel Distribution Study Switzerland: Results for the Reference Year 2015
Hotel Distribution Study Switzerland: Results for the Reference Year 2015Hotel Distribution Study Switzerland: Results for the Reference Year 2015
Hotel Distribution Study Switzerland: Results for the Reference Year 2015
 
Product Brochure: Global Online Accommodation Booking Market 2018
Product Brochure: Global Online Accommodation Booking Market 2018Product Brochure: Global Online Accommodation Booking Market 2018
Product Brochure: Global Online Accommodation Booking Market 2018
 
Marketing information and research - September 2014
Marketing information and research - September 2014Marketing information and research - September 2014
Marketing information and research - September 2014
 
5 important US hotel and travel trends with the power to shape 2016
5 important US hotel and travel trends with the power to shape 20165 important US hotel and travel trends with the power to shape 2016
5 important US hotel and travel trends with the power to shape 2016
 
Distribution Survey for the Swiss Hotel Industry 2017: Strong Growth of OTAs
Distribution Survey for the Swiss Hotel Industry 2017: Strong Growth of OTAsDistribution Survey for the Swiss Hotel Industry 2017: Strong Growth of OTAs
Distribution Survey for the Swiss Hotel Industry 2017: Strong Growth of OTAs
 
Working with Online Travel Agencies (so It Works for You)
Working with Online Travel Agencies (so It Works for You)Working with Online Travel Agencies (so It Works for You)
Working with Online Travel Agencies (so It Works for You)
 
Implement a successfull e distribution strategy by hopineo
Implement a successfull e distribution strategy by hopineoImplement a successfull e distribution strategy by hopineo
Implement a successfull e distribution strategy by hopineo
 
Hotel Booking Analysis : This project aims to uncover the factors behind high...
Hotel Booking Analysis : This project aims to uncover the factors behind high...Hotel Booking Analysis : This project aims to uncover the factors behind high...
Hotel Booking Analysis : This project aims to uncover the factors behind high...
 
Company presentation
Company presentationCompany presentation
Company presentation
 
Beginners guide to revenue management
Beginners guide to revenue managementBeginners guide to revenue management
Beginners guide to revenue management
 
Rooms Blocks and Red flags
Rooms Blocks and Red flagsRooms Blocks and Red flags
Rooms Blocks and Red flags
 

Recently uploaded

CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerunnathinaik
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfakmcokerachita
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Science lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonScience lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonJericReyAuditor
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 

Recently uploaded (20)

CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developer
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdf
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Science lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonScience lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lesson
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 

Science preject.pptx

  • 1. Capstone Project - 1 Hotel Booking Analysis Team members: Minaz Uddin R. Kiran Mamtani 1
  • 2. Content • Importing libraries and Database • Summary of Data • Data wrangling • Data Analysis • Data visualization • Challenges • Conclusion 2
  • 3. Importing Libraries and Database • NumPy • Pandas • Matplotlib • Seaborn 3
  • 4. Summary of Data About Data - This Database consists of information between 2 different hotels over 3 years (2015 to 2017) Size of Data: ● Rows - 119390 ● Columns - 32 Source: https://www.freepik.com/ 4
  • 5. Data wrangling Data cleaning : ● This Database consists of null values ‘Nan’. ● Replaced those null values with zero. Data preparation : ● Database consist of different types of data types. ● Data types : integer, float, object ● Converted the ‘Float’ Data type into ‘integer’ Data type 5
  • 6. Data Analysis Below are some questions came to our mind before analysis and we thought to include them… 1. Which type of hotels they have booked 2. Which type of customer are booking the hotels 3. From which medium the maximum booking are happening 4. Find the rate of cancelation of hotels 5. Find the month and year that has maximum booking ? Compared it with hotel type and hotel canceled 6. Find the top 10 countries that has maximum bookings Continues… 6
  • 7. 7. Find the relationship between room change and cancellations 8. What if you wanted to predict whether or not a hotel was likely to receive a disproportionately high number of special request ? 9. What kind of people book the most hotel ? 10. For how long the people stay in hotel ? 11. Percentage of repeated guest in hotel ? 12. Analyze average ADR for given statement a) Avg ADR for each year b) Avg ADR for resort and hotels c) Avg ADR for each country d) Avg ADR of top 10 countries who has highest number of bookings Data Analysis 7
  • 8. Data Analysis and Visualization 8
  • 9. Here we can observe that, out of 100%, City hotels are more booked (66.4%) compared to resort hotels (33.6%) 1. Which type of hotels they have booked ? 9
  • 10. 2. Which type of customers is booking hotels? Customer type Count Customer type % Transient 89613 75.059050 Transient-party 25124 21.043638 Contract 4076 3.414021 Group 577 0.483290 So the maximum customer type is transient (75%). It means that these are the customers who are living for the short period of time. 10
  • 11. 3. From which medium the maximum booking had happened ? Market segment Market segment counts Market segment percentage Online TA 56477 47.304632 Offline TA/TO 24219 20.285619 Groups 19811 16.593517 Direct 12606 10.558673 Corporate 5295 4.435045 Complementary 743 0.622330 Aviation 237 0.198509 Undefined 2 0.001675 So 47% of the customers are booking hotels online 11
  • 12. 4. Find the rate of cancellation of hotels? When compared with City and Resort hotels. City hotel cancellation is 42% and Resort hotel cancellation is 28% Canceled Counts Percentage 0 (Not canceled) 75166 62.95 1 (Canceled) 44224 37.04 So 37% of total booking got canceled 12
  • 13. 5. Find the month and year that has maximum booking? Compare it with hotel type and hotel canceled. So August has highest number of bookings. But this include both resort and city hotels 13
  • 14. Lets see resort and city differently So resort and city both hotels has highest number of bookings in August month 14
  • 15. To find the number of cancellations on monthly basis So when compared to all the months ,August month has maximum number of cancellations followed by July and may month. 15
  • 16. Now lets see which year has the highest number bookings Bookings has increased in 2016 than decreased in 2017. And max booking was in 2016 which include both resort and city hotels 16
  • 17. Now lets see resort and city hotels differently in each year So resort and city both hotels has highest number of bookings in 2016 17
  • 18. Lets see which year has maximum cancellation We could find that maximum number of booking are been cancelled in the year 2016 18
  • 19. Now lets see year wise booking of each month • As we can see in above graph, we have data from July 2015 till August 2017. • Earlier we have seen August is the month who has maximum booking but that was cumulative result of all three years. August is repeating in every year. • Here in year 2015, the maximum booking was in September. In year 2016, it is October and in year 2017, it is May. • In none of the year we have got August. • So we can't say August has maximum booking in each year, it has maximum booking but in the given period of time in our data base. • If our data was from January to December or from July 2015 till June 2017 then we can get accurate number in the outcome 19
  • 20. 6. Find the top 10 countries that had maximum bookings? Portugal has highest bookings followed by Germany and France 20
  • 21. Lets compare country wise bookings with hotel types • Portugal has max booking in resort as well as city hotel. • The 2nd highest booking is in Germany for both hotel & city hotels. • The 3rd highest booking for resort hotel is in Spain and for city hotel is in France. 21
  • 22. Now lets see which countries has highest cancellation Portugal has highest cancellation followed by Germany and Spain 22
  • 23. Now lets see the actual people stayed and enjoyed their holiday’s So after looking at the actual booking data, USA has taken place in top 10 and BRA (Brazil) got removed from top 10 23
  • 24. 7. Find the relationship between room change and cancellation? Customer who has not changed room type has cancellation more and 5.7% of people cancelled room after they had assigned different room from booking room. 24
  • 25. 8. What if you wanted to predict whether or not a hotel was likely to receive a disproportionately high number of special requests? So the probability of 0 special request is highest 25
  • 26. 9. What Kind of People Book the Most Hotels? So as it is seen max booking was done by couples 26
  • 27. Now lets see which type of people has canceled booking more So couples has canceled maximum booking followed by single then family respectively 27
  • 28. 10. For how long people stay in hotel? So most of people staying in hotel for 2 days follow by 3 days and 1 day. 28
  • 29. Now lets compare resort and city hotel stays... For city hotel most popular stay in 2,3,1 days respectively. And for resort its 1,7,2 days respectively 29
  • 30. 11. Percentage of Repeated Guests in hotels. In city hotel 53.3% people are repeated guest while in resort the percentage is 46.7%. 30
  • 31. 12. Calculate average ADR for given statements ? a) Avg ADR for each year As we can see in above graph the avg ADR is gradually increasing from 2015 to 2017. So in 2017 the avg ADR is max 31
  • 32. b) Avg ADR for hotel and resort As the ADR between city hotel and resort are the Slight change. when compare to city hotel is more 32
  • 33. c) Avg ADR for each country So the highest ADR is of Djibouti (DJI) . But the above 10 countries are not our top 10 countries who has highest booking. So lets see our top 10 countries with highest booking and their ADR 33
  • 34. d) Avg ADR of top 10 countries who has highest number of bookings So as we know Portugal has highest booking and its ADR is 92. And the highest ADR from top ten countries is of Spain i.e. 116.99 ~ 117 34
  • 35. Conclusion  In our analysis of hotel booking we have tried different methods and visualized it by graphs.  We started with asking few question to ourself and drafted the same to get the best solution.  The first and most important step was cleaning the data. And then our analysis of data started. Here is the conclusion of all the solution we got from our analysis: • People are preferring hotel booking more than the resort hotels. • The preferred booking of mode is online booking • People preferred to stay for shorter period of time. For city hotel the popular stay is 2,3,1 days respectively and for resort hotel it is 1,7,2 days respectively. • Couples books more hotel than single and family. • 37% of people are cancelling their bookings (City & Resort). Resort hotel has less cancellation rate (28%) in compare to city hotels (42%) Continues… 35
  • 36. Conclusion Now our data is from July 2015 till August 2017. But we have analyzed for year and months so it will not give us accurate results. • 2016 has highest booking in both hotel & resort • August has highest number of booking in both hotel & resort • In year 2015, the maximum booking was in September. In year 2016, it was October and in year 2017, it was May • Also, 2016 and August has highest number of cancellation • The top three countries who has highest number of bookings are Portugal, Germany and France • If we analyze hotel wise: Portugal has max booking in resort as well as city hotel. 2nd highest booking is in Germany. The 3rd highest booking for resort hotel is Spain and for city hotel is France. 36 Continues…
  • 37. Conclusion •Portugal, Germany and Spain has maximum cancellation • We analyzed relationship between room changed and cancellation: We conclude that Customer who has not changed room type has cancellation more • 5.7% of people cancelled room after they had assigned different room from booking room. • The highest probability of 0 special request is highest • The highest average ADR is for 2017 i.e. 114.63 • The average ADR for city hotel is 105.30 and for resort hotel is 94.95 • The highest average ADR is for Djibouti (DJI) country i.e. 273. • And the highest average ADR for Portugal is 92.04 37
  • 38. Summation • Q2 & Q3 are having higher number of Bookings So hotels can avail some discount offers in Q1 and Q4 as Bookings rate is less in these quarters. • Hotels can promote their business online as most of the Bookings are coming from online portals. • Special discount can be given to repeated guests. • City Hotel can come up with special plans for long staying guests. • Resort Hotel can come up with special plans for short staying guests 38
  • 39. Challenges • Most difficult part was to decide how to clean data. Should we remove the columns? Or should we replace it with mean? Or should we replace it with 0. • Second was should we remove all the canceled rows then continue with the analysis. Or we should continue with data and compare each analysis with canceled column. • At the starting point of the project, got confused in groupby method. • It was also difficult to choose between what to include or what not. 39