SlideShare a Scribd company logo
1 of 31
San Francisco Bay Area
Bike Share Analysis
Project By :
Shefali Dagar
Shashank Khede
Navtej Singh Chawla
Contents:
•Introduction
•About Dataset
•Data Analysis and Findings
•Challenges
•Conclusion of Analysis
•Future work
Introduction
• We are using the recorded data of the bike trips which were used in
Bay area of San Francisco From August 2013 to August 2015.
• The objective of this project is to determine the peak hours, days,
months in which bikes are mostly occupied by the
subscribers/customers.
• We have chosen this project as it provides insights into the factors
which affects the bike sharing on basis of weather conditions , days or
locations. We can use this analysis to predict future consumption of
these bikes.
About Data
• The data sets used belongs to the Kaggle and it is provided by
Ben Hamner.
• We have used 3 CSV data files Station, Trip, Weather for
anlysis.
• Station CSV has 84 rows with details of different stations
• Trip CSV has 669960 records and details of different trips
• Weather CSV has 3665 records and data details of weather between
Aug 2013 to Aug 2015.
Problem Statements
• What is the average duration of the bike trips.
• What is the busiest time of the week for the bike trips, the weekends
or the weekdays.
• Can we predict that what role the weather plays in bike share trip?
• Do the bike trip patterns vary with the time and day of the week?
Steps Performed During Data Analysis
• Problem 1) - What is the average duration of the bike trips.
Data Smoothing (Removing noisy and blank values).
Joining trip and station data for analysis.
Analyzing trips which are shorter than a day and hour respectively.
Histogram plot
Problem 1 - Plots
• Average duration of
bike trips is around ‘9
minutes’.
• Our data is skewed
right and mean is
larger than the
median.
• Most of trips are
less than 5 hours.
• 99.96% trips are
less than a day
trip.
• 96.83% trips are
less than an hour
Problem 2 - Plots
Steps Performed During Data Analysis
• Problem 2) - What is the busiest time of the week for the bike trips,
the weekends or the weekdays.
Creating new variables for trips in minutes.
Splitting weekday and weekend data.
Bar plot for days(weekend & weekday) vs minutes plot according to different
cities.
• People prefer
to rent the
bike more over
the weekdays
than weekend.
Problem 2 - Plots
Steps Performed During Data Analysis
• Problem 3) - Can we predict that what role the weather plays in bike
share trip?
We check weather conditions throughout the year in the bay area.
Customer and Subscriber plot according to weekday and weekend.
Plotting humidity and bike trips, wind speed and bike trips.
Rider sensitivity according to different weather conditions.
Ride duration in seconds according to temperature (in Fahrenheit).
Problem 3 - Plots
• Mean temperature through
Aug’13 to Aug’15 is around 60
Fahrenheit.
• Humidity is around 70 during
this time.
• Wind speed is approximately at
9mph during this time period.
Problem 3 - Plots
• Customers are those who
have access for 24hours.
• Subscribers have 3 days or
more pass.
• Most trips are recorded
when temperature is
around 68 Fahrenheit.
Problem 3 - Plots
• Customers and subscribers
both love to ride when
humidity is between 60 to
80.
• Less trips are recorded
when humidity is more.
Problem 3 - Plots
• Customers and subscribers
both love to ride when the
wind speed is around 9-
10mph.
• Least trips are recorded
when its windy.
Problem 3 - Plots
• Best time when riders
love to ride is when the
weather is clear or there
is ‘No fog or rain’.
• Interestingly, people love
to ride during
‘Thunderstorm’.
• People don’t like riding in
the ‘Rain’ according to
our analysis.
Steps Performed During Data Analysis
• Problem 4) - Do the bike trip patterns vary with the time and day of
the week?
Dividing date into each month, days of week.
Verifying peak time when bikes are rented the most.
Bar plot for each day vs minutes.
Histogram for peak timings for total number of rides.
Problem 4 - Plots
• After we analyzed that people
take more trips on weekdays,
we further see that ‘Tuesday’
are busiest days.
• Sundays are the least busiest
day in terms of bike renting.
Problem 4 - Plots
• July and August are of the most
busiest months where most
bikes are rented.
• December records the least
number of bikes rented.
Problem 4 - Plots
• This plot shows the busiest time
which is 9:00 AM in the
morning and 5:00 PM in the
evening.
• Least bike rental is done at
12:00 AM to 5:00 AM in the
morning.
Model Creation
• Verifying scatter plot
to check linear
relationship between
attributes.
Model Creation
• We created an initial
model where we get
26.75% accuracy to
get average bike trip
duration.
• All variables have
significant impact as
p-value is <0.05 at
95% confidence
interval.
Model Creation
• We plot the residuals vs
predicted plot and we see
there are many outliers having
value > 3.
• VIF value for each variable is <
5 and hence we don’t have
any multicollinearity problem.
Model Creation
• Plotting residuals vs
subscription type.
Model Creation
• Residuals vs Month plot.
Model Creation
• Residual vs year plot
Model Creation
• Plotting QQ Plot we check for
outliers having values > 3.
Model Creation
• By backward elimination
we get a new model
with accuracy 26.73%.
Model Creation
• Using step wise
elimination we get
26.75% accuracy.
Conclusion
• We addressed all our problem statements and below were the
outcomes:
Average duration of bike trips is around ‘9 minutes’.
People prefer to rent the bike more over the weekdays than weekend.
Best time when riders love to ride is when the weather is clear or there is ‘No
fog or rain’.
People love to ride when weather is like ‘Thunderstorm’.
People don’t like riding in the ‘Rain’ according to our analysis.
This plot shows the busiest time which is 9:00 AM in the morning and 5:00
PM in the evening.
Least bike rental is done at 12:00 AM to 5:00 AM in the morning.
THANK YOU !

More Related Content

What's hot

Automatic Machine Learning, AutoML
Automatic Machine Learning, AutoMLAutomatic Machine Learning, AutoML
Automatic Machine Learning, AutoMLHimadri Mishra
 
Lane detection by use of canny edge
Lane detection by use of canny edgeLane detection by use of canny edge
Lane detection by use of canny edgebanz23
 
Traffic Prediction for Intelligent Transportation System using Machine Learning
Traffic Prediction for Intelligent Transportation System using Machine LearningTraffic Prediction for Intelligent Transportation System using Machine Learning
Traffic Prediction for Intelligent Transportation System using Machine LearningOmSuryawanshi9
 
MachineLearning.ppt
MachineLearning.pptMachineLearning.ppt
MachineLearning.pptbutest
 
Linear regression
Linear regressionLinear regression
Linear regressionMartinHogg9
 
Park in : Business plan
Park in : Business planPark in : Business plan
Park in : Business plansurbhi mathur
 
Marketing Plan for an App ( Parking Finder!)
Marketing Plan for an App ( Parking Finder!)Marketing Plan for an App ( Parking Finder!)
Marketing Plan for an App ( Parking Finder!)Simarpreet Singh Chawla
 
Traffic Light Control
Traffic Light ControlTraffic Light Control
Traffic Light Controlhoadktd
 
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Md. Main Uddin Rony
 
The Internet of Cars - Towards the Future of the Connected Car
The Internet of Cars - Towards the Future of the Connected CarThe Internet of Cars - Towards the Future of the Connected Car
The Internet of Cars - Towards the Future of the Connected CarJorgen Thelin
 
Traffic simulation
Traffic simulationTraffic simulation
Traffic simulationPraful -
 
Carpooling Network Presentation
Carpooling Network PresentationCarpooling Network Presentation
Carpooling Network Presentationsineadc96
 
BLIND SPOT INFORMATION SYSTEM
BLIND SPOT INFORMATION SYSTEMBLIND SPOT INFORMATION SYSTEM
BLIND SPOT INFORMATION SYSTEMMUKESHBORAN
 

What's hot (20)

Autonomous vehicles
Autonomous vehiclesAutonomous vehicles
Autonomous vehicles
 
Connected and Autonomous Vehicles: The Enabling Technologies
Connected and Autonomous Vehicles: The Enabling TechnologiesConnected and Autonomous Vehicles: The Enabling Technologies
Connected and Autonomous Vehicles: The Enabling Technologies
 
Autonomous cars
Autonomous carsAutonomous cars
Autonomous cars
 
Automatic Machine Learning, AutoML
Automatic Machine Learning, AutoMLAutomatic Machine Learning, AutoML
Automatic Machine Learning, AutoML
 
Lane detection by use of canny edge
Lane detection by use of canny edgeLane detection by use of canny edge
Lane detection by use of canny edge
 
Traffic presentation
Traffic presentationTraffic presentation
Traffic presentation
 
Pedestrian Safety and Urban Infrastructure
Pedestrian Safety and Urban InfrastructurePedestrian Safety and Urban Infrastructure
Pedestrian Safety and Urban Infrastructure
 
Traffic Prediction for Intelligent Transportation System using Machine Learning
Traffic Prediction for Intelligent Transportation System using Machine LearningTraffic Prediction for Intelligent Transportation System using Machine Learning
Traffic Prediction for Intelligent Transportation System using Machine Learning
 
Autonomous car
Autonomous carAutonomous car
Autonomous car
 
MachineLearning.ppt
MachineLearning.pptMachineLearning.ppt
MachineLearning.ppt
 
Linear regression
Linear regressionLinear regression
Linear regression
 
PRESENTATION
PRESENTATIONPRESENTATION
PRESENTATION
 
Park in : Business plan
Park in : Business planPark in : Business plan
Park in : Business plan
 
Marketing Plan for an App ( Parking Finder!)
Marketing Plan for an App ( Parking Finder!)Marketing Plan for an App ( Parking Finder!)
Marketing Plan for an App ( Parking Finder!)
 
Traffic Light Control
Traffic Light ControlTraffic Light Control
Traffic Light Control
 
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
 
The Internet of Cars - Towards the Future of the Connected Car
The Internet of Cars - Towards the Future of the Connected CarThe Internet of Cars - Towards the Future of the Connected Car
The Internet of Cars - Towards the Future of the Connected Car
 
Traffic simulation
Traffic simulationTraffic simulation
Traffic simulation
 
Carpooling Network Presentation
Carpooling Network PresentationCarpooling Network Presentation
Carpooling Network Presentation
 
BLIND SPOT INFORMATION SYSTEM
BLIND SPOT INFORMATION SYSTEMBLIND SPOT INFORMATION SYSTEM
BLIND SPOT INFORMATION SYSTEM
 

Similar to Bike sharing analysis san francisco

How Does a Bike-Share Navigate Speedy Success?
How Does a Bike-Share Navigate Speedy Success? How Does a Bike-Share Navigate Speedy Success?
How Does a Bike-Share Navigate Speedy Success? Ahmed Abdelfatah
 
Predictive Analysis of Bike Sharing System Using Machine Learning Algorithms
Predictive Analysis of Bike Sharing System Using Machine Learning AlgorithmsPredictive Analysis of Bike Sharing System Using Machine Learning Algorithms
Predictive Analysis of Bike Sharing System Using Machine Learning Algorithmssushantparte
 
Cyclistic.Google project.pptx
Cyclistic.Google project.pptxCyclistic.Google project.pptx
Cyclistic.Google project.pptxHanyaAlwardany
 
IBM Watson Analytics - Sales Prediction
IBM Watson Analytics - Sales Prediction IBM Watson Analytics - Sales Prediction
IBM Watson Analytics - Sales Prediction Pragyan Sharma
 
Slide share countingbikes&peds6
Slide share countingbikes&peds6Slide share countingbikes&peds6
Slide share countingbikes&peds6sogoss
 
ATS-16: Making Data Count, Josh Roll
ATS-16: Making Data Count, Josh RollATS-16: Making Data Count, Josh Roll
ATS-16: Making Data Count, Josh RollBTAOregon
 
6101-Project Report
6101-Project Report6101-Project Report
6101-Project ReportLove Tyagi
 
Data Science: Can weather predict Bikeshare usage?
Data Science: Can weather predict Bikeshare usage?Data Science: Can weather predict Bikeshare usage?
Data Science: Can weather predict Bikeshare usage?Jackie Zajac
 
Dynamic demand
Dynamic demandDynamic demand
Dynamic demandJumpingJaq
 
Presentation on Spot Speed Study Analysis for the course CE 454
Presentation on Spot Speed Study Analysis for the course CE 454Presentation on Spot Speed Study Analysis for the course CE 454
Presentation on Spot Speed Study Analysis for the course CE 454nazifa tabassum
 
April Live Pavement Management Webinar 1 by IPMA Academy
April Live Pavement Management Webinar 1 by IPMA AcademyApril Live Pavement Management Webinar 1 by IPMA Academy
April Live Pavement Management Webinar 1 by IPMA Academyblairbarnhardt
 
Dynamic Fleet Sizing Problem for an E-Scooter Valet Service
Dynamic Fleet Sizing Problem for an E-Scooter Valet ServiceDynamic Fleet Sizing Problem for an E-Scooter Valet Service
Dynamic Fleet Sizing Problem for an E-Scooter Valet ServiceJoseph Chow
 
Sustrans Scotland Raising the Standards Day 2017: Monitoring and Evaluation
Sustrans Scotland Raising the Standards Day 2017: Monitoring and EvaluationSustrans Scotland Raising the Standards Day 2017: Monitoring and Evaluation
Sustrans Scotland Raising the Standards Day 2017: Monitoring and EvaluationSustrans
 
Fowler distributing company
Fowler distributing companyFowler distributing company
Fowler distributing companyShaheen Sardar
 
Cyclistic bike share case study
Cyclistic bike share case studyCyclistic bike share case study
Cyclistic bike share case studybentley ho
 
Analysis on Bike Rental Data to Predict Future Use
Analysis on Bike Rental Data to Predict Future UseAnalysis on Bike Rental Data to Predict Future Use
Analysis on Bike Rental Data to Predict Future UseKimberly Nguyen
 
Time Series Forecasting Using TBATS Model.pptx
Time Series Forecasting Using TBATS Model.pptxTime Series Forecasting Using TBATS Model.pptx
Time Series Forecasting Using TBATS Model.pptxGowthamKumar470818
 
ATS-16: Making Data Count, Krista Nordback
ATS-16: Making Data Count, Krista NordbackATS-16: Making Data Count, Krista Nordback
ATS-16: Making Data Count, Krista NordbackBTAOregon
 
poster_B-Cycle
poster_B-Cycleposter_B-Cycle
poster_B-CycleBrett Keim
 

Similar to Bike sharing analysis san francisco (20)

How Does a Bike-Share Navigate Speedy Success?
How Does a Bike-Share Navigate Speedy Success? How Does a Bike-Share Navigate Speedy Success?
How Does a Bike-Share Navigate Speedy Success?
 
Predictive Analysis of Bike Sharing System Using Machine Learning Algorithms
Predictive Analysis of Bike Sharing System Using Machine Learning AlgorithmsPredictive Analysis of Bike Sharing System Using Machine Learning Algorithms
Predictive Analysis of Bike Sharing System Using Machine Learning Algorithms
 
Cyclistic.Google project.pptx
Cyclistic.Google project.pptxCyclistic.Google project.pptx
Cyclistic.Google project.pptx
 
IBM Watson Analytics - Sales Prediction
IBM Watson Analytics - Sales Prediction IBM Watson Analytics - Sales Prediction
IBM Watson Analytics - Sales Prediction
 
Slide share countingbikes&peds6
Slide share countingbikes&peds6Slide share countingbikes&peds6
Slide share countingbikes&peds6
 
ATS-16: Making Data Count, Josh Roll
ATS-16: Making Data Count, Josh RollATS-16: Making Data Count, Josh Roll
ATS-16: Making Data Count, Josh Roll
 
6101-Project Report
6101-Project Report6101-Project Report
6101-Project Report
 
Data Science: Can weather predict Bikeshare usage?
Data Science: Can weather predict Bikeshare usage?Data Science: Can weather predict Bikeshare usage?
Data Science: Can weather predict Bikeshare usage?
 
Dynamic demand
Dynamic demandDynamic demand
Dynamic demand
 
Presentation on Spot Speed Study Analysis for the course CE 454
Presentation on Spot Speed Study Analysis for the course CE 454Presentation on Spot Speed Study Analysis for the course CE 454
Presentation on Spot Speed Study Analysis for the course CE 454
 
April Live Pavement Management Webinar 1 by IPMA Academy
April Live Pavement Management Webinar 1 by IPMA AcademyApril Live Pavement Management Webinar 1 by IPMA Academy
April Live Pavement Management Webinar 1 by IPMA Academy
 
Dynamic Fleet Sizing Problem for an E-Scooter Valet Service
Dynamic Fleet Sizing Problem for an E-Scooter Valet ServiceDynamic Fleet Sizing Problem for an E-Scooter Valet Service
Dynamic Fleet Sizing Problem for an E-Scooter Valet Service
 
Sustrans Scotland Raising the Standards Day 2017: Monitoring and Evaluation
Sustrans Scotland Raising the Standards Day 2017: Monitoring and EvaluationSustrans Scotland Raising the Standards Day 2017: Monitoring and Evaluation
Sustrans Scotland Raising the Standards Day 2017: Monitoring and Evaluation
 
Online bicycle renting
Online bicycle rentingOnline bicycle renting
Online bicycle renting
 
Fowler distributing company
Fowler distributing companyFowler distributing company
Fowler distributing company
 
Cyclistic bike share case study
Cyclistic bike share case studyCyclistic bike share case study
Cyclistic bike share case study
 
Analysis on Bike Rental Data to Predict Future Use
Analysis on Bike Rental Data to Predict Future UseAnalysis on Bike Rental Data to Predict Future Use
Analysis on Bike Rental Data to Predict Future Use
 
Time Series Forecasting Using TBATS Model.pptx
Time Series Forecasting Using TBATS Model.pptxTime Series Forecasting Using TBATS Model.pptx
Time Series Forecasting Using TBATS Model.pptx
 
ATS-16: Making Data Count, Krista Nordback
ATS-16: Making Data Count, Krista NordbackATS-16: Making Data Count, Krista Nordback
ATS-16: Making Data Count, Krista Nordback
 
poster_B-Cycle
poster_B-Cycleposter_B-Cycle
poster_B-Cycle
 

Recently uploaded

如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证acoha1
 
Digital Marketing Demystified: Expert Tips from Samantha Rae Coolbeth
Digital Marketing Demystified: Expert Tips from Samantha Rae CoolbethDigital Marketing Demystified: Expert Tips from Samantha Rae Coolbeth
Digital Marketing Demystified: Expert Tips from Samantha Rae CoolbethSamantha Rae Coolbeth
 
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...yulianti213969
 
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam DunksNOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam Dunksgmuir1066
 
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarjSCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarjadimosmejiaslendon
 
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证pwgnohujw
 
MATERI MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI  MANAJEMEN OF PENYAKIT TETANUS.pptMATERI  MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI MANAJEMEN OF PENYAKIT TETANUS.pptRachmaGhifari
 
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...ssuserf63bd7
 
Formulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfFormulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfRobertoOcampo24
 
Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"John Sobanski
 
The Significance of Transliteration Enhancing
The Significance of Transliteration EnhancingThe Significance of Transliteration Enhancing
The Significance of Transliteration Enhancingmohamed Elzalabany
 
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeCredit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeBoston Institute of Analytics
 
Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024patrickdtherriault
 
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样jk0tkvfv
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...Amil baba
 
Bios of leading Astrologers & Researchers
Bios of leading Astrologers & ResearchersBios of leading Astrologers & Researchers
Bios of leading Astrologers & Researchersdarmandersingh4580
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证acoha1
 
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...Klinik Aborsi
 
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...ThinkInnovation
 
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...BabaJohn3
 

Recently uploaded (20)

如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
 
Digital Marketing Demystified: Expert Tips from Samantha Rae Coolbeth
Digital Marketing Demystified: Expert Tips from Samantha Rae CoolbethDigital Marketing Demystified: Expert Tips from Samantha Rae Coolbeth
Digital Marketing Demystified: Expert Tips from Samantha Rae Coolbeth
 
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
 
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam DunksNOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
 
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarjSCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
 
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
 
MATERI MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI  MANAJEMEN OF PENYAKIT TETANUS.pptMATERI  MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI MANAJEMEN OF PENYAKIT TETANUS.ppt
 
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
 
Formulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfFormulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdf
 
Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"
 
The Significance of Transliteration Enhancing
The Significance of Transliteration EnhancingThe Significance of Transliteration Enhancing
The Significance of Transliteration Enhancing
 
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeCredit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
 
Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024
 
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
 
Bios of leading Astrologers & Researchers
Bios of leading Astrologers & ResearchersBios of leading Astrologers & Researchers
Bios of leading Astrologers & Researchers
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
 
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
 
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
 
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...
 

Bike sharing analysis san francisco

  • 1. San Francisco Bay Area Bike Share Analysis Project By : Shefali Dagar Shashank Khede Navtej Singh Chawla
  • 2. Contents: •Introduction •About Dataset •Data Analysis and Findings •Challenges •Conclusion of Analysis •Future work
  • 3. Introduction • We are using the recorded data of the bike trips which were used in Bay area of San Francisco From August 2013 to August 2015. • The objective of this project is to determine the peak hours, days, months in which bikes are mostly occupied by the subscribers/customers. • We have chosen this project as it provides insights into the factors which affects the bike sharing on basis of weather conditions , days or locations. We can use this analysis to predict future consumption of these bikes.
  • 4. About Data • The data sets used belongs to the Kaggle and it is provided by Ben Hamner. • We have used 3 CSV data files Station, Trip, Weather for anlysis. • Station CSV has 84 rows with details of different stations • Trip CSV has 669960 records and details of different trips • Weather CSV has 3665 records and data details of weather between Aug 2013 to Aug 2015.
  • 5. Problem Statements • What is the average duration of the bike trips. • What is the busiest time of the week for the bike trips, the weekends or the weekdays. • Can we predict that what role the weather plays in bike share trip? • Do the bike trip patterns vary with the time and day of the week?
  • 6. Steps Performed During Data Analysis • Problem 1) - What is the average duration of the bike trips. Data Smoothing (Removing noisy and blank values). Joining trip and station data for analysis. Analyzing trips which are shorter than a day and hour respectively. Histogram plot
  • 7. Problem 1 - Plots • Average duration of bike trips is around ‘9 minutes’. • Our data is skewed right and mean is larger than the median.
  • 8. • Most of trips are less than 5 hours. • 99.96% trips are less than a day trip. • 96.83% trips are less than an hour Problem 2 - Plots
  • 9. Steps Performed During Data Analysis • Problem 2) - What is the busiest time of the week for the bike trips, the weekends or the weekdays. Creating new variables for trips in minutes. Splitting weekday and weekend data. Bar plot for days(weekend & weekday) vs minutes plot according to different cities.
  • 10. • People prefer to rent the bike more over the weekdays than weekend. Problem 2 - Plots
  • 11. Steps Performed During Data Analysis • Problem 3) - Can we predict that what role the weather plays in bike share trip? We check weather conditions throughout the year in the bay area. Customer and Subscriber plot according to weekday and weekend. Plotting humidity and bike trips, wind speed and bike trips. Rider sensitivity according to different weather conditions. Ride duration in seconds according to temperature (in Fahrenheit).
  • 12. Problem 3 - Plots • Mean temperature through Aug’13 to Aug’15 is around 60 Fahrenheit. • Humidity is around 70 during this time. • Wind speed is approximately at 9mph during this time period.
  • 13. Problem 3 - Plots • Customers are those who have access for 24hours. • Subscribers have 3 days or more pass. • Most trips are recorded when temperature is around 68 Fahrenheit.
  • 14. Problem 3 - Plots • Customers and subscribers both love to ride when humidity is between 60 to 80. • Less trips are recorded when humidity is more.
  • 15. Problem 3 - Plots • Customers and subscribers both love to ride when the wind speed is around 9- 10mph. • Least trips are recorded when its windy.
  • 16. Problem 3 - Plots • Best time when riders love to ride is when the weather is clear or there is ‘No fog or rain’. • Interestingly, people love to ride during ‘Thunderstorm’. • People don’t like riding in the ‘Rain’ according to our analysis.
  • 17. Steps Performed During Data Analysis • Problem 4) - Do the bike trip patterns vary with the time and day of the week? Dividing date into each month, days of week. Verifying peak time when bikes are rented the most. Bar plot for each day vs minutes. Histogram for peak timings for total number of rides.
  • 18. Problem 4 - Plots • After we analyzed that people take more trips on weekdays, we further see that ‘Tuesday’ are busiest days. • Sundays are the least busiest day in terms of bike renting.
  • 19. Problem 4 - Plots • July and August are of the most busiest months where most bikes are rented. • December records the least number of bikes rented.
  • 20. Problem 4 - Plots • This plot shows the busiest time which is 9:00 AM in the morning and 5:00 PM in the evening. • Least bike rental is done at 12:00 AM to 5:00 AM in the morning.
  • 21. Model Creation • Verifying scatter plot to check linear relationship between attributes.
  • 22. Model Creation • We created an initial model where we get 26.75% accuracy to get average bike trip duration. • All variables have significant impact as p-value is <0.05 at 95% confidence interval.
  • 23. Model Creation • We plot the residuals vs predicted plot and we see there are many outliers having value > 3. • VIF value for each variable is < 5 and hence we don’t have any multicollinearity problem.
  • 24. Model Creation • Plotting residuals vs subscription type.
  • 27. Model Creation • Plotting QQ Plot we check for outliers having values > 3.
  • 28. Model Creation • By backward elimination we get a new model with accuracy 26.73%.
  • 29. Model Creation • Using step wise elimination we get 26.75% accuracy.
  • 30. Conclusion • We addressed all our problem statements and below were the outcomes: Average duration of bike trips is around ‘9 minutes’. People prefer to rent the bike more over the weekdays than weekend. Best time when riders love to ride is when the weather is clear or there is ‘No fog or rain’. People love to ride when weather is like ‘Thunderstorm’. People don’t like riding in the ‘Rain’ according to our analysis. This plot shows the busiest time which is 9:00 AM in the morning and 5:00 PM in the evening. Least bike rental is done at 12:00 AM to 5:00 AM in the morning.