SlideShare a Scribd company logo
Mother Nature’s Impact 
on Bike Ridership 
Jackie Zajac 
Kays Fattal 
Naumaan Nasir
Does weather have a relationship 
with bike ridership? 
Can we predict bike usage based 
on weather?
INTRODUCTION 
• Our team 
• Research questions 
• Picking datasets 
• Our audience
METHODOLOGY 
• Why linear regression? 
• How we manipulated the data 
• MySQL engine aggregated 
3M table into sum of rental 
counts and duration 
• Mashed up with 731 rows of 
weather data (2011, 2012) 
• Added a Year field 
• Tools: Excel, MySQL database, 
R (Rattle)
METHODOLOGY 
• Picking our best configuration 
• Categoric vs. numeric variables 
• Must decide how to measure bike usage 
• Must pick best variables 
• Error analysis
PHASE I 
• Began with a broad study of six regressions 
• Two target variables (rental counts, duration) 
• Three temperature measures 
• Minimum, Average, Maximum 
• Chunked the day into three time ranges to reflect 
temperature during bike rides 
• Evaluated multiple weather variables’ affect on 
regressions 
• Ignored Date field
Plots
PHASE II 
• Combining the data sets 
• Picking best variables: 
• Bike rental counts as sole target variable 
• Maximum temperature 
• Utilized date/year field 
• Switched Snow to categoric variable 
• Analyzed and refined our regression 
• Higher accuracy – R-squared = .8374 or 83.74%
MSE and R-squared 
• A measure of accuracy in one dataset 
predicting another 
• Relationship between R-squared and MSE
X X 
X
FINAL MODEL 
Weight Variable 
-4004.501 Intercept 
62.118 Maximum Temperature 
-132.741 Average Wind 
93.162 Precipitation 
416.818 Visibility 
2063.069 Year 
-161.038 Snow [0.0-1.2] inches 
-4.945 Snow [1.2-2.0] inches 
-588.349 Snow [2.0-3.1] inches 
-5.390 Snow [3.1-3.9] inches 
Y=
LESSONS LEARNED 
• Too many independent variables to incorporate 
crime dataset in addition to weather dataset 
• Means Squared Error (MSE), R-squared 
• Only two years’ worth of data was available due to 
Bikeshare’s short history (2011, 2012) 
• Final model would be even more accurate with 
additional historical data
CONCLUSION 
• Our hypotheses proved true: weather does affect 
bike ridership 
• Why is Maximum Temperature better? 
• Why does the Year improve accuracy? 
• The categorical range of snow inches
QUESTIONS? 
Thanks!

More Related Content

What's hot

Afcom2010
Afcom2010Afcom2010
Afcom2010
stacygriggs
 
Approaches for a sustainable industrial heating
Approaches for a sustainable industrial heating Approaches for a sustainable industrial heating
Approaches for a sustainable industrial heating
Aspiration Energy Pvt Ltd
 
Submission#19614_Final
Submission#19614_FinalSubmission#19614_Final
Submission#19614_Final
Amir Rezaei-Bazkiaei
 
การนำเสนอบทความวิชาการระดับนานาชาติ Version ภาษาไทย
การนำเสนอบทความวิชาการระดับนานาชาติ Version ภาษาไทยการนำเสนอบทความวิชาการระดับนานาชาติ Version ภาษาไทย
การนำเสนอบทความวิชาการระดับนานาชาติ Version ภาษาไทย
AJ. Tor วิศวกรรมแหล่งนํา้
 
Illinois ASHRAE Research Promotion
Illinois ASHRAE Research PromotionIllinois ASHRAE Research Promotion
Illinois ASHRAE Research Promotion
Illinois ASHRAE
 
Class 27 pd, pid electronic controllers
Class 27   pd, pid electronic controllersClass 27   pd, pid electronic controllers
Class 27 pd, pid electronic controllers
Manipal Institute of Technology
 
42 Wind Energy Potential Assessment In Order to Produce Electrical Energy for...
42 Wind Energy Potential Assessment In Order to Produce Electrical Energy for...42 Wind Energy Potential Assessment In Order to Produce Electrical Energy for...
42 Wind Energy Potential Assessment In Order to Produce Electrical Energy for...
Vahid Rasouli
 
Big Data: WEATHER DATA ANALYSIS
Big Data: WEATHER DATA ANALYSISBig Data: WEATHER DATA ANALYSIS
Big Data: WEATHER DATA ANALYSIS
Hamid Shekarforoush
 

What's hot (8)

Afcom2010
Afcom2010Afcom2010
Afcom2010
 
Approaches for a sustainable industrial heating
Approaches for a sustainable industrial heating Approaches for a sustainable industrial heating
Approaches for a sustainable industrial heating
 
Submission#19614_Final
Submission#19614_FinalSubmission#19614_Final
Submission#19614_Final
 
การนำเสนอบทความวิชาการระดับนานาชาติ Version ภาษาไทย
การนำเสนอบทความวิชาการระดับนานาชาติ Version ภาษาไทยการนำเสนอบทความวิชาการระดับนานาชาติ Version ภาษาไทย
การนำเสนอบทความวิชาการระดับนานาชาติ Version ภาษาไทย
 
Illinois ASHRAE Research Promotion
Illinois ASHRAE Research PromotionIllinois ASHRAE Research Promotion
Illinois ASHRAE Research Promotion
 
Class 27 pd, pid electronic controllers
Class 27   pd, pid electronic controllersClass 27   pd, pid electronic controllers
Class 27 pd, pid electronic controllers
 
42 Wind Energy Potential Assessment In Order to Produce Electrical Energy for...
42 Wind Energy Potential Assessment In Order to Produce Electrical Energy for...42 Wind Energy Potential Assessment In Order to Produce Electrical Energy for...
42 Wind Energy Potential Assessment In Order to Produce Electrical Energy for...
 
Big Data: WEATHER DATA ANALYSIS
Big Data: WEATHER DATA ANALYSISBig Data: WEATHER DATA ANALYSIS
Big Data: WEATHER DATA ANALYSIS
 

Similar to Data Science: Can weather predict Bikeshare usage?

Bike sharing analysis san francisco
Bike sharing analysis san franciscoBike sharing analysis san francisco
Bike sharing analysis san francisco
Navtej Singh Chawla
 
Predicting the Wind: Wind farm prospecting using GIS
Predicting the Wind: Wind farm prospecting using GISPredicting the Wind: Wind farm prospecting using GIS
Predicting the Wind: Wind farm prospecting using GIS
Kenex Ltd
 
Predicting the Wind - wind farm prospecting with GIS
Predicting the Wind - wind farm prospecting with GISPredicting the Wind - wind farm prospecting with GIS
Predicting the Wind - wind farm prospecting with GIS
Kenex Ltd
 
Analysis on Bike Rental Data to Predict Future Use
Analysis on Bike Rental Data to Predict Future UseAnalysis on Bike Rental Data to Predict Future Use
Analysis on Bike Rental Data to Predict Future Use
Kimberly Nguyen
 
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
CLIM: Transition Workshop - Optimization Methods in Remote Sensing  - Jessica...CLIM: Transition Workshop - Optimization Methods in Remote Sensing  - Jessica...
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
The Statistical and Applied Mathematical Sciences Institute
 
NOVEL DATA ANALYSIS TECHNIQUE USED TO EVALUATE NOX AND CO2 CONTINUOUS EMISSIO...
NOVEL DATA ANALYSIS TECHNIQUE USED TO EVALUATE NOX AND CO2 CONTINUOUS EMISSIO...NOVEL DATA ANALYSIS TECHNIQUE USED TO EVALUATE NOX AND CO2 CONTINUOUS EMISSIO...
NOVEL DATA ANALYSIS TECHNIQUE USED TO EVALUATE NOX AND CO2 CONTINUOUS EMISSIO...
Sergio A. Guerra
 
ATS-16: Making Data Count, Josh Roll
ATS-16: Making Data Count, Josh RollATS-16: Making Data Count, Josh Roll
ATS-16: Making Data Count, Josh Roll
BTAOregon
 
Energy audit & conservation studies for commercial premises
Energy audit & conservation studies for commercial premisesEnergy audit & conservation studies for commercial premises
Energy audit & conservation studies for commercial premises
ravindradatar
 
From Data to insight: Emerging Opportunities in Africa for 2018
From Data to insight: Emerging Opportunities in Africa for 2018From Data to insight: Emerging Opportunities in Africa for 2018
From Data to insight: Emerging Opportunities in Africa for 2018
mdn_dan
 
MIDIH Paufex-IOTandCI experiment
MIDIH Paufex-IOTandCI experimentMIDIH Paufex-IOTandCI experiment
MIDIH Paufex-IOTandCI experiment
MIDIH_EU
 
Portfolio MS-MBA
Portfolio MS-MBAPortfolio MS-MBA
Portfolio MS-MBA
RAHUL SINGH
 
03 solargis uncertainty_albuquerque_pvp_mws_2017-05_final
03 solargis uncertainty_albuquerque_pvp_mws_2017-05_final03 solargis uncertainty_albuquerque_pvp_mws_2017-05_final
03 solargis uncertainty_albuquerque_pvp_mws_2017-05_final
Sandia National Laboratories: Energy & Climate: Renewables
 
Case Study: Operational Energy Reduction through Data Analysis & Virtual Benc...
Case Study: Operational Energy Reduction through Data Analysis & Virtual Benc...Case Study: Operational Energy Reduction through Data Analysis & Virtual Benc...
Case Study: Operational Energy Reduction through Data Analysis & Virtual Benc...
EMEX
 
Project template for presenting it before the panel
Project template for presenting it before the panelProject template for presenting it before the panel
Project template for presenting it before the panel
AkshatMehrotra14
 
An Autonomic Approach to Real-Time Predictive Analytics using Open Data and ...
An Autonomic Approach to Real-Time Predictive Analytics using Open Data and ...An Autonomic Approach to Real-Time Predictive Analytics using Open Data and ...
An Autonomic Approach to Real-Time Predictive Analytics using Open Data and ...
Wassim Derguech
 
JML_WeatherResume
JML_WeatherResumeJML_WeatherResume
JML_WeatherResume
John L'Heureux
 
FinalPresentation-GradProject
FinalPresentation-GradProjectFinalPresentation-GradProject
FinalPresentation-GradProject
Manabu Mukohyoshi
 
Presentation on Spot Speed Study Analysis for the course CE 454
Presentation on Spot Speed Study Analysis for the course CE 454Presentation on Spot Speed Study Analysis for the course CE 454
Presentation on Spot Speed Study Analysis for the course CE 454
nazifa tabassum
 
2015 x472 class 10 - lcca
2015 x472 class 10 - lcca2015 x472 class 10 - lcca
2015 x472 class 10 - lcca
michaeljmack
 
13 helioscope pvpmc 2017v4
13 helioscope pvpmc 2017v413 helioscope pvpmc 2017v4

Similar to Data Science: Can weather predict Bikeshare usage? (20)

Bike sharing analysis san francisco
Bike sharing analysis san franciscoBike sharing analysis san francisco
Bike sharing analysis san francisco
 
Predicting the Wind: Wind farm prospecting using GIS
Predicting the Wind: Wind farm prospecting using GISPredicting the Wind: Wind farm prospecting using GIS
Predicting the Wind: Wind farm prospecting using GIS
 
Predicting the Wind - wind farm prospecting with GIS
Predicting the Wind - wind farm prospecting with GISPredicting the Wind - wind farm prospecting with GIS
Predicting the Wind - wind farm prospecting with GIS
 
Analysis on Bike Rental Data to Predict Future Use
Analysis on Bike Rental Data to Predict Future UseAnalysis on Bike Rental Data to Predict Future Use
Analysis on Bike Rental Data to Predict Future Use
 
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
CLIM: Transition Workshop - Optimization Methods in Remote Sensing  - Jessica...CLIM: Transition Workshop - Optimization Methods in Remote Sensing  - Jessica...
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
 
NOVEL DATA ANALYSIS TECHNIQUE USED TO EVALUATE NOX AND CO2 CONTINUOUS EMISSIO...
NOVEL DATA ANALYSIS TECHNIQUE USED TO EVALUATE NOX AND CO2 CONTINUOUS EMISSIO...NOVEL DATA ANALYSIS TECHNIQUE USED TO EVALUATE NOX AND CO2 CONTINUOUS EMISSIO...
NOVEL DATA ANALYSIS TECHNIQUE USED TO EVALUATE NOX AND CO2 CONTINUOUS EMISSIO...
 
ATS-16: Making Data Count, Josh Roll
ATS-16: Making Data Count, Josh RollATS-16: Making Data Count, Josh Roll
ATS-16: Making Data Count, Josh Roll
 
Energy audit & conservation studies for commercial premises
Energy audit & conservation studies for commercial premisesEnergy audit & conservation studies for commercial premises
Energy audit & conservation studies for commercial premises
 
From Data to insight: Emerging Opportunities in Africa for 2018
From Data to insight: Emerging Opportunities in Africa for 2018From Data to insight: Emerging Opportunities in Africa for 2018
From Data to insight: Emerging Opportunities in Africa for 2018
 
MIDIH Paufex-IOTandCI experiment
MIDIH Paufex-IOTandCI experimentMIDIH Paufex-IOTandCI experiment
MIDIH Paufex-IOTandCI experiment
 
Portfolio MS-MBA
Portfolio MS-MBAPortfolio MS-MBA
Portfolio MS-MBA
 
03 solargis uncertainty_albuquerque_pvp_mws_2017-05_final
03 solargis uncertainty_albuquerque_pvp_mws_2017-05_final03 solargis uncertainty_albuquerque_pvp_mws_2017-05_final
03 solargis uncertainty_albuquerque_pvp_mws_2017-05_final
 
Case Study: Operational Energy Reduction through Data Analysis & Virtual Benc...
Case Study: Operational Energy Reduction through Data Analysis & Virtual Benc...Case Study: Operational Energy Reduction through Data Analysis & Virtual Benc...
Case Study: Operational Energy Reduction through Data Analysis & Virtual Benc...
 
Project template for presenting it before the panel
Project template for presenting it before the panelProject template for presenting it before the panel
Project template for presenting it before the panel
 
An Autonomic Approach to Real-Time Predictive Analytics using Open Data and ...
An Autonomic Approach to Real-Time Predictive Analytics using Open Data and ...An Autonomic Approach to Real-Time Predictive Analytics using Open Data and ...
An Autonomic Approach to Real-Time Predictive Analytics using Open Data and ...
 
JML_WeatherResume
JML_WeatherResumeJML_WeatherResume
JML_WeatherResume
 
FinalPresentation-GradProject
FinalPresentation-GradProjectFinalPresentation-GradProject
FinalPresentation-GradProject
 
Presentation on Spot Speed Study Analysis for the course CE 454
Presentation on Spot Speed Study Analysis for the course CE 454Presentation on Spot Speed Study Analysis for the course CE 454
Presentation on Spot Speed Study Analysis for the course CE 454
 
2015 x472 class 10 - lcca
2015 x472 class 10 - lcca2015 x472 class 10 - lcca
2015 x472 class 10 - lcca
 
13 helioscope pvpmc 2017v4
13 helioscope pvpmc 2017v413 helioscope pvpmc 2017v4
13 helioscope pvpmc 2017v4
 

Recently uploaded

Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
Intelisync
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
Hiike
 
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Jeffrey Haguewood
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
saastr
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 

Recently uploaded (20)

Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
 
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 

Data Science: Can weather predict Bikeshare usage?

  • 1. Mother Nature’s Impact on Bike Ridership Jackie Zajac Kays Fattal Naumaan Nasir
  • 2. Does weather have a relationship with bike ridership? Can we predict bike usage based on weather?
  • 3. INTRODUCTION • Our team • Research questions • Picking datasets • Our audience
  • 4. METHODOLOGY • Why linear regression? • How we manipulated the data • MySQL engine aggregated 3M table into sum of rental counts and duration • Mashed up with 731 rows of weather data (2011, 2012) • Added a Year field • Tools: Excel, MySQL database, R (Rattle)
  • 5. METHODOLOGY • Picking our best configuration • Categoric vs. numeric variables • Must decide how to measure bike usage • Must pick best variables • Error analysis
  • 6. PHASE I • Began with a broad study of six regressions • Two target variables (rental counts, duration) • Three temperature measures • Minimum, Average, Maximum • Chunked the day into three time ranges to reflect temperature during bike rides • Evaluated multiple weather variables’ affect on regressions • Ignored Date field
  • 8. PHASE II • Combining the data sets • Picking best variables: • Bike rental counts as sole target variable • Maximum temperature • Utilized date/year field • Switched Snow to categoric variable • Analyzed and refined our regression • Higher accuracy – R-squared = .8374 or 83.74%
  • 9. MSE and R-squared • A measure of accuracy in one dataset predicting another • Relationship between R-squared and MSE
  • 10. X X X
  • 11. FINAL MODEL Weight Variable -4004.501 Intercept 62.118 Maximum Temperature -132.741 Average Wind 93.162 Precipitation 416.818 Visibility 2063.069 Year -161.038 Snow [0.0-1.2] inches -4.945 Snow [1.2-2.0] inches -588.349 Snow [2.0-3.1] inches -5.390 Snow [3.1-3.9] inches Y=
  • 12. LESSONS LEARNED • Too many independent variables to incorporate crime dataset in addition to weather dataset • Means Squared Error (MSE), R-squared • Only two years’ worth of data was available due to Bikeshare’s short history (2011, 2012) • Final model would be even more accurate with additional historical data
  • 13. CONCLUSION • Our hypotheses proved true: weather does affect bike ridership • Why is Maximum Temperature better? • Why does the Year improve accuracy? • The categorical range of snow inches

Editor's Notes

  1. Does weather affect Bikeshare, and how? Can we predict it? To what limit can we be accurate? Found the datasets on capital bikeshare and on farmer’s almanac Who can use this study? Discuss what this could do for Bikeshare as a company
  2. Linear regression was best suited. We were doing a comparison rather than classification. It was not a true/false research question. We used charts in Excel to study the difference between predicted values and actual values.
  3. Linear regression was best suited. We were doing a comparison rather than classification. It was not a true/false research question. Min, avg, max temperature – best variables? Error analysis – used both MSE and R-squared. Kays will discuss in further detail later.
  4. TOP LEFT: Minimum temperature TOP RIGHT: Average temperature, date is numeric LOWER LEFT: Maximum temperature, date is numeric LOWER RIGHT: Best combination: Maximum temperature, Year variable – numeric with two years only