I just finished the Coursera Data Analytics Certificate course and decided to do the optional capstone data analysis project. I had the option of choosing my own project or working with Chicago’s real bikeshare data set compiled by Divvy by Lyft. After looking into the data and the program, I saw that Lyft does a bikeshare program in my home city of San Francisco called Bay Wheels so I decided to do the bikeshare project but work with the Bay Wheels data instead.
In this project, I am the junior data analyst on a marketing analytics team for a fictional company called Cyclistic. My role is to ask, prepare, process, analyze, share, and act. While the scenario is fictional, the data and findings are real.
The director of marketing believes the company’s future success depends on maximizing the number of annual memberships and they want to know how casual riders and members use the bikes differently. My report will be shared with my analytics team, the director and the executive team.
The director has set a clear goal: Design marketing strategies aimed at converting casual riders into annual members. In order to do that, however, the marketing analyst team needs to better understand how annual members and casual riders differ, why casual riders would buy a membership, and how digital media could affect their marketing tactics. Moreno and her team are interested in analyzing the Cyclistic historical bike trip data to identify trends
My assignment is to answer: how annual members and casual riders differ.
I am responsible for producing a report with the following deliverables:
1. A clear statement of the business task
2. A description of all data sources used
3. Documentation of any cleaning or manipulation of data
4. A summary of my analysis
5. Supporting visualizations and key findings
6. Top three recommendations based on my analysis
Cyclistic bike-share analysis case study. A bike-share program that features more than 5,800 bicycles and 600 docking stations. Cyclistic sets itself apart by also offering reclining bikes, hand tricycles, and cargo bikes, making bike-share more inclusive to people with disabilities and riders who can’t use a standard two-wheeled bike. The majority of riders opt for traditional bikes; about 8% of riders use the assistive options. Cyclistic users are more likely to ride for leisure, but about 30% use them to commute to work each day. The director of marketing believes the company’s future success depends on maximizing the number of annual memberships. Therefore, we need to understand how casual riders and annual members use Cyclistic bikes differently. From these insights, the marketing team will design a new marketing strategy to convert casual riders into annual members.
A junior data analyst working in the marketing analytics team at Cyclistic -a bike-share company in Chicago- The marketing team wants to design a new marketing strategy to convert casual riders into annual members. • Data has been cleaned, organized and visualized using R. And recommendations were given based on the key findings.
Divvy Bike Use Data Analysis and RecommendationsHanbit Choi
Divvy bike sharing system use analysis with recommendations
This was for my final project in Data Analytics course at General Assembly.
• Approach: created SQL databases after cleaning raw data, used Tableau for analysis with SQL server and visualization including custom mapping
• Findings: recommendations for new station location and sales strategy after analyzing by peak season, user type, day of the week, community areas, and popular stations
"Exciting news! I've just completed my Google Data Analytics Capstone project, which focused on a Cyclistic Bike Share case study. I used R code for data wrangling and Tableau for data visualization, and am excited to share the results of my analysis.
Bringing them online: Using design research to identify online opportunitiesPatrick Kennedy
Presented at Oz-IA 2009, this presentation discusses the use of user research to inform the design of SuperRacing, a cross website horse racing content vertical.
Cyclistic bike-share analysis case study. A bike-share program that features more than 5,800 bicycles and 600 docking stations. Cyclistic sets itself apart by also offering reclining bikes, hand tricycles, and cargo bikes, making bike-share more inclusive to people with disabilities and riders who can’t use a standard two-wheeled bike. The majority of riders opt for traditional bikes; about 8% of riders use the assistive options. Cyclistic users are more likely to ride for leisure, but about 30% use them to commute to work each day. The director of marketing believes the company’s future success depends on maximizing the number of annual memberships. Therefore, we need to understand how casual riders and annual members use Cyclistic bikes differently. From these insights, the marketing team will design a new marketing strategy to convert casual riders into annual members.
A junior data analyst working in the marketing analytics team at Cyclistic -a bike-share company in Chicago- The marketing team wants to design a new marketing strategy to convert casual riders into annual members. • Data has been cleaned, organized and visualized using R. And recommendations were given based on the key findings.
Divvy Bike Use Data Analysis and RecommendationsHanbit Choi
Divvy bike sharing system use analysis with recommendations
This was for my final project in Data Analytics course at General Assembly.
• Approach: created SQL databases after cleaning raw data, used Tableau for analysis with SQL server and visualization including custom mapping
• Findings: recommendations for new station location and sales strategy after analyzing by peak season, user type, day of the week, community areas, and popular stations
"Exciting news! I've just completed my Google Data Analytics Capstone project, which focused on a Cyclistic Bike Share case study. I used R code for data wrangling and Tableau for data visualization, and am excited to share the results of my analysis.
Bringing them online: Using design research to identify online opportunitiesPatrick Kennedy
Presented at Oz-IA 2009, this presentation discusses the use of user research to inform the design of SuperRacing, a cross website horse racing content vertical.
descriptionWe used the recorded data of the bike trips which were used in Bay area of San Francisco From August 2013 to August 2015.
The objective of this project was to determine the peak hours, days, months in which bikes are mostly occupied by the subscribers/customers.
We had chosen this project as it provides insights into the factors which affects the bike sharing on basis of weather conditions , days or locations. We can use this analysis to predict future usage of these bikes.
See
Bike share stations are generally spaced in a dense grid pattern to create convenient origins and destinations for riders. Bike share is oriented to short-term, point-to-point use: most US operators record the average ride at 15 to 20 minutes and between one to three miles long. The bicycle can be returned to any number of self-serve bike sharing stations, including the original check out location. Generally, the bicycles are one style and easy to operate with simple components and adjustable seats. The rental transaction is fully automated and there is no need for on-site staff.
Capston Project: A case study to convert temporary customers into permanent c...Curtin University
I have completed this project to complete the final step of the google analytics program which is a capston project. This case study is about a fictional company called Cyclistic which has a potential to increase profitability by converting casual users to members. As a data analyst, my role was to dive more into the behaviors of the casual riders to understand if there are any potential opportunities for the company. The dataset used in this case study are licensed by Motivate International Inc (https://ride.divvybikes.com/data-license-agreement).
More than Just Lines on a Map: Best Practices for U.S Bike Routes
This session highlights best practices and lessons learned for U.S. Bike Route System designation, as well as how and why these routes should be integrated into bicycle planning at the local and regional level.
Presenters:
Presenter: Kevin Luecke Toole Design Group
Co-Presenter: Virginia Sullivan Adventure Cycling Association
Information from this report will be used to inform the final design package which we will present to community members, including residents, traders, service providers and visitors to the area for further comment.
Session 63: How the Nashville Area MPO Bike/Ped Study Changed Funding Decisio...Sharon Roerty
How does a region of 22 municipalities, 3,300 miles of major roadways, and 1.3 million people covering 2,900 square miles determine where to invest in sidewalks, bikeways, and greenways? This session will focus on key successes from middle Tennessee’s first regional bicycle and pedestrian study including a public involvement process that engaged nearly 2,100 participants and the creation of a unique formula-based non-motorized project evaluation process impacting MPO funding.
Sustrans Scotland Raising the Standards Day 2017: Monitoring and EvaluationSustrans
Our research and monitoring unit specialists explain how they can help you get the data to answer the questions of what you should invest in to achieve active mobility, by understanding the impact of infrastructure and behaviour change programmes.
2014 Walk/Ride Day Challenge Sample Impact ReportTilly Pick
Walk/Ride Days celebrate sustainability and health. One day each month, Walk/Ride Days amplify green, healthy commuting choices made by many to the broader community, inviting more people to actively reconsider how they get to work. In 2014, Walk/Ride Days grew nearly 2x compared to 2013. In over 15,000 commutes recorded in 2014, 12% showed a shift to greener commutes; 8% were healthier. Participation in Walk/Ride Day and the Walk/Ride Day Corporate Challenge is possible from anywhere in the world.
Council is preparing draft concept plans to improve the pedestrian amenity along Greville and King Streets, Prahran. Those who live, work and visit the area participated either by survey or attendance at an information session. We have developed a couple of draft concept plans to give you an idea about how we’re planning to improve Greville and King Streets. Information from this report will be used to inform the final design package which we will present to community members, including residents, traders, service providers and visitors to the area for further comment.
While at Wyncode, I worked with a team to develop two projects, most notable was Escher, the Airbnb of office spaces. It allowed people to list there personal space and office amenities so that professionals could use their space to get work done or use their equipment such as printers.
This is a brief case study showcasing my work as a software product manager at Kravin Kitchen, a culinary business management platform uniquely designed to help food trucks, meal prep businesses, and virtual restaurants streamline operations and increase sales.
In this instance, I evaluated the customers and chef's list of orders and I redesigned the UI in Figma for easy scannability.
descriptionWe used the recorded data of the bike trips which were used in Bay area of San Francisco From August 2013 to August 2015.
The objective of this project was to determine the peak hours, days, months in which bikes are mostly occupied by the subscribers/customers.
We had chosen this project as it provides insights into the factors which affects the bike sharing on basis of weather conditions , days or locations. We can use this analysis to predict future usage of these bikes.
See
Bike share stations are generally spaced in a dense grid pattern to create convenient origins and destinations for riders. Bike share is oriented to short-term, point-to-point use: most US operators record the average ride at 15 to 20 minutes and between one to three miles long. The bicycle can be returned to any number of self-serve bike sharing stations, including the original check out location. Generally, the bicycles are one style and easy to operate with simple components and adjustable seats. The rental transaction is fully automated and there is no need for on-site staff.
Capston Project: A case study to convert temporary customers into permanent c...Curtin University
I have completed this project to complete the final step of the google analytics program which is a capston project. This case study is about a fictional company called Cyclistic which has a potential to increase profitability by converting casual users to members. As a data analyst, my role was to dive more into the behaviors of the casual riders to understand if there are any potential opportunities for the company. The dataset used in this case study are licensed by Motivate International Inc (https://ride.divvybikes.com/data-license-agreement).
More than Just Lines on a Map: Best Practices for U.S Bike Routes
This session highlights best practices and lessons learned for U.S. Bike Route System designation, as well as how and why these routes should be integrated into bicycle planning at the local and regional level.
Presenters:
Presenter: Kevin Luecke Toole Design Group
Co-Presenter: Virginia Sullivan Adventure Cycling Association
Information from this report will be used to inform the final design package which we will present to community members, including residents, traders, service providers and visitors to the area for further comment.
Session 63: How the Nashville Area MPO Bike/Ped Study Changed Funding Decisio...Sharon Roerty
How does a region of 22 municipalities, 3,300 miles of major roadways, and 1.3 million people covering 2,900 square miles determine where to invest in sidewalks, bikeways, and greenways? This session will focus on key successes from middle Tennessee’s first regional bicycle and pedestrian study including a public involvement process that engaged nearly 2,100 participants and the creation of a unique formula-based non-motorized project evaluation process impacting MPO funding.
Sustrans Scotland Raising the Standards Day 2017: Monitoring and EvaluationSustrans
Our research and monitoring unit specialists explain how they can help you get the data to answer the questions of what you should invest in to achieve active mobility, by understanding the impact of infrastructure and behaviour change programmes.
2014 Walk/Ride Day Challenge Sample Impact ReportTilly Pick
Walk/Ride Days celebrate sustainability and health. One day each month, Walk/Ride Days amplify green, healthy commuting choices made by many to the broader community, inviting more people to actively reconsider how they get to work. In 2014, Walk/Ride Days grew nearly 2x compared to 2013. In over 15,000 commutes recorded in 2014, 12% showed a shift to greener commutes; 8% were healthier. Participation in Walk/Ride Day and the Walk/Ride Day Corporate Challenge is possible from anywhere in the world.
Council is preparing draft concept plans to improve the pedestrian amenity along Greville and King Streets, Prahran. Those who live, work and visit the area participated either by survey or attendance at an information session. We have developed a couple of draft concept plans to give you an idea about how we’re planning to improve Greville and King Streets. Information from this report will be used to inform the final design package which we will present to community members, including residents, traders, service providers and visitors to the area for further comment.
While at Wyncode, I worked with a team to develop two projects, most notable was Escher, the Airbnb of office spaces. It allowed people to list there personal space and office amenities so that professionals could use their space to get work done or use their equipment such as printers.
This is a brief case study showcasing my work as a software product manager at Kravin Kitchen, a culinary business management platform uniquely designed to help food trucks, meal prep businesses, and virtual restaurants streamline operations and increase sales.
In this instance, I evaluated the customers and chef's list of orders and I redesigned the UI in Figma for easy scannability.
Getting New Subscribers to Complete Their ProfilesNate B. DeWaele
I worked for Kravin Kitchen as a Product Manager; the platform was the leading business management platform uniquely designed to help food trucks, meal prep businesses, and virtual restaurants streamline operations and increase sales.
As the Product Manager, I made designs in Figma and wrote and refined user stories. I used intuition, customer feedback and qualitative and quantitative data taken from Mouseflow to shape the product.
This case study demonstrates how we attempted to get more subscribers to complete their profiles after sign up.
Capleo is a music application that makes it easy to build playlists. For this project, I selected the color scheme and did backend QA with minitest. The design is mobile friendly. Technology used includes Ruby on Rails, Bootstrap and the Spotify API. It is no longer deployed but the repo still exists.
https://github.com/nbdewaele/music-app
The Wall is simple demonstration of my wireframing and user story mapping abilities as a product owner. It is not an application in production. Click around! the links work and the view has logged in/out dependancies. I used WireFrame Sketcher and can't wait to use more vibrant tools.
https://wiresketch.com/8ejnMiAh/intro
Vitstack is a Ruby on Rails app that I am currently working on. It will find formulas containing combinations of vitamins and supplements the user is currently taking. My aim is to help people save money, cut down on packaging waste, and help people to remember to take all of their pills.
https://infinite-mountain-69399.herokuapp.com/
Vertical Gardening School is an online authority on DIY indoor gardening. It is built with the GeneratePress theme on WordPress. I did the styling as well as the integration of application functions. Most of the work I do as a freelancer is with Wordpress. I am available remote and in Fort Lauderdale.
https://verticalgardeningschool.com/
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Show drafts
volume_up
Empowering the Data Analytics Ecosystem: A Laser Focus on Value
The data analytics ecosystem thrives when every component functions at its peak, unlocking the true potential of data. Here's a laser focus on key areas for an empowered ecosystem:
1. Democratize Access, Not Data:
Granular Access Controls: Provide users with self-service tools tailored to their specific needs, preventing data overload and misuse.
Data Catalogs: Implement robust data catalogs for easy discovery and understanding of available data sources.
2. Foster Collaboration with Clear Roles:
Data Mesh Architecture: Break down data silos by creating a distributed data ownership model with clear ownership and responsibilities.
Collaborative Workspaces: Utilize interactive platforms where data scientists, analysts, and domain experts can work seamlessly together.
3. Leverage Advanced Analytics Strategically:
AI-powered Automation: Automate repetitive tasks like data cleaning and feature engineering, freeing up data talent for higher-level analysis.
Right-Tool Selection: Strategically choose the most effective advanced analytics techniques (e.g., AI, ML) based on specific business problems.
4. Prioritize Data Quality with Automation:
Automated Data Validation: Implement automated data quality checks to identify and rectify errors at the source, minimizing downstream issues.
Data Lineage Tracking: Track the flow of data throughout the ecosystem, ensuring transparency and facilitating root cause analysis for errors.
5. Cultivate a Data-Driven Mindset:
Metrics-Driven Performance Management: Align KPIs and performance metrics with data-driven insights to ensure actionable decision making.
Data Storytelling Workshops: Equip stakeholders with the skills to translate complex data findings into compelling narratives that drive action.
Benefits of a Precise Ecosystem:
Sharpened Focus: Precise access and clear roles ensure everyone works with the most relevant data, maximizing efficiency.
Actionable Insights: Strategic analytics and automated quality checks lead to more reliable and actionable data insights.
Continuous Improvement: Data-driven performance management fosters a culture of learning and continuous improvement.
Sustainable Growth: Empowered by data, organizations can make informed decisions to drive sustainable growth and innovation.
By focusing on these precise actions, organizations can create an empowered data analytics ecosystem that delivers real value by driving data-driven decisions and maximizing the return on their data investment.
1. Nate DeWaele November 1st, 2023
Ridership Trends of Bay Wheels
Member and Casual Riders
Data analysis exploring bike share rider behavior
2. Contents
• Intro / Background
• Statement of Business Task
• Description of Data Sources Used
• Documentation of any cleaning or manipulation of data
• Summary of Analysis
• Supporting Visualizations
• Recommendations
• Appendix
3. Intro / Background
Coursera Data Analytics Certi
fi
cate Capstone Project
• This project demonstrates what I have learned in the course
• The data is real; taken from San Francisco Bay Wheels data provided
by Lyft
• My role: Data Analyst on a marketing analytics team for a fictional
company called Cyclistic
• Goal: Determine difference between casual and member bike riders
• Deliverable: This presentation which meets the business task
I just
fi
nished the Coursera Data Analytics Certi
fi
cate course and decided to do the optional capstone data analysis project. I had the option of choosing my own project
or working with Chicago’s real bikeshare data set compiled by Divvy by Lyft. After looking into the data and the program, I saw that Lyft does a bikeshare program in my
home city of San Francisco called Bay Wheels so I decided to do the bikeshare project but work with the Bay Wheels data instead.
In this project, I am the junior data analyst on a marketing analytics team for a
fi
ctional company called Cyclistic. My role is to ask, prepare, process, analyze, share, and
act. While the scenario is
fi
ctional, the data and
fi
ndings are real.
The director of marketing believes the company’s future success depends on maximizing the number of annual memberships and they want to know how casual riders
and members use the bikes di
ff
erently. My report will be shared with my analytics team, the director and the executive team.
The director has set a clear goal: Design marketing strategies aimed at converting casual riders into annual members. In order to do that, however, the marketing analyst
team needs to better understand how annual members and casual riders di
ff
er, why casual riders would buy a membership, and how digital media could a
ff
ect their
marketing tactics. Moreno and her team are interested in analyzing the Cyclistic historical bike trip data to identify trends
My assignment is to answer: how annual members and casual riders di
ff
er.
I am responsible for producing a report with the following deliverables:
1. A clear statement of the business task
2. A description of all data sources used
3. Documentation of any cleaning or manipulation of data
4. 4. A summary of my analysis
5. Supporting visualizations and key
fi
ndings
6. Your top three recommendations based on my analysis
5. Business Task
• Determine how annual and casual riders differ
• Report findings to:
• Marketing team
• Director
• Executive team
6. Data Source Details
Bay Wheels bike-share data is real and was collected by Lyft; it is made
publicly available by the city of San Francisco and Motivate International Inc.
• Data spans from Oct. 2022 - Sept. 2023
• Final processing of data was on November 6th, 2023
• Total data: 2,485,313 observations
• Credit Card Info was removed so as to protect user
identities and
fi
nancial Info
• Insights about who converted from casual to
member cannot be determined
• See appendix for more info
• Variables
• ride_id
• rideable_type
• started_at
• ended_at
• start_station_name
• start_station_id
• end_station_name
• end_station_id
• start_lat
• start_lng
• end_lat
• end_lng
• member_casual
7. Documentation of Data Cleaning
From 2,485,313 raw observations to a clean data set of 1,865,753
• RStudio was used due to the large volume of records
• Removed Observations:
• Duplicates
• Nulls
• Containing empty strings in member_casual, started_at, ended_at
• Trip times < 0
• Records with the word “Test” in them
• Con
fi
rmed there were no outliers
• started_at and ended_at converted from string to date time format
• duration column made by subtracting started_at from ended_at
Data cleaning and manipulation was done in Rstudio. After all records were collected they were merged into one raw data set containing 2,485,313 observations. Here
is a list of things I did to clean the data:
Remove duplicates
Remove observations with null values
Remove observations with empty strings
After con
fi
rming that those records had an even distribution and were not in large chunks of time
Con
fi
rmed there were no large outliers, no observations were removed
Remove observations where the trip time was less than zero
Remove any observations with the word “test” in them
After removing records, I had 1,865,753 records left. I then checked some basic stats about trip duration to see if there might be anything concerning. To do that I
converted ended_at and started_at from strings to datetime data types and made a duration column which is ended_at - started_at.
8. Documentation of Data Cleaning
Final cleaned data set
• Mean Duration: 22.00771
• Median Duration: 22.74417
• Standard Deviation: 4.766666
The median durations are relatively close, which suggests that the data may be
normally distributed. The standard deviation is relatively low, indicating that most
of the data points are close to the mean and will tell an accurate story.
Below are the stats:
Mean Duration: 22.00771
Median Duration: 22.74417
Standard Deviation: 4.766666
The median durations are relatively close, which suggests that the data may be normally distributed. The standard deviation is relatively low, indicating that most of the
data points are close to the mean and will tell an accurate story.
9. Summary of Analysis
Basic Stats: No signi
fi
cant di
ff
erence in trip duration
Member Riders Summary
• Mean duration: 21.91673
• Median duration: 22.68861
• Standard deviation: 4.769218
Casual Riders Summary
• Mean duration: 22.20975
• Median duration: 22.83722
• Standard deviation: 4.754778
10. Casual routes
Summary of Analysis
Top routes
Member routes
Start Station Name End Station Name Total Rides
Market St. at Stuart St Barry St. at 4th St 811
North Point St. at Polk St. Market St. at Stuart St 811
Bay Pl. at Vernon St. 19th St. BART Station 773
Market St. at 10th St. Market St. at 10th St. 738
The Embarcadero St. at Sansome Market St. at Stuart St 738
Start Station Name End Station Name Total Rides
Mason St. at Halleck Mason St. at Halleck 1763
Lincoln Blvd. at Graham St. Lincoln Blvd. at Graham St. 1176
Pier 1/2 at The Embarcadero North Point St. at Powell St. 876
Fell St. at Stanyan St. Fell St. at Stanyan St. 875
North Point St. at Polk St. North Point St. at Polk St. 784
Looking at the top 5 routes for member and casual rider types, we see that 4/5 routes casual riders take are circuitous; they end at the same station the bike was
checked out at. Alternatively, 4/5 of the most common routes member riders take end at a di
ff
erent station than the start.
This suggests that members have a place to go and that casual riders are generally taking bikes for a joy ride
11. Casual riders
Summary of Analysis
Top start stations
Member riders
Start Station Name Total Rides
Market St. at Stuart St. 18,327
Market St. at 10th St. 18,014
Powell St. BART Station (Market St. at 4th St.) 14,521
San Francisco Caltrain (Townsend St. at 4th St.) 13,991
Montgomery St. BART Station (Market St. at 2nd St.) 12,853
Start Station Name Total Rides
Market St. at Stuart St. 8,060
San Francisco Ferry Building (Harry Bridges Plaza) 7,008
3 Pier 1/2 at The Embarcadero 6,775
Powell St. BART Station (Market St. at 4th St.) 6,068
North Point St. at Polk St. 6,053
While we are looking at the di
ff
erences, I found that there was a similarity in start stations for casual and member riders. A look at the top 5 start stations for each rider
showed that Market St at Steuart St and Powell St BART Station (Market St at 4th St) stations are popular among both riders. These may be stations where more
commuters are leaving from therefore these may be better targets for advertising.
*Market St at Steuart St station* and *Powell St BART Station (Market St at 4th St)* are two stations that are most common for causal and member riders. It may be
worth targeting marketing in this area. Some users may be in town for work and biking into the site from these stations. People who live in the area may also have used
these stations casually; advertisements around these stations may help convert casual riders to member riders.
12. Visualizations
Ridership for members is highest at the beginning of the business week, especially mid week. It drops signi
fi
cantly on Friday and Saturday and then picks up again on
Sunday. This may be because members use bikes to commute during the week. Members may be less inclined to ride on Friday and Saturday because that is where
their routine transitions out of work. And the increase on Sunday may be more for leisure.
Casual riders show a more even distribution of bike checkouts from day to day with highest number of checkouts on Thursday, Friday and Saturday. Interestingly, on
Monday, Tuesday and Thursday, the di
ff
erence in amount of bike checkouts is very similar to the bike checkouts of members on those days, this indicates casual riders
on those days may be using the bikes more like members do to commute.
13. Visualizations
Ridership for members and casual riders show a similar trend which makes sense because people are less likely to check bikes out during the late hours of the night.
Member riders begin checking out more bikes before work starts with a peak between 8-9am. Following rush hour, the member ridership drops and evens out until the
post work rush hour at 4pm where a signi
fi
cantly larger amount of bikes are checked out till 6pm. This trend indicates members are largely riding as part of a commute
and between the ride to work and the ride home, they prefer to commute home via bike.
Casual riders don’t demonstrate as large an uptick of checkouts as members during the morning rush hour; however ridership does increase around that time.
Interestingly, casual ridership does spike at rush hour during the afternoon commute in the same proportion as member riders; that indicates that casual riders who
check out bikes around that time are commuting home or back to their hotel after work.
14. Visualizations
The rate of bike checkouts is proportionally similar between member and casual riders throughout the year. April through October are the warmest months in San
Francisco and the uptick in bike checkouts re
fl
ects that; these may be the best months for marketing campaigns.
15. Recommendations
• Implement a strategic marketing campaign spanning from April to October to capture the
peak ridership seasons, targeting the transition of casual riders into loyal members.
• Highlight the practical utility of bike usage, particularly during the afternoon rush hour.
Emphasize biking as a means of unwinding after work, aligning with the observed surge in
checkouts during this time.
• Focus marketing e
ff
orts on stations like Market St. at Steuart St. and Powell St. BART
Station (Market St. at 4th St.), the most frequent start stations for both casual and member
riders. These locations, often used by commuters or locals, present a prime opportunity for
targeted advertising. Encouraging riders to shift from casual use to committed membership
could signi
fi
cantly bene
fi
t from localized marketing strategies around these stations.
Roll out marketing campaigns from April to October
There are many reasons to use a bike such as leisure, but it seems even casual riders largely use the bikes for the practical purpose of commuting. Since a larger
amount of checkouts are during afternoon rush hour, the marketing message may appeal to people who want to unwind after work.
Market St at Stuart St station and Powell St BART Station (Market St at 4th St) are two stations that are most common for causal and member riders. It may be worth
targeting marketing in this area. Some users may be in town for work and biking into the site from these stations. People who live in the area may also have used these
stations casually; advertisements around these stations may help convert casual riders to member riders.
16. Appendix
Links to Published Work
• This project can be found and checked on my Kaggle pro le
• Supporting scripts must be accessed on my GitHub account
• This presentation is also on my LinkedIn Page or my slideshare page
• If you like my work please consider sharing it or connecting with me on
LinkedIn
• I am currently looking for a Software Product Manager or Data Analyst
position
17. Appendix
Kudos / Citations
• Data set: Motivate International Inc. Bay Wheels, Lyft Bikes and Scooters LLC.
2022-2023. Bay Wheels Trip Data. Retrieved from URL: https://s3.amazonaws.com/
baywheels-data/index.html.
• License: Bay Wheels, Lyft Bikes and Scooters LLC. (2023). Data License Agreement.
Retrieved from URL: https://baywheels-assets.s3.amazonaws.com/data-license-
agreement.html
• Wickham, H. (2016). ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New
York. ISBN 978-3-319-24277-4. https://ggplot2.tidyverse.org
• Wickham, H., Francois, R., Henry, L., & Muller, K. (2021). Dplyr: A Grammar of Data
Manipulation. R package version 1.0.7 Retrieved from https://CRAN.R-project.org/
package=dplyr
18. Appendix
Kudos / Citations
• Xie, Y. (2021). Knitr: A General-Purpose Package for Dynamic Report
Generation in R. R package version 1.33. Retrieved from https://CRAN.R-
project.org/package/knitr
• Wickham, H., Averick, M., Bryan, J., Chang, W., McGowan, L. D., Francois,
R., … & Yutani, H. (2019). Welcome to the tidy verse. Journal of Open Source
Software, 4(43), 1686. Retrieved from https://doi.org/10.21105/joss.01686
• Grolemund, H., & Wickham, H. (2011). Dates and times made easy with
lubridate. Journal of Statistical Software, 40(3), 1-25. Retrieved from https://
doi.org/10.18637/jss.v040.i03