The document describes a project to predict occupancy trends in Barcelona's public bicycle sharing stations using open data. The project aims to help users plan trips by showing them which stations will likely have available bikes or empty slots. It uses a random forest model to make 3-day predictions based on past occupancy data, weather forecasts, calendar information such as holidays and weekdays. The model was trained on 90 days of recent observations and evaluated against data from the same months in the previous year.
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Predicting occupancy trends in Barcelona's bicycle service stations using open data
1. Gabriel Martins Dias*, Boris Bellalta, Simon Oechsner
Universitat Pompeu Fabra, Barcelona, Spain
Predicting occupancy trends in Barcelona's
bicycle service stations using open data
SAI Intelligent Systems Conference 2015
10-11 November 2015 | London UK
3. Bicing
The public bicycle system of Barcelona is called “Bicing” and it made
for local citizens. In order to use it, people have to pay an annual fee
that lets them borrow a bike for 30 minutes without any extra cost.
If the trip lasts longer than 30 minutes and less than 2 hours, a small
fee is applied. Otherwise, if it lasts longer than 2 hours the fee is much
more expensive.
Therefore, most of the trips (97%) last less than 30 minutes.
5. Barcelona
This is the map of Barcelona.
At the bottom, we can observe the sea. The terrain is not flat and the
highest altitude is over 100 meters above the sea level.
9. Problem
In general, the bicycles help people to travel around the city, to go to
work, to the school, and so on.
However, there are two problems which users face very often:
1. Not finding a bicycle when they want to go somewhere;
2. Not finding a free slot in the station when they arrived to their
destination.
19. Levels
Full
Almost full
Bikes and slots available
Almost empty
Empty
We observed that for a person that is looking for a bike, it does not
matter whether there are 5, 10 or 50 bicycles available in a station.
However, they want to avoid stations that may be nearly empty.
On the other hand, a person that is looking for a free slot will avoid
stations that are nearly or completely full.
Therefore, we defined such statuses as the critical ones.
48. 3 days of predictions
Using Random Forest
0
10
20
30
90 days of observations
49. 3 days of predictions
Using Random Forest
0
10
20
30
90 days of observations
50. 3 days of predictions
Using Random Forest
0
10
20
30
90 days of observations
51. 3 days of predictions
Using Random Forest
0
10
20
30
90 days of observations
52. 3 days of predictions
Using Random Forest
0
10
20
30
90 days of observations
0
10
20
30
30 days of observations - 1 year before
53. 3 days of predictions
Using Random Forest
0
10
20
30
90 days of observations
0
10
20
30
30 days of observations - 1 year before
54. 3 days of predictions
Using Random Forest
0
10
20
30
90 days of observations
0
10
20
30
30 days of observations - 1 year before
+
55. 3 days of predictions
Using Random Forest
0
10
20
30
90 days of observations
0
10
20
30
30 days of observations - 1 year before
0
10
20
30
Next 72 hours
+
56. 3 days of predictions
Using Random Forest
0
10
20
30
90 days of observations
0
10
20
30
30 days of observations - 1 year before
0
10
20
30
Next 72 hours
+
We considered the last 90 days and the 30 days observed 1 year
before to predict the statuses in the next 72 hours.
70. Accuracy
According to the age of the predictions
0 %
50 %
100 %
0 days old 1 day 2 days
Without open data Using open data
71. Accuracy
According to the age of the predictions
0 %
50 %
100 %
0 days old 1 day 2 days
Without open data Using open data
72. Accuracy
According to the age of the predictions
0 %
50 %
100 %
0 days old 1 day 2 days
Without open data Using open data
We observed that the average accuracy was improved when we used
open data in the predictions.
77. Sensitivity
Using open data
According to the age of the predictions
0 %
25 %
50 %
75 %
100 %
0 days old 1 day 2 days
& Specificity
The sensitivity is the percentage of critical statuses that were
correctly predicted. It was over 75%.
87. Future work
Scalability
Other data sources
Our future work involve making the predictions for all 400 stations,
considering other sources of open data.
91. Future work
Scalability
Other data sources
Smartphone application
Other applications
We expect that the city council might use the predictions to improve
their schedule to collect the bikes from the full stations.
92. Gabriel Martins Dias
gabriel.martins@upf.edu
Boris Bellalta, Simon Oechsner
Universitat Pompeu Fabra
Barcelona, Spain
Predicting occupancy trends in Barcelona's
bicycle service stations using open data
Impact of
external
factors
Critical
statuses can
be predicted
Use of
open data