This document summarizes an analysis of island hopping routes in Greece. It found that the top destinations based on searches were Santorini, Mykonos, Syros, Paros, and Sifnos. A network was modeled showing connections between islands, particularly in the Cyclades. A linear regression model was developed to predict arrivals to destinations based on search traffic from Openseas, achieving good accuracy on test examples. Future work proposed identifying missing routes that prevent daily island hopping and predicting demand for those new routes.
2. BLUE HACKATHON 2015
ISLAND HOPPING
Island hopping is the crossing of
an ocean by a series of shorter journeys
between islands, as opposed to a single
journey directly to the destination.
In Greece, getting there really is
half the adventure and island
hopping remains an essential
part of the Greek experience.
3. First of all, we analyzed our data in order to find out
which are the top destinations,
based on the performed queries at Openseas,
http://www.openseas.gr/
using the R project for Statistical Computing,
http://www.r-project.org
BLUE HACKATHON 2015
ISLAND HOPPING
5. Top 20 destinations – Some statistics
Using the R code
summary(names_df$the_values)
We obtained the following results
Min. 1st Qu. Median Mean 3rd Qu. Max.
1.0 88.5 596.0 4698.0 3290.0 139300.0
BLUE HACKATHON 2015
ISLAND HOPPING
6. 16,7
6,3
5,7 5,3
4,4 4,0
3,4 3,4 3,2
2,6 2,4 2,3 2,0 2,0 1,9 1,7 1,4 1,3 1,3 1,3
0,0
2,0
4,0
6,0
8,0
10,0
12,0
14,0
16,0
18,0
piraeus
all
santorini
mykonos
syros
paros
sifnos
naxos
aegina
rafina
lavrio
tinos
ios
milos
kea
kythnos
heraklion
rodos
folegandros
serifos
% requests via Openseas (per destination)
BLUE HACKATHON 2015
ISLAND HOPPING
Blue Hackathon 2015: FORTH crs data (2014)
Analysis: Top 20 destinations - % requests (per destination)
via www.openseas.gr
8. Network modeling and visualization
Our aim is to build and visualize a network based on the
connections of these islands (Islands = nodes, Routes = edges),
with focus on Cyclades
so that we can see which of the islands are connected
– through a route – and which are not!
For this we used the Cytoscape visualization tool
www.cytoscape.org
BLUE HACKATHON 2015
ISLAND HOPPING
11. Network modeling and visualization
BLUE HACKATHON 2015
ISLAND HOPPING
Blue Hackathon 2015: FORTH crs data (2014)
Analysis: Network of the top 136 routes - based on the requests via Openseas
12. In order to predict the demand of a destination
we found the relationship between OpenSeas searches and
actual arrivals,
using as input the searches towards 50 different
destinations, against the actual arrivals
from the data provided by OpenSeas.gr and the Greek
port Authorities (respectively).
BLUE HACKATHON 2015
ISLAND HOPPING
13. Prediction of the demand of a destination
For this, we trained a linear regression algorithm written in
R so that, given as input:
the destination
the number of OpenSeas queries towards that destinations,
the month in which these queries were conducted
it predicts the actual number of arrivals at that
destination
BLUE HACKATHON 2015
ISLAND HOPPING
14. Prediction of the demand of a destination
We defined the destinations and the months
as categorical variables, and
we discovered that the model that best fits the data is of
the following form:
BLUE HACKATHON 2015
ISLAND HOPPING
arrivals = (requests*name)+ name + month + requests^2
+ month*requests + (requests^2*month)
Where in the above equation, requests is a real number denoting the number of Openseas
requests, month is a vector (Jan,…, Dec) and names is a vector containing all the names of
the islands (paros, naxos, …).
16. Prediction of the demand of a destination
BLUE HACKATHON 2015
ISLAND HOPPING
‘EXPERIMENTS’/EXAMPLES
ISLAND MONTH SEARCHES PREDICTIONS ACTUAL
ARRIVALS
Mykonos October 4000 37545.69 37065
Aegina July 5050 106847.9 107225
Paros August 11092 124534.4 132117
Ios June 74 18303.95 13245
Naxos April 23 20238.17 18758
Antiparos July 3 55527 55527
17. Future Steps: System’s failure
We will define as failure of the system, not being able to visit one
of these islands on a daily basis (island hopping).
We will study also the islands which are connected but the existing
routes do not allow the visitor to go/return at a convenient time.
From the above, we will find out which routes are missing, so that
island hopping will become possible!
BLUE HACKATHON 2015
ISLAND HOPPING
18. Future Steps: Missing routes
We will predict the demand of the missing routes.
To achieve this, we will use the prediction of the demand of a
destination, and other parameters, such as the attractiveness of the
islands.
BLUE HACKATHON 2015
ISLAND HOPPING