SlideShare a Scribd company logo
ENTER 2017 Research Track Slide Number 1
Wolfram Höpkena, Dominic Ernestia, Matthias Fuchsb,
Kai Kronenbergb, and Maria Lexhagenb
a Business Informatics Group
University of Applied Sciences Ravensburg-Weingarten, Germany
{name.surname}@hs-weingarten.de
b European Tourism Research Institute (ETOUR)
Mid-Sweden University, Sweden
{name.surname}@miun.se
Big data as input for
predicting tourist arrivals
ENTER 2017 Research Track Slide Number 2
Content
• Introduction
• Related work
• Data collection and preparation
• Model building
• Results
• Conclusion and Outlook
ENTER 2017 Research Track Slide Number 3
Motivation
• Demand prediction in tourism
– Being a labour-intensive branch, tourism managers strongly depend on
precise demand predictions
– Besides long-term trends, tourist arrivals typically follow strong
seasonal trends
Autoregressive approaches lead to quite satisfactory results and have
been used quite widely in tourism in the past (Song & Li, 2008, p. 210)
• Limitations of autoregressive approaches
– Tourism demand is heavily influenced by external factors as well
• Economic factors: exchange rates, consumers’ income (Frechtling, 2002; Song & Witt, 2000)
• Destination-specific factors: marketing efforts (Brida & Schubert, 2008; Kronenberg et al. 2016)
– Electronic footprints of customers are promising new info sources
• Especially to encounter singular or unexpected demand fluctuations
induced by, e.g. natural disasters or mega events
ENTER 2017 Research Track Slide Number 4
Objective
• Research proposition (1): Big data information sources
increase performance of predicting tourist arrivals compared
to autoregressive approaches
– Extend autoregressive demand prediction by making use of big data
information sources
• Economic factors
• Data on web search behaviour of potential customers
• Research proposition (2): data mining technique can achieve
better prediction performance than statistical approaches
(e.g. linear regression)
– Data mining technique k-nearest neighbour
• Not restricted to a linear relationships, robust against biased and
statistically inconvenient data (e.g. collinearity of input attributes)
ENTER 2017 Research Track Slide Number 5
Content
• Introduction
• Related work
• Data collection and preparation
• Model building
• Results
• Conclusion and Outlook
ENTER 2017 Research Track Slide Number 6
Related work
• (Non-causal) Time series models
– Autoregressive moving average (ARIMA) models,
exponential smoothing models (Song & Li, 2008, p. 210)
• (Causal) Econometric approaches
– Autoregressive distributed lag models (ADLM), error correction model
(EDM), vector autoregressive (VAR) model, time varying parameter
(TVP) model to avoid spurious regression results (Peng et al. 2010)
– Linear structural equation model (SEM) (Turner & Witt 2001)
– Important determinants of tourism demand
• Consumer price index, GDP, exchange rates, interest rate, unemployment
rate, money supply, export/import rates (Cho 2001; Song & Li 2008, p. 211)
• Man-made events, advertising investments, crises (e.g. financial crisis,
terror attacks), natural disasters (SARS, foot & mouth disease, etc.)
ENTER 2017 Research Track Slide Number 7
Related work
• Using big data sources to predict tourism demand
– Google search engine traffic to increase forecasting performance using
autoregressive mixed-data sampling (AR-MIDAS) models
(Bangwayo-Skeete & Skeete, 2015)
– Google search engine traffic to improve tourism demand prediction
accuracy, compared to purely autoregressive models or exponential
smoothing time-series models (Önder & Gunter, 2016)
– Search engine data to increase forecasting accuracy, compared to
autoregressive moving average (ARMA) models (Yang et al., 2015)
– DMO Web traffic data to predict demand for hotel rooms in a tourist
destination (Yang et al., 2014)
ENTER 2017 Research Track Slide Number 8
Related work
• Artificial intelligence (AI) / data mining (DM) methods to
predict tourism demand
– Artificial neural networks (ANN) outperform decomposition models,
exponential smoothing, ARIMA and multiple regression (Kon & Turner 2005;
Song & Li 2008, p. 212; Law and Au 1999; Palmer et al. 2006; Lin et al. 2011)
– Rough set approach (decision rule induction) (Law and Au 1988; Goh et al. 2008)
– Genetic Algorithms for recognizing changes in composition of tourism
demand (Hernandez-López 2004; Hong et al. 2011)
– Support Vector Machines (SVM) superior to ARIMA models (Pai et al. 2006;
Chen & Wang 2007)
ENTER 2017 Research Track Slide Number 9
Content
• Introduction
• Related work
• Data collection and preparation
• Model building
• Results
• Conclusion and Outlook
ENTER 2017 Research Track Slide Number 10
Specification of data set
• Tourist arrival data for Swedish mountain destination Åre
– Total period: 77 months (December 2005 - April 2012)
– Sending countries: Denmark, Finland, Norway, Russia, United Kingdom
• Attributes
– Tourist arrivals: forecasting attribute
– Income: GDP per capita for each sending country
– Destination price level: CPI (Consumer Price Index) Sweden / CPI sending country
– Price level of alternative destinations
– Transportation costs: Jet fuel prices
– Advertising expenditures: Advertising investments by SkiStar Åre
– Web search traffic: Google Trends data (Åre, Åre winter, Åre ski, …)
– Mega event: dummy variable for FIS Alpine Ski World Championship
ENTER 2017 Research Track Slide Number 11
Specification of data set
Tourist arrivals and Google online traffic
ENTER 2017 Research Track Slide Number 12
Data preparation
• Time-lagged data
– Target attribute tourist arrivals is used with different time-lags as input
attributes (autoregressive approach)
– Time-lags are meaningful for other input attributes as well (e.g. web
search traffic, advertising expenditures)
– Data mining techniques (e.g. k-nearest neighbour) do not require
specific statistical characteristics to hold true for data set (e.g. non-
collinearity)
Input attributes can be included in data set for all different time-lags
within certain time window
– Time window of 24 months has been chosen to cope with seasonality
and trend over several years
Data set consists of
• Arrivals + big data attributes at time t-1 to t-24 = input attributes
• Tourist arrivals at time t = target attribute
ENTER 2017 Research Track Slide Number 13
Content
• Introduction
• Related work
• Data collection and preparation
• Model building
• Results
• Conclusion and Outlook
ENTER 2017 Research Track Slide Number 14
Model building
• Linear regression as statistical approach
– Forward selection (data-driven model building approach) to avoid
inclusion of irrelevant or highly correlated input attributes (caused by
including attributes with different time-lags)
• K-nearest neighbour as data mining approach
– Estimates target attribute of a new data entry, based on k most similar
data entries within training data set
– Two nearest neighbours (k=2) and Euclidian distance measure to
identify most similar past time windows
– Z-score standardisation to normalize attribute values
• Evaluation
– Prediction performance evaluated by sliding window validation
(moving a training and consecutive test window along data set)
ENTER 2017 Research Track Slide Number 15
Content
• Introduction
• Related work
• Data collection and preparation
• Model building
• Results
• Conclusion and Outlook
ENTER 2017 Research Track Slide Number 16
Results
ENTER 2017 Research Track Slide Number 17
Results
Autoregressive
approaches reach
good results in light
of an average of
3,223 arrivals and a
maximum of
14,199 arrivals per
month
ENTER 2017 Research Track Slide Number 18
Results
Adding big data
information
sources as
additional input
significantly
increases
prediction
performance
(supports research
proposition 1)
ENTER 2017 Research Track Slide Number 19
Results
MAE over all
sending countries
for the prediction
method k-NN is
reduced from 620
to 432, thus, by
30%
ENTER 2017 Research Track Slide Number 20
Results
Web search traffic
(i.e. Google trends),
price level of
alternative
destinations and
transportation costs
(i.e. jet fuel prices)
show strongest
influence on tourist
arrivals
ENTER 2017 Research Track Slide Number 21
Results
Predicting tourist arrivals by linear regression and big data
ENTER 2017 Research Track Slide Number 22
Results
Data mining
technique k-
nearest neighbour
(k-NN) outperforms
statistical approach
linear regression
(research
proposition 2)
ENTER 2017 Research Track Slide Number 23
Content
• Introduction
• Related work
• Data collection and preparation
• Model building
• Results
• Conclusion and Outlook
ENTER 2017 Research Track Slide Number 24
Conclusion and outlook
• Big data as input to tourist arrival prediction
– Big data information sources (e.g. web search traffic) show potential to
increase accuracy to predict tourist arrivals compared to
autoregressive approaches
– Advantage of enabling a prediction of tourist arrivals under changing
external conditions or singular events
• Data mining techniques instead of statistical approaches
– Data mining techniques (in this case k-NN) show potential to
outperform statistical approaches, like linear regression
– Can identify any kind of non-linear relationships; more robust against
biased data and violations of input attribute preconditions
• Outlook: Extend range of big data information sources
– Web navigation data, customer traffic data on social media platforms
(Twitter, Facebook, YouTube, etc.)
ENTER 2017 Research Track Slide Number 25
Wolfram Höpkena, Dominic Ernestia, Matthias Fuchsb,
Kai Kronenbergb, and Maria Lexhagenb
a Business Informatics Group
University of Applied Sciences Ravensburg-Weingarten, Germany
{name.surname}@hs-weingarten.de
b European Tourism Research Institute (ETOUR)
Mid-Sweden University, Sweden
{name.surname}@miun.se
Big data as input for
predicting tourist arrivals

More Related Content

Viewers also liked

An Integrative Model of the Pursuit of Happiness and the Role of Smart Touris...
An Integrative Model of the Pursuit of Happiness and the Role of Smart Touris...An Integrative Model of the Pursuit of Happiness and the Role of Smart Touris...
An Integrative Model of the Pursuit of Happiness and the Role of Smart Touris...
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Use of Bitcoin in Online Travel Product Shopping: The European Perspective
Use of Bitcoin in Online Travel Product Shopping: The European PerspectiveUse of Bitcoin in Online Travel Product Shopping: The European Perspective
Use of Bitcoin in Online Travel Product Shopping: The European Perspective
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Authenticity and tourism: What TripAdvisor reviews reveal about authentic travel
Authenticity and tourism: What TripAdvisor reviews reveal about authentic travelAuthenticity and tourism: What TripAdvisor reviews reveal about authentic travel
Authenticity and tourism: What TripAdvisor reviews reveal about authentic travel
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Flight Cancellation Behaviour under Mobile Travel Application: Based on the C...
Flight Cancellation Behaviour under Mobile Travel Application: Based on the C...Flight Cancellation Behaviour under Mobile Travel Application: Based on the C...
Flight Cancellation Behaviour under Mobile Travel Application: Based on the C...
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Technology Acceptance of Virtual Reality for Travel Planning
Technology Acceptance of Virtual Reality for Travel PlanningTechnology Acceptance of Virtual Reality for Travel Planning
Technology Acceptance of Virtual Reality for Travel Planning
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Personality impacts on the participation in the peer-to-peer (P2P) travel acc...
Personality impacts on the participation in the peer-to-peer (P2P) travel acc...Personality impacts on the participation in the peer-to-peer (P2P) travel acc...
Personality impacts on the participation in the peer-to-peer (P2P) travel acc...
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
An innovative virtual method for providing eTourism education in a university...
An innovative virtual method for providing eTourism education in a university...An innovative virtual method for providing eTourism education in a university...
An innovative virtual method for providing eTourism education in a university...
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Managerial Response Strategy to Online Customer Compliments: a Comparative An...
Managerial Response Strategy to Online Customer Compliments: a Comparative An...Managerial Response Strategy to Online Customer Compliments: a Comparative An...
Managerial Response Strategy to Online Customer Compliments: a Comparative An...
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
A synthesis of unique product attributes for alternative accommodation types
A synthesis of unique product attributes for alternative accommodation typesA synthesis of unique product attributes for alternative accommodation types
A synthesis of unique product attributes for alternative accommodation types
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Assessing the Performance of a Tourism MOOC Using the Kirkpatrick Model: A Su...
Assessing the Performance of a Tourism MOOC Using the Kirkpatrick Model: A Su...Assessing the Performance of a Tourism MOOC Using the Kirkpatrick Model: A Su...
Assessing the Performance of a Tourism MOOC Using the Kirkpatrick Model: A Su...
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Virtual Reality and Attitudes toward Tourism Destinations
Virtual Reality and Attitudes toward Tourism DestinationsVirtual Reality and Attitudes toward Tourism Destinations
Virtual Reality and Attitudes toward Tourism Destinations
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Towards an ICT roadmap for smart tourism destinations based on prospective an...
Towards an ICT roadmap for smart tourism destinations based on prospective an...Towards an ICT roadmap for smart tourism destinations based on prospective an...
Towards an ICT roadmap for smart tourism destinations based on prospective an...
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Tourism Service Portfolio
Tourism Service PortfolioTourism Service Portfolio
Entrepeneurship in the Contemporary Tourism Ecosystem: the Case of Incoming T...
Entrepeneurship in the Contemporary Tourism Ecosystem: the Case of Incoming T...Entrepeneurship in the Contemporary Tourism Ecosystem: the Case of Incoming T...
Entrepeneurship in the Contemporary Tourism Ecosystem: the Case of Incoming T...
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Online reputation and tourism destination competitiveness - conceptual model ...
Online reputation and tourism destination competitiveness - conceptual model ...Online reputation and tourism destination competitiveness - conceptual model ...
Online reputation and tourism destination competitiveness - conceptual model ...
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Smart Glasses Adoption in Smart Tourism Destination: A Conceptual Model
Smart Glasses Adoption in Smart Tourism Destination: A Conceptual ModelSmart Glasses Adoption in Smart Tourism Destination: A Conceptual Model
Smart Glasses Adoption in Smart Tourism Destination: A Conceptual Model
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Heimatunes
HeimatunesHeimatunes
Ticino Tourism VR Experience
Ticino Tourism VR ExperienceTicino Tourism VR Experience
Exploring the Determinants of Strategic Revenue Management with Idiosyncratic...
Exploring the Determinants of Strategic Revenue Management with Idiosyncratic...Exploring the Determinants of Strategic Revenue Management with Idiosyncratic...
Exploring the Determinants of Strategic Revenue Management with Idiosyncratic...
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
How Instagram influences Visual Destination Image - a case study of Jordan an...
How Instagram influences Visual Destination Image - a case study of Jordan an...How Instagram influences Visual Destination Image - a case study of Jordan an...
How Instagram influences Visual Destination Image - a case study of Jordan an...
International Federation for Information Technologies in Travel and Tourism (IFITT)
 

Viewers also liked (20)

An Integrative Model of the Pursuit of Happiness and the Role of Smart Touris...
An Integrative Model of the Pursuit of Happiness and the Role of Smart Touris...An Integrative Model of the Pursuit of Happiness and the Role of Smart Touris...
An Integrative Model of the Pursuit of Happiness and the Role of Smart Touris...
 
Use of Bitcoin in Online Travel Product Shopping: The European Perspective
Use of Bitcoin in Online Travel Product Shopping: The European PerspectiveUse of Bitcoin in Online Travel Product Shopping: The European Perspective
Use of Bitcoin in Online Travel Product Shopping: The European Perspective
 
Authenticity and tourism: What TripAdvisor reviews reveal about authentic travel
Authenticity and tourism: What TripAdvisor reviews reveal about authentic travelAuthenticity and tourism: What TripAdvisor reviews reveal about authentic travel
Authenticity and tourism: What TripAdvisor reviews reveal about authentic travel
 
Flight Cancellation Behaviour under Mobile Travel Application: Based on the C...
Flight Cancellation Behaviour under Mobile Travel Application: Based on the C...Flight Cancellation Behaviour under Mobile Travel Application: Based on the C...
Flight Cancellation Behaviour under Mobile Travel Application: Based on the C...
 
Technology Acceptance of Virtual Reality for Travel Planning
Technology Acceptance of Virtual Reality for Travel PlanningTechnology Acceptance of Virtual Reality for Travel Planning
Technology Acceptance of Virtual Reality for Travel Planning
 
Personality impacts on the participation in the peer-to-peer (P2P) travel acc...
Personality impacts on the participation in the peer-to-peer (P2P) travel acc...Personality impacts on the participation in the peer-to-peer (P2P) travel acc...
Personality impacts on the participation in the peer-to-peer (P2P) travel acc...
 
An innovative virtual method for providing eTourism education in a university...
An innovative virtual method for providing eTourism education in a university...An innovative virtual method for providing eTourism education in a university...
An innovative virtual method for providing eTourism education in a university...
 
Managerial Response Strategy to Online Customer Compliments: a Comparative An...
Managerial Response Strategy to Online Customer Compliments: a Comparative An...Managerial Response Strategy to Online Customer Compliments: a Comparative An...
Managerial Response Strategy to Online Customer Compliments: a Comparative An...
 
A synthesis of unique product attributes for alternative accommodation types
A synthesis of unique product attributes for alternative accommodation typesA synthesis of unique product attributes for alternative accommodation types
A synthesis of unique product attributes for alternative accommodation types
 
Assessing the Performance of a Tourism MOOC Using the Kirkpatrick Model: A Su...
Assessing the Performance of a Tourism MOOC Using the Kirkpatrick Model: A Su...Assessing the Performance of a Tourism MOOC Using the Kirkpatrick Model: A Su...
Assessing the Performance of a Tourism MOOC Using the Kirkpatrick Model: A Su...
 
Virtual Reality and Attitudes toward Tourism Destinations
Virtual Reality and Attitudes toward Tourism DestinationsVirtual Reality and Attitudes toward Tourism Destinations
Virtual Reality and Attitudes toward Tourism Destinations
 
Towards an ICT roadmap for smart tourism destinations based on prospective an...
Towards an ICT roadmap for smart tourism destinations based on prospective an...Towards an ICT roadmap for smart tourism destinations based on prospective an...
Towards an ICT roadmap for smart tourism destinations based on prospective an...
 
Tourism Service Portfolio
Tourism Service PortfolioTourism Service Portfolio
Tourism Service Portfolio
 
Entrepeneurship in the Contemporary Tourism Ecosystem: the Case of Incoming T...
Entrepeneurship in the Contemporary Tourism Ecosystem: the Case of Incoming T...Entrepeneurship in the Contemporary Tourism Ecosystem: the Case of Incoming T...
Entrepeneurship in the Contemporary Tourism Ecosystem: the Case of Incoming T...
 
Online reputation and tourism destination competitiveness - conceptual model ...
Online reputation and tourism destination competitiveness - conceptual model ...Online reputation and tourism destination competitiveness - conceptual model ...
Online reputation and tourism destination competitiveness - conceptual model ...
 
Smart Glasses Adoption in Smart Tourism Destination: A Conceptual Model
Smart Glasses Adoption in Smart Tourism Destination: A Conceptual ModelSmart Glasses Adoption in Smart Tourism Destination: A Conceptual Model
Smart Glasses Adoption in Smart Tourism Destination: A Conceptual Model
 
Heimatunes
HeimatunesHeimatunes
Heimatunes
 
Ticino Tourism VR Experience
Ticino Tourism VR ExperienceTicino Tourism VR Experience
Ticino Tourism VR Experience
 
Exploring the Determinants of Strategic Revenue Management with Idiosyncratic...
Exploring the Determinants of Strategic Revenue Management with Idiosyncratic...Exploring the Determinants of Strategic Revenue Management with Idiosyncratic...
Exploring the Determinants of Strategic Revenue Management with Idiosyncratic...
 
How Instagram influences Visual Destination Image - a case study of Jordan an...
How Instagram influences Visual Destination Image - a case study of Jordan an...How Instagram influences Visual Destination Image - a case study of Jordan an...
How Instagram influences Visual Destination Image - a case study of Jordan an...
 

Similar to Big data as input for predicting tourist arrivals

Search engine traffic as input for predicting tourist arrivals
Search engine traffic as input for predicting tourist arrivalsSearch engine traffic as input for predicting tourist arrivals
Search engine traffic as input for predicting tourist arrivals
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Linked Data for Cross-Domain Decision-making in Tourism
Linked Data for Cross-Domain Decision-making in TourismLinked Data for Cross-Domain Decision-making in Tourism
Linked Data for Cross-Domain Decision-making in Tourism
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Constructing a Data Warehouse Based Decision Support Platform for China Touri...
Constructing a Data Warehouse Based Decision Support Platform for China Touri...Constructing a Data Warehouse Based Decision Support Platform for China Touri...
Constructing a Data Warehouse Based Decision Support Platform for China Touri...
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Strategic Visitor Flows (SVF) analysis using mobile data
Strategic Visitor Flows (SVF) analysis using mobile dataStrategic Visitor Flows (SVF) analysis using mobile data
Strategic Visitor Flows (SVF) analysis using mobile data
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Research agenda for analysing online climate and weather information in the ...
 Research agenda for analysing online climate and weather information in the ... Research agenda for analysing online climate and weather information in the ...
Research agenda for analysing online climate and weather information in the ...
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Forecasting London Museum Visitors Using Google Trends Data (Research Note)
Forecasting London Museum Visitors Using Google Trends Data (Research Note)Forecasting London Museum Visitors Using Google Trends Data (Research Note)
Forecasting London Museum Visitors Using Google Trends Data (Research Note)
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Automated Assignment of Hotel Descriptions to Travel Behavioural Patterns
Automated Assignment of Hotel Descriptions to Travel Behavioural PatternsAutomated Assignment of Hotel Descriptions to Travel Behavioural Patterns
Automated Assignment of Hotel Descriptions to Travel Behavioural Patterns
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Automated Assignment of Hotel Descriptions to Travel Behavioural Patterns
 Automated Assignment of Hotel Descriptions to Travel Behavioural Patterns Automated Assignment of Hotel Descriptions to Travel Behavioural Patterns
Automated Assignment of Hotel Descriptions to Travel Behavioural Patterns
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Information gathering by ubiquitous services for CRM in tourism destinations:...
Information gathering by ubiquitous services for CRM in tourism destinations:...Information gathering by ubiquitous services for CRM in tourism destinations:...
Information gathering by ubiquitous services for CRM in tourism destinations:...
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Strategic e-tourism Options for Destinations. Austrian Case
Strategic e-tourism Options for Destinations. Austrian CaseStrategic e-tourism Options for Destinations. Austrian Case
Strategic e-tourism Options for Destinations. Austrian Case
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Ontology-based Matchmaking to Provide Personalized Offers
Ontology-based Matchmaking to Provide Personalized OffersOntology-based Matchmaking to Provide Personalized Offers
Ontology-based Matchmaking to Provide Personalized Offers
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...
Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...
Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...
John Makridis
 
Experimental transformation of ABS data into Data Cube Vocabulary (DCV) form...
Experimental transformation of  ABS data into Data Cube Vocabulary (DCV) form...Experimental transformation of  ABS data into Data Cube Vocabulary (DCV) form...
Experimental transformation of ABS data into Data Cube Vocabulary (DCV) form...
Alistair Hamilton
 
La telefonía móvil como fuente de información para el estudio de la movilidad...
La telefonía móvil como fuente de información para el estudio de la movilidad...La telefonía móvil como fuente de información para el estudio de la movilidad...
La telefonía móvil como fuente de información para el estudio de la movilidad...
Esri España
 
Integration of data mining results into multi-dimensional data models
Integration of data mining results into multi-dimensional data modelsIntegration of data mining results into multi-dimensional data models
Integration of data mining results into multi-dimensional data models
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Impact/legacy measurement and evaluation in mega events projects with focus o...
Impact/legacy measurement and evaluation in mega events projects with focus o...Impact/legacy measurement and evaluation in mega events projects with focus o...
Impact/legacy measurement and evaluation in mega events projects with focus o...
Marcos CAVALCANTI
 
Data-Warehouse-and-Business-Intelligence
Data-Warehouse-and-Business-IntelligenceData-Warehouse-and-Business-Intelligence
Data-Warehouse-and-Business-Intelligence
Shantanu Deshpande
 
P. Struijs, Toward the Use of Big Data for European Statistics
P. Struijs, Toward the Use of Big Data for European StatisticsP. Struijs, Toward the Use of Big Data for European Statistics
P. Struijs, Toward the Use of Big Data for European Statistics
Istituto nazionale di statistica
 
Key factors in the booking activity process: the case of self-catering in Rom...
Key factors in the booking activity process: the case of self-catering in Rom...Key factors in the booking activity process: the case of self-catering in Rom...
Key factors in the booking activity process: the case of self-catering in Rom...
International Federation for Information Technologies in Travel and Tourism (IFITT)
 
Noordini audit24112009
Noordini audit24112009Noordini audit24112009
Noordini audit24112009
Dini Che Man
 

Similar to Big data as input for predicting tourist arrivals (20)

Search engine traffic as input for predicting tourist arrivals
Search engine traffic as input for predicting tourist arrivalsSearch engine traffic as input for predicting tourist arrivals
Search engine traffic as input for predicting tourist arrivals
 
Linked Data for Cross-Domain Decision-making in Tourism
Linked Data for Cross-Domain Decision-making in TourismLinked Data for Cross-Domain Decision-making in Tourism
Linked Data for Cross-Domain Decision-making in Tourism
 
Constructing a Data Warehouse Based Decision Support Platform for China Touri...
Constructing a Data Warehouse Based Decision Support Platform for China Touri...Constructing a Data Warehouse Based Decision Support Platform for China Touri...
Constructing a Data Warehouse Based Decision Support Platform for China Touri...
 
Strategic Visitor Flows (SVF) analysis using mobile data
Strategic Visitor Flows (SVF) analysis using mobile dataStrategic Visitor Flows (SVF) analysis using mobile data
Strategic Visitor Flows (SVF) analysis using mobile data
 
Research agenda for analysing online climate and weather information in the ...
 Research agenda for analysing online climate and weather information in the ... Research agenda for analysing online climate and weather information in the ...
Research agenda for analysing online climate and weather information in the ...
 
Forecasting London Museum Visitors Using Google Trends Data (Research Note)
Forecasting London Museum Visitors Using Google Trends Data (Research Note)Forecasting London Museum Visitors Using Google Trends Data (Research Note)
Forecasting London Museum Visitors Using Google Trends Data (Research Note)
 
Automated Assignment of Hotel Descriptions to Travel Behavioural Patterns
Automated Assignment of Hotel Descriptions to Travel Behavioural PatternsAutomated Assignment of Hotel Descriptions to Travel Behavioural Patterns
Automated Assignment of Hotel Descriptions to Travel Behavioural Patterns
 
Automated Assignment of Hotel Descriptions to Travel Behavioural Patterns
 Automated Assignment of Hotel Descriptions to Travel Behavioural Patterns Automated Assignment of Hotel Descriptions to Travel Behavioural Patterns
Automated Assignment of Hotel Descriptions to Travel Behavioural Patterns
 
Information gathering by ubiquitous services for CRM in tourism destinations:...
Information gathering by ubiquitous services for CRM in tourism destinations:...Information gathering by ubiquitous services for CRM in tourism destinations:...
Information gathering by ubiquitous services for CRM in tourism destinations:...
 
Strategic e-tourism Options for Destinations. Austrian Case
Strategic e-tourism Options for Destinations. Austrian CaseStrategic e-tourism Options for Destinations. Austrian Case
Strategic e-tourism Options for Destinations. Austrian Case
 
Ontology-based Matchmaking to Provide Personalized Offers
Ontology-based Matchmaking to Provide Personalized OffersOntology-based Matchmaking to Provide Personalized Offers
Ontology-based Matchmaking to Provide Personalized Offers
 
Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...
Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...
Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...
 
Experimental transformation of ABS data into Data Cube Vocabulary (DCV) form...
Experimental transformation of  ABS data into Data Cube Vocabulary (DCV) form...Experimental transformation of  ABS data into Data Cube Vocabulary (DCV) form...
Experimental transformation of ABS data into Data Cube Vocabulary (DCV) form...
 
La telefonía móvil como fuente de información para el estudio de la movilidad...
La telefonía móvil como fuente de información para el estudio de la movilidad...La telefonía móvil como fuente de información para el estudio de la movilidad...
La telefonía móvil como fuente de información para el estudio de la movilidad...
 
Integration of data mining results into multi-dimensional data models
Integration of data mining results into multi-dimensional data modelsIntegration of data mining results into multi-dimensional data models
Integration of data mining results into multi-dimensional data models
 
Impact/legacy measurement and evaluation in mega events projects with focus o...
Impact/legacy measurement and evaluation in mega events projects with focus o...Impact/legacy measurement and evaluation in mega events projects with focus o...
Impact/legacy measurement and evaluation in mega events projects with focus o...
 
Data-Warehouse-and-Business-Intelligence
Data-Warehouse-and-Business-IntelligenceData-Warehouse-and-Business-Intelligence
Data-Warehouse-and-Business-Intelligence
 
P. Struijs, Toward the Use of Big Data for European Statistics
P. Struijs, Toward the Use of Big Data for European StatisticsP. Struijs, Toward the Use of Big Data for European Statistics
P. Struijs, Toward the Use of Big Data for European Statistics
 
Key factors in the booking activity process: the case of self-catering in Rom...
Key factors in the booking activity process: the case of self-catering in Rom...Key factors in the booking activity process: the case of self-catering in Rom...
Key factors in the booking activity process: the case of self-catering in Rom...
 
Noordini audit24112009
Noordini audit24112009Noordini audit24112009
Noordini audit24112009
 

Recently uploaded

Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
Zilliz
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 

Recently uploaded (20)

Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 

Big data as input for predicting tourist arrivals

  • 1. ENTER 2017 Research Track Slide Number 1 Wolfram Höpkena, Dominic Ernestia, Matthias Fuchsb, Kai Kronenbergb, and Maria Lexhagenb a Business Informatics Group University of Applied Sciences Ravensburg-Weingarten, Germany {name.surname}@hs-weingarten.de b European Tourism Research Institute (ETOUR) Mid-Sweden University, Sweden {name.surname}@miun.se Big data as input for predicting tourist arrivals
  • 2. ENTER 2017 Research Track Slide Number 2 Content • Introduction • Related work • Data collection and preparation • Model building • Results • Conclusion and Outlook
  • 3. ENTER 2017 Research Track Slide Number 3 Motivation • Demand prediction in tourism – Being a labour-intensive branch, tourism managers strongly depend on precise demand predictions – Besides long-term trends, tourist arrivals typically follow strong seasonal trends Autoregressive approaches lead to quite satisfactory results and have been used quite widely in tourism in the past (Song & Li, 2008, p. 210) • Limitations of autoregressive approaches – Tourism demand is heavily influenced by external factors as well • Economic factors: exchange rates, consumers’ income (Frechtling, 2002; Song & Witt, 2000) • Destination-specific factors: marketing efforts (Brida & Schubert, 2008; Kronenberg et al. 2016) – Electronic footprints of customers are promising new info sources • Especially to encounter singular or unexpected demand fluctuations induced by, e.g. natural disasters or mega events
  • 4. ENTER 2017 Research Track Slide Number 4 Objective • Research proposition (1): Big data information sources increase performance of predicting tourist arrivals compared to autoregressive approaches – Extend autoregressive demand prediction by making use of big data information sources • Economic factors • Data on web search behaviour of potential customers • Research proposition (2): data mining technique can achieve better prediction performance than statistical approaches (e.g. linear regression) – Data mining technique k-nearest neighbour • Not restricted to a linear relationships, robust against biased and statistically inconvenient data (e.g. collinearity of input attributes)
  • 5. ENTER 2017 Research Track Slide Number 5 Content • Introduction • Related work • Data collection and preparation • Model building • Results • Conclusion and Outlook
  • 6. ENTER 2017 Research Track Slide Number 6 Related work • (Non-causal) Time series models – Autoregressive moving average (ARIMA) models, exponential smoothing models (Song & Li, 2008, p. 210) • (Causal) Econometric approaches – Autoregressive distributed lag models (ADLM), error correction model (EDM), vector autoregressive (VAR) model, time varying parameter (TVP) model to avoid spurious regression results (Peng et al. 2010) – Linear structural equation model (SEM) (Turner & Witt 2001) – Important determinants of tourism demand • Consumer price index, GDP, exchange rates, interest rate, unemployment rate, money supply, export/import rates (Cho 2001; Song & Li 2008, p. 211) • Man-made events, advertising investments, crises (e.g. financial crisis, terror attacks), natural disasters (SARS, foot & mouth disease, etc.)
  • 7. ENTER 2017 Research Track Slide Number 7 Related work • Using big data sources to predict tourism demand – Google search engine traffic to increase forecasting performance using autoregressive mixed-data sampling (AR-MIDAS) models (Bangwayo-Skeete & Skeete, 2015) – Google search engine traffic to improve tourism demand prediction accuracy, compared to purely autoregressive models or exponential smoothing time-series models (Önder & Gunter, 2016) – Search engine data to increase forecasting accuracy, compared to autoregressive moving average (ARMA) models (Yang et al., 2015) – DMO Web traffic data to predict demand for hotel rooms in a tourist destination (Yang et al., 2014)
  • 8. ENTER 2017 Research Track Slide Number 8 Related work • Artificial intelligence (AI) / data mining (DM) methods to predict tourism demand – Artificial neural networks (ANN) outperform decomposition models, exponential smoothing, ARIMA and multiple regression (Kon & Turner 2005; Song & Li 2008, p. 212; Law and Au 1999; Palmer et al. 2006; Lin et al. 2011) – Rough set approach (decision rule induction) (Law and Au 1988; Goh et al. 2008) – Genetic Algorithms for recognizing changes in composition of tourism demand (Hernandez-López 2004; Hong et al. 2011) – Support Vector Machines (SVM) superior to ARIMA models (Pai et al. 2006; Chen & Wang 2007)
  • 9. ENTER 2017 Research Track Slide Number 9 Content • Introduction • Related work • Data collection and preparation • Model building • Results • Conclusion and Outlook
  • 10. ENTER 2017 Research Track Slide Number 10 Specification of data set • Tourist arrival data for Swedish mountain destination Åre – Total period: 77 months (December 2005 - April 2012) – Sending countries: Denmark, Finland, Norway, Russia, United Kingdom • Attributes – Tourist arrivals: forecasting attribute – Income: GDP per capita for each sending country – Destination price level: CPI (Consumer Price Index) Sweden / CPI sending country – Price level of alternative destinations – Transportation costs: Jet fuel prices – Advertising expenditures: Advertising investments by SkiStar Åre – Web search traffic: Google Trends data (Åre, Åre winter, Åre ski, …) – Mega event: dummy variable for FIS Alpine Ski World Championship
  • 11. ENTER 2017 Research Track Slide Number 11 Specification of data set Tourist arrivals and Google online traffic
  • 12. ENTER 2017 Research Track Slide Number 12 Data preparation • Time-lagged data – Target attribute tourist arrivals is used with different time-lags as input attributes (autoregressive approach) – Time-lags are meaningful for other input attributes as well (e.g. web search traffic, advertising expenditures) – Data mining techniques (e.g. k-nearest neighbour) do not require specific statistical characteristics to hold true for data set (e.g. non- collinearity) Input attributes can be included in data set for all different time-lags within certain time window – Time window of 24 months has been chosen to cope with seasonality and trend over several years Data set consists of • Arrivals + big data attributes at time t-1 to t-24 = input attributes • Tourist arrivals at time t = target attribute
  • 13. ENTER 2017 Research Track Slide Number 13 Content • Introduction • Related work • Data collection and preparation • Model building • Results • Conclusion and Outlook
  • 14. ENTER 2017 Research Track Slide Number 14 Model building • Linear regression as statistical approach – Forward selection (data-driven model building approach) to avoid inclusion of irrelevant or highly correlated input attributes (caused by including attributes with different time-lags) • K-nearest neighbour as data mining approach – Estimates target attribute of a new data entry, based on k most similar data entries within training data set – Two nearest neighbours (k=2) and Euclidian distance measure to identify most similar past time windows – Z-score standardisation to normalize attribute values • Evaluation – Prediction performance evaluated by sliding window validation (moving a training and consecutive test window along data set)
  • 15. ENTER 2017 Research Track Slide Number 15 Content • Introduction • Related work • Data collection and preparation • Model building • Results • Conclusion and Outlook
  • 16. ENTER 2017 Research Track Slide Number 16 Results
  • 17. ENTER 2017 Research Track Slide Number 17 Results Autoregressive approaches reach good results in light of an average of 3,223 arrivals and a maximum of 14,199 arrivals per month
  • 18. ENTER 2017 Research Track Slide Number 18 Results Adding big data information sources as additional input significantly increases prediction performance (supports research proposition 1)
  • 19. ENTER 2017 Research Track Slide Number 19 Results MAE over all sending countries for the prediction method k-NN is reduced from 620 to 432, thus, by 30%
  • 20. ENTER 2017 Research Track Slide Number 20 Results Web search traffic (i.e. Google trends), price level of alternative destinations and transportation costs (i.e. jet fuel prices) show strongest influence on tourist arrivals
  • 21. ENTER 2017 Research Track Slide Number 21 Results Predicting tourist arrivals by linear regression and big data
  • 22. ENTER 2017 Research Track Slide Number 22 Results Data mining technique k- nearest neighbour (k-NN) outperforms statistical approach linear regression (research proposition 2)
  • 23. ENTER 2017 Research Track Slide Number 23 Content • Introduction • Related work • Data collection and preparation • Model building • Results • Conclusion and Outlook
  • 24. ENTER 2017 Research Track Slide Number 24 Conclusion and outlook • Big data as input to tourist arrival prediction – Big data information sources (e.g. web search traffic) show potential to increase accuracy to predict tourist arrivals compared to autoregressive approaches – Advantage of enabling a prediction of tourist arrivals under changing external conditions or singular events • Data mining techniques instead of statistical approaches – Data mining techniques (in this case k-NN) show potential to outperform statistical approaches, like linear regression – Can identify any kind of non-linear relationships; more robust against biased data and violations of input attribute preconditions • Outlook: Extend range of big data information sources – Web navigation data, customer traffic data on social media platforms (Twitter, Facebook, YouTube, etc.)
  • 25. ENTER 2017 Research Track Slide Number 25 Wolfram Höpkena, Dominic Ernestia, Matthias Fuchsb, Kai Kronenbergb, and Maria Lexhagenb a Business Informatics Group University of Applied Sciences Ravensburg-Weingarten, Germany {name.surname}@hs-weingarten.de b European Tourism Research Institute (ETOUR) Mid-Sweden University, Sweden {name.surname}@miun.se Big data as input for predicting tourist arrivals

Editor's Notes

  1. Duration: 20 min (without questions)
  2. 1 min
  3. 1,5 min
  4. 1,5 min
  5. 1 min
  6. 1 min
  7. 1 min
  8. 1,5 min
  9. 1 min
  10. 1,5 min
  11. 2 min
  12. 0,5 min the mean absolute error (MAE) for all prediction methods, typically used as the most expressive performance measures when comparing different prediction methods on the same data set
  13. 0,5 min the mean absolute error (MAE) for all prediction methods, typically used as the most expressive performance measures when comparing different prediction methods on the same data set The performance deviation between different sending countries is simply caused by different arrival scales, e.g. Norway having 4,875 arrivals on average compared to UK with 1,082 average arrivals per month
  14. 1 min the mean absolute error (MAE) for all prediction methods, typically used as the most expressive performance measures when comparing different prediction methods on the same data set The performance deviation between different sending countries is simply caused by different arrival scales, e.g. Norway having 4,875 arrivals on average compared to UK with 1,082 average arrivals per month
  15. 0,5 min the mean absolute error (MAE) for all prediction methods, typically used as the most expressive performance measures when comparing different prediction methods on the same data set The performance deviation between different sending countries is simply caused by different arrival scales, e.g. Norway having 4,875 arrivals on average compared to UK with 1,082 average arrivals per month
  16. 0,5 min the mean absolute error (MAE) for all prediction methods, typically used as the most expressive performance measures when comparing different prediction methods on the same data set The performance deviation between different sending countries is simply caused by different arrival scales, e.g. Norway having 4,875 arrivals on average compared to UK with 1,082 average arrivals per month
  17. 0,5 min Norway is caused by an extreme peak of arrivals in 2010, which is better handled by the linear regression
  18. 1,5 min
  19. Duration: 20 min (without questions)