SlideShare a Scribd company logo
1 of 13
Download to read offline
1
How Big Data
revolutionizes decision
support in tourism
Prof. Dr. Wolfram Höpken
Hochschule Ravensburg-Weingarten
wolfram.hoepken@hs-weingarten.de
25th January 2018
2 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Facebook can predict whether you are about
to enter into a new relationship
www.break.com
2
3 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Credit card companies can
predict divorce with 95%
accuracy, two years out,
based on your purchasing
decisions
www.bearron.com
4 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Predictive Analytics in Tourism
Revenue management
• Explanation of booking and
cancellation behavior
• Prediction of tourism demand
• Prediction of flight prices
(DINAMO: Yield management system
developed by American Airlines 1988)
Product optimization & sales
• Explanation of tourists’
consumption behavior
• Optimization of product bundles /
market basket analysis
• Cross selling
Customer relationship
management
• Customer segmentation
• Adaptive marketing
3
5 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Integration of big data sources
Data warehouseData warehouse
ReportingReporting
OLAPOLAP
Data
mining
Data
mining
CRSCRS
ERPERP
CRMCRM
Online
platforms
Online
platforms
Web contentWeb content
• User generated content (customer feedback / opinions)
• Data on markets and competitors (e.g. changes in demand structure,
price changes)
Operative systems (OLTP) Dispositive systems (OLAP)
External data sources
6 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Integration of big data sources
Data warehouseData warehouse
ReportingReporting
OLAPOLAP
Data
mining
Data
mining
CRSCRS
ERPERP
CRMCRM
Online
platforms
Online
platforms
Web contentWeb content
• Economic data (e.g. GDP, employment data in sending
countries)
• Weather data (historic weather data and weather forecasts)
Environment
data
Environment
data
Operative systems (OLTP) Dispositive systems (OLAP)
External data sources
4
7 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Integration of big data sources
Data warehouseData warehouse
ReportingReporting
OLAPOLAP
Data
mining
Data
mining
CRSCRS
ERPERP
CRMCRM
Online
platforms
Online
platforms
Web ContentWeb Content
• Interactions with local infrastructure (light, air conditioning,
minibar, stereo equipment, TV, telephone, etc. e.g. in hotel room)
Environment
data
Environment
data
Local
infrastructure
Local
infrastructure
Operative systems (OLTP) Dispositive systems (OLAP)
External data sources Interactions with environment
8 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Integration of big data sources
Data warehouseData warehouse
ReportingReporting
OLAPOLAP
Data
mining
Data
mining
CRSCRS
ERPERP
CRMCRM
Online
platforms
Online
platforms
Web contentWeb content
• Location tracking (GPS-based)
• Reaching POIs (QR code/RFID/NFC-based)
Environment
data
Environment
data
Local
infrastructure
Local
infrastructure
Movement
profiles
Movement
profiles
Operative systems (OLTP) Dispositive systems (OLAP)
External data sources Interactions with environment
5
9 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Integration of big data sources
Data warehouseData warehouse
Operative systems (OLTP)
ReportingReporting
OLAPOLAP
Data
mining
Data
mining
Dispositive systems (OLAP)
CRSCRS
ERPERP
CRMCRM
Online
platforms
Online
platforms
External data sources
Web contentWeb content
Environment
data
Environment
data
Local
infrastructure
Local
infrastructure
Interactions with environment
Movement
profiles
Movement
profiles
Typical characteristics of big data sources
• Often unstructured (web content)
• Very large data volumes
• External
10 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Revenue Management
Prediction of demand based on google search volume
Tourist arrivals
arrivals
googlesearchvolume
6
11 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Revenue Management
Prediction of demand based on google search volume
Prediction of tourist arrivals at different forecasting horizons
Including
Google Trends
data
significantly
increases
prediction
performance
12 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Revenue Management
Identification of relevant search terms and most significant time lag
3 to 2 month before arrival -> search for activities in Sweden
One month before arrival -> more precise queries, searching specifically for Are
7
13 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Revenue Management
Prediction of demand based on big data sources
Tourist arrivals and google online traffic
Tourist arrivals and jet fuel price
Predicting tourist arrivals based on
past arrivals and big data
Used data sources: google online traffic, jet fuel
price, GDP of sending countries, price level of
destination & alternative destinations
14 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Revenue Management
Prediction of demand based on big data sources
 Including big data sources significantly increases prediction performance
MAE (mean average
error) over all sending
countries is reduced
from 620 to 432,
thus, by 30%
8
15 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Optimization of real product
Analysis of customer feedback (sentiment analysis)
 Extraction of customer feedback from review platforms
 Detection of topic
 What are customers talking about
(e.g. hotel room, reception, staff, …)
 Detection of subjectivity
 Is the statement objective or subjective
 Detection of sentiment
 Is the statement positive or negative
Method Accuracy
Topic detection
SVM (with POS tagging) 72.36%
Naïve Bayes
(with POS tagging)
49.72%
k-NN (with k = 8) 57.08%
Dictionary-based 71.28%
Subjectivity detection
SVM 65.50%
Naïve Bayes 60.67%
k-NN (with k = 5) 55.50%
Dictionary-based 82.63%
Sentiment detection
SVM (with bigrams) 76.80%
Naïve Bayes (with trigrams) 69.80%
k-NN (with k = 8) 69.60%
Dictionary-based 71.28%
16 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Optimization of real product
Analysis of customer feedback (sentiment analysis)
Detailed analysis of customer feedback
(positive/negative statements)
9
17 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Optimization of real product
Analysis customer feedback (sentiment analysis)
Benchmarking along product topics
18 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Optimization of real product
Dynamic topic detection
 Identification of (fine-grained) topics mentioned in customer feedback
(without the need to predefine topics)
Approach Accuracy
Identification of frequent words
(nouns only)
82.86%
Keyword Clustering
(nouns only, sentences-based,
k=80)
88.45%
LSI - Latent Semantic Indexing
(nouns only, sentences-based,
k=80)
85.46%
NER – Named Entity Recognition
(Naïve Bayes, 2 words +/- as
context)
75.17%
Fine-grained topics
with keywords
Predefined high-level
topics
restaurant
service
staff
center
city
halmstad
station
train
walk
hotel
parking
dinner
food
food &
beverage
staff location
breakfast
place
beach
hotel
location
10
19 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Optimization of real product
Topic detection by picture classification
Detecting topics like lunch, dinner,
reservation, service, atmosphere,
etc.
Deep learning by neural networks
20 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Marketing & sales
Movement patterns extracted from flickr
Clustering of
flickr foto uploads
(by DBSCAN)
146,958 photo
uploads from 11,289
users for the period
2005-2015
11
21 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Marketing & sales
Movement patterns extracted from flickr
Association rules
Rule Sup % Conf % Lift
1, 3 → 8 1 53.3 2.97
1
3
8
22 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Marketing & sales
Movement patterns extracted from flickr
More fine-grained
clustering for city
center
(by k-means)
12
23 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Marketing & sales
Movement patterns extracted from flickr
Association rules
Rule Sup % Conf % Lift
1,2 → 3 1.6 100 7.86
1
2
3
24 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Marketing & sales
Movement patterns extracted from flickr
Sequential patterns
Frequent Sequence Sup %
<Max-Joseph-Platz>
<Odeonsplatz>
1.6
Frequent Sequence Sup %
<Frauenkirche>
<Hofbräuhaus>
1.3
Frequent Sequence Sup %
<Frauenkirche>
<Heilig-Geist-
Kirche>
1.3
13
25 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Marketing & sales
Movement patterns extracted from flickr
# Frequent sequence Supp.
1 〈0, 142, 10〉 0.010
2 〈0, 64, 10〉 0.010
3 〈152, 0, 64〉 0.010
4 〈206, 190, 178〉 0.010
5 〈0, 142, 64, 10〉 0.008
6 〈0, 10, 126, 42〉 0.006
7 〈0, 142, 10, 126〉 0.006
8 〈118, 0, 64, 187〉 0.006
9 〈152, 0, 64, 75〉 0.006
10 〈152, 142, 64, 10〉 0.008
11 〈190, 178, 171, 169〉 0.006
12 〈75, 16, 178, 190〉 0.006
13 〈16, 0, 142, 64, 10〉 0.006
14 〈178, 0, 142, 64, 10〉 0.006
Frequent sequences based on k-Means clustering
Combining association rules and frequent sequences
26 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism
Conclusion
 Current trends
 Tourists leave traces during nearly all touristic activities
 Booking/consumption behavior, information need, preferences,
movement patterns, feedback, etc.
Big Data
 Today all this information can technically be gathered and
analysed
Improvement of decision support
Adaptation/optimization of operative processes and
personalization of customer interactions (Operational BI)
 Challenge: Evaluation of feasability
 Do the available data sources deliver the required knowledge and
can the intended decision support or customer benefit be realized?

More Related Content

Similar to SEARCH ENGINE TRAFFIC AS INPUT FOR PREDICTING TOURIST ARRIVALS

Cutting through the channel clutter: How hotels can get back in control of th...
Cutting through the channel clutter: How hotels can get back in control of th...Cutting through the channel clutter: How hotels can get back in control of th...
Cutting through the channel clutter: How hotels can get back in control of th...SiteMinder
 
5733 a deep dive into IBM Watson Foundation for CSP (WFC)
5733   a deep dive into IBM Watson Foundation for CSP (WFC)5733   a deep dive into IBM Watson Foundation for CSP (WFC)
5733 a deep dive into IBM Watson Foundation for CSP (WFC)Arvind Sathi
 
Introducing Mobius: Technology that Closes the Loop on Your Marketing Perform...
Introducing Mobius: Technology that Closes the Loop on Your Marketing Perform...Introducing Mobius: Technology that Closes the Loop on Your Marketing Perform...
Introducing Mobius: Technology that Closes the Loop on Your Marketing Perform...Tinuiti
 
Ppc keywords discovery search labs sao paulo 2010 pavel dolezal
Ppc keywords discovery   search labs sao paulo 2010 pavel dolezalPpc keywords discovery   search labs sao paulo 2010 pavel dolezal
Ppc keywords discovery search labs sao paulo 2010 pavel dolezalPavel Dolezal
 
Moving Beyond Batch: Transactional Databases for Real-time Data
Moving Beyond Batch: Transactional Databases for Real-time DataMoving Beyond Batch: Transactional Databases for Real-time Data
Moving Beyond Batch: Transactional Databases for Real-time DataVoltDB
 
DMA MAC Presentation: Kajal Mukhopadhyay, Ph.D.
DMA MAC Presentation: Kajal Mukhopadhyay, Ph.D.DMA MAC Presentation: Kajal Mukhopadhyay, Ph.D.
DMA MAC Presentation: Kajal Mukhopadhyay, Ph.D.Kajal Mukhopadhyay, PhD
 
How to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
How to predict the future of shopping - Ulrich Kerzel @ PAPIs ConnectHow to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
How to predict the future of shopping - Ulrich Kerzel @ PAPIs ConnectPAPIs.io
 
Impact of big data on analytics
Impact of big data on analyticsImpact of big data on analytics
Impact of big data on analyticsCapgemini
 
Retail lessons learned from the first data driven business and future direct...
Retail  lessons learned from the first data driven business and future direct...Retail  lessons learned from the first data driven business and future direct...
Retail lessons learned from the first data driven business and future direct...TUSHAR GARG
 
Introduction to data science and candidate data science projects
Introduction to data science and candidate data science projectsIntroduction to data science and candidate data science projects
Introduction to data science and candidate data science projectsJay (Jianqiang) Wang
 
"Ready or Not, Here Comes 2015: Marketing Trends to Master" TrendLab Webinar
"Ready or Not, Here Comes 2015: Marketing Trends to Master" TrendLab Webinar"Ready or Not, Here Comes 2015: Marketing Trends to Master" TrendLab Webinar
"Ready or Not, Here Comes 2015: Marketing Trends to Master" TrendLab WebinarBluespire Marketing
 
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)Guido Schmutz
 
¿Como los modelos predictivos cambian los negocios?
¿Como los modelos predictivos cambian los negocios?¿Como los modelos predictivos cambian los negocios?
¿Como los modelos predictivos cambian los negocios?Fabricio Quintanilla
 
Big Data: Operational Excellence
Big Data: Operational ExcellenceBig Data: Operational Excellence
Big Data: Operational ExcellenceMatthias Vallaey
 
BBDO Connect Big Data
BBDO Connect Big DataBBDO Connect Big Data
BBDO Connect Big DataBBDO Belgium
 
Streaming and Visual Data Discovery for the Internet of Things
Streaming and Visual Data Discovery for the Internet of ThingsStreaming and Visual Data Discovery for the Internet of Things
Streaming and Visual Data Discovery for the Internet of ThingsDatawatchCorporation
 
Big Data sharing experience - Jacques Wieczorek
Big Data sharing experience - Jacques WieczorekBig Data sharing experience - Jacques Wieczorek
Big Data sharing experience - Jacques WieczorekNRB
 

Similar to SEARCH ENGINE TRAFFIC AS INPUT FOR PREDICTING TOURIST ARRIVALS (20)

How Big Data revolutionizes decision support in tourism
How Big Data revolutionizes decision support in tourismHow Big Data revolutionizes decision support in tourism
How Big Data revolutionizes decision support in tourism
 
Randy Goebel for the KIEF 2018. FROM DATA TO ECONOMIC VALUE
Randy Goebel for the KIEF 2018. FROM DATA TO ECONOMIC VALUERandy Goebel for the KIEF 2018. FROM DATA TO ECONOMIC VALUE
Randy Goebel for the KIEF 2018. FROM DATA TO ECONOMIC VALUE
 
Cutting through the channel clutter: How hotels can get back in control of th...
Cutting through the channel clutter: How hotels can get back in control of th...Cutting through the channel clutter: How hotels can get back in control of th...
Cutting through the channel clutter: How hotels can get back in control of th...
 
5733 a deep dive into IBM Watson Foundation for CSP (WFC)
5733   a deep dive into IBM Watson Foundation for CSP (WFC)5733   a deep dive into IBM Watson Foundation for CSP (WFC)
5733 a deep dive into IBM Watson Foundation for CSP (WFC)
 
Introducing Mobius: Technology that Closes the Loop on Your Marketing Perform...
Introducing Mobius: Technology that Closes the Loop on Your Marketing Perform...Introducing Mobius: Technology that Closes the Loop on Your Marketing Perform...
Introducing Mobius: Technology that Closes the Loop on Your Marketing Perform...
 
Ppc keywords discovery search labs sao paulo 2010 pavel dolezal
Ppc keywords discovery   search labs sao paulo 2010 pavel dolezalPpc keywords discovery   search labs sao paulo 2010 pavel dolezal
Ppc keywords discovery search labs sao paulo 2010 pavel dolezal
 
Moving Beyond Batch: Transactional Databases for Real-time Data
Moving Beyond Batch: Transactional Databases for Real-time DataMoving Beyond Batch: Transactional Databases for Real-time Data
Moving Beyond Batch: Transactional Databases for Real-time Data
 
DMA MAC Presentation: Kajal Mukhopadhyay, Ph.D.
DMA MAC Presentation: Kajal Mukhopadhyay, Ph.D.DMA MAC Presentation: Kajal Mukhopadhyay, Ph.D.
DMA MAC Presentation: Kajal Mukhopadhyay, Ph.D.
 
How to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
How to predict the future of shopping - Ulrich Kerzel @ PAPIs ConnectHow to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
How to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
 
Impact of big data on analytics
Impact of big data on analyticsImpact of big data on analytics
Impact of big data on analytics
 
Retail lessons learned from the first data driven business and future direct...
Retail  lessons learned from the first data driven business and future direct...Retail  lessons learned from the first data driven business and future direct...
Retail lessons learned from the first data driven business and future direct...
 
Introduction to data science and candidate data science projects
Introduction to data science and candidate data science projectsIntroduction to data science and candidate data science projects
Introduction to data science and candidate data science projects
 
"Ready or Not, Here Comes 2015: Marketing Trends to Master" TrendLab Webinar
"Ready or Not, Here Comes 2015: Marketing Trends to Master" TrendLab Webinar"Ready or Not, Here Comes 2015: Marketing Trends to Master" TrendLab Webinar
"Ready or Not, Here Comes 2015: Marketing Trends to Master" TrendLab Webinar
 
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
 
¿Como los modelos predictivos cambian los negocios?
¿Como los modelos predictivos cambian los negocios?¿Como los modelos predictivos cambian los negocios?
¿Como los modelos predictivos cambian los negocios?
 
uae views on big data
  uae views on  big data  uae views on  big data
uae views on big data
 
Big Data: Operational Excellence
Big Data: Operational ExcellenceBig Data: Operational Excellence
Big Data: Operational Excellence
 
BBDO Connect Big Data
BBDO Connect Big DataBBDO Connect Big Data
BBDO Connect Big Data
 
Streaming and Visual Data Discovery for the Internet of Things
Streaming and Visual Data Discovery for the Internet of ThingsStreaming and Visual Data Discovery for the Internet of Things
Streaming and Visual Data Discovery for the Internet of Things
 
Big Data sharing experience - Jacques Wieczorek
Big Data sharing experience - Jacques WieczorekBig Data sharing experience - Jacques Wieczorek
Big Data sharing experience - Jacques Wieczorek
 

Recently uploaded

Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringWSO2
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfdanishmna97
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governanceWSO2
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMKumar Satyam
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data SciencePaolo Missier
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 

Recently uploaded (20)

Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software Engineering
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governance
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 

SEARCH ENGINE TRAFFIC AS INPUT FOR PREDICTING TOURIST ARRIVALS

  • 1. 1 How Big Data revolutionizes decision support in tourism Prof. Dr. Wolfram Höpken Hochschule Ravensburg-Weingarten wolfram.hoepken@hs-weingarten.de 25th January 2018 2 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Facebook can predict whether you are about to enter into a new relationship www.break.com
  • 2. 2 3 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Credit card companies can predict divorce with 95% accuracy, two years out, based on your purchasing decisions www.bearron.com 4 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Predictive Analytics in Tourism Revenue management • Explanation of booking and cancellation behavior • Prediction of tourism demand • Prediction of flight prices (DINAMO: Yield management system developed by American Airlines 1988) Product optimization & sales • Explanation of tourists’ consumption behavior • Optimization of product bundles / market basket analysis • Cross selling Customer relationship management • Customer segmentation • Adaptive marketing
  • 3. 3 5 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Integration of big data sources Data warehouseData warehouse ReportingReporting OLAPOLAP Data mining Data mining CRSCRS ERPERP CRMCRM Online platforms Online platforms Web contentWeb content • User generated content (customer feedback / opinions) • Data on markets and competitors (e.g. changes in demand structure, price changes) Operative systems (OLTP) Dispositive systems (OLAP) External data sources 6 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Integration of big data sources Data warehouseData warehouse ReportingReporting OLAPOLAP Data mining Data mining CRSCRS ERPERP CRMCRM Online platforms Online platforms Web contentWeb content • Economic data (e.g. GDP, employment data in sending countries) • Weather data (historic weather data and weather forecasts) Environment data Environment data Operative systems (OLTP) Dispositive systems (OLAP) External data sources
  • 4. 4 7 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Integration of big data sources Data warehouseData warehouse ReportingReporting OLAPOLAP Data mining Data mining CRSCRS ERPERP CRMCRM Online platforms Online platforms Web ContentWeb Content • Interactions with local infrastructure (light, air conditioning, minibar, stereo equipment, TV, telephone, etc. e.g. in hotel room) Environment data Environment data Local infrastructure Local infrastructure Operative systems (OLTP) Dispositive systems (OLAP) External data sources Interactions with environment 8 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Integration of big data sources Data warehouseData warehouse ReportingReporting OLAPOLAP Data mining Data mining CRSCRS ERPERP CRMCRM Online platforms Online platforms Web contentWeb content • Location tracking (GPS-based) • Reaching POIs (QR code/RFID/NFC-based) Environment data Environment data Local infrastructure Local infrastructure Movement profiles Movement profiles Operative systems (OLTP) Dispositive systems (OLAP) External data sources Interactions with environment
  • 5. 5 9 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Integration of big data sources Data warehouseData warehouse Operative systems (OLTP) ReportingReporting OLAPOLAP Data mining Data mining Dispositive systems (OLAP) CRSCRS ERPERP CRMCRM Online platforms Online platforms External data sources Web contentWeb content Environment data Environment data Local infrastructure Local infrastructure Interactions with environment Movement profiles Movement profiles Typical characteristics of big data sources • Often unstructured (web content) • Very large data volumes • External 10 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Revenue Management Prediction of demand based on google search volume Tourist arrivals arrivals googlesearchvolume
  • 6. 6 11 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Revenue Management Prediction of demand based on google search volume Prediction of tourist arrivals at different forecasting horizons Including Google Trends data significantly increases prediction performance 12 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Revenue Management Identification of relevant search terms and most significant time lag 3 to 2 month before arrival -> search for activities in Sweden One month before arrival -> more precise queries, searching specifically for Are
  • 7. 7 13 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Revenue Management Prediction of demand based on big data sources Tourist arrivals and google online traffic Tourist arrivals and jet fuel price Predicting tourist arrivals based on past arrivals and big data Used data sources: google online traffic, jet fuel price, GDP of sending countries, price level of destination & alternative destinations 14 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Revenue Management Prediction of demand based on big data sources  Including big data sources significantly increases prediction performance MAE (mean average error) over all sending countries is reduced from 620 to 432, thus, by 30%
  • 8. 8 15 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Optimization of real product Analysis of customer feedback (sentiment analysis)  Extraction of customer feedback from review platforms  Detection of topic  What are customers talking about (e.g. hotel room, reception, staff, …)  Detection of subjectivity  Is the statement objective or subjective  Detection of sentiment  Is the statement positive or negative Method Accuracy Topic detection SVM (with POS tagging) 72.36% Naïve Bayes (with POS tagging) 49.72% k-NN (with k = 8) 57.08% Dictionary-based 71.28% Subjectivity detection SVM 65.50% Naïve Bayes 60.67% k-NN (with k = 5) 55.50% Dictionary-based 82.63% Sentiment detection SVM (with bigrams) 76.80% Naïve Bayes (with trigrams) 69.80% k-NN (with k = 8) 69.60% Dictionary-based 71.28% 16 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Optimization of real product Analysis of customer feedback (sentiment analysis) Detailed analysis of customer feedback (positive/negative statements)
  • 9. 9 17 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Optimization of real product Analysis customer feedback (sentiment analysis) Benchmarking along product topics 18 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Optimization of real product Dynamic topic detection  Identification of (fine-grained) topics mentioned in customer feedback (without the need to predefine topics) Approach Accuracy Identification of frequent words (nouns only) 82.86% Keyword Clustering (nouns only, sentences-based, k=80) 88.45% LSI - Latent Semantic Indexing (nouns only, sentences-based, k=80) 85.46% NER – Named Entity Recognition (Naïve Bayes, 2 words +/- as context) 75.17% Fine-grained topics with keywords Predefined high-level topics restaurant service staff center city halmstad station train walk hotel parking dinner food food & beverage staff location breakfast place beach hotel location
  • 10. 10 19 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Optimization of real product Topic detection by picture classification Detecting topics like lunch, dinner, reservation, service, atmosphere, etc. Deep learning by neural networks 20 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Marketing & sales Movement patterns extracted from flickr Clustering of flickr foto uploads (by DBSCAN) 146,958 photo uploads from 11,289 users for the period 2005-2015
  • 11. 11 21 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Marketing & sales Movement patterns extracted from flickr Association rules Rule Sup % Conf % Lift 1, 3 → 8 1 53.3 2.97 1 3 8 22 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Marketing & sales Movement patterns extracted from flickr More fine-grained clustering for city center (by k-means)
  • 12. 12 23 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Marketing & sales Movement patterns extracted from flickr Association rules Rule Sup % Conf % Lift 1,2 → 3 1.6 100 7.86 1 2 3 24 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Marketing & sales Movement patterns extracted from flickr Sequential patterns Frequent Sequence Sup % <Max-Joseph-Platz> <Odeonsplatz> 1.6 Frequent Sequence Sup % <Frauenkirche> <Hofbräuhaus> 1.3 Frequent Sequence Sup % <Frauenkirche> <Heilig-Geist- Kirche> 1.3
  • 13. 13 25 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Marketing & sales Movement patterns extracted from flickr # Frequent sequence Supp. 1 〈0, 142, 10〉 0.010 2 〈0, 64, 10〉 0.010 3 〈152, 0, 64〉 0.010 4 〈206, 190, 178〉 0.010 5 〈0, 142, 64, 10〉 0.008 6 〈0, 10, 126, 42〉 0.006 7 〈0, 142, 10, 126〉 0.006 8 〈118, 0, 64, 187〉 0.006 9 〈152, 0, 64, 75〉 0.006 10 〈152, 142, 64, 10〉 0.008 11 〈190, 178, 171, 169〉 0.006 12 〈75, 16, 178, 190〉 0.006 13 〈16, 0, 142, 64, 10〉 0.006 14 〈178, 0, 142, 64, 10〉 0.006 Frequent sequences based on k-Means clustering Combining association rules and frequent sequences 26 Prof. Dr. Wolfram HöpkenHow Big Data revolutionizes decision support in tourism Conclusion  Current trends  Tourists leave traces during nearly all touristic activities  Booking/consumption behavior, information need, preferences, movement patterns, feedback, etc. Big Data  Today all this information can technically be gathered and analysed Improvement of decision support Adaptation/optimization of operative processes and personalization of customer interactions (Operational BI)  Challenge: Evaluation of feasability  Do the available data sources deliver the required knowledge and can the intended decision support or customer benefit be realized?