SlideShare a Scribd company logo
Big Data, Big Tourism
Tourism and Mechanics
https://www.slideshare.net/sirmmo/big-data-big-tourism
What are «Big Data»?
• Excel gets stuck working a
dataset? => «medium» data
• Stata/R suffer working a
dataset? => «big» data
Where do we get the data?
• Tourists
• Have sensors
• Are sensors
• Are actors
• Attractions
• Are sensors
• Are actors
• Hotels, restaurants
• Are sensors
• Have sensors
Can we access the data?
• Tourists
• Have sensors
• Are sensors
• Are actors
• Attractions
• Are sensors
• Are actors
• Hotels, restaurants
• Are sensors
• Have sensors
Can we access the data?
• Tourists
• Have sensors
• Are sensors
• Are actors
• Attractions
• Are sensors
• Are actors
• Hotels, restaurants
• Are sensors
• Have sensors
Can we access the data?
• Tourists
• Have sensors
• Are sensors
• Are actors
• Attractions
• Are sensors
• Are actors
• Hotels, restaurants
• Are sensors
• Have sensors
Government
Can we access the data?
• Tourists
• Have sensors
• Are sensors
• Are actors
• Attractions
• Are sensors
• Are actors
• Hotels, restaurants
• Are sensors
• Have sensors
Private Sector
Can we access the data?
• Tourists
• Have sensors
• Are sensors
• Are actors
• Attractions
• Are sensors
• Are actors
• Hotels, restaurants
• Are sensors
• Have sensors
Private SectorGovernment
Open(able/ish)
Data
Almost
always
Ok so who owns that data?
• Government
• Bureaucracy-driven data
• Incoherent
• Inconsistent
• Irregular production
• Private Sector
• Deeply integrated with user
experience
• Very «behavioral», and as such
very «real»
• Very business-oriented metrics
Ok so who owns that data?
• Government
• Bureaucracy-driven data
• Incoherent
• Inconsistent
• Irregular production
• Private Sector
• Deeply integrated with user
experience
• Very «behavioral», and as such
very «real»
• Very business-oriented metrics
Ok so who owns that data?
• Government
• Bureaucracy-driven data
• Incoherent
• Inconsistent
• Irregular production
• Private Sector
• Deeply integrated with user
experience
• Very «behavioral», and as such
very «real»
• Very business-oriented metrics
Scraping
• Time consuming
• Power consuming
• Illegal (up to a certain point)
• Unavoidable (up to a certain
point)
Scraping
• It relies on the fact that (most)
web is based on HTML
• And HTML is text
• And JavaScript is text
• And CSS is text
• Everything can be read before
the render…
Scraping
• It relies on the fact that (most)
web is based on HTML
• And HTML is text
• And JavaScript is text
• And CSS is text
• Everything can be read before
the render…
• Or after the render
Tools
• Not easy for «complex» sites
• Some cases come up
• Some tools help
• Maybe knowledge of Xml Query
Language or CSS required
• Some tools are very advanced
• Selenium browser driver
• «headless» browsers
• Chrome
• https://chrome.google.com/webstore/detai
l/scraper/mbigbapnjcgaffohmbkdlecaccepn
gjd?hl=en
• https://chrome.google.com/webstore/detai
l/web-
scraper/jnhgnonknehpejjnehehllkliplmbmh
n?hl=en
• https://chrome.google.com/webstore/detai
l/advanced-web-
scraper/gpolcofcjjiooogejfbaamdgmgfehgff
• Firefox
• https://addons.mozilla.org/en-
US/firefox/addon/datascraper/
• Web
• https://www.import.io/
• https://scrapinghub.com/portia/
Cases and issues of scraping
• Booking.com
• Amazing website
• Easy navigation for the user
• Issues
• They know!!!
• The website gets a complete
structural overhaul every 6-9
months
• They tend to hate scrapers
• The webpage is empty at the
beginning
Cases and issues of scraping
• Booking.com
• Amazing website
• Easy navigation for the user
• Issues
• They know!!!
• The website gets a complete
structural overhaul every 6-9
months
• They tend to hate scrapers
• The webpage is empty at the
beginning
Cases and issues of scraping
• AirBnB
• Nice navigation
• Full overhaul every 3 months
• Issues
• The page really tracks what kind of
user is accessing
• The visible pages are 13 (only)
• They are randomly generated
every day for the major areas
Cases and issues of scraping
• Weather
• Many sources
• Many formats
• Issues
• Normalization of vocabulary
• Bad weather == Rain == Rainy ==
Cloud Icon == ???
• Normalization of ranges
• Normalization of numbers
• Normalization of periodicity
Apps
Questionnaire
to get user to
explicitly give
data
Information
driven
application to
track user
data
Gamification
and/or
information
platform to
elaborate
and give data
back
Explicit data
• Relies on the user’s knowing
actions
• Requires real willing acceptance
for sharing information
• Stops at politically correctness
• Implies (almost always)
anonimity
• Questionnaire
• In-place review
• In-place comment
• Bureaucracy
Behavioral data
• Almost always true
• Difficult to get
• Easily contextualizable
• Interactive
• Interconnected
• Application
• Platform
• Social Media integration
• Gamification
• Social Media involvement
Cool, so what can be done?
Getting Data
• Municipalities are setting up
open wireless networks.
• Users can be tracked.
• Services can be offered (and
instrumented)
• Museums can track users within
their premises
• Social Media interactions
Using Data
• Analysis of context of specific
behaviours
• Automated storytelling for city
visits
• Pricing methodologies
• Destination brand analysis
Big and Big-ish Data Tools
• The problem is computational
power
• Lots of work on AI
• Classification
• Generation
• Machine Learning
• Correlations
• DataWarehouses
• Mondrian -
http://community.pentaho.com/projects/
mondrian/
• Big Data DBs
• Cassandra - http://cassandra.apache.org/
• Hadoop - http://hadoop.apache.org/
• Big Data Search
• BigQuery -
https://cloud.google.com/bigquery/
• GraphQL - http://graphql.org/
• Big Data AI/ML
• TensorFlow -
https://www.tensorflow.org/
• ScikitPy - https://www.scipy.org/
A few open questions
• Impact of crowdfunding on tourism-bound projects
• Impact of meta-search-engines on pricing
• Impact (or lack thereof) of destination information websites on user
decisions
• How can the user be «vetted» in order to tailor the touristic
experience around her?
• Would such vetting process impact on customer return decisions?
One more thing: Watch out!!
Thanks! Questions?
@ingmmo
marco.montanari@gmail.com
http://ingmmo.com, https://medium.com/@ingmmo
sirmmo
http://it.linkedin.com/in/montanarim/
https://www.facebook.com/marco.montanari
marco.montanari
https://www.slideshare.net/sirmmo/big-data-big-tourism

More Related Content

Similar to Big data, big tourism

Data-Driven Development Era and Its Technologies
Data-Driven Development Era and Its TechnologiesData-Driven Development Era and Its Technologies
Data-Driven Development Era and Its Technologies
SATOSHI TAGOMORI
 
Big Data Landscape 2018
Big Data Landscape 2018Big Data Landscape 2018
Big Data Landscape 2018
Leanne Hwee
 
Discovering Big Data in the Fog: Why Catalogs Matter
 Discovering Big Data in the Fog: Why Catalogs Matter Discovering Big Data in the Fog: Why Catalogs Matter
Discovering Big Data in the Fog: Why Catalogs Matter
Eric Kavanagh
 
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014ALTER WAY
 
The Semantic Web: The Why? What? How?
The Semantic Web: The Why? What? How?The Semantic Web: The Why? What? How?
The Semantic Web: The Why? What? How?
iLinkoln Meetup
 
Introduction to Big Data
Introduction to Big Data Introduction to Big Data
Introduction to Big Data
Srinath Perera
 
Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...
Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...
Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...
Anselm Hook
 
Business Intelligence Barista: What DataViz Tool to Use, and When?
Business Intelligence Barista: What DataViz Tool to Use, and When?Business Intelligence Barista: What DataViz Tool to Use, and When?
Business Intelligence Barista: What DataViz Tool to Use, and When?
Jen Stirrup
 
Business Intelligence Barista: What DataViz Tool to Use, and When?
Business Intelligence Barista: What DataViz Tool to Use, and When?Business Intelligence Barista: What DataViz Tool to Use, and When?
Business Intelligence Barista: What DataViz Tool to Use, and When?
Jen Stirrup
 
The New Frontier: Optimizing Big Data Exploration
The New Frontier: Optimizing Big Data ExplorationThe New Frontier: Optimizing Big Data Exploration
The New Frontier: Optimizing Big Data Exploration
Inside Analysis
 
Level Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentationLevel Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentation
Doug Denton
 
How to crack Big Data and Data Science roles
How to crack Big Data and Data Science rolesHow to crack Big Data and Data Science roles
How to crack Big Data and Data Science roles
UpXAcademy
 
Computer-assisted reporting seminar
Computer-assisted reporting seminarComputer-assisted reporting seminar
Computer-assisted reporting seminar
Glen McGregor
 
datamining-lect1.pptx
datamining-lect1.pptxdatamining-lect1.pptx
datamining-lect1.pptx
GautamDematti1
 
chương 1 - Tổng quan về khai phá dữ liệu.pdf
chương 1 - Tổng quan về khai phá dữ liệu.pdfchương 1 - Tổng quan về khai phá dữ liệu.pdf
chương 1 - Tổng quan về khai phá dữ liệu.pdf
phongnguyen312110237
 
The Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They NeedThe Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They Need
Dunn Solutions Group
 
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...Open Analytics
 
Open Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe OlsenOpen Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe OlsenChristopher Whitaker
 
5 Essential Practices of the Data Driven Organization
5 Essential Practices of the Data Driven Organization5 Essential Practices of the Data Driven Organization
5 Essential Practices of the Data Driven OrganizationVivastream
 
Visualising montioring and evaluation data
Visualising montioring and evaluation dataVisualising montioring and evaluation data
Visualising montioring and evaluation data
Rob Worthington
 

Similar to Big data, big tourism (20)

Data-Driven Development Era and Its Technologies
Data-Driven Development Era and Its TechnologiesData-Driven Development Era and Its Technologies
Data-Driven Development Era and Its Technologies
 
Big Data Landscape 2018
Big Data Landscape 2018Big Data Landscape 2018
Big Data Landscape 2018
 
Discovering Big Data in the Fog: Why Catalogs Matter
 Discovering Big Data in the Fog: Why Catalogs Matter Discovering Big Data in the Fog: Why Catalogs Matter
Discovering Big Data in the Fog: Why Catalogs Matter
 
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
 
The Semantic Web: The Why? What? How?
The Semantic Web: The Why? What? How?The Semantic Web: The Why? What? How?
The Semantic Web: The Why? What? How?
 
Introduction to Big Data
Introduction to Big Data Introduction to Big Data
Introduction to Big Data
 
Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...
Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...
Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...
 
Business Intelligence Barista: What DataViz Tool to Use, and When?
Business Intelligence Barista: What DataViz Tool to Use, and When?Business Intelligence Barista: What DataViz Tool to Use, and When?
Business Intelligence Barista: What DataViz Tool to Use, and When?
 
Business Intelligence Barista: What DataViz Tool to Use, and When?
Business Intelligence Barista: What DataViz Tool to Use, and When?Business Intelligence Barista: What DataViz Tool to Use, and When?
Business Intelligence Barista: What DataViz Tool to Use, and When?
 
The New Frontier: Optimizing Big Data Exploration
The New Frontier: Optimizing Big Data ExplorationThe New Frontier: Optimizing Big Data Exploration
The New Frontier: Optimizing Big Data Exploration
 
Level Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentationLevel Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentation
 
How to crack Big Data and Data Science roles
How to crack Big Data and Data Science rolesHow to crack Big Data and Data Science roles
How to crack Big Data and Data Science roles
 
Computer-assisted reporting seminar
Computer-assisted reporting seminarComputer-assisted reporting seminar
Computer-assisted reporting seminar
 
datamining-lect1.pptx
datamining-lect1.pptxdatamining-lect1.pptx
datamining-lect1.pptx
 
chương 1 - Tổng quan về khai phá dữ liệu.pdf
chương 1 - Tổng quan về khai phá dữ liệu.pdfchương 1 - Tổng quan về khai phá dữ liệu.pdf
chương 1 - Tổng quan về khai phá dữ liệu.pdf
 
The Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They NeedThe Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They Need
 
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
 
Open Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe OlsenOpen Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe Olsen
 
5 Essential Practices of the Data Driven Organization
5 Essential Practices of the Data Driven Organization5 Essential Practices of the Data Driven Organization
5 Essential Practices of the Data Driven Organization
 
Visualising montioring and evaluation data
Visualising montioring and evaluation dataVisualising montioring and evaluation data
Visualising montioring and evaluation data
 

More from Marco Montanari

OpenStreetMap_LinuxDay2023.pptx
OpenStreetMap_LinuxDay2023.pptxOpenStreetMap_LinuxDay2023.pptx
OpenStreetMap_LinuxDay2023.pptx
Marco Montanari
 
Ohm wikimania 2021
Ohm wikimania 2021Ohm wikimania 2021
Ohm wikimania 2021
Marco Montanari
 
Ohm itwikicon tech - english
Ohm itwikicon tech - englishOhm itwikicon tech - english
Ohm itwikicon tech - english
Marco Montanari
 
ITWikiCon 2020 - OpenHistoryMap
ITWikiCon 2020 - OpenHistoryMapITWikiCon 2020 - OpenHistoryMap
ITWikiCon 2020 - OpenHistoryMap
Marco Montanari
 
ITWikiCon - Edutainment e Wikipedia
ITWikiCon - Edutainment e WikipediaITWikiCon - Edutainment e Wikipedia
ITWikiCon - Edutainment e Wikipedia
Marco Montanari
 
Storia dell'informatica
Storia dell'informaticaStoria dell'informatica
Storia dell'informatica
Marco Montanari
 
Bononia 1115
Bononia 1115Bononia 1115
Bononia 1115
Marco Montanari
 
ChContext
ChContextChContext
ChContext
Marco Montanari
 
MN-MAP Poster for Foss4G2018
MN-MAP Poster for Foss4G2018MN-MAP Poster for Foss4G2018
MN-MAP Poster for Foss4G2018
Marco Montanari
 
GEOCONTEXT AND CHCONTEXT GEOGRAPHIC INFORMATION IN CULTURAL HERITAGE
GEOCONTEXT AND CHCONTEXT GEOGRAPHIC INFORMATION IN CULTURAL HERITAGEGEOCONTEXT AND CHCONTEXT GEOGRAPHIC INFORMATION IN CULTURAL HERITAGE
GEOCONTEXT AND CHCONTEXT GEOGRAPHIC INFORMATION IN CULTURAL HERITAGE
Marco Montanari
 
OHM at FOSS4G17
OHM at FOSS4G17OHM at FOSS4G17
OHM at FOSS4G17
Marco Montanari
 
Mn map poster
Mn map posterMn map poster
Mn map poster
Marco Montanari
 
Saas rad with django, django rest framework
Saas rad with django, django rest frameworkSaas rad with django, django rest framework
Saas rad with django, django rest framework
Marco Montanari
 
poster mn-auth
poster mn-authposter mn-auth
poster mn-auth
Marco Montanari
 
poster holodocker
poster holodockerposter holodocker
poster holodocker
Marco Montanari
 
Intro datajournalism - 14-15/06/2017
Intro datajournalism - 14-15/06/2017Intro datajournalism - 14-15/06/2017
Intro datajournalism - 14-15/06/2017
Marco Montanari
 
OHM at Kainua17
OHM at Kainua17OHM at Kainua17
OHM at Kainua17
Marco Montanari
 
OHM Workshop
OHM WorkshopOHM Workshop
OHM Workshop
Marco Montanari
 
Open Data e Trasparenza come punto di contatto fra cittadinanza e politica
Open Data e Trasparenza come punto di contatto fra cittadinanza e politicaOpen Data e Trasparenza come punto di contatto fra cittadinanza e politica
Open Data e Trasparenza come punto di contatto fra cittadinanza e politica
Marco Montanari
 
Intervento 20160705
Intervento 20160705Intervento 20160705
Intervento 20160705
Marco Montanari
 

More from Marco Montanari (20)

OpenStreetMap_LinuxDay2023.pptx
OpenStreetMap_LinuxDay2023.pptxOpenStreetMap_LinuxDay2023.pptx
OpenStreetMap_LinuxDay2023.pptx
 
Ohm wikimania 2021
Ohm wikimania 2021Ohm wikimania 2021
Ohm wikimania 2021
 
Ohm itwikicon tech - english
Ohm itwikicon tech - englishOhm itwikicon tech - english
Ohm itwikicon tech - english
 
ITWikiCon 2020 - OpenHistoryMap
ITWikiCon 2020 - OpenHistoryMapITWikiCon 2020 - OpenHistoryMap
ITWikiCon 2020 - OpenHistoryMap
 
ITWikiCon - Edutainment e Wikipedia
ITWikiCon - Edutainment e WikipediaITWikiCon - Edutainment e Wikipedia
ITWikiCon - Edutainment e Wikipedia
 
Storia dell'informatica
Storia dell'informaticaStoria dell'informatica
Storia dell'informatica
 
Bononia 1115
Bononia 1115Bononia 1115
Bononia 1115
 
ChContext
ChContextChContext
ChContext
 
MN-MAP Poster for Foss4G2018
MN-MAP Poster for Foss4G2018MN-MAP Poster for Foss4G2018
MN-MAP Poster for Foss4G2018
 
GEOCONTEXT AND CHCONTEXT GEOGRAPHIC INFORMATION IN CULTURAL HERITAGE
GEOCONTEXT AND CHCONTEXT GEOGRAPHIC INFORMATION IN CULTURAL HERITAGEGEOCONTEXT AND CHCONTEXT GEOGRAPHIC INFORMATION IN CULTURAL HERITAGE
GEOCONTEXT AND CHCONTEXT GEOGRAPHIC INFORMATION IN CULTURAL HERITAGE
 
OHM at FOSS4G17
OHM at FOSS4G17OHM at FOSS4G17
OHM at FOSS4G17
 
Mn map poster
Mn map posterMn map poster
Mn map poster
 
Saas rad with django, django rest framework
Saas rad with django, django rest frameworkSaas rad with django, django rest framework
Saas rad with django, django rest framework
 
poster mn-auth
poster mn-authposter mn-auth
poster mn-auth
 
poster holodocker
poster holodockerposter holodocker
poster holodocker
 
Intro datajournalism - 14-15/06/2017
Intro datajournalism - 14-15/06/2017Intro datajournalism - 14-15/06/2017
Intro datajournalism - 14-15/06/2017
 
OHM at Kainua17
OHM at Kainua17OHM at Kainua17
OHM at Kainua17
 
OHM Workshop
OHM WorkshopOHM Workshop
OHM Workshop
 
Open Data e Trasparenza come punto di contatto fra cittadinanza e politica
Open Data e Trasparenza come punto di contatto fra cittadinanza e politicaOpen Data e Trasparenza come punto di contatto fra cittadinanza e politica
Open Data e Trasparenza come punto di contatto fra cittadinanza e politica
 
Intervento 20160705
Intervento 20160705Intervento 20160705
Intervento 20160705
 

Recently uploaded

Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
Best Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDABest Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDA
deeptiverma2406
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
Mohammed Sikander
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
Marketing internship report file for MBA
Marketing internship report file for MBAMarketing internship report file for MBA
Marketing internship report file for MBA
gb193092
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Dr. Vinod Kumar Kanvaria
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Atul Kumar Singh
 

Recently uploaded (20)

Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
Best Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDABest Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDA
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
Marketing internship report file for MBA
Marketing internship report file for MBAMarketing internship report file for MBA
Marketing internship report file for MBA
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
 

Big data, big tourism

  • 1. Big Data, Big Tourism Tourism and Mechanics https://www.slideshare.net/sirmmo/big-data-big-tourism
  • 2.
  • 3. What are «Big Data»? • Excel gets stuck working a dataset? => «medium» data • Stata/R suffer working a dataset? => «big» data
  • 4. Where do we get the data? • Tourists • Have sensors • Are sensors • Are actors • Attractions • Are sensors • Are actors • Hotels, restaurants • Are sensors • Have sensors
  • 5. Can we access the data? • Tourists • Have sensors • Are sensors • Are actors • Attractions • Are sensors • Are actors • Hotels, restaurants • Are sensors • Have sensors
  • 6. Can we access the data? • Tourists • Have sensors • Are sensors • Are actors • Attractions • Are sensors • Are actors • Hotels, restaurants • Are sensors • Have sensors
  • 7. Can we access the data? • Tourists • Have sensors • Are sensors • Are actors • Attractions • Are sensors • Are actors • Hotels, restaurants • Are sensors • Have sensors
  • 8. Government Can we access the data? • Tourists • Have sensors • Are sensors • Are actors • Attractions • Are sensors • Are actors • Hotels, restaurants • Are sensors • Have sensors Private Sector
  • 9. Can we access the data? • Tourists • Have sensors • Are sensors • Are actors • Attractions • Are sensors • Are actors • Hotels, restaurants • Are sensors • Have sensors Private SectorGovernment Open(able/ish) Data Almost always
  • 10. Ok so who owns that data? • Government • Bureaucracy-driven data • Incoherent • Inconsistent • Irregular production • Private Sector • Deeply integrated with user experience • Very «behavioral», and as such very «real» • Very business-oriented metrics
  • 11. Ok so who owns that data? • Government • Bureaucracy-driven data • Incoherent • Inconsistent • Irregular production • Private Sector • Deeply integrated with user experience • Very «behavioral», and as such very «real» • Very business-oriented metrics
  • 12. Ok so who owns that data? • Government • Bureaucracy-driven data • Incoherent • Inconsistent • Irregular production • Private Sector • Deeply integrated with user experience • Very «behavioral», and as such very «real» • Very business-oriented metrics
  • 13. Scraping • Time consuming • Power consuming • Illegal (up to a certain point) • Unavoidable (up to a certain point)
  • 14. Scraping • It relies on the fact that (most) web is based on HTML • And HTML is text • And JavaScript is text • And CSS is text • Everything can be read before the render…
  • 15. Scraping • It relies on the fact that (most) web is based on HTML • And HTML is text • And JavaScript is text • And CSS is text • Everything can be read before the render… • Or after the render
  • 16. Tools • Not easy for «complex» sites • Some cases come up • Some tools help • Maybe knowledge of Xml Query Language or CSS required • Some tools are very advanced • Selenium browser driver • «headless» browsers • Chrome • https://chrome.google.com/webstore/detai l/scraper/mbigbapnjcgaffohmbkdlecaccepn gjd?hl=en • https://chrome.google.com/webstore/detai l/web- scraper/jnhgnonknehpejjnehehllkliplmbmh n?hl=en • https://chrome.google.com/webstore/detai l/advanced-web- scraper/gpolcofcjjiooogejfbaamdgmgfehgff • Firefox • https://addons.mozilla.org/en- US/firefox/addon/datascraper/ • Web • https://www.import.io/ • https://scrapinghub.com/portia/
  • 17. Cases and issues of scraping • Booking.com • Amazing website • Easy navigation for the user • Issues • They know!!! • The website gets a complete structural overhaul every 6-9 months • They tend to hate scrapers • The webpage is empty at the beginning
  • 18. Cases and issues of scraping • Booking.com • Amazing website • Easy navigation for the user • Issues • They know!!! • The website gets a complete structural overhaul every 6-9 months • They tend to hate scrapers • The webpage is empty at the beginning
  • 19. Cases and issues of scraping • AirBnB • Nice navigation • Full overhaul every 3 months • Issues • The page really tracks what kind of user is accessing • The visible pages are 13 (only) • They are randomly generated every day for the major areas
  • 20. Cases and issues of scraping • Weather • Many sources • Many formats • Issues • Normalization of vocabulary • Bad weather == Rain == Rainy == Cloud Icon == ??? • Normalization of ranges • Normalization of numbers • Normalization of periodicity
  • 21. Apps Questionnaire to get user to explicitly give data Information driven application to track user data Gamification and/or information platform to elaborate and give data back
  • 22. Explicit data • Relies on the user’s knowing actions • Requires real willing acceptance for sharing information • Stops at politically correctness • Implies (almost always) anonimity • Questionnaire • In-place review • In-place comment • Bureaucracy
  • 23.
  • 24.
  • 25. Behavioral data • Almost always true • Difficult to get • Easily contextualizable • Interactive • Interconnected • Application • Platform • Social Media integration • Gamification • Social Media involvement
  • 26. Cool, so what can be done? Getting Data • Municipalities are setting up open wireless networks. • Users can be tracked. • Services can be offered (and instrumented) • Museums can track users within their premises • Social Media interactions Using Data • Analysis of context of specific behaviours • Automated storytelling for city visits • Pricing methodologies • Destination brand analysis
  • 27. Big and Big-ish Data Tools • The problem is computational power • Lots of work on AI • Classification • Generation • Machine Learning • Correlations • DataWarehouses • Mondrian - http://community.pentaho.com/projects/ mondrian/ • Big Data DBs • Cassandra - http://cassandra.apache.org/ • Hadoop - http://hadoop.apache.org/ • Big Data Search • BigQuery - https://cloud.google.com/bigquery/ • GraphQL - http://graphql.org/ • Big Data AI/ML • TensorFlow - https://www.tensorflow.org/ • ScikitPy - https://www.scipy.org/
  • 28. A few open questions • Impact of crowdfunding on tourism-bound projects • Impact of meta-search-engines on pricing • Impact (or lack thereof) of destination information websites on user decisions • How can the user be «vetted» in order to tailor the touristic experience around her? • Would such vetting process impact on customer return decisions?
  • 29. One more thing: Watch out!!