SlideShare a Scribd company logo
1 of 31
Travel route recommendation
using geotagged photos
Takeshi Kurashima, Tomoharu Iwata,
Go Irie, Ko Fujimura
The system's user interface
Block diagram of the system
User location
Recommended
landmark sequence
of given travel time

User mode of
transport
User's free time
and allowed
margin

Recommendation system

Recommended
landmark sequence
of given travel time

...

User requested
number of
sequences

Recommended
landmark sequence
of given travel time

Flickr photos
User's photos
Necessary components
●

●

Identifying landmarks in the area (no given list) and
naming them
Estimation of travel time between landmarks using
given transportation method

●

Estimation of time spent visiting each landmark

●

Recommending landmark sequences for the user
Previous Work and Innovation
●

Previous Work:
●

●
●

●

●

Crandall et al - extract landmarks at various granularity levels from
Flickr photos using mean-shift, and name them
Popescu et al - popular trips within a city from photo data-sets
Choudhury et al - constructing representative travel routes linking
popular landmarks within a city using popularity of landmarks, stay
times and transit times
Popescu et al - deducing the typical visit duration of a landmark

Main innovation here:
●

●

●

Personalized recommendations based on user's location history
and implicit interests
Estimation of traveling times between landmarks using different
modes of transport
Building a complete recommender system implementing the ideas
above
Flickr
●

●

Many digital cameras and phones add a geo-location tag to images
automatically.
Flickr houses at least 221,883,830 Geo-tagged time-stamped photos
from over 51 million users. http://www.flickr.com/map
●
●

Time-stamps will be used for travel time estimation

●

Textual tags are used to name the extracted landmark

●

●

Geo-tags will be used for landmark extraction and recommendation

NOT using the actual photos at all, just the meta-data (fast)

The Flickr API allows searching for public images taken in a given geobox or geo-circle for non-commercial use.
http://www.flickr.com/services/api/flickr.photos.search.html
Assumptions
●

Taking a picture of a place and uploading it to
Flickr constitutes a recommendation.
(Not many “This museum was boring” photos)

●

Geo-locations of camera and of photographed
object are equivalent
(The lookout point is recommended, not the view)

●

NOT assuming absolute time stamps of photos
are correct, since many camera clocks aren't
set.
Image time-deltas are used.
From photos to landmarks
●

Clustering points in a two-dimensional space
Landmark Extraction Assumptions
●

●

The probability of taking a photograph of a
landmark is distributed normally (
) as a
function of the distance (>0) from the landmark
Each photo is of one landmark.
(A photo of a close object against the
background of a distant one is a photo of the
closer object)
The Mean-Shift procedure:
●

Estimates the local maximum of the probability distribution of
each cluster of photos – the location of a landmark

●

Its only parameter is the bandwidth ω

●

Iteratively compute for each photo, until it converges:
From photos to landmarks
●

●

Substitute the geo-location of each photo with the
landmark it captures.
Group successive user photos of the same
landmark as one photo.
●

●

●

(Taking many pictures of a place isn't considered a
stronger recommendation)

The time-stamp of grouped photos is the average
between the time-stamps of the first and last
successive photos of the landmark.
The textual representation of each discovered
landmark is the most common tag of all the
photos of the landmark
Photographer behavior model
●

We want to estimate P( lt | <lt−1lt−2...l1>, hu ), the
probability that:
–

user u

–

with location history hu

–

at landmark lt−1 at time t−1,
lt−2 at time t−2, etc.

–
●

visits lt at time t

We assume the photographer's decision on the next
landmark to visit is a function of:
–

The photographer's current location (sequence)

–

The photographer's topics of interest
Location-based Model
●

Using a Markov model:
–

For simplicity, a first-order Markov model is used:

–

Maximum likelihood estimation:
Topic Model (PLSA)
●

Each user is a distribution over topics Z

●

Each topic is a distribution over the landmarks
User

●

Landmark
distributions

Using the law of total probability:
P(lt) =

●

Topic
distribution

P(z)

Assuming P(hu) and P(lt) are independently
conditioned on p(z) we get
P(lt|z,hu)P(z|hu) =
Expectation Maximization (EM)
●

Computes P(lt|z), P(z|hu) for the topic formula
iteratively until convergence using :

●

E step:

●

M step:
visible
Markov-Topic Model
●

Assuming P(hu) and P(lt-1) are independently
conditioned on P(lt) we get, after derivation:

Topic
Markov

“Normalizing Factor” P(lt-1|hu)
Generating travel routes
●

Naive method:
–

–
●

Compute the probability of all possible routes of
given time based on user's location and history
Choosing the most probable ones

A best-first-search is used on the probability
tree:
P(l1|l0,hu)
l1

P(l2|<l1l0>,hu)

l0

P(l2|l0,hu)
l2

P(l3|l0,hu)
l3
Travel time estimation
●

The time-delta between consecutive landmarks
in a sequence represents travel time between
them, using a specific mode of transport
–

(and sometimes includes some of the visit times of
both locations)
K-means
●

K-means is used on each two landmarks.
–

●

Identifies K typical travel times between them using
different transportation methods

K=3 was chosen
–
–

●

Three peaks visible here:
Google Maps gives estimates for walking, using
public transportation and using a car

Walking is assumed to be slowest, followed by
public transport, then private car
Experiments
●

696,394 photographs

●

71,718 users

●

Photos taken within 20 km from the center of:
–

Washington D.C., New York City, Philadelphia and
Boston on the East Coast

–

Los Angeles, San Francisco and Las Vegas on the
West Coast
Choosing the number of topics
●

Rating by precision of prediction of last landmark of
each sequence, over 5-fold cross-validation
Results
Last-step prediction accuracy
Sequence under time-constraint
prediction accuracy
Comparison of estimation
of travel time against Google Maps

●

Does this reflect on the system or on Google
Maps?
Routes per time period
Routes per transportation mode
Routes chosen by topic
Routes suggested by the
Markov model alone:

Routes suggested by the
Markov-Topic model:
Future Work
●

●

●

Using photographer's social network profile and
friends list
Consideration of opening hours, congestion
and fee
Evaluation in the field
Take-away points
●
●

●

●

●

Creatively looking for data
Building a complete system is a teaching
experience.
To build a system, it's frequently necessary to
use a variety of (AI) methods – it's good to have
a diverse mental toolbox, or a diverse team.
Testing is important - quantitative experiments
on a large-scale dataset.
Statistically significantly better than the
competitors.
Thanks!
●

Questions?

More Related Content

Similar to Travel route reconmmendations using geotagged photos

Implementation and evaluation of Space Time Alarm Clock
Implementation and evaluation of Space Time Alarm ClockImplementation and evaluation of Space Time Alarm Clock
Implementation and evaluation of Space Time Alarm ClockAdrian C. Prelipcean
 
Prediction of taxi rides ETA
Prediction of taxi rides ETAPrediction of taxi rides ETA
Prediction of taxi rides ETADaniel Marcous
 
Servitization Federica Santuccio
Servitization Federica SantuccioServitization Federica Santuccio
Servitization Federica SantuccioOrkestra
 
Network analysis for shortest optimum path
Network analysis for shortest optimum pathNetwork analysis for shortest optimum path
Network analysis for shortest optimum pathSourabh Jain
 
Big data Analytics for Tourism Destination management by Professor G MIchael ...
Big data Analytics for Tourism Destination management by Professor G MIchael ...Big data Analytics for Tourism Destination management by Professor G MIchael ...
Big data Analytics for Tourism Destination management by Professor G MIchael ...Leisure Solutions®
 
(Slides) A Personal Navigation System with a Schedule Planning Facility Based...
(Slides) A Personal Navigation System with a Schedule Planning Facility Based...(Slides) A Personal Navigation System with a Schedule Planning Facility Based...
(Slides) A Personal Navigation System with a Schedule Planning Facility Based...Naoki Shibata
 
01 example of literature presentation
01 example of literature presentation01 example of literature presentation
01 example of literature presentationJason Yang
 
Learning And Inferring Transportation Routines
Learning And Inferring Transportation RoutinesLearning And Inferring Transportation Routines
Learning And Inferring Transportation Routinesmstjsw
 
Gao cong geospatial social media data management and context-aware recommenda...
Gao cong geospatial social media data management and context-aware recommenda...Gao cong geospatial social media data management and context-aware recommenda...
Gao cong geospatial social media data management and context-aware recommenda...jins0618
 
Constrained Interest-base Tour Recommendations in Large Scale Cultural Herita...
Constrained Interest-base Tour Recommendations in Large Scale Cultural Herita...Constrained Interest-base Tour Recommendations in Large Scale Cultural Herita...
Constrained Interest-base Tour Recommendations in Large Scale Cultural Herita...Vasileios Komianos
 
PRM-RL: Long-range Robotics Navigation Tasks by Combining Reinforcement Learn...
PRM-RL: Long-range Robotics Navigation Tasks by Combining Reinforcement Learn...PRM-RL: Long-range Robotics Navigation Tasks by Combining Reinforcement Learn...
PRM-RL: Long-range Robotics Navigation Tasks by Combining Reinforcement Learn...Dongmin Lee
 
Continuous Evaluation of Collaborative Recommender Systems in Data Stream Man...
Continuous Evaluation of Collaborative Recommender Systems in Data Stream Man...Continuous Evaluation of Collaborative Recommender Systems in Data Stream Man...
Continuous Evaluation of Collaborative Recommender Systems in Data Stream Man...Dr. Cornelius Ludmann
 
Research on the characteristics and evaluation of nightscape along the LRT line
Research on the characteristics and evaluation of nightscape along the LRT lineResearch on the characteristics and evaluation of nightscape along the LRT line
Research on the characteristics and evaluation of nightscape along the LRT lineIJERA Editor
 
Sensing City Potential through Social Data @ ICMU2014 Panel
Sensing City Potential through Social Data @ ICMU2014 PanelSensing City Potential through Social Data @ ICMU2014 Panel
Sensing City Potential through Social Data @ ICMU2014 PanelYutaka Arakawa
 
Driving Behavior for ADAS and Autonomous Driving VIII
Driving Behavior for ADAS and Autonomous Driving VIIIDriving Behavior for ADAS and Autonomous Driving VIII
Driving Behavior for ADAS and Autonomous Driving VIIIYu Huang
 
Gunjan insight student conference v2
Gunjan insight student conference v2Gunjan insight student conference v2
Gunjan insight student conference v2Gunjan Kumar
 

Similar to Travel route reconmmendations using geotagged photos (20)

Implementation and evaluation of Space Time Alarm Clock
Implementation and evaluation of Space Time Alarm ClockImplementation and evaluation of Space Time Alarm Clock
Implementation and evaluation of Space Time Alarm Clock
 
South Tyrol Suggests - STS
South Tyrol Suggests - STSSouth Tyrol Suggests - STS
South Tyrol Suggests - STS
 
Prediction of taxi rides ETA
Prediction of taxi rides ETAPrediction of taxi rides ETA
Prediction of taxi rides ETA
 
Motion analyser using image processing
Motion analyser using image processingMotion analyser using image processing
Motion analyser using image processing
 
Servitization Federica Santuccio
Servitization Federica SantuccioServitization Federica Santuccio
Servitization Federica Santuccio
 
Network analysis for shortest optimum path
Network analysis for shortest optimum pathNetwork analysis for shortest optimum path
Network analysis for shortest optimum path
 
Big data Analytics for Tourism Destination management by Professor G MIchael ...
Big data Analytics for Tourism Destination management by Professor G MIchael ...Big data Analytics for Tourism Destination management by Professor G MIchael ...
Big data Analytics for Tourism Destination management by Professor G MIchael ...
 
(Slides) A Personal Navigation System with a Schedule Planning Facility Based...
(Slides) A Personal Navigation System with a Schedule Planning Facility Based...(Slides) A Personal Navigation System with a Schedule Planning Facility Based...
(Slides) A Personal Navigation System with a Schedule Planning Facility Based...
 
Generating paths through discovered places-of-interests for city trip planning
Generating paths through discovered places-of-interests for city trip planningGenerating paths through discovered places-of-interests for city trip planning
Generating paths through discovered places-of-interests for city trip planning
 
01 example of literature presentation
01 example of literature presentation01 example of literature presentation
01 example of literature presentation
 
Learning And Inferring Transportation Routines
Learning And Inferring Transportation RoutinesLearning And Inferring Transportation Routines
Learning And Inferring Transportation Routines
 
Gao cong geospatial social media data management and context-aware recommenda...
Gao cong geospatial social media data management and context-aware recommenda...Gao cong geospatial social media data management and context-aware recommenda...
Gao cong geospatial social media data management and context-aware recommenda...
 
Constrained Interest-base Tour Recommendations in Large Scale Cultural Herita...
Constrained Interest-base Tour Recommendations in Large Scale Cultural Herita...Constrained Interest-base Tour Recommendations in Large Scale Cultural Herita...
Constrained Interest-base Tour Recommendations in Large Scale Cultural Herita...
 
PRM-RL: Long-range Robotics Navigation Tasks by Combining Reinforcement Learn...
PRM-RL: Long-range Robotics Navigation Tasks by Combining Reinforcement Learn...PRM-RL: Long-range Robotics Navigation Tasks by Combining Reinforcement Learn...
PRM-RL: Long-range Robotics Navigation Tasks by Combining Reinforcement Learn...
 
Continuous Evaluation of Collaborative Recommender Systems in Data Stream Man...
Continuous Evaluation of Collaborative Recommender Systems in Data Stream Man...Continuous Evaluation of Collaborative Recommender Systems in Data Stream Man...
Continuous Evaluation of Collaborative Recommender Systems in Data Stream Man...
 
IEEE Projects 2014-2015
IEEE Projects 2014-2015IEEE Projects 2014-2015
IEEE Projects 2014-2015
 
Research on the characteristics and evaluation of nightscape along the LRT line
Research on the characteristics and evaluation of nightscape along the LRT lineResearch on the characteristics and evaluation of nightscape along the LRT line
Research on the characteristics and evaluation of nightscape along the LRT line
 
Sensing City Potential through Social Data @ ICMU2014 Panel
Sensing City Potential through Social Data @ ICMU2014 PanelSensing City Potential through Social Data @ ICMU2014 Panel
Sensing City Potential through Social Data @ ICMU2014 Panel
 
Driving Behavior for ADAS and Autonomous Driving VIII
Driving Behavior for ADAS and Autonomous Driving VIIIDriving Behavior for ADAS and Autonomous Driving VIII
Driving Behavior for ADAS and Autonomous Driving VIII
 
Gunjan insight student conference v2
Gunjan insight student conference v2Gunjan insight student conference v2
Gunjan insight student conference v2
 

Recently uploaded

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 

Travel route reconmmendations using geotagged photos

  • 1. Travel route recommendation using geotagged photos Takeshi Kurashima, Tomoharu Iwata, Go Irie, Ko Fujimura
  • 2. The system's user interface
  • 3. Block diagram of the system User location Recommended landmark sequence of given travel time User mode of transport User's free time and allowed margin Recommendation system Recommended landmark sequence of given travel time ... User requested number of sequences Recommended landmark sequence of given travel time Flickr photos User's photos
  • 4. Necessary components ● ● Identifying landmarks in the area (no given list) and naming them Estimation of travel time between landmarks using given transportation method ● Estimation of time spent visiting each landmark ● Recommending landmark sequences for the user
  • 5. Previous Work and Innovation ● Previous Work: ● ● ● ● ● Crandall et al - extract landmarks at various granularity levels from Flickr photos using mean-shift, and name them Popescu et al - popular trips within a city from photo data-sets Choudhury et al - constructing representative travel routes linking popular landmarks within a city using popularity of landmarks, stay times and transit times Popescu et al - deducing the typical visit duration of a landmark Main innovation here: ● ● ● Personalized recommendations based on user's location history and implicit interests Estimation of traveling times between landmarks using different modes of transport Building a complete recommender system implementing the ideas above
  • 6. Flickr ● ● Many digital cameras and phones add a geo-location tag to images automatically. Flickr houses at least 221,883,830 Geo-tagged time-stamped photos from over 51 million users. http://www.flickr.com/map ● ● Time-stamps will be used for travel time estimation ● Textual tags are used to name the extracted landmark ● ● Geo-tags will be used for landmark extraction and recommendation NOT using the actual photos at all, just the meta-data (fast) The Flickr API allows searching for public images taken in a given geobox or geo-circle for non-commercial use. http://www.flickr.com/services/api/flickr.photos.search.html
  • 7. Assumptions ● Taking a picture of a place and uploading it to Flickr constitutes a recommendation. (Not many “This museum was boring” photos) ● Geo-locations of camera and of photographed object are equivalent (The lookout point is recommended, not the view) ● NOT assuming absolute time stamps of photos are correct, since many camera clocks aren't set. Image time-deltas are used.
  • 8. From photos to landmarks ● Clustering points in a two-dimensional space
  • 9. Landmark Extraction Assumptions ● ● The probability of taking a photograph of a landmark is distributed normally ( ) as a function of the distance (>0) from the landmark Each photo is of one landmark. (A photo of a close object against the background of a distant one is a photo of the closer object)
  • 10. The Mean-Shift procedure: ● Estimates the local maximum of the probability distribution of each cluster of photos – the location of a landmark ● Its only parameter is the bandwidth ω ● Iteratively compute for each photo, until it converges:
  • 11. From photos to landmarks ● ● Substitute the geo-location of each photo with the landmark it captures. Group successive user photos of the same landmark as one photo. ● ● ● (Taking many pictures of a place isn't considered a stronger recommendation) The time-stamp of grouped photos is the average between the time-stamps of the first and last successive photos of the landmark. The textual representation of each discovered landmark is the most common tag of all the photos of the landmark
  • 12. Photographer behavior model ● We want to estimate P( lt | <lt−1lt−2...l1>, hu ), the probability that: – user u – with location history hu – at landmark lt−1 at time t−1, lt−2 at time t−2, etc. – ● visits lt at time t We assume the photographer's decision on the next landmark to visit is a function of: – The photographer's current location (sequence) – The photographer's topics of interest
  • 13. Location-based Model ● Using a Markov model: – For simplicity, a first-order Markov model is used: – Maximum likelihood estimation:
  • 14. Topic Model (PLSA) ● Each user is a distribution over topics Z ● Each topic is a distribution over the landmarks User ● Landmark distributions Using the law of total probability: P(lt) = ● Topic distribution P(z) Assuming P(hu) and P(lt) are independently conditioned on p(z) we get P(lt|z,hu)P(z|hu) =
  • 15. Expectation Maximization (EM) ● Computes P(lt|z), P(z|hu) for the topic formula iteratively until convergence using : ● E step: ● M step: visible
  • 16. Markov-Topic Model ● Assuming P(hu) and P(lt-1) are independently conditioned on P(lt) we get, after derivation: Topic Markov “Normalizing Factor” P(lt-1|hu)
  • 17. Generating travel routes ● Naive method: – – ● Compute the probability of all possible routes of given time based on user's location and history Choosing the most probable ones A best-first-search is used on the probability tree: P(l1|l0,hu) l1 P(l2|<l1l0>,hu) l0 P(l2|l0,hu) l2 P(l3|l0,hu) l3
  • 18.
  • 19. Travel time estimation ● The time-delta between consecutive landmarks in a sequence represents travel time between them, using a specific mode of transport – (and sometimes includes some of the visit times of both locations)
  • 20. K-means ● K-means is used on each two landmarks. – ● Identifies K typical travel times between them using different transportation methods K=3 was chosen – – ● Three peaks visible here: Google Maps gives estimates for walking, using public transportation and using a car Walking is assumed to be slowest, followed by public transport, then private car
  • 21. Experiments ● 696,394 photographs ● 71,718 users ● Photos taken within 20 km from the center of: – Washington D.C., New York City, Philadelphia and Boston on the East Coast – Los Angeles, San Francisco and Las Vegas on the West Coast
  • 22. Choosing the number of topics ● Rating by precision of prediction of last landmark of each sequence, over 5-fold cross-validation
  • 25. Comparison of estimation of travel time against Google Maps ● Does this reflect on the system or on Google Maps?
  • 26. Routes per time period
  • 28. Routes chosen by topic Routes suggested by the Markov model alone: Routes suggested by the Markov-Topic model:
  • 29. Future Work ● ● ● Using photographer's social network profile and friends list Consideration of opening hours, congestion and fee Evaluation in the field
  • 30. Take-away points ● ● ● ● ● Creatively looking for data Building a complete system is a teaching experience. To build a system, it's frequently necessary to use a variety of (AI) methods – it's good to have a diverse mental toolbox, or a diverse team. Testing is important - quantitative experiments on a large-scale dataset. Statistically significantly better than the competitors.

Editor's Notes

  1. Mainly suited for heavily photographed areas frequented by Flickr users
  2. David Crandall, Lars Backstrom, Daniel Huttenlocher, and Jon Kleinberg. Mapping the world’s photos. In Proc. WWW’2009, pages 761–770, April 2009. Popescu A, Grefenstette G, Moëllic (2009) Mining tourist information from user-supplied collections. In: Proceedings of 18th ACM conference on information and knowledge management, pp 1713–1716 Choudhury MD, Feldman M, Amer-Yahia S et al (2010) Constructing travel itineraries from tagged geo-temporal breadcrumbs. In: Proceedings of 19th international conference on world wide web, Pp 1083–1084 Popescu A, Grefenstette G (2009) Deducing trip related information from Flickr. In: Proceedings of 18th international conference on world wide web, pp 1183–1184
  3. The number of Flickr users was taken from a 2011 advertising pitch: http://advertising.yahoo.com/article/flickr.html.
  4. Visiting time of all landmarks is ZERO. (“Japanese tourism”) Not exactly correct
  5. The larger the bandwidth, the more images are grouped by the procedure since they didn&apos;t fall off the ends of the probability distribution function (u users) Video from http://www.youtube.com/watch?v=nuLYrSZ3fRo
  6. Note the models are NOT independent. The current location is possibly affected by interests.
  7. Probabilistic latent semantic analysis A,B are independently conditioned on z if they&apos;re independent after z is known. &lt;Z,B&gt; gives no more information for A than just Z. Knowing the topic of a landmark is sufficient, no need to know the topic and the user No need to go back two steps in the above graph
  8. Proving the algorithm takes about six hours. It&apos;s an impressive proof, I recommend it
  9. assume conditional independence: p(hu,lt-1|lt) = p(hu|lt)p(lt-1|lt) p(lt|lt-1,hu) = p(lt-1, hu|lt)p(lt)/p(lt-1,hu) using assumption: = p(hu|lt)p(lt-1|lt)p(lt)/p(lt-1,hu) using bayes on p(lt-1|lt): = p(hu|lt)p(lt|lt-1)p(lt-1)/p(lt)*p(lt)/p(lt-1,hu) = p(hu|lt)p(lt|lt-1)/p(lt-1,hu) using bayes on p(hu|lt): p(lt|hu)p(hu)/p(lt)*p(lt|lt-1)/p(lt-1,hu) = p(lt|hu)p(hu)p(lt|lt-1)/p(lt-1,hu)p(lt) = p(lt|hu)p(lt|lt-1)/p(lt) * p(hu)/p(lt-1,hu) define C(lt-1,hu) = p(lt-1,hu)/p(hu) = p(lt-1|hu) = p(lt|hu)p(lt|lt-1)/p(lt)C(lt-1,hu) = topic*markov/(easy*C(lt-1, hu))
  10. Sometimes = If consecutive photos were merged If two location are very close, everybody would walk between them and not take a car or a train, so the estimates of the 3 transportation methods would be very similar
  11. 5-fold cross-validation to avoid over-fitting
  12. Compared strategies: Multinomial model – simply recommend the most popular landmark Markov model – considers user&apos;s location only Topic model – considers user&apos;s interests only Markov-Topic model Trained with all sequences, removing the last landmark Tested with all sequences
  13. Tested by slicing off the given time period from all the sequences. A bit less data since some sequences were shorter than the time period and were dismissed
  14. (Assuming Google&apos;s estimation is good) Estimated travel times are a little longer than Google&apos;s since they include some of the visit time and since are affected by weather, traffic etc. The system&apos;s estimation is actually based on real data from these specific routes more than Google&apos;s estimation