SlideShare a Scribd company logo
1 of 24
Download to read offline
STATISTICAL ANALYSIS OF TAXI
MOBILITY IN SAN FRANCISCO




                               Oleguer Sagarra Pascual
          Master in Computational Physics. UB-UPC 2011
                  PhD Supervisor: Dr. Albert Diaz Guilera
Motivation

 Human Mobility Research
   GPS data applications
 Complex Network Science
   Directed Weighted network metric
   Big dataset visualisation
Structure


 The data
 Statistical review
 Building the net
 Net structure
 Open questions and some answers
The Data
  GPS high frequency* mobility traces from CRAWDAD
  UNIX time, GPS latitude & longitude, occupancy
  537 Taxis**
  106 Trips




* <ti+1-ti> ∼90 s

** Considered independent
Statistical Analysis: Overview

Constant regime
ending with fat tails
Initial Noise
Similar shapes for Δr
Bump on the tails



   Typical Δt, Δr
Efficiency & Location
Do taxis perform different?
  Efficiency parameters
Do taxis move different?
  Centre of Masses
  Radius of Gyration
Spread

 Radius of Gyration: Mean
 spread over taxi reference
 frame.
 Taxis wander when empty,
 not when full.
 Taxis mainly operate
 inside the city: Short
 Trips*.

* But the tails are fat... what about long trips?
Long Trips: Ranges & Correlation

  Overlap in Δr not present in Δt

  Length correlation for consecutive (full-empty) trip pairs in two groups:

     Short (Δr<5 km)

     Long (Δr>20 km)




The similude between tails is explained by legal issues

Movement habits of customers determine the tails of both statistics
Building the net


  Discretized grid of San Fracisco area (100 x 100 m)*
  Nodes: Locations (∼17000)
  Edges: Trips between locations (weighted and directed)
  Excluded self-loops (∼5000) and isolated nodes (∼100)


* Using the cartesian coordinates UTM system
Net structure
  Truncated power law
  k,s,ω distribution*


  Full net less dense, faster
  decay in trips**
*Locations in San Francisco are
limited, amount of trips is not.
**Behavioural differences
Weighted net: Correlations
 Strength and degree provide similar information
 The more connected a node is, the more usual it is
 visited, it becomes more important and better
 connected with important nodes.
Assortativity: r values
 We want to study behavioral patterns: Linking tendency

 Pearson r values on strengths (quantitative measurement)

   Directed net : 4 possible pairs

   Full: All disassortative, Empty: Mixed behavior
Top connected nodes greatly influence the statistics

Net is robust

  Assortative tendency for middle ranged nodes

  Dissassortative tendency for small/top nodes
Assortativity:neighbor degree
    We compute the mean (weighted) neighbor degree: Qualitative
    measure for linking tendency of equivalent* nodes P(k|k’).
        If the network is uncorrelated, <knn>=ctt.




*Taking a mean field approach, i.e. nodes are equivalent if they are equally connected/preferred (degree/strength).
Net representations



 Empty Net (video)
Some answers
     So, how do customers move?

          Limited minimum range of usage of taxi (but non-negligible long trips)

          Heterogeneous destinations (mostly inside the city) and heterogeneous lengths

          Move between hot spots, move from/to hubs and scattered locations

     So, how do drivers move?

          They are strongly influenced by customers (long trips)

          They do not seem to perform very differently*

          Searching strategy after each run: Nearest local hot spot (clustering) or global
          hub (assortativity)**

     Obvious? Maybe, but allows for comparison with other data sets and is a proof that
     the methodology is correct.

* Although no scaling relation was found...

** Reminiscent of Lévy Flight...
Open questions and further research

     Mobility:

          GPS mobility traces can be studied through a consistent* complex metric

          Set the basis for a simple mechanistic (agent based) model to be simulated and
          optimitzed

          Compare with other datasets (other transportation means, other cities...)

     Complex Networks:

          Refine assortativity measures (r/neighbor degree)

          Further statistical characterization of the network (correlations, clustering, disparity...)

          Introduction of time measures: Net growth, dynamical weighted net, coupling
          transport/social networks...



* Fat tailed degree distributions, similar features shared with other studied systems.
Thanks for your attention...
STATISTICAL ANALYSIS OF TAXI
MOBILITY IN SAN FRANCISCO:
Complementary Material




                               Oleguer Sagarra Pascual
          Master in Computational Physics. UB-UPC 2011
Trimming and fitting the data
Distributions for Δr,Δt
Building the net: Isolated and Selfloops
Complex Networks: Definitions

 E : Number of edges, N: Number of nodes

 Density

 Degree k (in-out): Number of edges entering (leaving) a node.

 Degree (strength) assortativity: r, <kwnn>,<swnn>

 Betweenness centrality: Rel. number of (weighted) shortest path
 passing through a node.

 Strongly (Weakly) connected component: Complete subgraph
 containing the nodes connected among themselves via pairs of
 directed edges (undirected).
Betweenness/Strength and Assortativity

More Related Content

Viewers also liked

Multimedia information management
Multimedia information managementMultimedia information management
Multimedia information managementSara Egidi
 
the near future of tourism services based on digital traces
the near future of tourism services based on digital tracesthe near future of tourism services based on digital traces
the near future of tourism services based on digital tracesnicolas nova
 
Prepare your Ph.D. Defense Presentation
Prepare your Ph.D. Defense PresentationPrepare your Ph.D. Defense Presentation
Prepare your Ph.D. Defense PresentationChristian Glahn
 

Viewers also liked (6)

Multimedia information management
Multimedia information managementMultimedia information management
Multimedia information management
 
the near future of tourism services based on digital traces
the near future of tourism services based on digital tracesthe near future of tourism services based on digital traces
the near future of tourism services based on digital traces
 
outline_ppt_v8
outline_ppt_v8outline_ppt_v8
outline_ppt_v8
 
Smart taxis for smart cities
Smart taxis for smart citiesSmart taxis for smart cities
Smart taxis for smart cities
 
M sc thesis
M sc thesisM sc thesis
M sc thesis
 
Prepare your Ph.D. Defense Presentation
Prepare your Ph.D. Defense PresentationPrepare your Ph.D. Defense Presentation
Prepare your Ph.D. Defense Presentation
 

Similar to Statistical Analysis of Taxi Mobility in San Francisco

Network analysis in gis , part 4 transportation networks
Network analysis in gis , part 4 transportation networksNetwork analysis in gis , part 4 transportation networks
Network analysis in gis , part 4 transportation networksDepartment of Applied Geology
 
Individual movements and geographical data mining. Clustering algorithms for ...
Individual movements and geographical data mining. Clustering algorithms for ...Individual movements and geographical data mining. Clustering algorithms for ...
Individual movements and geographical data mining. Clustering algorithms for ...Beniamino Murgante
 
Traffic Prediction from Street Network images.pptx
Traffic Prediction from  Street Network images.pptxTraffic Prediction from  Street Network images.pptx
Traffic Prediction from Street Network images.pptxchirantanGupta1
 
The Design of a Simulation for the Modeling and Analysis of Public Transporta...
The Design of a Simulation for the Modeling and Analysis of Public Transporta...The Design of a Simulation for the Modeling and Analysis of Public Transporta...
The Design of a Simulation for the Modeling and Analysis of Public Transporta...CSCJournals
 
Improve MANET network performance using ESPS approach
Improve MANET network performance using ESPS approachImprove MANET network performance using ESPS approach
Improve MANET network performance using ESPS approachSurbhi Sharma
 
Mobile ad hoc networks – dangling issues of optimal path strategy
Mobile ad hoc networks – dangling issues of optimal path strategyMobile ad hoc networks – dangling issues of optimal path strategy
Mobile ad hoc networks – dangling issues of optimal path strategyAlexander Decker
 
A Comparison Of Smart Routings In Mobile Ad Hoc Networks(MANETs)
A Comparison Of Smart Routings In Mobile Ad Hoc  Networks(MANETs) A Comparison Of Smart Routings In Mobile Ad Hoc  Networks(MANETs)
A Comparison Of Smart Routings In Mobile Ad Hoc Networks(MANETs) IJMER
 
Optimal multicast capacity and delay tradeoffs in manets
Optimal multicast capacity and delay tradeoffs in manetsOptimal multicast capacity and delay tradeoffs in manets
Optimal multicast capacity and delay tradeoffs in manetsJPINFOTECH JAYAPRAKASH
 
IRJET- Survey on Implementation of Graph Theory in Routing Protocols of Wired...
IRJET- Survey on Implementation of Graph Theory in Routing Protocols of Wired...IRJET- Survey on Implementation of Graph Theory in Routing Protocols of Wired...
IRJET- Survey on Implementation of Graph Theory in Routing Protocols of Wired...IRJET Journal
 
Distributed Path Computation Using DIV Algorithm
Distributed Path Computation Using DIV AlgorithmDistributed Path Computation Using DIV Algorithm
Distributed Path Computation Using DIV AlgorithmIOSR Journals
 
JPN1404 Optimal Multicast Capacity and Delay Tradeoffs in MANETs
JPN1404 Optimal Multicast Capacity and Delay Tradeoffs in MANETsJPN1404 Optimal Multicast Capacity and Delay Tradeoffs in MANETs
JPN1404 Optimal Multicast Capacity and Delay Tradeoffs in MANETschennaijp
 
Amtr the ant based qos aware multipath temporally ordered routing algorithm ...
Amtr  the ant based qos aware multipath temporally ordered routing algorithm ...Amtr  the ant based qos aware multipath temporally ordered routing algorithm ...
Amtr the ant based qos aware multipath temporally ordered routing algorithm ...csandit
 
AMTR: THE ANT BASED QOS AWARE MULTIPATH TEMPORALLY ORDERED ROUTING ALGORITHM ...
AMTR: THE ANT BASED QOS AWARE MULTIPATH TEMPORALLY ORDERED ROUTING ALGORITHM ...AMTR: THE ANT BASED QOS AWARE MULTIPATH TEMPORALLY ORDERED ROUTING ALGORITHM ...
AMTR: THE ANT BASED QOS AWARE MULTIPATH TEMPORALLY ORDERED ROUTING ALGORITHM ...cscpconf
 
Network Analysis in ArcGIS
Network Analysis in ArcGISNetwork Analysis in ArcGIS
Network Analysis in ArcGISJohn Reiser
 

Similar to Statistical Analysis of Taxi Mobility in San Francisco (20)

Network analysis in gis , part 4 transportation networks
Network analysis in gis , part 4 transportation networksNetwork analysis in gis , part 4 transportation networks
Network analysis in gis , part 4 transportation networks
 
Individual movements and geographical data mining. Clustering algorithms for ...
Individual movements and geographical data mining. Clustering algorithms for ...Individual movements and geographical data mining. Clustering algorithms for ...
Individual movements and geographical data mining. Clustering algorithms for ...
 
Traffic Prediction from Street Network images.pptx
Traffic Prediction from  Street Network images.pptxTraffic Prediction from  Street Network images.pptx
Traffic Prediction from Street Network images.pptx
 
Road network presentation
Road network presentationRoad network presentation
Road network presentation
 
Shortest path analysis
Shortest path analysis Shortest path analysis
Shortest path analysis
 
The Design of a Simulation for the Modeling and Analysis of Public Transporta...
The Design of a Simulation for the Modeling and Analysis of Public Transporta...The Design of a Simulation for the Modeling and Analysis of Public Transporta...
The Design of a Simulation for the Modeling and Analysis of Public Transporta...
 
Wiki2010 Unit 4
Wiki2010 Unit 4Wiki2010 Unit 4
Wiki2010 Unit 4
 
Improve MANET network performance using ESPS approach
Improve MANET network performance using ESPS approachImprove MANET network performance using ESPS approach
Improve MANET network performance using ESPS approach
 
D0441722
D0441722D0441722
D0441722
 
Mobile ad hoc networks – dangling issues of optimal path strategy
Mobile ad hoc networks – dangling issues of optimal path strategyMobile ad hoc networks – dangling issues of optimal path strategy
Mobile ad hoc networks – dangling issues of optimal path strategy
 
A Comparison Of Smart Routings In Mobile Ad Hoc Networks(MANETs)
A Comparison Of Smart Routings In Mobile Ad Hoc  Networks(MANETs) A Comparison Of Smart Routings In Mobile Ad Hoc  Networks(MANETs)
A Comparison Of Smart Routings In Mobile Ad Hoc Networks(MANETs)
 
Optimal multicast capacity and delay tradeoffs in manets
Optimal multicast capacity and delay tradeoffs in manetsOptimal multicast capacity and delay tradeoffs in manets
Optimal multicast capacity and delay tradeoffs in manets
 
IRJET- Survey on Implementation of Graph Theory in Routing Protocols of Wired...
IRJET- Survey on Implementation of Graph Theory in Routing Protocols of Wired...IRJET- Survey on Implementation of Graph Theory in Routing Protocols of Wired...
IRJET- Survey on Implementation of Graph Theory in Routing Protocols of Wired...
 
Distributed Path Computation Using DIV Algorithm
Distributed Path Computation Using DIV AlgorithmDistributed Path Computation Using DIV Algorithm
Distributed Path Computation Using DIV Algorithm
 
C0431320
C0431320C0431320
C0431320
 
JPN1404 Optimal Multicast Capacity and Delay Tradeoffs in MANETs
JPN1404 Optimal Multicast Capacity and Delay Tradeoffs in MANETsJPN1404 Optimal Multicast Capacity and Delay Tradeoffs in MANETs
JPN1404 Optimal Multicast Capacity and Delay Tradeoffs in MANETs
 
Amtr the ant based qos aware multipath temporally ordered routing algorithm ...
Amtr  the ant based qos aware multipath temporally ordered routing algorithm ...Amtr  the ant based qos aware multipath temporally ordered routing algorithm ...
Amtr the ant based qos aware multipath temporally ordered routing algorithm ...
 
AMTR: THE ANT BASED QOS AWARE MULTIPATH TEMPORALLY ORDERED ROUTING ALGORITHM ...
AMTR: THE ANT BASED QOS AWARE MULTIPATH TEMPORALLY ORDERED ROUTING ALGORITHM ...AMTR: THE ANT BASED QOS AWARE MULTIPATH TEMPORALLY ORDERED ROUTING ALGORITHM ...
AMTR: THE ANT BASED QOS AWARE MULTIPATH TEMPORALLY ORDERED ROUTING ALGORITHM ...
 
Network Analysis in ArcGIS
Network Analysis in ArcGISNetwork Analysis in ArcGIS
Network Analysis in ArcGIS
 
hkn_talk.ppt
hkn_talk.ppthkn_talk.ppt
hkn_talk.ppt
 

Recently uploaded

My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 

Recently uploaded (20)

My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 

Statistical Analysis of Taxi Mobility in San Francisco

  • 1. STATISTICAL ANALYSIS OF TAXI MOBILITY IN SAN FRANCISCO Oleguer Sagarra Pascual Master in Computational Physics. UB-UPC 2011 PhD Supervisor: Dr. Albert Diaz Guilera
  • 2. Motivation Human Mobility Research GPS data applications Complex Network Science Directed Weighted network metric Big dataset visualisation
  • 3. Structure The data Statistical review Building the net Net structure Open questions and some answers
  • 4. The Data GPS high frequency* mobility traces from CRAWDAD UNIX time, GPS latitude & longitude, occupancy 537 Taxis** 106 Trips * <ti+1-ti> ∼90 s ** Considered independent
  • 5. Statistical Analysis: Overview Constant regime ending with fat tails Initial Noise Similar shapes for Δr Bump on the tails Typical Δt, Δr
  • 6. Efficiency & Location Do taxis perform different? Efficiency parameters Do taxis move different? Centre of Masses Radius of Gyration
  • 7. Spread Radius of Gyration: Mean spread over taxi reference frame. Taxis wander when empty, not when full. Taxis mainly operate inside the city: Short Trips*. * But the tails are fat... what about long trips?
  • 8. Long Trips: Ranges & Correlation Overlap in Δr not present in Δt Length correlation for consecutive (full-empty) trip pairs in two groups: Short (Δr<5 km) Long (Δr>20 km) The similude between tails is explained by legal issues Movement habits of customers determine the tails of both statistics
  • 9. Building the net Discretized grid of San Fracisco area (100 x 100 m)* Nodes: Locations (∼17000) Edges: Trips between locations (weighted and directed) Excluded self-loops (∼5000) and isolated nodes (∼100) * Using the cartesian coordinates UTM system
  • 10. Net structure Truncated power law k,s,ω distribution* Full net less dense, faster decay in trips** *Locations in San Francisco are limited, amount of trips is not. **Behavioural differences
  • 11. Weighted net: Correlations Strength and degree provide similar information The more connected a node is, the more usual it is visited, it becomes more important and better connected with important nodes.
  • 12. Assortativity: r values We want to study behavioral patterns: Linking tendency Pearson r values on strengths (quantitative measurement) Directed net : 4 possible pairs Full: All disassortative, Empty: Mixed behavior
  • 13. Top connected nodes greatly influence the statistics Net is robust Assortative tendency for middle ranged nodes Dissassortative tendency for small/top nodes
  • 14. Assortativity:neighbor degree We compute the mean (weighted) neighbor degree: Qualitative measure for linking tendency of equivalent* nodes P(k|k’). If the network is uncorrelated, <knn>=ctt. *Taking a mean field approach, i.e. nodes are equivalent if they are equally connected/preferred (degree/strength).
  • 16. Some answers So, how do customers move? Limited minimum range of usage of taxi (but non-negligible long trips) Heterogeneous destinations (mostly inside the city) and heterogeneous lengths Move between hot spots, move from/to hubs and scattered locations So, how do drivers move? They are strongly influenced by customers (long trips) They do not seem to perform very differently* Searching strategy after each run: Nearest local hot spot (clustering) or global hub (assortativity)** Obvious? Maybe, but allows for comparison with other data sets and is a proof that the methodology is correct. * Although no scaling relation was found... ** Reminiscent of Lévy Flight...
  • 17. Open questions and further research Mobility: GPS mobility traces can be studied through a consistent* complex metric Set the basis for a simple mechanistic (agent based) model to be simulated and optimitzed Compare with other datasets (other transportation means, other cities...) Complex Networks: Refine assortativity measures (r/neighbor degree) Further statistical characterization of the network (correlations, clustering, disparity...) Introduction of time measures: Net growth, dynamical weighted net, coupling transport/social networks... * Fat tailed degree distributions, similar features shared with other studied systems.
  • 18. Thanks for your attention...
  • 19. STATISTICAL ANALYSIS OF TAXI MOBILITY IN SAN FRANCISCO: Complementary Material Oleguer Sagarra Pascual Master in Computational Physics. UB-UPC 2011
  • 22. Building the net: Isolated and Selfloops
  • 23. Complex Networks: Definitions E : Number of edges, N: Number of nodes Density Degree k (in-out): Number of edges entering (leaving) a node. Degree (strength) assortativity: r, <kwnn>,<swnn> Betweenness centrality: Rel. number of (weighted) shortest path passing through a node. Strongly (Weakly) connected component: Complete subgraph containing the nodes connected among themselves via pairs of directed edges (undirected).