SlideShare a Scribd company logo
1 of 34
Doing in One Go: Delivery Time
Inference Based on Couriers’ Trajectories
Hongkyu Lim
2021. 03. 19
KDD ’20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge
Discovery & Data Mining August 2020
Contents
•Overview of the Paper
•Experiments
•Case Study
•System Deployment
2
Overview of the Paper
• Basic Ideas and Motivation
 “Last mile” problem - It would be of great value for both the couriers and the logistics
company if we can infer the delivery time.
 A courier would stay at a location for a while when he/she is delivering a parcel, thus
generating a stay point. A straightforward solution would be to extract the delivery
time based on the stay points of the trajectories.
• Problems
 It is not possible to infer the delivery time directly from stay points due to the.
following two main challenges:
• Inaccurate delivery locations - Most of the Geocoded waybill locations have some distance shifts
to the delivery caused stay points
• Various stay scenarios -Even if we find the closest stay point to the true delivery location, we still
cannot say that the parcel is delivered at that stay point. The reason is that a courier might stay.
at a location for various reasons.
3
Overview of the Paper
4
Overview of the Paper
• Solution
 Delivery Time Inference (DTInf)
• DTInf can automatically infer the delivery time of each completed waybill
based on couriers’ trajectories.
The proposed system contains three main components:
5
Overview of the Paper
• Solution
 Delivery Time Inference (DTInf)
The proposed system contains three
main components:
1) Data Pre-processing: cleans trajectorie
-s from couriers, detects stay points, and
separates stay points and waybills by deli
-very trips;
2) Delivery Location Correction:
correcting the Geocoded waybill location
-s based on their historical deliveries to
them
3) Delivery Event-based Matching:
forming several delivery events by groupi
-ng waybills according to their delivery
locations, and matches each delivery
event with the most likely stay point in
its neighborhood.
6
Overview of the Paper
• Terminology
 Waybill
• A waybill is a parcel delivery task assigned to a courier, denoted as a 4-tuple
• 𝑙𝑎 is the Geocoded waybill location of the shipping address
• F𝑝 are features of the parcel, e.g., the weight and the volume
• 𝑡𝑟 is the timestamp, at which a courier receives the parcel
• 𝑡𝑑 is the delivery time
7
Overview of the Paper
• Terminology
 Delivery Location
• A delivery location is a spatial point, denoted as
• a location (𝑥,𝑦) : longitude and latitude
8
Overview of the Paper
• Terminology
 Trajectory
• A trajectory is a sequence of spatio-temporal points, denoted as
• A location (𝑥,𝑦) : longitude and latitude
• Each point 𝑝 = (𝑥,𝑦,𝑡) indicates the physical presence at a location (𝑥,𝑦) (e.g., longitude and l
atitude) at time 𝑡. Points in a trajectory are organized chronologically.
9
Overview of the Paper
• Terminology
 Stay Point
• A stay point is a subsequence of the trajectory, which semantically means that a moving
object stays in a geographic region for a while.
• Formally, given a distance threshold 𝐷𝑚𝑎𝑥 and a time threshold 𝑇𝑚𝑖𝑛
• A stay point 𝑠𝑝 if 𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒(𝑝𝑖,𝑝𝑘) ≤ 𝐷𝑚𝑎𝑥 (∀𝑘 ∈ [𝑖 + 1, 𝑗]), 𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒(𝑝𝑖,𝑝𝑗+1)>𝐷𝑚𝑎𝑥 (if𝑗<𝑛)
,and|𝑝𝑗.𝑡−𝑝𝑖.𝑡|≥𝑇𝑚𝑖𝑛.The time interval of a 𝑠𝑝 is[𝑝𝑖.𝑡,𝑝𝑗.𝑡].
10
Overview of the Paper
• Terminology
 Delivery Trip
• A delivery trip is a process that a courier delivers a batch of parcels to customers.
 Stay Point
• The location of a 𝑠𝑝 is estimated using its spatial centroid:
• The time of a 𝑠𝑝 is defined as the middle point of its time interval:
• Particularly, if a stay point is caused by a delivery, we call it a delivery caused stay point. In
historical data, it can be identified by checking whether there is a parcel delivered during the
time interval of the stay point based on the delivery time of the waybill.
11
Overview of the Paper
• Problem Formulation
 Stay point -> Delivery caused stay point
• Reasons for this strategy
• Short delivery stay: measurability (80% waybills, the delivery duration does not last for longer than 20
minutes )
• Anonymized shipping address: One or more parcels can be delivered at the same stay point. However,
Geocoding anonymizes the detailed floor information of shipping addresses
12
Overview of the Paper
• Problem Formulation
 Stay point -> Delivery caused stay point
• Given courier’s stay points 𝑆𝑃 = {𝑠𝑝𝑗 |𝑗 ∈ 1, ..,𝑚} detected from the trajectory of a delivery trip,
and the waybills 𝑊 = {𝑤𝑖|𝑖 ∈ 1,...,𝑝} he/she completed in the trip, the objective is to match each
waybill 𝑤𝑖 with its delivery caused stay point 𝑠𝑝 𝑗 .
13
Overview of the Paper
• System Framework (The system framework
of DTInf )
 Data Pre-processing
• DP takes couriers’ trajectories
• 3 main tasks
1) Noise Filtering, which removes the outlier
GPS points
2) Stay Point Detection, which detects all the stay
points from the trajectories
3) Delivery Trip Identification, which separates
waybills and stay points by the identified delivery
trips
14
Overview of the Paper
• System Framework (The system framework
of DTInf )
 Data Pre-processing
• DP takes couriers’ trajectories
• 3 main tasks
1) Noise Filtering, which removes the outlier
GPS points
• The error of 𝑝4 and 𝑝7 might be several hundred
meters away from its true location. Such noise points
would affect the quality of stay point detection.
(Trajectory data mining by Yu Zheng)
15
Overview of the Paper
• System Framework (The system framework
of DTInf )
 Data Pre-processing
• DP takes couriers’ trajectories
• 3 main tasks
2) Stay Point Detection
• Based on the cleaned trajectories, we extract all stay
points from them. We use stay points not only to infer
the delivery time, but also to find the real delivery.
locations.
• 𝐷𝑚𝑎𝑥 =20𝑚 and 𝑇𝑚𝑖𝑛 =30𝑠.
16
Overview of the Paper
• System Framework (The system framework of DTInf )
 Data Pre-processing
• 3 main tasks
3) Delivery Trip Identification
• A trip begins when the number of receiving parcels stops increasing, and the number of delivering
parcels begins to increase
• A trip ends if the opposite condition holds.
17
Overview of the Paper
• System Framework (The system framework
of DTInf )
 Delivery Location Correction
• DLC takes historical waybills and stay points, and
generates the location mapping from the Geocoded
waybill location to the delivery location
• 3 main steps
1) Inverted Indexing, which finds all historical delivery
caused stay points for each Geocoded waybill
location
2) Location Inference, which infers the raw delivery
location based on the inverted index
3) Location Refinement, which refines the raw deliver
-y location by merging it with its nearby delivery
locations discovered by other Geocoded waybill
locations
18
Overview of the Paper
• System Framework (The system framework
of DTInf )
 Delivery Location Correction
• It is difficult to set a global consistent judgment
about which stay point might be the delivery caused
stay point of a certain waybill if we treat the
Geocoded waybill location as the delivery location.
• Fortunately, because a customer might place orders
multiple times using the same shipping address, a Ge
ocoded waybill location might appear several times.
19
Overview of the Paper
• System Framework (The system framework
of DTInf )
 Delivery Location Correction
• Inspired by 2 insights
1) Multiple delivery caused stay points
• It is noticeable that although those stay points are qui
te close, there are still minor differences. If all stay poi
nts are leveraged, the delivery location correction can
be more accurate.
2) Redundant Geocoded waybill locations
• It can be noticed that their delivery caused stay points
have considerably large overlaps, which indicates they
potentially correspond to the same delivery location.
20
Overview of the Paper
• System Framework (The system framework of DTInf )
 Delivery Location Correction
• (a) shows the distribution of the number of delivery trips at a Geocoded waybill location duri-
-ng a period of 15 months. It is noticeable that for 72% Geocoded waybill locations, there
exist multiple deliveries. Besides, those locations can also appear in the future.
• the Geocoded waybill locations of waybills in the previous 4 months cover more than 80%
Geocoded waybill locations in the last month.
21
Overview of the Paper
• System Framework (The system framework
of DTInf )
 Delivery Event-based Matching
• Delivery Event Construction
• We group waybills into several delivery events accordi
-ng to their corrected delivery locations.
• Then, in the later step, we can select the most probabl
-e stay point for each delivery event based on its
delivery location and assign that stay point to all
waybills in that event.
22
Overview of the Paper
• System Framework (The system framework
of DTInf )
 Delivery Event-based Matching
• The reasons to perform the delivery event-level matc
-hing for waybills in a trip are two folds:
• Location by location delivery
• main stay point : For a delivery location in each.
trip, we can find a stay point that is the most
frequently matched by waybills at that location.
• If we perform the delivery event-level matching
and correctly infer the main stay point for each
delivery event, the time inference errors for
waybills are acceptable.
23
Overview of the Paper
• System Framework (The system framework
of DTInf )
 Delivery Event-based Matching
• The reasons to perform the delivery event-level matc
hing for waybills in a trip are two folds:
• Correlations between delivery events and stay points
• The parcels delivered at the same delivery
location will affect the characteristics of the stay
point.
• Such characteristics cannot be captured if we
perform the inference task for each waybill.
individually.
24
Overview of the Paper
• System Framework (The system framework
of DTInf )
 Delivery Event-based Matching
• Stay Point Selection
• Use a model to capture the correlation between
delivery events and the main delivery caused stay
points
• Ultimately improve the inference accuracy
• Binary classification problem
• For a delivery event and one stay point in its neighbor-
-hood, we extract features of them and predict wheth
-er the stay point is the main delivery caused stay
point of the delivery event. (MLP)
25
Overview of the Paper
• System Framework (The system framework
of DTInf )
 Four types of features are extracted:
• Location features: We obtain the POI category of the
Geocoded waybill location via the reverse Geocoding
service, which is encoded by the one-hot vector.
• Delivery event features: We extract four aggregated
information from the delivery events of waybills,
namely, the number of waybills, number of customer
s, total weight, and total volume.
• Stay point features: We extract the duration and the
area of the stay point.
• Matching feature: The geographical distance
between the centroid of the stay point and the
delivery location is extracted.
26
Experiments
• Datasets
 The datasets contain trajectories and waybills of 5 couriers at a delivery station in Ton
-gzhou District, Beijing over a period of about 15 months (from Apr. 12nd, 2018 to Jul.
7th, 2019).
 Couriers’ trajectories
• They are raw GPS logs generated by couriers’ PDAs, where each record contains a courier ID, a loc-
-ation, and a timestamp. The average sampling time interval is 7.4 seconds. The datasets contain 5
.93 million GPS points.
 Waybills
• Each record contains a customer ID, a courier ID, parcel information (e.g., weight and volume), the
time when the parcel is received, the time when the parcel is delivered, and a Geocoded waybill l
ocation. The datasets contain 274 thousand waybills. Besides, there are 16 POI categories we obta
ined via the reverse Geocoding service.
27
Experiments
• Datasets
 Baselines
• There is no existing solution that can exactly tackle our problem. Therefore, 3 baselines are design
-ed for comparison.
• Random Inference (RDInf): We randomly select a stay point from each waybill’s neighborhood as its deli
-very caused stay point.
• Spatial Nearest Inference (SNInf): SNInf matches each waybill with its closest stay point in its neighborh-
-ood.
• Temporal Longest Inference (TLInf): TLInf selects the stay point in waybill’s neighborhood with the longe
-st duration.
 Evaluation Metrics
• We use the accuracy, which is defined as the proportion of waybills whose corresponding delivery
caused stay points are correctly classified (i.e., their inferred delivery times are accurate).
• We also report the RMSE and the MAE based on the inferred delivery time and the time of deliver
-y caused stay points.
28
Experiments
• Datasets
 Variants
• We also compare DTInf with its three variants:
• DTInf-nC: This variant does not correct the delivery locations. The model is trained based on the Geocod
-ed waybill locations.
• DTInf-nM: This variant corrects the locations, but it selects the stay point that is the closest to the correc
-ted location.
• DTInf-nE: This variant also corrects the location, but it does not construct delivery events. Instead, it
infers delivery caused stay point for each waybill based on the same model, but the delivery event
features are replaced with individual waybill features.
29
Experiments
 Effectiveness Evaluation
• Merging Distance Selection
• The reason is that when 𝐷 becomes larger, re
dundant Geocoded waybill locations are cor-
rected to the same delivery locations, which
makes the delivery event modeling more accu
-rate. However, when 𝐷 is larger than 3m, the
performance is degraded, because we might
merge adjacent delivery locations by mistake.
We also report the ratio between the number
of detected delivery locations and Geocoded
waybill locations (denoted as the compression
rate) in the same figure.
30
Experiments
• Output
31
Case Study
• Red triangle – Geocoded location
• Blue points – Centroids of stay points
detected from the courier’s trajector
y of the corresponding delivery trip.
• The parcel is delivered in sp3.
• Geocoded waybill location has been
delivered multiple times in history->
It was possible to correct it.
32
System Deployment
• Grey line : Courier’s trajectory
• Blue circle – stay point centroids
• Red triangle – Geocoded waybill
locations
• Green circle – After clicking
Correct, the corrected delivery.
locations are displayed with.
green circles
• Big red circles – querying neighb-
-orhood.
• Once the infer button’s clicked, a
link is generated between the
corrected location and the
inferred delivery caused stay
point.
Doing in One Go: Delivery Time Inference Based on Couriers’ Trajectories

More Related Content

Similar to Doing in One Go: Delivery Time Inference Based on Couriers’ Trajectories

Logistic route rationalization and optimization
Logistic route rationalization and optimizationLogistic route rationalization and optimization
Logistic route rationalization and optimizationPuneet Mishra
 
Module 3 Part B - computer networks module 2 ppt
Module 3 Part B - computer networks module 2 pptModule 3 Part B - computer networks module 2 ppt
Module 3 Part B - computer networks module 2 pptanushaj46
 
NGSI: Geoqueries & Carto integration
NGSI: Geoqueries & Carto integrationNGSI: Geoqueries & Carto integration
NGSI: Geoqueries & Carto integrationFIWARE
 
NetworkAlgorithms.ppt
NetworkAlgorithms.pptNetworkAlgorithms.ppt
NetworkAlgorithms.ppt21121A0594
 
Location SUPPLY CHAIN ANALYSIS 234 .pptx
Location SUPPLY CHAIN ANALYSIS  234 .pptxLocation SUPPLY CHAIN ANALYSIS  234 .pptx
Location SUPPLY CHAIN ANALYSIS 234 .pptxEyliaAbbas1
 
Logistic Management in India
Logistic Management in IndiaLogistic Management in India
Logistic Management in IndiaRobin Patni
 
Logistics - Operational Planning - for XLRI PGCLSM
Logistics - Operational Planning - for XLRI PGCLSMLogistics - Operational Planning - for XLRI PGCLSM
Logistics - Operational Planning - for XLRI PGCLSMVinodh Soundarajan
 
How to determine demand Centers-of-Gravity
How to determine demand Centers-of-Gravity How to determine demand Centers-of-Gravity
How to determine demand Centers-of-Gravity StellingConsulting
 
ISM 2016 A Mining Imperative: Enhanced accuracy of real-time spatial data and...
ISM 2016 A Mining Imperative: Enhanced accuracy of real-time spatial data and...ISM 2016 A Mining Imperative: Enhanced accuracy of real-time spatial data and...
ISM 2016 A Mining Imperative: Enhanced accuracy of real-time spatial data and...MineWare Pty Ltd
 
Global positioning system by abhiram
Global positioning system by abhiramGlobal positioning system by abhiram
Global positioning system by abhiramAbhiram Reddy Kamsani
 
Gps and remote sensing
Gps and remote sensingGps and remote sensing
Gps and remote sensingSanila Khalid
 
Receiving and shipping operations
Receiving and shipping operationsReceiving and shipping operations
Receiving and shipping operationsAyman Aljahdali
 
Towards characterizing international routing detours
Towards characterizing international routing detoursTowards characterizing international routing detours
Towards characterizing international routing detoursAPNIC
 
LogiCloud Presentation
LogiCloud PresentationLogiCloud Presentation
LogiCloud PresentationLogiCloud
 

Similar to Doing in One Go: Delivery Time Inference Based on Couriers’ Trajectories (20)

Knowledge logistics
Knowledge logisticsKnowledge logistics
Knowledge logistics
 
Dell project client presentation
Dell project client presentationDell project client presentation
Dell project client presentation
 
Logistic route rationalization and optimization
Logistic route rationalization and optimizationLogistic route rationalization and optimization
Logistic route rationalization and optimization
 
Ballou13
Ballou13Ballou13
Ballou13
 
Module 3 Part B - computer networks module 2 ppt
Module 3 Part B - computer networks module 2 pptModule 3 Part B - computer networks module 2 ppt
Module 3 Part B - computer networks module 2 ppt
 
NGSI: Geoqueries & Carto integration
NGSI: Geoqueries & Carto integrationNGSI: Geoqueries & Carto integration
NGSI: Geoqueries & Carto integration
 
NetworkAlgorithms.ppt
NetworkAlgorithms.pptNetworkAlgorithms.ppt
NetworkAlgorithms.ppt
 
Core Location in iOS
Core Location in iOSCore Location in iOS
Core Location in iOS
 
Navigation OD.pptx
Navigation OD.pptxNavigation OD.pptx
Navigation OD.pptx
 
Location SUPPLY CHAIN ANALYSIS 234 .pptx
Location SUPPLY CHAIN ANALYSIS  234 .pptxLocation SUPPLY CHAIN ANALYSIS  234 .pptx
Location SUPPLY CHAIN ANALYSIS 234 .pptx
 
Logistic Management in India
Logistic Management in IndiaLogistic Management in India
Logistic Management in India
 
Logistics - Operational Planning - for XLRI PGCLSM
Logistics - Operational Planning - for XLRI PGCLSMLogistics - Operational Planning - for XLRI PGCLSM
Logistics - Operational Planning - for XLRI PGCLSM
 
Robotics Localization
Robotics LocalizationRobotics Localization
Robotics Localization
 
How to determine demand Centers-of-Gravity
How to determine demand Centers-of-Gravity How to determine demand Centers-of-Gravity
How to determine demand Centers-of-Gravity
 
ISM 2016 A Mining Imperative: Enhanced accuracy of real-time spatial data and...
ISM 2016 A Mining Imperative: Enhanced accuracy of real-time spatial data and...ISM 2016 A Mining Imperative: Enhanced accuracy of real-time spatial data and...
ISM 2016 A Mining Imperative: Enhanced accuracy of real-time spatial data and...
 
Global positioning system by abhiram
Global positioning system by abhiramGlobal positioning system by abhiram
Global positioning system by abhiram
 
Gps and remote sensing
Gps and remote sensingGps and remote sensing
Gps and remote sensing
 
Receiving and shipping operations
Receiving and shipping operationsReceiving and shipping operations
Receiving and shipping operations
 
Towards characterizing international routing detours
Towards characterizing international routing detoursTowards characterizing international routing detours
Towards characterizing international routing detours
 
LogiCloud Presentation
LogiCloud PresentationLogiCloud Presentation
LogiCloud Presentation
 

More from ivaderivader

DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
DDGK: Learning Graph Representations for Deep Divergence Graph KernelsDDGK: Learning Graph Representations for Deep Divergence Graph Kernels
DDGK: Learning Graph Representations for Deep Divergence Graph Kernelsivaderivader
 
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality ivaderivader
 
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...ivaderivader
 
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...ivaderivader
 
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Orien...
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Orien...Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Orien...
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Orien...ivaderivader
 
A Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial NetworksA Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial Networksivaderivader
 
CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...
CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...
CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...ivaderivader
 
Perception! Immersion! Empowerment! Superpowers as Inspiration for Visualization
Perception! Immersion! Empowerment! Superpowers as Inspiration for VisualizationPerception! Immersion! Empowerment! Superpowers as Inspiration for Visualization
Perception! Immersion! Empowerment! Superpowers as Inspiration for Visualizationivaderivader
 
Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...
Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...
Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...ivaderivader
 
Neural Approximate Dynamic Programming for On-Demand Ride-Pooling
Neural Approximate Dynamic Programming for On-Demand Ride-PoolingNeural Approximate Dynamic Programming for On-Demand Ride-Pooling
Neural Approximate Dynamic Programming for On-Demand Ride-Poolingivaderivader
 
StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...
StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...
StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...ivaderivader
 
Bad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTube
Bad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTubeBad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTube
Bad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTubeivaderivader
 
Invertible Denoising Network: A Light Solution for Real Noise Removal
Invertible Denoising Network: A Light Solution for Real Noise RemovalInvertible Denoising Network: A Light Solution for Real Noise Removal
Invertible Denoising Network: A Light Solution for Real Noise Removalivaderivader
 
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural Network
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural NetworkTraffic Demand Prediction Based Dynamic Transition Convolutional Neural Network
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural Networkivaderivader
 
MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training
MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training  MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training
MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training ivaderivader
 
Screen2Vec: Semantic Embedding of GUI Screens and GUI Components
Screen2Vec: Semantic Embedding of GUI Screens and GUI ComponentsScreen2Vec: Semantic Embedding of GUI Screens and GUI Components
Screen2Vec: Semantic Embedding of GUI Screens and GUI Componentsivaderivader
 
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...ivaderivader
 
Natural Language to Visualization by Neural Machine Translation
Natural Language to Visualization by Neural Machine TranslationNatural Language to Visualization by Neural Machine Translation
Natural Language to Visualization by Neural Machine Translationivaderivader
 

More from ivaderivader (20)

Argument Mining
Argument MiningArgument Mining
Argument Mining
 
Papers at CHI23
Papers at CHI23Papers at CHI23
Papers at CHI23
 
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
DDGK: Learning Graph Representations for Deep Divergence Graph KernelsDDGK: Learning Graph Representations for Deep Divergence Graph Kernels
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
 
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality
 
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
 
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
 
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Orien...
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Orien...Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Orien...
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Orien...
 
A Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial NetworksA Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial Networks
 
CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...
CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...
CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...
 
Perception! Immersion! Empowerment! Superpowers as Inspiration for Visualization
Perception! Immersion! Empowerment! Superpowers as Inspiration for VisualizationPerception! Immersion! Empowerment! Superpowers as Inspiration for Visualization
Perception! Immersion! Empowerment! Superpowers as Inspiration for Visualization
 
Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...
Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...
Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...
 
Neural Approximate Dynamic Programming for On-Demand Ride-Pooling
Neural Approximate Dynamic Programming for On-Demand Ride-PoolingNeural Approximate Dynamic Programming for On-Demand Ride-Pooling
Neural Approximate Dynamic Programming for On-Demand Ride-Pooling
 
StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...
StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...
StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...
 
Bad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTube
Bad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTubeBad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTube
Bad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTube
 
Invertible Denoising Network: A Light Solution for Real Noise Removal
Invertible Denoising Network: A Light Solution for Real Noise RemovalInvertible Denoising Network: A Light Solution for Real Noise Removal
Invertible Denoising Network: A Light Solution for Real Noise Removal
 
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural Network
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural NetworkTraffic Demand Prediction Based Dynamic Transition Convolutional Neural Network
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural Network
 
MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training
MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training  MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training
MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training
 
Screen2Vec: Semantic Embedding of GUI Screens and GUI Components
Screen2Vec: Semantic Embedding of GUI Screens and GUI ComponentsScreen2Vec: Semantic Embedding of GUI Screens and GUI Components
Screen2Vec: Semantic Embedding of GUI Screens and GUI Components
 
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
 
Natural Language to Visualization by Neural Machine Translation
Natural Language to Visualization by Neural Machine TranslationNatural Language to Visualization by Neural Machine Translation
Natural Language to Visualization by Neural Machine Translation
 

Recently uploaded

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 

Recently uploaded (20)

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 

Doing in One Go: Delivery Time Inference Based on Couriers’ Trajectories

  • 1. Doing in One Go: Delivery Time Inference Based on Couriers’ Trajectories Hongkyu Lim 2021. 03. 19 KDD ’20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining August 2020
  • 2. Contents •Overview of the Paper •Experiments •Case Study •System Deployment
  • 3. 2 Overview of the Paper • Basic Ideas and Motivation  “Last mile” problem - It would be of great value for both the couriers and the logistics company if we can infer the delivery time.  A courier would stay at a location for a while when he/she is delivering a parcel, thus generating a stay point. A straightforward solution would be to extract the delivery time based on the stay points of the trajectories. • Problems  It is not possible to infer the delivery time directly from stay points due to the. following two main challenges: • Inaccurate delivery locations - Most of the Geocoded waybill locations have some distance shifts to the delivery caused stay points • Various stay scenarios -Even if we find the closest stay point to the true delivery location, we still cannot say that the parcel is delivered at that stay point. The reason is that a courier might stay. at a location for various reasons.
  • 5. 4 Overview of the Paper • Solution  Delivery Time Inference (DTInf) • DTInf can automatically infer the delivery time of each completed waybill based on couriers’ trajectories. The proposed system contains three main components:
  • 6. 5 Overview of the Paper • Solution  Delivery Time Inference (DTInf) The proposed system contains three main components: 1) Data Pre-processing: cleans trajectorie -s from couriers, detects stay points, and separates stay points and waybills by deli -very trips; 2) Delivery Location Correction: correcting the Geocoded waybill location -s based on their historical deliveries to them 3) Delivery Event-based Matching: forming several delivery events by groupi -ng waybills according to their delivery locations, and matches each delivery event with the most likely stay point in its neighborhood.
  • 7. 6 Overview of the Paper • Terminology  Waybill • A waybill is a parcel delivery task assigned to a courier, denoted as a 4-tuple • 𝑙𝑎 is the Geocoded waybill location of the shipping address • F𝑝 are features of the parcel, e.g., the weight and the volume • 𝑡𝑟 is the timestamp, at which a courier receives the parcel • 𝑡𝑑 is the delivery time
  • 8. 7 Overview of the Paper • Terminology  Delivery Location • A delivery location is a spatial point, denoted as • a location (𝑥,𝑦) : longitude and latitude
  • 9. 8 Overview of the Paper • Terminology  Trajectory • A trajectory is a sequence of spatio-temporal points, denoted as • A location (𝑥,𝑦) : longitude and latitude • Each point 𝑝 = (𝑥,𝑦,𝑡) indicates the physical presence at a location (𝑥,𝑦) (e.g., longitude and l atitude) at time 𝑡. Points in a trajectory are organized chronologically.
  • 10. 9 Overview of the Paper • Terminology  Stay Point • A stay point is a subsequence of the trajectory, which semantically means that a moving object stays in a geographic region for a while. • Formally, given a distance threshold 𝐷𝑚𝑎𝑥 and a time threshold 𝑇𝑚𝑖𝑛 • A stay point 𝑠𝑝 if 𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒(𝑝𝑖,𝑝𝑘) ≤ 𝐷𝑚𝑎𝑥 (∀𝑘 ∈ [𝑖 + 1, 𝑗]), 𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒(𝑝𝑖,𝑝𝑗+1)>𝐷𝑚𝑎𝑥 (if𝑗<𝑛) ,and|𝑝𝑗.𝑡−𝑝𝑖.𝑡|≥𝑇𝑚𝑖𝑛.The time interval of a 𝑠𝑝 is[𝑝𝑖.𝑡,𝑝𝑗.𝑡].
  • 11. 10 Overview of the Paper • Terminology  Delivery Trip • A delivery trip is a process that a courier delivers a batch of parcels to customers.  Stay Point • The location of a 𝑠𝑝 is estimated using its spatial centroid: • The time of a 𝑠𝑝 is defined as the middle point of its time interval: • Particularly, if a stay point is caused by a delivery, we call it a delivery caused stay point. In historical data, it can be identified by checking whether there is a parcel delivered during the time interval of the stay point based on the delivery time of the waybill.
  • 12. 11 Overview of the Paper • Problem Formulation  Stay point -> Delivery caused stay point • Reasons for this strategy • Short delivery stay: measurability (80% waybills, the delivery duration does not last for longer than 20 minutes ) • Anonymized shipping address: One or more parcels can be delivered at the same stay point. However, Geocoding anonymizes the detailed floor information of shipping addresses
  • 13. 12 Overview of the Paper • Problem Formulation  Stay point -> Delivery caused stay point • Given courier’s stay points 𝑆𝑃 = {𝑠𝑝𝑗 |𝑗 ∈ 1, ..,𝑚} detected from the trajectory of a delivery trip, and the waybills 𝑊 = {𝑤𝑖|𝑖 ∈ 1,...,𝑝} he/she completed in the trip, the objective is to match each waybill 𝑤𝑖 with its delivery caused stay point 𝑠𝑝 𝑗 .
  • 14. 13 Overview of the Paper • System Framework (The system framework of DTInf )  Data Pre-processing • DP takes couriers’ trajectories • 3 main tasks 1) Noise Filtering, which removes the outlier GPS points 2) Stay Point Detection, which detects all the stay points from the trajectories 3) Delivery Trip Identification, which separates waybills and stay points by the identified delivery trips
  • 15. 14 Overview of the Paper • System Framework (The system framework of DTInf )  Data Pre-processing • DP takes couriers’ trajectories • 3 main tasks 1) Noise Filtering, which removes the outlier GPS points • The error of 𝑝4 and 𝑝7 might be several hundred meters away from its true location. Such noise points would affect the quality of stay point detection. (Trajectory data mining by Yu Zheng)
  • 16. 15 Overview of the Paper • System Framework (The system framework of DTInf )  Data Pre-processing • DP takes couriers’ trajectories • 3 main tasks 2) Stay Point Detection • Based on the cleaned trajectories, we extract all stay points from them. We use stay points not only to infer the delivery time, but also to find the real delivery. locations. • 𝐷𝑚𝑎𝑥 =20𝑚 and 𝑇𝑚𝑖𝑛 =30𝑠.
  • 17. 16 Overview of the Paper • System Framework (The system framework of DTInf )  Data Pre-processing • 3 main tasks 3) Delivery Trip Identification • A trip begins when the number of receiving parcels stops increasing, and the number of delivering parcels begins to increase • A trip ends if the opposite condition holds.
  • 18. 17 Overview of the Paper • System Framework (The system framework of DTInf )  Delivery Location Correction • DLC takes historical waybills and stay points, and generates the location mapping from the Geocoded waybill location to the delivery location • 3 main steps 1) Inverted Indexing, which finds all historical delivery caused stay points for each Geocoded waybill location 2) Location Inference, which infers the raw delivery location based on the inverted index 3) Location Refinement, which refines the raw deliver -y location by merging it with its nearby delivery locations discovered by other Geocoded waybill locations
  • 19. 18 Overview of the Paper • System Framework (The system framework of DTInf )  Delivery Location Correction • It is difficult to set a global consistent judgment about which stay point might be the delivery caused stay point of a certain waybill if we treat the Geocoded waybill location as the delivery location. • Fortunately, because a customer might place orders multiple times using the same shipping address, a Ge ocoded waybill location might appear several times.
  • 20. 19 Overview of the Paper • System Framework (The system framework of DTInf )  Delivery Location Correction • Inspired by 2 insights 1) Multiple delivery caused stay points • It is noticeable that although those stay points are qui te close, there are still minor differences. If all stay poi nts are leveraged, the delivery location correction can be more accurate. 2) Redundant Geocoded waybill locations • It can be noticed that their delivery caused stay points have considerably large overlaps, which indicates they potentially correspond to the same delivery location.
  • 21. 20 Overview of the Paper • System Framework (The system framework of DTInf )  Delivery Location Correction • (a) shows the distribution of the number of delivery trips at a Geocoded waybill location duri- -ng a period of 15 months. It is noticeable that for 72% Geocoded waybill locations, there exist multiple deliveries. Besides, those locations can also appear in the future. • the Geocoded waybill locations of waybills in the previous 4 months cover more than 80% Geocoded waybill locations in the last month.
  • 22. 21 Overview of the Paper • System Framework (The system framework of DTInf )  Delivery Event-based Matching • Delivery Event Construction • We group waybills into several delivery events accordi -ng to their corrected delivery locations. • Then, in the later step, we can select the most probabl -e stay point for each delivery event based on its delivery location and assign that stay point to all waybills in that event.
  • 23. 22 Overview of the Paper • System Framework (The system framework of DTInf )  Delivery Event-based Matching • The reasons to perform the delivery event-level matc -hing for waybills in a trip are two folds: • Location by location delivery • main stay point : For a delivery location in each. trip, we can find a stay point that is the most frequently matched by waybills at that location. • If we perform the delivery event-level matching and correctly infer the main stay point for each delivery event, the time inference errors for waybills are acceptable.
  • 24. 23 Overview of the Paper • System Framework (The system framework of DTInf )  Delivery Event-based Matching • The reasons to perform the delivery event-level matc hing for waybills in a trip are two folds: • Correlations between delivery events and stay points • The parcels delivered at the same delivery location will affect the characteristics of the stay point. • Such characteristics cannot be captured if we perform the inference task for each waybill. individually.
  • 25. 24 Overview of the Paper • System Framework (The system framework of DTInf )  Delivery Event-based Matching • Stay Point Selection • Use a model to capture the correlation between delivery events and the main delivery caused stay points • Ultimately improve the inference accuracy • Binary classification problem • For a delivery event and one stay point in its neighbor- -hood, we extract features of them and predict wheth -er the stay point is the main delivery caused stay point of the delivery event. (MLP)
  • 26. 25 Overview of the Paper • System Framework (The system framework of DTInf )  Four types of features are extracted: • Location features: We obtain the POI category of the Geocoded waybill location via the reverse Geocoding service, which is encoded by the one-hot vector. • Delivery event features: We extract four aggregated information from the delivery events of waybills, namely, the number of waybills, number of customer s, total weight, and total volume. • Stay point features: We extract the duration and the area of the stay point. • Matching feature: The geographical distance between the centroid of the stay point and the delivery location is extracted.
  • 27. 26 Experiments • Datasets  The datasets contain trajectories and waybills of 5 couriers at a delivery station in Ton -gzhou District, Beijing over a period of about 15 months (from Apr. 12nd, 2018 to Jul. 7th, 2019).  Couriers’ trajectories • They are raw GPS logs generated by couriers’ PDAs, where each record contains a courier ID, a loc- -ation, and a timestamp. The average sampling time interval is 7.4 seconds. The datasets contain 5 .93 million GPS points.  Waybills • Each record contains a customer ID, a courier ID, parcel information (e.g., weight and volume), the time when the parcel is received, the time when the parcel is delivered, and a Geocoded waybill l ocation. The datasets contain 274 thousand waybills. Besides, there are 16 POI categories we obta ined via the reverse Geocoding service.
  • 28. 27 Experiments • Datasets  Baselines • There is no existing solution that can exactly tackle our problem. Therefore, 3 baselines are design -ed for comparison. • Random Inference (RDInf): We randomly select a stay point from each waybill’s neighborhood as its deli -very caused stay point. • Spatial Nearest Inference (SNInf): SNInf matches each waybill with its closest stay point in its neighborh- -ood. • Temporal Longest Inference (TLInf): TLInf selects the stay point in waybill’s neighborhood with the longe -st duration.  Evaluation Metrics • We use the accuracy, which is defined as the proportion of waybills whose corresponding delivery caused stay points are correctly classified (i.e., their inferred delivery times are accurate). • We also report the RMSE and the MAE based on the inferred delivery time and the time of deliver -y caused stay points.
  • 29. 28 Experiments • Datasets  Variants • We also compare DTInf with its three variants: • DTInf-nC: This variant does not correct the delivery locations. The model is trained based on the Geocod -ed waybill locations. • DTInf-nM: This variant corrects the locations, but it selects the stay point that is the closest to the correc -ted location. • DTInf-nE: This variant also corrects the location, but it does not construct delivery events. Instead, it infers delivery caused stay point for each waybill based on the same model, but the delivery event features are replaced with individual waybill features.
  • 30. 29 Experiments  Effectiveness Evaluation • Merging Distance Selection • The reason is that when 𝐷 becomes larger, re dundant Geocoded waybill locations are cor- rected to the same delivery locations, which makes the delivery event modeling more accu -rate. However, when 𝐷 is larger than 3m, the performance is degraded, because we might merge adjacent delivery locations by mistake. We also report the ratio between the number of detected delivery locations and Geocoded waybill locations (denoted as the compression rate) in the same figure.
  • 32. 31 Case Study • Red triangle – Geocoded location • Blue points – Centroids of stay points detected from the courier’s trajector y of the corresponding delivery trip. • The parcel is delivered in sp3. • Geocoded waybill location has been delivered multiple times in history-> It was possible to correct it.
  • 33. 32 System Deployment • Grey line : Courier’s trajectory • Blue circle – stay point centroids • Red triangle – Geocoded waybill locations • Green circle – After clicking Correct, the corrected delivery. locations are displayed with. green circles • Big red circles – querying neighb- -orhood. • Once the infer button’s clicked, a link is generated between the corrected location and the inferred delivery caused stay point.