SlideShare a Scribd company logo
Columbus
Map Creation Using GPS Locations
Introduction
Creating maps and depicting the real world entities on the map with accuracy has been a challenging
problem for years.

In Go-Jek (a ride-hailing company), we are using Google map APIs to calculate the shortest distance to be
travelled by the driver and to allocate the nearest available driver to the customer booking the ride. 

To create our own map and find the shortest paths, we have come up with “Columbus”.
Problem Statement
Creation of Map using driver GPS location pings, representing it as a directional graph and finding
the shortest path.
Data Collection and Data Visualisation
In Go-Jek, we receive driver location pings every 10 seconds. This enormous collection of pings is
injected into Kafka by the Driver Location Service. We have collected and processed the GPS
locations of drivers using Flink.

This collection of driver GPS locations is visualised as a layer on top of OSM (Open Street Map), this
shows the GPS points clustered along the roads.
Approach
A. Identify the clusters of the driver GPS
locations collection.

B. Calculate the centroid of these
clusters.

C. These centroids form the nodes of the
graph representation of map.

D. Detect the direction of the road by
analysing the history of driver location
pings.

E. Add the directional edges representing
the roads between the graph nodes.

F. Calculate the shortest path between
any two given nodes of the directional
graph which represents the map.
Data Challenges
• The GPS locations collection is humongous which takes significant amount of time to process.

• The accuracy of the GPS locations waver, hence the driver locations can be away from the actual road line.

• Identifying the clusters of these GPS locations after filtering the outlier locations.

• Calculating the centroid of the clusters of GPS locations and forming a graph of these centroids as nodes gives a large
interconnected mesh of nodes.

• The biggest challenge was in identifying the connections between the nodes which represent the roads and giving a
directional sense to these connections.

More Related Content

What's hot

Continuing Our Look At Primary And Secondary Data
Continuing Our Look At Primary And Secondary DataContinuing Our Look At Primary And Secondary Data
Continuing Our Look At Primary And Secondary Dataguest2137aa
 
The importance of map
The importance of mapThe importance of map
The importance of maparceyes
 
Web Based Gis For Land Administration 20090629
Web Based Gis For Land Administration 20090629Web Based Gis For Land Administration 20090629
Web Based Gis For Land Administration 20090629John Reitsma
 
Introduction to Nespak_2016
Introduction to Nespak_2016Introduction to Nespak_2016
Introduction to Nespak_2016Atiqa khan
 
Cartography lec #4
Cartography lec #4Cartography lec #4
Cartography lec #4Nazir Ahmed
 
Cartographic symbology. By proj jasper
Cartographic symbology. By proj jasperCartographic symbology. By proj jasper
Cartographic symbology. By proj jasperAgbajiJacob
 
10 using the web gis
10 using the web gis10 using the web gis
10 using the web gisRCREEE
 
The Mapping Network Lake Mapping
The Mapping Network Lake MappingThe Mapping Network Lake Mapping
The Mapping Network Lake MappingThe Mapping Network
 
Grounds 03 2008 1
Grounds 03 2008 1Grounds 03 2008 1
Grounds 03 2008 1bboliver
 

What's hot (20)

Introduction to Digital Maps
Introduction to Digital MapsIntroduction to Digital Maps
Introduction to Digital Maps
 
Continuing Our Look At Primary And Secondary Data
Continuing Our Look At Primary And Secondary DataContinuing Our Look At Primary And Secondary Data
Continuing Our Look At Primary And Secondary Data
 
The importance of map
The importance of mapThe importance of map
The importance of map
 
Web Based Gis For Land Administration 20090629
Web Based Gis For Land Administration 20090629Web Based Gis For Land Administration 20090629
Web Based Gis For Land Administration 20090629
 
Introduction to Nespak_2016
Introduction to Nespak_2016Introduction to Nespak_2016
Introduction to Nespak_2016
 
DSM Extraction from Pleiades Images Using RSP
DSM Extraction from Pleiades Images Using RSPDSM Extraction from Pleiades Images Using RSP
DSM Extraction from Pleiades Images Using RSP
 
UC2011
UC2011UC2011
UC2011
 
Cartography
CartographyCartography
Cartography
 
Google map
Google mapGoogle map
Google map
 
Maps and GIS
Maps and GISMaps and GIS
Maps and GIS
 
Cartography lec #4
Cartography lec #4Cartography lec #4
Cartography lec #4
 
Cartographic symbology. By proj jasper
Cartographic symbology. By proj jasperCartographic symbology. By proj jasper
Cartographic symbology. By proj jasper
 
Basics in Cartography
Basics in Cartography Basics in Cartography
Basics in Cartography
 
TOWN PLANNING
TOWN PLANNINGTOWN PLANNING
TOWN PLANNING
 
Digitalcartography
DigitalcartographyDigitalcartography
Digitalcartography
 
10 using the web gis
10 using the web gis10 using the web gis
10 using the web gis
 
Maps
MapsMaps
Maps
 
Cartography intro
Cartography introCartography intro
Cartography intro
 
The Mapping Network Lake Mapping
The Mapping Network Lake MappingThe Mapping Network Lake Mapping
The Mapping Network Lake Mapping
 
Grounds 03 2008 1
Grounds 03 2008 1Grounds 03 2008 1
Grounds 03 2008 1
 

Similar to Columbus - map creation using gps locations

the bike map - a look into a practical application of graph theory
the bike map - a look into a practical application of graph theorythe bike map - a look into a practical application of graph theory
the bike map - a look into a practical application of graph theoryCharlie Hsu
 
Madcom osp design & engineering capabilities
Madcom osp design  & engineering capabilitiesMadcom osp design  & engineering capabilities
Madcom osp design & engineering capabilitiesRich Frank
 
Fiber design efficiencies cost savings
Fiber design efficiencies cost savingsFiber design efficiencies cost savings
Fiber design efficiencies cost savingsRich Frank
 
2012 SNU Driver Experience: 특강_내비게이션의 이해(서동권이사님)
2012 SNU Driver Experience: 특강_내비게이션의 이해(서동권이사님)2012 SNU Driver Experience: 특강_내비게이션의 이해(서동권이사님)
2012 SNU Driver Experience: 특강_내비게이션의 이해(서동권이사님)DriverExperience
 
240513_Thanh_LabSeminar[Learning and Aggregating Lane Graphs for Urban Automa...
240513_Thanh_LabSeminar[Learning and Aggregating Lane Graphs for Urban Automa...240513_Thanh_LabSeminar[Learning and Aggregating Lane Graphs for Urban Automa...
240513_Thanh_LabSeminar[Learning and Aggregating Lane Graphs for Urban Automa...thanhdowork
 
Optimizing GIS based Systems
Optimizing GIS based SystemsOptimizing GIS based Systems
Optimizing GIS based SystemsAjinkya Deshpande
 
SoTM US Routing
SoTM US RoutingSoTM US Routing
SoTM US RoutingMapQuest
 
Optimal City Routes (Postman Problem
Optimal City Routes (Postman ProblemOptimal City Routes (Postman Problem
Optimal City Routes (Postman ProblemStorm Meadows
 
Brochure_ESurveyCADD_2021-22.pdf
Brochure_ESurveyCADD_2021-22.pdfBrochure_ESurveyCADD_2021-22.pdf
Brochure_ESurveyCADD_2021-22.pdfMangeshKale47
 
CycleStreets main presentation to OSM State of the Map 2009
CycleStreets main presentation to OSM State of the Map 2009CycleStreets main presentation to OSM State of the Map 2009
CycleStreets main presentation to OSM State of the Map 2009CycleStreets
 
Making an Orienteering Map in Seven Basic Steps
Making an Orienteering Map in Seven Basic StepsMaking an Orienteering Map in Seven Basic Steps
Making an Orienteering Map in Seven Basic StepsGord Hunter
 
GISG 110 Final Report
GISG 110 Final ReportGISG 110 Final Report
GISG 110 Final Reportstevestr
 
Geo search introduction
Geo search introductionGeo search introduction
Geo search introductionkenshin03
 
The Impact of Always-on Connectivity for Geospatial Applications and Analysis
The Impact of Always-on Connectivity for Geospatial Applications and AnalysisThe Impact of Always-on Connectivity for Geospatial Applications and Analysis
The Impact of Always-on Connectivity for Geospatial Applications and AnalysisSingleStore
 

Similar to Columbus - map creation using gps locations (20)

the bike map - a look into a practical application of graph theory
the bike map - a look into a practical application of graph theorythe bike map - a look into a practical application of graph theory
the bike map - a look into a practical application of graph theory
 
Madcom osp design & engineering capabilities
Madcom osp design  & engineering capabilitiesMadcom osp design  & engineering capabilities
Madcom osp design & engineering capabilities
 
Fiber design efficiencies cost savings
Fiber design efficiencies cost savingsFiber design efficiencies cost savings
Fiber design efficiencies cost savings
 
2012 SNU Driver Experience: 특강_내비게이션의 이해(서동권이사님)
2012 SNU Driver Experience: 특강_내비게이션의 이해(서동권이사님)2012 SNU Driver Experience: 특강_내비게이션의 이해(서동권이사님)
2012 SNU Driver Experience: 특강_내비게이션의 이해(서동권이사님)
 
240513_Thanh_LabSeminar[Learning and Aggregating Lane Graphs for Urban Automa...
240513_Thanh_LabSeminar[Learning and Aggregating Lane Graphs for Urban Automa...240513_Thanh_LabSeminar[Learning and Aggregating Lane Graphs for Urban Automa...
240513_Thanh_LabSeminar[Learning and Aggregating Lane Graphs for Urban Automa...
 
Optimizing GIS based Systems
Optimizing GIS based SystemsOptimizing GIS based Systems
Optimizing GIS based Systems
 
Citi Bike Finder
Citi Bike FinderCiti Bike Finder
Citi Bike Finder
 
SoTM US Routing
SoTM US RoutingSoTM US Routing
SoTM US Routing
 
ESRI Thursday
ESRI ThursdayESRI Thursday
ESRI Thursday
 
Presentation
PresentationPresentation
Presentation
 
Mahendran
MahendranMahendran
Mahendran
 
Optimal City Routes (Postman Problem
Optimal City Routes (Postman ProblemOptimal City Routes (Postman Problem
Optimal City Routes (Postman Problem
 
Analysing road traffic
Analysing road trafficAnalysing road traffic
Analysing road traffic
 
Brochure_ESurveyCADD_2021-22.pdf
Brochure_ESurveyCADD_2021-22.pdfBrochure_ESurveyCADD_2021-22.pdf
Brochure_ESurveyCADD_2021-22.pdf
 
CycleStreets main presentation to OSM State of the Map 2009
CycleStreets main presentation to OSM State of the Map 2009CycleStreets main presentation to OSM State of the Map 2009
CycleStreets main presentation to OSM State of the Map 2009
 
Making an Orienteering Map in Seven Basic Steps
Making an Orienteering Map in Seven Basic StepsMaking an Orienteering Map in Seven Basic Steps
Making an Orienteering Map in Seven Basic Steps
 
GISG 110 Final Report
GISG 110 Final ReportGISG 110 Final Report
GISG 110 Final Report
 
Geo search introduction
Geo search introductionGeo search introduction
Geo search introduction
 
The Impact of Always-on Connectivity for Geospatial Applications and Analysis
The Impact of Always-on Connectivity for Geospatial Applications and AnalysisThe Impact of Always-on Connectivity for Geospatial Applications and Analysis
The Impact of Always-on Connectivity for Geospatial Applications and Analysis
 
Presentation1
Presentation1Presentation1
Presentation1
 

Recently uploaded

Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBAlireza Kamrani
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...correoyaya
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?DOT TECH
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单enxupq
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单ewymefz
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsalex933524
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesStarCompliance.io
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxDilipVasan
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJames Polillo
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictJack Cole
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单yhkoc
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundOppotus
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sMAQIB18
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...elinavihriala
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxStephen266013
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIAlejandraGmez176757
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单ewymefz
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsCEPTES Software Inc
 

Recently uploaded (20)

Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDB
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp ClaimsSlip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 

Columbus - map creation using gps locations

  • 2. Introduction Creating maps and depicting the real world entities on the map with accuracy has been a challenging problem for years. In Go-Jek (a ride-hailing company), we are using Google map APIs to calculate the shortest distance to be travelled by the driver and to allocate the nearest available driver to the customer booking the ride. To create our own map and find the shortest paths, we have come up with “Columbus”.
  • 3. Problem Statement Creation of Map using driver GPS location pings, representing it as a directional graph and finding the shortest path.
  • 4. Data Collection and Data Visualisation In Go-Jek, we receive driver location pings every 10 seconds. This enormous collection of pings is injected into Kafka by the Driver Location Service. We have collected and processed the GPS locations of drivers using Flink. This collection of driver GPS locations is visualised as a layer on top of OSM (Open Street Map), this shows the GPS points clustered along the roads.
  • 5. Approach A. Identify the clusters of the driver GPS locations collection. B. Calculate the centroid of these clusters. C. These centroids form the nodes of the graph representation of map. D. Detect the direction of the road by analysing the history of driver location pings. E. Add the directional edges representing the roads between the graph nodes. F. Calculate the shortest path between any two given nodes of the directional graph which represents the map.
  • 6. Data Challenges • The GPS locations collection is humongous which takes significant amount of time to process. • The accuracy of the GPS locations waver, hence the driver locations can be away from the actual road line. • Identifying the clusters of these GPS locations after filtering the outlier locations. • Calculating the centroid of the clusters of GPS locations and forming a graph of these centroids as nodes gives a large interconnected mesh of nodes. • The biggest challenge was in identifying the connections between the nodes which represent the roads and giving a directional sense to these connections.