OpenStreetMap address base:
ready for prime time?
Maxim Dubinin
sim@gis-lab.info
SotM Baltics 2013
2 из 24
3 из 24
Can OpenStreetMap address database be
used to create complete geographic
datasets?
The Question
4 из 24
● Creation of data layers for different
features
● Large areas (Russia)
● Thousands objects
● Practical applicatio...
5 из 24
1. How good is OSM address database and
fully automatic geocoding?
2. How much does postprocessing help?
3. How is...
6 из 24
● What are the mistakes of geocoding and
how it can be improved?
● What is the right scheme for addressing?
● When...
7 из 24
Result — correct lat/long for an address
Result ~ data preparation + geocoding +
postprocessing
● Data prep — make...
8 из 24
● OpenPolice — where are the local cops in
Moscow
● Elections — where are the voting stations in
Moscow
● Orphanag...
9 из 24
1.How good is OSM address database
and fully automatic geocoding?
2.How much does postprocessing help?
3.How is qu...
10 из 24
OpenPolice
● Extract all addresses from 112.ru
● Geocode them
● Relate them to buildings in Moscow to get
areas o...
11 из 24
Results
● Total: ~41000 addresses in Moscow
12 из 24
1.How good is OSM address database and
fully automatic geocoding?
2.How much does postprocessing
help?
3.How is q...
13 из 24
Voting comissions
● Extract all addresses from public database
● Geocode them
● Crowdsource post-processing
http:...
14 из 24
Results
● Total: ~3500 addresses in Moscow
● Before post-processing VS after
post-processing
15 из 24
1.How good is OSM address database and
fully automatic geocoding?
2.How much does postprocessing help?
3.How is c...
16 из 24
Orphanages
● Extract all addresses from public database
● Geocode and post-process them
● All regions of Russia, ...
17 из 24
Orphanages
● Buildings before and after post-proc, % total
18 из 24
Orphanages
● Buildings and streets before and after post-proc,
% total
19 из 24
1.How good is OSM address database
and fully automatic geocoding?
2.How much does postprocessing
help?
3.How is q...
20 из 24
No project, just comparison
● Take few hundreds of addresses in different
parts of Russia
● Geocode them with OSM...
21 из 24
OSM vs Yandex
● Summed scores for geocoding accuracy
22 из 24
Yandex
● Yandex People's map contribution to total score
23 из 24
● Map more ;)
● Improve automatic geocoding
● Create positive feedback loop with
geocoding projects
How to get be...
24 из 24
https://github.com/simgislab/osmaddress-sotmbaltics13
Sources for this presentation
Upcoming SlideShare
Loading in...5
×

OpenStreetMap address base: ready for prime time?

2,697

Published on

Presentation by Maxim Dubinin at SotM Baltics, 2013. Tartu, Estonia.

Published in: Technology, Travel
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
2,697
On Slideshare
0
From Embeds
0
Number of Embeds
14
Actions
Shares
0
Downloads
4
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

OpenStreetMap address base: ready for prime time?

  1. 1. OpenStreetMap address base: ready for prime time? Maxim Dubinin sim@gis-lab.info SotM Baltics 2013
  2. 2. 2 из 24
  3. 3. 3 из 24 Can OpenStreetMap address database be used to create complete geographic datasets? The Question
  4. 4. 4 из 24 ● Creation of data layers for different features ● Large areas (Russia) ● Thousands objects ● Practical applications Setup
  5. 5. 5 из 24 1. How good is OSM address database and fully automatic geocoding? 2. How much does postprocessing help? 3. How is completeness distributed across Russia? 4. How good is the quality compared to other geocoders? You will find answers here for...
  6. 6. 6 из 24 ● What are the mistakes of geocoding and how it can be improved? ● What is the right scheme for addressing? ● When will OSM take over the world? ...but, nothing about...
  7. 7. 7 из 24 Result — correct lat/long for an address Result ~ data preparation + geocoding + postprocessing ● Data prep — make well structured address ● Geocoding — find lat/long for it with osm.org.ru ● Postprocessing — fix it manually if wrong Some definitions
  8. 8. 8 из 24 ● OpenPolice — where are the local cops in Moscow ● Elections — where are the voting stations in Moscow ● Orphanages — where are the children orphanages in Russia Examples
  9. 9. 9 из 24 1.How good is OSM address database and fully automatic geocoding? 2.How much does postprocessing help? 3.How is quality distributed across Russia? 4.How good is the quality compared to other geocoders? Question 1
  10. 10. 10 из 24 OpenPolice ● Extract all addresses from 112.ru ● Geocode them ● Relate them to buildings in Moscow to get areas of responsibility http://gis-lab.info/qa/openpolice.html
  11. 11. 11 из 24 Results ● Total: ~41000 addresses in Moscow
  12. 12. 12 из 24 1.How good is OSM address database and fully automatic geocoding? 2.How much does postprocessing help? 3.How is quality distributed across Russia? 4.How good is the quality compared to other geocoders? Question 2
  13. 13. 13 из 24 Voting comissions ● Extract all addresses from public database ● Geocode them ● Crowdsource post-processing http://uikgeo.gis-lab.info
  14. 14. 14 из 24 Results ● Total: ~3500 addresses in Moscow ● Before post-processing VS after post-processing
  15. 15. 15 из 24 1.How good is OSM address database and fully automatic geocoding? 2.How much does postprocessing help? 3.How is completeness distributed across Russia? 4.How good is quality compared to other geocoders? Question 3
  16. 16. 16 из 24 Orphanages ● Extract all addresses from public database ● Geocode and post-process them ● All regions of Russia, ~5000 orphanages total, mean 50 per region http://gis-lab.info/qa/detdom.html
  17. 17. 17 из 24 Orphanages ● Buildings before and after post-proc, % total
  18. 18. 18 из 24 Orphanages ● Buildings and streets before and after post-proc, % total
  19. 19. 19 из 24 1.How good is OSM address database and fully automatic geocoding? 2.How much does postprocessing help? 3.How is quality distributed across Russia? 4.How good is the quality compared to other geocoders? Question 4
  20. 20. 20 из 24 No project, just comparison ● Take few hundreds of addresses in different parts of Russia ● Geocode them with OSM and Yandex ● For each point, assign score: Building = 3, street = 2, settlement = 1 ● Sum the scores up ● Compare
  21. 21. 21 из 24 OSM vs Yandex ● Summed scores for geocoding accuracy
  22. 22. 22 из 24 Yandex ● Yandex People's map contribution to total score
  23. 23. 23 из 24 ● Map more ;) ● Improve automatic geocoding ● Create positive feedback loop with geocoding projects How to get better?
  24. 24. 24 из 24 https://github.com/simgislab/osmaddress-sotmbaltics13 Sources for this presentation
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×