0
Managing Data Quality in         OpenStreetMapTOOLS FOR AN ACTIVEMAPPING COMMUNITYNC GIS CONFERENCE 2013    This document ...
Overview                            2 The Short History of the OpenStreetMap   Revolution Assessing Open Source Data Qua...
Overview: Key Questions                                    3 How can crowd-sourced projects manage data   quality effecti...
OpenStreetMap is…                                 4 A freely-editable map of the world   unconstrained by proprietary own...
The Origins of OpenStreetMap                              5 OpenStreetMap.org domain registered by Steve  Coast in 2004 ...
OpenStreetMap is…                                 6 A freely-editable map of the world   unconstrained by proprietary own...
Looks like…a wiki                                 7NC GIS Conference 2013                       23 February 2013
Wiki-based Documentation!                         8NC GIS Conference 2013                                         23 Febru...
Milestones in OpenStreetMap History                             9 2004 - OpenStreetMap.org registered by Steve Coast 200...
OpenStreetMap User Growth                                          10One million registered users worldwide! NC GIS Confer...
OpenStreetMap Growth in User Edits                         11NC GIS Conference 2013                 23 February 2013
OpenStreetMap Database Growth                           12NC GIS Conference 2013                  23 February 2013
Data Quality in Crowd-sourced Projects                                                            13 Goodchild & Li: Iden...
Crowd-sourced Approach to Data Quality                                                        14 Based on Surowiecki’s “W...
Social Approach to Data Quality                             15 Through practices, users acquire reputations Users with g...
Geographic Tools for Data Quality                                   16 Geographic approach draws on formal geographic   t...
A Quick Survey of Data Quality Tools                               17 Two types of tools are in widespread use:      Err...
Error Detection Tools: Keep Right                             18NC GIS Conference 2013                      23 February 2013
Error Detection Tools: Map Dust                             19NC GIS Conference 2013                     23 February 2013
Error Detection Tools: OpenStreetBugsNC GIS Conference 2013                 23 February 2013
Error Detection Tools: No Name                             21NC GIS Conference 2013                     23 February 2013
Error Detection Tools: MapRoulette                           22NC GIS Conference 2013                    23 February 2013
Monitoring Tools                                23NC GIS Conference 2013                      23 February 2013
Monitoring Tools: OpenStreetMap Watch List                  (OWL)                         24NC GIS Conference 2013        ...
Monitoring Tools: GeoFabrik Map Compare                         25NC GIS Conference 2013           23 February 2013
Monitoring Tools: Who Did It                               26NC GIS Conference 2013                           23 February ...
Monitoring Tools: ITO TIGER Reviewed                         27NC GIS Conference 2013              23 February 2013
Monitoring Tools: ITO TIGER Reviewed                         28NC GIS Conference 2013              23 February 2013
Monitoring Tools: Green Means Go                          29NC GIS Conference 2013                  23 February 2013
Monitoring Tools: Who’s Around Me                          30NC GIS Conference 2013                  23 February 2013
Social Controls                                31 OpenStreetMap - Data Working Group (DWG)   Resolving disputes between ...
How do Social Methods Treat Vandalism?                                32 OpenStreetMap is not immune from malicious inten...
Summary Review                                 33 Three methods for data quality control   Crowd-sourced   Social   Ge...
Lessons Learned about OSM Data Quality                                                       34 Successive editing by mul...
Thank You                                                                   35 Questions? Steven Johnson   (e) stevejoh...
Upcoming SlideShare
Loading in...5
×

OpenStreetMap Data Quality

985

Published on

A survey of data quality tools for OpenStreetMap

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
985
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
29
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Transcript of "OpenStreetMap Data Quality"

  1. 1. Managing Data Quality in OpenStreetMapTOOLS FOR AN ACTIVEMAPPING COMMUNITYNC GIS CONFERENCE 2013 This document licensed in entirety by Creative Commons CC-by-SA. For specific terms of license, see: http://creativecommons.org/licenses/by-sa/3.0/
  2. 2. Overview 2 The Short History of the OpenStreetMap Revolution Assessing Open Source Data Quality Overview of Tools Creating Tools that MatterNC GIS Conference 2013 23 February 2013
  3. 3. Overview: Key Questions 3 How can crowd-sourced projects manage data quality effectively? What tools exist for monitoring data quality in OpenStreetMap? What conclusions can be drawn about existing tools? What is the future of data quality in crowd-sourced projects?NC GIS Conference 2013 23 February 2013
  4. 4. OpenStreetMap is… 4 A freely-editable map of the world unconstrained by proprietary ownership “Wikipedia for maps”NC GIS Conference 2013 23 February 2013
  5. 5. The Origins of OpenStreetMap 5 OpenStreetMap.org domain registered by Steve Coast in 2004 Project originated in the United Kingdom, where…  Crown copyright on geospatial data  Little, or no public domain data Simple goal to create a free, publicly-available database of street centerlinesNC GIS Conference 2013 23 February 2013
  6. 6. OpenStreetMap is… 6 A freely-editable map of the world unconstrained by proprietary ownership “Wikipedia for maps”NC GIS Conference 2013 23 February 2013
  7. 7. Looks like…a wiki 7NC GIS Conference 2013 23 February 2013
  8. 8. Wiki-based Documentation! 8NC GIS Conference 2013 23 February 2013
  9. 9. Milestones in OpenStreetMap History 9 2004 - OpenStreetMap.org registered by Steve Coast 2005 – Map Limehouse, 1st OpenStreetMap mapping party 2005 – 1000 registered OpenStreetMap users 2006 – OpenStreetMap Foundation established 2007 – 5 million ways in OSM database 2007 – 10,000 registered OpenStreetMap users 2008 - TIGER data import for the US completed 2009 - 100,000 registered OpenStreetMap users 2010 - 200,000 registered OpenStreetMap users 2012 – ~670,000 registered OpenStreetMap usersNC GIS Conference 2013 23 February 2013
  10. 10. OpenStreetMap User Growth 10One million registered users worldwide! NC GIS Conference 2013 23 February 2013
  11. 11. OpenStreetMap Growth in User Edits 11NC GIS Conference 2013 23 February 2013
  12. 12. OpenStreetMap Database Growth 12NC GIS Conference 2013 23 February 2013
  13. 13. Data Quality in Crowd-sourced Projects 13 Goodchild & Li: Identified three mechanisms for Quality Assurance  Crowd-sourcing  Social  GeographicGoodchild, Michael F., and Linna Li. "Assuring the quality of volunteered geographic information."Spatial Statistics 1 (2012): 110-120.NC GIS Conference 2013 23 February 2013
  14. 14. Crowd-sourced Approach to Data Quality 14 Based on Surowiecki’s “Wisdom of the Crowd”  Multiple users converge around consensus solutions that might escape an individual  Many independent observations reinforce the validity of a single observation  Concurrence on observed features (e.g. “It’s a bridge.”)  Convergence on the truth  The group validates observations & corrects errors Surowiecki, J., 2005. The Wisdom of Crowds. Anchor, New York.NC GIS Conference 2013 23 February 2013
  15. 15. Social Approach to Data Quality 15 Through practices, users acquire reputations Users with good reputations are trusted Trust and reputation are indicators of stewardship As the project evolves, social leadership becomes more formalized. The Data Working Group of OpenStreetMap fullfills this function Email lists supplement social stewardshipNC GIS Conference 2013 23 February 2013
  16. 16. Geographic Tools for Data Quality 16 Geographic approach draws on formal geographic theory:  Spatial neighbors & auto-correlation (Moran statistics)  Christaller’s Central Place Theory  Descriptive Statistics  Inferential Statistics & Analysis of Variance (ANOVA)  Richardson plots of linear measurements  Cluster analysis, e.g. k-means These approaches have not been widely adopted for use in the OpenStreetMap project…yetNC GIS Conference 2013 23 February 2013
  17. 17. A Quick Survey of Data Quality Tools 17 Two types of tools are in widespread use:  Error Detection Tools  Monitoring ToolsNC GIS Conference 2013 23 February 2013
  18. 18. Error Detection Tools: Keep Right 18NC GIS Conference 2013 23 February 2013
  19. 19. Error Detection Tools: Map Dust 19NC GIS Conference 2013 23 February 2013
  20. 20. Error Detection Tools: OpenStreetBugsNC GIS Conference 2013 23 February 2013
  21. 21. Error Detection Tools: No Name 21NC GIS Conference 2013 23 February 2013
  22. 22. Error Detection Tools: MapRoulette 22NC GIS Conference 2013 23 February 2013
  23. 23. Monitoring Tools 23NC GIS Conference 2013 23 February 2013
  24. 24. Monitoring Tools: OpenStreetMap Watch List (OWL) 24NC GIS Conference 2013 23 February 2013
  25. 25. Monitoring Tools: GeoFabrik Map Compare 25NC GIS Conference 2013 23 February 2013
  26. 26. Monitoring Tools: Who Did It 26NC GIS Conference 2013 23 February 2013
  27. 27. Monitoring Tools: ITO TIGER Reviewed 27NC GIS Conference 2013 23 February 2013
  28. 28. Monitoring Tools: ITO TIGER Reviewed 28NC GIS Conference 2013 23 February 2013
  29. 29. Monitoring Tools: Green Means Go 29NC GIS Conference 2013 23 February 2013
  30. 30. Monitoring Tools: Who’s Around Me 30NC GIS Conference 2013 23 February 2013
  31. 31. Social Controls 31 OpenStreetMap - Data Working Group (DWG)  Resolving disputes between users  Processes & protocols for data imports  Investigates copyright infringement  Deals with issues of vandalism and fraud  Suspends or closes user accounts (in case of abuse)  IP blocking (in case of abuse)NC GIS Conference 2013 23 February 2013
  32. 32. How do Social Methods Treat Vandalism? 32 OpenStreetMap is not immune from malicious intent  Copyright infringement (e.g. copying from Google Maps)  Graffiti  Disputes & “Edit Wars” (e.g. Kashmir region, Palestine)  Spam Tools for Managing Vandalism  Detect using daily diffs  UserActivity – batch comparison of two versions of the database  Revert – undo changeset to previous version  Virtual BanNC GIS Conference 2013 23 February 2013
  33. 33. Summary Review 33 Three methods for data quality control  Crowd-sourced  Social  Geographic OpenStreetMap has crowd-sourced and social tools for managing data quality  Error & Monitoring tools  Data Working Group - Social Geographic methods are experimental at this time Increasingly complete geographic features will lead to better toolsNC GIS Conference 2013 23 February 2013
  34. 34. Lessons Learned about OSM Data Quality 34 Successive editing by multiple users can improve accuracy…up to a point  Haklay suggests that few improvements are made beyond the 13th edit  Semantic differences are not easy to resolve – “Tag wars”  Obscure edits do not always get corrected if there are no local mappers that take ownership Social approaches will acquire more authority  Are part-time, volunteer staffers enough to guarantee data quality?  What are appropriate metrics for trust and reputation? Haklay, M. 2010. How Good is volunteered geographical information? a comparative study of OpenStreetMap and Ordnance Survey Datasets. Environment & Planning B: Planning and Design 37 (4), 682-703gNC GIS Conference 2013 23 February 2013
  35. 35. Thank You 35 Questions? Steven Johnson  (e) stevejohnson@deloitte.com  (t) @geomantic This document licensed in entirety by Creative Commons CC-by-SA. For specific terms of license, see: http://creativecommons.org/licenses/by-sa/3.0/NC GIS Conference 2013 23 February 2013
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×