Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

The rise of crowd-sourcing - how valuable data can we get out of VGI

1,382 views

Published on

Presentation of Grega Milčinski from Sinergise during CAPIGI 2011 in Amsterdam at 5 April 2011.

Published in: Technology
  • Be the first to comment

The rise of crowd-sourcing - how valuable data can we get out of VGI

  1. 1. The rise of crowd-sourcing - how valuabledata can we get out ofVGIGrega Mil!inskigrega.milcinski@sinergise.com
  2. 2. Contents!  About Sinergise!  VGI trends!  Practical experiences !  OpenStreetMap !  GeopediaThe rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 2
  3. 3. About SinergiseThe rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 3
  4. 4. About SinergiseThe rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 4
  5. 5. About Sinergise!  8 years in GIS!  20 people, mostly programmers!  Governmental Solutions •  Agriculture – IACS •  Real-estate Management!  We run GIS in Slovenia and going outwards •  2008 - IACS in Croatia •  2009 – Real Estate in Africa •  2010 – IACS in Macedonia•  VGI project Geopedia.si running since 2007 The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 5
  6. 6. Rise of Volunteered GeographicInformation (VGI)Michael Jones, Google:The chief Internet evangelist at Google Inc., and one of thefounding fathers of the Internet, says he’d like to see ageographic equivalent of Wikipedia — “Geopedia,” he dubs it —where anyone could add to the world’s geographic know-how.Jack Dangerman, ESRI:He worries that even the best-intentioned amateur could provideinaccurate data that could lead to a disaster. “Who wants to dig ahole and run into a pipe?” Dangerman asks. (GeoWeb, summer 2007)The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 6
  7. 7. Short history of Wikipedia!  January 2001 – start!  October 2005 – criticism •  “Broadly speaking, its inaccurate and unclear.” •  “I wouldnt have thought of using Wikipedia as a serious reference source”!  December 2005 – “Wikipedia is about as good a source of accurate information as Britannica” (Nature)!  June 2008 – “Encyclopaedia Britannica To Follow Modified Wikipedia Model”!  June 2009 – “Wikipedia, with a 97% share of the online encyclopedia market, has forced Microsoft to shut down Encarta.“ The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 7
  8. 8. Creating your own VGI system!  idea!  infrastructure!  technology!  basic content!  rules!  community!  users The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 8
  9. 9. Idea!  What would we like to achieve?!  Which data are we collecting?!  More focused the idea is, better possibilities for a success. The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 9
  10. 10. Infrastructure!  Requirements •  Data storage (raster and vector data) •  Application servers •  Network!  Possibilities •  Set of own servers •  Cloud (Google Maps, SimpleGeo,...) + own servers •  Cloud (Google MyMaps, Amazon, Geopedia.si,..) The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 10
  11. 11. Technology!  Open source •  Geoserver, OpenLayers, Drupal •  PostGis, MySQL!  Minimum development •  ESRI •  Oracle, MS SQL!  Custom software The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 11
  12. 12. Basic content!  Orientation •  Topographic maps, satellite/aerial imagery ! Google Maps (restrictions!) and similar, open data (NASA), government!  Geo-location •  Municipalities, street numbers ! Google Maps (restrictions!), open data (Natural earth), government!  Topic data •  Depends on the idea The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 12
  13. 13. Rules!  Lesson from Wikipedia •  “Nature said its reviewers found that Wikipedia entries were often poorly structured” (June 2005)!  Maps vs GIS The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 13
  14. 14. Community!  Wikipedia •  365 MIO visitors (2010) •  500.000 contributors •  2000 power users (mothly activity)!  Geopedia.si •  620.000 visitors (2010) •  15.000 contributors •  55 power users The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 14
  15. 15. Users!  Who will make use of the data collected!  How? •  on-line viewer •  web-services •  data export!  Where the data will be maintained? The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 15
  16. 16. Real-life experiences!  OpenStreetMap!  Geopedia.si The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 16
  17. 17. Open-street mapIdea Creation of a free, editable map of the whole world (roads, buildings, etc.)Infrastructure Own serversTechnology OpenLayers PostgreSQL Other software (Potlatch, Maplink...)Basic content Yahoo! Aerial Imagery Some government’s data (US Gov, AND Holland, Ordnance Survey, ...)Rules Well-defined feature listCommunity >200.000 (end of 2009) 10 % contributing the majority of the dataUsers Haiti Earthquake, large number of web-sites (web- services, tiles)The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 17
  18. 18. Open-street map – Haiti earthquakeThe rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 18
  19. 19. Open-street map – Kibera slumThe rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 19
  20. 20. Geopedia.siIdea Infrastructure for VGI systems, collection of all spatial data in SloveniaInfrastructure Own serversTechnology Own software (Giselle) MySQL / PostgreSQLBasic content Topographic maps, aerial imagery, DMR (Slovenia) Street numbers, other government dataRules •  single-theme applications with strict business rules •  editable spatial layers with pre-defined structure •  creation of own layersCommunity 20.000 5% contributing the majority of the dataUsers Illegal dump-sites, Cyclists, Mountaineering organization, Encyclopedia of natural and cultural heritage, Energy agency, ...The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 20
  21. 21. Geopedia.si - SDI Home Office MOD Inland Revenue Secret Common Access Point Service Cabinet Office DEFRA Emergency Services Private Sector Customs & Excise Police Based on industry Coast Guard standard OGC protocols Defence Estates DSTL Immigration Service Civil Aviation AuthorityThe rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 21
  22. 22. Geopedia.si – cyclingThe rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 22
  23. 23. Geopedia.si – cycling!  800 paths!  25.000 km The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 23
  24. 24. Geopedia.si – illegal dump-site registryThe rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 24
  25. 25. Geopedia.si - floodsThe rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 25
  26. 26. Geopedia.si – stats!  150.000 monthly visits (3 MIO page-views)!  average time on site – 8 minutes!  6.000 spatial layers (1000 marked as “good quality”)!  10 MIO of spatial entries!  (only in Slovenia) The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 26
  27. 27. Comparison to non-VGIMichael Jones, Google:The chief Internet evangelist at Google Inc., and one of thefounding fathers of the Internet, says he’d like to see ageographic equivalent of Wikipedia — “Geopedia,” he dubs it —where anyone could add to the world’s geographic know-how.Jack Dangerman, ESRI:He worries that even the best-intentioned amateur could provideinaccurate data that could lead to a disaster. “Who wants to dig ahole and run into a pipe?” Dangerman asks. (GeoWeb, summer 2007)The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 27
  28. 28. Comparison to non-VGI!  LPIS!  Land Cadastre The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 28
  29. 29. LPIS!  Government operated, based on farmer’s input!  Used for distribution of agriculture subsidies The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 29
  30. 30. LPIS – strict controlThe rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 30
  31. 31. LPIS - Quality!  Theoretically not possible to meet EU requirements in some cases (max 3% of area difference). The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 31 d
  32. 32. Land Cadastre!  Government operated, based on field survey!  Used for land administration, taxation The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 32
  33. 33. Land Cadastre – strict controlThe rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 33
  34. 34. Land Cadastre – quality!  Not good enough as a base for LPIS (in Slovenia and some other cases) The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 34
  35. 35. Conclusion!  VGI does not produce perfect results Neither do “professional” systems!  Good enough for practical purpose!  Improving in the future (mobile GIS, etc.) The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 35
  36. 36. Thank you Questions?The rise of crowd-sourcing - how valuable data can we get out of VGI, CAPIGI 2011 36

×