SlideShare a Scribd company logo

Points of Knowledge - Crowdsourcing Solutions to Improve Data Accuracy and Re-use in Kenya

Qiyang Xu
Qiyang Xu
1 of 12
Download to read offline
The Open Data 'Bazaar'
Crowdsourcing Solutions to Improve Data
     Accuracy and Re-use in Kenya

             Qiyang Xu
         qxu1@worldbank.org
Kenya Open Data
   Initiative
Government                Citizens
 Data collector           Launches              Companies
                          Open Data           Civil Society, etc


                        Inaccuracy for the       Accuracy?
                         efficacy of open
                         data initiatives?


                  Leverage crowdsourcing
                  to improve the validity?
                                             ‘Crowds’
Three Questions about
      Open Data
Location Datasets
 Kenya Primary School 2007
 Health Facilities Kenya
      Both available on Kenya Open Data site (opendata.go.ke, 2012)


Geospatial Information Datasets
 Global Administrative Unit Layers (GAUL)
 DigitalGlobe Global Basemap
 Google Earth



Datasets
 Health Facilities
                                Total: 8232
                                   For validation: 4867
                                   Actual valid: 4644
                                       4,203 in Digital Globe
                                       441 in Google Earth




 Primary Schools
      Total: 31229
      For validation: 110
       (random sampling)
      Actual valid: 108


   Results
Basemaps     Track
selection   changes




                         School
                      Information




                       School
                      locations

Recommended

Data Quality and Neogeography
Data Quality and NeogeographyData Quality and Neogeography
Data Quality and Neogeographymdob
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data CommonsVivien Bonazzi
 
Online Communities in Citizen Science
Online Communities in Citizen ScienceOnline Communities in Citizen Science
Online Communities in Citizen ScienceAndrea Wiggins
 
Data commons bonazzi bd2 k fundamentals of science feb 2017
Data commons bonazzi   bd2 k fundamentals of science feb 2017Data commons bonazzi   bd2 k fundamentals of science feb 2017
Data commons bonazzi bd2 k fundamentals of science feb 2017Vivien Bonazzi
 
Driving Innovation with Knowledge Sharing and Open Data
Driving Innovation with Knowledge Sharing and Open DataDriving Innovation with Knowledge Sharing and Open Data
Driving Innovation with Knowledge Sharing and Open DataJeanne Holm
 
Jalonen_2017_Using Social Analytics for Value Co-Creation in Digitalized Ecos...
Jalonen_2017_Using Social Analytics for Value Co-Creation in Digitalized Ecos...Jalonen_2017_Using Social Analytics for Value Co-Creation in Digitalized Ecos...
Jalonen_2017_Using Social Analytics for Value Co-Creation in Digitalized Ecos...Harri Jalonen
 
Bonazzi data commons nhgri council feb 2017
Bonazzi data commons nhgri council feb 2017Bonazzi data commons nhgri council feb 2017
Bonazzi data commons nhgri council feb 2017Vivien Bonazzi
 

More Related Content

Similar to Points of Knowledge - Crowdsourcing Solutions to Improve Data Accuracy and Re-use in Kenya

One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data SciencePhilip Bourne
 
VGI Overview - Crowdsourcing Participatory Mapping
VGI Overview - Crowdsourcing Participatory MappingVGI Overview - Crowdsourcing Participatory Mapping
VGI Overview - Crowdsourcing Participatory MappingDany Laksono
 
Beyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional PracticeBeyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional PracticeEric Kansa
 
Online Communities in Citizen Science & BirdCams
Online Communities in Citizen Science & BirdCamsOnline Communities in Citizen Science & BirdCams
Online Communities in Citizen Science & BirdCamsAndrea Wiggins
 
AAG GeoWeb/SDI - The GIS Caste System
AAG GeoWeb/SDI - The GIS Caste SystemAAG GeoWeb/SDI - The GIS Caste System
AAG GeoWeb/SDI - The GIS Caste Systemseagor
 
Using linked data and the semantic web - "powered by INSPIRE" conference pres...
Using linked data and the semantic web - "powered by INSPIRE" conference pres...Using linked data and the semantic web - "powered by INSPIRE" conference pres...
Using linked data and the semantic web - "powered by INSPIRE" conference pres...Alex Coley
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AlonePhilip Bourne
 
EarthCube Townhall: ESIP Summer Meeting 2014
EarthCube Townhall: ESIP Summer Meeting 2014 EarthCube Townhall: ESIP Summer Meeting 2014
EarthCube Townhall: ESIP Summer Meeting 2014 EarthCube
 
Open Data and Open Software Geospatial Applications
Open Data and Open Software Geospatial ApplicationsOpen Data and Open Software Geospatial Applications
Open Data and Open Software Geospatial ApplicationsJody Garnett
 
Reusable Software and Open Data To Optimize Agriculture
Reusable Software and Open Data To Optimize AgricultureReusable Software and Open Data To Optimize Agriculture
Reusable Software and Open Data To Optimize AgricultureDavid LeBauer
 
Big Data, Open Data & eHealth Innovation
Big Data, Open Data & eHealth InnovationBig Data, Open Data & eHealth Innovation
Big Data, Open Data & eHealth InnovationOpen Knowledge Canada
 
A Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterpriseA Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterprisePhilip Bourne
 
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017Sandra Gesing
 
Introduction to CKAN
Introduction to CKANIntroduction to CKAN
Introduction to CKANOKCon2013
 
Bratsas: Greek open data current status and the okfn
Bratsas: Greek open data current status and the okfnBratsas: Greek open data current status and the okfn
Bratsas: Greek open data current status and the okfnOKFN-GR
 
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...Edward Curry
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsAnita de Waard
 

Similar to Points of Knowledge - Crowdsourcing Solutions to Improve Data Accuracy and Re-use in Kenya (20)

A Server-Assigned Crowdsourcing Framework
A Server-Assigned Crowdsourcing FrameworkA Server-Assigned Crowdsourcing Framework
A Server-Assigned Crowdsourcing Framework
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
 
VGI Overview - Crowdsourcing Participatory Mapping
VGI Overview - Crowdsourcing Participatory MappingVGI Overview - Crowdsourcing Participatory Mapping
VGI Overview - Crowdsourcing Participatory Mapping
 
Beyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional PracticeBeyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional Practice
 
Online Communities in Citizen Science & BirdCams
Online Communities in Citizen Science & BirdCamsOnline Communities in Citizen Science & BirdCams
Online Communities in Citizen Science & BirdCams
 
AAG GeoWeb/SDI - The GIS Caste System
AAG GeoWeb/SDI - The GIS Caste SystemAAG GeoWeb/SDI - The GIS Caste System
AAG GeoWeb/SDI - The GIS Caste System
 
OKF intro and CKAN 2.0
OKF intro and CKAN 2.0OKF intro and CKAN 2.0
OKF intro and CKAN 2.0
 
Using linked data and the semantic web - "powered by INSPIRE" conference pres...
Using linked data and the semantic web - "powered by INSPIRE" conference pres...Using linked data and the semantic web - "powered by INSPIRE" conference pres...
Using linked data and the semantic web - "powered by INSPIRE" conference pres...
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
 
EarthCube Townhall: ESIP Summer Meeting 2014
EarthCube Townhall: ESIP Summer Meeting 2014 EarthCube Townhall: ESIP Summer Meeting 2014
EarthCube Townhall: ESIP Summer Meeting 2014
 
Open Data and Open Software Geospatial Applications
Open Data and Open Software Geospatial ApplicationsOpen Data and Open Software Geospatial Applications
Open Data and Open Software Geospatial Applications
 
Reusable Software and Open Data To Optimize Agriculture
Reusable Software and Open Data To Optimize AgricultureReusable Software and Open Data To Optimize Agriculture
Reusable Software and Open Data To Optimize Agriculture
 
Big Data, Open Data & eHealth Innovation
Big Data, Open Data & eHealth InnovationBig Data, Open Data & eHealth Innovation
Big Data, Open Data & eHealth Innovation
 
A Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterpriseA Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital Enterprise
 
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
 
Crowdsourcing: A Geographic Approach to Identifying Policy Opportunities and ...
Crowdsourcing: A Geographic Approach to Identifying Policy Opportunities and ...Crowdsourcing: A Geographic Approach to Identifying Policy Opportunities and ...
Crowdsourcing: A Geographic Approach to Identifying Policy Opportunities and ...
 
Introduction to CKAN
Introduction to CKANIntroduction to CKAN
Introduction to CKAN
 
Bratsas: Greek open data current status and the okfn
Bratsas: Greek open data current status and the okfnBratsas: Greek open data current status and the okfn
Bratsas: Greek open data current status and the okfn
 
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
 

Points of Knowledge - Crowdsourcing Solutions to Improve Data Accuracy and Re-use in Kenya

  • 1. The Open Data 'Bazaar' Crowdsourcing Solutions to Improve Data Accuracy and Re-use in Kenya Qiyang Xu qxu1@worldbank.org
  • 2. Kenya Open Data Initiative
  • 3. Government Citizens Data collector Launches Companies Open Data Civil Society, etc Inaccuracy for the Accuracy? efficacy of open data initiatives? Leverage crowdsourcing to improve the validity? ‘Crowds’ Three Questions about Open Data
  • 4. Location Datasets  Kenya Primary School 2007  Health Facilities Kenya  Both available on Kenya Open Data site (opendata.go.ke, 2012) Geospatial Information Datasets  Global Administrative Unit Layers (GAUL)  DigitalGlobe Global Basemap  Google Earth Datasets
  • 5.  Health Facilities  Total: 8232  For validation: 4867  Actual valid: 4644  4,203 in Digital Globe  441 in Google Earth  Primary Schools  Total: 31229  For validation: 110 (random sampling)  Actual valid: 108 Results
  • 6. Basemaps Track selection changes School Information School locations
  • 7. Connect available data to ‘crowds’ for more powerful feedbacks Color of Locater Description Green Original location given in the dataset Purple New location added to the dataset Track changes Red Proposed correction to the original location Yellow Current location selected to be moved
  • 9. Let’s play……  Honor Board with highlighted contributors:  Last registered  Most active  Best contributor……  Measure global progress over time  Clear goals, easy to achieve  Editing locations and user interactions
  • 10. Future……  Multi-language interface  Community built on Points of Knowledge  Users communication  Allow picture-uploading and GPS position collection using mobile device  ……
  • 11.  All datasets are fully open  Open source solution preferred  Collaborative development process
  • 12. Unleashing the ‘wisdom of crowds’ Innovation in Governance, World Bank Institute