Data collection

1,056 views

Published on

Published in: Technology
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,056
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
0
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

Data collection

  1. 1. BY :LISSY VERMASHRADDHA GUPTA
  2. 2.  Data Collection  ODK : Open Data Kit  Demo Usher : Improving Data Quality  Purpose  Implementation  Results
  3. 3.  Data collection in developing areas is difficult. None of existing tools suffice. Based on need, new features are needed.
  4. 4.  ODK is a tool suite for collection and management of data on mobile phones. The main objective is to provide open source tools.
  5. 5.  ODK COLLECT  Collects Data ODK AGGREGATE  Store Data, view and export. ODK MANAGE  Remote Device Management
  6. 6.  AMPATH deployed the ODK for data collection for medical purpose. Deployment was found to be successful minimizing delays and improving lives of healthcare workers and other people.
  7. 7.  Expertise in form design Double Entry : Costly Data Cleaning
  8. 8. Constraints Combo-boxes.Reduce Time Automatically filled Leave-forms.
  9. 9. ESCORTER : Guide towards correct entries. Question Ordering in form.  Greedy Information Gain Dynamically Reorder Questions Predict Errors to Re-ask.  Contextualized Error Likelihood Principle.
  10. 10.  Concept : An unscrupulous door-to-door surveyor Shirks Work, ask only important questions.  Greedy Information Gain Uniform Prior : Equal likely inputs  Training Set Context – specific Model Required Bayesian Learning
  11. 11.  The patient dataset collected at a rural HIV/AIDS clinic at Tanzania. Survey dataset, responses from 1986 poll about race and politics
  12. 12. Bayesian Network for the patient dataset
  13. 13. Question layout generated by the algorithm
  14. 14. Approximates Double Entry Uncertainty : High Entropy Outliers
  15. 15.  Due to digital divide between the developing and developed areas, it is very difficult to collect and use data in the developing regions. The main problems being : Lack of reliable infrastructure, Proper connectivity, and, Inadequate expertise. Currently available tools for data collection like Pedragon Forms, Nokia Data Gathering, Java-Rosa, RapidSMS etc. are difficult to deploy, hard to use, complicated to scale and rarely customizable.
  16. 16.  The Open Data Kit or simply ODK is a suite of tools for data collection that uses Google’s Android platform. The main objectives of the technology are : Modularising and customising tools Use of open interfaces and standards Long time survival of tools. The three components of ODK are: 1. ODK Collect : collects data using Forms. 2. ODK Aggregate : ready to deploy online repository to store, view and export collected data. 3. ODK Build : enables users to generate forms. 4. ODK Voice : maps Forms to sound snippets. 5. ODK Clinic : mobile medical record system. 6. ODK Manage : maintains database of all phones for remote device management 7. ODK Validate : validates Form. Other tools being ODK Dropbox, ODK Rangefinder, ODK Tasks, ODK Listen and ODK Visualise.

×