Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

StatMine (New Technologies and Techniques for Statistics)

564 views

Published on

Published in: Technology
  • Be the first to comment

  • Be the first to like this

StatMine (New Technologies and Techniques for Statistics)

  1. 1. StatMine – prototype 0.2 Edwin de Jonge, Jan van der Laan & Jessica Solcer Statistics Netherlands (CBS) NTTS 2013, March 6 2013
  2. 2. StatMine Goal: Improve use figures Statistics Netherlands How: Add Analysis layer to OutputDB (StatLine) Working approach: • • • • Formulate improvement Develop software prototype Test prototype on (real) users Evaluate But why? StatMine 2
  3. 3. Mission SN “The mission of Statistics Netherlands is to publish reliable and coherent statistical information that meets the needs of society” (source: www.cbs.nl) StatMine 0.2 3
  4. 4. Mission SN “The mission of Statistics Netherlands is to publish reliable and coherent statistical information that meets the needs of society” (source: www.cbs.nl) StatMine 0.2 4
  5. 5. Evidence-based policy 5
  6. 6. What is the state of the Netherlands? StatLine contains over 1.000.000.000 figures! StatMine 6
  7. 7. Problem 1 Figures ≠ Information StatMine 7
  8. 8. 1. Figures ≠ Information We know (from user study): • Some important user don’t get the most out of StatLine: • Data journalists • Policy makers • They don’t find and see interesting information, because of tabular presention (data = table) StatMine 0.2 8
  9. 9. Solution 1 Visualize data! StatMine 9
  10. 10. Problem 2. Fragmented information StatMine 10
  11. 11. 2. Fragmented information For policy makers and journalist most information in OutputDB is fragmented: • Users need to combine fragments from different statistics • Diabetes (insuline usage, hospital admissions, mortality, visits to doctor, obesity) • Energy consumption vs economic growth • Income vs economic growth • (Perceived) public safety vs registered crimes StatMine 0.2 11
  12. 12. 2. Solution: Let users combine tables (even if we wouldn’t …) StatMine 12
  13. 13. Prototype StatMine 0.2 Implements: • Visual interactive data browsing • Combining fragments of different tables Tested on: • 40 SN employees (++) • 40 policy makers (++) StatMine 0.2 13
  14. 14. Line chart Bar chart - Show development - Compare Bubble/scatter chart Mosaic chart - Show correlation - Show structure StatMine 0.2 14
  15. 15. Small multiples StatMine 0.2 15
  16. 16. StatMine 16
  17. 17. Technical HTML5 JSON R JavaScript CSS SVG • Runs on desktop • makkelijk over te zetten naar webserver StatMine 0.2 17
  18. 18. Currently (2013) • All Official Statistics have confidence interval. • StatMine 0.3 will test if showing uncertainty improves/changes understanding of (quality of) figures. • May lead to publishing interval estimates (in stead of point estimates). StatMine 18
  19. 19. Conclusion • Visual data browsing is promising for • Our own statisticians (quality control) • External policy makers and journalists • Using real end users for testing is very helpful: • Lots of suggestions for improvement from users • Users feel involved in innovation process of NSI StatMine 19

×