Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Data for Good August 2017 - Peter Bull of DrivenData on Crowdsourcing Data for Good


Published on

Presentation given by Peter Bull of DrivenData at the August 2017 Data for Good meetup in San Francisco.

Crowdsourcing Data for Good: Lessons from the social sector and how to get involved

Just like every major corporation today, nonprofits and governments have more data than ever before. And just like those corporations, they are eager to tap into the power of their data. But the social sector doesn’t have the same resources to attract talent. Jeff Hammerbacher put it best: “The best minds of my generation are thinking about how to make people click ads. That sucks.” At DrivenData our goal is to make the world suck a little less by empowering impact organizations to get the most from their data. Peter Bull, co-founder at DrivenData, will speak on the ways in which statistics, computer science, and machine learning can be applied to the challenges in the social sector. The talk has two parts: the first is the big-picture context of the data for good movement, how to get involved. The second is an in-depth case study of the methods which won DrivenData’s recent machine learning competitions.

Published in: Data & Analytics
  • Be the first to comment

  • Be the first to like this

Data for Good August 2017 - Peter Bull of DrivenData on Crowdsourcing Data for Good

  1. 1. Crowdsourcing Data for Good Data for Good Meetup Domino Data Labs – 30 August 2017
  2. 2. About me Philosopher then Software Engineer then Data Scientist Co-founder of DrivenData @pjbull | @drivendataorg
  3. 3. Agenda Data and the social sector The crowdsourcing approach Case studies How to get involved
  4. 4. What is data for good?
  5. 5. Hackers (1995) 6.2/10 stars on IMDB
  6. 6. That’s a beautiful vision, but more often it feels like…
  7. 7. Transformers: Age of Extinction (2014) 5.8/10 stars on IMDB Simple coding…ALGORITHMS!MATH!Why can’t we make what we want to make the way we want to make it?
  8. 8. But, for social sector organizations, the reality is…
  9. 9. Why?
  10. 10. THE DATA CAPACITY GAP $135,000 average salary executive director US nonprofit 140k – 180k shortage of data scientists $118,700 average salary of data scientist
  11. 11. “Finding ways to make big data useful to humanitarian decision makers is one of the great challenges and opportunities of the network age.” UN Office for the Coordination of Humanitarian Affairs
  12. 12. “The best minds of my generation are thinking about how to make people click ads… That sucks.” Jeff Hammerbacher, Former Data Manager, Facebook
  13. 13. How can we help?
  14. 14. ?
  15. 15. Benefits of competitions
  16. 16. Two-way Visibility
  17. 17. Community Building
  18. 18. Accuracy Matters
  19. 19. Fun
  20. 20. What can non-profits do with data?
  21. 21. Measure Impact!
  22. 22. DATA FOR ACTION Collection Plan Data Collection Data Storage Data Analysis Results DATA FOR IMPACT data scientist domain expert data scientist domain expert
  23. 23. There’s more to data for good than impact measurement.
  24. 24. What can nonprofits do with data today?
  25. 25. Education Resource Strategies Budget We are budget- ing our things $ Expenditures Monies for making students good Beware there be dollars here
  26. 26. PETRO-VEND FUEL AND FLUIDS MAINT MATERIALS SATELLITE COOK UPPER EARLY INTERVENTION PROGRAM 4-5 Regional Playoff Hosts Supp.- Materials ITEMGH EXTENDED DAY FURNITURE AND FIXTURES NON-CAPITALIZED AV Water and Sewage * Instructional Materials Food Services - Other Costs Capital Assets - Locally Defined Groupings
  27. 27. Regular * TEACHE R Special Instruction TCHR, SCNDY MATH Certificated Employees Salaries And Wages IN OTH CERTIFICAT ED PERSON 1 Disadvantage d Youth * $104,928.19 Title I - Disadvantaged Children/Target ed Assistance BUILDING ALLOCATION S LABEL COLUMN Function Sharing Object_Type Student_Type Position_Type MOST LIKELY Teacher Compensation School Reported Base Salary/Compensatio n Poverty Teacher 2ND LIKELY Food Services School on Central Budgets Benefit Unspecified Coordinator/Manager 3RD LIKELY NO_LABEL NO_LABEL Supplies/Materials NO_LABEL NO_LABEL New Data Predictions!
  28. 28. Education Resource Strategies Saving about 300 staff-hours per year
  29. 29. Data for Action Use 1: Automation
  30. 30. Data for Action Use 2: Smarter Strategy
  31. 31. 4 years of history 8,000 locations 315,000 violations
  32. 32. • RATINGS • REVIEWS • Date of the review • Free form text • CATEGORIES • Cuisine • Format • NEGHBORHOODS • CHECKINS
  33. 33. "The muffins are great...esp the blueberry! I have never had that good a blueberry muffin...its not super sickey sweet like most...."
  34. 34. "Food was soggy and cold by the time I got it and they messed up my order. Better food and better service from other places nearby."
  35. 35. NEXT NEXT NEXT
  36. 36. Data for Action Use 3: Better Targeting
  37. 37. DATA FOR ACTION Data Analysis Results > automation > smarter strategy > better targeting
  38. 38. What can I do?
  39. 39.
  40. 40. competitions
  42. 42.
  43. 43. What else can I do?
  44. 44. 1. Join data for good groups
  45. 45.
  46. 46.
  47. 47. 2. Do data for good in your spare time
  48. 48.
  49. 49.
  50. 50.
  51. 51. 3. Participate in a fellowship
  52. 52.
  53. 53.
  54. 54.
  55. 55.
  56. 56.
  57. 57. 4. Attend data for good events
  58. 58.
  59. 59.
  60. 60.
  61. 61. 5. Get involved professionally
  62. 62. Get your company involved!
  63. 63. Look for local non-profits!
  64. 64. Donate your skills and your knowledge in the way that works for you. Data for Good Meetup Domino Data Labs – 30 August 2017 Data has the power to change the world.