Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Iterative Design for Data Science Projects

338 views

Published on

Video and slides synchronized, mp3 and slide download available at URL http://bit.ly/2jEruds.

Bo Peng goes over how Datascope iterated on the major pieces of the Expert Finder application project to produce actionable insights and recommendations on methodologies not only for the user interfaces, but also for our “expert finding” algorithms and data sources. Filmed at qconsf.com.

Bo Peng is a partner and a data scientist at Datascope, a leading data science consultancy in Chicago, where she combines human centered design with analytics to derive actionable business insights for clients like P&G, Motorola, Thomson, Reuters. She is an active member of the technology community, co-organizing data science meetups and organizing the Women in Machine Learning & Data Science.

Published in: Technology
  • DOWNLOAD FULL BOOKS, INTO AVAILABLE Format, ......................................................................................................................... ......................................................................................................................... 1.DOWNLOAD FULL. PDF EBOOK here { https://tinyurl.com/y6a5rkg5 } ......................................................................................................................... 1.DOWNLOAD FULL. EPUB Ebook here { https://tinyurl.com/y6a5rkg5 } ......................................................................................................................... 1.DOWNLOAD FULL. doc Ebook here { https://tinyurl.com/y6a5rkg5 } ......................................................................................................................... 1.DOWNLOAD FULL. PDF EBOOK here { https://tinyurl.com/y6a5rkg5 } ......................................................................................................................... 1.DOWNLOAD FULL. EPUB Ebook here { https://tinyurl.com/y6a5rkg5 } ......................................................................................................................... 1.DOWNLOAD FULL. doc Ebook here { https://tinyurl.com/y6a5rkg5 } ......................................................................................................................... ......................................................................................................................... ......................................................................................................................... .............. Browse by Genre Available eBooks ......................................................................................................................... Art, Biography, Business, Chick Lit, Children's, Christian, Classics, Comics, Contemporary, Cookbooks, Crime, Ebooks, Fantasy, Fiction, Graphic Novels, Historical Fiction, History, Horror, Humor And Comedy, Manga, Memoir, Music, Mystery, Non Fiction, Paranormal, Philosophy, Poetry, Psychology, Religion, Romance, Science, Science Fiction, Self Help, Suspense, Spirituality, Sports, Thriller, Travel, Young Adult,
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here
  • Be the first to like this

Iterative Design for Data Science Projects

  1. 1. Bo Peng • @bo_p Iterative design for data science projects for QCon San Francisco • Nov 7, 2016
  2. 2. InfoQ.com: News & Community Site • 750,000 unique visitors/month • Published in 4 languages (English, Chinese, Japanese and Brazilian Portuguese) • Post content from our QCon conferences • News 15-20 / week • Articles 3-4 / week • Presentations (videos) 12-15 / week • Interviews 2-3 / week • Books 1 / month Watch the video with slide synchronization on InfoQ.com! https://www.infoq.com/presentations/ iterative-design-data-science
  3. 3. Purpose of QCon - to empower software development by facilitating the spread of knowledge and innovation Strategy - practitioner-driven conference designed for YOU: influencers of change and innovation in your teams - speakers and topics driving the evolution and innovation - connecting and catalyzing the influencers and innovators Highlights - attended by more than 12,000 delegates since 2007 - held in 9 cities worldwide Presented at QCon San Francisco www.qconsf.com
  4. 4. http://heritagehealthprize.com Goal: Create an algorithm that predicts how many days a patient will spend in a hospital in the next year. case study: heritage health prize approach
  5. 5. 2 1,363 25,316 years teams entries http://heritagehealthprize.com case study: heritage health prize approach
  6. 6. score time (in months) constant value all zeros goal http://heritagehealthprize.com case study: heritage health prize approach
  7. 7. score time (in months) constant value all zeros goal http://heritagehealthprize.com case study: heritage health prize approach
  8. 8. score time (in months) constant value all zeros goal http://heritagehealthprize.com case study: heritage health prize approach
  9. 9. score time (in months) constant value all zeros goal http://heritagehealthprize.com case study: heritage health prize approach
  10. 10. score time (in months) constant value all zeros goal http://heritagehealthprize.com case study: heritage health prize approach
  11. 11. score time (in months) constant value all zeros goal http://heritagehealthprize.com case study: heritage health prize approach
  12. 12. score constant value all zeros goal What can we learn from this? Solving business problems can rarely be reduced to minimizing a model’s RMSE.
  13. 13. score constant value all zeros goal Contests are fun. Solving business problems can rarely be reduced to minimizing a model’s RMSE.
  14. 14. score constant value all zeros goal Contests are fun. Solving business problems can rarely be reduced to minimizing a model’s RMSE.
  15. 15. agenda - A common approach to data science - The design approach: - a simple model goes along way (eDiscovery) - finding & recommending experts within P&G
  16. 16. How simple models + design go a long way Data driven e-discovery for Daegis
  17. 17. data-driven e-discovery daegis
  18. 18. aboutpatent not aboutpatent data-driven e-discovery daegis
  19. 19. aboutpatent not aboutpatent turn over to plaintiff don’t turn over to plaintiff adverse inference data-driven e-discovery daegis
  20. 20. aboutpatent not aboutpatent turn over to plaintiff don’t turn over to plaintiff adverse inference give away trade secrets data-driven e-discovery daegis
  21. 21. aboutpatent not aboutpatent turn over to plaintiff don’t turn over to plaintiff adverse inference give away trade secrets data-driven e-discovery daegis
  22. 22. turn over to plaintiff don’t turn over to plaintiff data-driven e-discovery daegis
  23. 23. data-driven e-discovery daegis
  24. 24. create a “document map” algorithm design patents marketing finances fantasy football lunch coffee data-driven e-discovery daegis
  25. 25. create a “document map” fantasy football algorithm design patents lunch marketing finances coffee review away shades of grey reduce reviews by 90-99% data-driven e-discovery daegis
  26. 26. care about design. simple, powerful interfaces relay analytics better.
  27. 27. iterative problem solving generate ideas build prototypeevaluate rapid iterations plan, build, test, and iterate as quickly as possible
  28. 28. Procter & Gamble Data driven expertise exploration
  29. 29. data-driven expertise exploration procter & gamble
  30. 30. data-driven expertise exploration procter & gamble
  31. 31. High level goals: - reveal areas of expertise - evaluate connectivity within experts
  32. 32. data-driven expertise exploration procter & gamble
  33. 33. Lorem Ipsum: a narrative about blankets. Author: Charlie Brown Date: 31 Jan 2012 Lorem Ipsum is a dummy text used when typesetting or marking up documents. It has a long history starting from the 1500s and is still used in digital millennium for typesetting electronic documents, page designs, etc. In itself, the original text of Lorem Ipsum might have been taken from an ancient Latin book that was written about 50 BC. Nevertheless, Lorem Ipsum’s words have been changed so they don’t read as a proper text. Naturally, page designs that are made for text documents must contain some text rather than placeholder dots or something else. However, should they contain proper English words and sentences almost every reader will deliberately try to interpret it eventually, missing the design itself. However, a placeholder text must have a natural distribution of letters and punctuation or otherwise the markup will look strange and unnatural. That’s what Lorem Ipsum helps to achieve. I would like to thank Peppermint Pattyfor her support on studying Lorem Ipsum as well as the infinite wisdom of Linus van Peltand his willingness to use his blanket in my experiments. data-driven expertise exploration procter & gamble
  34. 34. vs.
  35. 35. vs.
  36. 36. iterative problem solving generate ideas build prototypeevaluate rapid iterations plan, build, test, and iterate as quickly as possible
  37. 37. High level goals: - reveal areas of expertise - evaluate connectivity within experts
  38. 38. High level goals: - reveal areas of expertise - evaluate connectivity within experts
  39. 39. let’s compare countries.
  40. 40. + 1
  41. 41. 10 5 5 20 8 25 2 5 12 3 30 10 1 20 25 50
  42. 42. 10 5 5 20 8 25 2 5 12 3 30 10 1 20 25 50
  43. 43. 10 5 5 20 8 25 2 5 12 3 30 10 1 20 25 50
  44. 44. 10 5 5 20 8 25 2 5 12 3 30 10 1 20 25 50
  45. 45. design influences data science.
  46. 46. care about design.
  47. 47. Iterative design for data science projects Bo Peng • @bo_p for QCon San Francisco • Thanks!
  48. 48. Watch the video with slide synchronization on InfoQ.com! https://www.infoq.com/presentations/ iterative-design-data-science

×