Advertisement
Advertisement

More Related Content

Slideshows for you(20)

Similar to Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtworks (20)

Advertisement
Advertisement

Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtworks

  1. BIG DATA AGILE ANALYTICS Ken Collier, PhD Director, Agile Analytics @theagilist #thoughtworks 1
  2. Value Complexity What happened? Descriptive Analytics Why did it happen? Diagnostic Analytics What will happen? Predictive Analytics How can we make it happen? Prescriptive Analytics
  3. Value Complexity What happened? Descriptive Analytics Why did it happen? Diagnostic Analytics What will happen? Predictive Analytics How can we make it happen? Prescriptive Analytics 3 Traditional Business Intelligence Advanced Analytics
  4. Agile Analytics Big Data Solutions Thinking Ethics Agile DeliveryLean Learning Impact Advanced Analytics
  5. Agile Analytics Big Data Solutions Thinking Ethics Agile Delivery Lean Learning Impact Advanced Analytics Volume Velocity Variety NoSQL Complexity Polyglot Persistence
  6. Big Data Analytics Pipeline Modeling Data Operational Data External Data Data Integration Reporting Engine Dimension Mapping Clean Data Report Report Report Dimensional Data Data Sampling Feature Selection Data Partitioning Test Data Training Data Analytical Modeling Candidate Model Model Validation Accepted Model
  7. Agile Analytics Big Data Solutions Thinking Ethics Agile Delivery Lean Learning Impact Advanced Analytics Volume Velocity Variety NoSQL Complexity Polyglot Persistence
  8. Advanced Analytics Agile Analytics Big Data Solutions Thinking Ethics Agile Delivery Lean Learning Impact Volume Velocity Variety NoSQL Complexity Polyglot Persistence
  9. Discover & Explore Analyze & Act Data Convergence Analytical Divergence Discover Harvest Filter Integrate Augment Analyze Act Analytical Opportunities How Advanced Analytics Works If we knew X, we could do Y
  10. Typical Timeline 3-6 months 2 months 2-4 months 10 Data Convergence Analytical Divergence Discover Harvest Filter Integrate Augment Analyze Act Analytical Opportunities Traditional Analytics If we knew X, we could do Y
  11. Advanced Analytics Agile Analytics Big Data Solutions Thinking Ethics Agile DeliveryLean Learning Impact Volume Velocity Variety NoSQL Complexity Polyglot Persistence Continuous Integration Collaboration Evolve Continuous Delivery
  12. Advanced Analytics Agile Analytics Big Data Solutions Thinking Ethics Agile DeliveryLean Learning Impact Volume Velocity Variety NoSQL Complexity Polyglot Persistence Continuous Integration Collaboration Evolve Continuous Delivery Hypothesis Build Learn Measure
  13. Analytical Divergence Analytical Opportunities If we knew X, we could do Y Data Convergence Discover Harvest Filter Integrate Augment Analyze Act Repeat this cycle solving small problems every few days LEARN MEASURE BUILD Agility in Analytics
  14. Retain high value customers High value business goal Like this example…
  15. What’s the smallest, simplest thing we can do? Retain high value customers Like this example… Common features of defectors?
  16. Is it useful & actionable? Retain high value customers Like this example… Common features of defectors?
  17. Repeat!Retain high value customers Like this example… Common features of defectors? Shopping behaviors of defectors?
  18. Retain high value customers Like this example… Common features of defectors? What leads to customers leaving? Shopping behaviors of defectors? What do defectors say about us? Customers’ sentiment before defecting? What encourages customers to stay? Do incentives reduce defection rates?
  19. Problem solved or continue? What leads to customers leaving? Like this example… Common features of defectors? Shopping behaviors of defectors? What do defectors say about us? Customers’ sentiment before defecting? What encourages customers to stay? Do incentives reduce defection rates?
  20. Advanced Analytics Agile Analytics Big Data Solutions Thinking Ethics Agile DeliveryLean Learning Impact Volume Velocity Variety NoSQL Complexity Polyglot Persistence Continuous Integration Collaboration Evolve Continuous Delivery Hypothesis Build Learn Measure Data Science Machine Learning Statistics
  21. THE “DATA SCIENTIST” Machine Learning Statistical Modeling Artificial Neural Networks Decision Tree Learning Support Vector Machines Clustering …and many more… Bayesian Classification Monte Carlo Simulation Logistic Regression K-Nearest Neighbor …and many more… Domain Knowledge Data Semantics Business Understanding Business Communication Programming Skills Functional Programming Data “Wrangling” Map/Reduce, SQL, & NoSQL
  22. Advanced Analytics Data Science Visual Storytelling Machine Learning Statistics Agile Analytics Big Data Solutions Thinking Ethics Agile DeliveryLean Learning Impact Volume Velocity Variety NoSQL Complexity Polyglot Persistence Continuous Integration Collaboration Evolve Continuous Delivery Hypothesis Build Learn Measure
  23. drones.pitchinteractive.com Data Visualization
  24. Advanced Analytics Data Science Visual Storytelling Machine Learning Statistics Agile Analytics Big Data Solutions Thinking Ethics Agile DeliveryLean Learning Impact Volume Velocity Variety NoSQL Complexity Polyglot Persistence Continuous Integration Collaboration Evolve Continuous Delivery Hypothesis Build Learn Measure Data Reduction
  25. Objective Truth Discoverable Truth Uninterpretable Irrelevant Noise Not Actionable Impactful New Insights “Little Data”
  26. Advanced Analytics Data Science Visual Storytelling Machine Learning Statistics Agile Analytics Big Data Solutions Thinking Ethics Agile DeliveryLean Learning Impact Volume Velocity Variety NoSQL Complexity Polyglot Persistence Continuous Integration Collaboration Evolve Continuous Delivery Hypothesis Build Learn Measure Data Reduction Insight Knowledge Action Disruption
  27. Advanced Analytics Data Science Visual Storytelling Machine Learning Statistics Agile Analytics Big Data Solutions Thinking Ethics Agile DeliveryLean Learning Impact Volume Velocity Variety NoSQL Complexity Polyglot Persistence Continuous Integration Collaboration Evolve Continuous Delivery Hypothesis Build Learn Measure Data Reduction Insight Knowledge Action Disruption Business vs. IT Focus vs. Platform Monitor & Measure
  28. Advanced Analytics Data Science Visual Storytelling Machine Learning Statistics Agile Analytics Big Data Solutions Thinking Ethics Agile DeliveryLean Learning Impact Volume Velocity Variety NoSQL Complexity Polyglot Persistence Continuous Integration Collaboration Evolve Continuous Delivery Hypothesis Build Learn Measure Data Reduction Insight Knowledge Action Disruption Business vs. IT Focus vs. Platform Monitor & Measure Privacy Controls Radical Transparency Data Democracy Open Data
  29. Advanced Analytics Data Science Visual Storytelling Machine Learning Statistics Agile Analytics Big Data Solutions Thinking Ethics Agile DeliveryLean Learning Impact Volume Velocity Variety NoSQL Complexity Polyglot Persistence Continuous Integration Collaboration Evolve Continuous Delivery Hypothesis Build Learn Measure Data Reduction Insight Knowledge Action Disruption Business vs. IT Focus vs. Platform Monitor & Measure Privacy Controls Radical Transparency Data Democracy Open Data
  30. Ken Collier, Director, Agile Analytics kcollier@thoughtworks.com Value Creation Cool New Technologies + Sophisticated Analytics + Lean Learning Principals + Fast Agile Delivery =
Advertisement