Engineering a Platform
Scaling DataScience
Who is DataScience?
2
● Do you need…
● …insights about your data?
● …to hire or train data scientists to provide
actionable insights?
● …a platform for consuming cutting-edge
data science and publishing within your org,
without having data scientists doing ops
work?
● …to focus your data scientists on your core
business, while we provide models for LTV,
pricing, retention and more?
● Visit www.datascience.com!
Two Primary Challenges
3
● Rapid Team Growth
● Team member onboarding
requires a low-risk, standardized
toolchain. Time to first
contribution is measured in
hours, not days or weeks. 
● Dynamic Tooling Landscape
● Best-of-breed data tools are
always changing. Our culture and
platform encourage
experimentation and evaluation
of new tools and techniques.
Rapid Onboarding
4
● A packaged virtual
development environment
● No wrestling with complex
system dependencies and
version compatibilities
● A clean starting point to
quickly retreat at any time
● Monitoring and diagnostics
● Scripted automation for
customizing per user
Rapid Onboarding
5
● Upgrades are vetted in
advance prior to wide release
● Continuous integration
provides automated feedback
● Group chat pushes
institutional knowledge out
into searchable, company-
wide record.
● A culture of sharing and
demonstration
Dynamic Landscape
● Configuration management to
quickly compose systems based
on requirements
● Software tools are constantly
evolving. A robust virtual
environment promotes
experimentation and iteration.
● Think in terms of categories of
tools not specific techs.
6
Dynamic Landscape
● Version control: track changes
to analysis over time. promote
reproducibility
● Automated testing: An
engineering approach to analysis
quality
● Integrated publishing: A
publishing workflow that closely
follows the underlying analysis
7
8
Thank you.
Questions?
We’re hiring.
Appendix: Rapid Onboarding
9
Appendix: Dynamic Landscape
10
Thank you.

Scaling Data Science: Engineering a Platform

  • 1.
  • 2.
    Who is DataScience? 2 ●Do you need… ● …insights about your data? ● …to hire or train data scientists to provide actionable insights? ● …a platform for consuming cutting-edge data science and publishing within your org, without having data scientists doing ops work? ● …to focus your data scientists on your core business, while we provide models for LTV, pricing, retention and more? ● Visit www.datascience.com!
  • 3.
    Two Primary Challenges 3 ●Rapid Team Growth ● Team member onboarding requires a low-risk, standardized toolchain. Time to first contribution is measured in hours, not days or weeks.  ● Dynamic Tooling Landscape ● Best-of-breed data tools are always changing. Our culture and platform encourage experimentation and evaluation of new tools and techniques.
  • 4.
    Rapid Onboarding 4 ● Apackaged virtual development environment ● No wrestling with complex system dependencies and version compatibilities ● A clean starting point to quickly retreat at any time ● Monitoring and diagnostics ● Scripted automation for customizing per user
  • 5.
    Rapid Onboarding 5 ● Upgradesare vetted in advance prior to wide release ● Continuous integration provides automated feedback ● Group chat pushes institutional knowledge out into searchable, company- wide record. ● A culture of sharing and demonstration
  • 6.
    Dynamic Landscape ● Configurationmanagement to quickly compose systems based on requirements ● Software tools are constantly evolving. A robust virtual environment promotes experimentation and iteration. ● Think in terms of categories of tools not specific techs. 6
  • 7.
    Dynamic Landscape ● Versioncontrol: track changes to analysis over time. promote reproducibility ● Automated testing: An engineering approach to analysis quality ● Integrated publishing: A publishing workflow that closely follows the underlying analysis 7
  • 8.
  • 9.
  • 10.
  • 11.