Dances with unicorns
Agile datascience
from exploration to adoption
Un grand merci à nos sponsors et partenaires
Oman 11.30 AM
Paris 9.45 AM
<24h
From user’s feedback to production
One month later...
This project has been
abandoned
That’s me!
Wassel Alazhar
Consultant, developer, problem solver
@wasselovski
https://github.com/jcraftsman
Agenda
● The full story
● What went wrong?
● What did we learn?
○ How to bring value from datascience
○ Explore and build
○ Efficient collaboration
○ Product quality
● Why this talk?
● Takeaways
The full story
A global energy leader
A global energy leader
Produce Deliver SELL
A global energy leader
Produce Deliver SELL
Sensors everywhere!
All along the value chain
The problem to solve
Produce Deliver SELL
Sensors everywhere!
All along the value chain
The problem to solve
Two new generation power plants
They are exactly the same
but...
Twin A Twin B
The problem to solve
Twin B is way more performant (i.e., makes money)
Twin A Twin B
The solution
Datascience can
help identifying
better operational
models for the
power plants
The solution
BIG DATA
+
DATA SCIENCE
=
MAGIC
The partner
The partner
A unicorn is a privately held startup company valued at over
$1 billion
The bill
But wait…
How much is that?
Nevermind.
It’s all on me!
It’s called innovation.
Great!
The team
Data engineers Data scientists App developers
To the Silicon valley
Data engineers Data scientists App developers
Week after week… Demo after demo
It couldn’t be any better
SURPRISE
Now, it’s all yours!
All you have to pay
for is the run.
Oh!
No, thanks.
I’m out of it.
Deception
What went wrong?
Building a software!
What was the Problem to solve?
Do you remember the twin power plants?
Twin A Twin B
Not what we’ve expected...
Problem solved explained quickly
No actionable findings
Instead we have delivered features!
Degradation analysis
Anomaly detection
The software can detect dust in the steam turbine!
PCA???
Feature
≠
VALUE
What did we learn?
Happy ending stories...
Predictive maintenance
Smart buildings
Ice detection
Heating and cooling efficiency
Business use case discovery
Don’t start with a software!
Explore
Observe
Confirm hypothesis or not!
Discover
Business use case discovery
Don’t explore in a dark lab!
Get feedback!
Business use case discovery
A python notebook is not a software!
It’s a tool for a study!
From study to product delivery
Business use case located?
Build!
VALUE
Product delivery
Not like this!
Wait, what does datascience look like in 2018?
How would you write a program for puppy recognition?
Wait, what does datascience look like in 2018?
You can:
● Try to define what a puppy face is
● Code all these rules!
Or, use Machine learning:
● Show a lot of puppy faces examples!
You don’t need to tell the algorithm what to do.
All you need is to show it a lot of examples!
Wait, what does datascience look like in 2018?
Take care of your examples (data pipeline)
Verify the results (predictions)
Putting it all together
Discovery:
Given a real world pictures sample, would it be possible to
recognize a puppy face?
The answer is 86% yes, 13% muffins, 1% unknown.
Product:
Play a dog kibble comercial whenever a puppy picture is
displayed!
Explore and build
Explore:
● Gathering data
● Cleaning data
● Feature engineering
● Defining model
● Training
● Predicting the output
=> Discover what you are able to do
with your data
Build:
● Data acquisition
● Data filtering
● Use model configuration
● Use model
● Training (or use a train set)
● Predicting the output
=> Steadily bring value from your
data
Explore and build iteratively
Explore:
● Gathering data
● Cleaning data
● Feature engineering
● Defining model
● Training
● Predicting the output
=> Discover what you are able to do
with your data
Build:
● Data acquisition
● Data filtering
● Use model configuration
● Use model
● Training (or use a train set)
● Predicting the output
=> Steadily bring value from your
data
Explore and build iteratively
Explore Build
Business
use case
discovery
Product delivery
Product delivery
You’re not done with datascience!
They should build together!
Building together
Code review
When? All the time!
Who? Everyone!
Why? Quality, collective
ownership and joy!
Building together
Pair programming
When? All the time!
Who? Everyone!
Why? Quality, collective
ownership and joy!
Building together
Mob programming
When? Whenever you start
something new or complex.
Who? Everyone!
Why? Collective
intelligence, collective
ownership, quality and joy!
Building together
TDD
Let’s be serious!
When? Whenever you change the
product’s behaviour.
Who? Everyone working on the product!
Why? Collective intelligence,
collective ownership, quality and joy!
Building together
TDD
Have you ever met a data
scientist who write unit tests
and refactor? I did! :)
It’s hard to imagine doing TDD
during an exploratory work though!
(i.e., when the target observable
behaviour is not yet defined)
Product delivery
Spikes and user stories
Product delivery essentials
Don’t lose time repeating boring stuff!
Automate!
Make data available for everyone!
Don’t treat your infra like pets!
Destroy and rebuild!
Don’t over-engineer though!
Product adoption
Stay close to the users!
Don’t plan too many features!
Incorporate feedback!
What is agile anyway?
Can datascience be agile?
It’s still true!
Even for:
● Big data
● AI
● Datascience
Why this talk?
Myths about datascience
Well… Things have slightly changed since then… But not that much!
Myths about datascience
Myths about datascience
Unicorns
New unicorns - Same old stories
You should draw your entire
model before you start coding!
Open a ticket!
You need to hire a machine
learning engineer!
Takeaways!
Takeaways!
Make people together!
Business value discovery => product delivery
Explore and build iteratively
Agile is still:
● Short feedback
● Small increments
● Take engineering seriously
work
learn
OCTO © 2018 - Reproduction interdite sans autorisation écrite préalable 67
OCTO Provence recrute !
C’EST AVANT TOUT
UN ÉTAT D’ESPRIT START-UP
APPUYÉ PAR DES EXPERTISES
TECH, AGILE & CHANGE
POUR ACCOMPAGNER
DIGITALE
TRANSFORMATION
NOS CLIENTS DANS LEUR
Contactez-nous sur
PROVENCE@OCTO.COM

Dances with unicorns