Failure Rate Prediction with Deep Learning

PRODUCT FAILURE RATEPREDICTION
7NOVEMBER2018 AIM^2OSLOMET
ALINAASTRAKOVA
DATASCIENTIST@ELKJØPNORDIC

BUSINESSCASE(-S)
• Question A: how big is a probability many products of a given type fail (help to
estimate the product cost including service cost)
2

BUSINESSCASE(-S)
• Question B: how to compare products/groups/brands (help to provide the right
assortment)
3

BUSINESSCASE(-S)
assortment)
• Question C: how many service cases we get every month/week/day (stuffing
for our service center)
4

BUSINESSCASE(-S)
assortment)
5
Observations previously
reported:
orders and services
within a given period

BUSINESSCASE(-S)
WE CONCENTRATED ON:
assortment)
6

BUSINESSCASE(-S)
WE CONCENTRATED ON:
assortment)
7
Might need a
product/group
characteristics

BUSINESSCASE(-S)
WE CONCENTRATED ON:
assortment)
8
Might need a
product/group
characteristics

BUSINESSCASE(-S)
WE CONCENTRATED ON:
assortment)
9
Might need a
product/group
characteristics

OBSERVATIONBYPRODUCTAGE
DIFFERENT NUMBER OF PRODUCTS OF DIFFERENT AGE OUT THERE
10
«0» means sold to a customer
product age

11
product age

12
product age

13

14

PRODUCTFAILURERATE(ANDITS CUMMULATIVE)
16

17

18

• Failure rate by period
• Cumulativa failure rate
• Other transformations (creating failure rate categories, .log-transform, other)
• Important milestones: service ratio after 2, 3, 5 years
• Level of granularity (ask stackholders more)
19
WHAT TOPREDICT

PRODUCTDATA
21
Categorical data
- Encode
- Embed
Continuous data
- Use as is
- Regroup to create categories
- Create extra features
(e.g., min/max/avg of
guarantee per chain, to overcome
bad data quality)
Text data:
- Use as is in CNNs
- Can create new text data
(example: ‘AenergiklasseVa+++’)
- Can create continuous variables: tf-idf + svd/nmf; ...
- Can create categorical variables: topics; ...

TIMESERIESPREDICTION
OUR PRACTICE
Something that worked nicely for us:
- Floating ‘time’ between the observations and what we predict
-> grows sample size
22

TIMESERIESPREDICTION
OUR PRACTICE
Something that worked nicely for us:
- Floating ‘time’ between the observations and what we predict
-> grows sample size
- Use possible positional and date/time features
-> differentiate between different samples
-> allows to use temporal CNNs if wanted
-> if sequence observations have a logical ‘start’,
like zero-age for Failure Rate, more positional
features can be created
23

SIMPLEMESSAGES
ESTABLISH THE BASELINE – TIMESERIES WITHOUT PRODUCT DATA IS STILL USEFUL
Example after half a year of observations:
await a significant drop of services after the first half-a year
24

SIMPLEMESSAGES
PREDICT WITH CONFIDENCE 😎
- Quantile estimations (quantile estimates also have a mean error 😱)
- BNNs (Monte Carlo dropout, VAE, inheretent noise) - Uber recommends 👍
- Ensembeling by retraining – the simplest way
- ...
25

SIMPLEMESSAGES
PREDICT WITH CONFIDENCE
- Aim for cold start ⛄️
26

SIMPLEMESSAGES
KNOW YOUR OUTLIERS
27

SIMPLEMESSAGES
CHOOSE ROBUST MODELS
Try simple versions of everything
- RNN, CNN, Temporal CNN with/without Gating
- Choose good validation loss
- Plan for future ensemble predictions
28

SIMPLEMESSAGES
- Plan for future ensemble predictions if possible
29

SIMPLEMESSAGES
- Plan for future ensemble predictions if possible
30

SIMPLEMESSAGES
What simplifies our life (IN ANY PROJECT) :
- Clear priorities from the departments, their support
- Snowflake as DW + connector to PowerBI + connector to Databricks
- Common practice (in DEV)
31

32
Thankyou!
https://giphy.com/stefanieshank

Failure Rate Prediction with Deep Learning

Recommended

Recommended

More Related Content

Similar to Failure Rate Prediction with Deep Learning

Similar to Failure Rate Prediction with Deep Learning (20)

Recently uploaded

Recently uploaded (20)

Failure Rate Prediction with Deep Learning