Ocado Technology is providing a full solution to put the world’s retailers online using the cloud, robotics, AI and IoT. Processing tens of thousands of orders every day, we generate millions of events every minute, leading to huge amount of data to be managed. We will present how this Big Data is handled in Google Cloud Platform to build a end-to-end machine learning pipeline: how data is stored and processed in BigQuery, post-processed and copied with Dataflow, then used to train Deep Neural Network models with TensorFlow, how all this is orchestrated using our in-house scheduling software called Query Manager, and how predictions are finally run in real-time using Cloud ML Engine and Datastore.
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Real-Time Decisions Using ML on the Google Cloud Platform
1. Real-Time Decisions
Using ML on the
Google Cloud Platform
Przemysław Pastuszka & Carlos Garcia
QCon London
7th March 2018
2. How many of you are
interested in machine
learning?
3. but… how many of you are
running real-time machine
learning in production?
4. Who is Ocado?
Ocado is the world’s
largest dedicated
online grocery retailer
We have 645,000
active shoppers
And 49,000 SKUs in
our webshop
Three highly-automated
fulfilment centres
263,000 orders a
week ‘picked’
3 million routing
calculations per second
5.
6. What Ocado Technology does
(1) Cloud and AI
(2) Automation and robotics
(3) Big Data
(4) Web and app development
(5) IoT
48. train(C1
, … CN
, O1
, … ON
, Y) = model
C1
, … CN
, O1
, … ON
- customer and order features
C1
- Average basket size for the customer
O1
- % of alcoholic items in current order
...
Y - Fraud or not fraud
52. train(C1
, … CN
, O1
, … ON
, Y) = model
model(C1
, … CN
, O1
, … ON
) = prediction
C1
, … CN
, O1
, … ON
- customer and order features
C1
- Average basket size for the customer
O1
- % of alcoholic items in current order
...
Y - Fraud or not fraud
prediction - Probability of current order being fraudulent
58. Know your target
Keep It Simple
Choose your model wisely
Google Cloud ML Engine for Neural Nets
Have data and tools ready
BigQuery is king
Unified architecture for training and serving predictions