SlideShare a Scribd company logo
Delivery Excellence:
Love is in the Air –
an Architecture for Analytics
in Production
Leverkusen, July 31, 2019, Daniel Schulz
DSC_RendezvousArchitecture.pptx
Love is in the Air
is an Architectural Battle Story
DSC_RendezvousArchitecture.pptx
Time Schedule
Topics Covered
1. Brief Motivation
2. Recap: Lambda Architecture
3. Rendezvous Architecture
© 2019 Daniel Schulz. All rights reserved. 3
DSC_RendezvousArchitecture.pptx
Challenge
Deliver many Ever-Improving
Predictive Models in Reproducible &
Robust Manner to Production
DSC_RendezvousArchitecture.pptx
The Challenge Put Forward
• Greenfield project
• Modern tech stack: Kubernetes, Docker, Cloud-native, DSE Cassandra
• Reproducibility
• Resilient & robust models
• Deliver multiple models w/ increasing quality
• DevOps delivery
© 2019 Daniel Schulz. All rights reserved. 5
Requirements for both QA & Production
DSC_RendezvousArchitecture.pptx
Time Schedule
Topics Covered
1. Brief Motivation
2. Recap: Lambda Architecture
3. Rendezvous Architecture
© 2019 Daniel Schulz. All rights reserved. 6
DSC_RendezvousArchitecture.pptx
Competing, Famous
λ-Architecture
is a Different Approach – both
Might be Combined w/ One Another
DSC_RendezvousArchitecture.pptx
The Lambda-Architecture Splits Data into Two Distinct Sets –
a Large Batch Set as a 360° View & Just Recent, Speedy Updates
© 2019 Daniel Schulz. All rights reserved. 8
Just Like the Data Set is Split, so are the Data Stores & Hence their Respective Serving Architectures
Speed Layer
Batch Layer
Data Sources
Queries
Queries
GET
GET
PUT/POST
DSC_RendezvousArchitecture.pptx
In a more Unified Version, One Central Serving Architecture
Combines Both Batch & Speed Layer Data to Answer Queries
© 2019 Daniel Schulz. All rights reserved. 9
There is a Trade-Off to be Made between Completeness/Perfection in Batch & Recency in Speed Layer Data
Speed Layer
Batch Layer
Data Sources Serving Layer QueriesPUT/POST GET
GET
DSC_RendezvousArchitecture.pptx
Lambda-Architecture is a Trade-Off Between Completeness &
Speediness of Prediction
• Commonalities
• Use Cases in Predictive Analytics
• Deliver predictions very fast
• Differences
• Lambda Architecture
• is criticized for its complexity –
different data stores, software code base and serving technologies might be needed
• Only applicable in Append-only Architectures like e.g. Hadoop –
data change would yield a completely new batch computation
• Focused on data age
▪ Rendezvous Architecture
• Technically more complex to implement
• Message Queue needed which might duplicate messages and increase latency
• Focused on Predictive Models
© 2019 Daniel Schulz. All rights reserved. 10
The Batch Layer Provides an almost Perfect 360° on Everything – the Speed Layer Intends to Predict Fast
DSC_RendezvousArchitecture.pptx
Time Schedule
Topics Covered
1. Brief Motivation
2. Recap: Lambda Architecture
3. Rendezvous Architecture
© 2019 Daniel Schulz. All rights reserved. 11
DSC_RendezvousArchitecture.pptx
λ-Architecture ⊥
Rendezvous Architecture
Are Different Approaches – both
Might be Combined w/ One Another
DSC_RendezvousArchitecture.pptx
Objectives on the Rendezvous Architecture
© 2019 Daniel Schulz. All rights reserved. 13
Managing a Myriad of Predictive Analytics Models in Production
▪ Manage multiple models in production and alike environments
▪ Test-drive incumbent & challenging models against one another
→ enable Rapid App/Model Development (RAD) for DevOps projects
▪ Reproducibility & transparency of model’s predictions
DSC_RendezvousArchitecture.pptx
Client/Server
Architecture
Simple & Traditional
DSC_RendezvousArchitecture.pptx
Starting with Simplest, Traditional Client/Server Architecture –
Discrete, Direct & Stateless Response
© 2019 Daniel Schulz. All rights reserved. 15
One Endpoint Answers Predictive Queries w/ all Information to the Model Encapsulated in Request
Source & image courtesy: [MLL]
DSC_RendezvousArchitecture.pptx
Starting with Simplest, Traditional Client/Server Architecture –
Discrete, Direct & Stateless Response
• Shortcomings:
• Only one Predictive Model at a time
• Incumbent-only deployment – no challenging models for reference
• No comparing many model’s accuracies against one another
• Continuous improvement in Data Science resp. DataOps team unlikely due to lack of feedback
• Redeployment could result in downtimes iff no Green/Blue Deployments are used
• Selected implementation ideas:
• Tensorflow Serving
• Flask
• Various Application Servers
© 2019 Daniel Schulz. All rights reserved. 16
This is the Simplest Architecture for AI Deployments
DSC_RendezvousArchitecture.pptx
Annotations for All Future Architecture Iterations
• Annotation on HA:
independent of the High-Availability aspect, the following is agnostic to whether there might be one or many replicas to
take over in case of technical failure
• Annotation on Multi-Threading:
any Model might be one singular process, a Multi-Threading, Multi-Process program or even an Endpoint in a Microservice
Architecture (with more complex calls behind it); the only relevant information is the Client/Server nature of it, where all
information is enclosed in this stateless request and the model responds to it; external dependencies might apply or not
• Annotation on Ensembles, etc.:
this model might be an Ensemble or not – the sophistication and complexity underneath is secondary
© 2019 Daniel Schulz. All rights reserved. 17
Applies to this Base Model & all Supporting, more Complex Versions of such
DSC_RendezvousArchitecture.pptx
Load Balanced
Client/Server Architecture
Enable Fast Model Exchange
DSC_RendezvousArchitecture.pptx
Adding Technical High Availability Helps Little –
a Load Balancer Forwards any One Request to Exactly One Model
© 2019 Daniel Schulz. All rights reserved. 19
The Reverse Proxy Enables Green/Blue Deployments Only – Swapping Models Just Happens Faster Now
Source & image courtesy: [MLL]
DSC_RendezvousArchitecture.pptx
Adding Technical High Availability Helps Little –
a Load Balancer Forwards any One Request to Exactly One Model
• Shortcomings:
• Only one Predictive Model per Request at a time
• Hence, still no challenging models for reference
• No comparing many model’s accuracies against one another
• Continuous improvement in Data Science resp. DataOps team unlikely due to lack of feedback
• Selected implementation ideas for Reverse Proxy:
• Kubernetes or Kubernetes w/ Istio (Microservice Architectures)
• Docker Swarm (Microservice Architectures)
• Nginx
• Flask
• Various Application Servers
© 2019 Daniel Schulz. All rights reserved. 20
This Architecture is Pretty Simple & a Standard Deployment Practice from Custom Solution Development
DSC_RendezvousArchitecture.pptx
Load Balanced
Parallelized Models
Enable Multiple Models at Once
DSC_RendezvousArchitecture.pptx
Multiple Models in Parallel –
Adding a Message Queue Enables Concurrency of Models
© 2019 Daniel Schulz. All rights reserved. 22
But which Prediction to Choose in the End?
Source & image courtesy: [MLL]
DSC_RendezvousArchitecture.pptx
Multiple Models in Parallel –
Adding a Message Queue Enables Concurrency of Models
• Shortcomings:
• Somewhat challenging models for reference
• No comparing many model’s accuracies against one another
• Continuous improvement in Data Science resp. DataOps team unlikely due to lack of feedback
• Need to add return address (as an URI) and boolean flag to whether to return anything at all in the queue –
for the consumer to respond to the original (HTTP) request, which might be completed then
• Persistent Message Queues tend to be slower than pure in-memory ones;
Spark Streaming working in micro-batches might have higher latencies as well
• Selected implementation ideas for (persistent) Message Queues:
• Apache Kafka
• Apache Flume
• MapR Streams
• RabbitMQ, ActiveMQ, ZeroMQ, etc.
• Apache Spark’s Streaming resp. Apache Flink’s Streaming as Producers resp. Consumers of such Message Queues
© 2019 Daniel Schulz. All rights reserved. 23
But which Prediction to Choose in the End?
DSC_RendezvousArchitecture.pptx
Rendezvous
Architecture
Stop-back Request Guarantees,
Parallelized, Comparable,
Rapid Model Updates,
QoS Guarantees,
etc.
DSC_RendezvousArchitecture.pptx
The Model’s Rendezvous – a Scoring Stream Collects Various
Predictions & Returns them to the Original Request’s Client
© 2019 Daniel Schulz. All rights reserved. 25
Many Models – One Final Prediction
Source & image courtesy: [MLL]
DSC_RendezvousArchitecture.pptx
Stateless Models are Easier to Replicate –
but Data Augmentation Might be Helpful or Necessary
© 2019 Daniel Schulz. All rights reserved. 26
Any Additional Piece of or State Information Helpful to our Models Shall be Added in One Central Place
Source & image courtesy: [MLL]
DSC_RendezvousArchitecture.pptx
The Decoy Model –
Collect Production Data for Debugging, Optimizations and Reproducibility
© 2019 Daniel Schulz. All rights reserved. 27
The “Unit Tests” of AI Models in Production is Real-World Data
Source & image courtesy: [MLL]
DSC_RendezvousArchitecture.pptx
Discussion when to Add External Information
© 2019 Daniel Schulz. All rights reserved. 28
No Augmentation
▪ Models only receive request information
▪ More reliable for technical failure
▪ More overhead when many models fetch the same data –
use caches here
DSC_RendezvousArchitecture.pptx
Discussion when to Add External Information
© 2019 Daniel Schulz. All rights reserved. 29
No Augmentation
▪ Models only receive request information
▪ More reliable for technical failure
▪ More overhead when many models fetch the same data –
use caches here
All Augmentation in One Central Place
▪ Models receive all complete information
▪ Ideal case for reproducibility –
as the “Decoy Model” stores all information for
debugging and to explain predictions later-on
▪ Usually faster due to smaller overhead
→ Best Practice by Ellen Friedman & Ted Dunning,
Chapter 3, sub-section “Stateful Models” in [MLL]
DSC_RendezvousArchitecture.pptx
Add Metrics & the Canary Model for Monitoring & Optimizations
© 2019 Daniel Schulz. All rights reserved. 30
Metrics are Crucial to Compare Many Models Against One Another & Hence Supports Accuracy Improvements
Source & image courtesy: [MLL]
DSC_RendezvousArchitecture.pptx
Add Metrics & the Canary Model for Monitoring & Optimizations
© 2019 Daniel Schulz. All rights reserved. 31
Metrics
▪ Metrics monitor all model’s predictions and compares
their performances
▪ Hence, we are able to judge challenging model’s
performances compared to the incumbent model’s ones
▪ Metrics are helpful to detect outliers in predictions –
• e.g. Adversarial Images, where models might predict
obscure classifications
• e.g. detect swayed models, shift in input data’s
distributions, etc.
▪ Both technological SLA, timing & AI metrics
• SLA metrics:
latency, throughput, etc.
• Timing metrics:
computation time in threads, time for requests by
source/endpoint, etc.
• AI metrics:
accuracy, error metrics, AUC, F-statistics, etc.
The Canary Model
▪ Also a Best Practice by [MLL] for finding anomalies
▪ Is a rather dated model that keeps predicting to compare
the newer models’ predictions with it – to help detect
how production-ready they really are
▪ The difference in Canary Model and later ones is proof of
progress – or lack thereof for the Data Scientists
DSC_RendezvousArchitecture.pptx
Rendezvous Architecture – a Mixture of Models in Harmony
• Advantages:
• Model “warm-up” in production-like environments
• Switch models in an instant – un-deploy & deploy AI models swiftly
• Introduce time guarantees: all models work in parallel like Cassandra queries
• Mix simple, technically robust (not failing) models along w/ more sophisticated ones, which might break suddenly
• Incumbent vs challenging: collect raw data and performance metrics for various models (e.g. XGBoost vs Random
Forests; e.g. Linear Model vs SVM; e.g. PCA vs T-SNE) and differing versions in model streams (version 0.1, 0.2, …)
• Backstop:
when taking too long, a simpler, less robust model may answer as a backstop for more complex, more sophisticated
models; the same applies when longer-term performance metrics might indicate another model would perform better
© 2019 Daniel Schulz. All rights reserved. 32
My Suggestion for Reliable AI Systems Due to…
DSC_RendezvousArchitecture.pptx
Source & Image Courtesy from Book “Machine Learning Logistics”
© 2019 Daniel Schulz. All rights reserved. 33
“Machine Learning Logistics” by Ellen Friedman & Ted Dunning
▪ Authors: Ellen Friedman & Ted Dunning
▪ Publisher: O'Reilly Media, Inc.
▪ Release Date: October 2017
▪ ISBN: 9 7814 9199 7611
▪ Picture source ID: MLL
DSC_RendezvousArchitecture.pptx
Source from Book “Hands-on Machine Learning with Scikit-Learn,
Keras, and TensorFlow, 2nd Edition”
© 2019 Daniel Schulz. All rights reserved. 34
“Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow” by Aurélien Géron
▪ Author: Aurélien Géron
▪ Publisher: O'Reilly Media, Inc.
▪ Release Date: September 2019
▪ ISBN: 9 7814 9203 2649
DSC_RendezvousArchitecture.pptx
Résumé
Resilient, Extendable &
Production-ready Architecture for
Predictive Analytics
DSC_RendezvousArchitecture.pptx
Resumé on Rendezvous Architecture
© 2019 Daniel Schulz. All rights reserved. 36
Limitations
▪ Is no silver bullet – does not solve all logistical obstacles in AI projects
▪ Focusses on production-side architecture – development, test, QA and LTE environments might benefit from it
▪ Focus on technological, software architecture – does not cover ML Metrics, Hyperparameter Tuning, etc.
▪ Latencies increased a bit by Message-Queue-dependency
DSC_RendezvousArchitecture.pptx
Resumé on Rendezvous Architecture
© 2019 Daniel Schulz. All rights reserved. 37
Major Advantages
▪ Manage multiple models in production and alike
environments
▪ Test-drive incumbent & challenging models against one
another
▪ Reproducibility & transparency of model’s predictions
DSC_RendezvousArchitecture.pptx
Resumé on Rendezvous Architecture
© 2019 Daniel Schulz. All rights reserved. 38
Major Advantages
▪ Manage multiple models in production and alike
environments
▪ Test-drive incumbent & challenging models against one
another
▪ Reproducibility & transparency of model’s predictions
Minor Benefits
▪ Collect real-world data for future development
▪ Establish baseline performance values for Predictive
Analytics
▪ Rapid development due to default, fallback models
▪ Latency guarantees for model predictions
resilient, robust Predictive models
in modern Agile & DevOps projects
DSC_RendezvousArchitecture.pptx
Rendezvous Architecture
is the Modern Bedrock
of Robust Predictive models for
Today’s Agile & DevOps Projects
DSC_RendezvousArchitecture.pptx
Thank You for Your Attention
© 2019 Daniel Schulz. All rights reserved. 40
Please Feel Free to Ask any Open Questions, Suggestions or Voice Your Opinion…
A global leader in consulting, technology services and digital transformation,
Capgemini is at the forefront of innovation to address the entire breadth of clients’
opportunities in the evolving world of cloud, digital and platforms. Building on its
strong 50-year heritage and deep industry-specific expertise, Capgemini enables
organizations to realize their business ambitions through an array of services from
strategy to operations. Capgemini is driven by the conviction that the business
value of technology comes from and through people. It is a multicultural company
of over 200,000 team members in more than 40 countries. The Group reported
2018 global revenues of EUR 13.2 billion.
About Capgemini
Learn more about us at
www.capgemini.com
This presentation contains information that may be privileged or confidential and
is the property of the Capgemini Group.
Copyright © 2019 Daniel Schulz. All rights reserved.
People matter, results count.

More Related Content

What's hot

Why to Cloud Native
Why to Cloud NativeWhy to Cloud Native
Why to Cloud Native
Karthik Gaekwad
 
From Mainframe to Microservices with Pivotal Platform and Kafka: Bridging the...
From Mainframe to Microservices with Pivotal Platform and Kafka: Bridging the...From Mainframe to Microservices with Pivotal Platform and Kafka: Bridging the...
From Mainframe to Microservices with Pivotal Platform and Kafka: Bridging the...
VMware Tanzu
 
[Sirius Day Eindhoven 2018] ASML's MDE Going Sirius
[Sirius Day Eindhoven 2018]  ASML's MDE Going Sirius[Sirius Day Eindhoven 2018]  ASML's MDE Going Sirius
[Sirius Day Eindhoven 2018] ASML's MDE Going Sirius
Obeo
 
Continuous Delivery with CloudBees Core
Continuous Delivery with CloudBees CoreContinuous Delivery with CloudBees Core
Continuous Delivery with CloudBees Core
Bhavani Rao
 
How to Guarantee High Performance for Application Data in the Cloud
How to Guarantee High Performance for Application Data in the CloudHow to Guarantee High Performance for Application Data in the Cloud
How to Guarantee High Performance for Application Data in the Cloud
Buurst
 
Intel IT Open Cloud - What's under the Hood and How do we Drive it?
Intel IT Open Cloud - What's under the Hood and How do we Drive it?Intel IT Open Cloud - What's under the Hood and How do we Drive it?
Intel IT Open Cloud - What's under the Hood and How do we Drive it?
Odinot Stanislas
 
Pulling Back the Curtain – CloudStack in Private and Community Clouds
Pulling Back the Curtain –CloudStack in Private and Community CloudsPulling Back the Curtain –CloudStack in Private and Community Clouds
Pulling Back the Curtain – CloudStack in Private and Community Clouds
Chip Childers
 
Marine Air Ground Task Force Command & Control Systems Software Deployment an...
Marine Air Ground Task Force Command & Control Systems Software Deployment an...Marine Air Ground Task Force Command & Control Systems Software Deployment an...
Marine Air Ground Task Force Command & Control Systems Software Deployment an...
LaurenWendler
 
The Next Generation Datacenter
The Next Generation DatacenterThe Next Generation Datacenter
The Next Generation Datacenter
Red Hat Events
 
Bring cloud on premises with a kubernetes-native infrastructure
Bring cloud on premises with a kubernetes-native infrastructureBring cloud on premises with a kubernetes-native infrastructure
Bring cloud on premises with a kubernetes-native infrastructure
Abhinav Joshi
 
Capella Days 2021 | How much time does modeling take? Experiences from modeli...
Capella Days 2021 | How much time does modeling take? Experiences from modeli...Capella Days 2021 | How much time does modeling take? Experiences from modeli...
Capella Days 2021 | How much time does modeling take? Experiences from modeli...
Obeo
 
It summit 150604 cb_wcl_ld_kmh_v6_to_publish
It summit 150604 cb_wcl_ld_kmh_v6_to_publishIt summit 150604 cb_wcl_ld_kmh_v6_to_publish
It summit 150604 cb_wcl_ld_kmh_v6_to_publish
kevin_donovan
 
Capella Days 2021 | Introduction to CAPELLA/ARCADIA and NASA Systems Engineer...
Capella Days 2021 | Introduction to CAPELLA/ARCADIA and NASA Systems Engineer...Capella Days 2021 | Introduction to CAPELLA/ARCADIA and NASA Systems Engineer...
Capella Days 2021 | Introduction to CAPELLA/ARCADIA and NASA Systems Engineer...
Obeo
 
Cloud Migration Strategy - IT Transformation with Cloud
Cloud Migration Strategy - IT Transformation with CloudCloud Migration Strategy - IT Transformation with Cloud
Cloud Migration Strategy - IT Transformation with Cloud
Blazeclan Technologies Private Limited
 
HP Helion OpenStack and Professional Services
HP Helion OpenStack and Professional ServicesHP Helion OpenStack and Professional Services
HP Helion OpenStack and Professional Services
Matthew Farina
 
Migration to the hybrid cloud in 4 steps
Migration to the hybrid cloud in 4 steps Migration to the hybrid cloud in 4 steps
Migration to the hybrid cloud in 4 steps
Sylvain FRANCESCHI
 
Ibm test & development cloud + rational service delivery services platform
Ibm test & development cloud + rational service delivery services platformIbm test & development cloud + rational service delivery services platform
Ibm test & development cloud + rational service delivery services platform
Babak Hosseinzadeh
 
HP Helion OpenStack step by step
HP Helion OpenStack step by stepHP Helion OpenStack step by step
HP Helion OpenStack step by step
Bruno Cornec
 
Cloud Migration: Moving to the Cloud
Cloud Migration: Moving to the CloudCloud Migration: Moving to the Cloud
Cloud Migration: Moving to the Cloud
Dr.-Ing. Michael Menzel
 
A walkthrough of HP Helion OpenStack
A walkthrough of HP Helion OpenStackA walkthrough of HP Helion OpenStack
A walkthrough of HP Helion OpenStack
HP Enterprise Italia
 

What's hot (20)

Why to Cloud Native
Why to Cloud NativeWhy to Cloud Native
Why to Cloud Native
 
From Mainframe to Microservices with Pivotal Platform and Kafka: Bridging the...
From Mainframe to Microservices with Pivotal Platform and Kafka: Bridging the...From Mainframe to Microservices with Pivotal Platform and Kafka: Bridging the...
From Mainframe to Microservices with Pivotal Platform and Kafka: Bridging the...
 
[Sirius Day Eindhoven 2018] ASML's MDE Going Sirius
[Sirius Day Eindhoven 2018]  ASML's MDE Going Sirius[Sirius Day Eindhoven 2018]  ASML's MDE Going Sirius
[Sirius Day Eindhoven 2018] ASML's MDE Going Sirius
 
Continuous Delivery with CloudBees Core
Continuous Delivery with CloudBees CoreContinuous Delivery with CloudBees Core
Continuous Delivery with CloudBees Core
 
How to Guarantee High Performance for Application Data in the Cloud
How to Guarantee High Performance for Application Data in the CloudHow to Guarantee High Performance for Application Data in the Cloud
How to Guarantee High Performance for Application Data in the Cloud
 
Intel IT Open Cloud - What's under the Hood and How do we Drive it?
Intel IT Open Cloud - What's under the Hood and How do we Drive it?Intel IT Open Cloud - What's under the Hood and How do we Drive it?
Intel IT Open Cloud - What's under the Hood and How do we Drive it?
 
Pulling Back the Curtain – CloudStack in Private and Community Clouds
Pulling Back the Curtain –CloudStack in Private and Community CloudsPulling Back the Curtain –CloudStack in Private and Community Clouds
Pulling Back the Curtain – CloudStack in Private and Community Clouds
 
Marine Air Ground Task Force Command & Control Systems Software Deployment an...
Marine Air Ground Task Force Command & Control Systems Software Deployment an...Marine Air Ground Task Force Command & Control Systems Software Deployment an...
Marine Air Ground Task Force Command & Control Systems Software Deployment an...
 
The Next Generation Datacenter
The Next Generation DatacenterThe Next Generation Datacenter
The Next Generation Datacenter
 
Bring cloud on premises with a kubernetes-native infrastructure
Bring cloud on premises with a kubernetes-native infrastructureBring cloud on premises with a kubernetes-native infrastructure
Bring cloud on premises with a kubernetes-native infrastructure
 
Capella Days 2021 | How much time does modeling take? Experiences from modeli...
Capella Days 2021 | How much time does modeling take? Experiences from modeli...Capella Days 2021 | How much time does modeling take? Experiences from modeli...
Capella Days 2021 | How much time does modeling take? Experiences from modeli...
 
It summit 150604 cb_wcl_ld_kmh_v6_to_publish
It summit 150604 cb_wcl_ld_kmh_v6_to_publishIt summit 150604 cb_wcl_ld_kmh_v6_to_publish
It summit 150604 cb_wcl_ld_kmh_v6_to_publish
 
Capella Days 2021 | Introduction to CAPELLA/ARCADIA and NASA Systems Engineer...
Capella Days 2021 | Introduction to CAPELLA/ARCADIA and NASA Systems Engineer...Capella Days 2021 | Introduction to CAPELLA/ARCADIA and NASA Systems Engineer...
Capella Days 2021 | Introduction to CAPELLA/ARCADIA and NASA Systems Engineer...
 
Cloud Migration Strategy - IT Transformation with Cloud
Cloud Migration Strategy - IT Transformation with CloudCloud Migration Strategy - IT Transformation with Cloud
Cloud Migration Strategy - IT Transformation with Cloud
 
HP Helion OpenStack and Professional Services
HP Helion OpenStack and Professional ServicesHP Helion OpenStack and Professional Services
HP Helion OpenStack and Professional Services
 
Migration to the hybrid cloud in 4 steps
Migration to the hybrid cloud in 4 steps Migration to the hybrid cloud in 4 steps
Migration to the hybrid cloud in 4 steps
 
Ibm test & development cloud + rational service delivery services platform
Ibm test & development cloud + rational service delivery services platformIbm test & development cloud + rational service delivery services platform
Ibm test & development cloud + rational service delivery services platform
 
HP Helion OpenStack step by step
HP Helion OpenStack step by stepHP Helion OpenStack step by step
HP Helion OpenStack step by step
 
Cloud Migration: Moving to the Cloud
Cloud Migration: Moving to the CloudCloud Migration: Moving to the Cloud
Cloud Migration: Moving to the Cloud
 
A walkthrough of HP Helion OpenStack
A walkthrough of HP Helion OpenStackA walkthrough of HP Helion OpenStack
A walkthrough of HP Helion OpenStack
 

Similar to Productionizing Predictive Analytics using the Rendezvous Architecture - for Data Scientists

Productionizing Predictive Analytics using the Rendezvous Architecture - for ...
Productionizing Predictive Analytics using the Rendezvous Architecture - for ...Productionizing Predictive Analytics using the Rendezvous Architecture - for ...
Productionizing Predictive Analytics using the Rendezvous Architecture - for ...
danielschulz2005
 
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step JourneyWebinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
DataStax
 
Best practices for application migration to public clouds interop presentation
Best practices for application migration to public clouds interop presentationBest practices for application migration to public clouds interop presentation
Best practices for application migration to public clouds interop presentation
esebeus
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Multicloud - Understanding Benefits. Obstacles, and Best Approaches
Multicloud - Understanding Benefits. Obstacles, and Best ApproachesMulticloud - Understanding Benefits. Obstacles, and Best Approaches
Multicloud - Understanding Benefits. Obstacles, and Best Approaches
Kenneth Hui
 
Continuous Delivery for cloud - scenarios and scope
Continuous Delivery for cloud  - scenarios and scopeContinuous Delivery for cloud  - scenarios and scope
Continuous Delivery for cloud - scenarios and scope
Sanjeev Sharma
 
[Srijan Wednesday Webinars] How to Build a Cloud Native Platform for Enterpri...
[Srijan Wednesday Webinars] How to Build a Cloud Native Platform for Enterpri...[Srijan Wednesday Webinars] How to Build a Cloud Native Platform for Enterpri...
[Srijan Wednesday Webinars] How to Build a Cloud Native Platform for Enterpri...
Srijan Technologies
 
Cloud computing & Batch processing: potentiels & perspectives
Cloud computing & Batch processing:  potentiels & perspectives  Cloud computing & Batch processing:  potentiels & perspectives
Cloud computing & Batch processing: potentiels & perspectives
Claude Riousset
 
How to Transform Corporate IT into the Driver for Digital Transformation
How to Transform Corporate IT into the Driver for Digital TransformationHow to Transform Corporate IT into the Driver for Digital Transformation
How to Transform Corporate IT into the Driver for Digital Transformation
Enterprise Management Associates
 
cloud-migrations.pptx
cloud-migrations.pptxcloud-migrations.pptx
cloud-migrations.pptx
John Mulhall
 
Enabling Fast IT using Containers, Microservices and DAVROS models: an overview
Enabling Fast IT using Containers, Microservices and DAVROS models: an overviewEnabling Fast IT using Containers, Microservices and DAVROS models: an overview
Enabling Fast IT using Containers, Microservices and DAVROS models: an overview
Cisco DevNet
 
Confluent Partner Tech Talk with QLIK
Confluent Partner Tech Talk with QLIKConfluent Partner Tech Talk with QLIK
Confluent Partner Tech Talk with QLIK
confluent
 
Cloud-Native Data: What data questions to ask when building cloud-native apps
Cloud-Native Data: What data questions to ask when building cloud-native appsCloud-Native Data: What data questions to ask when building cloud-native apps
Cloud-Native Data: What data questions to ask when building cloud-native apps
VMware Tanzu
 
Azure App Modernization
Azure App ModernizationAzure App Modernization
Azure App Modernization
Phi Huynh
 
CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018
Krishna-Kumar
 
Greg Maxey - Electric Cloud - Process as Code: An Introduction to the Electri...
Greg Maxey - Electric Cloud - Process as Code: An Introduction to the Electri...Greg Maxey - Electric Cloud - Process as Code: An Introduction to the Electri...
Greg Maxey - Electric Cloud - Process as Code: An Introduction to the Electri...
DevOps Enterprise Summit
 
Matias Creimerman - Cloud migration and modernization effort
Matias Creimerman - Cloud migration and modernization effortMatias Creimerman - Cloud migration and modernization effort
Matias Creimerman - Cloud migration and modernization effort
Matias Creimerman
 
Accelerating the Path to Digital with a Cloud Data Strategy
Accelerating the Path to Digital with a Cloud Data StrategyAccelerating the Path to Digital with a Cloud Data Strategy
Accelerating the Path to Digital with a Cloud Data Strategy
MongoDB
 
Developing Hybrid Cloud Applications
Developing Hybrid Cloud ApplicationsDeveloping Hybrid Cloud Applications
Developing Hybrid Cloud Applications
Daniel Berg
 
Continuous Delivery to the cloud - Innovate 2014
Continuous Delivery to the cloud - Innovate 2014Continuous Delivery to the cloud - Innovate 2014
Continuous Delivery to the cloud - Innovate 2014
Sanjeev Sharma
 

Similar to Productionizing Predictive Analytics using the Rendezvous Architecture - for Data Scientists (20)

Productionizing Predictive Analytics using the Rendezvous Architecture - for ...
Productionizing Predictive Analytics using the Rendezvous Architecture - for ...Productionizing Predictive Analytics using the Rendezvous Architecture - for ...
Productionizing Predictive Analytics using the Rendezvous Architecture - for ...
 
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step JourneyWebinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
 
Best practices for application migration to public clouds interop presentation
Best practices for application migration to public clouds interop presentationBest practices for application migration to public clouds interop presentation
Best practices for application migration to public clouds interop presentation
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Multicloud - Understanding Benefits. Obstacles, and Best Approaches
Multicloud - Understanding Benefits. Obstacles, and Best ApproachesMulticloud - Understanding Benefits. Obstacles, and Best Approaches
Multicloud - Understanding Benefits. Obstacles, and Best Approaches
 
Continuous Delivery for cloud - scenarios and scope
Continuous Delivery for cloud  - scenarios and scopeContinuous Delivery for cloud  - scenarios and scope
Continuous Delivery for cloud - scenarios and scope
 
[Srijan Wednesday Webinars] How to Build a Cloud Native Platform for Enterpri...
[Srijan Wednesday Webinars] How to Build a Cloud Native Platform for Enterpri...[Srijan Wednesday Webinars] How to Build a Cloud Native Platform for Enterpri...
[Srijan Wednesday Webinars] How to Build a Cloud Native Platform for Enterpri...
 
Cloud computing & Batch processing: potentiels & perspectives
Cloud computing & Batch processing:  potentiels & perspectives  Cloud computing & Batch processing:  potentiels & perspectives
Cloud computing & Batch processing: potentiels & perspectives
 
How to Transform Corporate IT into the Driver for Digital Transformation
How to Transform Corporate IT into the Driver for Digital TransformationHow to Transform Corporate IT into the Driver for Digital Transformation
How to Transform Corporate IT into the Driver for Digital Transformation
 
cloud-migrations.pptx
cloud-migrations.pptxcloud-migrations.pptx
cloud-migrations.pptx
 
Enabling Fast IT using Containers, Microservices and DAVROS models: an overview
Enabling Fast IT using Containers, Microservices and DAVROS models: an overviewEnabling Fast IT using Containers, Microservices and DAVROS models: an overview
Enabling Fast IT using Containers, Microservices and DAVROS models: an overview
 
Confluent Partner Tech Talk with QLIK
Confluent Partner Tech Talk with QLIKConfluent Partner Tech Talk with QLIK
Confluent Partner Tech Talk with QLIK
 
Cloud-Native Data: What data questions to ask when building cloud-native apps
Cloud-Native Data: What data questions to ask when building cloud-native appsCloud-Native Data: What data questions to ask when building cloud-native apps
Cloud-Native Data: What data questions to ask when building cloud-native apps
 
Azure App Modernization
Azure App ModernizationAzure App Modernization
Azure App Modernization
 
CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018
 
Greg Maxey - Electric Cloud - Process as Code: An Introduction to the Electri...
Greg Maxey - Electric Cloud - Process as Code: An Introduction to the Electri...Greg Maxey - Electric Cloud - Process as Code: An Introduction to the Electri...
Greg Maxey - Electric Cloud - Process as Code: An Introduction to the Electri...
 
Matias Creimerman - Cloud migration and modernization effort
Matias Creimerman - Cloud migration and modernization effortMatias Creimerman - Cloud migration and modernization effort
Matias Creimerman - Cloud migration and modernization effort
 
Accelerating the Path to Digital with a Cloud Data Strategy
Accelerating the Path to Digital with a Cloud Data StrategyAccelerating the Path to Digital with a Cloud Data Strategy
Accelerating the Path to Digital with a Cloud Data Strategy
 
Developing Hybrid Cloud Applications
Developing Hybrid Cloud ApplicationsDeveloping Hybrid Cloud Applications
Developing Hybrid Cloud Applications
 
Continuous Delivery to the cloud - Innovate 2014
Continuous Delivery to the cloud - Innovate 2014Continuous Delivery to the cloud - Innovate 2014
Continuous Delivery to the cloud - Innovate 2014
 

Recently uploaded

一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
zsjl4mimo
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
kuntobimo2016
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
74nqk8xf
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
AndrzejJarynowski
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
Sachin Paul
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 

Recently uploaded (20)

一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 

Productionizing Predictive Analytics using the Rendezvous Architecture - for Data Scientists

  • 1. Delivery Excellence: Love is in the Air – an Architecture for Analytics in Production Leverkusen, July 31, 2019, Daniel Schulz
  • 2. DSC_RendezvousArchitecture.pptx Love is in the Air is an Architectural Battle Story
  • 3. DSC_RendezvousArchitecture.pptx Time Schedule Topics Covered 1. Brief Motivation 2. Recap: Lambda Architecture 3. Rendezvous Architecture © 2019 Daniel Schulz. All rights reserved. 3
  • 4. DSC_RendezvousArchitecture.pptx Challenge Deliver many Ever-Improving Predictive Models in Reproducible & Robust Manner to Production
  • 5. DSC_RendezvousArchitecture.pptx The Challenge Put Forward • Greenfield project • Modern tech stack: Kubernetes, Docker, Cloud-native, DSE Cassandra • Reproducibility • Resilient & robust models • Deliver multiple models w/ increasing quality • DevOps delivery © 2019 Daniel Schulz. All rights reserved. 5 Requirements for both QA & Production
  • 6. DSC_RendezvousArchitecture.pptx Time Schedule Topics Covered 1. Brief Motivation 2. Recap: Lambda Architecture 3. Rendezvous Architecture © 2019 Daniel Schulz. All rights reserved. 6
  • 7. DSC_RendezvousArchitecture.pptx Competing, Famous λ-Architecture is a Different Approach – both Might be Combined w/ One Another
  • 8. DSC_RendezvousArchitecture.pptx The Lambda-Architecture Splits Data into Two Distinct Sets – a Large Batch Set as a 360° View & Just Recent, Speedy Updates © 2019 Daniel Schulz. All rights reserved. 8 Just Like the Data Set is Split, so are the Data Stores & Hence their Respective Serving Architectures Speed Layer Batch Layer Data Sources Queries Queries GET GET PUT/POST
  • 9. DSC_RendezvousArchitecture.pptx In a more Unified Version, One Central Serving Architecture Combines Both Batch & Speed Layer Data to Answer Queries © 2019 Daniel Schulz. All rights reserved. 9 There is a Trade-Off to be Made between Completeness/Perfection in Batch & Recency in Speed Layer Data Speed Layer Batch Layer Data Sources Serving Layer QueriesPUT/POST GET GET
  • 10. DSC_RendezvousArchitecture.pptx Lambda-Architecture is a Trade-Off Between Completeness & Speediness of Prediction • Commonalities • Use Cases in Predictive Analytics • Deliver predictions very fast • Differences • Lambda Architecture • is criticized for its complexity – different data stores, software code base and serving technologies might be needed • Only applicable in Append-only Architectures like e.g. Hadoop – data change would yield a completely new batch computation • Focused on data age ▪ Rendezvous Architecture • Technically more complex to implement • Message Queue needed which might duplicate messages and increase latency • Focused on Predictive Models © 2019 Daniel Schulz. All rights reserved. 10 The Batch Layer Provides an almost Perfect 360° on Everything – the Speed Layer Intends to Predict Fast
  • 11. DSC_RendezvousArchitecture.pptx Time Schedule Topics Covered 1. Brief Motivation 2. Recap: Lambda Architecture 3. Rendezvous Architecture © 2019 Daniel Schulz. All rights reserved. 11
  • 12. DSC_RendezvousArchitecture.pptx λ-Architecture ⊥ Rendezvous Architecture Are Different Approaches – both Might be Combined w/ One Another
  • 13. DSC_RendezvousArchitecture.pptx Objectives on the Rendezvous Architecture © 2019 Daniel Schulz. All rights reserved. 13 Managing a Myriad of Predictive Analytics Models in Production ▪ Manage multiple models in production and alike environments ▪ Test-drive incumbent & challenging models against one another → enable Rapid App/Model Development (RAD) for DevOps projects ▪ Reproducibility & transparency of model’s predictions
  • 15. DSC_RendezvousArchitecture.pptx Starting with Simplest, Traditional Client/Server Architecture – Discrete, Direct & Stateless Response © 2019 Daniel Schulz. All rights reserved. 15 One Endpoint Answers Predictive Queries w/ all Information to the Model Encapsulated in Request Source & image courtesy: [MLL]
  • 16. DSC_RendezvousArchitecture.pptx Starting with Simplest, Traditional Client/Server Architecture – Discrete, Direct & Stateless Response • Shortcomings: • Only one Predictive Model at a time • Incumbent-only deployment – no challenging models for reference • No comparing many model’s accuracies against one another • Continuous improvement in Data Science resp. DataOps team unlikely due to lack of feedback • Redeployment could result in downtimes iff no Green/Blue Deployments are used • Selected implementation ideas: • Tensorflow Serving • Flask • Various Application Servers © 2019 Daniel Schulz. All rights reserved. 16 This is the Simplest Architecture for AI Deployments
  • 17. DSC_RendezvousArchitecture.pptx Annotations for All Future Architecture Iterations • Annotation on HA: independent of the High-Availability aspect, the following is agnostic to whether there might be one or many replicas to take over in case of technical failure • Annotation on Multi-Threading: any Model might be one singular process, a Multi-Threading, Multi-Process program or even an Endpoint in a Microservice Architecture (with more complex calls behind it); the only relevant information is the Client/Server nature of it, where all information is enclosed in this stateless request and the model responds to it; external dependencies might apply or not • Annotation on Ensembles, etc.: this model might be an Ensemble or not – the sophistication and complexity underneath is secondary © 2019 Daniel Schulz. All rights reserved. 17 Applies to this Base Model & all Supporting, more Complex Versions of such
  • 19. DSC_RendezvousArchitecture.pptx Adding Technical High Availability Helps Little – a Load Balancer Forwards any One Request to Exactly One Model © 2019 Daniel Schulz. All rights reserved. 19 The Reverse Proxy Enables Green/Blue Deployments Only – Swapping Models Just Happens Faster Now Source & image courtesy: [MLL]
  • 20. DSC_RendezvousArchitecture.pptx Adding Technical High Availability Helps Little – a Load Balancer Forwards any One Request to Exactly One Model • Shortcomings: • Only one Predictive Model per Request at a time • Hence, still no challenging models for reference • No comparing many model’s accuracies against one another • Continuous improvement in Data Science resp. DataOps team unlikely due to lack of feedback • Selected implementation ideas for Reverse Proxy: • Kubernetes or Kubernetes w/ Istio (Microservice Architectures) • Docker Swarm (Microservice Architectures) • Nginx • Flask • Various Application Servers © 2019 Daniel Schulz. All rights reserved. 20 This Architecture is Pretty Simple & a Standard Deployment Practice from Custom Solution Development
  • 22. DSC_RendezvousArchitecture.pptx Multiple Models in Parallel – Adding a Message Queue Enables Concurrency of Models © 2019 Daniel Schulz. All rights reserved. 22 But which Prediction to Choose in the End? Source & image courtesy: [MLL]
  • 23. DSC_RendezvousArchitecture.pptx Multiple Models in Parallel – Adding a Message Queue Enables Concurrency of Models • Shortcomings: • Somewhat challenging models for reference • No comparing many model’s accuracies against one another • Continuous improvement in Data Science resp. DataOps team unlikely due to lack of feedback • Need to add return address (as an URI) and boolean flag to whether to return anything at all in the queue – for the consumer to respond to the original (HTTP) request, which might be completed then • Persistent Message Queues tend to be slower than pure in-memory ones; Spark Streaming working in micro-batches might have higher latencies as well • Selected implementation ideas for (persistent) Message Queues: • Apache Kafka • Apache Flume • MapR Streams • RabbitMQ, ActiveMQ, ZeroMQ, etc. • Apache Spark’s Streaming resp. Apache Flink’s Streaming as Producers resp. Consumers of such Message Queues © 2019 Daniel Schulz. All rights reserved. 23 But which Prediction to Choose in the End?
  • 25. DSC_RendezvousArchitecture.pptx The Model’s Rendezvous – a Scoring Stream Collects Various Predictions & Returns them to the Original Request’s Client © 2019 Daniel Schulz. All rights reserved. 25 Many Models – One Final Prediction Source & image courtesy: [MLL]
  • 26. DSC_RendezvousArchitecture.pptx Stateless Models are Easier to Replicate – but Data Augmentation Might be Helpful or Necessary © 2019 Daniel Schulz. All rights reserved. 26 Any Additional Piece of or State Information Helpful to our Models Shall be Added in One Central Place Source & image courtesy: [MLL]
  • 27. DSC_RendezvousArchitecture.pptx The Decoy Model – Collect Production Data for Debugging, Optimizations and Reproducibility © 2019 Daniel Schulz. All rights reserved. 27 The “Unit Tests” of AI Models in Production is Real-World Data Source & image courtesy: [MLL]
  • 28. DSC_RendezvousArchitecture.pptx Discussion when to Add External Information © 2019 Daniel Schulz. All rights reserved. 28 No Augmentation ▪ Models only receive request information ▪ More reliable for technical failure ▪ More overhead when many models fetch the same data – use caches here
  • 29. DSC_RendezvousArchitecture.pptx Discussion when to Add External Information © 2019 Daniel Schulz. All rights reserved. 29 No Augmentation ▪ Models only receive request information ▪ More reliable for technical failure ▪ More overhead when many models fetch the same data – use caches here All Augmentation in One Central Place ▪ Models receive all complete information ▪ Ideal case for reproducibility – as the “Decoy Model” stores all information for debugging and to explain predictions later-on ▪ Usually faster due to smaller overhead → Best Practice by Ellen Friedman & Ted Dunning, Chapter 3, sub-section “Stateful Models” in [MLL]
  • 30. DSC_RendezvousArchitecture.pptx Add Metrics & the Canary Model for Monitoring & Optimizations © 2019 Daniel Schulz. All rights reserved. 30 Metrics are Crucial to Compare Many Models Against One Another & Hence Supports Accuracy Improvements Source & image courtesy: [MLL]
  • 31. DSC_RendezvousArchitecture.pptx Add Metrics & the Canary Model for Monitoring & Optimizations © 2019 Daniel Schulz. All rights reserved. 31 Metrics ▪ Metrics monitor all model’s predictions and compares their performances ▪ Hence, we are able to judge challenging model’s performances compared to the incumbent model’s ones ▪ Metrics are helpful to detect outliers in predictions – • e.g. Adversarial Images, where models might predict obscure classifications • e.g. detect swayed models, shift in input data’s distributions, etc. ▪ Both technological SLA, timing & AI metrics • SLA metrics: latency, throughput, etc. • Timing metrics: computation time in threads, time for requests by source/endpoint, etc. • AI metrics: accuracy, error metrics, AUC, F-statistics, etc. The Canary Model ▪ Also a Best Practice by [MLL] for finding anomalies ▪ Is a rather dated model that keeps predicting to compare the newer models’ predictions with it – to help detect how production-ready they really are ▪ The difference in Canary Model and later ones is proof of progress – or lack thereof for the Data Scientists
  • 32. DSC_RendezvousArchitecture.pptx Rendezvous Architecture – a Mixture of Models in Harmony • Advantages: • Model “warm-up” in production-like environments • Switch models in an instant – un-deploy & deploy AI models swiftly • Introduce time guarantees: all models work in parallel like Cassandra queries • Mix simple, technically robust (not failing) models along w/ more sophisticated ones, which might break suddenly • Incumbent vs challenging: collect raw data and performance metrics for various models (e.g. XGBoost vs Random Forests; e.g. Linear Model vs SVM; e.g. PCA vs T-SNE) and differing versions in model streams (version 0.1, 0.2, …) • Backstop: when taking too long, a simpler, less robust model may answer as a backstop for more complex, more sophisticated models; the same applies when longer-term performance metrics might indicate another model would perform better © 2019 Daniel Schulz. All rights reserved. 32 My Suggestion for Reliable AI Systems Due to…
  • 33. DSC_RendezvousArchitecture.pptx Source & Image Courtesy from Book “Machine Learning Logistics” © 2019 Daniel Schulz. All rights reserved. 33 “Machine Learning Logistics” by Ellen Friedman & Ted Dunning ▪ Authors: Ellen Friedman & Ted Dunning ▪ Publisher: O'Reilly Media, Inc. ▪ Release Date: October 2017 ▪ ISBN: 9 7814 9199 7611 ▪ Picture source ID: MLL
  • 34. DSC_RendezvousArchitecture.pptx Source from Book “Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd Edition” © 2019 Daniel Schulz. All rights reserved. 34 “Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow” by Aurélien Géron ▪ Author: Aurélien Géron ▪ Publisher: O'Reilly Media, Inc. ▪ Release Date: September 2019 ▪ ISBN: 9 7814 9203 2649
  • 36. DSC_RendezvousArchitecture.pptx Resumé on Rendezvous Architecture © 2019 Daniel Schulz. All rights reserved. 36 Limitations ▪ Is no silver bullet – does not solve all logistical obstacles in AI projects ▪ Focusses on production-side architecture – development, test, QA and LTE environments might benefit from it ▪ Focus on technological, software architecture – does not cover ML Metrics, Hyperparameter Tuning, etc. ▪ Latencies increased a bit by Message-Queue-dependency
  • 37. DSC_RendezvousArchitecture.pptx Resumé on Rendezvous Architecture © 2019 Daniel Schulz. All rights reserved. 37 Major Advantages ▪ Manage multiple models in production and alike environments ▪ Test-drive incumbent & challenging models against one another ▪ Reproducibility & transparency of model’s predictions
  • 38. DSC_RendezvousArchitecture.pptx Resumé on Rendezvous Architecture © 2019 Daniel Schulz. All rights reserved. 38 Major Advantages ▪ Manage multiple models in production and alike environments ▪ Test-drive incumbent & challenging models against one another ▪ Reproducibility & transparency of model’s predictions Minor Benefits ▪ Collect real-world data for future development ▪ Establish baseline performance values for Predictive Analytics ▪ Rapid development due to default, fallback models ▪ Latency guarantees for model predictions resilient, robust Predictive models in modern Agile & DevOps projects
  • 39. DSC_RendezvousArchitecture.pptx Rendezvous Architecture is the Modern Bedrock of Robust Predictive models for Today’s Agile & DevOps Projects
  • 40. DSC_RendezvousArchitecture.pptx Thank You for Your Attention © 2019 Daniel Schulz. All rights reserved. 40 Please Feel Free to Ask any Open Questions, Suggestions or Voice Your Opinion…
  • 41. A global leader in consulting, technology services and digital transformation, Capgemini is at the forefront of innovation to address the entire breadth of clients’ opportunities in the evolving world of cloud, digital and platforms. Building on its strong 50-year heritage and deep industry-specific expertise, Capgemini enables organizations to realize their business ambitions through an array of services from strategy to operations. Capgemini is driven by the conviction that the business value of technology comes from and through people. It is a multicultural company of over 200,000 team members in more than 40 countries. The Group reported 2018 global revenues of EUR 13.2 billion. About Capgemini Learn more about us at www.capgemini.com This presentation contains information that may be privileged or confidential and is the property of the Capgemini Group. Copyright © 2019 Daniel Schulz. All rights reserved. People matter, results count.