Collaboration over Consolidation in the Cloud

Collaboration over Consolidation in the Cloud
Orchestrate for Reliability
Aude SMIEJAN // @aude_sw

Historically industries have tended toward consolidation

The exponential rise of technology has disrupted the established cycle of:
INNOVATION ► UPSTART ► ACQUISITION ► CONSOLIDATION.

WHAT ABOUT INNOVATION IN THE CLOUD INDUSTRY?

Tech Industry has a long history of market
concentration.
► IBM in mainframe computers
► Microsoft in PC operating systems
► SAP & Oracle in enterprise applications
Is the Cloud Industry already consolidated?

Cloud vendors seem to be dedicated to playing together nicely – for now
► Collaboration makes sense to keep R&D costs low.
► Cloud Platforms are becoming powerful integrated systems.
► All major providers support nearly all dev environments (AWS has a growing Microsoft business, 40% of
Azure runs on Linux etc.).

The Cloud industry is made for Microservices (and vice-versa)

Basic idea behind microservices:
Microservices break down software into functional components that interoperate / communicate to create an overall
application.

The complexity lies in orchestrating microservices:
Microservices do not live in isolation, their
complexity lies in the large-scale environment or
ecosystem they live in.
Building, standardizing, and maintaining this
infrastructure in a stable, scalable, fault-tolerant,
and reliable way is essential for successful
microservice operation.

Microservices ecosystem:
Actual machines, servers, physical computers > Amazon EC2, Google Cloud Platform, Microsoft Azure etc. or
private DC.
Microservices
Application Platform
Communication
Hardware
DevOps teams, self-service dev tools etc
Networks, DNS, RCPs, API endpoints,
service discovery / registry, Load-
balancing etc.

Other risk factors in the Microservice Ecosystem:
► Network failures (EC2 outage etc)
► Security breaches – the fewer the providers, the higher the risk
► Vendor lock-in :
- Vendor lock-in at the service layer (AWS lambda, IBM Watson)
- Cloud vendors will stop competing on price once they’ve reached critical mass
► Cut back on innovation

ORCHESTRATING IN THE MICROSERVICE ECOSYSTEM

1/ AVAILABILITY
► Services need to be available locally
► Services need to be available globally / externally
 On a public internet that is not smart enough find the best available path.
 On a public internet full of bots (consuming real user traffic) and DDOs attacks.

Cloud & Network Outages happen every day, everywhere
https://www.theverge.com/2017/11/6/16614160/comcast-xfinity-internet-down-reports https://www.theregister.co.uk/2017/03/01/aws_s3_outag
e/
http://www.silicon.co.uk/cloud/microsoft-azure-outage-
207503

live.cedexis.com
+15 BILLION
MEASUREMENTS PER DAY
+1 BILLION DAILY
END-USER SESSIONS
FROM +50’000
NETWORKS AROUND THE WORLD
THROUGHPUT
RANGING FROM 1
TO 10 ON A SINGLE
PROVIDER ALONG
THE DAY
1000 OUTAGE PER
CDN PER DAY.
10:21 PM – CDN1 167ms
10:22 PM – CDN2 94ms
10:22 PM – CLOUD 1 128ms
10:23 PM – CLOUD 2 230ms
10:26 PM – DC 1 153ms

Cloud & Network Outages happen every day, everywhere

2/ SPEED
► Services need to be fast locally
► Services need to be fast globally / externally
 On a public internet that is not built to find the fasted path.

2/ WORK ALL THE TIME, UNDER ANY CONDITION
► Services need to work all the time, everywhere – and every service has to be designed to work always
► Services need to work under heavy pressure
 When a lot of traffic starts flowing in (need to scale up)
 Under attack (DDoS etc)
 When services depend on each other (none of them should be a SPOF) etc

1/ LOCAL RELIABILITY
► Provide local fallback / alternative when the main endpoint is slow / unavailable / a source of errors.
 Multiple endpoints for each critical microservice
 Local Load-Balancing
 Local heath check monitoring

2/ GLOBAL RELIABILITY
► Orchestrate a multi-homed infrastructure at the global level too
► Use a Global Load-Balancer in order to route traffic away from bottlenecks and outages based on:
 Global (external) monitoring (health-checks)
 Real end-user monitoring
 Load/Error feedback (directly from the server / PoP / region)
Automate your traffic management with a software-defined solution

2/ GLOBAL RELIABILITY
Multi-server / Multi-Region
/ Multi-Cloud / Multi-CDN
or Hybrid architectures
• Local Load-Balancing to
select Optimum Server /
Instance
• Continuously Updated
RUM & APM Monitoring
• Global health-checks /
monitoring
• Software-defined,
automated Global Load-
balancing
RUM Cloud
Scoring
Availability &
Latency
App
Performance
Monitoring
CPU & I/O
APP PERFORMANCE MONITORING
Continuous Self-Correcting
Action
Monitor Data
Center &
Application
Health
LOCAL HEALTHCHECKS
REAL USER MONITORING
Select Optimum
Server
LOCAL LOAD-BALANCING
Select
Optimal
Cloud Region
/ Cloud
GLOBAL, REAL-TIME LOAD-BALANCING

Make sure your services are multi-homed & orchestrated so that
they can collaborate together to provide a fast, reliable service - all
the time.

THANK YOU!
Aude SMIEJAN // @aude_sw // aude@cedexis.com

Collaboration over Consolidation in the Cloud

Recommended

Recommended

More Related Content

Similar to Collaboration over Consolidation in the Cloud

Similar to Collaboration over Consolidation in the Cloud (20)

Recently uploaded

Recently uploaded (20)

Collaboration over Consolidation in the Cloud

Editor's Notes