Deploy and Destroy: Testing Environments - Michael Arenzon - DevOpsDays Tel Aviv 2018

Deploy and Destroy:
Testing Environments
Michael Arenzon, Platform Engineering Group

120+ R&D
100s (Micro-)Services
1000s Servers
10s Deployments / Day
About
“AppsFlyer is the world's leading mobile attribution &
marketing analytics platform, helping app marketers
around the world make better decisions.”
80B+ Events / Day
85K+ Apps (Using SDK)
1000s Partners
17B$ Media Spent Measure

Agenda
Velocit
y
# Devs
1-
25
25-100 100+
Tooling?
Tests?
Documentation?
CI / Pipelines
* Based on my spectaculus drawing skills

Agenda
# Devs
25-100 100+
Tooling?
Tests?
Documentation?
CI / Pipelines
[Dev / Testing] Environments!!

Reference: https://martinfowler.com/articles/practical-test-pyramid.html
Source: https://www.360logica.com/blog/sneak-peek-test-framework-test-pyramid-testing-pyramid/
Test Pyramid

Where It All Began
Source: https://github.com/facebook/create-react-app/issues/4071

Table?
Source: https://en.wikipedia.org/wiki/Dude,_Where%27s_My_Car%3F

Shared [Dev | Test] Environments
● Easy - When it’s too big
for the laptop
● “Low” maintenance
overhead
● Every developer
maintains only their
team’s stack
● Similarity to production
● Fuzzy Version Control - is
it my version?
● Stability - who owns it?
who deploys new
versions?
● State is mutable - shared
database, shared state.
● Mutability!!!!!!!!!!!!
Pets vs. Cattle
Source: https://medium.com/@Joachim8675309/devops-concepts-pets-vs-cattle-2380b5aab313

Environment as a human-
readable schema (JSON /
YAML)
API Driven (UI/CLI Enhanced),
Self-serve (!)
Isolation (!!)
Composable
Key Principles To Env. Creation
Definition
Interaction
Safety
Usability

Namespaces (to the rescue!)
● Environment == JSON file
● Self-serve, based on API
● Isolation between
environments
● A building block
It’s a platform for creating testing environments easily without dealing
with infrastructure.
Based on two main concepts:
● Services (e.g., any micro-service)
● Resources (e.g., Kafka, MySQL)

"namespace": "<UNIQUE NAME>",
"envType": "[DEV, TEST, STG]"
"team": "<TEAM NAME>"
"services": [
{
"id": "<UNIQUE ID>",
"image": {
"service": "<SERVICE NAME>",
"version": "<VERSION>"
}
… Common Details
},
]
"resources": [
{
"id": "<UNIQUE ID>",
"name": "<RESOURCE NAME>"
... Common Details
},
]
Basic Information
Resources DetailsServices Details
{
}
A Namespace Definition (JSON)
"globalEnvVariables": {
"<KEY>": "<VALUE>"
}
Env. Variables
...
"requirements": {
"ports": [<PORT NUMBER>]
"cpus": <CPUS AMOUNT>,
"mem": <MEM AMOUNT>,
"instances": <INS. AMOUNT>
},
"checks": [{
"path": "/<HEALTH PATH>",
… Timeout & retry settings
}],
"envVariables": {
"<KEY>": "<VALUE>"
}
...
Common Details (Embedded)

Environment as a human-readable
schema

Namespaces API* (Rest)
● -X POST @env.json /namespace (CREATE)
● -X GET /namespace/{name}/schema (READ)
● -X PUT @env.json /namespace (UPDATE)
● -X DELETE /namespace/{name} (DELETE)
Fully self-serve. Developers / QAs invoke the API to
manage their environments.
* And more (/status, /refresh, /logs, etc)

Communication & Service Discovery
http://helloworld.devopsdays-ns.msp.af.com
For every namespace, we deploy another container
called “mspproxy” (Traefik + custom), that handles the
service communication routing.

Isolation (by DNS)
How can we make sure services / resources
communicates inside their environment?
resolv.conf FTW!
Source: https://en.wikipedia.org/wiki/Resolv.conf

Isolation (by DNS)
Services / Resources are communicating via “short name”
$ curl http://helloworld/ => http://helloworld.devopsdays-
ns.msp.af.com
$ telnet memcached 11211 => memcached.devopsdays-
ns.msp.af.com
Source: https://kubernetes.io/docs/concepts/services-networking/service/
Reminds K8s Service Model
What happens if you want to talk with different namespace? Just add
the environment name as a suffix:
$ curl http://helloworld.scaladays-ns => http://helloworld.scaladays-ns…

Composability (Building Block)

Composability (Building Block)
API driven by design, means that it can be invoked anywhere you
want.
● CI / CD Pipeline
● Selenium / UI Builds
● Scheduled Jobs

But how does it looks in
production?

Infrastructure Utilization
● Most of the nodes are running on spots.
● Containers are exposed with random port. Proxy handles
communication
● For resources (e.g. Memcached), we expose their original port
(e.g. memcache:11211)
● Can’t bind same port twice on a single machine (hmm, proxy on
port 80?)
- It is possible to use overlay network with Mesos.

Service Discovery
● Marathon provides event-bus for all the deployment events.
- Delayed messages
- Losing events
● Solution - scheduled sync + events subscription.
- “mspproxy”

Data Replication
● Multiple databases.
● Owners created a job that creates Snapshot from production.
- Without private / sensitive data.
- For big storages, 1/X(>=10) of the data can be enough.
● They can inject the snapshot revision as part of the JSON.
- Wrapping the Database container with custom entrypoint
script.

Debuggability
● Dynamic environments means resource (mem/cpu) constraints
- Out of memory.
● Env. is not stable - whose side is the problem? DevOps?
- Out of memory.
● Logs? Metrics?
- Yes, please.

● Yes and no.
- Ideally, we can leverage our existing deployment tool.
- Interface should be different.
● Creating [test/dev] environment != production deployment.
- No need in canary / green deployments.
- Less validations, more into the point.
Another Deployment Tool

2cents:
Treat it like any other product.
It’s a product.

● Static environments are ok. Dynamic environments are great.
● Simplify as much as possible. Self-serve & API driven by design.
● KPI is developers / QAs happiness. They aren’t? Not good
enough.
● Keep it layered. Building blocks are better than one over-
engineered solution (:cough: K8s :cough:)
● Nobody cares about your Docker problems. Make it work.
Observability and clear error messages are key.
Summary

Thank you!
* P.s., We’re Hiring!
michael.arenzon@appsflyer.com

Deploy and Destroy: Testing Environments - Michael Arenzon - DevOpsDays Tel Aviv 2018

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Deploy and Destroy: Testing Environments - Michael Arenzon - DevOpsDays Tel Aviv 2018

Similar to Deploy and Destroy: Testing Environments - Michael Arenzon - DevOpsDays Tel Aviv 2018 (20)

More from DevOpsDays Tel Aviv

More from DevOpsDays Tel Aviv (20)

Recently uploaded

Recently uploaded (20)

Deploy and Destroy: Testing Environments - Michael Arenzon - DevOpsDays Tel Aviv 2018

Editor's Notes