Talkbits service architecture and deployment


Published on

Published in: Technology
1 Like
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Talkbits service architecture and deployment

  1. 1. Talkbits service architecture and deployment. by Aleksei Kornev
  2. 2. Get stuff done...
  3. 3. Typical application
  4. 4. Architecture of talkbits service One way to configure service, logs, metrics. One way to package and deploy service. One way to lunch service. Bundled in one-jar.
  5. 5. One delivery unit. Contains: Java service In a single executable fat-jar. Installation script [Re]installs service on the machine, registers it in /etc/init.d Init.d script Contains instructions to start, stop, restart JVM and get quick status. Delivery
  6. 6. Logging Confuguration • SLF4J as an API, all other libraries redirected • Logback as a logging implementation • Each service logs to /var/log/talkbits/... (application logs, GC logs) • Daily rotation policy applied • Also sent to for aggregation, grouping etc. Aggregation • • sshfs for analyzing logs by means of linux tools such as grep, tail, less, etc. Aggregation alternatives, Flume, Scribe, etc...
  7. 7. Metrics Application metrics and health checks are implemented with CodaHale lib ( Codahale reports metrics via JMX. Jolokia JVM agent ( exposes JMX beans via REST (JSON / HTTP), using JVMs internal HTTP server. Monitoring agent use jolokia REST interface to fetch metrics and send them to monitoring system. All metrics are divided into common metrics (HW, JVM, etc) and service-specific metrics.
  8. 8. Deployment Fabric ( used for environments provisioning and services deployment. Process • Fabric script provisions new env (or uses existing) by cluster scheme • Amazon instances are automatically tagged with services list (i.e., instance roles) • Fabric script reads instance roles and deploys (redeploys) appropriate components.
  9. 9. Monitoring As monitoring platform we chose Datadog is a SaaS which is easy to integrate into your infrastucture. Datadog agent is opensourced and implemented in Python. There are many predefined checksets (plugins, or integrations) for popular products out of the box - including JVM, Cassandra, Zookeeper and ElasticSearch. Datadog provides REST API. Alternatives • Nagios, Zabbix - need to have bearded admin in team. We wanted to go SaaS and outsource infrastructure as far as possible. • Amazon CloudWatch, LogicMonitor, ManageEngine, etc. Process Each service has own monitoring agent instance on a single machine. If node has 'monitoring-agent' role in the roles tag of EC2 instance, monitoring agent will be installed for each service on this node.
  10. 10. Talkbits cluster structure
  11. 11. QA Aleksei Kornev Max Alexejev