SlideShare a Scribd company logo
1 of 35
Download to read offline
Distributed Tracing in the enterprise,
Nov. 09-11, 2021 / OSMC
Open Source Application Performance Monitoring
OSS APM
Manifest
1. Who am I
2. Who is “VHV Versicherung”
3. What is OSS APM / Distributed Tracing
• Generic instructions
• Challenges at VHV
• Concept of the solution
• Solution
/ /
Hi, I am Sascha Brechmann.
I have been working for the “VHV Versicherung” since more than 7 years.
My main tasks at Team Monitoring is the “Application-Performance-Monitoring”
• My Main Skills are:
• Linux-System-Administration
• Monitoring-Administration
• Availability-Monitoring (CheckMK)
• Performance-Monitoring (inspect-IT)
• Consulting everything around Monitoring
• Event-/Log-Monitoring (ELK-Stack)
OSS APM / OSMC2021 19.11.2021 3
OSS APM
Who am I
/ /
The VHV group, located in Hannover, is a growing corporation specialist in insurance and
foresight.
The business-brands “VHV Versicherung” and “Hannoversche” and “VHV Versicherung (Austria)”
are the “VHV Gruppe” a professional and forward-looking affiliate for insurance.
The business areas are property and casualty insurance, motor insurance and life insurance.
For its success, the VHV Group relies on the strengths of around 3,000 employees, modern
structures, efficient cost management and customer-oriented products.
OSS APM / OSMC2021 19.11.2021 4
OSS APM
Who is the VHV insurance (VHV Versicherung)
Manifest
1. Who am I
2. Who is “VHV Versicherung”
3. What is OSS APM / Distributed Tracing
• Generic instructions
• Challenges at VHV
• Concept of the solution
• Solution
/ /
Distributed tracing in context of APM means the observation of distributed transactions (“traces”)
across different applications / functions (“span”).
Metrics, TAGs and "logs / messages" can be attached to the individual spans and evaluated for
(problem) analysis.
With distributed tracing, the "spans" from all the applications involved must be forwarded to a
central backend for correlation and analysis.
OSS APM / OSMC2021 19.11.2021 6
General introduction
What is Distributed-Tracing
M
o
ni
t
o
ri
/ /
OSS APM / OSMC2021 19.11.2021 7
A picture says more than thousand words.
©
Ub
er
/ /
OSS APM / OSMC2021 19.11.2021 8
And that is what "distributed tracing" makes
of this image for a transaction.
©
Ub
er
/ /
OSS APM / OSMC2021 19.11.2021 9
How does distributed tracing work?
Simplified display without EUM / browser monitoring
©
Ja
eg
er-
Tr
aci
/ /
OSS APM / OSMC2021 19.11.2021 10
Different views of a single transaction
Dependencies vs. Timelines
©
Ja
eg
er-
Tr
aci
/
/
The benefits of (OSS) APM / Distributed Tracing
Part 1
• Cross-application measurement of (business-) processes
• Requests, error rates and duration of transactions (R.E.D. metrics)
• Enriching transactions with additional (meta) data for error analysis (HTTP status code; order code; ...)
• Correlation of transactions with logs
• Comparing transactions:
• Releases / periods
• End-User interactions / actions
19.11.2021
OSS APM / OSMC2021 11
/
/
The benefits of (OSS) APM / Distributed Tracing
Part 2
• Early identification of performance problems
• Avoidance of support tickets
• Earlier integration of performance measurement into the development process
• Runtime behaviour / analysis of alternative software solutions
• Easily drill down into the "software code"
• Striking display of complex metrics in dashboards
• Overview of dependencies / connections between applications
• Reveal hidden dependencies (why is application A talking to B?)
19.11.2021
OSS APM / OSMC2021 12
…
3. What is OSS APM / Distributed Tracing
• Generic instructions
• Challenges at the VHV
• Concept of the solution
• Solution
/ /
(License) cost reduction
• Commercial solutions can only be used to a limited extent
(quantities) of instances
• Not all stages / levels
• Not all applications
• No vendor lock-in
• “Hardly / no influence” on the development
• (Micro) service oriented
• The simplest possible exchange of individual components
• Commercial solutions are usually a monolith / black box and
scaling is only possible with the help of the manufacturer
• Cooperation with other (insurance) market participants
Organizational improvements
• Earliest possible inclusion of performance measurements in the
development process
• Better overview through a service dependency graph (flow map)
of the application landscape
• Easily create striking dashboards
• Self-service for the IT operations groups
• Less (know-how) dependence on “power users” thanks to a
comprehensive training concept
• Easy adaptation to/in the VHV processes
• Simple correlation between traces and (application) logs
• Acceptance of previous "blackboxes" (SAP / Mainframe / None
Java)
OSS APM / OSMC2021 19.11.2021 14
OSS APM / Distributed-Tracing
Problem statement at the VHV
Commercial vs. OSS Solution
…
3. What is OSS APM / Distributed Tracing
• Generic instructions
• Challenges at VHV
• Concept of the solution
• Solution
/ /
1. The desired (open source) solution should consist of several (micro) services.
2. The solution must rely on open standards and (mostly) existing open-source software.
3. The individual components should be easy to replace.
4. The solution should be as simple as possible to scale (especially horizontally, by adding nodes)
5. The solution should simply fit into the VHV staging concept (development, test, training, production stage)
6. The solution should, if possible, rely on technologies that are already in use at VHV
a) Linux (SLES)
b) Java
c) Elastic Search
d) PostgreSQL/SQLite
OSS APM / OSMC2021 19.11.2021 16
OSS APM / Distributed Tracing
Concept of solution
…
3. What is OSS APM / Distributed Tracing
• Generic instructions
• Challenges at VHV
• Concept of the solution
• Solution
/ /
1. Micro-Services + Standards:
a) OpenCensus-Collector;Jaeger-Collector;Elasticsearch;InfluxDB;Grafana;inspect-IT configsrv
b) Standards: OpenCensus + OpenMetrics + (coming soon OpenTelemetry)
2. VHV staging concept: A separate “inspect IT stack” for each VHV main stage
3. Simply scale:
a) Collectors, Elasticsearch, Grafana + inspect-IT services (EUM,config,baseline) scale horizontally.
b) InfluxDB is still a "problem child" here
4. Technologies:
a) Linux (SLES) for Client + Server / Windows Client
b) Java (OpenJDK; Adopt-JDK; Oracle-JDK; IBM-JDK) Version: >= 1.8
c) PostgreSQL/SQLite (currently more SQLite, but migration to PGSQL is planned)
d) Elasticsearch: It is used as storage for traces
e) InfluxDB: New Technology. Alternative metric storages in consideration
f) GO-Lang: Several components are implemented in GO-Lang, but are used as BINs and are not
compiled by themselves.
OSS APM / OSMC2021 19.11.2021 18
OSS APM / Distributed Tracing
Solution
/ /
OSS APM / OSMC2021 19.11.2021 19
„inspect-IT stack“ architecture sketch
See https://openapm.io for alternatives
B
r
o
w
s
e
r
J
a
v
a
-
A
p
p
li
c
a
ti
o
n
J
a
v
a
-
T
r
a
c
i
n
g
-
F
r
a
m
e
w
D
a
t
e
n
-
T
r
a
n
s
p
o
rt
D
a
t
a
-
C
o
rr
e
l
a
ti
o
n
D
a
t
e
n
-
S
t
o
r
a
g
e
D
a
t
a
-
A
n
a
ly
s
e
s
+
A
l
e
rt
i
/
/
Core functions of inspect-IT
- Distributed tracing (from application to application / SDG - flow map)
- Recording of transaction metrics and traces
- Detection of business transactions
- Display of HTTP / SQL Query / MQ / SMTP / LDAP / SFTP transactions
- Automatic injection of the TraceID into the (Application-)Log output
- Measuring of the "end-user experience" when using the (observed) application
19.11.2021
OSS APM / OSMC2021 20
/
/
„inspect-IT“: Who/How/What
Under the name “inspect-IT Stack” we summarize a collection of open-source tools, that together provide the
functions of “Distributed Tracing” in our solution.
• Inspect-IT Ocelot Agent => this is used to instrument the individual Java applications.
• Open Census-Collector => Collecting and routing the trace information
• Jaeger => Central tracing instance. Takes care of the preparation
(drill down) of tracing information.
• Elasticsearch => Storage backend for Jaeger / Traces -> Correlation Traces
• Telegraf => machine agent. Collects all (server) metrics and stores this in InfluxDB
• InfluxDB => TimeSeriesDB. Stores all metrics
• Grafana => Web UI for InfluxDB / Elasticsearch
19.11.2021
OSS APM / OSMC2021 21
/
A few figures for an overview of the use of inspect-IT at VHV
• “Inspect-IT Stack” Stages: 4 with currently 5 VMs per level (without tracing storage)
• VHV-Stages (Application-Stages): >20
• Number of inspect-IT agents: > 50 per VHV stage
• Number of different applications: >100
• Data volume metrics: ~ 60GB / inspect-IT stage (7 days of raw data, then aggregated)
• Amount of data traces:> 250 GB / inspect-IT stage and day (up to 10 days retention time)
• User: >100 (only IT)
19.11.2021
OSS APM / OSMC2021 22
/
Some Screenshots from our Environment
Service Dependency Graph
19.11.2021
OSS APM / OSMC2021 23
/
Some Screenshots from our Environment
Service Dependency Graph / Single application
19.11.2021
OSS APM / OSMC2021 24
/
Some Screenshots from our Environment
JVM Metric of one CRM Application
19.11.2021
OSS APM / OSMC2021 25
/
Some Screenshots from our Environment
HTTP Metrics of one CRM Application
19.11.2021
OSS APM / OSMC2021 26
/
Some Screenshots from our Environment
BT Metrics of one CRM Application
19.11.2021
OSS APM / OSMC2021 27
/
Some Screenshots from our Environment
EUM/Browser Metrics of one CRM Application
19.11.2021
OSS APM / OSMC2021 28
/ /
OSS APM / OSMC2021 19.11.2021 29
Backup
/ /
OSS APM / OSMC2021 19.11.2021 30
V
H
V
-
D
e
v
el
o
p
er
IT
-
O
E
n
d-
U
s
er
B
u
si
n
e
s
s-
U
s
er
In
te
D
a
s
h
b
o
ar
d-
U
s
er
St
or
a
g
e
St
or
a
g
e
S
t
o
r
a
g
e
D
a
s
h
b
o
ar
d-
U
s
er
/
OpenCensus/OpenTracing/OpenTelemetry:
Who/How/What
• OpenCensus is an open-source standard and describes the functions required to implement "distributed tracing“
(APIs/Libs – Protocols – Receiver/Exporter).
• OpenCensus was initially designed by Google and Microsoft and is largely based on Google's own "Distributed Tracing" implementation.
-> The commercial version is / was “Google Stacktrace” + “Azure AppInside”
• OpenCensus describes how "distributed tracing" information can be exchanged between the individual systems involved
• OpenCensus also provides a reference implementation (Libs) ready for a very wide range of programming languages
• OpenTelemetry is the successor project or the continuation of OpenCensus (v2) and OpenTracing (v2).
• OpenCensus + OpenTracing have now been merged into the "OpenTelemetry" project
(no two standards / implementations for the same goal)
• OpenTelemetry has an expanded focus that includes the following information levels:
• Traces, from OpenTelemetry, OpenCensus and OpenTracing. (Interoperability)
• Metrics: Acquisition of application metrics, output of metrics, transport of metrics, storage of metrics
• Logs: Collection and transport of log messages. Transfer to a log collector
19.11.2021
OSS APM / OSMC2021 31
/ /
International:
• Uber
• Redhat
• Ticketmaster
• Grafana
Deutsch:
• Hermes Logistik (Start 2018; Change from Monolith to Micro Service)
• Zalando
19.11.2021 32
Companies that already work on distributed tracing with
OpenTelemetry / Jaeger
OSS APM / OSMC2021
/
Why distributed tracing?
19.11.2021 33
OSS APM / OSMC2021
© Jaeger-Tracing
/
What is a Trace / Span?
- A „span“ is a unic Unit (Function) of a distributed System
- A “trace” is a collection of one or more Spans, represention a single transaction
19.11.2021 34
OSS APM / OSMC2021
©
Ja
eg
er-
Tr
aci
ng
/ /
OSS APM / OSMC2021 19.11.2021 35
DEMO
htt
ps:
//c
ont
eli
o.d
e/
me
hr/
pra
es
ent
ati
on-
co
nte
lio-

More Related Content

What's hot

Oracle SOA suite and Coherence dehydration
Oracle SOA suite and  Coherence dehydrationOracle SOA suite and  Coherence dehydration
Oracle SOA suite and Coherence dehydrationMichel Schildmeijer
 
Marcus Rochelle - Landis+Gyr - Monitoring with Nagios Enterprise Edition
Marcus Rochelle - Landis+Gyr - Monitoring with Nagios Enterprise EditionMarcus Rochelle - Landis+Gyr - Monitoring with Nagios Enterprise Edition
Marcus Rochelle - Landis+Gyr - Monitoring with Nagios Enterprise EditionNagios
 
5 steps to take setting up a streamlined container pipeline
5 steps to take setting up a streamlined container pipeline5 steps to take setting up a streamlined container pipeline
5 steps to take setting up a streamlined container pipelineMichel Schildmeijer
 
OSMC 2021 | Handling 250K flows per second with OpenNMS: a case study
OSMC 2021 | Handling 250K flows per second with OpenNMS: a case studyOSMC 2021 | Handling 250K flows per second with OpenNMS: a case study
OSMC 2021 | Handling 250K flows per second with OpenNMS: a case studyNETWAYS
 
Nagios World Conference 2015 - Scott Wilkerson Opening
Nagios World Conference 2015 - Scott Wilkerson OpeningNagios World Conference 2015 - Scott Wilkerson Opening
Nagios World Conference 2015 - Scott Wilkerson OpeningNagios
 
Mike Weber - Nagios and Group Deployment of Service Checks
Mike Weber - Nagios and Group Deployment of Service ChecksMike Weber - Nagios and Group Deployment of Service Checks
Mike Weber - Nagios and Group Deployment of Service ChecksNagios
 
Lee Myers - What To Do When Nagios Notification Don't Meet Your Needs.
Lee Myers - What To Do When Nagios Notification Don't Meet Your Needs.Lee Myers - What To Do When Nagios Notification Don't Meet Your Needs.
Lee Myers - What To Do When Nagios Notification Don't Meet Your Needs.Nagios
 
Hot to build continuously processing for 24/7 real-time data streaming platform?
Hot to build continuously processing for 24/7 real-time data streaming platform?Hot to build continuously processing for 24/7 real-time data streaming platform?
Hot to build continuously processing for 24/7 real-time data streaming platform?GetInData
 
Accelerating Software Development with NetApp's P4flex
Accelerating Software Development with NetApp's P4flexAccelerating Software Development with NetApp's P4flex
Accelerating Software Development with NetApp's P4flexPerforce
 
OpenStack QA Tooling & How to use it for Production Cloud Testing | Ghanshyam...
OpenStack QA Tooling & How to use it for Production Cloud Testing | Ghanshyam...OpenStack QA Tooling & How to use it for Production Cloud Testing | Ghanshyam...
OpenStack QA Tooling & How to use it for Production Cloud Testing | Ghanshyam...Vietnam Open Infrastructure User Group
 
Enhancing OpenStack FWaaS for real world application
Enhancing OpenStack FWaaS for real world applicationEnhancing OpenStack FWaaS for real world application
Enhancing OpenStack FWaaS for real world applicationopenstackindia
 
OpenStack Tempest and REST API testing
OpenStack Tempest and REST API testingOpenStack Tempest and REST API testing
OpenStack Tempest and REST API testingopenstackindia
 
FreeSWITCH on Docker
FreeSWITCH on DockerFreeSWITCH on Docker
FreeSWITCH on Docker建澄 吳
 
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017Luciano Resende
 
OpenStack Enabling DevOps
OpenStack Enabling DevOpsOpenStack Enabling DevOps
OpenStack Enabling DevOpsCisco DevNet
 
Oracle Enterprise Manager - EM12c R5 Hybrid Cloud Management
Oracle Enterprise Manager - EM12c R5 Hybrid Cloud ManagementOracle Enterprise Manager - EM12c R5 Hybrid Cloud Management
Oracle Enterprise Manager - EM12c R5 Hybrid Cloud ManagementMarketingArrowECS_CZ
 
OpenStack for devops environment
OpenStack for devops environment OpenStack for devops environment
OpenStack for devops environment Orgad Kimchi
 
NetDevOps for the Network Dude: How to get started with API's, Ansible and Py...
NetDevOps for the Network Dude: How to get started with API's, Ansible and Py...NetDevOps for the Network Dude: How to get started with API's, Ansible and Py...
NetDevOps for the Network Dude: How to get started with API's, Ansible and Py...Cisco DevNet
 
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...Daniel Krook
 

What's hot (20)

Oracle SOA suite and Coherence dehydration
Oracle SOA suite and  Coherence dehydrationOracle SOA suite and  Coherence dehydration
Oracle SOA suite and Coherence dehydration
 
Marcus Rochelle - Landis+Gyr - Monitoring with Nagios Enterprise Edition
Marcus Rochelle - Landis+Gyr - Monitoring with Nagios Enterprise EditionMarcus Rochelle - Landis+Gyr - Monitoring with Nagios Enterprise Edition
Marcus Rochelle - Landis+Gyr - Monitoring with Nagios Enterprise Edition
 
5 steps to take setting up a streamlined container pipeline
5 steps to take setting up a streamlined container pipeline5 steps to take setting up a streamlined container pipeline
5 steps to take setting up a streamlined container pipeline
 
OSMC 2021 | Handling 250K flows per second with OpenNMS: a case study
OSMC 2021 | Handling 250K flows per second with OpenNMS: a case studyOSMC 2021 | Handling 250K flows per second with OpenNMS: a case study
OSMC 2021 | Handling 250K flows per second with OpenNMS: a case study
 
Nagios World Conference 2015 - Scott Wilkerson Opening
Nagios World Conference 2015 - Scott Wilkerson OpeningNagios World Conference 2015 - Scott Wilkerson Opening
Nagios World Conference 2015 - Scott Wilkerson Opening
 
Mike Weber - Nagios and Group Deployment of Service Checks
Mike Weber - Nagios and Group Deployment of Service ChecksMike Weber - Nagios and Group Deployment of Service Checks
Mike Weber - Nagios and Group Deployment of Service Checks
 
Lee Myers - What To Do When Nagios Notification Don't Meet Your Needs.
Lee Myers - What To Do When Nagios Notification Don't Meet Your Needs.Lee Myers - What To Do When Nagios Notification Don't Meet Your Needs.
Lee Myers - What To Do When Nagios Notification Don't Meet Your Needs.
 
Hot to build continuously processing for 24/7 real-time data streaming platform?
Hot to build continuously processing for 24/7 real-time data streaming platform?Hot to build continuously processing for 24/7 real-time data streaming platform?
Hot to build continuously processing for 24/7 real-time data streaming platform?
 
Accelerating Software Development with NetApp's P4flex
Accelerating Software Development with NetApp's P4flexAccelerating Software Development with NetApp's P4flex
Accelerating Software Development with NetApp's P4flex
 
OpenStack QA Tooling & How to use it for Production Cloud Testing | Ghanshyam...
OpenStack QA Tooling & How to use it for Production Cloud Testing | Ghanshyam...OpenStack QA Tooling & How to use it for Production Cloud Testing | Ghanshyam...
OpenStack QA Tooling & How to use it for Production Cloud Testing | Ghanshyam...
 
Enhancing OpenStack FWaaS for real world application
Enhancing OpenStack FWaaS for real world applicationEnhancing OpenStack FWaaS for real world application
Enhancing OpenStack FWaaS for real world application
 
OpenStack Tempest and REST API testing
OpenStack Tempest and REST API testingOpenStack Tempest and REST API testing
OpenStack Tempest and REST API testing
 
FreeSWITCH on Docker
FreeSWITCH on DockerFreeSWITCH on Docker
FreeSWITCH on Docker
 
WebLogic and GraalVM
WebLogic and GraalVMWebLogic and GraalVM
WebLogic and GraalVM
 
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017
 
OpenStack Enabling DevOps
OpenStack Enabling DevOpsOpenStack Enabling DevOps
OpenStack Enabling DevOps
 
Oracle Enterprise Manager - EM12c R5 Hybrid Cloud Management
Oracle Enterprise Manager - EM12c R5 Hybrid Cloud ManagementOracle Enterprise Manager - EM12c R5 Hybrid Cloud Management
Oracle Enterprise Manager - EM12c R5 Hybrid Cloud Management
 
OpenStack for devops environment
OpenStack for devops environment OpenStack for devops environment
OpenStack for devops environment
 
NetDevOps for the Network Dude: How to get started with API's, Ansible and Py...
NetDevOps for the Network Dude: How to get started with API's, Ansible and Py...NetDevOps for the Network Dude: How to get started with API's, Ansible and Py...
NetDevOps for the Network Dude: How to get started with API's, Ansible and Py...
 
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
 

Similar to OSMC 2021 || Open Source Application Performance Monitoring in the Enterprise

Introducing ONAP for OpenStack St Louis Meetup
Introducing ONAP for OpenStack St Louis MeetupIntroducing ONAP for OpenStack St Louis Meetup
Introducing ONAP for OpenStack St Louis Meetupdjzook
 
127801976 mobile-shop-management-system-documentation
127801976 mobile-shop-management-system-documentation127801976 mobile-shop-management-system-documentation
127801976 mobile-shop-management-system-documentationNitesh Kumar
 
DevSecOps - Security in DevOps
DevSecOps - Security in DevOpsDevSecOps - Security in DevOps
DevSecOps - Security in DevOpsAarno Aukia
 
DevSecOps: Bringing security to the DevOps pipeline
DevSecOps: Bringing security to the DevOps pipelineDevSecOps: Bringing security to the DevOps pipeline
DevSecOps: Bringing security to the DevOps pipelineAarno Aukia
 
WebLogic Event Server - Alexandre Alves, BEA
WebLogic Event Server - Alexandre Alves, BEAWebLogic Event Server - Alexandre Alves, BEA
WebLogic Event Server - Alexandre Alves, BEAmfrancis
 
Reduce Risk with End to End Monitoring of Middleware-based Applications
Reduce Risk with End to End Monitoring of Middleware-based ApplicationsReduce Risk with End to End Monitoring of Middleware-based Applications
Reduce Risk with End to End Monitoring of Middleware-based ApplicationsSL Corporation
 
muCon 2019: "Creating an Effective Developer Experience for Cloud-Native Apps"
muCon 2019: "Creating an Effective Developer Experience for Cloud-Native Apps"muCon 2019: "Creating an Effective Developer Experience for Cloud-Native Apps"
muCon 2019: "Creating an Effective Developer Experience for Cloud-Native Apps"Daniel Bryant
 
Sybase unwired platform 1.2 developing secure blackberry applications
Sybase unwired platform 1.2   developing secure blackberry applicationsSybase unwired platform 1.2   developing secure blackberry applications
Sybase unwired platform 1.2 developing secure blackberry applicationsEdwin Ramos
 
Would Mr. Spok choose Open Source
Would Mr. Spok choose Open SourceWould Mr. Spok choose Open Source
Would Mr. Spok choose Open Sourcevlcinsky
 
Dubbo and Weidian's practice on micro-service architecture
Dubbo and Weidian's practice on micro-service architectureDubbo and Weidian's practice on micro-service architecture
Dubbo and Weidian's practice on micro-service architectureHuxing Zhang
 
Agentless System Crawler - InterConnect 2016
Agentless System Crawler - InterConnect 2016Agentless System Crawler - InterConnect 2016
Agentless System Crawler - InterConnect 2016Canturk Isci
 
Agile and continuous delivery – How IBM Watson Workspace is built
Agile and continuous delivery – How IBM Watson Workspace is builtAgile and continuous delivery – How IBM Watson Workspace is built
Agile and continuous delivery – How IBM Watson Workspace is builtVincent Burckhardt
 
Vivek kumar ray 5 year Java-Webservices-Bigdata
Vivek kumar ray 5 year Java-Webservices-BigdataVivek kumar ray 5 year Java-Webservices-Bigdata
Vivek kumar ray 5 year Java-Webservices-BigdataVivek Rai
 
2017-05-10 Gate4SPICE: "Legacy Software"
2017-05-10 Gate4SPICE: "Legacy Software"2017-05-10 Gate4SPICE: "Legacy Software"
2017-05-10 Gate4SPICE: "Legacy Software"Alexander Much
 
WLCG Grid Infrastructure Monitoring
WLCG Grid Infrastructure MonitoringWLCG Grid Infrastructure Monitoring
WLCG Grid Infrastructure MonitoringJames Casey
 
IBM Bluemix OpenWhisk: Serverless Conference 2016, London, UK: The Future of ...
IBM Bluemix OpenWhisk: Serverless Conference 2016, London, UK: The Future of ...IBM Bluemix OpenWhisk: Serverless Conference 2016, London, UK: The Future of ...
IBM Bluemix OpenWhisk: Serverless Conference 2016, London, UK: The Future of ...OpenWhisk
 
DevSecOps: Bringing security to the DevOps pipeline
DevSecOps: Bringing security to the DevOps pipelineDevSecOps: Bringing security to the DevOps pipeline
DevSecOps: Bringing security to the DevOps pipelineAarno Aukia
 
Case study: How Cozy Cloud monitors every layer of its activity using OVH Met...
Case study: How Cozy Cloud monitors every layer of its activity using OVH Met...Case study: How Cozy Cloud monitors every layer of its activity using OVH Met...
Case study: How Cozy Cloud monitors every layer of its activity using OVH Met...OVHcloud
 
Yohanes Widi Sono - Modern Development for Business Agility
Yohanes Widi Sono - Modern Development for Business AgilityYohanes Widi Sono - Modern Development for Business Agility
Yohanes Widi Sono - Modern Development for Business AgilityAgile Impact Conference
 

Similar to OSMC 2021 || Open Source Application Performance Monitoring in the Enterprise (20)

Introducing ONAP for OpenStack St Louis Meetup
Introducing ONAP for OpenStack St Louis MeetupIntroducing ONAP for OpenStack St Louis Meetup
Introducing ONAP for OpenStack St Louis Meetup
 
127801976 mobile-shop-management-system-documentation
127801976 mobile-shop-management-system-documentation127801976 mobile-shop-management-system-documentation
127801976 mobile-shop-management-system-documentation
 
DevSecOps - Security in DevOps
DevSecOps - Security in DevOpsDevSecOps - Security in DevOps
DevSecOps - Security in DevOps
 
DevSecOps: Bringing security to the DevOps pipeline
DevSecOps: Bringing security to the DevOps pipelineDevSecOps: Bringing security to the DevOps pipeline
DevSecOps: Bringing security to the DevOps pipeline
 
WebLogic Event Server - Alexandre Alves, BEA
WebLogic Event Server - Alexandre Alves, BEAWebLogic Event Server - Alexandre Alves, BEA
WebLogic Event Server - Alexandre Alves, BEA
 
Reduce Risk with End to End Monitoring of Middleware-based Applications
Reduce Risk with End to End Monitoring of Middleware-based ApplicationsReduce Risk with End to End Monitoring of Middleware-based Applications
Reduce Risk with End to End Monitoring of Middleware-based Applications
 
muCon 2019: "Creating an Effective Developer Experience for Cloud-Native Apps"
muCon 2019: "Creating an Effective Developer Experience for Cloud-Native Apps"muCon 2019: "Creating an Effective Developer Experience for Cloud-Native Apps"
muCon 2019: "Creating an Effective Developer Experience for Cloud-Native Apps"
 
Sybase unwired platform 1.2 developing secure blackberry applications
Sybase unwired platform 1.2   developing secure blackberry applicationsSybase unwired platform 1.2   developing secure blackberry applications
Sybase unwired platform 1.2 developing secure blackberry applications
 
Would Mr. Spok choose Open Source
Would Mr. Spok choose Open SourceWould Mr. Spok choose Open Source
Would Mr. Spok choose Open Source
 
Dubbo and Weidian's practice on micro-service architecture
Dubbo and Weidian's practice on micro-service architectureDubbo and Weidian's practice on micro-service architecture
Dubbo and Weidian's practice on micro-service architecture
 
Agentless System Crawler - InterConnect 2016
Agentless System Crawler - InterConnect 2016Agentless System Crawler - InterConnect 2016
Agentless System Crawler - InterConnect 2016
 
Agile and continuous delivery – How IBM Watson Workspace is built
Agile and continuous delivery – How IBM Watson Workspace is builtAgile and continuous delivery – How IBM Watson Workspace is built
Agile and continuous delivery – How IBM Watson Workspace is built
 
Vivek kumar ray 5 year Java-Webservices-Bigdata
Vivek kumar ray 5 year Java-Webservices-BigdataVivek kumar ray 5 year Java-Webservices-Bigdata
Vivek kumar ray 5 year Java-Webservices-Bigdata
 
1 App,
1 App, 1 App,
1 App,
 
2017-05-10 Gate4SPICE: "Legacy Software"
2017-05-10 Gate4SPICE: "Legacy Software"2017-05-10 Gate4SPICE: "Legacy Software"
2017-05-10 Gate4SPICE: "Legacy Software"
 
WLCG Grid Infrastructure Monitoring
WLCG Grid Infrastructure MonitoringWLCG Grid Infrastructure Monitoring
WLCG Grid Infrastructure Monitoring
 
IBM Bluemix OpenWhisk: Serverless Conference 2016, London, UK: The Future of ...
IBM Bluemix OpenWhisk: Serverless Conference 2016, London, UK: The Future of ...IBM Bluemix OpenWhisk: Serverless Conference 2016, London, UK: The Future of ...
IBM Bluemix OpenWhisk: Serverless Conference 2016, London, UK: The Future of ...
 
DevSecOps: Bringing security to the DevOps pipeline
DevSecOps: Bringing security to the DevOps pipelineDevSecOps: Bringing security to the DevOps pipeline
DevSecOps: Bringing security to the DevOps pipeline
 
Case study: How Cozy Cloud monitors every layer of its activity using OVH Met...
Case study: How Cozy Cloud monitors every layer of its activity using OVH Met...Case study: How Cozy Cloud monitors every layer of its activity using OVH Met...
Case study: How Cozy Cloud monitors every layer of its activity using OVH Met...
 
Yohanes Widi Sono - Modern Development for Business Agility
Yohanes Widi Sono - Modern Development for Business AgilityYohanes Widi Sono - Modern Development for Business Agility
Yohanes Widi Sono - Modern Development for Business Agility
 

Recently uploaded

ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - InfographicHr365.us smith
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptkotipi9215
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...aditisharan08
 
Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)Intelisync
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 

Recently uploaded (20)

ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - Infographic
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.ppt
 
Exploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the ProcessExploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the Process
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...
 
Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 

OSMC 2021 || Open Source Application Performance Monitoring in the Enterprise

  • 1. Distributed Tracing in the enterprise, Nov. 09-11, 2021 / OSMC Open Source Application Performance Monitoring OSS APM
  • 2. Manifest 1. Who am I 2. Who is “VHV Versicherung” 3. What is OSS APM / Distributed Tracing • Generic instructions • Challenges at VHV • Concept of the solution • Solution
  • 3. / / Hi, I am Sascha Brechmann. I have been working for the “VHV Versicherung” since more than 7 years. My main tasks at Team Monitoring is the “Application-Performance-Monitoring” • My Main Skills are: • Linux-System-Administration • Monitoring-Administration • Availability-Monitoring (CheckMK) • Performance-Monitoring (inspect-IT) • Consulting everything around Monitoring • Event-/Log-Monitoring (ELK-Stack) OSS APM / OSMC2021 19.11.2021 3 OSS APM Who am I
  • 4. / / The VHV group, located in Hannover, is a growing corporation specialist in insurance and foresight. The business-brands “VHV Versicherung” and “Hannoversche” and “VHV Versicherung (Austria)” are the “VHV Gruppe” a professional and forward-looking affiliate for insurance. The business areas are property and casualty insurance, motor insurance and life insurance. For its success, the VHV Group relies on the strengths of around 3,000 employees, modern structures, efficient cost management and customer-oriented products. OSS APM / OSMC2021 19.11.2021 4 OSS APM Who is the VHV insurance (VHV Versicherung)
  • 5. Manifest 1. Who am I 2. Who is “VHV Versicherung” 3. What is OSS APM / Distributed Tracing • Generic instructions • Challenges at VHV • Concept of the solution • Solution
  • 6. / / Distributed tracing in context of APM means the observation of distributed transactions (“traces”) across different applications / functions (“span”). Metrics, TAGs and "logs / messages" can be attached to the individual spans and evaluated for (problem) analysis. With distributed tracing, the "spans" from all the applications involved must be forwarded to a central backend for correlation and analysis. OSS APM / OSMC2021 19.11.2021 6 General introduction What is Distributed-Tracing M o ni t o ri
  • 7. / / OSS APM / OSMC2021 19.11.2021 7 A picture says more than thousand words. © Ub er
  • 8. / / OSS APM / OSMC2021 19.11.2021 8 And that is what "distributed tracing" makes of this image for a transaction. © Ub er
  • 9. / / OSS APM / OSMC2021 19.11.2021 9 How does distributed tracing work? Simplified display without EUM / browser monitoring © Ja eg er- Tr aci
  • 10. / / OSS APM / OSMC2021 19.11.2021 10 Different views of a single transaction Dependencies vs. Timelines © Ja eg er- Tr aci
  • 11. / / The benefits of (OSS) APM / Distributed Tracing Part 1 • Cross-application measurement of (business-) processes • Requests, error rates and duration of transactions (R.E.D. metrics) • Enriching transactions with additional (meta) data for error analysis (HTTP status code; order code; ...) • Correlation of transactions with logs • Comparing transactions: • Releases / periods • End-User interactions / actions 19.11.2021 OSS APM / OSMC2021 11
  • 12. / / The benefits of (OSS) APM / Distributed Tracing Part 2 • Early identification of performance problems • Avoidance of support tickets • Earlier integration of performance measurement into the development process • Runtime behaviour / analysis of alternative software solutions • Easily drill down into the "software code" • Striking display of complex metrics in dashboards • Overview of dependencies / connections between applications • Reveal hidden dependencies (why is application A talking to B?) 19.11.2021 OSS APM / OSMC2021 12
  • 13. … 3. What is OSS APM / Distributed Tracing • Generic instructions • Challenges at the VHV • Concept of the solution • Solution
  • 14. / / (License) cost reduction • Commercial solutions can only be used to a limited extent (quantities) of instances • Not all stages / levels • Not all applications • No vendor lock-in • “Hardly / no influence” on the development • (Micro) service oriented • The simplest possible exchange of individual components • Commercial solutions are usually a monolith / black box and scaling is only possible with the help of the manufacturer • Cooperation with other (insurance) market participants Organizational improvements • Earliest possible inclusion of performance measurements in the development process • Better overview through a service dependency graph (flow map) of the application landscape • Easily create striking dashboards • Self-service for the IT operations groups • Less (know-how) dependence on “power users” thanks to a comprehensive training concept • Easy adaptation to/in the VHV processes • Simple correlation between traces and (application) logs • Acceptance of previous "blackboxes" (SAP / Mainframe / None Java) OSS APM / OSMC2021 19.11.2021 14 OSS APM / Distributed-Tracing Problem statement at the VHV Commercial vs. OSS Solution
  • 15. … 3. What is OSS APM / Distributed Tracing • Generic instructions • Challenges at VHV • Concept of the solution • Solution
  • 16. / / 1. The desired (open source) solution should consist of several (micro) services. 2. The solution must rely on open standards and (mostly) existing open-source software. 3. The individual components should be easy to replace. 4. The solution should be as simple as possible to scale (especially horizontally, by adding nodes) 5. The solution should simply fit into the VHV staging concept (development, test, training, production stage) 6. The solution should, if possible, rely on technologies that are already in use at VHV a) Linux (SLES) b) Java c) Elastic Search d) PostgreSQL/SQLite OSS APM / OSMC2021 19.11.2021 16 OSS APM / Distributed Tracing Concept of solution
  • 17. … 3. What is OSS APM / Distributed Tracing • Generic instructions • Challenges at VHV • Concept of the solution • Solution
  • 18. / / 1. Micro-Services + Standards: a) OpenCensus-Collector;Jaeger-Collector;Elasticsearch;InfluxDB;Grafana;inspect-IT configsrv b) Standards: OpenCensus + OpenMetrics + (coming soon OpenTelemetry) 2. VHV staging concept: A separate “inspect IT stack” for each VHV main stage 3. Simply scale: a) Collectors, Elasticsearch, Grafana + inspect-IT services (EUM,config,baseline) scale horizontally. b) InfluxDB is still a "problem child" here 4. Technologies: a) Linux (SLES) for Client + Server / Windows Client b) Java (OpenJDK; Adopt-JDK; Oracle-JDK; IBM-JDK) Version: >= 1.8 c) PostgreSQL/SQLite (currently more SQLite, but migration to PGSQL is planned) d) Elasticsearch: It is used as storage for traces e) InfluxDB: New Technology. Alternative metric storages in consideration f) GO-Lang: Several components are implemented in GO-Lang, but are used as BINs and are not compiled by themselves. OSS APM / OSMC2021 19.11.2021 18 OSS APM / Distributed Tracing Solution
  • 19. / / OSS APM / OSMC2021 19.11.2021 19 „inspect-IT stack“ architecture sketch See https://openapm.io for alternatives B r o w s e r J a v a - A p p li c a ti o n J a v a - T r a c i n g - F r a m e w D a t e n - T r a n s p o rt D a t a - C o rr e l a ti o n D a t e n - S t o r a g e D a t a - A n a ly s e s + A l e rt i
  • 20. / / Core functions of inspect-IT - Distributed tracing (from application to application / SDG - flow map) - Recording of transaction metrics and traces - Detection of business transactions - Display of HTTP / SQL Query / MQ / SMTP / LDAP / SFTP transactions - Automatic injection of the TraceID into the (Application-)Log output - Measuring of the "end-user experience" when using the (observed) application 19.11.2021 OSS APM / OSMC2021 20
  • 21. / / „inspect-IT“: Who/How/What Under the name “inspect-IT Stack” we summarize a collection of open-source tools, that together provide the functions of “Distributed Tracing” in our solution. • Inspect-IT Ocelot Agent => this is used to instrument the individual Java applications. • Open Census-Collector => Collecting and routing the trace information • Jaeger => Central tracing instance. Takes care of the preparation (drill down) of tracing information. • Elasticsearch => Storage backend for Jaeger / Traces -> Correlation Traces • Telegraf => machine agent. Collects all (server) metrics and stores this in InfluxDB • InfluxDB => TimeSeriesDB. Stores all metrics • Grafana => Web UI for InfluxDB / Elasticsearch 19.11.2021 OSS APM / OSMC2021 21
  • 22. / A few figures for an overview of the use of inspect-IT at VHV • “Inspect-IT Stack” Stages: 4 with currently 5 VMs per level (without tracing storage) • VHV-Stages (Application-Stages): >20 • Number of inspect-IT agents: > 50 per VHV stage • Number of different applications: >100 • Data volume metrics: ~ 60GB / inspect-IT stage (7 days of raw data, then aggregated) • Amount of data traces:> 250 GB / inspect-IT stage and day (up to 10 days retention time) • User: >100 (only IT) 19.11.2021 OSS APM / OSMC2021 22
  • 23. / Some Screenshots from our Environment Service Dependency Graph 19.11.2021 OSS APM / OSMC2021 23
  • 24. / Some Screenshots from our Environment Service Dependency Graph / Single application 19.11.2021 OSS APM / OSMC2021 24
  • 25. / Some Screenshots from our Environment JVM Metric of one CRM Application 19.11.2021 OSS APM / OSMC2021 25
  • 26. / Some Screenshots from our Environment HTTP Metrics of one CRM Application 19.11.2021 OSS APM / OSMC2021 26
  • 27. / Some Screenshots from our Environment BT Metrics of one CRM Application 19.11.2021 OSS APM / OSMC2021 27
  • 28. / Some Screenshots from our Environment EUM/Browser Metrics of one CRM Application 19.11.2021 OSS APM / OSMC2021 28
  • 29. / / OSS APM / OSMC2021 19.11.2021 29 Backup
  • 30. / / OSS APM / OSMC2021 19.11.2021 30 V H V - D e v el o p er IT - O E n d- U s er B u si n e s s- U s er In te D a s h b o ar d- U s er St or a g e St or a g e S t o r a g e D a s h b o ar d- U s er
  • 31. / OpenCensus/OpenTracing/OpenTelemetry: Who/How/What • OpenCensus is an open-source standard and describes the functions required to implement "distributed tracing“ (APIs/Libs – Protocols – Receiver/Exporter). • OpenCensus was initially designed by Google and Microsoft and is largely based on Google's own "Distributed Tracing" implementation. -> The commercial version is / was “Google Stacktrace” + “Azure AppInside” • OpenCensus describes how "distributed tracing" information can be exchanged between the individual systems involved • OpenCensus also provides a reference implementation (Libs) ready for a very wide range of programming languages • OpenTelemetry is the successor project or the continuation of OpenCensus (v2) and OpenTracing (v2). • OpenCensus + OpenTracing have now been merged into the "OpenTelemetry" project (no two standards / implementations for the same goal) • OpenTelemetry has an expanded focus that includes the following information levels: • Traces, from OpenTelemetry, OpenCensus and OpenTracing. (Interoperability) • Metrics: Acquisition of application metrics, output of metrics, transport of metrics, storage of metrics • Logs: Collection and transport of log messages. Transfer to a log collector 19.11.2021 OSS APM / OSMC2021 31
  • 32. / / International: • Uber • Redhat • Ticketmaster • Grafana Deutsch: • Hermes Logistik (Start 2018; Change from Monolith to Micro Service) • Zalando 19.11.2021 32 Companies that already work on distributed tracing with OpenTelemetry / Jaeger OSS APM / OSMC2021
  • 33. / Why distributed tracing? 19.11.2021 33 OSS APM / OSMC2021 © Jaeger-Tracing
  • 34. / What is a Trace / Span? - A „span“ is a unic Unit (Function) of a distributed System - A “trace” is a collection of one or more Spans, represention a single transaction 19.11.2021 34 OSS APM / OSMC2021 © Ja eg er- Tr aci ng
  • 35. / / OSS APM / OSMC2021 19.11.2021 35 DEMO htt ps: //c ont eli o.d e/ me hr/ pra es ent ati on- co nte lio-