If you use open source Graphite software to monitor mission critical applications, you know well the challenges in running, managing and scaling Graphite. Graphite may be ok to get started, but it creates lots of cost and complexity and total-cost-of-ownership headaches as your environment scales.
Sumo Logic provides the industry’s first machine data analytics platform to natively ingest, index and analyze metrics and log data together in real-time.
In this webinar, we will show a live demo of how to:
Ingest graphite compatible metrics into the Sumo Logic service
Analyze and dashboard the metrics to get real-time real-time insights
Correlate Graphite metrics and logs to troubleshoot issues faster
See how easy it is to migrate from graphite to Sumo Logic.
Bring your Graphite-compatible metrics into Sumo Logic
1. Sumo Logic Confidential
March 2017
Sumo Logic ”How To” Webinar
Welcome.
To give everyone a
chance to successfully
connect, we’ll start at
10:05 AM Pacific.
Bring your Graphite-
compatible Metrics into
Sumo Logic
Mario Sanchez, Training
2. Sumo Logic Confidential
At the completion of this webinar, you will be able to…
Ingest Graphite-compatible metrics
Analyze and chart metrics to get real-time
analysis
Correlate Graphite metrics and logs to
troubleshoot issues faster
Join Beta Program for Metrics Rules
4. Sumo Logic Confidential
What are Metrics?
Numeric samples of data collected over time.
Can measure infrastructure:
OS performance, disk activity, application performance
Effective tool for monitoring, troubleshooting and identifying root
cause
Gain end-to-end visibility into app performance
Track KPIs over time
Determine if outage has occurred and restore service
Determine why an event occurred and how to prevent in the future
Documentation: https://help.sumologic.com/Metrics
7. Sumo Logic Confidential
Unified Logs and Metrics – Troubleshooting Demo
ALERT notifies of a
critical event
METRICS to identify
what is going on
LOGS to identify why
it’s happening
16. Sumo Logic Confidential
Metric Rules – BETA Feature
Continue to use your existing metrics structure
The rule is an instruction to parse an existing metric name so you can easily apply the
Sumo Logic syntax
Interested in joining the BETA program?
Reach us at
rules-beta-group@sumologic.com
17. Sumo Logic Confidential
In Summary…
Ingest Graphite-compatible metrics
Analyze and chart metrics to get real-time
analysis
Correlate Graphite metrics and logs
Faster Troubleshooting
Improved DEvOps Collaboration
Reduced TCO
18. Sumo Logic Confidential
Questions?
Search Documentation and Training
Library of Past Webinars
Search/Post @ Sumo Dojo
Search, Post, Respond
Submit Feature Requests
Vote for Existing Ones
Open a Support Case
Contact Customer Success
customer-success@sumologic.com
Prepping items:
1. For demo, to generate errors, Go here, http://csteam.travellogic.info, Press ctrl + g. You should get a popup that says ok on the screen.
2. Open SE Demo and open Dashboards TravelLogic Operation Dashboard (Use as starting point in Demo)
3. Open Slack to #travel_app channel (to talk about alerts)
Hi Everyone, I am …. and I … at Sumo Logic.
A few housekeeping items:
Everyone is on mute to avoid distractions
If you want to ask a question, please do so using the GT Webinar question panel
We’ll have a Q&A session at the end
This webinar will be recorded and shared with all of you, along with the slides
Most of you are very familiar logs, collecting them and analyzing them, therefore today, I’m going to focus on Metrics, and specifically on Graphite-compatible metrics.
At the completion …. (read slides)
Read slide
From a metrics perspective, Sumo Logic can ingest metrics from 3 main sources:
Host metrics from any of your host servers, regardless of OS. These give you insights into your CPU, Memory, Network, or Disk usage.
Your AWS metrics, which we collect via CloudWatch. These give you insights into your AWS environment: your ELB, RDS, ECS metrics.
Any Graphite-compatible metrics.
Today we will be focusing on the Graphite-compatible metrics. For these other 2 metric sources, you can watch the previous webinars.
But before we go any further on how to set it up, let me walk you thru a user demo.
Through this demo, I’ll show you how a UNIFIED Logs and Metrics solution can reduce your troubleshooting time.
1. First, I’ll show you how we can use Sumo’s Alerting capabilities to notify a user of a critical event
2. The Alert will direct us to a Dashboard where Metrics will help us identify WHAT is going on
3. Lastly, we’ll dive into the relevant logs to identify WHY this is happening
Let’s jump right into it!
When diving into metrics explain:
You might notice here that metric names do not seem to follow the Graphite format we’ve come to expect. This is because I’m using a feature called Metrics Rules that parses a complex Graphite name into an simple-to-read name-value pairs.
We will talk more about the Metrics Rules, but for now, let me show you how I could also create the same query using the Graphite name:
1 .Open the bookings tab which already has this query: Namespace=csteam type=bookings metric=success.count
2. Create a new query by searching for the metric in question: success.count
3. Select the one that reads: travel.csteam.*.*.bookings.success.count
Show how you can build it and the auto-complete understands Graphite format.
Let’s review how to Graphite metrics are ingested into Sumo Logic from tools like StatsD, CollectD, jmxStats, DropWizard or the likes…
This diagram can represent a simplified view of your current state:
You have a Server, a device, a container - and you’re looking to get metrics from your Operating System, or your Container metrics, or your Custom App code Metrics into an analytics tool so you can start getting visibility into your environment.
On the right side you would have a Graphite server collecting your metrics which can be sending to a Grafana or some similar tool for analysis.
Notice that StatsD requires a StatsD server that acts like a metrics aggregation Daemon before sending to the Graphite server.
Here’s how the landscape will look when you start sending Graphite-compatible metrics to Sumo.
CollectD, DropWizard, Metrics Libraries can all send their metrics straight to an Installed Collector which can forward to Sumo Logic. Additionally, this Collector do double-duty and can also ingest Host Metrics from your host server.
In the case of StatsD, the StatsD server can send to a Sumo Logic Collector.
Let me walk you through how to create these Metric Sources using our Setup Wizard.
As you can see on these screenshots, the setup is very similar for the various sources, what really changes is how you allow permission for Sumo to collect the metrics:
In CollectD you would edit the collectd.conf file to enable the Write Graphite plugin
In StatsD, you edit the localConfig.js to adda Graphite backend that will send data to Sumo’s installed collector
In DropWizard, you would set paramenters of the GraphiteReporter object to send metrics to Sumo
A query for these graphite-compatible metrics is made of 2 main parts:
The What – the metric itself
Any Operators we want to use to analyze the data
This one here is an interesting example, because it leads me to the next point:
For those of you who are familiar with Graphite, you are aware that it allows you to reference the metadata fields by position, in this case, by using _6, we are referring to the filed in the 6th position.
We’ve replicated this same functionality in Sumo Logic by default, but… we’ve taken it a step further.
Let’s take a step back:
The simple format for sending data to graphite is:
<metric path> <metric value> <metric timestamp>
For metric Path, the unofficial, but widely adopted naming scheme is:
a.b.c.d.metric,
where a, b, c, d are metadata
So taking advantage of this naming scheme, we can parse the values to provide key-value pair metadata
Open the tab named CPU and show how you can use a parse statement as the one above to parse and name the fields for a more meaningful statement.
Metric Rules allow you to create a permanent parsing rule for your metric names so you can easily reference their metadata with a key-value pair format.
Show and tell demo
If you are interested in joining this beta program for Metric rules, please email us at….
In Summary, we’ve learned to:
* Ingest Graphite-compatible metrics
Analyze and chart metrics to get real-time analysis
But more importantly to note, we’ve seen how we can correlate Graphite metrics and logs.
This helps us
1. Reduce the mean time to identification and resolution by providing contextual analysis
2. Improve your team’s collaboration by using a single source of truth that can be securely accessed and analyzed.
3. And lastly, a Unified Logs and Metrics solution eliminates the need for multiple tools, lowering your total cost of ownership.