Performance Tuning: You’re doing it wrong! by Kirk Pepperdine

•

2 likes•840 views

Baruch Sadogursky

Devoxx Ignite 2014 session

Software

Performance Tuning
You’re doing it wrong!

Number 10:
No performance
requirements
• No requirements == no problem

Number 9:
Tuning by folklore
• Performance happens in a context
• admin manuals are devoid of
context
• blogs don’t speak to your context
®
• Measure don’t guess
• use measurements to guide all
decisions

Number 8:
Tuning by Google
-XX:InitialHeapSize=1610612736 -XX:MaxHeapSize=2147483648
-XX:MaxNewSize=697933824 -XX:NewSize=697933824 -XX:OldSize=1395867648
-XX:OldPLABSize=16
-XX:+PrintGC -XX:+PrintGCDetails -XX:+PrintGCTimeStamps -XX:+PrintTenuringDistribution
-XX:+PrintGCApplicationStoppedTime
-XX:SurvivorRatio=1 -XX:TargetSurvivorRatio=90
-XX:-UseAdaptiveSizePolicy -XX:+PrintAdaptiveSizePolicy
-XX:+UseCompressedClassPointers -XX:+UseCompressedOops
-XX:-UseLargePagesIndividualAllocation
-XX:+UseConcMarkSweepGC -XX:+UseParNewGC -XX:+CMSParallelRemarkEnabled

Number 7:
That is how my system
works
• Often the diagnosis takes 15
minutes
• convincing the client that the
implement is a problem can
take hours!!!

Number 6:
Not listening to the
system
• Presented groups of developers
with a performance problem
• not one group has solved it in 8
years!!!
• once they learn to read the
signals the problem can be
diagnosed in minutes!

Number 4:
Diving into the code
• “Know” what the problem is based on local
knowledge
• even if you tell them what the problem is
they won’t change course!!!
• In one exercise >97% of developers fail to
identify and fix the primary bottleneck
• even when they’ve been emphatically told
where they will fail

Number 3:
Poor system visibility
• Monitoring pukes a ton of data into your
lap
• often leave people scratching their
heads trying to understand where to
start!!!
• Often app is only logging
• logs are optimized for development, not
ops

Number 2:
Give it more hardware
• If you don’t understand your bottleneck…..
• Won’t work if you can’t use the hardware
you already have
• Seemingly perfectly parallelizable batch job
needed to run in 1 hour
• single server took 4 hours
• 4 servers took more than 8 hours to
complete!!!

Number 1:
Setting up a tiger team
• Seriously!!! resort to diagnosis by
committee?
• Clear indication that your team is
using technology that either don’t
understand or can’t manage
• Nice way to trigger “tribal” behavior.

jClarity Illuminate
Performance Diagnostic
Engine
• Picks up where monitoring stops
• Minimize ambiguity
• uses heuristics to identify performance
bottlenecks in live system
• points out casual execution path!
• Profiles with minimal impact on (already
poor) performance

Similar to Performance Tuning: You’re doing it wrong! by Kirk Pepperdine

How long does your code take to run? Is it changing? When it is slow, WHY is it slow? Is it your fault, or somebody else's? Can you prove it? How much faster could your code be? Do you know how to measure the performance of your code as user workloads and data volumes increase? These are fundamental questions about performance, but the vast majority of Oracle application developers can't answer them. The most popular performance tools available to them—and to the database administrators that run their code in production—are incapable of answering any of these questions. But the Oracle Database can give you exactly what you need to answer these questions and many more. You can know exactly where YOUR CODE is spending YOUR TIME. This session explains how.

How to find and fix your Oracle application performance problem

Cary Millsap

Provisioning and Capacity Planning Workshop (Dogpatch Labs, September 2015)

Brian Brazil

Making operations visible - devopsdays tokyo 2013

Nick Galbreath

Making operations visible - Nick Gallbreath

Devopsdays

This is about leveling-up and REVOLUTIONIZING Testing as part of your Agile/DevOps Transformation. You can contribute more than testing functionality. You need to Level-Up your skill set by understanding the apps you are testing. # Images, # JS Files, # SQL Statements, Connection Pool Utilization and Garbage Collection Activity have to be added to your portfolio. Check these metrics when you do your functional testing and report regressions to your engineers even though the functionality is still good. But you just uncovered an Architectural regression that will lead to a scalabilty and performance problem. Finding these problems early will eliminate a lot of wasted and unplanned time later on in the lifecycle. that is your contribution to delivering software faster with better quality

BTD2015 - Your Place In DevTOps is Finding Solutions - Not Just Bugs!

Andreas Grabner

How to manage and monitor large sql server estates

Red Gate Software

As the Yelp infrastructure and engineering team grew, so did the pain of managing Nagios. Problems like splitting alerting across multiple teams, providing high availability and managing nagios systems in multiple environments had become pressing. As we grew towards a service oriented architecture and pushed some services out into the cloud, we rapidly needed more automated monitoring configuration. An evolutionary solution wasn’t going to solve all of our problems, we needed to revolutionize our monitoring. Sensu is built from the ground up to solve many of our issues and be easy to extend. This talk covers our puppet ‘monitoring_check’ API (that sets up monitoring for our services within puppet), how and why we deploy Sensu and our custom handlers and escalations, along with how we provide automatic ‘self service’ monitoring for dynamic services and how we deal with the challenges posed by the more ephemeral nature of cloud architectures.

Sensu and Sensibility - Puppetconf 2014

Tomas Doran

Dr Elephant: LinkedIn's Self-Service System for Detecting and Treating Hadoop...

DataWorks Summit

Scaling Pinterest's Monitoring

Brian Overstreet

Netflix SRE perf meetup_slides

Ed Hunter

Get lean tutorial

Marty Haught

Wed-12-05pm-box-salmanahmed

Salman Ahmed

DutchMLSchool. ML Business Perspective

BigML, Inc

Too many database queries, too much data loaded into memory, overloaded html pages, bad architectural decisions, ... These are all reasons why Java Applications are slow. In this presentation - first given at Boston Java Meetup - shows 6 real life examples on why Java-based Applications failed - and you may even heard about this in the news. All examples and the technical details were captured using Dynatrace which is available as a 30 Day Free Trial - http://bit.ly/dttrial - with an option to extend it for another 180 Days in case you share some of your results with us

Java Performance Mistakes

Andreas Grabner

Data-Driven Operations - Practice realtime data analyse

Guixing Bai

Silicon Valley Code Camp 2015 - Advanced MongoDB - The Sequel

Daniel Coupal

MSP best practice. Content includes the following: Why it’s important to analyze the configuration of your RMM solution against industry best practices (and where to get those practices and how do to the analysis); How to check thresholds and alerts against user roles and policy settings for maximum effectiveness; How to audit the integration of your RMM solution with your PSA tool, including a review of dashboards and reports – and learn why this is often overlooked; How to implement these changes to increase your profitability by 10% to 20%. Presented by Kaseya with SPC International (fka MSP University). October 2013

MSP Best Practice | Optimizing RMM Solutions and Increasing MSP Profits

David Castro

Performance tuning Grails applications

Lari Hotari

2013 01-23 when analytics projects go wrong

Julien Coquet

Performance tuning Grails applications

GR8Conf

Similar to Performance Tuning: You’re doing it wrong! by Kirk Pepperdine (20)

How to find and fix your Oracle application performance problem

Provisioning and Capacity Planning Workshop (Dogpatch Labs, September 2015)

Making operations visible - devopsdays tokyo 2013

Making operations visible - Nick Gallbreath

BTD2015 - Your Place In DevTOps is Finding Solutions - Not Just Bugs!

How to manage and monitor large sql server estates

Sensu and Sensibility - Puppetconf 2014

Dr Elephant: LinkedIn's Self-Service System for Detecting and Treating Hadoop...

Scaling Pinterest's Monitoring

Netflix SRE perf meetup_slides

Get lean tutorial

Wed-12-05pm-box-salmanahmed

DutchMLSchool. ML Business Perspective

Java Performance Mistakes

Data-Driven Operations - Practice realtime data analyse

Silicon Valley Code Camp 2015 - Advanced MongoDB - The Sequel

MSP Best Practice | Optimizing RMM Solutions and Increasing MSP Profits

Performance tuning Grails applications

2013 01-23 when analytics projects go wrong

Performance tuning Grails applications

More from Baruch Sadogursky

So, you want to update the software for your user, be it the nodes in your K8s cluster, a browser on user’s desktop, an app in user’s smartphone or even a user’s car. What can possibly go wrong? In this talk, we’ll analyze real-world software update fails and how multiple DevOps patterns, that fit a variety of scenarios, could have saved the developers. Manually making sure that everything works before sending an update and expecting the user to do acceptance tests before they update is most definitely not on the list of such patterns. Join us for some awesome and scary continuous update horror stories and some obvious (and some not so obvious) proven ideas for improvement and best practices you can start following tomorrow.

DevOps Patterns & Antipatterns for Continuous Software Updates @ NADOG April ...

Baruch Sadogursky

DevOps Patterns & Antipatterns for Continuous Software Updates @ DevOps.com A...

Baruch Sadogursky

In this talk, we’ll take you to a scaling journey, from 3 developers to a 100. We’ll talk about the challenges each milestone in this growth brings, both technological and methodological, and how to solve those challenges using the right mix of people, the right selection of tools and the correctly crafted process. The speakers excel in the different aspects of this triangle and went through this journey (more than once) themselves. And the fun and entertaining presentation as a Greek tragedy can’t hurt, can it?

DevOps @Scale (Greek Tragedy in 3 Acts) as it was presented at Oracle Code NY...

Baruch Sadogursky

Devops is usually viewed from a traditional perspective of a collaboration of Dev, Ops, and QA, driven by the change in Culture, People, and Process. But how do you know where you stand and where to move? As in almost any field, data and metrics give you the gauges and instruments. In this talk, we’ll talk about the key measurements for the DevOps transformation process and provide you with 3 metrics you can start measuring tomorrow.

Data driven devops as presented at QCon London 2018

Baruch Sadogursky

A Research Study Into DevOps Bottlenecks as presented at Oracle Code LA 2018

Baruch Sadogursky

Java Puzzlers NG S03 a DevNexus 2018

Baruch Sadogursky

Do you always know what’s going on with your product artifacts since the moment they are built by the CI server from Git sources all the way to being deployed by Helm into Kuberenetes? In this talk, we will show how to build a reliable and transparent pipeline from code to cluster using Git, Artifactory, Docker, Kubernetes, and Helm. We’ll show how you such a pipeline can help you answer the big questions: What to deploy, What is deployed, and what is this artifact that I am looking for. This kind of transparency is critical for today’s environments, and Kubernetes with Helm shouldn’t be an exception.

Where the Helm are your binaries? as presented at Canada Kubernetes Meetups

Baruch Sadogursky

By Baruch Sadogursky Devops is usually viewed from a traditional perspective of a collaboration of Dev, Ops and QA, driven by the change in Culture, People and Process. But how do you know where you stand and were to move? As in almost any field, data and metrics give you the gauges and instruments. In this talk we’ll talk about the key measurements for the DevOps transformation process and provide you with 3 metrics you can start measuring tomorrow.

Data driven devops as presented at Codemash 2018

Baruch Sadogursky

A Research Study into DevOps Bottlenecks as presented at Codemash 2018

Baruch Sadogursky

By Baruch Sadogursky There are three hard things in computer science: cache invalidation, naming things, and off-by-one errors. This session tackles naming, especially Docker version naming. Labels, tags, checksums...how should you use them to keep track of Docker versions? What about dev versus prod images—how best to distinguish those? What about the “latest” tag? What about cleanup? Could we do more? Versioning often seems like a simple problem, but when you have a tool that gives you as much power and flexibility as Docker does, it often helps to develop guidelines. The presentation examines the tools available for managing Docker images and some simple patterns you can employ in various use cases for CI/CD to keep track of your containers.

Best Practices for Managing Docker Versions as presented at JavaOne 2017

Baruch Sadogursky

Debugging applications in production is like being the detective in a crime movie. Especially with microservices. Especially with containers. Especially in the cloud. Trying to see what’s going on in a production deployment at scale is impossible without proper tools! Google has spent over a decade deploying containerized Java applications at unprecedented scale and the infrastructure and tools developed by Google have made it uniquely possible to manage, troubleshoot, and debug, at scale. Join this session to see how you can diagnose and troubleshoot production issues w/ out of the box Kubernetes tools, as well as getting insight from the ecosystem with Weave Scope, JFrog Artifactory & Stackdriver tools.

Troubleshooting & Debugging Production Microservices in Kubernetes as present...

Baruch Sadogursky

DevOps @Scale (Greek Tragedy in 3 Acts) as it was presented at Devoxx 2017

Baruch Sadogursky

Amazon Alexa Skills vs Google Home Actions, the Big Java VUI Faceoff as prese...

Baruch Sadogursky

DevOps @Scale (Greek Tragedy in 3 Acts) as it was presented at DevOps Days Be...

Baruch Sadogursky

Moar puzzlers! The more we work with Java 8, the more we go into the rabbit hole. Did they add all those streams, lambdas, monads, Optionals and CompletableFutures only to confuse us? It surely looks so! And Java 9 that heads our way brings even more of what we like the most, more puzzlers, of course! In this season we as usual have a great batch of the best Java WTF, great jokes to present them and great prizes for the winners!

Java Puzzlers NG S02: Down the Rabbit Hole as it was presented at The Pittsbu...

Baruch Sadogursky

DevOps @Scale (Greek Tragedy in 3 Acts) as it was presented at The Pittsburgh...

Baruch Sadogursky

Let’s Wing It: A Study in DevRel Strategy

Baruch Sadogursky

In this talk, Baruch Sadogursky presents the challenges of a high demand SaaS product incident triage at scale, as well as discuss the sources of log items, including the platform, tenants and other types of log sources. He will show practical examples of collector and filters configuration and will take you through a number of real world examples of problems investigations using Artifactory and Sumo Logic.

Log Driven First Class Customer Support at Scale

Baruch Sadogursky

No relationship in DevOps is more important than that between your CI/CD server and your Binary Repository. Jenkins has long been the go-to server for CI/CD, and JFrog Artifactory has long been one of the most popular integrations with it. This webinar focuses on the new features of the integration, leveraging the Jenkins Pipeline DSL for infrastructure-as-code of your favorite artifactory features whether it be generic, maven, gradle or Docker, and will show an end-to-end example of pipelines across multiple technologies and how powerful these new capabilities are.

[Webinar] The Frog And The Butler: CI Pipelines For Modern DevOps

Baruch Sadogursky

While Docker has enabled an unprecedented velocity of software production, it is all too easy to spin out of control. A promotion-based model is required to control and track the flow of Docker images as much as it is required for a traditional software development lifecycle. New tools often introduce new paradigms. We will examine the patterns and the antipatterns for Docker image management, and what impact the new tools have on the battle-proven paradigms of the software development lifecycle.

Patterns and antipatterns in Docker image lifecycle as was presented at DC Do...

Baruch Sadogursky

More from Baruch Sadogursky (20)