Monitoring and Managing Java Applications

1,780 views
1,656 views

Published on

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
1,780
On SlideShare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
15
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • Cars.com … prep for superbowl
  • Monitoring and Managing Java Applications

    1. 1. Managing Java Applications@AloisReitbauer
    2. 2. 3
    3. 3. DevOpsto the rescue?
    4. 4. Don’t just buildsoftware that runs
    5. 5. Build software thatsomebody else can run
    6. 6. What is performance and how do we measure it?
    7. 7. How we define Performance• Response Time The time it takes for an application to respond to a user request.• Throughput The amount of concurrently processed request in a given time period. e.g.: requests/minute.• Availability Status of our systems. Are they up and running for us and our users?• Accuracy Is the response what the user actually expected? Are there any errors
    8. 8. Performance Pyramid Business Application Container System Performance
    9. 9. Types of Measurement • Cyclic Measurements – Are collected ar regular time intervals – Are time based – JMX, CPU, Memory • Event-based measurements – Are collected as a request occurs – Are transactional – Response Times, CPU consumption
    10. 10. Blind Men andElephants again
    11. 11. How can this happen?A: Our response time is 2.3 secondsB: Our response time is 1.5 secondsC: Our response time is 6 seconds
    12. 12. Measurement Aggregation• Minimum and Maximum The best and the worst request. Beware of outliers.• Average Sum/Count. Uses in many cases. Quality depends on actual values.• Median What 50 percent of our users see.• Percentile What n percent of our users see.
    13. 13. Use percentiles for event-based measures
    14. 14. Use averages forcyclical measures
    15. 15. Agree on Measurement Approach Page Load Time HTTP Request Time 95 % Servlet Time Network Browser Firewall Sniffer Web Server Application Server Request Time (max)
    16. 16. System and Container Metrics
    17. 17. Time-Based vs. Event-Based Measurement
    18. 18. Transactional Measurement
    19. 19. Transaction Flow Visualization
    20. 20. Focus on a specific transaction
    21. 21. Visual Problem Identification
    22. 22. How do we manage performance and solve problems?
    23. 23. Types of Problems • Data-Driven Problems Problems which occur for specific users, scenarios and which depend on (input) parameters e.g.: a search query • Load Driven Problems Problems which depend on the current system load. Usually occur at higher load. In most cases resource dependent. e.g.: increased response time with more users. • Environment Driven Problems Problems caused by factors outside the application. e.g.: hardware, network connectivity, etc.
    24. 24. Operations Main Tasks• Monitoring Collect all relevant KPIs and check against SLAs and baselines?• Alerting/Incident Management Inform ops about problems• Impact Analysis Analyze the impact of issues. Who is affected?• Isolation What is the cause of a problem and who must I talk to?• Diagnosis Why is there a problem and how can we fix it?
    25. 25. Key Performance Indicators/Metrics• Visitors and Requests How many people are using our site?• Response Times How fast do we service requests.?• Errors and Failed Transactions How many problems do we see and how often do they affect functionality?• Availability Can our systems currently be reached?• Utilization Metrics Do we have enough resources and are we using them efficiently?
    26. 26. … and in the real world?
    27. 27. Large-scale shopping platform
    28. 28. Performance Pyramid Business Application Container System Performance
    29. 29. System/Container Performance
    30. 30. Technical Response Time View
    31. 31. Page Category Performance
    32. 32. Performance of Key Transactions
    33. 33. Business View wit Performance Data
    34. 34. You can do evenmore (aka. Goodies)
    35. 35. Page Error Tracking
    36. 36. Third Party Management
    37. 37. Amazon EC2 Cost Monitoring
    38. 38. Alois Reitbauer @AloisReitbauerhttp://book.dynatrace.com http://blog.dynatrace.com
    39. 39. 41 © 2011 Compuware Corporation — All Rights Reserved

    ×