SlideShare a Scribd company logo
1 of 11
Download to read offline
AWS CloudWatch
Cloud & Network Monitoring Services
DATA POINT - encapsulates the statistical data that Amazon CloudWatch computes from metric data.
METRICS – represent time-ordered set of data points. Shows CPU usage, memory status, disk usage etc. and
custom data. Defined by name, namespace and dimensions. Stored for 2 weeks.
NAMESPACES – containers for metrics. String defining during metric creation. Within namespaces, metrics are
isolated from each other. I.e: AWS/EC2, AWS/AutoScaling, AWS/SQS.
DIMENSIONS – pair of name and value to uniquely identify metric (like categories, filters). Using dimensions to
filter result sets that CloudWatch queries. Relate to existing metric.
TIME RANGE - defined as point-to-point time range
Statistics
Statistics allow to cumulate many data points and present them as human readable visualization.
Metric data aggregation over specified period of time, based on data points. The starting and ending points can be
as close together as 60 seconds, and as far apart as two weeks.
● Minimum – the lowest value during the specified period.
● Maximum – the highest value during the specified period.
● Sum – all values submitted for the matching metrics added together.
● SampleCount – number of data points for calculation.
● Average – Sum/SampleCount. Helps to increase/decrease resources as needed.
PERIOD – length of time associated with a specific statistic. Basic unit: second. Minimal value: 60. I.e, for statistics
aggregated into ten-minute blocks, set Period to 600. Important for alarms. (aka kubełek)
What metrics do we use:
● EC2 (external) instance metrics
Basic Monitoring – 7 pre-selected metric at 5-minute frequency for free
Detailed Monitoring – all metrics with 1-minute frequency for charge
● EC2 (internal) instance metrics
Extra monitoring (i.e. EBS)
Additional metrics:
● ELB
● RDS
CASE STUDY: Internal vs External CPU usage
CPU Steal (Noisy Neighbour)
CPU usage measurement from CloudWatch perspective differs the measurement from inside the EC2 instance. The
difference between these two metrics is what’s known as “CPU Steal”
Agent-based reporting (internal) shows how much you are using the instance.
Amazon reports total usage of instance (including other users exploitation) - external.
CloudWatch Logs
● System and customized logs measurement
● Pattern searching (pseudo, case-sensitive, PHP-Apache support)
● Logs groups
● Graph based on logs filters
● Alarm setting when metric crosses specific threshold
LOG STREAM - data exchange channel between the Logs Agent and AWS
LOG GROUP - represents grouped logs streams
METRIC FILTER - assigned to group text pattern dynamically creating single metric
CloudWatch Logs Agent (awslogs) – system service monitoring and synchronizing logs with AWS. Config in:
/etc/awslogs/awslogs.conf
Alarms
Automatically initiates planned actions when defined threshold occurs. One alarm watch single metric over
specified period. The actions the alarm performs is: SNS or Auto Scaling policy.
You are receiving this email because your Amazon CloudWatch Alarm "Ingestor_ErrorsCount" in the EU - Ireland
region has entered the ALARM state, because "Threshold Crossed: 1 datapoint (2.0) was greater than or equal to the
threshold (1.0)." at "Wednesday 07 October, 2015 19:29:29 UTC".
View this alarm in the AWS Management Console:
https://console.aws.amazon.com/cloudwatch/home?region=eu-west-1#s=Alarms&alarm=Ingestor_ErrorsCount
Alarm Details:
- Name: Ingestor_ErrorsCount
- Description:
- State Change: INSUFFICIENT_DATA -> ALARM
- Reason for State Change: Threshold Crossed: 1 datapoint (2.0) was greater than or equal to the threshold (1.0).
- Timestamp: Wednesday 07 October, 2015 19:29:29 UTC
Thank You!

More Related Content

Similar to cloud watch

Real Time Data Processing Using AWS Lambda - DevDay Austin 2017
Real Time Data Processing Using AWS Lambda - DevDay Austin 2017Real Time Data Processing Using AWS Lambda - DevDay Austin 2017
Real Time Data Processing Using AWS Lambda - DevDay Austin 2017
Amazon Web Services
 
Second review presentation
Second review presentationSecond review presentation
Second review presentation
Arvind Krishnaa
 

Similar to cloud watch (20)

Best Practices for Protecting Cloud Workloads - November 2016 Webinar Series
Best Practices for Protecting Cloud Workloads - November 2016 Webinar SeriesBest Practices for Protecting Cloud Workloads - November 2016 Webinar Series
Best Practices for Protecting Cloud Workloads - November 2016 Webinar Series
 
Optimizing Costs and Efficiency of AWS Services
Optimizing Costs and Efficiency of AWS Services Optimizing Costs and Efficiency of AWS Services
Optimizing Costs and Efficiency of AWS Services
 
Data Analytics on AWS
Data Analytics on AWSData Analytics on AWS
Data Analytics on AWS
 
Monitoring on Amazon AWS Cloud
Monitoring on Amazon AWS Cloud Monitoring on Amazon AWS Cloud
Monitoring on Amazon AWS Cloud
 
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Performance and cost evaluation of an...
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Performance and cost evaluation of an...IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Performance and cost evaluation of an...
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Performance and cost evaluation of an...
 
2014 IEEE JAVA CLOUD COMPUTING PROJECT Performance and cost evaluation of an ...
2014 IEEE JAVA CLOUD COMPUTING PROJECT Performance and cost evaluation of an ...2014 IEEE JAVA CLOUD COMPUTING PROJECT Performance and cost evaluation of an ...
2014 IEEE JAVA CLOUD COMPUTING PROJECT Performance and cost evaluation of an ...
 
Jamcracker Cloud Management Platform Updates: Devops Framework, Migration Pla...
Jamcracker Cloud Management Platform Updates: Devops Framework, Migration Pla...Jamcracker Cloud Management Platform Updates: Devops Framework, Migration Pla...
Jamcracker Cloud Management Platform Updates: Devops Framework, Migration Pla...
 
AWS re:Invent 2016: IoT Visualizations and Analytics (IOT306)
AWS re:Invent 2016: IoT Visualizations and Analytics (IOT306)AWS re:Invent 2016: IoT Visualizations and Analytics (IOT306)
AWS re:Invent 2016: IoT Visualizations and Analytics (IOT306)
 
FinOps - AWS Cost and Operational Efficiency - Pop-up Loft Tel Aviv
FinOps - AWS Cost and Operational Efficiency - Pop-up Loft Tel AvivFinOps - AWS Cost and Operational Efficiency - Pop-up Loft Tel Aviv
FinOps - AWS Cost and Operational Efficiency - Pop-up Loft Tel Aviv
 
Aws coi7
Aws coi7Aws coi7
Aws coi7
 
Monitoring with Prometheus
Monitoring with PrometheusMonitoring with Prometheus
Monitoring with Prometheus
 
Introduction on Amazon EC2
 Introduction on Amazon EC2 Introduction on Amazon EC2
Introduction on Amazon EC2
 
Real Time Data Processing Using AWS Lambda - DevDay Austin 2017
Real Time Data Processing Using AWS Lambda - DevDay Austin 2017Real Time Data Processing Using AWS Lambda - DevDay Austin 2017
Real Time Data Processing Using AWS Lambda - DevDay Austin 2017
 
Training AWS: Module 9 - CloudWatch
Training AWS: Module 9 - CloudWatchTraining AWS: Module 9 - CloudWatch
Training AWS: Module 9 - CloudWatch
 
Second review presentation
Second review presentationSecond review presentation
Second review presentation
 
AWS April 2016 Webinar Series - Getting Started with Real-Time Data Analytics...
AWS April 2016 Webinar Series - Getting Started with Real-Time Data Analytics...AWS April 2016 Webinar Series - Getting Started with Real-Time Data Analytics...
AWS April 2016 Webinar Series - Getting Started with Real-Time Data Analytics...
 
Aws cloud infrastructure and cost estimation for angular site
Aws cloud infrastructure and cost estimation for angular siteAws cloud infrastructure and cost estimation for angular site
Aws cloud infrastructure and cost estimation for angular site
 
Full stack monitoring across apps & infrastructure with Azure Monitor
Full stack monitoring across apps & infrastructure with Azure MonitorFull stack monitoring across apps & infrastructure with Azure Monitor
Full stack monitoring across apps & infrastructure with Azure Monitor
 
Cassandra data modelling best practices
Cassandra data modelling best practicesCassandra data modelling best practices
Cassandra data modelling best practices
 
Introduction to Amazon EC2
Introduction to Amazon EC2Introduction to Amazon EC2
Introduction to Amazon EC2
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

cloud watch

  • 1.
  • 2. AWS CloudWatch Cloud & Network Monitoring Services
  • 3.
  • 4. DATA POINT - encapsulates the statistical data that Amazon CloudWatch computes from metric data. METRICS – represent time-ordered set of data points. Shows CPU usage, memory status, disk usage etc. and custom data. Defined by name, namespace and dimensions. Stored for 2 weeks. NAMESPACES – containers for metrics. String defining during metric creation. Within namespaces, metrics are isolated from each other. I.e: AWS/EC2, AWS/AutoScaling, AWS/SQS. DIMENSIONS – pair of name and value to uniquely identify metric (like categories, filters). Using dimensions to filter result sets that CloudWatch queries. Relate to existing metric. TIME RANGE - defined as point-to-point time range
  • 5. Statistics Statistics allow to cumulate many data points and present them as human readable visualization. Metric data aggregation over specified period of time, based on data points. The starting and ending points can be as close together as 60 seconds, and as far apart as two weeks. ● Minimum – the lowest value during the specified period. ● Maximum – the highest value during the specified period. ● Sum – all values submitted for the matching metrics added together. ● SampleCount – number of data points for calculation. ● Average – Sum/SampleCount. Helps to increase/decrease resources as needed. PERIOD – length of time associated with a specific statistic. Basic unit: second. Minimal value: 60. I.e, for statistics aggregated into ten-minute blocks, set Period to 600. Important for alarms. (aka kubełek)
  • 6.
  • 7. What metrics do we use: ● EC2 (external) instance metrics Basic Monitoring – 7 pre-selected metric at 5-minute frequency for free Detailed Monitoring – all metrics with 1-minute frequency for charge ● EC2 (internal) instance metrics Extra monitoring (i.e. EBS) Additional metrics: ● ELB ● RDS
  • 8. CASE STUDY: Internal vs External CPU usage CPU Steal (Noisy Neighbour) CPU usage measurement from CloudWatch perspective differs the measurement from inside the EC2 instance. The difference between these two metrics is what’s known as “CPU Steal” Agent-based reporting (internal) shows how much you are using the instance. Amazon reports total usage of instance (including other users exploitation) - external.
  • 9. CloudWatch Logs ● System and customized logs measurement ● Pattern searching (pseudo, case-sensitive, PHP-Apache support) ● Logs groups ● Graph based on logs filters ● Alarm setting when metric crosses specific threshold LOG STREAM - data exchange channel between the Logs Agent and AWS LOG GROUP - represents grouped logs streams METRIC FILTER - assigned to group text pattern dynamically creating single metric CloudWatch Logs Agent (awslogs) – system service monitoring and synchronizing logs with AWS. Config in: /etc/awslogs/awslogs.conf
  • 10. Alarms Automatically initiates planned actions when defined threshold occurs. One alarm watch single metric over specified period. The actions the alarm performs is: SNS or Auto Scaling policy. You are receiving this email because your Amazon CloudWatch Alarm "Ingestor_ErrorsCount" in the EU - Ireland region has entered the ALARM state, because "Threshold Crossed: 1 datapoint (2.0) was greater than or equal to the threshold (1.0)." at "Wednesday 07 October, 2015 19:29:29 UTC". View this alarm in the AWS Management Console: https://console.aws.amazon.com/cloudwatch/home?region=eu-west-1#s=Alarms&alarm=Ingestor_ErrorsCount Alarm Details: - Name: Ingestor_ErrorsCount - Description: - State Change: INSUFFICIENT_DATA -> ALARM - Reason for State Change: Threshold Crossed: 1 datapoint (2.0) was greater than or equal to the threshold (1.0). - Timestamp: Wednesday 07 October, 2015 19:29:29 UTC