SlideShare a Scribd company logo
QAing INFRASTRUCTURE
A QA's Role in DevOps World
by
Aroj P George
Harshad Wankhede
What is Infrastructure Testing?
Typical Development Environment - Single Server
Developer/QA Laptop

W
E
B
S
E
R
V
E
R

App

LI

Service

Mocked ThirdParty Services

DB
Production Infrastructure – Multi-Tier
Replica Set
L
O
A
D
C
D
N

B
A
L
A
N
C
E
R

App

App

App

L
O
A
D
B
A
L
A
N
C
E
R

Service
DB

Service

Service

Third Party Services

DB

DB
Application Ecosystem

Application

Libraries

Operational Processes
Network Services
System Services
Authentication and Authorization Rules
Virtualization

Storage Configuration
Network Configuration
Firewalls and VPN and Secure Access Rules
What if we don’t test the infrastructure?
What is DevOps?
It’s a mindset
Bridges gap between Developers and Operations
What to Test?
Database Failure Scenarios
Do we have proper error handling, logging and alerting in place for below
scenarios?
if the application is unable to read data
 Network issues
 DB Primary is down
Does the DB Primary failover if it goes down?
Does the application keep running
Do we have alerts for the below?
Replication has failed
DB backups are not happening
DB high resource utlilization
DB queries are very slow
Application Performance Issues
Do we have monitoring for high resource utilization
Are we tracking the response times for application and dependent
web services
Are we tracking errors generated in the logs?
Do we get alerts if a cron job fails to run or if it errors?
Ex: Newrelic, Nagios, Ganglia metrics collection, Executing
Performance Testing scripts regularly
Server or Services are Down
How does the system react to database failure
How does the system react to application failure
How does the system react to third party service failure
What happens if the load balancer is down
Ex: Chaos Monkey
Logging and Archiving
Have you configured appropriate logging?
Are the logs being properly indexed and rotated?
Do you maintain log history using tools like Splunk?
Are the DB backups being archived?
Application Errors
Do we show appropriate error pages for various errors
500, 503, 404 Pages
What happens if there is a sudden increase in the rate of errors?
Do we get alerted regarding the same.
Resource Consumption Issues
Have you configured alerting for CPU usage?
Have you configured alerting for Memory usage?
Have you configured alerts for Disk Utilization?
Warning if utilization reaches 80%
Critical if utilization reaches 95%
CPU Usage
Memory Usage
Network Usage
Third Party Service Failure
Have you configured alerting when third party services are down?
Security
Have you blocked access to admin configuration?
Have you configured firewall rules properly?
Penetration Testing
Caching
Have you validated application caching?
Ex: CacheWarp
Cross functional/Non Functional Requirements
Testing infrastructure is nothing but testing the Cross-functional
Requirements
Chef Recipe
Chef - Templatize Configuration
Chef Demo
Chef is a systems and cloud infrastructure automation framework
that makes it easy to deploy servers and applications
Infrastructure as Code
Knife is a Chef utility
Using the Knife SSH command to test the infrastructure
RSpec Tests
Demo
CacheWarp
Demo

https://rubygems.org/gems/cachewarp
QA Skills Required:
Good Understanding of the Infrastructure
Ability to foresee Issues
Command-line proficiency, Vim
Ability to debug and troubleshoot issues
Should be able to write automated tests (rspec)
Basic shell scripting knowledge
Myths about Infrastructure Testing
No need to test Infrastructure. If it works locally it will work on any
environment
Application and Infrastructure Testing strategies are similar
Application Testing is more important than Infrastructure Testing
There aren’t too many different things to test in infrastructure
There is no business value in testing Infrastructure
Tools
Chef + Knife
Ganglia
Splunk
GO
Nagios
Mingle
CacheWarp
Vagrant
Vim
Tmux
Putty
Questions

More Related Content

What's hot

LANDesk Service pack 3 features
LANDesk Service pack 3 featuresLANDesk Service pack 3 features
LANDesk Service pack 3 features
InfraVision
 
Migrating Internal IT Apps - Jenn Boden
 Migrating Internal IT Apps - Jenn Boden Migrating Internal IT Apps - Jenn Boden
Migrating Internal IT Apps - Jenn Boden
Amazon Web Services
 
Cs seminar 20061207
Cs seminar 20061207Cs seminar 20061207
Cs seminar 20061207
Todd Deshane
 

What's hot (20)

AWS Summit Sydney 2014 | Updating Security Operations for the Cloud - Session...
AWS Summit Sydney 2014 | Updating Security Operations for the Cloud - Session...AWS Summit Sydney 2014 | Updating Security Operations for the Cloud - Session...
AWS Summit Sydney 2014 | Updating Security Operations for the Cloud - Session...
 
Application security in current era
Application security in current eraApplication security in current era
Application security in current era
 
Operations: Production Readiness
Operations: Production ReadinessOperations: Production Readiness
Operations: Production Readiness
 
LANDesk Service pack 3 features
LANDesk Service pack 3 featuresLANDesk Service pack 3 features
LANDesk Service pack 3 features
 
SP3 features
SP3 featuresSP3 features
SP3 features
 
Inspector
InspectorInspector
Inspector
 
Ride the database in JUnit tests with Database Rider
Ride the database in JUnit tests with Database RiderRide the database in JUnit tests with Database Rider
Ride the database in JUnit tests with Database Rider
 
Resolving problems & high availability
Resolving problems & high availabilityResolving problems & high availability
Resolving problems & high availability
 
The SharePoint Survival Guide Top 10
The SharePoint Survival Guide Top 10The SharePoint Survival Guide Top 10
The SharePoint Survival Guide Top 10
 
Operations: Security
Operations: SecurityOperations: Security
Operations: Security
 
DevSecOps: Taking a DevOps Approach to Security
DevSecOps: Taking a DevOps Approach to SecurityDevSecOps: Taking a DevOps Approach to Security
DevSecOps: Taking a DevOps Approach to Security
 
How to Do a Performance Audit of Your .NET Website
How to Do a Performance Audit of Your .NET WebsiteHow to Do a Performance Audit of Your .NET Website
How to Do a Performance Audit of Your .NET Website
 
Resilience and Security @ Scale: Lessons Learned
Resilience and Security @ Scale: Lessons LearnedResilience and Security @ Scale: Lessons Learned
Resilience and Security @ Scale: Lessons Learned
 
Migrating Internal IT Apps - Jenn Boden
 Migrating Internal IT Apps - Jenn Boden Migrating Internal IT Apps - Jenn Boden
Migrating Internal IT Apps - Jenn Boden
 
Serverless lessons learned #5 retries
Serverless lessons learned #5 retriesServerless lessons learned #5 retries
Serverless lessons learned #5 retries
 
How to Drive Down iSeries Computing Costs
How to Drive Down iSeries Computing CostsHow to Drive Down iSeries Computing Costs
How to Drive Down iSeries Computing Costs
 
Cs seminar 20061207
Cs seminar 20061207Cs seminar 20061207
Cs seminar 20061207
 
Splitting the Check on Compliance and Security
Splitting the Check on Compliance and SecuritySplitting the Check on Compliance and Security
Splitting the Check on Compliance and Security
 
Making security-agile matt-tesauro
Making security-agile matt-tesauroMaking security-agile matt-tesauro
Making security-agile matt-tesauro
 
Reverse Engineering Malware Workshop
Reverse Engineering Malware WorkshopReverse Engineering Malware Workshop
Reverse Engineering Malware Workshop
 

Similar to QAing INFRASTRUCTURE- A QA's role in the DevOps World-Aroj P George & Harshad Wankhede

Best Practices In Load And Stress Testing Cmg Seminar[1]
Best Practices In Load And Stress Testing Cmg Seminar[1]Best Practices In Load And Stress Testing Cmg Seminar[1]
Best Practices In Load And Stress Testing Cmg Seminar[1]
Munirathnam Naidu
 
Perfsystems- Consulting Services
Perfsystems- Consulting ServicesPerfsystems- Consulting Services
Perfsystems- Consulting Services
Perfsys Tems
 

Similar to QAing INFRASTRUCTURE- A QA's role in the DevOps World-Aroj P George & Harshad Wankhede (20)

Availability Considerations for SQL Server
Availability Considerations for SQL ServerAvailability Considerations for SQL Server
Availability Considerations for SQL Server
 
Starting Your DevOps Journey – Practical Tips for Ops
Starting Your DevOps Journey – Practical Tips for OpsStarting Your DevOps Journey – Practical Tips for Ops
Starting Your DevOps Journey – Practical Tips for Ops
 
T3 Consortium's Performance Center of Excellence
T3 Consortium's Performance Center of ExcellenceT3 Consortium's Performance Center of Excellence
T3 Consortium's Performance Center of Excellence
 
Scaling PHP Applications with Zend Platform
Scaling PHP Applications with Zend PlatformScaling PHP Applications with Zend Platform
Scaling PHP Applications with Zend Platform
 
Critical Preflight Checks for Your EPM Applications
Critical Preflight Checks for Your EPM ApplicationsCritical Preflight Checks for Your EPM Applications
Critical Preflight Checks for Your EPM Applications
 
Building the perfect share point farm
Building the perfect share point farmBuilding the perfect share point farm
Building the perfect share point farm
 
Using IBM Rational Change as an Enterprise-Wide Error Management Solution – ...
 Using IBM Rational Change as an Enterprise-Wide Error Management Solution – ... Using IBM Rational Change as an Enterprise-Wide Error Management Solution – ...
Using IBM Rational Change as an Enterprise-Wide Error Management Solution – ...
 
Handling Data in Mega Scale Systems
Handling Data in Mega Scale SystemsHandling Data in Mega Scale Systems
Handling Data in Mega Scale Systems
 
Infrastructure Automation with Chef & Ansible
Infrastructure Automation with Chef & AnsibleInfrastructure Automation with Chef & Ansible
Infrastructure Automation with Chef & Ansible
 
Server Monitoring (Scaling while bootstrapped)
Server Monitoring  (Scaling while bootstrapped)Server Monitoring  (Scaling while bootstrapped)
Server Monitoring (Scaling while bootstrapped)
 
North east user group tour
North east user group tourNorth east user group tour
North east user group tour
 
Web 2.0 Development with IBM DB2
Web 2.0 Development with IBM DB2Web 2.0 Development with IBM DB2
Web 2.0 Development with IBM DB2
 
Just do it!
Just do it!Just do it!
Just do it!
 
Best Practices In Load And Stress Testing Cmg Seminar[1]
Best Practices In Load And Stress Testing Cmg Seminar[1]Best Practices In Load And Stress Testing Cmg Seminar[1]
Best Practices In Load And Stress Testing Cmg Seminar[1]
 
Best practice adoption (and lack there of)
Best practice adoption (and lack there of)Best practice adoption (and lack there of)
Best practice adoption (and lack there of)
 
MongoDB on Windows Azure
MongoDB on Windows AzureMongoDB on Windows Azure
MongoDB on Windows Azure
 
Performance Testing
Performance TestingPerformance Testing
Performance Testing
 
Perfsystems- Consulting Services
Perfsystems- Consulting ServicesPerfsystems- Consulting Services
Perfsystems- Consulting Services
 
Cloud Done Right - PaaS is the Remedy to VM Hangover
Cloud Done Right - PaaS is the Remedy to VM HangoverCloud Done Right - PaaS is the Remedy to VM Hangover
Cloud Done Right - PaaS is the Remedy to VM Hangover
 
Replay Solutions CFD
Replay Solutions CFDReplay Solutions CFD
Replay Solutions CFD
 

More from bhumika2108

Reliable and fast security audits - The modern and offensive way-Mohan Gandhi
Reliable and fast security audits - The modern and offensive way-Mohan GandhiReliable and fast security audits - The modern and offensive way-Mohan Gandhi
Reliable and fast security audits - The modern and offensive way-Mohan Gandhi
bhumika2108
 
Did you check the UX Quality?-Rajarshi Ray
Did you check the UX Quality?-Rajarshi RayDid you check the UX Quality?-Rajarshi Ray
Did you check the UX Quality?-Rajarshi Ray
bhumika2108
 
Why did we delete our regression suite? Deepak Parmasivam & Sneha Vijayaraghavan
Why did we delete our regression suite? Deepak Parmasivam & Sneha VijayaraghavanWhy did we delete our regression suite? Deepak Parmasivam & Sneha Vijayaraghavan
Why did we delete our regression suite? Deepak Parmasivam & Sneha Vijayaraghavan
bhumika2108
 
Why do cars need to have the right breaking system -Shalabh Varma & Manisha A...
Why do cars need to have the right breaking system -Shalabh Varma & Manisha A...Why do cars need to have the right breaking system -Shalabh Varma & Manisha A...
Why do cars need to have the right breaking system -Shalabh Varma & Manisha A...
bhumika2108
 
Why test automation promises more and delivers less - Deepak Koul
Why test automation promises more and delivers less - Deepak KoulWhy test automation promises more and delivers less - Deepak Koul
Why test automation promises more and delivers less - Deepak Koul
bhumika2108
 
Accessibility testing-Gyani and Siddhanth
Accessibility testing-Gyani and SiddhanthAccessibility testing-Gyani and Siddhanth
Accessibility testing-Gyani and Siddhanth
bhumika2108
 
Fuzz Testing-Atul Khot
Fuzz Testing-Atul KhotFuzz Testing-Atul Khot
Fuzz Testing-Atul Khot
bhumika2108
 
Why every Tester should also aspire to be a Developer on his project!-Sandee...
Why every Tester should  also aspire to be a Developer on his project!-Sandee...Why every Tester should  also aspire to be a Developer on his project!-Sandee...
Why every Tester should also aspire to be a Developer on his project!-Sandee...
bhumika2108
 
Real time trend and failure analysis using TTA-Anand Bagmar & Aasawaree Deshmukh
Real time trend and failure analysis using TTA-Anand Bagmar & Aasawaree DeshmukhReal time trend and failure analysis using TTA-Anand Bagmar & Aasawaree Deshmukh
Real time trend and failure analysis using TTA-Anand Bagmar & Aasawaree Deshmukh
bhumika2108
 
Web android automation-Darshan Padmawar
Web android automation-Darshan PadmawarWeb android automation-Darshan Padmawar
Web android automation-Darshan Padmawar
bhumika2108
 

More from bhumika2108 (19)

User Story Mapping - WHY and HOW, a handson workshop
User Story Mapping - WHY and HOW, a handson workshopUser Story Mapping - WHY and HOW, a handson workshop
User Story Mapping - WHY and HOW, a handson workshop
 
Saying no to selenium tests
Saying no to selenium testsSaying no to selenium tests
Saying no to selenium tests
 
123 automation framework
123 automation framework123 automation framework
123 automation framework
 
Where do my tests belong?
Where do my tests belong?Where do my tests belong?
Where do my tests belong?
 
Wearables & testing
Wearables & testingWearables & testing
Wearables & testing
 
Reliable and fast security audits - The modern and offensive way-Mohan Gandhi
Reliable and fast security audits - The modern and offensive way-Mohan GandhiReliable and fast security audits - The modern and offensive way-Mohan Gandhi
Reliable and fast security audits - The modern and offensive way-Mohan Gandhi
 
Did you check the UX Quality?-Rajarshi Ray
Did you check the UX Quality?-Rajarshi RayDid you check the UX Quality?-Rajarshi Ray
Did you check the UX Quality?-Rajarshi Ray
 
Why did we delete our regression suite? Deepak Parmasivam & Sneha Vijayaraghavan
Why did we delete our regression suite? Deepak Parmasivam & Sneha VijayaraghavanWhy did we delete our regression suite? Deepak Parmasivam & Sneha Vijayaraghavan
Why did we delete our regression suite? Deepak Parmasivam & Sneha Vijayaraghavan
 
Why do cars need to have the right breaking system -Shalabh Varma & Manisha A...
Why do cars need to have the right breaking system -Shalabh Varma & Manisha A...Why do cars need to have the right breaking system -Shalabh Varma & Manisha A...
Why do cars need to have the right breaking system -Shalabh Varma & Manisha A...
 
Why test automation promises more and delivers less - Deepak Koul
Why test automation promises more and delivers less - Deepak KoulWhy test automation promises more and delivers less - Deepak Koul
Why test automation promises more and delivers less - Deepak Koul
 
Accessibility testing-Gyani and Siddhanth
Accessibility testing-Gyani and SiddhanthAccessibility testing-Gyani and Siddhanth
Accessibility testing-Gyani and Siddhanth
 
Fuzz Testing-Atul Khot
Fuzz Testing-Atul KhotFuzz Testing-Atul Khot
Fuzz Testing-Atul Khot
 
Why every Tester should also aspire to be a Developer on his project!-Sandee...
Why every Tester should  also aspire to be a Developer on his project!-Sandee...Why every Tester should  also aspire to be a Developer on his project!-Sandee...
Why every Tester should also aspire to be a Developer on his project!-Sandee...
 
Real time trend and failure analysis using TTA-Anand Bagmar & Aasawaree Deshmukh
Real time trend and failure analysis using TTA-Anand Bagmar & Aasawaree DeshmukhReal time trend and failure analysis using TTA-Anand Bagmar & Aasawaree Deshmukh
Real time trend and failure analysis using TTA-Anand Bagmar & Aasawaree Deshmukh
 
Web android automation-Darshan Padmawar
Web android automation-Darshan PadmawarWeb android automation-Darshan Padmawar
Web android automation-Darshan Padmawar
 
Whats accessibility
Whats accessibilityWhats accessibility
Whats accessibility
 
Add ons for software testers
Add ons for software testersAdd ons for software testers
Add ons for software testers
 
Relate UI automation & performance
Relate UI automation & performanceRelate UI automation & performance
Relate UI automation & performance
 
Automated infrastructure testing - by Ranjib Dey
Automated infrastructure testing - by Ranjib DeyAutomated infrastructure testing - by Ranjib Dey
Automated infrastructure testing - by Ranjib Dey
 

Recently uploaded

Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Peter Udo Diehl
 

Recently uploaded (20)

IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 

QAing INFRASTRUCTURE- A QA's role in the DevOps World-Aroj P George & Harshad Wankhede

  • 1. QAing INFRASTRUCTURE A QA's Role in DevOps World by Aroj P George Harshad Wankhede
  • 3. Typical Development Environment - Single Server Developer/QA Laptop W E B S E R V E R App LI Service Mocked ThirdParty Services DB
  • 4. Production Infrastructure – Multi-Tier Replica Set L O A D C D N B A L A N C E R App App App L O A D B A L A N C E R Service DB Service Service Third Party Services DB DB
  • 5. Application Ecosystem Application Libraries Operational Processes Network Services System Services Authentication and Authorization Rules Virtualization Storage Configuration Network Configuration Firewalls and VPN and Secure Access Rules
  • 6. What if we don’t test the infrastructure?
  • 7.
  • 8. What is DevOps? It’s a mindset Bridges gap between Developers and Operations
  • 10. Database Failure Scenarios Do we have proper error handling, logging and alerting in place for below scenarios? if the application is unable to read data  Network issues  DB Primary is down Does the DB Primary failover if it goes down? Does the application keep running Do we have alerts for the below? Replication has failed DB backups are not happening DB high resource utlilization DB queries are very slow
  • 11. Application Performance Issues Do we have monitoring for high resource utilization Are we tracking the response times for application and dependent web services Are we tracking errors generated in the logs? Do we get alerts if a cron job fails to run or if it errors? Ex: Newrelic, Nagios, Ganglia metrics collection, Executing Performance Testing scripts regularly
  • 12. Server or Services are Down How does the system react to database failure How does the system react to application failure How does the system react to third party service failure What happens if the load balancer is down Ex: Chaos Monkey
  • 13. Logging and Archiving Have you configured appropriate logging? Are the logs being properly indexed and rotated? Do you maintain log history using tools like Splunk? Are the DB backups being archived?
  • 14. Application Errors Do we show appropriate error pages for various errors 500, 503, 404 Pages What happens if there is a sudden increase in the rate of errors? Do we get alerted regarding the same.
  • 15. Resource Consumption Issues Have you configured alerting for CPU usage? Have you configured alerting for Memory usage? Have you configured alerts for Disk Utilization? Warning if utilization reaches 80% Critical if utilization reaches 95%
  • 19. Third Party Service Failure Have you configured alerting when third party services are down?
  • 20. Security Have you blocked access to admin configuration? Have you configured firewall rules properly? Penetration Testing
  • 21. Caching Have you validated application caching? Ex: CacheWarp
  • 22. Cross functional/Non Functional Requirements Testing infrastructure is nothing but testing the Cross-functional Requirements
  • 24. Chef - Templatize Configuration
  • 25. Chef Demo Chef is a systems and cloud infrastructure automation framework that makes it easy to deploy servers and applications Infrastructure as Code Knife is a Chef utility Using the Knife SSH command to test the infrastructure
  • 28. QA Skills Required: Good Understanding of the Infrastructure Ability to foresee Issues Command-line proficiency, Vim Ability to debug and troubleshoot issues Should be able to write automated tests (rspec) Basic shell scripting knowledge
  • 29. Myths about Infrastructure Testing No need to test Infrastructure. If it works locally it will work on any environment Application and Infrastructure Testing strategies are similar Application Testing is more important than Infrastructure Testing There aren’t too many different things to test in infrastructure There is no business value in testing Infrastructure

Editor's Notes

  1. LoadbalancersReplica set DB (Failover)CDNNetwork Connectivity
  2. Application is just one part of the whole bigger stack. As you can see there are so many other components and concerns.
  3. Bridge gap.
  4. Ganglia 100% CPU metrics.
  5. knife environment listknife node list -E sit1perf-previewknife node show sit1perf-preview-db-l-a-01knife ssh "chef_environment:sit1perf-preview AND roles:db" "hostname" -i '/Users/aroj/code/sony/gwt/devops/certificates/gwtaws_TW_sit1a.pem' -x root --attribute automatic.ec2.public_hostnameknife ssh "chef_environment:sit1perf-preview AND roles:db" "service mongod status" -i 'certificates/gwtaws_TW_sit1a.pem' -x root --attribute automatic.ec2.public_hostnameknife ssh "chef_environment:sit1perf-preview AND roles:pdp" "service nginx status" -i 'certificates/gwtaws_TW_sit1a.pem' -x root --attribute automatic.ec2.public_hostname
  6. knife environment listknife node list -E sit1perf-previewknife node show sit1perf-preview-db-l-a-01knife ssh "chef_environment:sit1perf-preview AND roles:db" "hostname" -i '/Users/aroj/code/sony/gwt/devops/certificates/gwtaws_TW_sit1a.pem' -x root --attribute automatic.ec2.public_hostnameknife ssh "chef_environment:sit1perf-preview AND roles:db" "service mongod status" -i 'certificates/gwtaws_TW_sit1a.pem' -x root --attribute automatic.ec2.public_hostnameknife ssh "chef_environment:sit1perf-preview AND roles:pdp" "service nginx status" -i 'certificates/gwtaws_TW_sit1a.pem' -x root --attribute automatic.ec2.public_hostname
  7. knife environment listknife node list -E sit1perf-previewknife node show sit1perf-preview-db-l-a-01knife ssh "chef_environment:sit1perf-preview AND roles:db" "hostname" -i '/Users/aroj/code/sony/gwt/devops/certificates/gwtaws_TW_sit1a.pem' -x root --attribute automatic.ec2.public_hostnameknife ssh "chef_environment:sit1perf-preview AND roles:db" "service mongod status" -i 'certificates/gwtaws_TW_sit1a.pem' -x root --attribute automatic.ec2.public_hostnameknife ssh "chef_environment:sit1perf-preview AND roles:pdp" "service nginx status" -i 'certificates/gwtaws_TW_sit1a.pem' -x root --attribute automatic.ec2.public_hostname
  8. Integrated into the deployment pipeline