Site reliability engineering (SRE) is a set of principles that applies software engineering practices to infrastructure and operations. SRE teams use automation and software development skills to manage systems and solve problems in order to create highly reliable and scalable software systems. SRE teams are responsible for availability, performance, monitoring, change management, emergency response, and capacity planning within an engineering organization. SRE focuses on automation, system design, and improvements to system resilience.