In order to provide reliable, safe and secure Data Centre and mission critical facilities, certain practices must be instituted and enforced. This document establishes basic standards and procedures centrally managed Data Centre for any of private organization both owned and leased, along with other mission critical support facilities. These procedures apply to all authorized employees as well as authorized business partners.
2. Contents
• Introduction
• Elements of DC Engineering Operation
• Team Structure
• Operation Procedures
• Maintenance Activities
• Trainings
• Audits
• Documentation
• Certifications
• Best Practices
• Common Mistakes
• Brief Summary
25-Mar-21 Data Centre Engineering Operations 2
3. Introduction
25-Mar-21 Data Centre Engineering Operations 3
A Data Centre is a repository that houses computing facilities like
servers, routers, switches and firewalls, as well as supporting
components like backup equipment, fire suppression facilities and air
conditioning.
Data Centre operations comprise the systems and workflows within a
Data Centre that keep the Data Centre running. Data Centre operations
include installing and maintaining network resources and non-IT
equipment, ensuring Data Centre security and monitoring systems that
take care of power and cooling.
4. Elements of
DC
Engineering
Operation
• Environmental health and safety
• Personnel Management
• Emergency preparedness and response
• Maintenance Management
• Change Management
• Documentation Management
• Trainings
• Infrastructure Management
• Quality Management
• Energy Management
• Financial Management
• Performance monitoring & Review
25-Mar-21 Data Centre Engineering Operations 4
5. Team
Structure
• Data Centre Facility Manager
• Shift Engineer
• HVAC Technician
• Electrical Technician
• DCIM/BMS Operator
• Security Personal
25-Mar-21 Data Centre Engineering Operations 5
6. Operation
Procedures
Safety requirements:
Work Permit, PPE,
MSDS Training etc.
Conducting
Maintenance
Activities.
Authorization &
Access
Requirements
Modification and
Changes
Decommissioning
of equipment.
Electrical Power
and HVAC
requirement
Equipment and
cables
Environmental
cleanliness
25-Mar-21 Data Centre Engineering Operations 6
7. Maintenance Activities
25-Mar-21 Data Centre Engineering Operations 7
Preventive Maintenance: measures are planned and performed on equipment with the
purpose of ensuring that failures do not occur and to lessen the consequences of
breakdowns.
Predictive Maintenance: monitors the performance and condition of equipment during
normal operation to reduce the likelihood of failures. It predicts when equipment failures
might occur and to prevent the occurrence of the failure by performing maintenance. Using
real-time data, you can analyze your machine’s health over a set time.
Corrective Maintenance: where equipment is repaired or replaced after wear,
malfunction or break down.
8. Trainings
Maximizing availability and minimizing human error in the critical systems environment
depends, in large part, on well-trained staff-
• Certification/Recertification
• Task/Certification alignment
• Emergency response drills
The training should be administered in a manner that allows new technicians to be
quickly brought to a minimum level of competency and achieve steady progress until they
are fully qualified in all facets of site operation.
A training program conducted in this way helps prevent errors, increase worker
confidence and satisfaction, as well as increase the amount of maintenance that can be
done in-house, thereby reducing maintenance costs
25-Mar-21 Data Centre Engineering Operations 8
9. 25-Mar-21 Data Centre Engineering Operations 9
Audits
Assessment of Data Centre operation is important, and it must cover
following significant parameters-
• Data Centre Provide Information
• Network Details
• Building information
• Physical Space
• Physical Security
• Power Information
• Cooling information
• Fire detection/Suppression
• Services
10. Documentation
• Statutory Compliance
• Legal Compliance
• As- built drawings
• Asset database
• Preventive maintenance scope of work
• Maintenance schedule.
• Critical facility work rules.
• Safety programs
• Facility reports
• Walkthrough checklist
25-Mar-21 Data Centre Engineering Operations 10
11. Certifications
Data Centre’s certifications reflect the quality
of security, operations, engineering excellence
and energy efficiency of Data Centre.
• ISO 9001 (Quality Management Services)
• ISO 27001 (Information Security Management
Service)
• ISO 20000 (IT service Management)
• ISO 14001 (Environmental Management
System)
25-Mar-21 Data Centre Engineering Operations 11
12. Best Practices
To improve the energy efficiency and economical
operation, there are few DC operation best practices
which needs to adopt.
• Availability
• Cost-effectiveness
• Flexibility
• Manageability
• Security
25-Mar-21 Data Centre Engineering Operations 12
13. Common
Mistakes
Common mistakes in Data Centre operation
management which may leads to increasing in
downtime and hence business loss.
• Maintenance program is not driven by metrics
• Poor training
• Ineffective change management
• Failure to consistently test and evaluation of
manpower skills
• Poor Documentation
• Failure to develop and implement a quality
control system
• Failure automation and adopt a manual mode
• Overconfidence
25-Mar-21 Data Centre Engineering Operations 13
14. Brief Summary
After studying all guidelines, process and procedures to operate and maintain the Data
Centre, we must have adopted the detailed guidelines mentioned below which are the pillars
of any of DC Operations & Maintenance.
• Environmental health and safety
• Emergency preparedness and response
• Maintenance management
• Infrastructure Management
• Energy Management
• Financial management
• Audits
• Compliance Management
25-Mar-21 Data Centre Engineering Operations 14