The Maintenance Function - The Basics for Managing Maintenance


Published on

Want to know the secret to managing maintenance? Read this paper and take one item you learned and apply in your organization, just one. Your comments would be appreciated and could help others who are struggling with their maintenance organization. Help a fellow Maintenance and Reliability Professional today.

The Maintenance Function - The Basics for Managing Maintenance

  1. 1. Rules of Thumb in Reliability and Maintenance Engineering The Maintenance FunctionMaintenance to many organizations is a word which provides little value to an organization when infact it provides great value to any company when developed, managed, and discipline are applied.I think what is most misunderstood about maintenance is the true objective of the function. Theobjective of maintenance is to maintain the assets of a company so that they meet the reliabilityneeds at an optimal cost. In Webster’s dictionary it defines maintenance as:Maintenance To maintain  Keep in existing condition  Preserve, protect  Keep from failure or declineThe ultimate goal of maintenance is to provide optimal reliability which meets the business needsof the company. Many people do not know the definition of reliability and it is: “The probability or duration of failure-free performance under stated conditions”Now that we understand maintenance maintains reliability let’s see how a proactive maintenanceprocess works. John Day, one the most well-known proactive maintenance management advocateswho was formerly the maintenance and engineering manager for Alumax Mt Holly was my mentorand manager for a number of years. John has been spoken all over the world as to his model ofproactive maintenance. I wanted to provide a valuable insight into what a successful plantconsidered the definition of the word “Maintenance” to be and in which I align with very much as itrelates to the maintenance process.John Day titled his vision and process to maintenance as:STRATEGY TO ACHIEVE WORLD-CLASS PRODUCTION THROUGH RELIABILITYINTRODUCTIONAlumax of South Carolina is an aluminum smelter that produces in excess of 180,000 MT of primaryaluminum each year. It began operation in 1980 after a 2-year construction phase. The plant is thelast Greenfield aluminum smelter constructed in the U.S. Alumax of SC is a part of Alumax, Inc.,which has headquarters in Norcross, Georgia; a suburb of Atlanta, Georgia. Alumax, Inc. is the thirdlargest producer of primary aluminum in the U.S. and the fourth largest in North America.The vision of general management was that the new smelter located on the Mt. Holly Plantation nearCharleston, SC, would begin operations with a planned maintenance system that could be developedinto a total proactive system. At the time in 1978-79, there were no maintenance computer systemsavailable on the market with the capability to support and accomplish the desired objectives. ThusTSW of Atlanta, Georgia was brought on site to take not only the Alumax of S.C. maintenanceconcepts and develop a computer system, but they were to integrate all the plant business functions1
  2. 2. Rules of Thumb in Reliability and Maintenance Engineering one on-line common data base system available to all employees in their normal performance ofduties.Since the development and initial operation of the Alumax of SC maintenance management system,it has matured and rendered impressive results. These results have received extensive recognitionon a national and international level. The first major recognition came in 1984 when PlantEngineering magazine published a feature article about the system. Then in 1987 A.T. Kearney, aninternational management consultant headquartered in Chicago, performed a study to find the bestmaintenance operations in North America. Alumax of S.C. was selected as one of the seven "Best ofthe Best". And in 1989, Maintenance Technology magazine recognized Alumax of SC as the bestmaintenance operation in the U.S. within its category and also as the best overall maintenanceoperation in any category. Mt Holly’s proactive model is shown below in figure 1.MAINTENANCE APPROACHESFrom a basic point of view there are two maintenance approaches. One approach is reactive and theother is proactive. In practice there are many combinations of the basic approaches.The reactive system (see Figure 1.1.1) responds to a work request or identified need, usuallyproduction identified, and depends on rapid response measures if effective. The goals of thisapproach are to reduce response time to a minimum (the computer helps) and to reduce equipmentdown time to an acceptable level. This is the approach used by most operations today. It may wellincorporate what is termed as a preventative maintenance program and may use proactivetechnologies. Figure 1.1.1. Reactive Maintenance Model2
  3. 3. Rules of Thumb in Reliability and Maintenance Engineering proactive approach (see figure 1.1.2) responds primarily to equipment assessment andpredictive procedures. The overwhelming majority of corrective, preventative, and modification workis generated internally in the maintenance function as a result of inspections and predictiveprocedures. The goals of this method are continuous equipment performance to establishedspecifications, maintenance of productive capacity, and continuous improvement. Alumax of SCpractices the proactive method. The comments which follow are based upon the experience andresults of pursuing this vision of maintenance. Figure 1.1.2. Mt Holly’s Proactive Maintenance ModelMAINTENANCE MANAGEMENT PHILOSOPHYAlumax of SC began development of the maintenance management concept with the idea thatmaintenance would be planned and managed in a way that provides an efficient continuous operatingfacility at all times. Add to this that maintenance would also be treated as an investment rather than acost, and you have the comprehensive philosophy on which the maintenance management systemwas built. An investment is expected to show a positive return, and so should maintenance beexpected to improve the profitability of an operation. The management philosophy for maintenance isjust as important as the philosophy established for any business operation. For most industry,maintenance is a supervised function at best, with little real cost control. But it must be a managedfunction employing the best methods and systems available to produce profitable results that have apositive effect on profitability.The development of a philosophy to support the concept of proactive planned maintenance isimportant. It is believed that many maintenance management deficiencies or failures have resulted3
  4. 4. Rules of Thumb in Reliability and Maintenance Engineering having poorly constructed philosophies or the reliance upon procedures, systems, or popularprograms that have no real philosophical basis.THE FUNCTION AND CONTROL SYSTEMToday there is little disagreement that the function and control system of a good maintenancemanagement program must be computer based.Using the philosophy that maintenance management is to be considered in the same way that allother business functions are considered. It is difficult to justify any other approach other thancomplete integration of maintenance management functions with total organizational managementfunctions. The computer is the tool to use to accomplish this difficult and complex task.The computer, in an integrated operation, must be available for use by every member of themaintenance organization as well as all other plant employees who have a need. it is an essentialpart of the maintenance employees resources for accomplishing his work. It is just as important to amechanic or electrician as the tools in his toolbox or the analysis and measurement instruments thathe uses daily.The computer must supply meaningful and useful information to the user as opposed to normalcomputer data.A successful integration of data systems will tie together maintenance, warehouse, purchasing,accounting, engineering, and production in such a way that all parties must work together and havethe use of each others information. This is part of the answer to the question being asked almostuniversally, how do you break down the barriers between departments and get them to work as partof the whole or as a team. The computer system must be on line, available, and time responsive. Abatch system or semi-batch system will not provide the support needed for a dynamic, integrated,maintenance management system.In the integrated system with a common data base, data is entered only once and immediatelyupdates all other files so that its use is immediately available to all functional areas. This means thatanyone in any functional area can use or look at data in any other area, unless it is restricted. Somehave referred to this effect as the "fish bowl effect" since everything is visible to all. This stimulatescooperation, in fact, it dictates cooperation.WHAT IS MAINTENANCE?Everyone knows what maintenance is; or at least they have their own customized definition ofmaintenance. If the question is asked, words like fix, restore, replace, recondition, patch, rebuild, andrejuvenate will be repeated. And to some extent there is a place for these words or functions indefining maintenance. However, to key the definition of maintenance to these words or functions is tomiss the mark in understanding maintenance, especially if you wish to explore the philosophicalnature of the subject. Maintenance is the act of maintaining. The basis for maintaining is to keep,preserve, and protect. That is to keep in an existing state or preserve from failure or decline. Thereis a lot of difference between the thoughts contained in this definition and the words and functions4
  5. 5. Rules of Thumb in Reliability and Maintenance Engineering recalled by most people who are "knowledgeable" of the maintenance function; i.e., fixrestore, replace, recondition, etc.SPECIFICATIONIf we shift our defining thoughts to maintenance in the pure sense, we force ourselves to deal withkeeping, preserving, and protecting. But what are we to keep, protect, or preserve? You may thinkthat it is the machine, equipment, or plant, and that is true. But how are you to define the level towhich the machine, equipment, or plant is to be kept. One way would be to say - "keep it like new".At face value the concept sounds good, but it is more subjective than objective. The answer tomaintenance levels must be defined by a specification.A specification is a detailed precise presentation of that which is required. We must have aspecification for the maintenance of equipment and plant. In actual usage today the specification, if itexists, is not detailed or precise. A specification usually does exist informally in the mind of themechanic or management member even though they may be unable to recite it. This means that atbest, it is a variable, general -type specification. This kind of specification is defined in terms of and isdependent upon time available, personnel training level, pressure to produce a current order now,money allocated or available, or management opinion. Obviously, a specification like this will notqualify as a true specification, nor will it qualify as a supporting component of the act of maintaining.The true maintenance specification may be a vendor specification, a design specification, or aninternally developed specification. The specification must be precise and objective in itsrequirements. The maintenance system and organization must be designed to support a conceptbased on rational specifications. Detailed work plans and schedules may be constructed to providethe specification requirement at the maintenance level. In the maintaining context, the specification isnot a goal. It is a requirement that must be met. The maintenance system must be designed to meetthis requirement. The specification must be accepted as the "floor" or minimum acceptablemaintenance level. Variation that does occur should be above the specification level or floor. Thespecifications will probably be stated in terms of attributes and capacity.In reference to maintenance specifications, included are individual equipment specifications, processspecifications, and plant performance specifications.THE MAINTENANCE FUNCTIONThe maintenance department is responsible and accountable for maintenance. It is responsible forthe way equipment runs and looks and for the costs to achieve the required level of performance.This is not to say that the operator has no responsibility for the use of equipment when in his hands -he does. The point is that responsibility and accountability must be assigned to a single function orperson whether it be a mechanic or operator. To split responsibility between maintenance or anyother department where overlapping responsibility occurs is to establish an operation where no one isaccountable. Alumax of SC considers this a fundamental principle for effective operation ofmaintenance.The maintenance function is responsible for the frequency and level of maintenance. They areresponsible for the costs to maintain, which requires development of detailed budgets and control ofcosts to these budgets.5
  6. 6. Rules of Thumb in Reliability and Maintenance Engineering as the quality function in an organization should report to the top manager, so does themaintenance function for the same obvious reasons. This allows maintenance problems to be dealtwith in the best interest of the plant or company as a whole. Maintenance efforts and costs must notbe manipulated as a means for another department to achieve its desired costs results.Where the maintenance department or group is held responsible and accountable for maintenance,the relationship with other departments takes on new meaning. The maintenance department cantafford to have adversary relationships with others. They must have credibility and trust as the basisof interdepartmental relationships. This is an essential element for the successful operation of amaintenance management system. Figure 1.1.3. THE MAINTENANCE ORGANIZATIONThe organizational chart or better yet the organizational graphic (Figure 1.1.3) is constructed on thebasis that the central functional element for core maintenance is the Technical team. The relational(syntax) aspects of the organization are shown with concentric bands of teams. The nearer band ofteams represents the tighter relationship to the core teams. Radial connecting lines show a directrelationship to a team or band of teams. Concentric connecting lines show a more indirectrelationship between teams. The outer band of teams requires a RELATIONAL ORGANIZATIONALCHART similar to the maintenance team’s chart to define their close relationships and full relationshipto other plant teams. This particular chart is predicated on the relationship of all teams to central coremaintenance teams.6
  7. 7. Rules of Thumb in Reliability and Maintenance Engineering Teams - Core Maintenance - These teams perform core maintenance for the plant. Theyare composed of qualified electricians, mechanics, and technicians. The teams are assigned basedon a functional requirement plant wide or on the basis of a geographic area of responsibility. Thefocus, direction of the team, and individual team member needs are provided by an assignedmember of the facilitator and directional control team.Facilitator and Directional Control Team - Members of this team have been trained and qualifiedto provide team organizational dynamics and traditional supervisory functions as required. With thefacilitator, the team must address work performance by categories, administrating,training/safety/housekeeping, budgeting and cost control and information reporting as well as thetechnical requirements of the team. These members perform the necessary traditional supervisoryfunctions, especially related to personnel functions, for the technical teams.Work Distribution and Project Coordination Team - This team works with the Facilitator, Planning andEngineering teams to staff technical teams to meet work load requests, inventory requirements,contractor support, and field superintendence of engineering projects.Job Planning Team - This team works closely with the Technical teams and the Facilitator team toplan and schedule maintenance, overhaul, and contractor work. Where operators are doingmaintenance functions, the same applies.In addition, information and reports are prepared by this team for all other teams as required orrequested. Quality control of the data input is a responsibility of this team. Coordination ofproduction requirements must also be performed.Technical Assistance Team - This team is a resource to the Technical teams and Facilitator team forcontinuous improvements, modifications, trouble shooting, and corrective action.Materials Support Team - This team works with the Planning team, Facilitator team, and theTechnical teams to meet planned job requirements and emergency material requirements.Maintenance Management Team - This team provides overall coordination of maintenance andmaterial functions to meet the plant capacity requirement. Overview of budget and cost control isalso provided.User/Operator Maintenance Team - This is a team of designated operators who perform assignedand scheduled maintenance work. They must be selected, trained and qualified prior to beingassigned to this team.Plant Engineering Team - This team provides projected management for the Plant capital budgetprogram. They provide consulting and troubleshooting to the Technical Teams on an as requestedbasis.Other teams in the outer band of the organizational chart must be specifically defined by individualrelational organization charts.7
  8. 8. Rules of Thumb in Reliability and Maintenance Engineering each of the above teams, a detailed performance requirement document must be developed.Individual team members are guided by a specific job performance document. These documentsdetail the vision, mission, processes used, and strategies employed.Does the maintenance function provide a service or produce a product? Again, definition is importantin the development of this part of the philosophy. Service is defined as a useful labor that does notproduce a tangible commodity. A product is something that is produced, usually tangible, butdefinitely measurable. In the case of the maintenance function and the development of thisphilosophy, both a service and a product are considered as an output of maintenance. The currentthinking which is related to traditional maintenance (reactive maintenance) suggests that themaintenance function is for the most part a service function. But the philosophy being developedhere considers the maintenance function as the provider of a product with a small but limited servicecomponent. Consider the product produced by maintenance to be capacity (Production/Plantcapacity). Writers on the subject of maintenance have suggested this concept in the past, but littlehas been made of developing the idea to date. A predominate service approach to maintenance, asis currently practiced, is a reactive mode of operation, and is typical of most t maintenance operationstoday. React means response to stimulus. Most maintenance operations today are designed torespond to the stimulus of breakdown and the work order request, except for small efforts related topreventative maintenance and predictive maintenance, usually less than 25% of labor hours worked.This simply means that the maintenance function must be notified (stimulated) of a problem orservice requirement by some means, usually by someone outside of the maintenance organization,then maintenance reacts. Rapid response is the "score card" of this system.It is being suggested by this proactive philosophy that the maintenance function be addressed as theproducer of the product “capacity”. Capacity is measured in units of production or output (or uptime). A total proactive system must specifically be designed to produce capacity (product). If themaintenance function is to be classified as proactive, it cannot stand by and wait for someone to callor make a request. In a total proactive approach, maintenance must be responsible and accountablefor the capacity and capability of all equipment and facilities. The function must provide a facility andequipment that performs to specification and produces the product (capacity). Stated again, themaintenance function is a process that produces capacity which is the product. See Table #2 for amore detailed analysis of service vs capacity.For more information contact Ricky Smith at - This information camefrom Chapter 1 in the book “Rules of Thumb for Maintenance and Reliability Engineers”.8
  9. 9. Rules of Thumb in Reliability and Maintenance Engineering Figure 1.1.4.Companies who have adopted John Day’s philosophy and strategy have achieved results beyondwhat was known within a company. One company of many companies who were successful was alarge manufacturing company. Once senior management understood and adopted John’sphilosophy and approach it resulted in:1. Increase plant capacity by $12 million dollars in the first year2. Deferred a large capital project because the capacity it was to provide was found as part of what is called the “hidden factory”3. Eliminated the need to hire a projected 12 additional maintenance staff members4. The plant maintenance staff was reduced by 20% over the following three years because of attritionThe approach to proactive maintenance is not magic however implementing the process is verydifficult but the results are worth the effort. In order to develop a true proactive maintenanceprocess a company must have commitment from senior management to floor level personnel andthe discipline to follow a known “best practices” which have been proven and work. What is Reliability?Most maintenance professionals are intimidated by the word “reliability”. Why? Most people simplyassociate reliability with RCM (Reliability Centered Maintenance) and are unclear on what itactually means. The definition is simple:9
  10. 10. Rules of Thumb in Reliability and Maintenance Engineering • Reliability is the ability of an item to perform a required function under a stated set of conditions for a stated period of time.The definition of reliability is not at all intimidating, is it? Many companies focus on fixing equipmentwhen it has already failed – not on ensuring reliability, and avoiding failure.A common reason for this finding is that there is no time available to investigate is the truerequirements to ensure the reliability of equipment. Yet, there is a growing awareness amongthese reactive maintenance organizations of the consequences of poor equipment performance.These consequences include: • Higher maintenance costs • Increasing equipment failures • Asset availability problems, and • Safety and environmental impactsCompanies who operate in a reactive maintenance mode need to face the facts - there is NO’silver bullet’ to solve the complex problem of poor equipment performance. Upper levelmanagement has traditionally viewed Lean Manufacturing or World Class Manufacturing as theanswer. Yet these strategies don’t directly address the true target of optimal asset reliability. Forgetthe “silver bullet” and focus on asset reliability. The results will follow.Companies Who Get ItLet’s call this company XYZ Corporation. This corporation was fighting an up hill battle to survivewith foreign competition, aging workshop, and many other issues. Their CEO (Chief ExecutiveOfficer) decided reliability would be their focus because maintenance is the largest controllablecost in an organization and without sound reliability of their assets losses multiply becausereliability affects cost in so many areas. This corporation established a dedicated team of over 50key employees, and over a two year period, they researched the world’s best maintenanceorganizations. This team assimilated all “best practices” they found in the world and implementedthem in a disciplined, structured environment. They found that focusing on reliability had thebiggest return with the longest lasting results. Today this corporation is one of the top producers intheir industry world wide.Corporations like XYZ Corporation who truly understand reliability typically have the bestperforming plants. What are some of the common characteristics of a “reliability-focused”organization? They take a holistic approach to asset management, focusing on people and culture.Common characteristics found are: • Their goal is optimal asset health at an optimal cost. • They focus on processes – what people are doing to achieve results. • They measure the effectiveness of each step in the process, in addition to measuring results. • Preventive maintenance programs focus mainly on monitoring and managing asset health. • Preventive maintenance programs are technically sound, with each task linked to a specific failure mode - formal practices and tools are used to identify the work required to ensure reliability.10
  11. 11. Rules of Thumb in Reliability and Maintenance Engineering toward proactive workMany companies focus their entire maintenance efforts on a PM program that has little to do withmeeting the actual reliability needs of the equipment. When these companies are asked why aparticular PM task is done, you will typically hear: “This is the way we’ve always done it” .Many companies attempt to use statistical analysis to improve reliability. Statistical analysistechniques such as Weibull Analysis are just used help to identify assets where reliability is aproblem. Do you really need to spend your valuable engineering resources to figure out that yourMTBF is too small? Statistical analysis is really most helpful in setting PM frequencies for time-based PM’s, which should account for a very small portion of your PM’s. Besides, it’s usually veryeasy to point to the pieces of equipment that are bad actors. Forget the statistical analysis – ratherthan measuring failure frequency, figure out how to improve reliability.The factsHere are some sobering facts that will make you think twice about the effectiveness of a time-based PM program: 2. Less than 20% of asset failures are age related. So for the 20% of failures that are age related how can you identify the frequency of their Preventive Maintenance activities? Ask yourself this question: Do I have good data to determine this frequency? If your answer is yes, then this would mean most asset failures have been correctly documented and coded in the CMMS/EAM. My findings are 98% of companies do not have good failure history data. 2. Most reliability studies show that over 80% of asset failures are random. So we ask ourselves: How do you prevent random failure? In many cases, it is possible to detect early signs of random failure by monitoring the right health indicators for the asset to determine where the asset is on the degradation curve. In simple terms, how much has it degraded and how long will it be before I lose the intended function of the asset? This approach allows time to take the corrective action, in a scheduled and proactive manner – before the functional failure occurs!Let’s take this statement a step further. Preventive Maintenance for random failures must usuallyfocus on the health of the asset (through monitoring indicators such as temperature, tolerance ,vibration, etc.) in order to determine where an asset is on the degradation or PF Curve (figure1.1.5). Point “P” is the first point at which we can detect degradation. Point “F”, the new definitionof failure, is the point at which the asset fails to perform at the required functional level. In the past,we defined “Failure” as the point at which the equipment broke down. You can see points P and Fand the two different definitions of failure in the graphic below.11
  12. 12. Rules of Thumb in Reliability and Maintenance Engineering Today’s definition Failure starts here of failure Potential Equipment not Failure performing intended function Functionally failed P-F Interval P Conditional F Old definition probability of failure of failure Equipment broken Time Figure 1.1.5. PF CurveThe amount of time that elapses between the detection of a potential failure (P) and itsdeterioration to functional failure (F) is known as the P-F interval. A maintenance organizationneeds to know the PF Curve on critical equipment in order to maintain reliability at the levelrequired to meet the needs of the plant. Without this knowledge how can one truly understand howto manage the reliability of the asset?The BarriersLet’s first look at a few barriers first which prevent a plant from obtaining a higher level of reliabilityof their assets 1. Most maintenance departments and production only understand that a failure means the equipment is broken. A true failure of an asset is when it is no one longer meets the function required of it at some known rate of standard.  Example: Conveyor is supposed to operate at 200 meters per minute so when the conveyor’s speed is no longer meeting this requirement it has functionally failed thus causing an immediate loss of revenue for the company. 2. Maintenance does not get involved when quality or production rate issues arise in the plant. In most cases when an asset has functionally failed in a plant no one in maintenance seems to understand the equipment has failed. 3. Most maintenance departments do not know the performance targets of the plant equipment and do not understand why it is important that they understand them. This not a failure of the maintenance department but a breakdown of how a total is not aligned to meet the goals of it.Overcoming all three of these barriers is essential to rapid performance in reliability. If anunderstanding and focus on functional failure is applied by all plant personnel rapid results willfollow resulting in higher asset reliability. The focus must be on the alignment of the total plant onmeeting performance targets of each asset. These performance targets and current performancerates need to be posted so everyone is aware if a gap occurs in asset performance. Productionand maintenance know that when an asset has functionally failed (no longer meeting theperformance target) and is probably resulting in lost revenue. We must understand this is a12
  13. 13. Rules of Thumb in Reliability and Maintenance Engineering and management problem and both organizations must accept responsibilities foractions to mitigate the performance losses.One requirement a company must meet in order to have a rapid breakthrough in performance isthey must define what a failure is truly:Old definition of failure (typically used in reactive companies): The equipment is broke orstopped. A good example is the conveyor stopped because of mechanical problemNew definition of failure (typically used in proactive companies): The equipment is no longerperforming the function required of its user. Examples would be: Partial Functional Failure Example: A conveyor is supposed to operate at 200 metersper minute however because of a problem it can only run 160 meters per minute. Total Functional Failure Example: A conveyor has stopped based on a mechanicalproblem. The function of the example conveyor is: 1. To transfer a product from point A to point B 2. To transport product at a speed of 200 meters per minute from point A to Point BExample of a new way to view failureAfter all we have reviewed thus far my question to you is, “In your plant is there any equipmentoperating below defined performance targets and when it does is maintenance engagedimmediately?” You could have what is called “the hidden plant” and thus by focusing on theequipment performance targets reliability could be increased rapidly of your assets.Have you ever heard the saying “it is what you don’t know that kills you”? This statement is true inreliability. Follow my advice and see rapid breakthrough in plant performance and you must knowthis is just the beginning of a long journey. Do sit back and be satisfied as the reliability results yougained by following my advice. A plant must now apply RCM (Reliability Centered Maintenance)methodology to meet the goal of “optimal reliability at optimal cost”. I did not say use RCM. I statedRCM Methodology which could be RCM II, Streamlined RCM, FMEA or MTA. Be careful whichmethodology you use if you want rapid performance.The Bottom LineA company must take a step back and review the way it manages equipment performance. Ifequipment continues to fail after performing preventive maintenance or overhauls, then clearly achange is needed. The focus must be on ensuring the reliability of plant assets. As a starting point,everyone in a plant should understand the definition of reliability and what it means to the successof the company. Make “Reliability” your plant’s collective buzzword.13