SlideShare a Scribd company logo
1 of 44
Download to read offline
Reliability Growth 
                Planning: Its Concept,
                  Applications, and 
                      Challenges
                                   Tongdan Jin
                           Assistant Prof. of Industrial Engineering
                                Ingram School of Engineering
                             Texas State University‐San Marcos
                           ©2011 ASQ & Presentation Tongdan Jin
                              Presented live on Nov 11th, 2010

http://reliabilitycalendar.org/The_Reli
ability_Calendar/Webinars_‐
_English/Webinars_‐_English.html
ASQ Reliability Division 
                  English Webinar Series
                   One of the monthly webinars 
                     on topics of interest to 
                       reliability engineers.
                     To view recorded webinar (available to ASQ Reliability 
                         Division members only) visit asq.org/reliability

                      To sign up for the free and available to anyone live 
                     webinars visit reliabilitycalendar.org and select English 
                     Webinars to find links to register for upcoming events


http://reliabilitycalendar.org/The_Reli
ability_Calendar/Webinars_‐
_English/Webinars_‐_English.html
1




Reliability Growth Planning: Its Concept,
      Applications, and Challenges


                     Tongdan Jin

       Assistant Prof. of Industrial Engineering
            Ingram School of Engineering
         Texas State University-San Marcos

                 November 11, 2010
2


                 Contents

•  RGT vs. RGP

•  Design for Reliability

•  New Reliability Monitoring Metrics

•  Reliability Growth under Budget Constraints

•  Conclusion
3

                        RGT vs. RGP
                          Product Life Cycle

Design and        Prototype and        Volume Production, Field Use
Development        Pilot Phase               and End of Life


Reliability Growth Testing (RGT)



                 Reliability Growth Planning (RGP)
4

               Why Need RGP?

  •  Design Cycle Shrinks

  •  Cut-off of Testing Budget

  •  Different Design/Development Schedule

                                                                   Adv. subsys 6

                                                   Adv. subsys 5

                                          Adv. subsys 4
                                  Basic subsys 3
                                  Basic subsys 2
                             t0   Basic subsys 1 t1         t2     t3              t4   time


                                  Basic design        Volume manufacturing and shipping

Automatic Test Equipment           Figure 3 Compressed System Design Cycle
5

              System Reliability vs. Shipment

              Target MTBF




                                                Field System Populations
System MTBF




                                       MTBF
                System Installs




                        Chronological Time
6
Reliability Growth Planning Across Lifecycle Time

                  hardware                         CA effectiveness
     design
                                                                      optimization
software          Design                              Driving
                    for                              Reliability
   mfg           Reliability                          Growth
                                                                        budget
       NFF
                                                      failure mode
                   process
                                                          pareto




Note: mfg=manufacturing, NFF=no fault found, CA=corrective action
7




    Topic One:
Design for Reliability
•  Component/Hardware Failures

•  Non-Component Failures
  ! Design weakness
  ! Software failures
  ! Manufacturing defects
  ! Process/handling issues
  ! No-fault-found (NFF)
8
      System Failure Mode Categories

       Failures Breakdown by Root-Cause Catagory
50%
       A
40%            C
30%                  D
           B
20%

10%

0%
                   (components)
        Hardware




                                                                      NFF
                                  Design




                                                           Software
                                           Mfg


                                                 Process
9

Different MTBF Scenarios




   Target MTBF




                 Time
10

   Modeling Hardware Failure Rate

             λ = λ0π T π Eπ Qπ FT π R
             λ0   = base failure rate.
             πT   = temperature factor.
             πE   = electrical stress factor.
             πQ   = quality factor.
             π FT = fault tolerance factor.
             π R = redundancy factor.


For a given design, π T , π E play essential roles in
the actual component reliability.
11
  Aggregate Failure Rate for Hardware
                 k                  k
         λhw = ∑ ni λi = ∑ ni λ0iπ Tiπ Ei
                i =1               i =1                                ASIC Temperature Distribution
                                                          14                                                      0.08
                                                          12                                        histogram     0.07
                                                          10                                        pdf           0.06
                        k                                                                                         0.05

        E[λhw ] = ∑ ni λ0i E[π Ti ]E[π Ei ]




                                               Quantity
                                                          8
                                                                                                                  0.04




                                                                                                                         pdf
                                                          6
                                                                                                                  0.03
                                                          4
                       i =1                               2
                                                                                                                  0.02
                                                                                                                  0.01
                                                          0                                                       0
                                                               <65 [65, 70)[70, 75)[75, 80)[80, 85)[85, 90) >90
                             k
        var(λhw ) = ∑ ni2λ2i var(π Ti π Ei )
                                                                             Degree in Celsius

                          0
                            i =1

Where
    k = number of types of devices used in the product.
    ni = quantity of ith type of device used in the product.
    !0i = base failure rate for ith type of device.
12
Challenges in Modeling Non-Hardware Failures


 1.  Quite often data is not well recorded

 2.  Varies from one product line to another

 3.  Process related

 4.  Design experience

 5.  Other random factors
13
   Triangle Models for Non-Hardware Failures

                                                    ⎧       2
 h
                                                    ⎪ (c − a)(b − a) (λ − a) a ≤ λ ≤ c
                                                    ⎪
                                                    ⎪       2
                                           g (λ ) = ⎨                (λ − b ) c < λ ≤ b
g(λ)




                                                    ⎪ (c − b)(b − a)
                                                    ⎪       0           otherwise
                                       λ            ⎪
        a                c     b                    ⎩


       Where:

            a = the smallest possible value of the failure rate
            b = the largest possible value of the failure rate
            c = the most likely value, and c=3λ -b-a
            λ = is the sample mean for the dataset
14
Example for Non-Hardware Failure Estimate

 Example:
 Based on historical data of predecessor products, it
 shows failure rates pertaining to manufacturing issues
 are (faults/hour):

 1.2!10-6, 1.4!10-6 and 2.4 !10-6.

 Then :

 λ = (1.2!10-6+1.4!10-6 +2.3 !10-6)/3=1.6!10-6
 a = 1.2!10-6
 b = 2.4 !10-6
 c = 1.3!10-6
15
Combining HW and Non-HW Failure Rate

                                          k
    λsys = λd + λs + λm + λ p + λo + ∑ ni λi
                                         i =1


   Where:
   !d = failure rate of design weakness
   !s = failure rate of software
   !m = failure rate of manufacturing
   !p = failure rate of process
   !o = failure rate of other issues (e.g. NFF)
   k= total number of HW component types
   !i = failure rates for component type i
Confidence Intervals for Failure Rate
                                                             16




                                              k
     λsys = λd + λs + λm + λ p + λo + ∑ ni λi
                                             i =1
                                                    k
    σ λ = σ λ + σ λ + σ λ + σ λ + σ λ + ∑ ni2σ λ
      2
      sys
                2
                 d
                        2
                        s
                            2
                            m
                                   2
                                   p
                                        2
                                         o
                                               2
                                                         i
                                                  i =1




            − 2σ λsys       λsys       2σ λsys
17
Application to Reliability Design (cnt d)

     µsys = E[λHW ] + E[λnon− HW ] = 1.13 ×10 −5

     σ sys = var(λHW ) + var(λnon−HW ) = 2.23 ×10 −11
       2




                                                 0.3%


                                                    !
                           µsys    2.43 ×10 −5
18
                    MTBF with 99.7% Confidence
                                            The mean of PCB failure rate
                Pr{MTBF ≥ t} ≥ 99.7%        is 1.13!10-5 faults/hours
          1
λSYS =
         MTBF
                            1
                 Pr{λsys   ≤ } ≥ 99.7%
                            t
                                                MTBF=1/(1.13!10-5 )
                                                    =88,100 hours
                MTBF(99.7%) =41,115 hours



    MTBF Estimate with Confidence           Neutral MTBF Estimate
19




   Topic Two:

Failure Mode Rate
        &
 Failure-In-Time
20

                Pareto Chart for Failure Modes
           Pareto by Failure Mode From January to March
      14                                                                      100%
      12
      10
       8
                                                      No C/A
                                                      C/A In Process
                                                                              80%

                                                                              60%
                                                                                     Difficulties:
                                                      C/A Complete
                                                                                     •    Static View
Qty




       6                                              Percentage              40%
       4
                                                                              20%
       2
       0                                                                      0%     •    No Trend of Each
                                 No Fault
            Relays




                                            Solder
                     Resistors




                                                        Software



                                                                    Op-Amp
                                                                                          Failure Mode
                                  Found

                                             Cold



                                                          Bug


           Pareto Chart by Failure Mode From April to June
                                                                              100%
                                                                                     •    Fail to Reflect
      28
      24                                              No C/A                  80%
                                                                                          Product MTBF
      20                                              C/A In Process
                                                                              60%
      16                                              C/A Complete
Qty




      12                                              Percentage              40%    Note: C/A= corrective action
       8
                                                                              20%
       4
       0                                                                      0%
                                                                   No Fault
                                             Relays
                                 Solder
                     Resistors




                                                        software
            Op-Amp




                                  Cold




                                                                    Found
                                                          bug
21

    Failure Mode Rate (FMR)



      failures for a type of FM
FMR =
      field product installations
22

        FMR Estimation: Example

      failure quantity for a type of FM
FMR =
           field product installation

For example:
Assuming 120 PCBs were shipped and installed
in the field in the first quarter, 5 failures returned
due poor solder joints, then the FMR for poor
solder joints in the first quarter is

        5
 FMR =     = 0.042        faults / board / quarter
       120
23
                                       FMR Run Chart

                               Failure Mode Rate (FMR) by Quarter
                     0.06                                              400
                            relays            Op-amp        Product
                                                                       350
                     0.05                                   shipment




                                                                             Cumulative PCB Shipment
                                                                       300
Failures Per Board




                     0.04
                                                                       250
                     0.03                                              200
                            resistor
                                                                       150
                     0.02
                                                                       100
                     0.01
                                                                       50
                     0.00                                              0
                            1st Qtr     2nd Qtr   3rd Qtr    4th Qtr
24
                   Estimate MTBF using FMR Chart
                     Quarters                  1st        2nd Qtr     3rd         4th
               Cumulative Shipment             120         200        220        264
                 Cum Run Hours              262,080      436,800     480,480 576,576
                  Cum FM rate                 0.117       0.150       0.057      0.051
                Defective Boards               14           30         12         13
                 MTBF (hours)                18720        14560      38541       42856
                                          13 Weeks Rolling MTBF
                 45,000                                                            100

                 40,000              Defective Boards                              90

                 35,000              MTBF                                          80
                                                                                   70
MTBF (hours)




                 30,000




                                                                                         Failures
                                                                                   60
                 25,000
                                                                                   50
                 20,000
                                                                                   40
                 15,000
                                                                                   30
                 10,000                                                            20
                  5,000                                                            10
                     0                                                             0
                           1st Qtr           2nd Qtr       3rd Qtr     4th Qtr
25
Estimate for PCB Failure Rate
                                       k
  λsys = λd + λs + λm + λ p + λo + ∑ ni λi
                                      i =1

  Notice FIT = λ ×10 9
                                                        k
  FITsys = FITd + FITs + FITm + FITp + FITo + ∑ FITi
                                                       i =1


Where
!d = failure rate of design errors
!s = failure rate of software bugs
!m = failure rate of manufacturing
!p = failure rate of process
!o = failure rate of other issues
!i = failure rates for component type i
k= total number of new component types
ni= quantity of component type i used in the product
26
FIT-Based Reliability Driven: Example (1)

    FM Category          Target MTBF (hrs)   Target FIT
   Overall Product            50,000          20,000
Components (hardware)         117,647          8,500
    Others (NFF)              250,000          4,000
       Design                 333,333          3,000
    Manufacturing             500,000          2,000
       Process                666,667          1,500
      Software               1,000,000         1,000
                                   109
                     Notice FIT =
                                  MTBF
27
FIT-Based Reliability Driven: Example (2)
Product Target    Categorical FM FIT        Failure Mode      Target FIT   Current FIT   Ownership
     FIT
                                                Relay           2,000         2,491        Tom
                                              Op-Amp            3,000         4,097        Jones
                  Component (8,500)           Resistor          1,500         2,786       Carlos
                                          DC-DC converter        800          1,393       Jesson
                                               ASICs            1,200         1,716        Jim
                                          Eng Change Order      1,300         2,383        David
                                         FPGA Rev Upgrade        900          1,643        Kim
                     Design (3,000)       Change relay type      800          1,498        John
                                             cold Solder        1,600         3,092        Tony
 PCB (20,000)
                                         backward component      250           355          Joe
                 Manufacturing (2,000)    Faked component        150           255         Paul
                    Process (1,500)          broken part         700           942          Jen
                                            Missing part         300           447         Chris
                                                OES              500           515        Andrew
                   Software (1,000)          Sever bugs          200           398        Eileen
                                            Medium bugs          400           665          Ed
                                             Trivial bugs        400           497         Eric
                    Others (4,000)              NFF             3,000          457         Mark
                                               PCFD             1,000         1,669        Jeff
28




       Topic Three:

Reliability Growth Prediction
             &
  Corrective Action under
  Budget/Cost Constraints
29

        Crow/AMSAA Growth Model

Failure Intensity: λ = αβ
                       ˆ ˆt βˆ −1
                                                                                 Various Failure Intensity Models

      ˆ
Where β =         N                 N                               6
                             α=
                             ˆ
                  ⎛ ts   ⎞             ˆ
                                    t sβ
                                                                    5            beta 1           !=1 for all
              N
             ∑ ln⎜       ⎟




                                                Failure Intensity
                                                                    4            beta 0.5
                  ⎜t     ⎟                                                       beta 1.5
             i =1
                  ⎝ i    ⎠                                          3

                                                                    2

Hypothesis Testing:                                                 1

                                                                    0
              H0: β=1, HPP                                              0        1          2          3        4   5
                                                                                                Time
              H1: β!1, NHPP

              2N                                                            2N
Reject H0        < χ 2 N ,1−θ / 2
                     2
                                           or                                  > χ 2 N ,θ / 2
                                                                                   2

               ˆ
               β                                                             ˆ
                                                                             β

ts=termination time, ti=ith failure arrival time
30

                         An Example

Cumulative Failure Arrival Interarrival
                                          ln(ts/ti)
 Failures   Time (hours) Time (hours)
                                                      N=10
    1            67             67         3.23
    2           150             83         2.43                 N
    3           234             84         1.98       ˆ
                                                      β=               = 0.797
    4           360            126         1.55
                                                            N    ⎛t ⎞
                                                               ln⎜ s ⎟
                                                           ∑ ⎜ ⎟
    5           533            173         1.16            i =1 ⎝ ti ⎠
    6           720            187         0.86
    7           912            192         0.62
    8           1102           190         0.43            N
    9           1345           243         0.23
                                                      α=
                                                      ˆ     ˆ
                                                            β
                                                                = 0.0266
    10          1632           287         0.04
                                                           ts
    ts          1700           sum         12.55
31

      Failure Modes (FM) Pareto Chart

                                           Given $10 budget for
                                           corrective actions.
                        Which FM should
                        be fixed? Given    Option one: Fix relays
                        limited budget.    MTBF=4800/(14-2.5)
                                                  =417 hours


                                           Option two: fix all others
                                           MTBF=4800/(14-9)
                                                  =960 hours
Cumulative operating time is 4800 hours,
total failures is 14.
Current MTBF=4800/14=343 hours.
32

New Reliability Growth Model

1.  Failure mode based growth prediction

2.  Reliability growth subject to CA budget
    constraints

3.  No assumption of parametric models

4.  CA effectiveness function
33
Why Need the CA Effectiveness Function?
             Limit Recourses ($)
             Spent on CA due to
                1.  Retrofit
                  2.  ECO


             CA Effectiveness
                Function



                 Maximize
                 Reliability
                  Growth
34
          An Example: ECO or Retrofit

A type of relays used on a PCB module fails constantly due to
a known failure mechanism. Two options available for
corrective actions

1.  Replace all on-board relays upon the failure return of the
    module
2.  Pro-actively recall all modules and replace with new types
    of relays having much higher reliability

     CA Option         Cost ($)       CA Effectiveness
        ECO              Low                 Low
       Retrofit         High                High
35
                              Modeling CA Effectiveness

 h(x)
                                                      Effectiveness Model
                1
effectiveness




                                                                           b
                        b<1                                         ⎛ x⎞
                              b=1                          h( x ) = ⎜ ⎟
                                    b>1                             ⎝c⎠
                                            x
                                                       b and c to be determined
                    0     CA budget ($) c




                                  Failure rate before CA – Failures rate after CA
                    Effectiveness=
                                                 Failure rate before CA
36
                      An Example

The current failure rate a type of relay is 2!10-8 faults per
hour. Upon the implementation of CA, the rate is reduced to
5!10-9.

The CA effectiveness can be expressed as 0.75, that is


                       −8         −9
                2 ×10 − 5 ×10
                                       = 0.75
                    2 ×10 −8
37
Incorporate h(x) into System Failure Rate

         HW                 Non-HW


            k                   m                                   b
λs (t ) = ∑ ni λi (t ) + ∑ λi (t )                           ⎛ x⎞
                                                    h( x ) = ⎜ ⎟
           i =1               i =k +1                        ⎝c⎠



                   k                           m
   λs ,CA (t ) = ∑ ni (1 − hi ( xi ))λi (t ) + ∑ (1 − hi ( xi ))λi (t )
                  i =1                       i = k +1
38

       Making The Prediction via MS Excel (I)

                            Week	
  No.            1    2     3     4     5     6     7     8
Cum	
  Failures	
  
   by	
  FM            Cum	
  Opting	
  Hours     1680 3360 5040 6720 8400 10080 11760 13440
        7                       Replay             2    0    1    0    3     0     1     0
        6                   resistors              1    1    0    1    0     2     0     1
        4                     op-­‐amp             0    0    0    1    0     1     1     1
        5                   capacitor              1    0    0    1    1     0     0     2
        2                 design	
  e rror         0    0    1    0    0     0     1     0
        4                software	
  bugs          1    0    0    0    1     1     1     0
        6                  cold	
  solder          0    2    0    1    0     0     1     2
        2                 bad	
  process           0    0    1    0    0     0     0     1
        4                        NFF               1    2    0    0    0     1     0     0
                      Latent	
  Failure	
  Mode
                      weekly	
  cum	
  failures    6     5     3     4     5     5     5     7
                          Actual	
  MTBF          280   305   360   373   365   360   356   336
39

             Making The Prediction via MS Excel (II)

                                                                                   Week	
  No.            1    8     9     10    11    12    13    14    15    16
 Required	
   Cost	
  for	
  fix	
                     Cum	
  Failures	
  
Budget	
  ($)   FM	
  ($)            Target	
  FM	
  %    by	
  FM            Cum	
  Opting	
  Hours     1680 13440 15120 16800 18480 20160 21840 23520 25200 26880
   150           300                     50%                   7                       Replay             2     0    1.0    0    0.5    0    1.5    0    0.5    0
   500           500                      0%                   6                   resistors              1     1     0     0     0     0     0     0     0     0
   100           200                     50%                   4                     op-­‐amp             0     1     0     0     0    0.5    0    0.5   0.5   0.5
   350           350                      0%                   5                   capacitor              1     2     0     0     0     0     0     0     0     0
   700           700                      0%                   2                 design	
  e rror         0     0     0     0     0     0     0     0     0     0
   125           250                     50%                   4                software	
  bugs          1     0    0.5    0     0     0    0.5   0.5   0.5    0
   100           100                      0%                   6                  cold	
  solder          0     2     0    0.0    0    0.0    0     0    0.0   0.0
    0             50                    100%                   2                 bad	
  process           0     1     0     0    1.0    0     0     0     0    1.0
   225           450                     50%                   4                        NFF               1     0    0.5   1.0    0     0     0    0.5    0     0
                                                                             Latent	
  Failure	
  Mode               0.3   0.2   0.2   0.1   0.3   0.2   0.2   0.2
                                                                             weekly	
  cum	
  failures    6     7     2     1     2     1     2     2     2     2
                                                                                 Actual	
  MTBF          280 336
    2250                                                                       Predicted	
  MTBF               336 318 348 372 404 420 439 457 473
40

       Reliability Growth Planning Process

                               Retrofit
                                Team
                                          Retrofit Loop
                 FRACA
   System                      Repair                In-service
 Manufacturer                  Center                 Systems

       1. Failure analysis                ECO Loop
       2. CA decisions
       3. Reliability prediction           Stocks

ECO=Engineering Change Order
CA=Corrective Actions
41
                     Conclusions
1.  Design for reliability (DFR) should incorporate hardware and
    non-hardware issues along with the variation of the failure
    rates.

2.  Trade-off should be made between the reliability growth and
    the associated availability of CA resources.

3.  The CA effectiveness function links the CA budget with the
    expected failure mode reduction rate.

4.  A reliability database system such as FRACAS is essential for
    performing RGP.
42




       Thanks !
         &
Questions/Comments ?

More Related Content

What's hot

Weibull Distribution
Weibull DistributionWeibull Distribution
Weibull Distribution
Ciarán Nolan
 
Basic reliability models
Basic reliability modelsBasic reliability models
Basic reliability models
Ana Zuliastuti
 

What's hot (20)

Managing system reliability and maintenance under performance based contract ...
Managing system reliability and maintenance under performance based contract ...Managing system reliability and maintenance under performance based contract ...
Managing system reliability and maintenance under performance based contract ...
 
Accelerated life testing
Accelerated life testingAccelerated life testing
Accelerated life testing
 
Weibull Distribution
Weibull DistributionWeibull Distribution
Weibull Distribution
 
2017 Accelerated Testing: ALT, HALT and MEOST
2017   Accelerated Testing: ALT, HALT and MEOST2017   Accelerated Testing: ALT, HALT and MEOST
2017 Accelerated Testing: ALT, HALT and MEOST
 
Predicting product life using reliability analysis methods
Predicting product life using reliability analysis methodsPredicting product life using reliability analysis methods
Predicting product life using reliability analysis methods
 
Basic reliability models
Basic reliability modelsBasic reliability models
Basic reliability models
 
Reliability Training Lesson 1 Basics
Reliability Training Lesson 1   BasicsReliability Training Lesson 1   Basics
Reliability Training Lesson 1 Basics
 
SPICE MODEL of TLP521-1 SAMPLE A in SPICE PARK
SPICE MODEL of TLP521-1 SAMPLE A in SPICE PARKSPICE MODEL of TLP521-1 SAMPLE A in SPICE PARK
SPICE MODEL of TLP521-1 SAMPLE A in SPICE PARK
 
A Proposal for an Alternative to MTBF/MTTF
A Proposal for an Alternative to MTBF/MTTFA Proposal for an Alternative to MTBF/MTTF
A Proposal for an Alternative to MTBF/MTTF
 
Mechanical Reliability Prediction: A Different Approach
Mechanical Reliability Prediction: A Different ApproachMechanical Reliability Prediction: A Different Approach
Mechanical Reliability Prediction: A Different Approach
 
Design for Reliability (DfR) Seminar
Design for Reliability (DfR) SeminarDesign for Reliability (DfR) Seminar
Design for Reliability (DfR) Seminar
 
Seminar presentation on reliability
Seminar presentation on reliabilitySeminar presentation on reliability
Seminar presentation on reliability
 
Weibull analysis introduction
Weibull analysis   introductionWeibull analysis   introduction
Weibull analysis introduction
 
51 check list preposti
51   check list preposti51   check list preposti
51 check list preposti
 
Logistik Dozentenfolien Vorschau
Logistik Dozentenfolien VorschauLogistik Dozentenfolien Vorschau
Logistik Dozentenfolien Vorschau
 
Mapping Automotive SPICE: Achieving Higher Maturity &amp; Capability Levels
Mapping Automotive SPICE: Achieving Higher Maturity &amp; Capability LevelsMapping Automotive SPICE: Achieving Higher Maturity &amp; Capability Levels
Mapping Automotive SPICE: Achieving Higher Maturity &amp; Capability Levels
 
Plan-for-Every-Part (PFEP) - Introduction November 2016
Plan-for-Every-Part (PFEP) - Introduction November 2016Plan-for-Every-Part (PFEP) - Introduction November 2016
Plan-for-Every-Part (PFEP) - Introduction November 2016
 
Effective reliability testing to drive design improvement
Effective reliability testing to drive design improvementEffective reliability testing to drive design improvement
Effective reliability testing to drive design improvement
 
Reliability Tools and Integration Seminar
Reliability Tools and Integration SeminarReliability Tools and Integration Seminar
Reliability Tools and Integration Seminar
 
18 palier cor
18 palier cor18 palier cor
18 palier cor
 

Viewers also liked

5 M Web Ex Run Chart Analysis Slides 04.02.08
5 M Web Ex Run Chart Analysis Slides 04.02.085 M Web Ex Run Chart Analysis Slides 04.02.08
5 M Web Ex Run Chart Analysis Slides 04.02.08
David Velasquez
 
Reducing Product Development Risk with Reliability Engineering Methods
Reducing Product Development Risk with Reliability Engineering MethodsReducing Product Development Risk with Reliability Engineering Methods
Reducing Product Development Risk with Reliability Engineering Methods
Wilde Analysis Ltd.
 
care and maintenance of soft contact lenses
 care and maintenance of soft contact lenses care and maintenance of soft contact lenses
care and maintenance of soft contact lenses
Vishakh Nair
 
Care and maintenance of contact lenses
Care and maintenance of contact lensesCare and maintenance of contact lenses
Care and maintenance of contact lenses
Manoj Aryal
 
Soft Contact Lens Fitting
Soft Contact Lens FittingSoft Contact Lens Fitting
Soft Contact Lens Fitting
Vishakh Nair
 

Viewers also liked (15)

Successful Career in Reliability Engineering
Successful Career in Reliability EngineeringSuccessful Career in Reliability Engineering
Successful Career in Reliability Engineering
 
Reliability Maintenance Engineering 1 - 2 Max Benefits
Reliability Maintenance Engineering 1 - 2 Max BenefitsReliability Maintenance Engineering 1 - 2 Max Benefits
Reliability Maintenance Engineering 1 - 2 Max Benefits
 
5 M Web Ex Run Chart Analysis Slides 04.02.08
5 M Web Ex Run Chart Analysis Slides 04.02.085 M Web Ex Run Chart Analysis Slides 04.02.08
5 M Web Ex Run Chart Analysis Slides 04.02.08
 
RGP lens care and maintenance
RGP  lens care and maintenanceRGP  lens care and maintenance
RGP lens care and maintenance
 
Reducing Product Development Risk with Reliability Engineering Methods
Reducing Product Development Risk with Reliability Engineering MethodsReducing Product Development Risk with Reliability Engineering Methods
Reducing Product Development Risk with Reliability Engineering Methods
 
care and maintenance of soft contact lenses
 care and maintenance of soft contact lenses care and maintenance of soft contact lenses
care and maintenance of soft contact lenses
 
Care and maintenance of contact lenses
Care and maintenance of contact lensesCare and maintenance of contact lenses
Care and maintenance of contact lenses
 
Contact lenses
Contact lensesContact lenses
Contact lenses
 
Care and maintenance of soft contact lenses
Care and maintenance of soft contact lensesCare and maintenance of soft contact lenses
Care and maintenance of soft contact lenses
 
Spherical RGP contact lens fitting and prescribing
Spherical RGP contact lens fitting and prescribingSpherical RGP contact lens fitting and prescribing
Spherical RGP contact lens fitting and prescribing
 
Fundamentals of reliability engineering and applications part1of3
Fundamentals of reliability engineering and applications part1of3Fundamentals of reliability engineering and applications part1of3
Fundamentals of reliability engineering and applications part1of3
 
Reliability centered maintenance
Reliability centered maintenanceReliability centered maintenance
Reliability centered maintenance
 
fitting RGP lenses
fitting RGP lensesfitting RGP lenses
fitting RGP lenses
 
Soft Contact Lens Fitting
Soft Contact Lens FittingSoft Contact Lens Fitting
Soft Contact Lens Fitting
 
Contact Lenses
Contact LensesContact Lenses
Contact Lenses
 

Similar to Reliability Growth Planning: Its Concept, Applications, and Challenges

Software Architecture: Test Case Writing
Software Architecture: Test Case WritingSoftware Architecture: Test Case Writing
Software Architecture: Test Case Writing
Sitdhibong Laokok
 
Pulse Design & Delivery Panel
Pulse Design & Delivery PanelPulse Design & Delivery Panel
Pulse Design & Delivery Panel
Mauricio Godoy
 
Walley.tina
Walley.tinaWalley.tina
Walley.tina
NASAPMC
 
Robust design and reliability engineering synergy webinar 2013 04 10
Robust design and reliability engineering synergy webinar   2013 04 10Robust design and reliability engineering synergy webinar   2013 04 10
Robust design and reliability engineering synergy webinar 2013 04 10
ASQ Reliability Division
 
06 operations and feedback dap-kabel
06   operations and feedback dap-kabel06   operations and feedback dap-kabel
06 operations and feedback dap-kabel
David Alvarez Palomo
 
BA conf presentation 2010
BA conf presentation 2010BA conf presentation 2010
BA conf presentation 2010
Julen Mohanty
 
Web App Testing - A Practical Approach
Web App Testing - A Practical ApproachWeb App Testing - A Practical Approach
Web App Testing - A Practical Approach
Walter Mamed
 
Care presentatie oktober 2011
Care presentatie oktober 2011Care presentatie oktober 2011
Care presentatie oktober 2011
meijerandre
 
Care Presentatie Oktober 2011
Care Presentatie Oktober 2011Care Presentatie Oktober 2011
Care Presentatie Oktober 2011
meijerandre
 
Bilbro james
Bilbro jamesBilbro james
Bilbro james
NASAPMC
 

Similar to Reliability Growth Planning: Its Concept, Applications, and Challenges (20)

Software Architecture: Test Case Writing
Software Architecture: Test Case WritingSoftware Architecture: Test Case Writing
Software Architecture: Test Case Writing
 
PLM - ERP integration
PLM - ERP integrationPLM - ERP integration
PLM - ERP integration
 
Building products - A Nifty Approach
Building products - A Nifty ApproachBuilding products - A Nifty Approach
Building products - A Nifty Approach
 
Feasible
FeasibleFeasible
Feasible
 
Pulse Design & Delivery Panel
Pulse Design & Delivery PanelPulse Design & Delivery Panel
Pulse Design & Delivery Panel
 
Walley.tina
Walley.tinaWalley.tina
Walley.tina
 
Robust design and reliability engineering synergy webinar 2013 04 10
Robust design and reliability engineering synergy webinar   2013 04 10Robust design and reliability engineering synergy webinar   2013 04 10
Robust design and reliability engineering synergy webinar 2013 04 10
 
Software test automation_overview
Software test automation_overviewSoftware test automation_overview
Software test automation_overview
 
06 operations and feedback dap-kabel
06   operations and feedback dap-kabel06   operations and feedback dap-kabel
06 operations and feedback dap-kabel
 
Bibhas automation testing
Bibhas automation testingBibhas automation testing
Bibhas automation testing
 
BA conf presentation 2010
BA conf presentation 2010BA conf presentation 2010
BA conf presentation 2010
 
Software Measurement for Lean Application Management
Software Measurement for Lean Application ManagementSoftware Measurement for Lean Application Management
Software Measurement for Lean Application Management
 
Web App Testing - A Practical Approach
Web App Testing - A Practical ApproachWeb App Testing - A Practical Approach
Web App Testing - A Practical Approach
 
Care presentatie oktober 2011
Care presentatie oktober 2011Care presentatie oktober 2011
Care presentatie oktober 2011
 
Care Presentatie Oktober 2011
Care Presentatie Oktober 2011Care Presentatie Oktober 2011
Care Presentatie Oktober 2011
 
Qualifying a high performance memory subsysten for Functional Safety
Qualifying a high performance memory subsysten for Functional SafetyQualifying a high performance memory subsysten for Functional Safety
Qualifying a high performance memory subsysten for Functional Safety
 
Postdoc Symposium - Abram Hindle
Postdoc Symposium - Abram HindlePostdoc Symposium - Abram Hindle
Postdoc Symposium - Abram Hindle
 
Software enginnering unit 01 by manoj kumar soni
Software enginnering unit 01 by manoj kumar soniSoftware enginnering unit 01 by manoj kumar soni
Software enginnering unit 01 by manoj kumar soni
 
Shanghai Automotive - Application of Process Automation and Optimisation
Shanghai Automotive - Application of Process Automation and OptimisationShanghai Automotive - Application of Process Automation and Optimisation
Shanghai Automotive - Application of Process Automation and Optimisation
 
Bilbro james
Bilbro jamesBilbro james
Bilbro james
 

More from ASQ Reliability Division

Root Cause Analysis: Think Again! - by Kevin Stewart
Root Cause Analysis: Think Again! - by Kevin StewartRoot Cause Analysis: Think Again! - by Kevin Stewart
Root Cause Analysis: Think Again! - by Kevin Stewart
ASQ Reliability Division
 
Dynamic vs. Traditional Probabilistic Risk Assessment Methodologies - by Huai...
Dynamic vs. Traditional Probabilistic Risk Assessment Methodologies - by Huai...Dynamic vs. Traditional Probabilistic Risk Assessment Methodologies - by Huai...
Dynamic vs. Traditional Probabilistic Risk Assessment Methodologies - by Huai...
ASQ Reliability Division
 
Efficient Reliability Demonstration Tests - by Guangbin Yang
Efficient Reliability Demonstration Tests - by Guangbin YangEfficient Reliability Demonstration Tests - by Guangbin Yang
Efficient Reliability Demonstration Tests - by Guangbin Yang
ASQ Reliability Division
 
Reliability Modeling Using Degradation Data - by Harry Guo
Reliability Modeling Using Degradation Data - by Harry GuoReliability Modeling Using Degradation Data - by Harry Guo
Reliability Modeling Using Degradation Data - by Harry Guo
ASQ Reliability Division
 
Reliability Division Webinar Series - Innovation: Quality for Tomorrow
Reliability Division Webinar Series -  Innovation: Quality for TomorrowReliability Division Webinar Series -  Innovation: Quality for Tomorrow
Reliability Division Webinar Series - Innovation: Quality for Tomorrow
ASQ Reliability Division
 

More from ASQ Reliability Division (20)

On Duty Cycle Concept in Reliability
On Duty Cycle Concept in ReliabilityOn Duty Cycle Concept in Reliability
On Duty Cycle Concept in Reliability
 
Thermodynamic Reliability
Thermodynamic  ReliabilityThermodynamic  Reliability
Thermodynamic Reliability
 
Root Cause Analysis: Think Again! - by Kevin Stewart
Root Cause Analysis: Think Again! - by Kevin StewartRoot Cause Analysis: Think Again! - by Kevin Stewart
Root Cause Analysis: Think Again! - by Kevin Stewart
 
Dynamic vs. Traditional Probabilistic Risk Assessment Methodologies - by Huai...
Dynamic vs. Traditional Probabilistic Risk Assessment Methodologies - by Huai...Dynamic vs. Traditional Probabilistic Risk Assessment Methodologies - by Huai...
Dynamic vs. Traditional Probabilistic Risk Assessment Methodologies - by Huai...
 
Efficient Reliability Demonstration Tests - by Guangbin Yang
Efficient Reliability Demonstration Tests - by Guangbin YangEfficient Reliability Demonstration Tests - by Guangbin Yang
Efficient Reliability Demonstration Tests - by Guangbin Yang
 
Reliability Modeling Using Degradation Data - by Harry Guo
Reliability Modeling Using Degradation Data - by Harry GuoReliability Modeling Using Degradation Data - by Harry Guo
Reliability Modeling Using Degradation Data - by Harry Guo
 
Reliability Division Webinar Series - Innovation: Quality for Tomorrow
Reliability Division Webinar Series -  Innovation: Quality for TomorrowReliability Division Webinar Series -  Innovation: Quality for Tomorrow
Reliability Division Webinar Series - Innovation: Quality for Tomorrow
 
Impact of censored data on reliability analysis
Impact of censored data on reliability analysisImpact of censored data on reliability analysis
Impact of censored data on reliability analysis
 
An introduction to weibull analysis
An introduction to weibull analysisAn introduction to weibull analysis
An introduction to weibull analysis
 
A multi phase decision on reliability growth with latent failure modes
A multi phase decision on reliability growth with latent failure modesA multi phase decision on reliability growth with latent failure modes
A multi phase decision on reliability growth with latent failure modes
 
Reliably Solving Intractable Problems
Reliably Solving Intractable ProblemsReliably Solving Intractable Problems
Reliably Solving Intractable Problems
 
Reliably producing breakthroughs
Reliably producing breakthroughsReliably producing breakthroughs
Reliably producing breakthroughs
 
ASQ RD Webinar: Design for reliability a roadmap for design robustness
ASQ RD Webinar: Design for reliability   a roadmap for design robustnessASQ RD Webinar: Design for reliability   a roadmap for design robustness
ASQ RD Webinar: Design for reliability a roadmap for design robustness
 
ASQ RD Webinar: Improved QFN Reliability Process
ASQ RD Webinar: Improved QFN Reliability Process ASQ RD Webinar: Improved QFN Reliability Process
ASQ RD Webinar: Improved QFN Reliability Process
 
Data Acquisition: A Key Challenge for Quality and Reliability Improvement
Data Acquisition: A Key Challenge for Quality and Reliability ImprovementData Acquisition: A Key Challenge for Quality and Reliability Improvement
Data Acquisition: A Key Challenge for Quality and Reliability Improvement
 
A Novel View of Applying FMECA to Software Engineering
A Novel View of Applying FMECA to Software EngineeringA Novel View of Applying FMECA to Software Engineering
A Novel View of Applying FMECA to Software Engineering
 
Astr2013 tutorial by mike silverman of ops a la carte 40 years of halt, wha...
Astr2013 tutorial by mike silverman of ops a la carte   40 years of halt, wha...Astr2013 tutorial by mike silverman of ops a la carte   40 years of halt, wha...
Astr2013 tutorial by mike silverman of ops a la carte 40 years of halt, wha...
 
Comparing Individual Reliability to Population Reliability for Aging Systems
Comparing Individual Reliability to Population Reliability for Aging SystemsComparing Individual Reliability to Population Reliability for Aging Systems
Comparing Individual Reliability to Population Reliability for Aging Systems
 
2013 asq field data analysis & statistical warranty forecasting
2013 asq field data analysis & statistical warranty forecasting2013 asq field data analysis & statistical warranty forecasting
2013 asq field data analysis & statistical warranty forecasting
 
Cost optimized reliability test planning rev 7
Cost optimized reliability test planning rev 7Cost optimized reliability test planning rev 7
Cost optimized reliability test planning rev 7
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 

Reliability Growth Planning: Its Concept, Applications, and Challenges

  • 1. Reliability Growth  Planning: Its Concept, Applications, and  Challenges Tongdan Jin Assistant Prof. of Industrial Engineering Ingram School of Engineering Texas State University‐San Marcos ©2011 ASQ & Presentation Tongdan Jin Presented live on Nov 11th, 2010 http://reliabilitycalendar.org/The_Reli ability_Calendar/Webinars_‐ _English/Webinars_‐_English.html
  • 2. ASQ Reliability Division  English Webinar Series One of the monthly webinars  on topics of interest to  reliability engineers. To view recorded webinar (available to ASQ Reliability  Division members only) visit asq.org/reliability To sign up for the free and available to anyone live  webinars visit reliabilitycalendar.org and select English  Webinars to find links to register for upcoming events http://reliabilitycalendar.org/The_Reli ability_Calendar/Webinars_‐ _English/Webinars_‐_English.html
  • 3. 1 Reliability Growth Planning: Its Concept, Applications, and Challenges Tongdan Jin Assistant Prof. of Industrial Engineering Ingram School of Engineering Texas State University-San Marcos November 11, 2010
  • 4. 2 Contents •  RGT vs. RGP •  Design for Reliability •  New Reliability Monitoring Metrics •  Reliability Growth under Budget Constraints •  Conclusion
  • 5. 3 RGT vs. RGP Product Life Cycle Design and Prototype and Volume Production, Field Use Development Pilot Phase and End of Life Reliability Growth Testing (RGT) Reliability Growth Planning (RGP)
  • 6. 4 Why Need RGP? •  Design Cycle Shrinks •  Cut-off of Testing Budget •  Different Design/Development Schedule Adv. subsys 6 Adv. subsys 5 Adv. subsys 4 Basic subsys 3 Basic subsys 2 t0 Basic subsys 1 t1 t2 t3 t4 time Basic design Volume manufacturing and shipping Automatic Test Equipment Figure 3 Compressed System Design Cycle
  • 7. 5 System Reliability vs. Shipment Target MTBF Field System Populations System MTBF MTBF System Installs Chronological Time
  • 8. 6 Reliability Growth Planning Across Lifecycle Time hardware CA effectiveness design optimization software Design Driving for Reliability mfg Reliability Growth budget NFF failure mode process pareto Note: mfg=manufacturing, NFF=no fault found, CA=corrective action
  • 9. 7 Topic One: Design for Reliability •  Component/Hardware Failures •  Non-Component Failures ! Design weakness ! Software failures ! Manufacturing defects ! Process/handling issues ! No-fault-found (NFF)
  • 10. 8 System Failure Mode Categories Failures Breakdown by Root-Cause Catagory 50% A 40% C 30% D B 20% 10% 0% (components) Hardware NFF Design Software Mfg Process
  • 11. 9 Different MTBF Scenarios Target MTBF Time
  • 12. 10 Modeling Hardware Failure Rate λ = λ0π T π Eπ Qπ FT π R λ0 = base failure rate. πT = temperature factor. πE = electrical stress factor. πQ = quality factor. π FT = fault tolerance factor. π R = redundancy factor. For a given design, π T , π E play essential roles in the actual component reliability.
  • 13. 11 Aggregate Failure Rate for Hardware k k λhw = ∑ ni λi = ∑ ni λ0iπ Tiπ Ei i =1 i =1 ASIC Temperature Distribution 14 0.08 12 histogram 0.07 10 pdf 0.06 k 0.05 E[λhw ] = ∑ ni λ0i E[π Ti ]E[π Ei ] Quantity 8 0.04 pdf 6 0.03 4 i =1 2 0.02 0.01 0 0 <65 [65, 70)[70, 75)[75, 80)[80, 85)[85, 90) >90 k var(λhw ) = ∑ ni2λ2i var(π Ti π Ei ) Degree in Celsius 0 i =1 Where k = number of types of devices used in the product. ni = quantity of ith type of device used in the product. !0i = base failure rate for ith type of device.
  • 14. 12 Challenges in Modeling Non-Hardware Failures 1.  Quite often data is not well recorded 2.  Varies from one product line to another 3.  Process related 4.  Design experience 5.  Other random factors
  • 15. 13 Triangle Models for Non-Hardware Failures ⎧ 2 h ⎪ (c − a)(b − a) (λ − a) a ≤ λ ≤ c ⎪ ⎪ 2 g (λ ) = ⎨ (λ − b ) c < λ ≤ b g(λ) ⎪ (c − b)(b − a) ⎪ 0 otherwise λ ⎪ a c b ⎩ Where: a = the smallest possible value of the failure rate b = the largest possible value of the failure rate c = the most likely value, and c=3λ -b-a λ = is the sample mean for the dataset
  • 16. 14 Example for Non-Hardware Failure Estimate Example: Based on historical data of predecessor products, it shows failure rates pertaining to manufacturing issues are (faults/hour): 1.2!10-6, 1.4!10-6 and 2.4 !10-6. Then : λ = (1.2!10-6+1.4!10-6 +2.3 !10-6)/3=1.6!10-6 a = 1.2!10-6 b = 2.4 !10-6 c = 1.3!10-6
  • 17. 15 Combining HW and Non-HW Failure Rate k λsys = λd + λs + λm + λ p + λo + ∑ ni λi i =1 Where: !d = failure rate of design weakness !s = failure rate of software !m = failure rate of manufacturing !p = failure rate of process !o = failure rate of other issues (e.g. NFF) k= total number of HW component types !i = failure rates for component type i
  • 18. Confidence Intervals for Failure Rate 16 k λsys = λd + λs + λm + λ p + λo + ∑ ni λi i =1 k σ λ = σ λ + σ λ + σ λ + σ λ + σ λ + ∑ ni2σ λ 2 sys 2 d 2 s 2 m 2 p 2 o 2 i i =1 − 2σ λsys λsys 2σ λsys
  • 19. 17 Application to Reliability Design (cnt d) µsys = E[λHW ] + E[λnon− HW ] = 1.13 ×10 −5 σ sys = var(λHW ) + var(λnon−HW ) = 2.23 ×10 −11 2 0.3% ! µsys 2.43 ×10 −5
  • 20. 18 MTBF with 99.7% Confidence The mean of PCB failure rate Pr{MTBF ≥ t} ≥ 99.7% is 1.13!10-5 faults/hours 1 λSYS = MTBF 1 Pr{λsys ≤ } ≥ 99.7% t MTBF=1/(1.13!10-5 ) =88,100 hours MTBF(99.7%) =41,115 hours MTBF Estimate with Confidence Neutral MTBF Estimate
  • 21. 19 Topic Two: Failure Mode Rate & Failure-In-Time
  • 22. 20 Pareto Chart for Failure Modes Pareto by Failure Mode From January to March 14 100% 12 10 8 No C/A C/A In Process 80% 60% Difficulties: C/A Complete •  Static View Qty 6 Percentage 40% 4 20% 2 0 0% •  No Trend of Each No Fault Relays Solder Resistors Software Op-Amp Failure Mode Found Cold Bug Pareto Chart by Failure Mode From April to June 100% •  Fail to Reflect 28 24 No C/A 80% Product MTBF 20 C/A In Process 60% 16 C/A Complete Qty 12 Percentage 40% Note: C/A= corrective action 8 20% 4 0 0% No Fault Relays Solder Resistors software Op-Amp Cold Found bug
  • 23. 21 Failure Mode Rate (FMR) failures for a type of FM FMR = field product installations
  • 24. 22 FMR Estimation: Example failure quantity for a type of FM FMR = field product installation For example: Assuming 120 PCBs were shipped and installed in the field in the first quarter, 5 failures returned due poor solder joints, then the FMR for poor solder joints in the first quarter is 5 FMR = = 0.042 faults / board / quarter 120
  • 25. 23 FMR Run Chart Failure Mode Rate (FMR) by Quarter 0.06 400 relays Op-amp Product 350 0.05 shipment Cumulative PCB Shipment 300 Failures Per Board 0.04 250 0.03 200 resistor 150 0.02 100 0.01 50 0.00 0 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr
  • 26. 24 Estimate MTBF using FMR Chart Quarters 1st 2nd Qtr 3rd 4th Cumulative Shipment 120 200 220 264 Cum Run Hours 262,080 436,800 480,480 576,576 Cum FM rate 0.117 0.150 0.057 0.051 Defective Boards 14 30 12 13 MTBF (hours) 18720 14560 38541 42856 13 Weeks Rolling MTBF 45,000 100 40,000 Defective Boards 90 35,000 MTBF 80 70 MTBF (hours) 30,000 Failures 60 25,000 50 20,000 40 15,000 30 10,000 20 5,000 10 0 0 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr
  • 27. 25 Estimate for PCB Failure Rate k λsys = λd + λs + λm + λ p + λo + ∑ ni λi i =1 Notice FIT = λ ×10 9 k FITsys = FITd + FITs + FITm + FITp + FITo + ∑ FITi i =1 Where !d = failure rate of design errors !s = failure rate of software bugs !m = failure rate of manufacturing !p = failure rate of process !o = failure rate of other issues !i = failure rates for component type i k= total number of new component types ni= quantity of component type i used in the product
  • 28. 26 FIT-Based Reliability Driven: Example (1) FM Category Target MTBF (hrs) Target FIT Overall Product 50,000 20,000 Components (hardware) 117,647 8,500 Others (NFF) 250,000 4,000 Design 333,333 3,000 Manufacturing 500,000 2,000 Process 666,667 1,500 Software 1,000,000 1,000 109 Notice FIT = MTBF
  • 29. 27 FIT-Based Reliability Driven: Example (2) Product Target Categorical FM FIT Failure Mode Target FIT Current FIT Ownership FIT Relay 2,000 2,491 Tom Op-Amp 3,000 4,097 Jones Component (8,500) Resistor 1,500 2,786 Carlos DC-DC converter 800 1,393 Jesson ASICs 1,200 1,716 Jim Eng Change Order 1,300 2,383 David FPGA Rev Upgrade 900 1,643 Kim Design (3,000) Change relay type 800 1,498 John cold Solder 1,600 3,092 Tony PCB (20,000) backward component 250 355 Joe Manufacturing (2,000) Faked component 150 255 Paul Process (1,500) broken part 700 942 Jen Missing part 300 447 Chris OES 500 515 Andrew Software (1,000) Sever bugs 200 398 Eileen Medium bugs 400 665 Ed Trivial bugs 400 497 Eric Others (4,000) NFF 3,000 457 Mark PCFD 1,000 1,669 Jeff
  • 30. 28 Topic Three: Reliability Growth Prediction & Corrective Action under Budget/Cost Constraints
  • 31. 29 Crow/AMSAA Growth Model Failure Intensity: λ = αβ ˆ ˆt βˆ −1 Various Failure Intensity Models ˆ Where β = N N 6 α= ˆ ⎛ ts ⎞ ˆ t sβ 5 beta 1 !=1 for all N ∑ ln⎜ ⎟ Failure Intensity 4 beta 0.5 ⎜t ⎟ beta 1.5 i =1 ⎝ i ⎠ 3 2 Hypothesis Testing: 1 0 H0: β=1, HPP 0 1 2 3 4 5 Time H1: β!1, NHPP 2N 2N Reject H0 < χ 2 N ,1−θ / 2 2 or > χ 2 N ,θ / 2 2 ˆ β ˆ β ts=termination time, ti=ith failure arrival time
  • 32. 30 An Example Cumulative Failure Arrival Interarrival ln(ts/ti) Failures Time (hours) Time (hours) N=10 1 67 67 3.23 2 150 83 2.43 N 3 234 84 1.98 ˆ β= = 0.797 4 360 126 1.55 N ⎛t ⎞ ln⎜ s ⎟ ∑ ⎜ ⎟ 5 533 173 1.16 i =1 ⎝ ti ⎠ 6 720 187 0.86 7 912 192 0.62 8 1102 190 0.43 N 9 1345 243 0.23 α= ˆ ˆ β = 0.0266 10 1632 287 0.04 ts ts 1700 sum 12.55
  • 33. 31 Failure Modes (FM) Pareto Chart Given $10 budget for corrective actions. Which FM should be fixed? Given Option one: Fix relays limited budget. MTBF=4800/(14-2.5) =417 hours Option two: fix all others MTBF=4800/(14-9) =960 hours Cumulative operating time is 4800 hours, total failures is 14. Current MTBF=4800/14=343 hours.
  • 34. 32 New Reliability Growth Model 1.  Failure mode based growth prediction 2.  Reliability growth subject to CA budget constraints 3.  No assumption of parametric models 4.  CA effectiveness function
  • 35. 33 Why Need the CA Effectiveness Function? Limit Recourses ($) Spent on CA due to 1.  Retrofit 2.  ECO CA Effectiveness Function Maximize Reliability Growth
  • 36. 34 An Example: ECO or Retrofit A type of relays used on a PCB module fails constantly due to a known failure mechanism. Two options available for corrective actions 1.  Replace all on-board relays upon the failure return of the module 2.  Pro-actively recall all modules and replace with new types of relays having much higher reliability CA Option Cost ($) CA Effectiveness ECO Low Low Retrofit High High
  • 37. 35 Modeling CA Effectiveness h(x) Effectiveness Model 1 effectiveness b b<1 ⎛ x⎞ b=1 h( x ) = ⎜ ⎟ b>1 ⎝c⎠ x b and c to be determined 0 CA budget ($) c Failure rate before CA – Failures rate after CA Effectiveness= Failure rate before CA
  • 38. 36 An Example The current failure rate a type of relay is 2!10-8 faults per hour. Upon the implementation of CA, the rate is reduced to 5!10-9. The CA effectiveness can be expressed as 0.75, that is −8 −9 2 ×10 − 5 ×10 = 0.75 2 ×10 −8
  • 39. 37 Incorporate h(x) into System Failure Rate HW Non-HW k m b λs (t ) = ∑ ni λi (t ) + ∑ λi (t ) ⎛ x⎞ h( x ) = ⎜ ⎟ i =1 i =k +1 ⎝c⎠ k m λs ,CA (t ) = ∑ ni (1 − hi ( xi ))λi (t ) + ∑ (1 − hi ( xi ))λi (t ) i =1 i = k +1
  • 40. 38 Making The Prediction via MS Excel (I) Week  No. 1 2 3 4 5 6 7 8 Cum  Failures   by  FM Cum  Opting  Hours 1680 3360 5040 6720 8400 10080 11760 13440 7 Replay 2 0 1 0 3 0 1 0 6 resistors 1 1 0 1 0 2 0 1 4 op-­‐amp 0 0 0 1 0 1 1 1 5 capacitor 1 0 0 1 1 0 0 2 2 design  e rror 0 0 1 0 0 0 1 0 4 software  bugs 1 0 0 0 1 1 1 0 6 cold  solder 0 2 0 1 0 0 1 2 2 bad  process 0 0 1 0 0 0 0 1 4 NFF 1 2 0 0 0 1 0 0 Latent  Failure  Mode weekly  cum  failures 6 5 3 4 5 5 5 7 Actual  MTBF 280 305 360 373 365 360 356 336
  • 41. 39 Making The Prediction via MS Excel (II) Week  No. 1 8 9 10 11 12 13 14 15 16 Required   Cost  for  fix   Cum  Failures   Budget  ($) FM  ($) Target  FM  % by  FM Cum  Opting  Hours 1680 13440 15120 16800 18480 20160 21840 23520 25200 26880 150 300 50% 7 Replay 2 0 1.0 0 0.5 0 1.5 0 0.5 0 500 500 0% 6 resistors 1 1 0 0 0 0 0 0 0 0 100 200 50% 4 op-­‐amp 0 1 0 0 0 0.5 0 0.5 0.5 0.5 350 350 0% 5 capacitor 1 2 0 0 0 0 0 0 0 0 700 700 0% 2 design  e rror 0 0 0 0 0 0 0 0 0 0 125 250 50% 4 software  bugs 1 0 0.5 0 0 0 0.5 0.5 0.5 0 100 100 0% 6 cold  solder 0 2 0 0.0 0 0.0 0 0 0.0 0.0 0 50 100% 2 bad  process 0 1 0 0 1.0 0 0 0 0 1.0 225 450 50% 4 NFF 1 0 0.5 1.0 0 0 0 0.5 0 0 Latent  Failure  Mode 0.3 0.2 0.2 0.1 0.3 0.2 0.2 0.2 weekly  cum  failures 6 7 2 1 2 1 2 2 2 2 Actual  MTBF 280 336 2250 Predicted  MTBF 336 318 348 372 404 420 439 457 473
  • 42. 40 Reliability Growth Planning Process Retrofit Team Retrofit Loop FRACA System Repair In-service Manufacturer Center Systems 1. Failure analysis ECO Loop 2. CA decisions 3. Reliability prediction Stocks ECO=Engineering Change Order CA=Corrective Actions
  • 43. 41 Conclusions 1.  Design for reliability (DFR) should incorporate hardware and non-hardware issues along with the variation of the failure rates. 2.  Trade-off should be made between the reliability growth and the associated availability of CA resources. 3.  The CA effectiveness function links the CA budget with the expected failure mode reduction rate. 4.  A reliability database system such as FRACAS is essential for performing RGP.
  • 44. 42 Thanks ! & Questions/Comments ?