SlideShare a Scribd company logo
1 of 75
Download to read offline
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com© 2004 – 2010
Root-Cause Analysis of
HALT Failures
Cheryl Tulkoff and Greg Caswell
ctulkoff@dfrsolutions.com
gcaswell@dfrsolutions.com
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
“the process of seamlessly
cohesively integrating reliability
tools together to maximize
reliability and at the lowest
possible cost”
Reliability Integration
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
Reliability and CostCOST
RELIABILITY
TOTAL
COST
CURVE
RELIABILITY
PROGRAM
COSTS
WARRANTY
COSTS
OPTIMUM
COST
POINT
Use of the proper tools during the proper life cycle phase will
help to minimize total Life Cycle Cost (LCC).
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o To minimize total Life Cycle Costs (LCC), a Reliability
Engineer must do two things:
o choose the best tools from all of the tools available and must
apply these tools at the proper phases of a product life cycle.
o properly integrate these tools together to assure that the
proper information is fed forward and backwards at the
proper times.
o Highly Accelerated Life Testing (HALT) is one reliability
tool that can assist with this process
Reliability vs. Cost, continued
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Highly Accelerated Life Testing (HALT)
o In HALT, a product is introduced to progressively higher stress
levels in order to quickly uncover design weaknesses, thereby
increasing the operating margins of the product, translating to
higher reliability.
Introduction to HALT
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
HALT Testing Overview
o Typically exposes the product to simultaneous
vibration and thermal cycling
o Product is tested in operational mode while the
vibration stress is increased with each thermal
cycle
o Objective of the test is to cause failure of the
product
o Identifies the weakest link which can be then be
improved
o Test duration is typically less than a few weeks
o On its own, this test is not able to predict the life of a
product (acceleration factor is not known)
o However, it is very useful when a product can be
compared side-by-side with a previous generation of
product with known reliability
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
Example HALT Test
o Functional testing is performed while the vibration is taking place
o This is important since intermittent opens can be found at this condition
o Vibration starts at 0 Grms and step up by 5 Grms each cycle until failure is
detected
o Failing units shall be analyzed carefully to find root cause failure
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Uncovers flaws typically not found before product
introduction
o Discovers and improves design margins
o Reduces overall development time and cost
o Provides information for developing accelerated
manufacturing screens (HASS)
HALT - Advantages over Traditional Testing
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
HALT versus Traditional Reliability Testing: Comparison
HALT
o Gathers info on product
limitations
o Focus on Design Weaknesses
o 6 DoF (degrees of freedom)
Vibration
o High Thermal Rate of Change
o Loosely Defined - Modified
“On the Fly”
o Not a “Pass/Fail” Test
o Results used as basis for HASS
or ESS (environmental stress
screening)
Traditional
o Simulates Lifetime of use
o Focus on Finding Failures
o Single Axis Vibration
o Moderate Thermal Rate of
Change
o Narrowly Defined - Rigidly
Followed
o “Pass/Fail” Test
o Results typically not used in
ESS
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
HALT
Highly Accelerated
Life Testing
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Value of HALT
o Rapidly discover design issues.
o Evaluate & improve design margins.
o Release mature product at market introduction.
o Reduce development time & cost.
o Evaluate cost reductions made to product.
o Developmental HALT is not really a test you pass or
fail, it is a process tool for the design engineers.
o There are no pre-established limits.
HALT - Highly Accelerated Life Test
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
HALT, How It Works
Start low and step up the
stress, testing the product
during the stressing
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
HALT, How It Works
Gradually increase
stress level until a
failure occurs
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
HALT, How It Works
Analyze
the failure
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
HALT, How It Works
Make
temporary
improvements
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
HALT, How It Works
Increase
stress and
start
process
over
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
HALT, How It Works
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
HALT, How It Works
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
HALT, How It Works
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
HALT, How It Works
Fundamental
Technological
Limit
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
Product
Operational
Specs
Stress
Upper
Oper.
Limit
Upper
Destruct
Limit
Lower
Destruct
Limit
Lower
Oper.
Limit
Margin Improvement Process
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
Product
Operational
Specs
Stress
Upper
Oper.
Limit
Upper
Destruct
Limit
Lower
Destruct
Limit
Lower
Oper.
Limit
Margin Improvement Process
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
Product
Operational
Specs
Stress
Upper
Oper.
Limit
Upper
Destruct
Limit
Lower
Destruct
Limit
Lower
Oper.
Limit
Margin Improvement Process
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
Product
Operational
Specs
Stress
Upper
Oper.
Limit
Upper
Destruct
Limit
Lower
Destruct
Limit
Lower
Oper.
Limit
Margin Improvement Process
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
Product
Operational
Specs
Stress
Upper
Oper.
Limit
Upper
Destruct
Limit
Lower
Destruct
Limit
Lower
Oper.
Limit
Margin Improvement Process
Operating
Margin
Destruct
Margin
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Combined stresses to technology limits
o Step stressing (individual and combined)
o Powered product with monitored tests
o Root cause failure analysis and appropriate corrective
action
HALT Implementation Requirements
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
Commonly Used HALT Equipment
o Combined
Temperature/Vibration
Equipment
o Pneumatic Vibration (to
provide the random
vibration) with Wide
Frequency Spectrum
o Fast Thermal Rates of Change
and Wide Thermal Range
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Planning a HALT
o Setting up for a HALT
o Executing a HALT
o Post Testing
Developing a HALT Process
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Meet with design engineers to discuss product.
o Determine stresses to apply.
o Determine number of samples available.
o Determine functional tests to run during Dev. HALT.
o Determine what parameters to monitor.
o Determine what constitutes a failure.
o Develop Test Plan
Planning a HALT
It is essential that the product being tested be fully exercised
and monitored throughout HALT for problem detection.
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
For each stress, we use the
Step Stress Approach
Continue until operating & destruct limits
of UUT (unit under test) are found or until test equipment
limits are reached.
Stimuli
Time (hour:minute)
D
C
B
A
0:00 0:10 0:20 0:30
Planning a HALT (cont.)
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
OTHER STIMULI:
• Voltage/frequency margining
• Power cycling
• Combined environment (Temp/Vib)
• Rapid transitions up to 60oC/min on the product
VIBRATION HIGH TEMP LOW TEMP
START
INCREMENT
DWELL TIME
END
3-5 G’s
3-5 G’s
10 min* 10 min* 10 min*
5 to 10° C
+20° C +20° C
Destruct Limit or Test Equipment Limitation
STIMULI
5 to 10° C
* In addition to functional test time
Planning a HALT (cont.)
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
Examples of Failure Inducing Loads
• Temperature Cycling
– Tmax, Tmin, dwell, ramp times
• Sustained Temperature
– T and exposure time
• Humidity
– Controlled, condensation
• Corrosion
– Salt, corrosive gases (Cl2, etc.)
• Power cycling
– Duty cycles, power dissipation
• Electrical Loads
– Voltage, current, current density
– Static and transient
• Electrical Noise
• Mechanical Bending (Static and Cyclic)
– Board-level strain
• Random Vibration
– PSD, exposure time, kurtosis
• Harmonic Vibration
– G and frequency
• Mechanical shock
– G, wave form, # of events
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Design vibration fixture to ensure energy transmission to the
product (different from electrodynamic vibration fixtures).
o Design air ducting to ensure maximum thermal transitions on
the product.
o Tune chamber for product to be tested.
o Apply thermocouples to product to be tested.
o Setup all functional test equipment and cabling.
Setting up for HALT
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Begin with cold step stress and then hot step stress.
o Step in 10 °C increments, as approach “limits” reduce to 5 °C.
o Dwell time minimum of 10 minutes + time to run functional tests to
ensure product is still functional. Start dwell once product reaches
temperature setpoint, begin functional tests after 10 minute dwell.
o Continue until fundamental limit of technology is reached.
o If circuits have thermal safeties, ensure operation & then defeat to
determine actual operating & destruct limits
o Apply additional product stresses during process:
o Power Supplies: Power cycling during cold step stress.
o Input voltage variation. Load variations.
o Frequency variation of clocks.
Executing a HALT (Thermal Step Stress)
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Transition temperature as fast as chamber will allow.
o Select temperature range within 5° of the operating limits
found during thermal step stress.
o If product cannot withstand maximum thermal transitions,
decrease transition rate by 10 °C per minute until
operating limit is found.
o Continue series of transitions for a minimum of 10 minutes
(or time it takes to run set of functional tests).
o Apply additional product stresses during process
Executing a HALT (Fast Thermal Transitions)
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Understand vibration response of product (i.e. how does
product respond to increases in vibration input).
o Determine Grms increments (usually 3-5 Grms on product).
o Dwell time minimum of 10 minutes + time to run functional
tests to ensure product is still functional. Start dwell once
product reaches vibration setpoint.
o Continue until reach fundamental limit of technology.
o Apply additional product stresses during process.
Executing a HALT (Vibration Step Stress)
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
Power Spectral Density (measured on product on OVS-2.5HP)
Marker Fcn
[Band Pwr]
Trace: D Strt: 0 Hz
Stop: 3 kHz
Band:5.468 grms
32Hz 12.8kHzAVG: 20
X:288 Hz Y:2.82893 m*
Band:6.548 grms
Band:9.842 grms
X:288 Hz Y:176.027 u*
32Hz 12.8kHzAVG: 20
X:288 Hz Y:3.44536 m*
32Hz 12.8kHzAVG: 20
Band:4.488 grms
A: ROBOT BD
B: ROBOT ARM
X:288 Hz Y:1.19064 m*
Y* = grms^2/Hz100
m*
10
u*
LogMag
Y* = grms^2/Hz100
m*
1
u*
LogMag
C: R MOTOR
D: TABLE
Y* = grms^2/Hz1
*
10
u*
LogMag
Y* = grms^2/Hz1
*
100
n*
LogMag
32Hz 12.8kHzAVG: 20
Date: 03-14-97 Time: 09:46:00 AM
Y-AXIS
Z-AXIS
X-AXIS
Z-AXIS
Vibration Step Stress (cont.)
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
Power Spectral Density (measured on product on QRS410T)
Vibration Step Stress (cont.)
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Develop thermal profile using thermal operating limits,
dwell times and transitions rates used during thermal step
stress & fast thermal transitions.
o Incorporate additional product stresses into profile such as
power cycling.
o The first run through the profile, run a constant vibration
level of approx. 5 Grms. Step in same increments
determined during vibration step stress.
o When reach higher Grms levels (approx. 20 Grms) add
tickle vibration (approx. 5 Grms) to determine if failures
were precipitated at high G level but only detectable at
lower G level.
Executing a HALT (Combined Environment)
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Determine root cause of all failures that occurred.
o Meet with design engineers to discuss results of
Developmental HALT and root cause analysis.
o Determine and implement corrective action.
o Perform Verification HALT to ensure problems fixed and
new problems not introduced.
o Periodically evaluate product as it is subjected to
engineering changes.
Post Testing
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Highly Accelerated Life Testing (HALT) is designed to
induce product failure
o Requires effective root-cause analysis
o Determination of failure site, failure mechanism, and root-
cause
o Absence prevents improvement in design margins; greatly
reduces value of HALT process
o DfR Solutions takes a systematic approach to root-
cause analysis
o Proceeding from the least destructive to most destructive until
root-cause(s) are conclusively identified
o Process steps based upon failure information (failure mode,
failure site, failure mechanism)
Failure Analysis (Motivation)
41
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com 42
No Analysis? No Improvement!
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com43
Information Gathering
o Crucial first step in any failure analysis effort
o Use of standard nomenclature
o Failure history
o When in HALT did the failure occur?
o Step stress testing, temperature cycling, vibration, combined
o What were the stresses being experienced by the product
before failure?
o Temperature, vibration, combined
o Also electrical stresses
o Is there a specific change in design/bill of
materials/manufacturing that is being assessed?
o Failure mode – the failure behavior experienced by the
observer
o Failure site – the supposed location of the failure
o Failure mechanism – the mechanism initiating the failure (not
root cause)
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com44
Failure History
o The stresses experienced before failure provides
strong indication of potential failure site
o Temperature step stress testing tends to drive
component issues
o High temperatures
o Driven by increase in leakage current (diode turn on) or increase
in resistance (time delay)
o Semiconductors, power components
o Cold temperatures
o Primarily liquid electrolytic capacitors
o Cyclic stresses (temperature cycling, vibration) tend to
drive interconnect issues
o Separable (sockets, press-fit connections, etc.)
o Permanent (solder joints)
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com45
Non-Destructive Evaluation (NDE)
o Designed to provide maximum information with minimal
risk of damaging or destroying physical evidence
o Evaluation methods
o Electrical Characterization
o Visual Inspection
o Xray Microscopy
o Thermal Imaging
o SQUID Microscopy
o Acoustic Microscopy
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Most critical step in failure analysis (FA) process for
product subjected to HALT
o Especially for failures induced during temperature step stress
testing
o Recoverable failures (operational limits)
o Failure site can only determined through electrical
characterization and interpretation of changes in functional
behavior
o Permanent failures (destruct limits)
o Proceeds with a broader range of possible electrical
characterization techniques
Electrical Characterization
46
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o JTAG (joint task action group) Boundary Scan
o Very accurate identification of failure site; rarely performed on failed units
o Oscilloscope
o Useful in probing operational circuitry
o Resistance measurements (manual)
o Binary approach (best for an electrical open)
o Isolation of attached components
o Attempt to perform electrical characterization without component removal
o Parametric characterization
o Comparison of performance to datasheet specifications
o Curve tracer
o Valuable in characterizing diode, transistor, and resistance behavior
o Time domain reflectrometry (TDR)
o Measurement of phase shift of return signal can indicate location of electrical open
o Other characterization equipment
o LCR Meter, High resistance meter, low resistance meter, etc.
o Sometimes requires re-introduction of environmental stresses
o Especially elevated temperatures
Electrical Characterization Techniques
47
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Latest innovations
o Digital detector, laminography, nanofocus
resolution, oblique viewing
o Greatest potential
o Solder joint failures under area array
devices
o Problem
o Difficult to identify microcracks that occur
during fatigue/fracture of the solder
o Can detect voiding or poor wetting
o Provides evidence of root-cause of failures
during the application of cyclic stresses
Xray and HALT
48
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Best for component failures
o Especially increases in elevated leakage
currents
o Rarely used
o Electrical characterization often provides
more rapid identification of failure site and
more definitive information on failure
mode
o However, can provide evidence of
excessive temperature rise due to power
dissipation
o Can identify root-cause of failures at
temperatures below expectations
Thermal Imaging and HALT
49
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com 50
SQUID Microscopy
 Current flow in devices produce a magnetic
field
 SQUID uses a highly sensitive magnetic detector
(superconductor) to resolve these fields
 Magnetic field image is converted to a current
density image, allowing for fault location
 Resolution
 500 nA, 300 nm
 Dependent on working distance (requires a flat
sample)
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com 51
SQUID Microscopy
 Critical technology for
detecting package level
electrical shorts
 Much more rapid failure
site resolution
 Absolute confirmation of
shorting path
 Thermal imaging induces
damage
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
Failure Analysis: Temperature Tools
o Cold Spray and hot
plates to simulate fails
at temperature
extremes
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
Failure Analysis Dremel Tool – Induce Vibrations
o A Dremel tool can be
used to induce local
vibration during
debugging
o http://www.dremel.com
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com54
Sherlock Analysis
o Sherlock Automated Design Analysis™ can be used to assess the validity of
failure mechanism
o Can provide prediction of modes shapes and stresses during vibration
o Compare prediction to observation
o Were vibration loads sufficient to fail the interconnect?
o Were defects identified during failure analysis?
o If answers to one and two are no, maybe issue with test procedure
o Important: Can not be used for combined environment (temperature cycling
+ vibration)
o Too complex
340 Hz (369 Hz) 806 Hz, (715 Hz) 945 Hz, (910 Hz)
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com55
NDE and HALT
o Failure site identification is best performed through
electrical characterization, visual inspection, and an
understanding of failure mechanisms
o NDE techniques are most useful in the second phase of
failure analysis, when root-cause must be determined
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com56
Root-Cause Evaluation
o Appropriate point to evaluate information obtained
o Requires understanding of failure mechanisms and root-cause
o Inappropriate technique could destroy evidence
o Use of failure analysis tools to direct destructive phase
of process
o Fault tree analysis (FTA)
o Fishbone diagram
o Stress vs. Strength
o Failure modes and effects analysis (FMEA)?
o Useful for design review; rarely used in failure analysis
process
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Used once approximate failure site has been
determined
FTA (cont.)
57
Ceramic
Capacitor
Failure
Insufficient
Capacitance
Dielectric
Formulation
Crack
Propagation
Solder Joint
Open
Elevated
Leakage
Current
Crack
Propagation
Surface
Insulation
Resistance
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com58
Fishbone Diagram (Ishikawa)
o Useful in identifying all potential influences that could
drive product failure
o Can result in overcommitment of resources
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
Stress vs. Strength
59
 Fiber/resin separation
 Copper/resin separation
 Hollow fibers
 Misregistration of PTHs/Vias
 Copper wicking
 Drilling damage
 Separation of PTH wall from epoxy resin
 Higher voltage
 Higher moisture concentration
 Tighter conductor spacing
 Higher ion concentration in epoxy resin
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com60
Destructive Evaluation
o Used when information obtained during NDE is
insufficient to conclusively identify root-cause of failure
o Primarily consists of two techniques
o Cross-sectioning
o Decapsulation/delidding
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Critical for interconnect failures (cyclic stresses)
Cross-Sectioning and HALT
61
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com62
Cross-Sectioning
o Identify the area of interest (failure site)
o Remove the area of interest
o Diamond-encrusted blades are preferred (minimizes surface damage)
o Band saw, high and low speed diamond saw
o Plunge cuts preferred (minimizes damage to surrounding PCBA)
o Mount sample
o Prevents relative movement / improves handling
o Use room-temperature cured epoxy
o Plane of interest should be near the surface of sample
o Excessive removal of material increases likelihood of non-planarity,
overheating, and loss of root-cause information
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com 63
Grinding/Polishing
o Rough grinding
o Used to approach the area of interest
o Involves application of 60 to 400 grit
of SiC paper
o Rough polishing
o Used to remove damage introduced
during grinding
o Involves 600 to 1200 grit of SiC paper
o Fine polishing/etching
o Used to reveal microstructural elements
o Involves use of polishing media
one micron and finer
Grit
(Silicon Carbide)
Particle Size
(microns)
240 50
320 29
400 17
600 9
800 7
1200 3
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Use of acid to remove encapsulant and
expose internal workings of the
package
o Used to analyze on-die failures
o Primarily driven by hot step stress test
o Interconnect issues, primarily wirebonds,
that can be discovered during
temperature cycling, are best initially
investigated using cross-sectioning
o Further characterization, using mechanical
testing, will require decapsulation
Decapsulation and HALT
64
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com65
Case Study -- Cold Step Stress Test
o Mass flow meter
o Recoverable failure at -30C
o Failure mode
o Loss of communication
o No permanent failures observed
o Results of electrical characterization / functional testing
o Insufficient filtering of electrolytic capacitors (rated at 105C)
o Parametric testing identified drop in capacitance at -35C
o Freezing of the electrolyte
o Corrective actions that were considered
o Switch from liquid electrolytic capacitor to tantalum capacitor
o Switch from 105C rated to 85C rated (reduced lifetime)
o Increase capacitance from 3.3 uF to 47 uF
o Extends range as well as improves filtering
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com66
Cold Step Stress Test (cont.)
o Subsequent testing on production units identified an additional
failure mode
o Initiated at approximately -5C
o Out of specification on line voltages, corrupting micros in the unit
o Incorrect resistor values
o Resistors were removed and replaced with different values
o Units passed remaining cold step stress test
o Of interest, the unit used for the initial HALT did not demonstrate
these cold temperature problems, showing impact of component
variability on product performance
o Identifying design margins!
o Subject sufficient number of samples to HALT process
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com67
Case Study -- Hot Step Stress Test
o Mass flow meter
o Permanent failure at 140C
o Failure site
o Catch diode for a switching power supply
o Failure mechanisms
o Electrical short (< 1 ohm). Operating junction temp for that
part is -65C to 125C.
o Diode was replaced and the unit was functional after
exposure at 140C. Temperature was stepped up to
150C, where nonfunctional failure reoccurred
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com68
Hot Step Stress Test (Example)
Hello Craig,
We found that an Optical-Isolator started getting noisy at higher temperatures.
The communication link stopped updating at 50C because digital data (1's and
0's) were getting through, but at a reduced amplitude. Thus, some of the 1's
were being misinterpreted as 0's by the foundation fieldbus communication
card.
We solved the problem by getting a better Opto-isolator with a higher current
gain.
Best regards,
Jim
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Application of vibration, starting at 5g and increasing
in 5g increments
o First failure noted at 30g
o Failure site identified as connector solder joints
o Insufficient flow through
o After touch up unit survived up to 40g
o Incorrect approach to failure analysis
o Unit was fixed as soon as a problem was detected
o Root-cause unable to be identified
o Design for reliability? Design for manufacturing? Processing
defect?
Case Study -- Vibration
69
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Failure after exposure to vibration
o Electrical characterization indicated electrical open under
area array device
o Confirmed through cross-sectioning
Case Study – Desktop Computer
70
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com
o Validity of failure mechanism?
o Additional observation of
separated leads on electrolytic
capacitors
o Occurrence of failure mechanism
was strongly dependent upon the
orientation of the leads
o Lead planes oriented along the
board length most likely to fail
o Vibration test may not have
applied random loads
o Potential issues with HALT table or
fixture
Issues with HALT
71
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com72
When FA and HALT?
o Failure analysis can be a time intensive process
o Hold up in product release while awaiting results
o The use of FA should be selective and should provide
maximum value
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com73
FA and HALT (cont.)
o Product
o When product design or functionality is revolutionary, perform FA on
all failures
o When product design or functionality is evolutionary, perform FA
selectively
o Temperature Step Stress Test
o When recoverable failure occurs between the operational and
storage specifications
o Specified to operate between 0 to 70C
o Specified for storage between -40 to 100C
o E.g., recoverable failure occurs at 90C
o When permanent failure occurs within 10C of cold temperature
storage specification or within 20C of hot temperature storage
specification
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com74
FA and HALT (Cyclic Stresses)
o Delineation between when to perform FA less definitive
o General rule
o Temperature cycling should not induce any failures, unless
using custom designed interconnect
o Use prior behavior to guide failure analysis in vibration or
combined
o Failure on previous designs is always the electrolytic capacitor at
20g’s
o Identification of processing defect can be a design
issue!
o Design for manufacturing
© 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com75
Conclusion
o HALT can be an important step in best practice
reliability activities
o Use can be extremely limited without root-cause
analysis
o Value-added root-cause analysis requires
understanding of failure mechanisms and the stresses
that drive them
o Sufficient knowledge base allows for optimization of
resources and rapid feedback

More Related Content

What's hot

Selecting a Printed Circuit Board Surface Fnish
Selecting a Printed Circuit Board Surface FnishSelecting a Printed Circuit Board Surface Fnish
Selecting a Printed Circuit Board Surface FnishCheryl Tulkoff
 
Reliability of power modules using sherlock
Reliability of power modules using sherlockReliability of power modules using sherlock
Reliability of power modules using sherlockGreg Caswell
 
James davies resume
James davies resumeJames davies resume
James davies resumeJames Davies
 
Selecting a Printed Circuit Board Surface Finish
Selecting a Printed Circuit Board Surface FinishSelecting a Printed Circuit Board Surface Finish
Selecting a Printed Circuit Board Surface FinishCheryl Tulkoff
 
Steven Sleeper Resume.mike's copypdf
Steven Sleeper Resume.mike's copypdfSteven Sleeper Resume.mike's copypdf
Steven Sleeper Resume.mike's copypdfsteven sleeper
 

What's hot (7)

Selecting a Printed Circuit Board Surface Fnish
Selecting a Printed Circuit Board Surface FnishSelecting a Printed Circuit Board Surface Fnish
Selecting a Printed Circuit Board Surface Fnish
 
Reference list
Reference listReference list
Reference list
 
Reliability of power modules using sherlock
Reliability of power modules using sherlockReliability of power modules using sherlock
Reliability of power modules using sherlock
 
Jason M (2)
Jason M (2)Jason M (2)
Jason M (2)
 
James davies resume
James davies resumeJames davies resume
James davies resume
 
Selecting a Printed Circuit Board Surface Finish
Selecting a Printed Circuit Board Surface FinishSelecting a Printed Circuit Board Surface Finish
Selecting a Printed Circuit Board Surface Finish
 
Steven Sleeper Resume.mike's copypdf
Steven Sleeper Resume.mike's copypdfSteven Sleeper Resume.mike's copypdf
Steven Sleeper Resume.mike's copypdf
 

Similar to Root Cause Analysis of HALT Failures

Alternatives to MTBF
Alternatives to MTBF Alternatives to MTBF
Alternatives to MTBF Craig Hillman
 
Wearable Electronics and Reliability
Wearable Electronics and ReliabilityWearable Electronics and Reliability
Wearable Electronics and ReliabilityCraig Hillman
 
Reliability of-wearable-electronics
Reliability of-wearable-electronicsReliability of-wearable-electronics
Reliability of-wearable-electronicsGreg Caswell
 
Using Physics of Failure to Predict System Level Reliability for Avionic Systems
Using Physics of Failure to Predict System Level Reliability for Avionic SystemsUsing Physics of Failure to Predict System Level Reliability for Avionic Systems
Using Physics of Failure to Predict System Level Reliability for Avionic SystemsGreg Caswell
 
Using Immersion Cooling Technology for Enhanced Efficiency & Reliability for ...
Using Immersion Cooling Technology for Enhanced Efficiency & Reliability for ...Using Immersion Cooling Technology for Enhanced Efficiency & Reliability for ...
Using Immersion Cooling Technology for Enhanced Efficiency & Reliability for ...Cheryl Tulkoff
 
DFR Conference, March 2019
DFR  Conference, March 2019DFR  Conference, March 2019
DFR Conference, March 2019OptimalPlus
 
Test Tech 2011 Overview
Test Tech 2011 OverviewTest Tech 2011 Overview
Test Tech 2011 Overviewcasandel
 
What's Better in a Smart Building World?
What's Better in a Smart Building World?What's Better in a Smart Building World?
What's Better in a Smart Building World?Memoori
 
Alternative Methods for Qualifying EEE Parts for CubeSats
Alternative Methods for Qualifying EEE Parts for CubeSatsAlternative Methods for Qualifying EEE Parts for CubeSats
Alternative Methods for Qualifying EEE Parts for CubeSatsCraig Hillman
 
Industry Services Portfolio Feb 2016
Industry Services Portfolio Feb 2016Industry Services Portfolio Feb 2016
Industry Services Portfolio Feb 2016Wade Striemer
 
Check-All Valve Product Catalog
Check-All Valve Product CatalogCheck-All Valve Product Catalog
Check-All Valve Product CatalogCTi Controltech
 
Design for Manufacturing: Challenges & Opportunities
Design for Manufacturing: Challenges & OpportunitiesDesign for Manufacturing: Challenges & Opportunities
Design for Manufacturing: Challenges & OpportunitiesCheryl Tulkoff
 
Wireline Operator
Wireline OperatorWireline Operator
Wireline OperatorShane Fuqua
 

Similar to Root Cause Analysis of HALT Failures (20)

Alternatives to MTBF
Alternatives to MTBF Alternatives to MTBF
Alternatives to MTBF
 
Wearable Electronics and Reliability
Wearable Electronics and ReliabilityWearable Electronics and Reliability
Wearable Electronics and Reliability
 
Reliability of-wearable-electronics
Reliability of-wearable-electronicsReliability of-wearable-electronics
Reliability of-wearable-electronics
 
Using Physics of Failure to Predict System Level Reliability for Avionic Systems
Using Physics of Failure to Predict System Level Reliability for Avionic SystemsUsing Physics of Failure to Predict System Level Reliability for Avionic Systems
Using Physics of Failure to Predict System Level Reliability for Avionic Systems
 
Using Immersion Cooling Technology for Enhanced Efficiency & Reliability for ...
Using Immersion Cooling Technology for Enhanced Efficiency & Reliability for ...Using Immersion Cooling Technology for Enhanced Efficiency & Reliability for ...
Using Immersion Cooling Technology for Enhanced Efficiency & Reliability for ...
 
DFR Conference, March 2019
DFR  Conference, March 2019DFR  Conference, March 2019
DFR Conference, March 2019
 
Test Tech 2011 Overview
Test Tech 2011 OverviewTest Tech 2011 Overview
Test Tech 2011 Overview
 
What's Better in a Smart Building World?
What's Better in a Smart Building World?What's Better in a Smart Building World?
What's Better in a Smart Building World?
 
2014 Mechanical Catalog HTA194
2014 Mechanical Catalog HTA1942014 Mechanical Catalog HTA194
2014 Mechanical Catalog HTA194
 
Alternative Methods for Qualifying EEE Parts for CubeSats
Alternative Methods for Qualifying EEE Parts for CubeSatsAlternative Methods for Qualifying EEE Parts for CubeSats
Alternative Methods for Qualifying EEE Parts for CubeSats
 
JAMES AJIBOYE RESUME-2016
JAMES AJIBOYE  RESUME-2016JAMES AJIBOYE  RESUME-2016
JAMES AJIBOYE RESUME-2016
 
Industry Services Portfolio Feb 2016
Industry Services Portfolio Feb 2016Industry Services Portfolio Feb 2016
Industry Services Portfolio Feb 2016
 
Check-All Valve Product Catalog
Check-All Valve Product CatalogCheck-All Valve Product Catalog
Check-All Valve Product Catalog
 
JWBRESUME
JWBRESUMEJWBRESUME
JWBRESUME
 
Design for Manufacturing: Challenges & Opportunities
Design for Manufacturing: Challenges & OpportunitiesDesign for Manufacturing: Challenges & Opportunities
Design for Manufacturing: Challenges & Opportunities
 
Wireline Operator
Wireline OperatorWireline Operator
Wireline Operator
 
Leo St Louis - Resume
Leo St Louis - ResumeLeo St Louis - Resume
Leo St Louis - Resume
 
Leo St Louis - Resume
Leo St Louis - ResumeLeo St Louis - Resume
Leo St Louis - Resume
 
New resume 2015
New resume 2015New resume 2015
New resume 2015
 
SWO&G
SWO&GSWO&G
SWO&G
 

Recently uploaded

Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 

Recently uploaded (20)

Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 

Root Cause Analysis of HALT Failures

  • 1. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com© 2004 – 2010 Root-Cause Analysis of HALT Failures Cheryl Tulkoff and Greg Caswell ctulkoff@dfrsolutions.com gcaswell@dfrsolutions.com
  • 2. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com “the process of seamlessly cohesively integrating reliability tools together to maximize reliability and at the lowest possible cost” Reliability Integration
  • 3. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com Reliability and CostCOST RELIABILITY TOTAL COST CURVE RELIABILITY PROGRAM COSTS WARRANTY COSTS OPTIMUM COST POINT Use of the proper tools during the proper life cycle phase will help to minimize total Life Cycle Cost (LCC).
  • 4. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o To minimize total Life Cycle Costs (LCC), a Reliability Engineer must do two things: o choose the best tools from all of the tools available and must apply these tools at the proper phases of a product life cycle. o properly integrate these tools together to assure that the proper information is fed forward and backwards at the proper times. o Highly Accelerated Life Testing (HALT) is one reliability tool that can assist with this process Reliability vs. Cost, continued
  • 5. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Highly Accelerated Life Testing (HALT) o In HALT, a product is introduced to progressively higher stress levels in order to quickly uncover design weaknesses, thereby increasing the operating margins of the product, translating to higher reliability. Introduction to HALT
  • 6. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com HALT Testing Overview o Typically exposes the product to simultaneous vibration and thermal cycling o Product is tested in operational mode while the vibration stress is increased with each thermal cycle o Objective of the test is to cause failure of the product o Identifies the weakest link which can be then be improved o Test duration is typically less than a few weeks o On its own, this test is not able to predict the life of a product (acceleration factor is not known) o However, it is very useful when a product can be compared side-by-side with a previous generation of product with known reliability
  • 7. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com Example HALT Test o Functional testing is performed while the vibration is taking place o This is important since intermittent opens can be found at this condition o Vibration starts at 0 Grms and step up by 5 Grms each cycle until failure is detected o Failing units shall be analyzed carefully to find root cause failure
  • 8. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Uncovers flaws typically not found before product introduction o Discovers and improves design margins o Reduces overall development time and cost o Provides information for developing accelerated manufacturing screens (HASS) HALT - Advantages over Traditional Testing
  • 9. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com HALT versus Traditional Reliability Testing: Comparison HALT o Gathers info on product limitations o Focus on Design Weaknesses o 6 DoF (degrees of freedom) Vibration o High Thermal Rate of Change o Loosely Defined - Modified “On the Fly” o Not a “Pass/Fail” Test o Results used as basis for HASS or ESS (environmental stress screening) Traditional o Simulates Lifetime of use o Focus on Finding Failures o Single Axis Vibration o Moderate Thermal Rate of Change o Narrowly Defined - Rigidly Followed o “Pass/Fail” Test o Results typically not used in ESS
  • 10. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com HALT Highly Accelerated Life Testing
  • 11. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Value of HALT o Rapidly discover design issues. o Evaluate & improve design margins. o Release mature product at market introduction. o Reduce development time & cost. o Evaluate cost reductions made to product. o Developmental HALT is not really a test you pass or fail, it is a process tool for the design engineers. o There are no pre-established limits. HALT - Highly Accelerated Life Test
  • 12. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com HALT, How It Works Start low and step up the stress, testing the product during the stressing
  • 13. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com HALT, How It Works Gradually increase stress level until a failure occurs
  • 14. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com HALT, How It Works Analyze the failure
  • 15. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com HALT, How It Works Make temporary improvements
  • 16. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com HALT, How It Works Increase stress and start process over
  • 17. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com HALT, How It Works
  • 18. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com HALT, How It Works
  • 19. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com HALT, How It Works
  • 20. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com HALT, How It Works Fundamental Technological Limit
  • 21. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com Product Operational Specs Stress Upper Oper. Limit Upper Destruct Limit Lower Destruct Limit Lower Oper. Limit Margin Improvement Process
  • 22. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com Product Operational Specs Stress Upper Oper. Limit Upper Destruct Limit Lower Destruct Limit Lower Oper. Limit Margin Improvement Process
  • 23. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com Product Operational Specs Stress Upper Oper. Limit Upper Destruct Limit Lower Destruct Limit Lower Oper. Limit Margin Improvement Process
  • 24. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com Product Operational Specs Stress Upper Oper. Limit Upper Destruct Limit Lower Destruct Limit Lower Oper. Limit Margin Improvement Process
  • 25. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com Product Operational Specs Stress Upper Oper. Limit Upper Destruct Limit Lower Destruct Limit Lower Oper. Limit Margin Improvement Process Operating Margin Destruct Margin
  • 26. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Combined stresses to technology limits o Step stressing (individual and combined) o Powered product with monitored tests o Root cause failure analysis and appropriate corrective action HALT Implementation Requirements
  • 27. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com Commonly Used HALT Equipment o Combined Temperature/Vibration Equipment o Pneumatic Vibration (to provide the random vibration) with Wide Frequency Spectrum o Fast Thermal Rates of Change and Wide Thermal Range
  • 28. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Planning a HALT o Setting up for a HALT o Executing a HALT o Post Testing Developing a HALT Process
  • 29. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Meet with design engineers to discuss product. o Determine stresses to apply. o Determine number of samples available. o Determine functional tests to run during Dev. HALT. o Determine what parameters to monitor. o Determine what constitutes a failure. o Develop Test Plan Planning a HALT It is essential that the product being tested be fully exercised and monitored throughout HALT for problem detection.
  • 30. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com For each stress, we use the Step Stress Approach Continue until operating & destruct limits of UUT (unit under test) are found or until test equipment limits are reached. Stimuli Time (hour:minute) D C B A 0:00 0:10 0:20 0:30 Planning a HALT (cont.)
  • 31. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com OTHER STIMULI: • Voltage/frequency margining • Power cycling • Combined environment (Temp/Vib) • Rapid transitions up to 60oC/min on the product VIBRATION HIGH TEMP LOW TEMP START INCREMENT DWELL TIME END 3-5 G’s 3-5 G’s 10 min* 10 min* 10 min* 5 to 10° C +20° C +20° C Destruct Limit or Test Equipment Limitation STIMULI 5 to 10° C * In addition to functional test time Planning a HALT (cont.)
  • 32. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com Examples of Failure Inducing Loads • Temperature Cycling – Tmax, Tmin, dwell, ramp times • Sustained Temperature – T and exposure time • Humidity – Controlled, condensation • Corrosion – Salt, corrosive gases (Cl2, etc.) • Power cycling – Duty cycles, power dissipation • Electrical Loads – Voltage, current, current density – Static and transient • Electrical Noise • Mechanical Bending (Static and Cyclic) – Board-level strain • Random Vibration – PSD, exposure time, kurtosis • Harmonic Vibration – G and frequency • Mechanical shock – G, wave form, # of events
  • 33. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Design vibration fixture to ensure energy transmission to the product (different from electrodynamic vibration fixtures). o Design air ducting to ensure maximum thermal transitions on the product. o Tune chamber for product to be tested. o Apply thermocouples to product to be tested. o Setup all functional test equipment and cabling. Setting up for HALT
  • 34. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Begin with cold step stress and then hot step stress. o Step in 10 °C increments, as approach “limits” reduce to 5 °C. o Dwell time minimum of 10 minutes + time to run functional tests to ensure product is still functional. Start dwell once product reaches temperature setpoint, begin functional tests after 10 minute dwell. o Continue until fundamental limit of technology is reached. o If circuits have thermal safeties, ensure operation & then defeat to determine actual operating & destruct limits o Apply additional product stresses during process: o Power Supplies: Power cycling during cold step stress. o Input voltage variation. Load variations. o Frequency variation of clocks. Executing a HALT (Thermal Step Stress)
  • 35. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Transition temperature as fast as chamber will allow. o Select temperature range within 5° of the operating limits found during thermal step stress. o If product cannot withstand maximum thermal transitions, decrease transition rate by 10 °C per minute until operating limit is found. o Continue series of transitions for a minimum of 10 minutes (or time it takes to run set of functional tests). o Apply additional product stresses during process Executing a HALT (Fast Thermal Transitions)
  • 36. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Understand vibration response of product (i.e. how does product respond to increases in vibration input). o Determine Grms increments (usually 3-5 Grms on product). o Dwell time minimum of 10 minutes + time to run functional tests to ensure product is still functional. Start dwell once product reaches vibration setpoint. o Continue until reach fundamental limit of technology. o Apply additional product stresses during process. Executing a HALT (Vibration Step Stress)
  • 37. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com Power Spectral Density (measured on product on OVS-2.5HP) Marker Fcn [Band Pwr] Trace: D Strt: 0 Hz Stop: 3 kHz Band:5.468 grms 32Hz 12.8kHzAVG: 20 X:288 Hz Y:2.82893 m* Band:6.548 grms Band:9.842 grms X:288 Hz Y:176.027 u* 32Hz 12.8kHzAVG: 20 X:288 Hz Y:3.44536 m* 32Hz 12.8kHzAVG: 20 Band:4.488 grms A: ROBOT BD B: ROBOT ARM X:288 Hz Y:1.19064 m* Y* = grms^2/Hz100 m* 10 u* LogMag Y* = grms^2/Hz100 m* 1 u* LogMag C: R MOTOR D: TABLE Y* = grms^2/Hz1 * 10 u* LogMag Y* = grms^2/Hz1 * 100 n* LogMag 32Hz 12.8kHzAVG: 20 Date: 03-14-97 Time: 09:46:00 AM Y-AXIS Z-AXIS X-AXIS Z-AXIS Vibration Step Stress (cont.)
  • 38. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com Power Spectral Density (measured on product on QRS410T) Vibration Step Stress (cont.)
  • 39. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Develop thermal profile using thermal operating limits, dwell times and transitions rates used during thermal step stress & fast thermal transitions. o Incorporate additional product stresses into profile such as power cycling. o The first run through the profile, run a constant vibration level of approx. 5 Grms. Step in same increments determined during vibration step stress. o When reach higher Grms levels (approx. 20 Grms) add tickle vibration (approx. 5 Grms) to determine if failures were precipitated at high G level but only detectable at lower G level. Executing a HALT (Combined Environment)
  • 40. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Determine root cause of all failures that occurred. o Meet with design engineers to discuss results of Developmental HALT and root cause analysis. o Determine and implement corrective action. o Perform Verification HALT to ensure problems fixed and new problems not introduced. o Periodically evaluate product as it is subjected to engineering changes. Post Testing
  • 41. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Highly Accelerated Life Testing (HALT) is designed to induce product failure o Requires effective root-cause analysis o Determination of failure site, failure mechanism, and root- cause o Absence prevents improvement in design margins; greatly reduces value of HALT process o DfR Solutions takes a systematic approach to root- cause analysis o Proceeding from the least destructive to most destructive until root-cause(s) are conclusively identified o Process steps based upon failure information (failure mode, failure site, failure mechanism) Failure Analysis (Motivation) 41
  • 42. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com 42 No Analysis? No Improvement!
  • 43. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com43 Information Gathering o Crucial first step in any failure analysis effort o Use of standard nomenclature o Failure history o When in HALT did the failure occur? o Step stress testing, temperature cycling, vibration, combined o What were the stresses being experienced by the product before failure? o Temperature, vibration, combined o Also electrical stresses o Is there a specific change in design/bill of materials/manufacturing that is being assessed? o Failure mode – the failure behavior experienced by the observer o Failure site – the supposed location of the failure o Failure mechanism – the mechanism initiating the failure (not root cause)
  • 44. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com44 Failure History o The stresses experienced before failure provides strong indication of potential failure site o Temperature step stress testing tends to drive component issues o High temperatures o Driven by increase in leakage current (diode turn on) or increase in resistance (time delay) o Semiconductors, power components o Cold temperatures o Primarily liquid electrolytic capacitors o Cyclic stresses (temperature cycling, vibration) tend to drive interconnect issues o Separable (sockets, press-fit connections, etc.) o Permanent (solder joints)
  • 45. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com45 Non-Destructive Evaluation (NDE) o Designed to provide maximum information with minimal risk of damaging or destroying physical evidence o Evaluation methods o Electrical Characterization o Visual Inspection o Xray Microscopy o Thermal Imaging o SQUID Microscopy o Acoustic Microscopy
  • 46. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Most critical step in failure analysis (FA) process for product subjected to HALT o Especially for failures induced during temperature step stress testing o Recoverable failures (operational limits) o Failure site can only determined through electrical characterization and interpretation of changes in functional behavior o Permanent failures (destruct limits) o Proceeds with a broader range of possible electrical characterization techniques Electrical Characterization 46
  • 47. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o JTAG (joint task action group) Boundary Scan o Very accurate identification of failure site; rarely performed on failed units o Oscilloscope o Useful in probing operational circuitry o Resistance measurements (manual) o Binary approach (best for an electrical open) o Isolation of attached components o Attempt to perform electrical characterization without component removal o Parametric characterization o Comparison of performance to datasheet specifications o Curve tracer o Valuable in characterizing diode, transistor, and resistance behavior o Time domain reflectrometry (TDR) o Measurement of phase shift of return signal can indicate location of electrical open o Other characterization equipment o LCR Meter, High resistance meter, low resistance meter, etc. o Sometimes requires re-introduction of environmental stresses o Especially elevated temperatures Electrical Characterization Techniques 47
  • 48. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Latest innovations o Digital detector, laminography, nanofocus resolution, oblique viewing o Greatest potential o Solder joint failures under area array devices o Problem o Difficult to identify microcracks that occur during fatigue/fracture of the solder o Can detect voiding or poor wetting o Provides evidence of root-cause of failures during the application of cyclic stresses Xray and HALT 48
  • 49. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Best for component failures o Especially increases in elevated leakage currents o Rarely used o Electrical characterization often provides more rapid identification of failure site and more definitive information on failure mode o However, can provide evidence of excessive temperature rise due to power dissipation o Can identify root-cause of failures at temperatures below expectations Thermal Imaging and HALT 49
  • 50. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com 50 SQUID Microscopy  Current flow in devices produce a magnetic field  SQUID uses a highly sensitive magnetic detector (superconductor) to resolve these fields  Magnetic field image is converted to a current density image, allowing for fault location  Resolution  500 nA, 300 nm  Dependent on working distance (requires a flat sample)
  • 51. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com 51 SQUID Microscopy  Critical technology for detecting package level electrical shorts  Much more rapid failure site resolution  Absolute confirmation of shorting path  Thermal imaging induces damage
  • 52. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com Failure Analysis: Temperature Tools o Cold Spray and hot plates to simulate fails at temperature extremes
  • 53. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com Failure Analysis Dremel Tool – Induce Vibrations o A Dremel tool can be used to induce local vibration during debugging o http://www.dremel.com
  • 54. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com54 Sherlock Analysis o Sherlock Automated Design Analysis™ can be used to assess the validity of failure mechanism o Can provide prediction of modes shapes and stresses during vibration o Compare prediction to observation o Were vibration loads sufficient to fail the interconnect? o Were defects identified during failure analysis? o If answers to one and two are no, maybe issue with test procedure o Important: Can not be used for combined environment (temperature cycling + vibration) o Too complex 340 Hz (369 Hz) 806 Hz, (715 Hz) 945 Hz, (910 Hz)
  • 55. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com55 NDE and HALT o Failure site identification is best performed through electrical characterization, visual inspection, and an understanding of failure mechanisms o NDE techniques are most useful in the second phase of failure analysis, when root-cause must be determined
  • 56. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com56 Root-Cause Evaluation o Appropriate point to evaluate information obtained o Requires understanding of failure mechanisms and root-cause o Inappropriate technique could destroy evidence o Use of failure analysis tools to direct destructive phase of process o Fault tree analysis (FTA) o Fishbone diagram o Stress vs. Strength o Failure modes and effects analysis (FMEA)? o Useful for design review; rarely used in failure analysis process
  • 57. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Used once approximate failure site has been determined FTA (cont.) 57 Ceramic Capacitor Failure Insufficient Capacitance Dielectric Formulation Crack Propagation Solder Joint Open Elevated Leakage Current Crack Propagation Surface Insulation Resistance
  • 58. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com58 Fishbone Diagram (Ishikawa) o Useful in identifying all potential influences that could drive product failure o Can result in overcommitment of resources
  • 59. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com Stress vs. Strength 59  Fiber/resin separation  Copper/resin separation  Hollow fibers  Misregistration of PTHs/Vias  Copper wicking  Drilling damage  Separation of PTH wall from epoxy resin  Higher voltage  Higher moisture concentration  Tighter conductor spacing  Higher ion concentration in epoxy resin
  • 60. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com60 Destructive Evaluation o Used when information obtained during NDE is insufficient to conclusively identify root-cause of failure o Primarily consists of two techniques o Cross-sectioning o Decapsulation/delidding
  • 61. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Critical for interconnect failures (cyclic stresses) Cross-Sectioning and HALT 61
  • 62. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com62 Cross-Sectioning o Identify the area of interest (failure site) o Remove the area of interest o Diamond-encrusted blades are preferred (minimizes surface damage) o Band saw, high and low speed diamond saw o Plunge cuts preferred (minimizes damage to surrounding PCBA) o Mount sample o Prevents relative movement / improves handling o Use room-temperature cured epoxy o Plane of interest should be near the surface of sample o Excessive removal of material increases likelihood of non-planarity, overheating, and loss of root-cause information
  • 63. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com 63 Grinding/Polishing o Rough grinding o Used to approach the area of interest o Involves application of 60 to 400 grit of SiC paper o Rough polishing o Used to remove damage introduced during grinding o Involves 600 to 1200 grit of SiC paper o Fine polishing/etching o Used to reveal microstructural elements o Involves use of polishing media one micron and finer Grit (Silicon Carbide) Particle Size (microns) 240 50 320 29 400 17 600 9 800 7 1200 3
  • 64. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Use of acid to remove encapsulant and expose internal workings of the package o Used to analyze on-die failures o Primarily driven by hot step stress test o Interconnect issues, primarily wirebonds, that can be discovered during temperature cycling, are best initially investigated using cross-sectioning o Further characterization, using mechanical testing, will require decapsulation Decapsulation and HALT 64
  • 65. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com65 Case Study -- Cold Step Stress Test o Mass flow meter o Recoverable failure at -30C o Failure mode o Loss of communication o No permanent failures observed o Results of electrical characterization / functional testing o Insufficient filtering of electrolytic capacitors (rated at 105C) o Parametric testing identified drop in capacitance at -35C o Freezing of the electrolyte o Corrective actions that were considered o Switch from liquid electrolytic capacitor to tantalum capacitor o Switch from 105C rated to 85C rated (reduced lifetime) o Increase capacitance from 3.3 uF to 47 uF o Extends range as well as improves filtering
  • 66. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com66 Cold Step Stress Test (cont.) o Subsequent testing on production units identified an additional failure mode o Initiated at approximately -5C o Out of specification on line voltages, corrupting micros in the unit o Incorrect resistor values o Resistors were removed and replaced with different values o Units passed remaining cold step stress test o Of interest, the unit used for the initial HALT did not demonstrate these cold temperature problems, showing impact of component variability on product performance o Identifying design margins! o Subject sufficient number of samples to HALT process
  • 67. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com67 Case Study -- Hot Step Stress Test o Mass flow meter o Permanent failure at 140C o Failure site o Catch diode for a switching power supply o Failure mechanisms o Electrical short (< 1 ohm). Operating junction temp for that part is -65C to 125C. o Diode was replaced and the unit was functional after exposure at 140C. Temperature was stepped up to 150C, where nonfunctional failure reoccurred
  • 68. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com68 Hot Step Stress Test (Example) Hello Craig, We found that an Optical-Isolator started getting noisy at higher temperatures. The communication link stopped updating at 50C because digital data (1's and 0's) were getting through, but at a reduced amplitude. Thus, some of the 1's were being misinterpreted as 0's by the foundation fieldbus communication card. We solved the problem by getting a better Opto-isolator with a higher current gain. Best regards, Jim
  • 69. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Application of vibration, starting at 5g and increasing in 5g increments o First failure noted at 30g o Failure site identified as connector solder joints o Insufficient flow through o After touch up unit survived up to 40g o Incorrect approach to failure analysis o Unit was fixed as soon as a problem was detected o Root-cause unable to be identified o Design for reliability? Design for manufacturing? Processing defect? Case Study -- Vibration 69
  • 70. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Failure after exposure to vibration o Electrical characterization indicated electrical open under area array device o Confirmed through cross-sectioning Case Study – Desktop Computer 70
  • 71. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com o Validity of failure mechanism? o Additional observation of separated leads on electrolytic capacitors o Occurrence of failure mechanism was strongly dependent upon the orientation of the leads o Lead planes oriented along the board length most likely to fail o Vibration test may not have applied random loads o Potential issues with HALT table or fixture Issues with HALT 71
  • 72. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com72 When FA and HALT? o Failure analysis can be a time intensive process o Hold up in product release while awaiting results o The use of FA should be selective and should provide maximum value
  • 73. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com73 FA and HALT (cont.) o Product o When product design or functionality is revolutionary, perform FA on all failures o When product design or functionality is evolutionary, perform FA selectively o Temperature Step Stress Test o When recoverable failure occurs between the operational and storage specifications o Specified to operate between 0 to 70C o Specified for storage between -40 to 100C o E.g., recoverable failure occurs at 90C o When permanent failure occurs within 10C of cold temperature storage specification or within 20C of hot temperature storage specification
  • 74. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com74 FA and HALT (Cyclic Stresses) o Delineation between when to perform FA less definitive o General rule o Temperature cycling should not induce any failures, unless using custom designed interconnect o Use prior behavior to guide failure analysis in vibration or combined o Failure on previous designs is always the electrolytic capacitor at 20g’s o Identification of processing defect can be a design issue! o Design for manufacturing
  • 75. © 2004 - 2007© 2004 - 20109000 Virginia Manor Rd Ste 290, Beltsville MD 20705 | 301-474-0607 | www.dfrsolutions.com75 Conclusion o HALT can be an important step in best practice reliability activities o Use can be extremely limited without root-cause analysis o Value-added root-cause analysis requires understanding of failure mechanisms and the stresses that drive them o Sufficient knowledge base allows for optimization of resources and rapid feedback