SlideShare a Scribd company logo
11
Sean Carter, NASA JSC
Daniel Deans, ManTech SRS Technologies
Constellation Reliability
Engineering Process –
Optimizing CxP Risk
Used with Permission
2
DFRAM Overview
 Why does reliability engineering exist?
 How does it fit within the life cycle?
 Success space vs. failure space
 Partnership on system engineering team
 The value of “designing-out” failure modes
 Where does it fit in the lifecycle?
 What are some of the tools?
 How are they applied?
 Real examples
2
3
 Failure is not an option…
 A design engineer does not
know what he does not know
 An extra set of eyes and ears
is always good
 You have to spend money to
make money
 Mr. Murphy tends to rear
his ugly head when you are
not expecting it…
 What all this means is: You
have to work at it – nothing
worth accomplishing
comes easy
 Reliability engineering is a
discipline that adds value
to the systems engineering
process!
3
Reliability Engineering Value - Clichés
4
Typical System Engineering Lifecycle
5
Reliability Engineering Throughout Project Life
66
The Life Cycle Approach
 Reliability is best designed-in;
it is, for the most part, not:
 Analyzed in
 Tested in
 Operated in
 Successful reliability performance
begins with a diligent, intentional
approach at the very beginning of a project
 Pre-phase A: requirements
 Phase A: allocation; plan; resources
 Phase B: analysis, design input, preliminary design review
 Phase C: detailed design inputs; more analysis; trade studies;
design verification; critical design review
 Phase D: test planning, test readiness, manufacturing, final
validation; flight readiness review
 Phase E/F: ops, growth, disposal and lessons learned
System EngineeringSystem Engineering Test and AssessmentTest and Assessment
Element
Integration & Test
System
Integration Test
System Element
Data Reduction and
Assessment
System Concept
Exploration
Preliminary
Design
Design Synthesis
Component Fabrication, Assembly,
Integrate, & Test
Requirements
Compliance
Configuration
Management
Project Direction,
Control, & Planning
Risk
Management
System
Analysis
Project
Direction
and
Control
Project
Direction
and
Control
• System, Element,
Subsystem Models
• System Performance
Analyses
• Specifications
• Verification
• Management Plan
• Budget Development & Control
• Project Plan Development
• Schedule Development & Control
• Design Data Base
• Problem/Failure
Reports (PFR)
• Engineering Change
Orders
• Risk Planning
• Risk Assessment
• Risk Handling/Mitigation
• Risk Monitoring
77
Success Space vs. Failure Space
 A design engineer thinks in success space (typically)
 How will the widget work?
 When it is designed, what function will it perform?
 What are the performance requirements?
 Reliability engineer paid to think in failure space
 How will the widget fail?
 What about the operating environment will cause issues?
 What materials, processes, and tools will accentuate failure modes?
 Is redundancy required
 Are there operational work-arounds?
 How will faults propagate through the system?
 What are the effects of a failure mode on the mission
 Superimpose the two processes, you get success!
88
Credibility: Partnership on
System Engineering Team
 Safety and Mission Assurance organization provides
discipline experts to support design teams
 Our job is to serve; not to inhibit
 We help the system engineering teams identify
hazards and failure modes and design them out
 Our sole reason for existing is to ensure
project/program success and to reduce/eliminate
operational risk
 We are partners for success
 The aim in partnership is to duplicate our knowledge
in the collective heads of our design-team partners
9
The Value of “Designing-Out” Failure Modes
 A failure mode is an obstacle to mission success
 Not all may cause mission failure, but, any failure of a
component has potential
 In the commercial world, a failure in the field costs 10 times
what it costs to mitigate in the design process
 In the space business, a failure can and will cost the
mission and quite possibly endanger people
 Identifying and designing-out failure modes is important!
9Company Confidential
1010
How Do We Design Out Failure Modes?
 Methodical process; starts in pre-phase A, follows the lifecycle.
 DMEDI – Define, Measure, Explore, Develop, Implement
(12 steps)
 Define requirements
 Allocate requirements
 Plan activities and analysis, including test and verification
 Collect data and develop data sources
 Use RAM simulation, FMEA, FTA, worst case analysis, derating,
proven design practices to drive the design
 Support design reviews and require improvement
 Verify and ensure that design will meet requirements
 Plan and implement thorough testing
 Finalize verification, ascertain flight readiness
 Identify reliability growth opportunities once design is complete
 Investigate and eliminate root causes to anomalies
 Develop lessons learned, provide feedback to future engineering teams
11
Pre-Phase A Concept
Development
 Very important part of process –
DFRAM starts here
 Develop requirements that will
optimize RAM for program/project
 Requirements include availability,
mean time to failure, fault tolerance,
mean time to repair, time to replace
 Import lessons learned from similar
programs/systems
 Collect similar system failure history
data
 Begin development of system model
 Begin development of RAM Plan
12
Phase A: Preliminary Analysis
 Refine requirements, negotiate
allocations with design elements
 Finalize RAM Plan and educate design
team on process; what role reliability
engineering team will fill
 Continue to develop preliminary model;
begin FMEAs, FTAs, Probabilistic
assessments
 Allocate requirements to lowest
design-to level
 Negotiate failure definitions, failure
budgets with design teams
 Identify initial critical items, compare with
lessons learned from previous systems
 Continue to identify data sources
 Identify critical suppliers; begin to form
partnerships
13
Phase B – Preliminary Design
 Continue to build simulation (model) and
add more details
 Identify most effective analyses tools to use
to drive design
 Complete preliminary FMEA, FTA, PRA
 Continue to develop supplier partnerships
 Prepare for preliminary design review
 Perform maintenance task analysis
 Identify design improvement initiatives and
optimize using simulation
 Perform other sensitivity studies based on
fault tolerance requirements
 Begin developing and finalizing FRACAS,
test plans, reliability growth strategy
 Partner with designers to identify failure
modes, design them out
 Support concept of operations optimization
14
Phase C – Detailed Design
 Perform detailed design analysis – PDR recovery
 Focus on pareto items identified from analyses (Top 10)
 Continue to develop and use RAM simulation, FMEA,
FTA, etc. to design out failure modes
 Use Con-Ops to develop operational work-arounds as
failure mode mitigation
 Finalize test plans –review for reliability success criteria
 Audit suppliers, provide support for reliability
improvement
 Mitigate schedule risks
 Finalize critical items, document for testing
 Begin life testing of components and subsystems as
feasible
 Perform specialized analysis (sneaks, fault propagation)
 Prepare for and support CDR
15
Phase D –Development
 Finalize design - CDR recovery, cut into
manufacturing
 Finalize FMEAs, FTAs, Simulations, CILs
 Support testing, root cause
investigations and corrective action
 Begin collection of failure and
operational history data (upon first
application of power)
 Finalize reliability growth strategy
 Develop and begin implementation of
reliability-centered maintenance
approach
 Make “last minute” improvements based
on test results
 Identify lessons learned and document
 Update Con-Ops with operational work-
arounds for critical items
16
Phase E/F – Ops and Disposal
 Continue to gather data, monitor
operations for anomalies
 Support failure analyses, root cause
investigations
 Implement reliability growth process,
identify areas for growth, design
solutions
 Document lessons learned
 Use simulation to validate reliability
growth strategy, sensitivities
 Update RAM Plan with lessons
learned
 Support system disposal via
identification of reliability challenges
to shutdown
17
What are the Tools?
 Some of the tools that we use are:
 Requirements allocation
 RAM simulation/probabilistic risk assessment
 FMEA/FMECA
 Fault tree analysis (FTA)/event tree assessment
 Parts stress analysis/derating
 Detailed design analysis
 Worst case analysis
 Redundancy screens
 Extensive testing and verification analysis
 Reliability growth planning and implementation
 Others….
18
Reliability and Maintainability Simulation
 A very powerful process
 Can help design out failure modes without cutting metal
 Provides for the Pareto Principle (20/80)
 Gives design team a tool for sensitivity analysis
 Allows for trying many different scenarios
 Helps to optimize the return on investment based on cost to
improve curve
$ Cost
Reliability
High rate of return
KITC
Area of diminishing return
KITC = Point on Curve where rise
becomes less than run (reliability
improvement = rise, cost to
improve = run)
19
Simulation Basics
 Simulations are built based on the system architecture
 Model provides for “RAM” characteristics of system
 Input data includes failure rates, repair times, sparing
information, logistics information, operational work-
arounds
 Simulation is run based on mission profiles
 “Monte Carlo” methodology is used
 Typically data is input using statistical distributions
 Outputs are system availability and cutsets (and other
failure “illuminators”)
 Cutsets lead to sensitivity analyses which in turn can
drive improvements (failure mode elimination)
20
RAM Simulation Example
 Simulation is dynamic, not static analysis
 Can provide much information about overall availability
of system under many different sets of conditions
 Today’s tools can include operational concepts and
rules, optimization of spares (some automatic)
 Requires specific input data
21
How Results are Used
 Outputs of baseline simulations are verified and
validated using expert elicitation
 Once all agree that the simulation is in the “ballpark,” (do
not get wrapped around the axle on the numbers; it is the
gap elimination that provides the most value) – begin the
sensitivity analyses
 Identify opportunities for improvement, plug those back
into the sim, ascertain value of improvements
 Continue this process until gaps are eliminated or at
least reduced.
 This can include block improvement of overall
component failure rates – get the suppliers in on the act
(supplier partnerships)
 Ensure data from simulation is used in the design
process
22
Success Stories: NASA Instrument Design
 Validation of proper installation of sample cup retaining springs
on Sample Manipulation System to preclude workmanship
failures. (single ring failure would result in loss of solid sample
science)
 Use of physics of failure methods to identify and eliminate,
where possible, failure modes of Pyrolysis Oven.
 Implementation of HiPot test for Wide Range Pump motor to
eliminate workmanship related failures.
 Identification of Hall Effect Device on actuators as possible
Radiation Sensitive device. Subsequent testing validated
suitability of device.
 Identification of thermal switch on Gas Trap as Reliability
Issue. Redesign produced higher Reliability solution.
 FMEA of Gas Processing System provided justification for
addition of limited redundancy.
 Improved reliability of instrument by approximately 25% based in
initial predictions.
23
Complex Space Systems Application
 Predicated on effective
requirements
implementation
 Detailed RAM Plan
developed and
implemented at Program
Level
 RAM requirements, RAM
Plan flowed down to
systems, elements of
systems
 System owners
responsible for DFRAM,
but program will facilitate
and audit
 Program level analyses
including simulation, FMEA,
PRA being performed
 Verification and validation
will be program level
functions
 PRA will be part of flight
readiness decision
 Software included in DFRAM
activities (no longer black
box)
 System Engineering
organization partnering with
S&MA organization for RAM
implementation
23
24
SUMMARY
 Success of a system
predicated on intentional
implementation of DFRAM
 It will not happen
spontaneously
 Must be married with the
system engineering
process
 Program management
must be disciples – will
not work otherwise
 It is always easier and
more cost effective to do
it right the first time
 Implementation requires
people skills and a
service mentality
24

More Related Content

What's hot

Stefanini.trinh
Stefanini.trinhStefanini.trinh
Stefanini.trinh
NASAPMC
 
Reed simpson
Reed simpsonReed simpson
Reed simpson
NASAPMC
 
Costello kenneth
Costello kennethCostello kenneth
Costello kenneth
NASAPMC
 
Dawn.schaible
Dawn.schaibleDawn.schaible
Dawn.schaible
NASAPMC
 
Snow lee
Snow leeSnow lee
Snow lee
NASAPMC
 
Hughitt brian
Hughitt brianHughitt brian
Hughitt brian
NASAPMC
 
Lou wheatcraft vv
Lou wheatcraft vvLou wheatcraft vv
Lou wheatcraft vv
NASAPMC
 
Saltzman.john
Saltzman.johnSaltzman.john
Saltzman.john
NASAPMC
 
Ray.ronald
Ray.ronaldRay.ronald
Ray.ronald
NASAPMC
 
Dittemore.gary
Dittemore.garyDittemore.gary
Dittemore.gary
NASAPMC
 
Armstrong
ArmstrongArmstrong
Armstrong
NASAPMC
 
D brown gbezos_shirshorn
D brown gbezos_shirshornD brown gbezos_shirshorn
D brown gbezos_shirshorn
NASAPMC
 
Bauer.frank
Bauer.frankBauer.frank
Bauer.frank
NASAPMC
 
Thomas.coonce
Thomas.coonceThomas.coonce
Thomas.coonce
NASAPMC
 
Housch
HouschHousch
Housch
NASAPMC
 
Dan galorath
Dan galorathDan galorath
Dan galorath
NASAPMC
 
Dezfuli youngblood
Dezfuli youngbloodDezfuli youngblood
Dezfuli youngblood
NASAPMC
 
K ingoldsby
K ingoldsbyK ingoldsby
K ingoldsby
NASAPMC
 
Jenks.ken
Jenks.kenJenks.ken
Jenks.ken
NASAPMC
 
Mullane stanley-hamilton-wise
Mullane stanley-hamilton-wiseMullane stanley-hamilton-wise
Mullane stanley-hamilton-wise
NASAPMC
 

What's hot (20)

Stefanini.trinh
Stefanini.trinhStefanini.trinh
Stefanini.trinh
 
Reed simpson
Reed simpsonReed simpson
Reed simpson
 
Costello kenneth
Costello kennethCostello kenneth
Costello kenneth
 
Dawn.schaible
Dawn.schaibleDawn.schaible
Dawn.schaible
 
Snow lee
Snow leeSnow lee
Snow lee
 
Hughitt brian
Hughitt brianHughitt brian
Hughitt brian
 
Lou wheatcraft vv
Lou wheatcraft vvLou wheatcraft vv
Lou wheatcraft vv
 
Saltzman.john
Saltzman.johnSaltzman.john
Saltzman.john
 
Ray.ronald
Ray.ronaldRay.ronald
Ray.ronald
 
Dittemore.gary
Dittemore.garyDittemore.gary
Dittemore.gary
 
Armstrong
ArmstrongArmstrong
Armstrong
 
D brown gbezos_shirshorn
D brown gbezos_shirshornD brown gbezos_shirshorn
D brown gbezos_shirshorn
 
Bauer.frank
Bauer.frankBauer.frank
Bauer.frank
 
Thomas.coonce
Thomas.coonceThomas.coonce
Thomas.coonce
 
Housch
HouschHousch
Housch
 
Dan galorath
Dan galorathDan galorath
Dan galorath
 
Dezfuli youngblood
Dezfuli youngbloodDezfuli youngblood
Dezfuli youngblood
 
K ingoldsby
K ingoldsbyK ingoldsby
K ingoldsby
 
Jenks.ken
Jenks.kenJenks.ken
Jenks.ken
 
Mullane stanley-hamilton-wise
Mullane stanley-hamilton-wiseMullane stanley-hamilton-wise
Mullane stanley-hamilton-wise
 

Similar to Sean carter dan_deans

Risk Driven Testing
Risk Driven TestingRisk Driven Testing
Risk Driven Testing
Jorge Boria
 
AFITC 2018 - Using Process Maturity and Agile to Strengthen Cyber Security
AFITC 2018 - Using Process Maturity and Agile to Strengthen Cyber SecurityAFITC 2018 - Using Process Maturity and Agile to Strengthen Cyber Security
AFITC 2018 - Using Process Maturity and Agile to Strengthen Cyber Security
Djindo Lee
 
The Automation Firehose: Be Strategic & Tactical With Your Mobile & Web Testing
The Automation Firehose: Be Strategic & Tactical With Your Mobile & Web TestingThe Automation Firehose: Be Strategic & Tactical With Your Mobile & Web Testing
The Automation Firehose: Be Strategic & Tactical With Your Mobile & Web Testing
Perfecto by Perforce
 
On the nature of FMECA... An introduction
On the nature of FMECA... An introductionOn the nature of FMECA... An introduction
On the nature of FMECA... An introduction
MartGerrand
 
2010-36-0387I_Paper
2010-36-0387I_Paper2010-36-0387I_Paper
Failure Mode Effect Analysis in Engineering Failures
Failure Mode Effect Analysis in Engineering FailuresFailure Mode Effect Analysis in Engineering Failures
Failure Mode Effect Analysis in Engineering Failures
Padmanabhan Krishnan
 
Value of software testing
Value of software testingValue of software testing
Value of software testing
Transpose Solutions Inc
 
The Automation Firehose: Be Strategic and Tactical by Thomas Haver
The Automation Firehose: Be Strategic and Tactical by Thomas HaverThe Automation Firehose: Be Strategic and Tactical by Thomas Haver
The Automation Firehose: Be Strategic and Tactical by Thomas Haver
QA or the Highway
 
Downloads abc 2006 presentation downloads-ramesh_babu
Downloads abc 2006   presentation downloads-ramesh_babuDownloads abc 2006   presentation downloads-ramesh_babu
Downloads abc 2006 presentation downloads-ramesh_babu
Hem Rana
 
Root Cause and Corrective Action (RCCA) Workshop
Root Cause and Corrective Action (RCCA) WorkshopRoot Cause and Corrective Action (RCCA) Workshop
Root Cause and Corrective Action (RCCA) Workshop
Accendo Reliability
 
Fmea
FmeaFmea
Peter Zimmerer - Evolve Design For Testability To The Next Level - EuroSTAR 2012
Peter Zimmerer - Evolve Design For Testability To The Next Level - EuroSTAR 2012Peter Zimmerer - Evolve Design For Testability To The Next Level - EuroSTAR 2012
Peter Zimmerer - Evolve Design For Testability To The Next Level - EuroSTAR 2012
TEST Huddle
 
MCVisionWP1A_2003
MCVisionWP1A_2003MCVisionWP1A_2003
MCVisionWP1A_2003
Jason Reid
 
Feasible
FeasibleFeasible
Feasible
anasamirah
 
Critical steps in Determining Your Value Stream Management Solution
Critical steps in Determining Your Value Stream Management SolutionCritical steps in Determining Your Value Stream Management Solution
Critical steps in Determining Your Value Stream Management Solution
DevOps.com
 
Asset Finance Systems: Project Initiation "101"
Asset Finance Systems: Project Initiation "101"Asset Finance Systems: Project Initiation "101"
Asset Finance Systems: Project Initiation "101"
David Pedreno
 
Asset Finance Systems: Project Initiation "101"
Asset Finance Systems: Project Initiation "101"Asset Finance Systems: Project Initiation "101"
Asset Finance Systems: Project Initiation "101"
David Pedreno
 
Software reliability engineering
Software reliability engineeringSoftware reliability engineering
Software reliability engineering
Mark Turner CRP
 
Designing for Testability: Differentiator in a Competitive Market
Designing for Testability: Differentiator in a Competitive MarketDesigning for Testability: Differentiator in a Competitive Market
Designing for Testability: Differentiator in a Competitive Market
TechWell
 
Working together SRE & Platform Engineering
Working together SRE & Platform EngineeringWorking together SRE & Platform Engineering
Working together SRE & Platform Engineering
Marcus Vechiato
 

Similar to Sean carter dan_deans (20)

Risk Driven Testing
Risk Driven TestingRisk Driven Testing
Risk Driven Testing
 
AFITC 2018 - Using Process Maturity and Agile to Strengthen Cyber Security
AFITC 2018 - Using Process Maturity and Agile to Strengthen Cyber SecurityAFITC 2018 - Using Process Maturity and Agile to Strengthen Cyber Security
AFITC 2018 - Using Process Maturity and Agile to Strengthen Cyber Security
 
The Automation Firehose: Be Strategic & Tactical With Your Mobile & Web Testing
The Automation Firehose: Be Strategic & Tactical With Your Mobile & Web TestingThe Automation Firehose: Be Strategic & Tactical With Your Mobile & Web Testing
The Automation Firehose: Be Strategic & Tactical With Your Mobile & Web Testing
 
On the nature of FMECA... An introduction
On the nature of FMECA... An introductionOn the nature of FMECA... An introduction
On the nature of FMECA... An introduction
 
2010-36-0387I_Paper
2010-36-0387I_Paper2010-36-0387I_Paper
2010-36-0387I_Paper
 
Failure Mode Effect Analysis in Engineering Failures
Failure Mode Effect Analysis in Engineering FailuresFailure Mode Effect Analysis in Engineering Failures
Failure Mode Effect Analysis in Engineering Failures
 
Value of software testing
Value of software testingValue of software testing
Value of software testing
 
The Automation Firehose: Be Strategic and Tactical by Thomas Haver
The Automation Firehose: Be Strategic and Tactical by Thomas HaverThe Automation Firehose: Be Strategic and Tactical by Thomas Haver
The Automation Firehose: Be Strategic and Tactical by Thomas Haver
 
Downloads abc 2006 presentation downloads-ramesh_babu
Downloads abc 2006   presentation downloads-ramesh_babuDownloads abc 2006   presentation downloads-ramesh_babu
Downloads abc 2006 presentation downloads-ramesh_babu
 
Root Cause and Corrective Action (RCCA) Workshop
Root Cause and Corrective Action (RCCA) WorkshopRoot Cause and Corrective Action (RCCA) Workshop
Root Cause and Corrective Action (RCCA) Workshop
 
Fmea
FmeaFmea
Fmea
 
Peter Zimmerer - Evolve Design For Testability To The Next Level - EuroSTAR 2012
Peter Zimmerer - Evolve Design For Testability To The Next Level - EuroSTAR 2012Peter Zimmerer - Evolve Design For Testability To The Next Level - EuroSTAR 2012
Peter Zimmerer - Evolve Design For Testability To The Next Level - EuroSTAR 2012
 
MCVisionWP1A_2003
MCVisionWP1A_2003MCVisionWP1A_2003
MCVisionWP1A_2003
 
Feasible
FeasibleFeasible
Feasible
 
Critical steps in Determining Your Value Stream Management Solution
Critical steps in Determining Your Value Stream Management SolutionCritical steps in Determining Your Value Stream Management Solution
Critical steps in Determining Your Value Stream Management Solution
 
Asset Finance Systems: Project Initiation "101"
Asset Finance Systems: Project Initiation "101"Asset Finance Systems: Project Initiation "101"
Asset Finance Systems: Project Initiation "101"
 
Asset Finance Systems: Project Initiation "101"
Asset Finance Systems: Project Initiation "101"Asset Finance Systems: Project Initiation "101"
Asset Finance Systems: Project Initiation "101"
 
Software reliability engineering
Software reliability engineeringSoftware reliability engineering
Software reliability engineering
 
Designing for Testability: Differentiator in a Competitive Market
Designing for Testability: Differentiator in a Competitive MarketDesigning for Testability: Differentiator in a Competitive Market
Designing for Testability: Differentiator in a Competitive Market
 
Working together SRE & Platform Engineering
Working together SRE & Platform EngineeringWorking together SRE & Platform Engineering
Working together SRE & Platform Engineering
 

More from NASAPMC

Bejmuk bo
Bejmuk boBejmuk bo
Bejmuk bo
NASAPMC
 
Baniszewski john
Baniszewski johnBaniszewski john
Baniszewski john
NASAPMC
 
Yew manson
Yew mansonYew manson
Yew manson
NASAPMC
 
Wood frank
Wood frankWood frank
Wood frank
NASAPMC
 
Wood frank
Wood frankWood frank
Wood frank
NASAPMC
 
Wessen randi (cd)
Wessen randi (cd)Wessen randi (cd)
Wessen randi (cd)
NASAPMC
 
Vellinga joe
Vellinga joeVellinga joe
Vellinga joe
NASAPMC
 
Trahan stuart
Trahan stuartTrahan stuart
Trahan stuart
NASAPMC
 
Stock gahm
Stock gahmStock gahm
Stock gahm
NASAPMC
 
Smalley sandra
Smalley sandraSmalley sandra
Smalley sandra
NASAPMC
 
Seftas krage
Seftas krageSeftas krage
Seftas krage
NASAPMC
 
Sampietro marco
Sampietro marcoSampietro marco
Sampietro marco
NASAPMC
 
Rudolphi mike
Rudolphi mikeRudolphi mike
Rudolphi mike
NASAPMC
 
Roberts karlene
Roberts karleneRoberts karlene
Roberts karlene
NASAPMC
 
Rackley mike
Rackley mikeRackley mike
Rackley mike
NASAPMC
 
Paradis william
Paradis williamParadis william
Paradis william
NASAPMC
 
Osterkamp jeff
Osterkamp jeffOsterkamp jeff
Osterkamp jeff
NASAPMC
 
O'keefe william
O'keefe williamO'keefe william
O'keefe william
NASAPMC
 
Muller ralf
Muller ralfMuller ralf
Muller ralf
NASAPMC
 
Mulenburg jerry
Mulenburg jerryMulenburg jerry
Mulenburg jerry
NASAPMC
 

More from NASAPMC (20)

Bejmuk bo
Bejmuk boBejmuk bo
Bejmuk bo
 
Baniszewski john
Baniszewski johnBaniszewski john
Baniszewski john
 
Yew manson
Yew mansonYew manson
Yew manson
 
Wood frank
Wood frankWood frank
Wood frank
 
Wood frank
Wood frankWood frank
Wood frank
 
Wessen randi (cd)
Wessen randi (cd)Wessen randi (cd)
Wessen randi (cd)
 
Vellinga joe
Vellinga joeVellinga joe
Vellinga joe
 
Trahan stuart
Trahan stuartTrahan stuart
Trahan stuart
 
Stock gahm
Stock gahmStock gahm
Stock gahm
 
Smalley sandra
Smalley sandraSmalley sandra
Smalley sandra
 
Seftas krage
Seftas krageSeftas krage
Seftas krage
 
Sampietro marco
Sampietro marcoSampietro marco
Sampietro marco
 
Rudolphi mike
Rudolphi mikeRudolphi mike
Rudolphi mike
 
Roberts karlene
Roberts karleneRoberts karlene
Roberts karlene
 
Rackley mike
Rackley mikeRackley mike
Rackley mike
 
Paradis william
Paradis williamParadis william
Paradis william
 
Osterkamp jeff
Osterkamp jeffOsterkamp jeff
Osterkamp jeff
 
O'keefe william
O'keefe williamO'keefe william
O'keefe william
 
Muller ralf
Muller ralfMuller ralf
Muller ralf
 
Mulenburg jerry
Mulenburg jerryMulenburg jerry
Mulenburg jerry
 

Recently uploaded

UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
SitimaJohn
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
Zilliz
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 

Recently uploaded (20)

UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 

Sean carter dan_deans

  • 1. 11 Sean Carter, NASA JSC Daniel Deans, ManTech SRS Technologies Constellation Reliability Engineering Process – Optimizing CxP Risk Used with Permission
  • 2. 2 DFRAM Overview  Why does reliability engineering exist?  How does it fit within the life cycle?  Success space vs. failure space  Partnership on system engineering team  The value of “designing-out” failure modes  Where does it fit in the lifecycle?  What are some of the tools?  How are they applied?  Real examples 2
  • 3. 3  Failure is not an option…  A design engineer does not know what he does not know  An extra set of eyes and ears is always good  You have to spend money to make money  Mr. Murphy tends to rear his ugly head when you are not expecting it…  What all this means is: You have to work at it – nothing worth accomplishing comes easy  Reliability engineering is a discipline that adds value to the systems engineering process! 3 Reliability Engineering Value - Clichés
  • 6. 66 The Life Cycle Approach  Reliability is best designed-in; it is, for the most part, not:  Analyzed in  Tested in  Operated in  Successful reliability performance begins with a diligent, intentional approach at the very beginning of a project  Pre-phase A: requirements  Phase A: allocation; plan; resources  Phase B: analysis, design input, preliminary design review  Phase C: detailed design inputs; more analysis; trade studies; design verification; critical design review  Phase D: test planning, test readiness, manufacturing, final validation; flight readiness review  Phase E/F: ops, growth, disposal and lessons learned System EngineeringSystem Engineering Test and AssessmentTest and Assessment Element Integration & Test System Integration Test System Element Data Reduction and Assessment System Concept Exploration Preliminary Design Design Synthesis Component Fabrication, Assembly, Integrate, & Test Requirements Compliance Configuration Management Project Direction, Control, & Planning Risk Management System Analysis Project Direction and Control Project Direction and Control • System, Element, Subsystem Models • System Performance Analyses • Specifications • Verification • Management Plan • Budget Development & Control • Project Plan Development • Schedule Development & Control • Design Data Base • Problem/Failure Reports (PFR) • Engineering Change Orders • Risk Planning • Risk Assessment • Risk Handling/Mitigation • Risk Monitoring
  • 7. 77 Success Space vs. Failure Space  A design engineer thinks in success space (typically)  How will the widget work?  When it is designed, what function will it perform?  What are the performance requirements?  Reliability engineer paid to think in failure space  How will the widget fail?  What about the operating environment will cause issues?  What materials, processes, and tools will accentuate failure modes?  Is redundancy required  Are there operational work-arounds?  How will faults propagate through the system?  What are the effects of a failure mode on the mission  Superimpose the two processes, you get success!
  • 8. 88 Credibility: Partnership on System Engineering Team  Safety and Mission Assurance organization provides discipline experts to support design teams  Our job is to serve; not to inhibit  We help the system engineering teams identify hazards and failure modes and design them out  Our sole reason for existing is to ensure project/program success and to reduce/eliminate operational risk  We are partners for success  The aim in partnership is to duplicate our knowledge in the collective heads of our design-team partners
  • 9. 9 The Value of “Designing-Out” Failure Modes  A failure mode is an obstacle to mission success  Not all may cause mission failure, but, any failure of a component has potential  In the commercial world, a failure in the field costs 10 times what it costs to mitigate in the design process  In the space business, a failure can and will cost the mission and quite possibly endanger people  Identifying and designing-out failure modes is important! 9Company Confidential
  • 10. 1010 How Do We Design Out Failure Modes?  Methodical process; starts in pre-phase A, follows the lifecycle.  DMEDI – Define, Measure, Explore, Develop, Implement (12 steps)  Define requirements  Allocate requirements  Plan activities and analysis, including test and verification  Collect data and develop data sources  Use RAM simulation, FMEA, FTA, worst case analysis, derating, proven design practices to drive the design  Support design reviews and require improvement  Verify and ensure that design will meet requirements  Plan and implement thorough testing  Finalize verification, ascertain flight readiness  Identify reliability growth opportunities once design is complete  Investigate and eliminate root causes to anomalies  Develop lessons learned, provide feedback to future engineering teams
  • 11. 11 Pre-Phase A Concept Development  Very important part of process – DFRAM starts here  Develop requirements that will optimize RAM for program/project  Requirements include availability, mean time to failure, fault tolerance, mean time to repair, time to replace  Import lessons learned from similar programs/systems  Collect similar system failure history data  Begin development of system model  Begin development of RAM Plan
  • 12. 12 Phase A: Preliminary Analysis  Refine requirements, negotiate allocations with design elements  Finalize RAM Plan and educate design team on process; what role reliability engineering team will fill  Continue to develop preliminary model; begin FMEAs, FTAs, Probabilistic assessments  Allocate requirements to lowest design-to level  Negotiate failure definitions, failure budgets with design teams  Identify initial critical items, compare with lessons learned from previous systems  Continue to identify data sources  Identify critical suppliers; begin to form partnerships
  • 13. 13 Phase B – Preliminary Design  Continue to build simulation (model) and add more details  Identify most effective analyses tools to use to drive design  Complete preliminary FMEA, FTA, PRA  Continue to develop supplier partnerships  Prepare for preliminary design review  Perform maintenance task analysis  Identify design improvement initiatives and optimize using simulation  Perform other sensitivity studies based on fault tolerance requirements  Begin developing and finalizing FRACAS, test plans, reliability growth strategy  Partner with designers to identify failure modes, design them out  Support concept of operations optimization
  • 14. 14 Phase C – Detailed Design  Perform detailed design analysis – PDR recovery  Focus on pareto items identified from analyses (Top 10)  Continue to develop and use RAM simulation, FMEA, FTA, etc. to design out failure modes  Use Con-Ops to develop operational work-arounds as failure mode mitigation  Finalize test plans –review for reliability success criteria  Audit suppliers, provide support for reliability improvement  Mitigate schedule risks  Finalize critical items, document for testing  Begin life testing of components and subsystems as feasible  Perform specialized analysis (sneaks, fault propagation)  Prepare for and support CDR
  • 15. 15 Phase D –Development  Finalize design - CDR recovery, cut into manufacturing  Finalize FMEAs, FTAs, Simulations, CILs  Support testing, root cause investigations and corrective action  Begin collection of failure and operational history data (upon first application of power)  Finalize reliability growth strategy  Develop and begin implementation of reliability-centered maintenance approach  Make “last minute” improvements based on test results  Identify lessons learned and document  Update Con-Ops with operational work- arounds for critical items
  • 16. 16 Phase E/F – Ops and Disposal  Continue to gather data, monitor operations for anomalies  Support failure analyses, root cause investigations  Implement reliability growth process, identify areas for growth, design solutions  Document lessons learned  Use simulation to validate reliability growth strategy, sensitivities  Update RAM Plan with lessons learned  Support system disposal via identification of reliability challenges to shutdown
  • 17. 17 What are the Tools?  Some of the tools that we use are:  Requirements allocation  RAM simulation/probabilistic risk assessment  FMEA/FMECA  Fault tree analysis (FTA)/event tree assessment  Parts stress analysis/derating  Detailed design analysis  Worst case analysis  Redundancy screens  Extensive testing and verification analysis  Reliability growth planning and implementation  Others….
  • 18. 18 Reliability and Maintainability Simulation  A very powerful process  Can help design out failure modes without cutting metal  Provides for the Pareto Principle (20/80)  Gives design team a tool for sensitivity analysis  Allows for trying many different scenarios  Helps to optimize the return on investment based on cost to improve curve $ Cost Reliability High rate of return KITC Area of diminishing return KITC = Point on Curve where rise becomes less than run (reliability improvement = rise, cost to improve = run)
  • 19. 19 Simulation Basics  Simulations are built based on the system architecture  Model provides for “RAM” characteristics of system  Input data includes failure rates, repair times, sparing information, logistics information, operational work- arounds  Simulation is run based on mission profiles  “Monte Carlo” methodology is used  Typically data is input using statistical distributions  Outputs are system availability and cutsets (and other failure “illuminators”)  Cutsets lead to sensitivity analyses which in turn can drive improvements (failure mode elimination)
  • 20. 20 RAM Simulation Example  Simulation is dynamic, not static analysis  Can provide much information about overall availability of system under many different sets of conditions  Today’s tools can include operational concepts and rules, optimization of spares (some automatic)  Requires specific input data
  • 21. 21 How Results are Used  Outputs of baseline simulations are verified and validated using expert elicitation  Once all agree that the simulation is in the “ballpark,” (do not get wrapped around the axle on the numbers; it is the gap elimination that provides the most value) – begin the sensitivity analyses  Identify opportunities for improvement, plug those back into the sim, ascertain value of improvements  Continue this process until gaps are eliminated or at least reduced.  This can include block improvement of overall component failure rates – get the suppliers in on the act (supplier partnerships)  Ensure data from simulation is used in the design process
  • 22. 22 Success Stories: NASA Instrument Design  Validation of proper installation of sample cup retaining springs on Sample Manipulation System to preclude workmanship failures. (single ring failure would result in loss of solid sample science)  Use of physics of failure methods to identify and eliminate, where possible, failure modes of Pyrolysis Oven.  Implementation of HiPot test for Wide Range Pump motor to eliminate workmanship related failures.  Identification of Hall Effect Device on actuators as possible Radiation Sensitive device. Subsequent testing validated suitability of device.  Identification of thermal switch on Gas Trap as Reliability Issue. Redesign produced higher Reliability solution.  FMEA of Gas Processing System provided justification for addition of limited redundancy.  Improved reliability of instrument by approximately 25% based in initial predictions.
  • 23. 23 Complex Space Systems Application  Predicated on effective requirements implementation  Detailed RAM Plan developed and implemented at Program Level  RAM requirements, RAM Plan flowed down to systems, elements of systems  System owners responsible for DFRAM, but program will facilitate and audit  Program level analyses including simulation, FMEA, PRA being performed  Verification and validation will be program level functions  PRA will be part of flight readiness decision  Software included in DFRAM activities (no longer black box)  System Engineering organization partnering with S&MA organization for RAM implementation 23
  • 24. 24 SUMMARY  Success of a system predicated on intentional implementation of DFRAM  It will not happen spontaneously  Must be married with the system engineering process  Program management must be disciples – will not work otherwise  It is always easier and more cost effective to do it right the first time  Implementation requires people skills and a service mentality 24