Fault Tree Analysis
Fault Tree Analysis
Table of Contents
Introduction ............................................................................................................... 3
Fault Tree Analysis ................................................................................................... 3
Basic Events ............................................................................................................ 4
Advantages ............................................................................................................. 4
Limitations .............................................................................................................. 4
Notation................................................................................................................... 5
General Procedure for Fault Tree Analysis........................................................ 6
Rules of Fault Tree Construction ......................................................................... 7
Considerations...................................................................................................... 11
Fault Tree Evaluation ............................................................................................. 12
Boolean Algebra ................................................................................................... 12
The OR Gate ....................................................................................................... 12
The AND Gate.................................................................................................... 12
Qualitative Analysis ............................................................................................ 14
Minimal Cut Sets ............................................................................................... 14
Criticality............................................................................................................ 14
Quantitative Analysis.......................................................................................... 16
Common-Cause Failures..................................................................................... 16
References................................................................................................................. 18
Introduction
there is a need to analyze all the possible failure mechanisms in complex
systems (e.g. nuclear power plants)
also perform probabilistic analyses for the expected rate of failures
estimate probabilities of events that are modelled as logical combinations
or logical outcomes of other random events
two main methods:
z fault tree analysis
z event tree analysis
decision trees also exist and are used in risk analysis (combines all
feasible alternatives, possible outcomes and their probabilities, monetary
consequences and utility evaluations)
other graphical methods include
z reliability block diagrams
z functional logic diagrams
z Failure Modes and Effects Analysis (FMEA)
the occurrence of a top event may or may not lead to a serious or adverse
consequence
the relative likelihood of a number of potential consequences will depend
on the conditions or subsequent events that follow
potential consequences can be systematically identified using an event
tree
Basic Events
Advantages
allow the use of reliable information on component failure and other basic
events to estimate the overall risk associated with new system designs for
which no historical data exists
simple to understand and easy to implement
qualitative descriptions of potential problems and combinations of events
causing specific problems of interest
quantitative estimates of failure frequencies and likelihoods, and relative
importances of various failure sequences and contributing events
lists of recommendations for reducing risks
quantitative evaluations of recommendation effectiveness
Limitations
difficult to conceive all possible scenarios leading to the top event
construction of fault trees for large systems can be tedious
correlations between basic events (e.g. failure of components belonging to
the same batch) are difficult to model and exact solutions to correlated
events do not exist
subjective decisions regarding the level of detail and completeness are
often necessary
Notation
Symbol Name Description
Primary Event Symbols
Circle Basic Event – a basic initiating fault requiring
no further development
Gate Symbols
OR Gate The union operation of events, i.e. the output
event occurs if (at least) one or more of the
inputs occur
AND Gate The intersection operation of events, i.e. the
output event occurs if and only if all the inputs
occur
INHIBIT The output event occurs if the (single) input
Gate event occurs in the presence of an enabling
condition (i.e. Conditioning Event (oval) drawn
to the right of the gate)
Transfer Symbols
Triangle-in Indicates that the tree is developed further
someplace else (e.g. another page)
Step 5. Solve the fault tree for the combinations of events contributing to
the top event.
Examine the fault tree model to identify all the possible
combinations of events and conditions that can cause the top event
of interest. A combination of events and conditions sufficient and
necessary to cause the top event is called a minimal cut set.
Rule 1. State the fault event as a fault, including the description and timing of
a fault condition at some particular time.
Include
(a) what the fault state of that system or component is,
(b) when that system or component is in the fault state.
Test the fault event by asking
(c) is it a fault?
(d) is the what-and-when portion included in the fault statement?
Rule 2. There are two basic types of fault statements, state-of-system and
state-of-component.
To continue the tree,
(a) if state-of-system fault statement, use Rule 3
(b) if state-of-component fault statement, use Rule 4
Example: (McCormick, 1981) Construct a fault tree for the simple electric
motor circuit shown below.
Switch Fuse
Power
Motor
Supply
Wire
Solution:
Step 1. Define the system of interest.
Need to identify
z Intended Functions
z Physical Boundaries (to avoid overlooking key elements of a system at
interfaces and penalizing a system by associating other equipment
with the subject of the study)
z Analytical Boundaries (to limit the level of analysis resolution, to
explicitly exclude certain types of events and conditions, such as
sabotage, from the analysis)
z Initial Conditions, (including equipment that is assumed to be out of
service initially, which affect the combinations of additional events
necessary to produce a specific system problem)
For this particular problem we have,
Intended Function – the motor is used for some (unknown)
purpose
Physical Boundaries – power supply
Analytical Boundaries – include all contributors in the above diagram
Initial Conditions – switch closed, motor on
Motor fails
to operate
Step 3. Construct the fault tree, starting from the top, i.e., define the treetop
structure. Identify the main contributing events, including all events and
scenarios that may cause the top event.
Step 4. Explore each branch in successive levels of detail, following the rules
of fault tree construction.
Motor fails
to operate
Defect No current
in motor to motor
3 4
Fuse
Switch Switch failure under
Fuse failure
fails opened normal
due to overload
open erroneously conditions
(open)
Fuse fails
5
open
Overload
in circuit
Wire Power
failure failure
(shorted) (surge)
Gate 1. One primary failure event is the failure of the motor itself (for
example, due to a wiring failure within the motor or loss of lubrication
to the bearings). This event is a basic event because no details of the
motor are given, therefore, the event cannot be developed further. The
other possibility is the event that no current is supplied to the motor.
Gate 2. The event “No current in motor” is the result of other events and
is therefore developed further. The lack of current to the motor can
result from a broken connection in any of the other four components in
the circuit, including the failure of the wire or power supply (basic
events), the switch being open, or failure of the fuse.
Gate 3. The open switch may be due to a basic failure of the switch, or the
event that the switch was opened erroneously. The erroneous opening
of the switch is due to human error, which could be developed further
into more basic events (i.e. operator is inexperienced, under stress,
etc.). However, due to insufficient information, the event is not
explored further. This purposely undeveloped event is therefore
denoted with the diamond symbol.
Gate 4. The fuse failure event may be caused by fuse failure under normal
conditions (primary failure) or due to overload from the circuit.
Gate 5. The secondary fuse failure can occur if the fuse does not open
every time an overload is present in the circuit (because all conditions
of an overload do not necessarily result in sufficient overcurrent to
open the fuse). This is why a conditional gate, denoted by the
hexagon, is used. The condition, i.e. “Fuse fails open” is placed in the
connecting oval, and the conditional gate is treated similarly to an
AND gate in subsequent tree analysis.
Considerations
construction of a fault tree is subjective
need to take into account
z Level of Detail – the number of basic events should be defined such
that the size of the tree is reasonable with respect to the scope of the
analysis
z Probability Assignment – need to stop development at the level
where probability or failure data is available
z Meaningfulness – the level of detail should be such that the basic and
undeveloped events correspond to the design aspects being analyzed
Boolean Algebra
fault trees describe the relationships between events using Boolean logic
a fault tree can be translated into an equivalent set of Boolean equations
The OR Gate
Represents the union of events at the gate. For event Q with two input events
A and B attached to the OR gate, the probability is obtained as
P (Q ) = P (A) + P (B ) − P (A ∩ B ) (1)
or
P (Q ) = P (A) + P (B ) − P (A)P (B | A) (2)
X ∩ (Y ∩ Z ) = (X ∩ Y ) ∩ Z X ⋅ (Y ⋅ Z ) = (X ⋅Y ) ⋅ Z Associative Law
X (YZ ) = (XY )Z
X ∪ (Y ∪ Z ) = (X ∪ Y ) ∪ Z X + (Y + Z ) = (X +Y ) + Z
X ∩ (Y ∪ Z ) = (X ∩ Y ) ∪ (X ∩ Z ) X ⋅ (Y + Z ) = X ⋅Y + X ⋅ Z Distributive Law
X (Y + Z ) = XY + XZ
X ∪ (Y ∩ Z ) = (X ∪Y ) ∩ (X ∪ Z ) X + (Y ⋅ Z ) = (X +Y ) ⋅ (X + Z )
X ∩X = X X ⋅X = X Idempotent Law
X ∪X = X X +X = X
X ∩ (X ∪ Y ) = X X ⋅ (X + Y ) = X Law of Absorption
X ∪ (X ∩ Y ) = X X + (X ⋅Y ) = X
X ∩X = φ X ⋅X = φ Complementation
X ∪X =Ω= I X +X = Ω= I
X ∩Y = X ∪Y X ⋅Y = X + Y de Morgan’s Rule
X ∪Y = X ∩Y X + Y = X ⋅Y
X ∪ (X ∩ Y ) = X ∪ Y X + (X ⋅Y ) = X + Y Other relationships
X ∩ (X ∪ Y ) = X ∩ Y X ⋅ (X + Y ) = X ⋅Y
Qualitative Analysis
used for identifying
critical events
z
potential system weaknesses
z
z best ways to reduce the risk associated with the top event
conducted using minimal cut sets
Criticality
the importance of each minimal cut set can be based on its relative
contribution to the total probability of the top event
the minimal cut sets for a fault tree can be used to identify the most critical
basic events using the following criteria
z minimal cut sets that include a small number of events represent less
redundancy and so have higher criticality
z events that appear on a large number of minimal cut sets are more
important because they can contribute to the top event in several
scenarios
must also consider the relative probabilities of each of the events
z the importance of each event can be based on the total probability of
the cut sets to which the basic event contributes (i.e. represents the
total probability of failure involving the event)
Example: Determine all the minimal cut sets for the small motor problem.
Solution:
Let T denote the top event
Let P denote primary events (circles)
Let G denote intermediate events (rectangles)
Let S denote undeveloped events (diamonds)
Let C denote conditioning events (ovals)
T
Therefore +
T = motor fails to operate
P1 = defect in motor G1
P1
P2 = wire failure (open)
P3 = power supply failure +
The top event, therefore, contains 6 single component minimum cut sets and
2 double component minimum cut sets.
Quantitative Analysis
calculate the probability of occurrence of the top event, given the fault
tree and the probability of occurrence of the basic events
common approaches include
zassume all basic events are independent
zassume the probabilities of basic events are small (rare-event
approximation)
if each basic event on the fault tree occurs only once, then the tree
branches emanating from any given gate in the tree are independent
the probability of the top event is given by the probability of the union of
the minimum cut sets
results in a conservative estimate of the probability of failure
Common-Cause Failures
Multiple failures originating from a common cause that fails the system.
Example: Calculate the probability of occurrence of the top event for the
simple motor example. Assume the probability of occurrence of each basic
event is equal to 0.01 and the probability of the event S1 “Switch opened
erroneously” is equal to 0.001. Also assume that the condition C1 “Fuse
fails open” has a probability of occurrence of 0.50.
Solution:
Using the same notation as before we are given
Event Description Probability
P1 Defect in motor 0.01
P2 Wire failure (open) 0.01
P3 Power supply failure 0.01
P4 Switch fails open 0.01
P5 Fuse failure under normal conditions (open) 0.01
P6 Wire failure (shorted) 0.01
P7 Power failure (surge) 0.01
S1 Switch opened erroneously 0.001
C1 Fuse fails open 0.50
The probability of intermediate events can be evaluated using the fault tree.
The probability of the top event is given by the union of the minimum cut sets
determined before as
0.061
0.01 0.051
+ +
x 0.50
0.02
0.01 0.01
References
McCormick, N.J. 1981. Reliability and Risk Analysis: Methods and Nuclear Power
Applications. Academic Press, New York.
United States Coast Guard. 2004. Risk-based Decision-making (RBDM)
Guidelines (3rd Ed.), Vol. 3 – Risk Assessment Tools Reference, Chapter 9 –
Fault Tree Analysis.
(http://www.uscg.mil/hq/g-m/risk/E-Guidelines/RBDMGuide.htm)
United States Nuclear Regulatory Commission (NRC). 1981. Fault Tree
Handbook. NUREG-0492. Systems and Reliability Research Office of
Nuclear Regulatory Research, Washington, D.C.
(http://www.nrc.gov/reading-rm/doc-collections/nuregs/staff/sr0492/sr0492.pdf)