Honeywell Webinar QA Alarm Management Workflow
Honeywell Webinar QA Alarm Management Workflow
Webinar (Apr.24/2013)
Can you define HAZOP and LOPA? HAZOP stands for Hazard and Operability Study.
LOPA stands for Layers of Protection. The HAZOP is
used to identify major process hazard or operational
issues due to the process design. The LOPA is to
review the safety protection layers or safeguards to
mitigate against hazards in the process.
What does APD stand for? APD stands for Alarm Philosophy Document.
Do you do the alarm cleanup before introducing This is really dependant on the support for alarm
a workflow process? management in your facility. If support is low then I
suggest you start with alarm cleanup documenting all
changes, costs associated with these changes and the
improvement. I would also document feedback from
operators and if possible demonstrating how this
improvement potentially would have mitigated an
alarm flood from the last 1 or 2 incidents. If there is
support at your facility then introducing workflow
process right away will have an immediate impact. The
maintenance and operational meeting is to discuss 3 –
5 bad actors per week and assign responsibility to fix.
The next week you can report on the improvement as
listed above which reinforces that the process is
working and having a significant improvement.
Do you recommend tracking if employees are The purpose of a workflow is to minimize the effort of
following their assigned workflow process? maintaining the alarm management system and
maximize the benefit. Initially, employees may not see
the value and require tracking to enforce the
importance of the workflow process. The best
approach is to educate the employees on the benefit
of the workflow process.
Is there a different process to shelve an alarm This really depends on your computer control system.
or suppress it? The system will use a form of suppression to shelve
the alarm. The difference between a shelved and
suppressed alarm is that the shelved alarm is time
Honeywell
based for when it is reactivated while a suppressed
alarm will need operator intervention to un-suppress
the alarm. There could be a different process for
shelving versus suppressing but you could also use
the same process just indicate that the alarm is being
shelved or suppressed.
When do we use suppressed alarms? There are numerous reasons to suppress an alarm:
State or mode base change
Nuisance alarm
Equipment outage
Equipment out of service
Production change
Production trials
Process changes
The reason you suppress an alarm is to remove a non-
actionable alarm from the alarm graphic. These alarms
can create complacency or hide important alarms due
to the noise they create. If we expect an operator to
take an action for every alarm as defined in the APD
then we need to remove alarms that due to the
operating conditions do not require an action. The
easiest way to do this is by suppressing it. Make sure
that the alarm is un-suppressed when conditions are
back to normal.
I have a shutdown unit. How do I shelve bad This is called state based or modal alarm suppression.
value alarms at shutdown and normalize them My preference for these applications is to automate
when my unit starts? the suppression based on the operation such as:
How do you define a safety-related alarm? A safety-related alarm is an alarm that could lead to an
employee being injured if the issue is not resolved
quickly. Examples of these are: gas leaks (H2S), liquid
overflow (toxic), equipment damage, etc.
Honeywell
How do you identify nuisance alarms by Nuisance alarms are defined as alarms that
automatic reports? continuously activate when there is no actionable
response. The reports that I use to identify nuisance
alarms are:
Most Frequent Alarms
Chattering Alarms
Symptomatic Alarms
You said that a stale alarm is not required The reason I said that a stale alarm (one standing
because it has been standing for 24 hours, yet greater than 24 hours) is not required because the
our rationalization determined we needed it. issue the alarm identified for the operator to resolve
How do I handle that situation? did not occur after the time identified in the
rationalization. The way I approach these alarms is to
review the rationalization documentation verifying that
it is still correct then review with operations why this
issue was not resolved yet there was no incident. After
this review I will have determined the solution which
may include removing the alarm, changing the alarm
setpoint, creating a new alarm or incorporating a state
based alarm.
You indicated there is a difference between The first step is to define stale and standing alarms in
Stale and Standing alarms – how do we your APD. Then use your workflow process to deal
incorporate that difference in my plant? with these alarms.
Example:
Stale alarms are reviewed daily by the shift supervisor
and recommendations are defined to resolve them.
Standing alarms are reviewed by the operator and
shift supervisor at shift change, and reviewed by the
operator every hour.
Was the electronic shift log book part of The electronic shift log book is standalone but can also
Honeywell's Alarm software or stand alone? be offered as a suite (OM Pro) with the alarm
management software.
We have stale alarms mainly due to process If these alarms are only required during an operational
conditions, for example, low level alarms in state i.e. shutdown, startup, they can be suppressed
knock out drums which are always in alarm when not required. If these alarms are required to
state. Operations need the alarm, so how do indicate to the operator the level is low in the knock out
we handle these types of alarms? drum then I would modify the graphic instead to show
this and remove the alarm. The key to solving these
types of alarms (required but always in alarm state) is
to understand the purpose and consider the options to
resolve other than an alarm i.e. graphic changes, user
alerts. If the alarm is always in an activate state then it
is of no value because the operator is not responding
to it so there is no layer of protection.
Honeywell
Can you elaborate on how to differentiate As I indicated in the webinar any alarm that rings in is
between standing and stale alarms with live considered standing whereas a stale alarm is an alarm
examples? that is active for a period of time (i.e. greater than 24
hours). The point I was trying to make during the
Which alarm out of this has the highest priority webinar is that a standing alarm becomes reportable
to be taken care of? when it exceeds a reasonable amount of time and in
my opinion 24 hours it too long. The reason we want to
report and review standing alarms is because they
have not returned to normal and there is still the
potential that there could be an incident. If an alarm
can be active for greater than 24 hours without an
incident then an alarm was not required, it is labeled
stale and should be reviewed and possibly removed.
Example:
A tank has an overflow onto the flow at the 97% of the
height of the tank. The alarm is set to annunciate when
the level reaches 85%. The priority on this alarm is low
which means there is an expectation for the alarm to
return to normal within 30 minutes based on the risk
metrics. When the alarm annunciates the operator
chooses to reduce the rate of the flow into the tank,
however this action only results in slowing the rate of
increase in the tank. Eventually the tank overflows
onto the floor and there is an incident. The standing
alarm report would have indicated that this alarm had
exceeded the 30 minutes and needed further
investigation.
Honeywell
How do you calculate alarm KPI and how long The alarm KPIs need to be defined in your APD and
will it be optimum after the commissioning based on ISA 18.2, EEMUA 191, API 1167 or a
phase of project? corporate standard. The calculation is defined within
these documentations. The purpose of KPIs is to
indicate where the problem areas are and if we are
having a positive impact on improving them. I would be
monitoring them throughout commissioning and into
operation. During the commissioning phase the alarms
will be tested and these need to be identified and
removed from your KPI calculations.
What determines the frequency of alarm Besides new equipment, you need to consider having
rationalization, assuming no new equipment is a rationalization when the control strategy or process
added to the process? is changed.
How can we rationalize alarms for new The challenge with rationalizing new equipment is that
equipment? Is there any guideline? the operators have no operating experience on this
process. That being said it is important to rationalize
this equipment before it goes into service because the
operators will not be familiar with it and will need these
alarms to indicate issues. The process for rationalizing
new equipment is the same except you will require
other personnel to participate:
Vendor of supplied equipment
2 operators instead of one, with experience on
similar equipment and process
Process design engineer
We have experience rationalizing new equipment and
Greenfield and are happy to discuss our process with
you.
Can alarm rationalization be carried out if the Yes a rationalization can be held without an APD,
facility does not have an alarm philosophy however I would not recommend it. If you do not have
document? an APD then you will need to define the following:
Alarm definition
Risk matrices
Alarm types
Alarm design
Graphic design, as it relates to alarm
Honeywell
If you do not have these parameters defined prior to
the rationalization then the rationalization will take
longer and require re-doing parts of it as you define
these parameters. These parameters will need to be
incorporated into your APD once it is written.
We are starting the rationalization process in The ISA 18.2 or EEMUA 191 list starting values for
our refinery. Is there any standard guideline for deadband for measurements (i.e. flow, pressure, etc).
Deadband, Filtering and Time Delay for typical My preference is not to apply a global deadband,
loops? filtering or time delay unless it has been identified that
the measurements for the process are typically noisy. I
identify the alarms that require these techniques
through the chattering alarm report, or if the alarm
setpoint is close to the operating setpoint. The
measurement and results desired will determine
whether to use a deadband, filter or time delay.
If there are currently no deadbands or time-off No I would not apply deadbands, filtering or time
delays implemented, would you do that as a delays as a standard. I would only use them for the
standard with all alarms in parallel with the bad bad actors.
actor program?
Trying to resolve Bad PV alarms we are I am not sure what you mean by extended range. If
thinking to increase the extended range of you are suggesting increasing the engineering range
analogue inputs. Is there any standard on the DCS beyond the design on the instrument
concerning this parameter as a percentage of calibration then this will not solve your problem and will
the instrument range? instead create inaccuracies in your measurement.
Honeywell
How do you incorporate the Time delays/Dead In Experion PKS R410, we introduced the ability to
bands for a Bad PV Alarm in Honeywell’s configure on-delay timers and off-delay timers (along
Experion and TPS? Without using logics? with a deadband value for chattering alarms) for
controller-based alarms. See the example screenshot
of a DACA block (bottom of page 9).
Can you explain the relationship between The Alarm Configuration Management (ACM) is the
Master Boundary Data (MBD) and Alarm master alarm database. The software application
Configuration Management (ACM)? connects to the computer control system and stores
the alarm settings along with the rationalization data.
You showed a Operations Logbook screenshot, Operations logbook can gather data from different
how do we get the alarm data in OL? sources including alarm database for shift summary.
How can we target alarms on oplimit crossing? I am not sure what you are asking. Is this output limit
or operation limit? Are you looking to report on this
issue or what you need to do to resolve this issue?
UOP CCR alarm appears for every cycle of lock Firstly define “alarm” in the APD.
hopper filling and emptying. As per ISA18.2 this
Alarm: is a notification to the operator indicating a
cannot be categorized as an alarm but it is
condition change requiring operator intervention.
difficult to convince operations as it is UOP.
If there is no operator action then based on the
definition there is no alarm. When we have an alarm
system that contains alarms that do not require an
operator action then we are dependent on the operator
deciding if this alarm requires an action. These alarms
also build complacency and distract the operator from
focusing on the process.
Do plants with dedicated control room Success with alarm management is not based on
personnel have more success with alarm dedicated control room personnel but a dedicated
management? operating staff that are continuously reviewing and
improving the alarm system. I have seen success with
and without dedicated control room personnel.
Honeywell
I have a facility with thousands of alarms If you are getting thousands of alarms per operator
appearing every day. I did the AOA (Alarm then yes you need to continue with the AOA. PHA and
Objective Analysis). SOA are excellent for identifying risks but they do not
take into account the alarms as part of a process. The
On the other hand, I have PHA (Process AOA will identify duplicate and redundant alarms
Hazard Analysis) and SOA (System Outage which can be removed, reducing your alarm count.
Analysis) telling which alarm should be set. The process will result in proper alarm setpoints and
priorities. During the AOA you identify an alarm is not
Are the PHA and SOA enough to consider as required due to no operator action, redundant or
the alarm reference, or should I continue the duplicate, but if it was identified in the PHA or SOA
alarm rationalization using AOA? then before the alarm is removed this result will need
to be reviewed again by the PHA and SOA team. This
can create extra work and that’s why I suggest
including AOA functionality in other processes i.e.
HAZOP, LOPA, PHA etc.
In our last PHA (Process Hazard Analysis) The purpose of having procedures is to confirm
revalidation study, we found a bunch of controls everyone is following the requirements. The new
had been compromised. In a plant where new resources need to be educated on the APD and the
resources are a problem, the team decided on rationalization process prior to making any changes to
putting some alarm-based controls that were the alarms.
prioritized. How often should we redo the alarm
The prioritization is determined during rationalization
prioritization study?
and consider re-doing a rationalization based on:
Alarm rate is increasing
New equipment
New control strategy
Incidents are due to alarm issues
If your process is not changing, the bad actors are
being dealt with properly and the alarm rates are within
standards then you may never require a
rationalization.
Operations think the alarm problem at our This is a common issue that I have seen at numerous
facility is a maintenance issue which the control facilities. My experience is that when operations own
team must fix. You had several slides that the alarm issue then the facility is successful with their
involve their participation. Could you suggest alarm management program. To solve this problem we
how I get operations more involved? first must understand the reason why they believe the
alarm problem is a maintenance issue.
Honeywell
the day you show up at the garage to find your motor
sitting on the work bench in pieces and the mechanic
tells you that your car won’t be ready until next week
and the bill will be $4500. Would you accept this?
Then why would you give maintenance the
responsibility to fix the alarm problem when the
purpose of an alarm is to indicate to the operator that
the process or equipment operation has changed and
requires an action to correct.
Honeywell