EPMS: Process Analysis and Simulation
EPMS: Process Analysis and Simulation
There are three general approaches to process analysis and design: an analytical approach,
a computational approach, and a qualitative or knowledge-based approach. First, queue-
ing theory provides a framework to analyze business processes. As an analysis tool, it
has a benefit in that an analysis team can quickly build a model and obtain results. The
drawbacks are that queueing models can be mathematically complex, are approximate
for more complicated systems, and the results are only valid if the actual system being
studied matches the underlying assumptions of the queueing model. Second, within com-
putational approaches, there are three main simulation models: continuous simulation,
discrete-event simulation, and agent-based simulation. The benefits of simulation are that
a system of almost any complexity could, in theory, be modeled accurately. A drawback is
that simulation modeling often requires significant expertise and time to both develop a
model and to analyze it. Also, to obtain greater accuracy of the results requires the secure-
ment of more accurate data than what might be called for in analytical approaches. Third,
a knowledge-based approach uses rules or heuristics based on best practices to guide the
analysis. There are many heuristics that can be applied; it is through the knowledgeable
application of these rules that the process can be improved.
Since a business process is a set of activities, business processes can be modeled with
queuing theory. In business processes, a flow or job unit is routed from activity to activity
and, at each activity, some transformation is done to the job until the job finally departs
the process. Each activity of the process is performed by a resource, either of a human or
machine nature. If the resource is busy when the job arrives, then the job must wait in a
queue until the resource becomes available. The benefits of applying queueing theory to
analyze business processes is that, first, they provide the analyst with insight into the per-
formance of business processes, and, second, the performance analysis can be conducted
rapidly, allowing for fast generation of alternative process designs.
Analytical queuing models offer powerful means for understanding and evaluating
queuing processes. However, the use of these analytical models is somewhat restricted
by their underlying assumptions. The limitations pertain to the structure of the queuing
system, the way variability can be incorporated into the models, and the focus on steady-
state analysis. Because many business processes are cross-functional and characterized
by complex structures and variability patterns, a more flexible modeling tool is needed.
Simulation, discussed in the latter half of this chapter, offers this flexibility and represents
a powerful approach for analysis and quantitative evaluation of business processes.
Simulation is a technique that enables us to define and launch an imitation of the behav-
ior of a certain real system in order to analyze its functionality and performance in detail.
For this purpose, real-life input data is required and collected for use in running, observing
the system’s behavior over time, and conducting different experiments without disturb-
ing the functioning of the original system. One of the most important properties of the
simulation technique is to enable experts to carry out experiments on the behavior of a sys-
tem by generating various options of “what if” questions. This characteristic of simulation
gives a possibility for exploring ideas and creating different scenarios that are based on an
329
330 Enterprise Process Management Systems
understanding of the system’s operation and deep analysis of the simulation output results.
This actually represents simulation’s main advantage, which consequently has led to the
widespread use of the technique in various fields for both academic and practical purposes.
Process simulation can be the primary resource for copious amount of process data
under differing experimental conditions and parameters which can then be mined
and analyzed for its characteristics and patterns. Process mining and process analy-
sis are covered in several publications like W. M. P. van der Aalst (2002), M. Dumas,
M. La Rosa, J. Mendling, and H. Reijers (2013) and M. Laguna and J. Marklund (2013).
FIGURE 14.1
Total cost of queue operations versus process capacity.
service so that the total cost of the operations (waiting cost + service cost) can be optimized.
Figure 14.1 shows the schematic of the total cost of queue operations versus process capacity.
• The person decides to leave and not to join the queue because of the large num-
ber of people already in the queue.
• Renege, the person joins the queue, but, after some time, decides to leave the
queue.
• The person decides to wait in the queue regardless of the time spent waiting.
3. Queue configuration: This indicates the type of queue that the flow unit joins,
which determines the requirements to join a queue and the behavior of a flow unit
or customer who joins the queue. There are two types of queues, as follows:
• Single-line queue configuration requires that a flow unit joins the queue at the
end of a single line and the flow unit is served after all the flow units before it
are served.
• The multiple-line queue configuration enables the flow unit to choose one of
several queue lines.
4. Queue discipline: Queue discipline represents the rule or discipline used to choose
the next flow unit for serving; there are different disciplines used depending on the
purpose of the queuing system. The most commonly used rule is known as first-in-
first-out. Some queue disciplines also use priority of the flow unit to select the next to
be served. This rule is, for example, used in medical institutions, where patients with
life-threatening conditions or children have the highest priority to be served first.
5. Service mechanism: This consists of a number of services that perform a set of tasks
on the flow unit within a process. The flow unit enters the service facility when the
resource is available to provide the flow unit with the service it needs. The time spent
by the service in performing the work on the flow unit is called the service time.
• First-come-first-serve
• Service in random order
• Last-come-first-serve
EPMS for Business Process Analysis 333
If there are more queues, then the customers often join the queue where the length is
small. This is known as jockeying. Sometimes, the customers tend to move away from the
queue place upon seeing its length. This is known as balking. If the customers wait for a
long time in the queue, but have not been serviced, then they may move away. This is
known as reneging.
The field of queueing theory has developed a taxonomy to describe systems
based on their arrival process, service process, and number of servers, written
as arrival/service/number servers. The basic notation, widely used in queue-
ing theory, is composed of three symbols separated by forward slashes. The
values for the symbols are:
There are varieties of queuing models that arise from the elements of a queue that are
described next.
(λt)n e − λt
Pn (t) = , n = 0, 1, 2…
n!
This is a Poisson distribution with mean E{n/t} = λt arrivals during a period of time t.
(µt )N −n e −µt
Pn (t) = , n = 0, 1, 2… N
(N − n)!
334 Enterprise Process Management Systems
and
N
P0 (t) = 1 − ∑ P (t)
n =1
0
λ n is the arrival rate where there are n customers in the system already
µ n is the service rate where there are n customers in the system already
Pn is the steady-state probability of n customers in the system
All of the earlier steady-state probabilities help in determining the different parameters for
the model such as average queue length, waiting time in the system, and various measures
of system’s performance.
Then, with
λ λ …λ 0
Pn = n −1 n −2 P0 , n = 1, 2,…
µ nµ n −1 µ1
∑
∞
we can determine P0 from the equation P0 = 1.
n =0
For n = 0,
λ
P1 = 0 P0
µ1
For n = 1,
λ 0P0 + µ 2P2 = ( λ1 + µ1 ) P1
λ λ
P2 = 1 0 P0
µ 2µ 1
EPMS for Business Process Analysis 335
Assuming,
The formula list for the model (M/ M/ 1) : (GD/∞/∞) is given in the following:
1. P0 = 1 − (λ / µ ) = 1 − ρ
λ λ
2. Pn = 1 − = (ρ) n × (1 − ρ)
µ µ
3. Probability of queue size greater than n(Q ≥ n) = (ρ)n
λ
4. L s =
µ−λ
λ λ λ2
5. L q = × =
µ − λ µ µ(µ − λ )
λ λ 1 1
6. Ws = × Ls × =
µ−λ µ µ µ−λ
λ
7. Wq =
µ(µ − λ )
λ
8. L / L > 0 = L n =
µ−λ
1
9. Average waiting time in the nonempty queue =
µ−λ
10. Probability of an arrival waiting for t mins or more = ρe −(µ−λ ) t
λ
1− N
µ λ λ
1. ( a ) PN = × , ≠ 1, N = 0, 1, 2,…N
λ µ µ
1−
µ
(1 − ρ)
= × (ρ)N , ρ ≠ 1, N = 0, 1, 2,…N
(1 − ρ)N +1
EPMS for Business Process Analysis 337
1
Pn =
N +1
λ e = λ ( 1 − PN ) = µ ( L s − L q )
λ
1 − (N + 1)(ρ) + N(ρ)
N N +1
µ
3. (a) L s = , if ρ ≠ 1
(1 − ρ)(1 − ρ)N +1
(b) If ρ = 1, L s = N/2
λe λ((1 − PN )
4. Lq = Ls − = Ls =
µ µ
Ls
5. Ws =
λ(1 − PN )
Lq Lq 1
6. Wq = = = Ws −
λ e λ(1 − PN ) µ
ρn
Pn l = for 0 ≤ n ≤ c
n!
1.
ρn
Pn = P0 for n > c
Cn − c C !
−1
C −1 ρn C
∑
ρ
2. P0 = +
n = 0 n ! C ! 1 − ρ
c
C⋅ρ
3. L q = Pc
(C − ρ)2
4. L s = L q + ρ
Lq
5. Wq =
λ
1
6. Ws = Wq +
µ
/∞/∞)
Morse (1998) shows that for (M/M/C): (GD/
ρ ρ
Lq = as → 1
C−ρ C
nµ for 0 ≤ n ≤ N
µ n = Cµ for C ≤ n ≤ N
0 for n ≥ N
EPMS for Business Process Analysis 339
N Cnρn ⋅ P0 for 0 ≤ n ≤ C
1. Pn = n ! ρn
N Cn ⋅ P0 for C ≤ n ≤ N
C ! Cn −C
−1
C N
n ! ρn
2. P0 = ∑
n =0
n
N Cnρ + ∑
n = C +1
C ! Cn −C
N
3. L q = ∑ (n − C)P
n = c +1
n
λe
4. L s = L q +
µ
C
5. λ e = µ ( C − C1 ) , C1 = ∑(C − n)P = λ ( N − L )
n =0
n s
Ls
6. Ws =
λe
Lq
7. w q =
λe
14.3 Simulation
For queuing systems such as the M/M/C queue, the analytical models are well-developed
and we can predict their steady-state performance without too much difficulty. We made
many assumptions (i.e., Poisson arrival process) about the analytical queuing models in
the previous sections so as to simplify the problem so that a mathematical model could be
formulated. However, in the real world, the situation becomes more dynamic and compli-
cated than those mathematical models can handle; even though queuing network models
are available, many situations are simply beyond the capabilities of the analytical math-
ematical model.
When modeling these systems, information such as shift patterns, lunch breaks, machine
breakdowns, arrival rates, and so forth cannot be ignored, as they will have significant
impacts on system performance; also, some systems might never arrive at a steady state
and do not operate on a 24/7 basis. So, it is nearly impossible to study these types of sys-
tems using queuing theory and models; a theoretical solution for those queuing systems
would be difficult to obtain. An alternative to the mathematical model is to use the simula-
tion model instead.
Simulation is the imitation or representation of the behavior of some real thing, state
of affairs, or process. The act of simulating something generally entails representing cer-
tain key characteristics or behaviors of a selected physical or abstract system. To simulate
340 Enterprise Process Management Systems
means to mimic the reality in some way; simulation is a system that represents or emulates
the behavior of another system over time. More specifically, a simulation is the imitation of
the operation of a real-world process or system over time.
Simulation is a technique that enables us to define and launch an imitation of the behav-
ior of a certain real system in order to analyze its functionality and performance in detail.
For this purpose, real-life input data is required and collected for use in running, observ-
ing the system’s behavior over time, and conducting different experiments without dis-
turbing the functioning of the original system. One of the most important properties of
the simulation technique is to enable experts to carry out experiments on the behavior of
a system by generating various options of what-if questions. This characteristic of simula-
tion gives a possibility for exploring ideas and creating different scenarios that are based
on an understanding of the system’s operation and deep analysis of the simulation output
results. This actually represents simulation’s main advantage, which consequently has led
to the widespread use of the technique in various fields for both academic and practical
purposes.
Characteristics of simulation:
the system remains unchanged. Consequently, in a short period of time, a computer can
simulate a large number of events corresponding to a lengthy real-time span.
To illustrate the mechanics of a discrete-event simulation model, consider an informa-
tion desk with a single server. Assume that the objective of the simulation is to estimate the
average delay of a customer. The simulation then must have the following state variables:
As the simulation runs, two events can change the value of these state variables; these are:
• Arrival of a customer, which either changes the status of the server from idle to
busy or increases the number of customers in the queue
• Completion of service, which either changes the status of the server from busy to
idle or decreases the number of customers in the queue
A single-server queuing process can be represented with a timeline on which the time of
each event is marked (Figure 14.2).
Assuming the following notation:
FIGURE 14.2
Events timeline for a single server.
EPMS for Business Process Analysis 343
Figure 14.2 shows a graphical representation of the events in a single-server process. This
example has six events, starting with event 0 and finishing with event 5. Event 0, e0, is the
initialization of the simulation. Event 1, e1, is the arrival of the first job, with arrival time
equal to t1. The arrival of the second job occurs at time t2. Because c1 > t2, the second job
is going to experience a delay. The delay D2 is equal to the difference between c1 and t2
(D2 = c1 − t2).
Further in Figure 14.2, the completion time for job 1 is calculated as: c1 = t1 + S1, because
this jobdoes not experience any delay. The last event in this figure, labeled e5, is the comple-
tion time for job 2. In this case, the calculation of the completion time c2 includes the waiting
time D2(c2 = t2 + D2 + S2).
There are three main mathematical system formalisms distinguished by how they
treat time and data values; they are as follows:
Continuous systems have a large and powerful body of theory. Linear systems have
comprehensive analytical and numerical solution methods and an extensive theory of
estimation and control. Nonlinear systems are still incompletely understood, but many
numerical techniques are available, some analytical stability methods are known, and
practical control approaches are accessible. The very active field of dynamical systems
addresses nonlinear as well as control aspects of systems. Similar results are available
for sampled data systems. Computational frameworks exist for discrete-event systems
(based on state machines and Petri nets), but are less complete than those for differen-
tial or difference equation systems in their ability to determine stability and synthesize
control laws. A variety of simulation tools are available for all three types of systems.
Some tools attempt to integrate all three types into a single framework, though this is
difficult.
Many modern systems are a mixture of all three types. For example, consider a
computer-based temperature controller for a chemical process. The complete system may
include continuous plant dynamics, a sampled data system for control under normal con-
ditions, and discrete-event controller behavior associated with threshold crossings and
mode changes. A comprehensive and practical modern system theory should answer the
classic questions about such a mixed system—stability, closed-loop dynamics, and control
law synthesis. No such comprehensive theory exists, but constructing one is an objective
of current research.
344 Enterprise Process Management Systems
8. Experimental design: In the experimental design step, the simulation team pre-
pares different alternative scenarios for running the simulation process. These
scenarios are developed on the basis of a complete understanding of the behavior
of the system, generating different possible behavior possibilities of the system
by using what-if questions, and trying to implement ideas for achieving improve-
ments in the functioning of the system.
9. Runs and analysis: In this step, the simulation team deals with estimating and
analyzing the performance results of the simulation in the prepared scenarios of
the previous step. On the basis of the results of the previously completed simu-
lation runs, the team may determine the need for conducting more simulation
runs. New ideas may be considered in the context of making changes in the exist-
ing scenarios or new scenarios developed on the basis of the carefully analyzed
output data, leading to the performance of new simulation runs on the system
concerned.
T
CT =
1− r
T
r = 1−
CT
346 Enterprise Process Management Systems
Both CT and T can now be determined using the data of the event logs.
In some information systems, it might be easier to track repetition based on the assign-
ment of tasks to resources. One example is helpdesk ticketing systems that record which
resource is working on a case. Also, the logs of these systems offer insights into repetition.
A typical process supported with ticketing systems is incident resolution. For example, an
incident might be a call by a patient who complains that the online doctor’s appointment
booking system does not work. Such an incident is recorded by a dedicated participant—
for example, a call center agent. Then, it is forwarded to a first-level support team who tries
to solve the problem. In case the problem turns out to be too specific, it is forwarded to a
second-level support team with specialized knowledge in the problem domain.
In the best case, the problem is solved and the patient is notified accordingly. In the
undesirable case, the team identifies that the problem is within the competence area of
another team. This has the consequence of that the problem is rooted back to the first-level
team. Similar to the repetition of tasks, we now see that there is a repeated assignment of
the problem to the same team. Accordingly, log information can be used to determine how
likely it is that a problem is rooted back.
1. Dotted charts using the timestamp to plot an event: the dotted chart is a simple yet
powerful visualization tool for event logs. Each event is plotted on a two-dimensional
canvas, with the first axis representing its occurrence in time and the second axis rep-
resenting its association with a classifier such as a case ID. There are different options
to organize the first axis. Time can be represented either in a relative manner, such that
the first event is counted as zero, or in an absolute manner, such that later cases with
a later start event are further right in comparison to cases that began earlier. The sec-
ond axis can be sorted according to different criteria. For instance, cases can be shown
according to their historical order or their overall cycle time.
2. A timeline chart showing the duration of a task and its waiting time: the temporal
analysis of event logs can be enhanced with further details if a corresponding
process model is available and tasks can be related to a start and an end event. The
idea is to utilize the concept of token replay for identifying the point in time when
a task gets activated.
• For tasks in a sequence, the activation time is the point in time when the previ-
ous task was completed.
• For tasks after an AND-join, this is the point in time when all previous tasks
were completed.
• For XOR-joins and splits it is the point when one of the previous tasks completes.
EPMS for Business Process Analysis 347
Using this information, we can plot a task not as a dot but instead as a bar in a timeline
chart. A timeline chart shows a waiting time (from activation until starting) and a process-
ing time (from starting until completion) for each task. The timelines of each task can be
visualized in a similar way as a dot in the dotted chart. The timeline chart is more infor-
mative than the dotted chart, since it shows the duration of the tasks and also the waiting
times. Both pieces of information are a valuable input for quantitative process analysis.
When thousands of cases are available as a log, one can estimate the distribution of wait-
ing time and processing time of each task, and:
14.5 Summary
This chapter explained the rationale for modeling business processes with queuing theory.
In business processes, each activity of the process is performed by a resource (i.e., either a
human resource or machine resource); thus, if the resource is busy when the job arrives,
then the job will wait in a queue until the resource becomes available. The benefits of
applying queueing theory to analyze business processes is, first, they provide the analyst
348 Enterprise Process Management Systems
with insight into the performance of business processes and, second, the performance
analysis can be conducted rapidly, allowing for the fast generation of alternative process
designs. The second half of the chapter introduced simulation as a technique that enables
defining and experimenting in the context of the imitation of the behavior of a real system
in order to analyze its functionality and performance in greater detail. For this purpose,
real-life input data are required and collected for use in running and observing the sys-
tem’s behavior over time and conducting for different experiments without disturbing the
functioning of the original system.