Domain Testing
Domain Testing
This unit gives an indepth overview of domain testing and its implementation.
● INTRODUCTION:
o Domain:In mathematics, domain is a set of possible values of an
independant variable or the variables of a function.
o Programs as input data classifiers: domain testing attempts to determine
whether the classification is or is not correct.
o Domain testing can be based on specifications or equivalent
implementation information.
o If domain testing is based on specifications, it is a functional test
technique.
o If domain testing is based implementation details, it is a structural test
technique.
o For example, you're doing domain testing when you check extreme values
of an input variable.
All inputs to a program can be considered as if they are numbers. For example, a
character string can be treated as a number by concatenating bits and looking at
them as if they were a binary integer. This is the view in domain testing, which is
why this strategy has a mathematical flavor.
o Before doing whatever it does, a routine must classify the input and set it
moving on the right path.
o An invalid input (e.g., value too big) is just a special processing case
called 'reject'.
o The input then passses to a hypothetical subroutine rather than on
calculations.
o In domain testing, we focus on the classification aspect of the routine
rather than on the calculations.
o Structural knowledge is not needed for this model - only a consistent,
complete specification of input values for each case.
o We can infer that for each case there must be atleast one path to process
that case.
● A DOMAIN IS A SET:
o An input domain is a set.
o If the source language supports set definitions (E.g. PASCAL set types
and C enumerated types) less testing is needed because the compiler
does much of it for us.
o Domain testing does not work well with arbitrary discrete sets of data
objects.
o Domain for a loop-free program corresponds to a set of numbers defined
over the input vector.
● DOMAINS, PATHS AND PREDICATES:
o In domain testing, predicates are assumed to be interpreted in terms of
input vector variables.
o If domain testing is applied to structure, then predicate interpretation must
be based on actual paths through the routine - that is, based on the
implementation control flowgraph.
o Conversely, if domain testing is applied to specifications, interpretation is
based on a specified data flowgraph for the routine; but usually, as is the
nature of specifications, no interpretation is needed because the domains
are specified directly.
o For every domain, there is at least one path through the routine.
o There may be more than one path if the domain consists of disconnected
parts or if the domain is defined by the union of two or more domains.
o Domains are defined their boundaries. Domain boundaries are also where
most domain bugs occur.
o For every boundary there is at least one predicate that specifies what
numbers belong to the domain and what numbers don't.
For example, in the statement IF x>0 THEN ALPHA ELSE BETA we know
that numbers greater than zero belong to ALPHA processing domain(s)
while zero and smaller numbers belong to BETA domain(s).
For example, if the predicate is x2 + y2 < 16, the domain is the inside of a
circle of radius 4 about the origin. Similarly, we could define a spherical
domain with one boundary but in three variables.
● A DOMAIN CLOSURE:
o A domain boundary is closed with respect to a domain if the points on the
boundary belong to the domain.
o If the boundary points belong to some other domain, the boundary is said
to be open.
o Figure 4.2 shows three situations for a one-dimensional domain - i.e., a
domain defined over one input variable; call it x
o The importance of domain closure is that incorrect closure bugs are
frequent domain bugs. For example, x >= 0 when x > 0 was intended.
● DOMAIN DIMENSIONALITY:
o Every input variable adds one dimension to the domain.
o One variable defines domains on a number line.
o Two variables define planar domains.
o Three variables define solid domains.
o Every new predicate slices through previously defined domains and cuts
them in half.
o Every boundary slices through the input vector space with a
dimensionality which is less than the dimensionality of the space.
o Thus, planes are cut by lines and points, volumes by planes, lines and
points and n-spaces by hyperplanes.
● BUG ASSUMPTION:
o The bug assumption for the domain testing is that processing is okay but
the domain definition is wrong.
o An incorrectly implemented domain means that boundaries are wrong,
which may in turn mean that control flow predicates are wrong.
o Many different bugs can result in domain errors. Some of them are:
Domain Errors:
▪ Double Zero Representation :In computers or Languages that
have a distinct positive and negative zero, boundary errors for
negative zero are common.
▪ Floating point zero check:A floating point number can equal zero
only if the previous definition of that number set it to zero or if it is
subtracted from it self or multiplied by zero. So the floating point
zero check to be done against a epsilon value.
▪ Contradictory domains:An implemented domain can never be
ambiguous or contradictory, but a specified domain can. A
contradictory domain specification means that at least two
supposedly distinct domains overlap.
▪ Ambiguous domains:Ambiguous domains means that union of
the domains is incomplete. That is there are missing domains or
holes in the specified domains. Not specifying what happens to
points on the domain boundary is a common ambiguity.
▪ Overspecified Domains:he domain can be overloaded with so
many conditions that the result is a null domain. Another way to put
it is to say that the domain's path is unachievable.
▪ Boundary Errors:Errors caused in and around the boundary of a
domain. Example, boundary closure bug, shifted, tilted, missing,
extra boundary.
▪ Closure Reversal:A common bug. The predicate is defined in
terms of >=. The programmer chooses to implement the logical
complement and incorrectly uses <= for the new predicate; i.e., x
>= 0 is incorrectly negated as x <= 0, thereby shifting boundary
values to adjacent domains.
▪ Faulty Logic:Compound predicates (especially) are subject to
faulty logic transformations and improper simplification. If the
predicates define domain boundaries, all kinds of domain bugs can
result from faulty logic manipulations.
● NICE DOMAINS:
o Where does these domains come from?
Domains are and will be defined by an imperfect iterative process aimed
at achieving (user, buyer, voter) satisfaction.
o Implemented domains can't be incomplete or inconsistent. Every input will
be processed (rejection is a process), possibly forever. Inconsistent
domains will be made consistent.
o Conversely, specified domains can be incomplete and/or inconsistent.
Incomplete in this context means that there are input vectors for which no
path is specified, and inconsistent means that there are at least two
contradictory specifications over the same segment of the input space.
o Some important properties of nice domains are: Linear, Complete,
Systematic, Orthogonal, Consistently closed, Convex and Simply
connected.
o To the extent that domains have these properties domain testing is easy
as testing gets.
o The bug frequency is lesser for nice domain than for ugly domains.
● SYSTEMATIC BOUNDARIES:
o Systematic boundary means that boundary inequalities related by a simple
function such as a constant.
o In Figure 4.3 for example, the domain boundaries for u and v differ only by
a constant. We want relations such as
where fi is an arbitrary linear function, X is the input vector, ki and c are
constants, and g(i,c) is a decent function over i and c that yields a
constant, such as k + ic.
o The first example is a set of parallel lines, and the second example is a set
of systematically (e.g., equally) spaced parallel lines (such as the spokes
of a wheel, if equally spaced in angles, systematic).
o If the boundaries are systematic and if you have one tied down and
generate tests for it, the tests for the rest of the boundaries in that set can
be automatically generated.
● ORTHOGONAL BOUNDARIES:
o Two boundary sets U and V (See Figure 4.3) are said to be orthogonal if
every inequality in V is perpendicular to every inequality in U.
o If two boundary sets are orthogonal, then they can be tested
independently
o In Figure 4.3 we have six boundaries in U and four in V. We can confirm
the boundary properties in a number of tests proportional to 6 + 4 = 10
(O(n)). If we tilt the boundaries to get Figure 4.5, we must now test the
intersections. We've gone from a linear number of cases to a quadratic:
from O(n) to O(n2).
Actually, there are two different but related orthogonality conditions. Sets
o
of boundaries can be orthogonal to one another but not orthogonal to the
coordinate axes (condition 1), or boundaries can be orthogonal to the
coordinate axes (condition 2).
● CLOSURE CONSISTENCY:
o Figure 4.6 shows another desirable domain property: boundary closures
are consistent and systematic.
o The shaded areas on the boundary denote that the boundary belongs to
the domain in which the shading lies - e.g., the boundary lines belong to
the domains on the right.
o Consistent closure means that there is a simple pattern to the closures -
for example, using the same relational operator for all boundaries of a set
of parallel boundaries.
● CONVEX:
o A geometric figure (in any number of dimensions) is convex if you can
take two arbitrary points on any two different boundaries, join them by a
line and all points on that line lie within the figure.
o Nice domains are convex; dirty domains aren't.
o You can smell a suspected concavity when you see phrases such as: ". . .
except if . . .," "However . . .," ". . . but not. . . ." In programming, it's often
the buts in the specification that kill you.
● SIMPLY CONNECTED:
o Nice domains are simply connected; that is, they are in one piece rather
than pieces all over the place interspersed with other domains.
o Simple connectivity is a weaker requirement than convexity; if a domain is
convex it is simply connected, but not vice versa.
o Consider domain boundaries defined by a compound predicate of the
(boolean) form ABC. Say that the input space is divided into two domains,
one defined by ABC and, therefore, the other defined by its negation .
o For example, suppose we define valid numbers as those lying between 10
and 17 inclusive. The invalid numbers are the disconnected domain
consisting of numbers less than 10 and greater than 17.
o Simple connectivity, especially for default cases, may be impossible.
● UGLY DOMAINS:
o Some domains are born ugly and some are uglified by bad specifications.
o Every simplification of ugly domains by programmers can be either good
or bad.
o Programmers in search of nice solutions will "simplify" essential
complexity out of existence. Testers in search of brilliant insights will be
blind to essential complexity and therefore miss important cases.
o If the ugliness results from bad specifications and the programmer's
simplification is harmless, then the programmer has made ugly good.
o But if the domain's complexity is essential (e.g., the income tax code),
such "simplifications" constitute bugs.
o Nonlinear boundaries are so rare in ordinary programming that there's no
information on how programmers might "correct" such boundaries if
they're essential.
● AMBIGUITIES AND CONTRADICTIONS:
o Domain ambiguities are holes in the input space.
o The holes may lie with in the domains or in cracks between domains.
Figure 4.7: Domain Ambiguities and Contradictions.
DOMAIN TESTING:
o Figure 4.12 shows generic domain bugs: closure bug, shifted boundaries,
tilted boundaries, extra boundary, missing boundary.
Figure 4.12: Generic Domain Bugs.
o In Figure 4.13a we assumed that the boundary was to be open for A. The
bug we're looking for is a closure error, which converts > to >= or < to <=
(Figure 4.13b). One test (marked x) on the boundary point detects this bug
because processing for that point will go to domain A rather than B.
o In Figure 4.13c we've suffered a boundary shift to the left. The test point
we used for closure detects this bug because the bug forces the point
from the B domain, where it should be, to A processing. Note that we can't
distinguish between a shift and a closure error, but we do know that we
have a bug.
o Figure 4.13d shows a shift the other way. The on point doesn't tell us
anything because the boundary shift doesn't change the fact that the test
point will be processed in B. To detect this shift we need a point close to
the boundary but within A. The boundary is open, therefore by definition,
the off point is in A (Open Off Inside).
o The same open off point also suffices to detect a missing boundary
because what should have been processed in A is now processed in B.
o To detect an extra boundary we have to look at two domain boundaries. In
this context an extra boundary means that A has been split in two. The
two off points that we selected before (one for each boundary) does the
job. If point C had been a closed boundary, the on test point at C would do
it.
o For closed domains look at Figure 4.14. As for the open boundary, a test
point on the boundary detects the closure bug. The rest of the cases are
similar to the open boundary, except now the strategy requires off points
just outside the domain.
DOMAIN AND INTERFACE TESTING
● INTRODUCTION:
o Recall that we defined integration testing as testing the correctness of the
interface between two otherwise correct components.
o Components A and B have been demonstrated to satisfy their component
tests, and as part of the act of integrating them we want to investigate
possible inconsistencies across their interface.
o Interface between any two components is considered as a subroutine call.
o We're looking for bugs in that "call" when we do interface testing.
o Let's assume that the call sequence is correct and that there are no type
incompatibilities.
o For a single variable, the domain span is the set of numbers between (and
including) the smallest value and the largest value. For every input
variable we want (at least): compatible domain spans and compatible
closures (Compatible but need not be Equal).
● DOMAINS AND RANGE:
o The set of output values produced by a function is called the range of the
function, in contrast with the domain, which is the set of input values over
which the function is defined.
o For most testing, our aim has been to specify input values and to predict
and/or confirm output values that result from those inputs.
o Interface testing requires that we select the output values of the calling
routine i.e. caller's range must be compatible with the called routine's
domain.
o An interface test consists of exploring the correctness of the following
mappings:
● CLOSURE COMPATIBILITY:
o Assume that the caller's range and the called domain spans the same
numbers - for example, 0 to 17.
o Figure 4.16 shows the four ways in which the caller's range closure and
the called's domain closure can agree.
o The thick line means closed and the thin line means open. Figure 4.16
shows the four cases consisting of domains that are closed both on top
(17) and bottom (0), open top and closed bottom, closed top and open
bottom, and open top and bottom.
o Figure 4.17 shows the twelve different ways the caller and the called can
disagree about closure. Not all of them are necessarily bugs. The four
cases in which a caller boundary is open and the called is closed (marked
with a "?") are probably not buggy. It means that the caller will not supply
such values but the called can accept them.
● SPAN COMPATIBILITY:
o Figure 4.18 shows three possibly harmless span incompatibilities.
Figure 4.18: Harmless Range / Domain Span incompatibility bug (Caller Span is
smaller than Called).
o In all cases, the caller's range is a subset of the called's domain. That's
not necessarily a bug.
o The routine is used by many callers; some require values inside a range
and some don't. This kind of span incompatibility is a bug only if the caller
expects the called routine to validate the called number for the caller.
o Figure 4.19a shows the opposite situation, in which the called routine's
domain has a smaller span than the caller expects. All of these examples
are buggy.
Figure 4.19: Buggy Range / Domain Mismatches
In Figure 4.19b the ranges and domains don't line up; hence good values
o
are rejected, bad values are accepted, and if the called routine isn't robust
enough, we have crashes.
o Figure 4.19c combines these notions to show various ways we can have
holes in the domain: these are all probably buggy.
● INTERFACE RANGE / DOMAIN COMPATIBILITY TESTING:
o For interface testing, bugs are more likely to concern single variables
rather than peculiar combinations of two or more variables.
o Test every input variable independently of other input variables to confirm
compatibility of the caller's range and the called routine's domain span and
closure of every domain defined for that variable.
o There are two boundaries to test and it's a one-dimensional domain;
therefore, it requires one on and one off point per boundary or a total of
two on points and two off points for the domain - pick the off points
appropriate to the closure (COOOOI).
o Start with the called routine's domains and generate test points in
accordance to the domain-testing strategy used for that routine in
component testing.
o Unless you're a mathematical whiz you won't be able to do this without
tools for more than one variable at a time.
SUMMARY:
1. Programs can be viewed as doing two different things: (a) classifying input
vectors into domains, and (b) doing the processing appropriate to the domain.
Domain testing focuses on the classification aspect and explores domain
correctness.
2. Domains are specified by the intersections of inequalities obtained by interpreting
predicates in terms of input variables. If domain testing is based on structure, the
interpretation is specific to the control-flow path through the set of predicates that
define the domain. If domain testing is based on specifications, the interpretation
is specific to the path through a specification data flowgraph.
3. Every domain boundary has a closure that specifies whether boundary points are
or are not in the domain. Closure verification is a big part of domain testing.
4. Almost all domain boundaries found in practice are based on linear inequalities.
Those that aren't can often be converted to linear inequalities by a suitable
linearization transformation.
5. Nice domains have the following properties: linear boundaries, boundaries that
extend from plus to minus infinity in all variables, have systematic inequality sets,
form orthogonal sets, have consistent closures, are convex, and create domains
that are all in one piece. Nice domains are easy to test because the boundaries
can be tested one at a time, independently of the other boundaries. If domains
aren't nice, examine the specifications to see whether they can be changed to
make the boundaries nice; often what's difficult about a boundary or domain is
arbitrary rather than based on real requirements.
6. As designers, guard against incorrect simplifications and transformations that
make essentially ugly domains nice. As testers, look for such transformations.
7. The general domain strategy for arbitrary convex, simply connected, linear
domains is based on testing at most (n + 1)p test points per domain, where n is
the dimension of the interpreted input space and p is the number of boundaries in
the domain. Of these, n points are on points and one is an off point. Remember
the definition of off point - COOOOI.
8. Real domains, especially if they have nice boundaries, can be tested in far less
than (n + 1)p points: as little as O(n).
9. Domain testing is easy for one dimension, difficult for two, and tool-intensive for
more than two. Beg, borrow, or build the tools before you attempt to apply
domain testing to the general situation. Finding the test points is a linear
programming problem for the general case and trivial for the nicest domains.
10. Domain testing is only one of many related partition testing methods which
includes more points, such as n-on + n-off, or extreme points and off-extreme
points. Extreme points are good because bugs tend to congregate there.
11. The domain-testing outlook is a productive tactic for integration interface testing.
Test range/domain compatibility between caller and called routines and all other
forms of intercomponent communications.