0% found this document useful (0 votes)

86 views

A Sparse Matrix Approach To Reverse Mode Automatic

The document describes three approaches to implementing reverse mode automatic differentiation in MATLAB using sparse matrices. The approaches are based on interpreting reverse AD as back-substitution on the sparse extended Jacobian matrix. Memory and runtime costs are reduced in the third approach by applying a hoisting technique to reduce the size of the extended Jacobian matrix at runtime. Performance testing shows this third approach outperforms the forward mode of an existing AD package for gradient problems.

Uploaded by

gorot1

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

86 views

A Sparse Matrix Approach To Reverse Mode Automatic

Uploaded by

gorot1

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

A sparse matrix approach to reverse mode automatic

dierentiation in Matlab
Shaun A. Forth

and Naveen Kr. Sharma

International Conference on Computational Science, ICCS 2010,

University of Amsterdam, The Netherlands
May 31 - June 2, 2010
Abstract
We review the extended Jacobian approach to automatic dierentiation of a user-supplied
function and highlight the Schur complement forms forward and reverse variants. We detail
a Matlab operator overloaded approach to construct the extended Jacobian that enables
the function Jacobian to be computed using Matlabs sparse matrix operations. Memory and
runtime costs are reduced using a variant of the hoisting technique of Bischof (Issues in Parallel
Automatic Dierentiation, 1991). On ve of the six mesh-based gradient test problems from
The MINPACK-2 Test Problem Collection (Averick et al, 1992) the reverse variant of our
extended Jacobian technique with hoisting outperforms the sparse storage forward mode of
the MAD package (Forth, ACM T. Math. Software. 32, 2006). For increasing problems size
the ratio of gradient to function cpu time is seen to be bounded, if not decreasing, in line with
Griewank and Walthers (Evaluating Derivatives, SIAM, 2008) cheap gradient principle.
1 Introduction
Automated program generation techniques have been used to augment numerical simulation pro-
grams in order to compute derivatives (or sensitivities) of desired simulation outputs with respect
to nominated inputs since the 1960s. Such techniques go by the collective name Automatic,
or Algorithmic, Dierentiation (AD) [1]. Advances in AD theory, techniques, tools and wide-
ranging applications may be found in the many references available on the international website
www.autodiff.org.
Historically, development has focussed on AD tools for programs written in Fortran and
C/C++ as these languages have dominated large simulations for technical computing. AD of
Matlab was pioneered by Verma [2] who used the, then new, object-oriented features of Matlab
in the overloaded AD package ADMAT 1.0 facilitating calculation of rst and second derivatives
and determination of Jacobian and Hessian sparsity patterns.
Our experience is that ADMATs derivative computation run times are signicantly higher
than expected from AD complexity analysis leading us to develop the MAD package [3] (the
recent ADMAT 2.0 may have addressed these issues [4]). Presently MAD facilitates rst derivative
computations via its fmad class, an operator-overloaded implementation of forward mode AD. The
fmad classs eciency may be attributed to its use of the optimised derivvec class for storing and
combining directional derivatives. The derivvec class permits multiple directional derivatives to
be stored as full or sparse matrices: sparse storage gave good performance on a range of problems.

Applied Mathematics and Scientic Computing, Department of Engineering Systems and Management, Cran-
eld University, Shrivenham, Swindon, SN6 8LA, UK

Department of Computer Science and Engineering, Indian Institute of Technology, Kharagpur 721302, West
Bengal, India

Naveen Kr. Sharma gratefully acknowledges the support of a Summer Internship from the Department of
Engineering Systems and Management, Craneld University.
1
Bischof et al [5] investigated forward mode Matlab AD by source transformation combined
with storing and combining direction derivatives via an overloaded library. This hybrid approach
gave a signicant performance improvement over ADMAT. Source transformation permits the
use of compile-time performance optimisations [6, 7]: forward substitution was found particularly
eective. Kharche and Forth [8] investigated specialising their source transformation inlining of
functions of MADs fmad and derivvec classes in the cases of scalar and inactive variables. This
was particularly benecial for small problem sizes for which overloadings run time requirements
are dominated by the large relative cost of function call overheads and branching (required for
code relevant to scalar and inactive variables) compared to arithmetic operations.
Our thesis is that automated program generation techniques, and specically AD, should take
advantage of the most ecient features of a target language. For example, MADs derivvec classs
exploitation [3] of the optimised sparse matrix features of Matlab [9]. As we recap in Sec. 2, it
is well known that forward and reverse mode AD may be interpreted in terms of forward and
back substitution on the sparse, so-called, extended Jacobian system [1, Chap. 9]. In this article
we investigate whether we might use Matlabs sparse matrix features to eect reverse mode AD
without recourse to the usual tape-based mechanisms [1, Chap 6.1]. Our implementation, including
optimised variants, is described in Sec. 3 and the performance testing of Sec. 4 demonstrates our
approachs potential benets. Conclusions and further work are presented in Sec. 5.
2 Matrix Interpretation of automatic dierentiation
Following Griewank and Walther [1], we consider a function 1 : 11

of the form,
j = 1(r), (1)
in which r 11

and j 11

are the vectors of independent, and dependent, variables respectively.

Our aim is to calculate the function Jacobian 1

(r) 11

for any given r from the source

code of a computer program that calculates 1(x): this is ADs primary task. We consider the
evaluation of 1(r) as a three-part evaluation procedure of the form,

= r

, i = 1, . . . , n, (2)

= ,

, i = n + 1, . . . , n +j, (3)

, i = n +j + 1, . . . , n +j +:. (4)
In (2) we copy the independent variables r to internal variables
1
, . . . ,

. In (3) each

, i =
n+1, . . . , n+j is obtained as a result of j successive elementary operations or elementary functions
(e.g., additions, multiplications, square roots, cosines, etc.), acting on a small number of already
calculated

. Finally in (4) the appropriate internal variables

are copied to the dependent

variables in such a way that j

=
++
, i = 1, . . . , :, to complete the function evaluation.
We dene the gradient operator =
(

1
, . . . ,

)
, dierentiate (2)-(4) and arrange the
resulting equations for the

as a linear system,

0 0
1 1 1

0
1 T 1

1,...,

+1,...,+

++1,...,++

0
0

. (5)
In (5): 1

is the n n identity matrix (1

and 1

are dened similarly);

1,...,
is the matrix,

1,...,
=

1
.
.
.

,
with
+1,...,+
and
++1,...,++
dened similarly; from (3) the j n 1 and j j 1
are both sparse and contain partial derivatives of elementary operations and assignments; from
2
(4) the : n 1 and : j T are such that [1 T] contains exactly one unit entry per row. The
(n + j + :) (n + j + :) coecient matrix in (5) is known as the Extended Jacobian denoted
C 1
++
and has sub-diagonal entries c

.
By forward substitution on (5) we see that the system Jacobian J = j
1,...,
=
++1,...,++
is given by,
J = 1 +T(1 1

)
1
1, (6)
the Schur complement of 1 1

. Using (6), J may be evaluated in two ways [1, p.188]:

1. Forward variant: derivatives
+1,...,+
and Jacobian J are determined by,
(1 1

)
+1,...,+
= 1, (7)
J = 1 +T
+1,...,+
. (8)
i.e., forward substitution on the lower triangular system (7) followed by a matrix multipli-
cation and addition (8).
2. Reverse variant: the j : adjoint matrix

7 and Jacobian J are determined by,
(1

)

7 = T

, (9)
J = 1 +

7

1. (10)
i.e., back-substitution of the upper triangular system (9) followed by matrix multiplication
and addition (10).
The arithmetic cost of both variants is dominated by the solution of the linear systems (7) and (9)
with common (though transposed in (9)) sparse coecient matrix. These systems have n and
: right-hand sides respectively giving the rule of thumb that the forward variant is likely to be
preferred for n < : and reverse for : n (see [1, p.189] for a counter example). Since matrices
1, 1 and T in (7) to (10) are also sparse, further reductions in arithmetic cost might be obtained
by storing and manipulating the or

7 as sparse matrices [1, Chap. 7].
Other approaches to reducing arithmetic cost are based on performing row operations to reduce
the number of entries in (5) or, indeed, to entirely eliminate some rows [1, Chaps. 9-10]. Such
approaches have been generalised by Naumann [10] and may be very ecient if performed at
compilation time by a source transformation AD tool [11]. In Sec. 3.3 we will adapt Bischofs
hoisting technique [12] to reduced the size and number of entries of the extended Jacobian: reducing
memory and runtime costs. Unlike Bischof, ours is a runtime approach more akin to Christianson
et als [13] dirty vectors.
3 Three Implementations
In Secs. 3.2-3.4 we describe our three closely related overloaded classes designed to generate the
extended Jacobians entries as the users function is executed. First, however, we introduce the
MADExtJacStore class, objects of which are used by all three overloaded classes to store the
extended Jacobian entries.
3.1 Extended Jacobian Storage
A MADExtJacStore object has components to store: the number of independent variables, the
number of rows of the Extended Jacobian for which entries have been determined, the number of
entries determined, and a three column matrix to store the i, , and coecient c

for each entry.

The number of rows of the matrix component is doubled whenever its current size is exhausted.
The class inherits the Matlab handle class attribute rendering all assignments of such objects to be
pointer assignments allowing us to have a single object of this class accessible to multiple objects
of our overloaded classes.
3
3.2 A First Implementation - ExtJacMAD
Figure 1: Example code.
function y = F(x)
y = 2 .* x(2:3) .* x(1);
We describe our classes of this and the next three sections
with reference to a vector of independent inputs r =
[0.5 1 2.5]

and the example code of Figure 1 which

corresponds to the function (j
1
, j
2
) = (2r
2
r
1
, 2r
3
r
1
).
ExtJacMAD class objects have three properties. The
rst is the value property which stores the numeric value
of the object and, since Matlab is an array-based lan-
guage, may be an arbitrary size array. The second is the
row index property which is an array of the same size as, and holds the extended Jacobian row
index i for each corresponding element of, the objects value property. Finally, the JacStore
component is a handle to a MADExtJacStore object used to store the extended Jacobian entries.
The statement
x = ExtJacMAD([0.5; 1; -2.5])
creates an object of ExtJacMAD class using the ExtJacMAD constructor function to set the three
properties of x. Firstly, x.value is assigned the column vector [0.5; 1; -2.5]. Secondly,
x.row index is set to the column vector [1; 2; 3] indicating that the elements of the value
component array correspond to the rst three rows of the extended Jacobian. Thirdly, x.JacStore
is set to point to a new object of MADExtJacStore class with its array of entries zeroed and both
the number of independent variables and the numbers of rows set to three.
Applying the function of Fig. 1, y = F(x), initiates four overloaded operations:
1. Our classs overloaded subscript reference function subsref forms a temporary object, call
it tmp1, with tmp1 = x(2:3). This function simply applies subscripting to the value and
row index properties of x such that tmp1.value = x.value(2:3) and tmp1.row index
= x.row index(2:3). JacStore is pointed to the same MADExtJacStore object as for x.
This approach, involving just two subscripting operations and a pointer copy, is intrinsically
more ecient than that for MADs forward mode [3] which involves further overloading on
derivvec objects and multiple operations required to subscript on their derivative storage
arrays.
2. Now 2 .* x(2:3) is eected as 2 .* tmp1 with the result stored in temporary object tmp2.
The associated overloaded times operation performs tmp2.value = 2 .* tmp1.value =
[2 -5] and copies the JacStore component of tmp1. However, the derivatives of tmp2.values
elements are twice those of
tmp1.value so two new extended Jacobian entries must be made for new rows i = [4 5],
columns j = [2 3] (i.e., the row indices of tmp1), and local derivatives c = 2. If j or c is
scalar our software automatically and eciently replicates them by one-to-many assignments
to be the same size as i: thus c = [2 2]. The new row indices are assigned tmp2.row index
= [4 5].
3. As in step 1, tmp3 = x(1) eects tmp3.value = x.value(1) = 0.5, tmp3.row index =
x.row index(1) = 1 and tmp3.JacStore = x.JacStore.
4. Finally, y is formed by the overloaded times operation so that: y.value = tmp2.value .*
tmp3.value = [1 -2.5]; y.JacStore=tmp2.JacStore; and another four entries in the ex-
tended Jacobian are created. Entries for the tmp2 argument with i = [6 7], j = tmp2.row index
= [4 5], c = tmp3.value = 0.5 (expanded to [0.5 0.5]) are rst stored. These are fol-
lowed by those for tmp3 with i = [6 7], j = tmp2.row index = [4 5], and c = tmp2.value
= [2 -5].
As a result of these steps the indices i = [4 5 6 7 6 7], j = [2 3 4 5 1 1] and coecients c
= [2 2 0.5 0.5 2 -5] of the extended Jacobians o-diagonal entries have been determined.
We may now obtain the Jacobian of y with respect to x,
4
J = getJacobian(y)
returning,
J =
2 1 0
-5 0 1
as expected. The ExtJacMAD classs getJacobian function rst adds : rows to the extended
Jacobian corresponding to a copy of the elements of y to ensure the extended Jacobian is of the
form (5) with the last : rows linearly independent. It then forms the sparse sub-matrices 1, 1,
1 and T before evaluating the Jacobian using one of four subtly dierent Matlab statements,
J = R - T *( (L - eye(p)) \ full(B)) (11)
J = R - T * ( (L - speye(p)) \ B) (12)
J = R - (full(T) / (L - eye(p))) * B (13)
J = R - (T / (L - speye(p))) * B (14)
Equation (11) uses full storage for the intermediate derivatives
+1,...,+
of the forward sub-
stitution (7) and Jacobian J of (8). Approach (12) performs the same operations (7)-(8) using
sparse storage. Equations (13) and (14) correspond to full and sparse storage variants respectively
of the back-substitution approach of (9)-(10).
3.3 Hoisting - ExtJacMAD H
As an alternative to the computational graph approach used by Bischof et al [12], hoisting can
be interpreted as Gaussian elimination using pivot rows with a single o-diagonal entry in the
extended Jacobian. Consider the extended Jacobian arising from the example code of Fig. 1 using
the approach of Sec. 3.2. As shown in (15), the two sub-diagonal entries of the 1 block arising
from the use of the two elements of the vector variable tmp1 may be eliminated by row operations
for the cost of two multiplications while generating two entries by ll-in (though in general ll-in
need not occur).
C1 =

1
1
1
2 1
2 1
2 .5 1
5 .5 1
1 1
1 1

row 6row 6+0.5row 4

row 7row 7+0.5row 5

1
1
1
2 1
2 1
2 1 1
5 1 1
1 1
1 1

(15)
In the extended Jacobian, rows 4 and 5 may now be removed from blocks 1 and 1, and columns 4
and 5 may be removed from blocks 1 and T. This also eliminates two rows from the intermediate
matrices
+1,...,+
or

7 leaving the extended Jacobian as,
C 1 =

1
1
1
2 1 1
5 1 1
1 1
1 1

(16)
5
We may now extract 1, 1, 1 and T from (16) and calculate the Jacobian J. Hoisting is an
example of a safe pre-elimination which never increases the number of arithmetic operations [1, p.
212] but can drastically reduce both these and memory costs. In Matlab hoisting may be applied
to element-wise operations or functions with a single array argument (e.g., -x, sin(x), sqrt(x))
and element-wise binary operations or functions with one inactive argument (e.g., 2 + x, A .*
x with A inactive). Hoisting is not applicable to matrix operations or functions (e.g, linear solve
X \ Y or determinant det(X)).
We eect hoisting by a run-time mechanism distinguishing our work from Bischof et al [12].
We use a similar technique to that of Christianson et al [13] who used it to reduce forward or back
substitution costs on the full extended Jacobian: we reduce the size of the extended Jacobian.
Our hoisted class ExtJacMAD H has an additional property to that of class ExtJacMAD, an
accumulated extended Jacobian entry array Cij. When we initialise our ExtJacMAD H object,
x = ExtJacMAD_H([0.5; 1; -2.5])
we assign x.Cij = 1 indicating that the derivatives of the elements of x are a multiple of one times
those associated with x.row index, i.e., rows 1 to 3. Step 1 of the overloaded operations of Sec. 3.2
is as before but with the additional copy tmp1.Cij = x.Cij. Step 2 diers more substantially
with no additions to the extended Jacobian, we merely copy tmp2.row_index = tmp1.row_index
and set tmp2.Cij = 2*tmp1.Cij. Step 3 is similar to the revised step 1. Step 4 is modied to
account for the objects accumulated extended Jacobian entry, so when dealing with the entries
associated with tmp2 we have c = temp2.Cij .* tmp3.value = 1 (expanded to [1 1]), and
when dealing with tmp3 we have c = tmp3.Cij .* tmp2.value = 1 .* [2 -5] = [2 -5]. The
assembled extended Jacobian then directly takes the form (16) and we see that the eects of
hoisting have been mimicked at runtime. Note that the Cij component is maintained as a scalar
whenever possible. Array values of Cij would be created if our example functions coding were
y = sin(x(2:3)) .* x(1) as then tmp2.Cij = cos(tmp1) = [0.5403 -0.8011] though this
would not prevent hoisting.
3.4 Using Matlabs New Objected Oriented Programming Style
Matlab release R2008a introduced substantially new object oriented programming styles and ca-
pabilities compared to those used by both our implementations of Secs. 3.2 and 3.3 and previous
Matlab AD packages [2, 3]. Instead of the old styles denition of all an objects methods within
separate source les in a common folder, in the new style all properties and methods are dened
in a single source le.
4 Performance Testing
All tests involved calculating the gradient of a function from the MINPACK-2 Test Problem
collection [14] with the original Fortran functions recoded into Matlab by replacing loops with
array operations and array functions. We performed all tests for a range of problem sizes n: for
each n ve dierent sets of independent variables r were used. For each derivative technique and
each problem size the set of ve derivative calculations were repeated suciently often that the
cumulative cpu time exceeded 5 seconds. If a single set of ve calculations exceeded 5 seconds
cpu time then that technique was not used for larger n. All tests were performed using Matlab
R2009b on a Windows XP SP3 PC with 2GB RAM. We rst consider detailed results from one
problem.
4.1 MINPACK-2 Optimal Design of Composites (ODC) test problem
Table 1 presents the run time ratio of gradient cpu time to function cpu time for the Optimal
Design of Composites (ODC) problem for varying number of independent variables n and using
the extended Jacobian techniques of Sec. 3. We also give run time ratios for a hand-coded adjoint,
6
one-sided nite dierences (FD) and sparse forward mode AD using MADs fmad class. For these,
and all other results presented, AD generated derivatives agreed with the hand-coded technique
to within round-o: errors for FD were in line with the expected truncation error. Within our
tables a dash indicates that memory or cpu time limits were exceeded.
Table 1: Gradient evaluation cpu time ratio cpu())/cpu()) for the MINPACK-2 Optimal Design
of Composites (ODC) test problem.
cpu())/cpu()) for problem size n
Grad. Tech 25 100 2500 10000 40000
hand-coded 1.8 1.9 2.0 2.0 1.7
FD 26.2 102.8 2684.0 11147.2 -
sparse forward AD 66.0 55.8 134.7 - -
Extended Jacobian: Sec. 3.2
forward full 58.6 79.9 - - -
forward sparse 61.3 56.9 62.0 64.7 53.7
reverse full 57.9 51.4 42.1 42.0 35.4
reverse sparse 57.2 57.4 51.2 55.1 48.0
Extended Jacobian + Hoisting: Sec. 3.3
forward full 46.3 51.4 - - -
forward sparse 48.3 42.7 34.0 33.9 28.6
reverse full 47.0 41.0 24.3 23.1 19.9
reverse sparse 44.9 39.9 26.3 26.4 23.3
Extended Jacobian + Hoisting + New Object Orientation: Sec. 3.4
forward full 99.4 95.2 - - -
forward sparse 99.4 83.1 38.4 35.3 28.6
reverse full 100.8 88.6 31.0 26.4 21.0
reverse sparse 98.0 82.1 33.7 29.0 23.8
The hand-coded results show what might be achievable for a source transformation AD tool in
Matlab. The FD cpu time ratio is in line with theory ( n + 1) but FD exceeded our maximum
permitted run time for large n. Sparse forward mode AD outperformed FD with increasing n but
exceeded our PCs 2 GB RAM for larger n.
The extended Jacobian approaches of Sec. 3.2, particularly the reverse variant with full storage,
are seen to be competitive with, or outperform, sparse forward AD with the exception of the
forward variant with full storage. Since : n we expect the reverse variants to outperform
forward and as : = 1 there is no point employing sparse storage. Employing the hoisting technique
of Sec. 3.3 was always benecial and for larger problem sizes halved the run time. This is because
hoisting reduces the number of entries in the Extended Jacobian by approximately 55% for all
problem sizes tested.
Employing Matlabs new object oriented features, described in Sec. 3.4, had a strongly detri-
mental eect doubling required cpu times compared to using the old features for small to moderate
n. The two sets of run times converge for large n because the underlying Matlab built-in functions
and operations are identical for both. Matlabs run time overheads must be signicantly higher
for the new approach and so dominate for small n.
4.2 MINPACK-2 mesh-based minimisation test problem
Table 2 presents selected results for the remaining mesh-based MINPACK-2 minimisation prob-
lems. We only present the most ecient reverse full variants of the Extended Jacobian approach.
Performance of the FD, hand-coded and new Object Oriented Extended Jacobian approaches was
in line with that for the ODC problem of Sec. 4.1.
From Table 2, only for the GL1 problem did sparse forward mode AD outperform the Extended
Jacobian approach. For all other problems the Extended Jacobian approach with hoisting gave
7
Table 2: Gradient evaluation cpu time ratio cpu())/cpu()) for the MINPACK-2 mesh-based
minimisation test problems. Ext. Jac. indicates use of the reverse full variant of the Extended
Jacobian approach.
cpu())/cpu()) for problem size n
Problem Grad. Tech 25 100 2500 10000 40000 160000
EPT sparse fwd. AD 102.1 99.6 341.3 - - -
Ext. Jac. 106.4 108.6 120.2 120.8 111.8 60.6
Ext. Jac. + Hoisting 93.5 98.3 91.2 91.9 86.0 46.1
GL1 sparse fwd. AD 138.0 93.1 13.4 7.5 6.6 7.3
Ext. Jac. 160.6 104.7 34.3 26.9 27.5 25.6
Ext. Jac. + Hoisting 115.5 85.6 23.2 18.0 17.9 16.6
MSA sparse fwd. AD 85.1 69.2 158.7 - - -
Ext. Jac. 82.4 71.3 54.0 60.3 66.0 -
Ext. Jac. + Hoisting 68.8 57.7 31.5 33.9 39.6 27.9
PJB sparse fwd. AD 74.7 79.4 - - - -
Ext. Jac. 61.5 69.7 131.8 71.6 68.5 -
Ext. Jac. + Hoisting 57.4 60.8 99.8 51.8 48.3 -
cpu())/cpu()) for problem size n
Problem Grad. Tech 100 400 10000 40000 160000 640000
GL2 sparse fwd. AD 86.9 86.7 142.8 292.8 - -
Ext. Jac. 76.8 78.6 88.1 66.2 82.8 -
Ext. Jac. + Hoisting 58.5 59.1 53.3 39.4 50.1 -
equivalent, or substantially faster, performance and used less memory allowing larger problem
sizes to be addressed. Hoisting reduced the number of extended Jacobian entries by between 34%
(EPT problem) to 49% (MSA problem) leading to signicant performance benets. In all cases,
except perhaps the GL2 problem, the Extended Jacobian with Hoisting approach gave run time
ratios decreasing with n for large n in line with the cheap gradient principle [1, p. 88].
5 Conclusions and further work
Our extended Jacobian approach of Sec. 3 allowed us to use operator overloading to build a
functions extended Jacobian before employing Matlabs sparse matrix operations to calculate
the Jacobian itself. Bischofs hoisting technique [12] was adapted for run time use to reduce the
size of the extended Jacobian. The performance testing of Sec. 4 shows that the reverse variant
of our approach with full storage of adjoints and hoisting was substantially more ecient and
able to cope with larger problem sizes than MADs forward mode [3] for ve of six gradient test
problems from the MINPACK-2 collection [14]. Performance is an order of magnitude worse than
for a hand-coded adjoint due to the additional costs of overloaded function calls and of branching
between multiple control ow paths in the overloaded functions. Additionally, the tailoring of the
hand-coded adjoints back-substitution to the sparsity of a particular source functions extended
Jacobian will likely outperform our use of a general sparse solve.
Future work will address Jacobian problems. Equations (11)-(14) may be employed directly
for this together with compression [1, Chap. 8]. Compression requires Jacobian-matrix, Jo =
1o +T(1 1

)
1
(1o), or matrix-Jacobian, \J = \1 + (\T)(1 1

)
1
1, products where o
and \ are so-called seed matrices. Hessians can be computed using the fmad class to dierentiate
the extended Jacobian computations: a forward-over-reverse strategy [1, Chap. 5].
The extended Jacobian approach is not suited to all problems. For example, the product of
two matrices would create some 2
3
extended Jacobian entries. Verma [2] noted that
by working at the matrix, and not the element level, just the 2
2
elements of the two matrices
are needed to enable reverse mode. An under-development tape-based AD implementation will
8
shortly be compared with this articles extended Jacobian approach.
References
[1] A. Griewank, A. Walther, Evaluating Derivatives: Principles and Techniques of Algorithmic
Dierentiation, 2nd Edition, SIAM, Philadelphia, PA, 2008.
[2] A. Verma, ADMAT: Automatic dierentiation in MATLAB using object oriented methods,
in: M. E. Henderson, C. R. Anderson, S. L. Lyons (Eds.), Object Oriented Methods for In-
teroperable Scientic and Engineering Computing: Proceedings of the 1998 SIAM Workshop,
SIAM, Philadelphia, 1999, pp. 174183.
[3] S. A. Forth, An ecient overloaded implementation of forward mode automatic dierentiation
in MATLAB, ACM T. Math. Software. 32 (2) (2006) 195222. doi:10.1145/1141885.1141888.
[4] Cayuga Research Associates, LLC, ADMAT: Automatic Dierentiation Toolbox for use with
MATLAB. Version 2.0. (2008).
URL http://www.math.uwaterloo.ca/CandO Dept/securedDownloadsWhitelist/Manual.pdf
[5] C. H. Bischof, H. M. B ucker, B. Lang, A. Rasch, A. Vehreschild, Combining source transfor-
mation and operator overloading techniques to compute derivatives for MATLAB programs,
in: Proceedings of the Second IEEE International Workshop on Source Code Analysis and
Manipulation (SCAM 2002), IEEE Computer Society, Los Alamitos, CA, USA, 2002, pp.
6572. doi:10.1109/SCAM.2002.1134106.
[6] C. H. Bischof, H. M. B ucker, A. Vehreschild, A macro language for derivative denition in
ADiMat, in: H. M. B ucker, G. Corliss, P. Hovland, U. Naumann, B. Norris (Eds.), Automatic
Dierentiation: Applications, Theory, and Implementations, Lecture Notes in Computational
Science and Engineering, Springer, 2005, pp. 181188. doi:10.1007/3-540-28438-9 16.
[7] H. M. B ucker, M. Petera, A. Vehreschild, Code optimization techniques in source transfor-
mations for interpreted languages, in: C. H. Bischof, H. M. B ucker, P. D. Hovland, U. Nau-
mann, J. Utke (Eds.), Advances in Automatic Dierentiation, Springer, 2008, pp. 223233.
doi:10.1007/978-3-540-68942-3 20.
[8] R. V. Kharche, S. A. Forth, Source transformation for MATLAB automatic dierentiation,
in: V. N. Alexandrov, G. D. van Albada, P. M. A. Sloot, J. Dongarra (Eds.), Computational
Science ICCS 2006, Vol. 3994 of Lect. Notes Comput. Sc., Springer, Heidelberg, 2006, pp.
558565. doi:10.1007/11758549 77.
[9] J. Gilbert, C. Moler, R. Schreiber, Sparse matrices in Matlab - design and implementation,
SIAM J. Matrix Anal. Appl. 13 (1) (1992) 333356.
[10] U. Naumann, Optimal accumulation of Jacobian matrices by elimination methods on the dual
computational graph, Math. Program., Ser. A 99 (3) (2004) 399421. doi:10.1007/s10107-
003-0456-9.
[11] S. A. Forth, M. Tadjouddine, J. D. Pryce, J. K. Reid, Jacobian code generated by source
transformation and vertex elimination can be as ecient as hand-coding, ACM T. Math.
Software. 30 (3) (2004) 266299. doi:10.1145/1024074.1024076.
[12] C. H. Bischof, Issues in parallel automatic dierentiation, in: A. Griewank, G. F. Corliss
(Eds.), Automatic Dierentiation of Algorithms: Theory, Implementation, and Application,
SIAM, Philadelphia, PA, 1991, pp. 100113.
[13] B. Christianson, L. C. W. Dixon, S. Brown, Sharing storage using dirty vectors, in: M. Berz,
C. Bischof, G. Corliss, A. Griewank (Eds.), Computational Dierentiation: Techniques, Ap-
plications, and Tools, SIAM, Philadelphia, PA, 1996, pp. 107115.
9
[14] B. M. Averick, R. G. Carter, J. J. More, G.-L. Xue, The MINPACK-2 test problem collection,
Preprint MCSP1530692, MCS ANL, Argonne, IL (1992).
10

Truesdell The Non Linear Field Theories of Mechanics
60% (5)
Truesdell The Non Linear Field Theories of Mechanics
627 pages
SIL Certificate Solenoid Valve FP Series
No ratings yet
SIL Certificate Solenoid Valve FP Series
2 pages
Optimal Jacobian Accumulation Is NP-complete: Uwe Naumann
No ratings yet
Optimal Jacobian Accumulation Is NP-complete: Uwe Naumann
15 pages
Optimization: Calculating Derivatives Panos Patrinos STADIUS, Department of Electrical Engineering, KU Leuven
No ratings yet
Optimization: Calculating Derivatives Panos Patrinos STADIUS, Department of Electrical Engineering, KU Leuven
21 pages
2021-Kofman-Compact Sparse Symbolic Jacobian Computation
No ratings yet
2021-Kofman-Compact Sparse Symbolic Jacobian Computation
18 pages
Introduction To Automatic Differentiation and MATLAB Object-Oriented Programming
No ratings yet
Introduction To Automatic Differentiation and MATLAB Object-Oriented Programming
19 pages
Mathematical View of Automatic Differentiation
No ratings yet
Mathematical View of Automatic Differentiation
78 pages
Automatic Differentiation: H Avard Berland
No ratings yet
Automatic Differentiation: H Avard Berland
22 pages
Automatic Differentiation
No ratings yet
Automatic Differentiation
22 pages
Efficient Calculation of Jacobian and Adjoint Vector Products in The Wave Propagational Inverse Problem Using Automatic Differentiation
No ratings yet
Efficient Calculation of Jacobian and Adjoint Vector Products in The Wave Propagational Inverse Problem Using Automatic Differentiation
22 pages
Computational Science
No ratings yet
Computational Science
65 pages
Advanced Mathematics With MATLAB-Thomas L.
100% (2)
Advanced Mathematics With MATLAB-Thomas L.
779 pages
Matrix Operation
No ratings yet
Matrix Operation
10 pages
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Chapter Nine Command Description Dblquad (Fun, A, B, C, D)
No ratings yet
Chapter Nine Command Description Dblquad (Fun, A, B, C, D)
7 pages
Laboratorio 1 Control
No ratings yet
Laboratorio 1 Control
21 pages
Current Trends in Numerical Linear Algebra
No ratings yet
Current Trends in Numerical Linear Algebra
19 pages
Matlab Tips
No ratings yet
Matlab Tips
14 pages
RTMB
No ratings yet
RTMB
50 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Automatic Generation of Floating-Point Test Data
No ratings yet
Automatic Generation of Floating-Point Test Data
4 pages
Automatic Differentiation Simons - Summer - School - Talk - August - 21st
No ratings yet
Automatic Differentiation Simons - Summer - School - Talk - August - 21st
65 pages
Lecture 4
No ratings yet
Lecture 4
12 pages
Salane1986numjac PDF
No ratings yet
Salane1986numjac PDF
24 pages
Numerical Linear Algebra With Matlab
No ratings yet
Numerical Linear Algebra With Matlab
16 pages
Performing Mathematical Operation and Performance Analysis in MATLAB
No ratings yet
Performing Mathematical Operation and Performance Analysis in MATLAB
3 pages
Gradients Without Backpropagation
No ratings yet
Gradients Without Backpropagation
10 pages
Research Paper
No ratings yet
Research Paper
10 pages
Intoduction To Atlab
No ratings yet
Intoduction To Atlab
59 pages
Viden Io Amity Aset Matlab Practical File Basic Simulation Lab Manual Updated
No ratings yet
Viden Io Amity Aset Matlab Practical File Basic Simulation Lab Manual Updated
50 pages
UF Lecture 14 Automatic Differentiation
No ratings yet
UF Lecture 14 Automatic Differentiation
16 pages
Intoduction To Atlab: M. Subramanian
No ratings yet
Intoduction To Atlab: M. Subramanian
58 pages
1 9780898717778 BM
No ratings yet
1 9780898717778 BM
46 pages
3527634
No ratings yet
3527634
49 pages
Performing Mathematical Operation and Performance Analysis in MATLAB
No ratings yet
Performing Mathematical Operation and Performance Analysis in MATLAB
7 pages
MATLAB for Beginners: A Gentle Approach
From Everand
MATLAB for Beginners: A Gentle Approach
Peter I. Kattan
No ratings yet
4.3 Numerical Differentiation 4.3.1 Calculation With Difference Formulas
No ratings yet
4.3 Numerical Differentiation 4.3.1 Calculation With Difference Formulas
1 page
MATLAB for Beginners: A Gentle Approach - Revised Edition
From Everand
MATLAB for Beginners: A Gentle Approach - Revised Edition
Peter Kattan
No ratings yet
Matlab Workshop
No ratings yet
Matlab Workshop
81 pages
Basic Simulation Lab
No ratings yet
Basic Simulation Lab
69 pages
Chad
No ratings yet
Chad
49 pages
You Only Linearize Once: Alexey Radul, Adam Paszke, Roy Frostig, Matthew J. Johnson, Dougal Maclaurin
No ratings yet
You Only Linearize Once: Alexey Radul, Adam Paszke, Roy Frostig, Matthew J. Johnson, Dougal Maclaurin
29 pages
Basic Simulation Lab File (Es-204) : Ravi Kumar A45615820008 B.Tech Ce 4 SEM
No ratings yet
Basic Simulation Lab File (Es-204) : Ravi Kumar A45615820008 B.Tech Ce 4 SEM
49 pages
Function (Jac, Iflag) Approx - Jacobian - FD (X, F, Calc - F, Options, Param)
No ratings yet
Function (Jac, Iflag) Approx - Jacobian - FD (X, F, Calc - F, Options, Param)
3 pages
Exercise 1
No ratings yet
Exercise 1
3 pages
Cse169 13
No ratings yet
Cse169 13
79 pages
Basic Simulation Lab: Amity School of Engineering Amity University - UTTAR PRADESH
No ratings yet
Basic Simulation Lab: Amity School of Engineering Amity University - UTTAR PRADESH
24 pages
Assignment 1
No ratings yet
Assignment 1
9 pages
Math3012 Numerical Analysis - Lab 2
No ratings yet
Math3012 Numerical Analysis - Lab 2
4 pages
Class 2 Sheet 4
No ratings yet
Class 2 Sheet 4
7 pages
Jacobian Matrix PDF
No ratings yet
Jacobian Matrix PDF
23 pages
Matlab - Khushal Verma
No ratings yet
Matlab - Khushal Verma
38 pages
19 Ways To Evaluate The Exponential of Matrices
No ratings yet
19 Ways To Evaluate The Exponential of Matrices
46 pages
BIOENG 1330/2330 Biomedical Imaging FALL 2015: Sowmya Aggarwal Ker-Jiun Wang University of Pittsburgh
No ratings yet
BIOENG 1330/2330 Biomedical Imaging FALL 2015: Sowmya Aggarwal Ker-Jiun Wang University of Pittsburgh
61 pages
2019 Bme 109 Biomechanics Lab 0
No ratings yet
2019 Bme 109 Biomechanics Lab 0
24 pages
Matlab File
No ratings yet
Matlab File
22 pages
Curse NG
No ratings yet
Curse NG
464 pages
MATLAB and The File System
No ratings yet
MATLAB and The File System
13 pages
Experiment No 1: AIM: Creating A One-Dimensional Array (Row / Column Vector) Creating A
No ratings yet
Experiment No 1: AIM: Creating A One-Dimensional Array (Row / Column Vector) Creating A
22 pages
Symbolic Mathematics in Data Science. Algebra, Calculus, and Geometry with Matlab
From Everand
Symbolic Mathematics in Data Science. Algebra, Calculus, and Geometry with Matlab
César Pérez López
No ratings yet
Jacobian Matrix and Determinant PDF
No ratings yet
Jacobian Matrix and Determinant PDF
6 pages
Practical Work N°3(Solution)
No ratings yet
Practical Work N°3(Solution)
3 pages
Application of Lie S Theory of Ordinary and PDEs
100% (1)
Application of Lie S Theory of Ordinary and PDEs
240 pages
Forced Harmonic Response Analysis of Nonlinear Structures
No ratings yet
Forced Harmonic Response Analysis of Nonlinear Structures
9 pages
Exponential Stability and Stabilization of Linear Time-Varying Singular System
No ratings yet
Exponential Stability and Stabilization of Linear Time-Varying Singular System
4 pages
Stiffness and Damage Identification With Model Reduction Technique
No ratings yet
Stiffness and Damage Identification With Model Reduction Technique
8 pages
Automatic Differentiation, C++ Templates AndPhotogrammetry
No ratings yet
Automatic Differentiation, C++ Templates AndPhotogrammetry
14 pages
Imm4000 PDF
No ratings yet
Imm4000 PDF
5 pages
Homotopy Analysis and Homotopy Pad Two-Dimensional Coupled Burgers' Equations
No ratings yet
Homotopy Analysis and Homotopy Pad Two-Dimensional Coupled Burgers' Equations
9 pages
Implicit Partial Differential Equations and The Constraints of Nonlinear Elasticity
No ratings yet
Implicit Partial Differential Equations and The Constraints of Nonlinear Elasticity
31 pages
Efficient Derivative Codes Through Automatic Differentiation and Interface Contraction: Application in Biostatistics
No ratings yet
Efficient Derivative Codes Through Automatic Differentiation and Interface Contraction: Application in Biostatistics
13 pages
The Application of Homotopy Analysis Method To Nonlinear Equations Arising in Heat Transfer
No ratings yet
The Application of Homotopy Analysis Method To Nonlinear Equations Arising in Heat Transfer
5 pages
Entry Requirements and Visa Procedure To Malaysia
No ratings yet
Entry Requirements and Visa Procedure To Malaysia
7 pages
MAF653 JUNE 2017 Suggested Answer
No ratings yet
MAF653 JUNE 2017 Suggested Answer
7 pages
7SJ82 50 51
100% (1)
7SJ82 50 51
4 pages
6 Summer Drinks Recipes - Fruit Drinks - Easy Refreshing Drinks - Summer Fruit Juice - Hebbar's Kitchen
No ratings yet
6 Summer Drinks Recipes - Fruit Drinks - Easy Refreshing Drinks - Summer Fruit Juice - Hebbar's Kitchen
3 pages
Mohd Hanif Resume Word
No ratings yet
Mohd Hanif Resume Word
4 pages
OJS Workflow Chart
No ratings yet
OJS Workflow Chart
1 page
Detailed Lesson Plan in English 6
No ratings yet
Detailed Lesson Plan in English 6
5 pages
Digital Marketing Online Course Brochure
No ratings yet
Digital Marketing Online Course Brochure
22 pages
SWRHK 65P - Manual 003
No ratings yet
SWRHK 65P - Manual 003
2 pages
FAM_QUESTION_BANK_CT[1]
No ratings yet
FAM_QUESTION_BANK_CT[1]
14 pages
Email:: Essay Writing - Online Assignment Help - Homework Help Service
No ratings yet
Email:: Essay Writing - Online Assignment Help - Homework Help Service
25 pages
RT21 Wheeled Loading Shovel
No ratings yet
RT21 Wheeled Loading Shovel
5 pages
CS8691-Artificial Intelligence NOTES 1
No ratings yet
CS8691-Artificial Intelligence NOTES 1
220 pages
Health Amp Happiness Private Limited,: Grand Total
No ratings yet
Health Amp Happiness Private Limited,: Grand Total
1 page
Aerodynamic Heating in Supersonic and Hypersonic Flows: Advanced Techniques for Drag and Aero-heating Reduction Mostafa Barzegar Gerdroodbary download
100% (2)
Aerodynamic Heating in Supersonic and Hypersonic Flows: Advanced Techniques for Drag and Aero-heating Reduction Mostafa Barzegar Gerdroodbary download
54 pages
Riber 7-s3 k18-108 116-128
No ratings yet
Riber 7-s3 k18-108 116-128
13 pages
2nd Year Fee Receipt
No ratings yet
2nd Year Fee Receipt
2 pages
Single Line Diagram of 11kV Substation
100% (3)
Single Line Diagram of 11kV Substation
46 pages
Learnforexbasic Candlestick Patterns
No ratings yet
Learnforexbasic Candlestick Patterns
10 pages
Exhibitor List - Updated Show Directory (Damini)
No ratings yet
Exhibitor List - Updated Show Directory (Damini)
6 pages
DLL Math 7 Q2 Week 1
No ratings yet
DLL Math 7 Q2 Week 1
5 pages
Chapter 1
No ratings yet
Chapter 1
15 pages
Burdick Sewing Machine Manual
100% (1)
Burdick Sewing Machine Manual
13 pages
BSBWHS616 - CAC Class Activities.v1.0
No ratings yet
BSBWHS616 - CAC Class Activities.v1.0
16 pages
Web Programming
No ratings yet
Web Programming
5 pages
Thesis Topics For MD Community Medicine
100% (3)
Thesis Topics For MD Community Medicine
4 pages
Writing A Teaching Philosophy State
No ratings yet
Writing A Teaching Philosophy State
2 pages
Mastering Microcontrollers Helped by Arduino: Component Lists
No ratings yet
Mastering Microcontrollers Helped by Arduino: Component Lists
4 pages
Focus3 2E Unit Test Vocabulary Grammar UoE Unit6 GroupB
No ratings yet
Focus3 2E Unit Test Vocabulary Grammar UoE Unit6 GroupB
2 pages

A Sparse Matrix Approach To Reverse Mode Automatic

Uploaded by

A Sparse Matrix Approach To Reverse Mode Automatic

Uploaded by

A sparse matrix approach to reverse mode automatic

and Naveen Kr. Sharma

International Conference on Computational Science, ICCS 2010,

are the vectors of independent, and dependent, variables respectively.

for any given r from the source

. Finally in (4) the appropriate internal variables

are copied to the dependent

is the n n identity matrix (1

are dened similarly);

. Using (6), J may be evaluated in two ways [1, p.188]:

for each entry.

and the example code of Figure 1 which

row 6row 6+0.5row 4

You might also like