0% found this document useful (0 votes)
4 views102 pages

short_line_groups

This thesis by Pia Nandini Malaney explores the index number problem using differential geometry, proposing a new economic derivative that resolves inconsistencies in existing index number theories and aligns with the Divisia index. It also examines the welfare implications of the Divisia and Konus indices under changing preferences, demonstrating that the Divisia can serve as a revealed preference method for the Konus index. Additionally, the thesis discusses rural to urban migration, challenging traditional assumptions and suggesting that increased rural incomes may paradoxically lead to higher migration rates.

Uploaded by

Jackson
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views102 pages

short_line_groups

This thesis by Pia Nandini Malaney explores the index number problem using differential geometry, proposing a new economic derivative that resolves inconsistencies in existing index number theories and aligns with the Divisia index. It also examines the welfare implications of the Divisia and Konus indices under changing preferences, demonstrating that the Divisia can serve as a revealed preference method for the Konus index. Additionally, the thesis discusses rural to urban migration, challenging traditional assumptions and suggesting that increased rural incomes may paradoxically lead to higher migration rates.

Uploaded by

Jackson
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 102

The Index Number Problem: A

Differential Geometric Approach.

A thesis presented

by

Pia Nandini Malaney

to

The Department of Economics

in partial fulfillment of the requirements


for the degree of
Doctor of Philosophy
in the subject of

Economics

Harvard University
Cambridge, Massachusetts

December 1996
Copyright c 1996 by Pia Nandini Malaney
All rights reserved.
Abstract

The first part of this thesis looks at issues in index number theory. By
using techniques developed in differential geometry, it is shown that the so-
called index number problem can be resolved by the development of a special
economic derivative operator constructed for this purpose. This derivative
is shown to give rise to a unique differential geometric index number which
is then demonstrated to equal the Divisia index. It is then shown in the
second chapter (co-authored with Eric Weinstein) that when placed under
similar assumptions, this data based index equals the Konus index, previously
asserted to be incalculable in the absence of a knowledge of preference maps.
The third chapter deals with rural to urban migration. Based on the risk
neutral Todaro model, it has been suggested that raising rural incomes will
have the effect of controlling migration to urban labour markets. However, it
is shown in this chapter that it in the case of decreasing absolute risk aversion
it is possible for increasing income to have the perverse effect of increasing
the desirability of migration.
Contents

Acknowledgments 4

Introduction 6

1 The Index Number Problem: A Differential Geometric Ap-


proach 9
1.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.2 Adapted Derivatives and Differing Notions of Constancy . . . 14
1.3 The Differential Geometric Index . . . . . . . . . . . . . . . . 16
1.3.1 Notation and Introduction to the Index . . . . . . . . . 16
1.4 An Extended Example . . . . . . . . . . . . . . . . . . . . . . 23
1.5 Resolution Of the Index Number Problem . . . . . . . . . . . 27
1.5.1 Technical Proof of Equivalence of Bilateral Index Num-
bers . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
1.6 The Divisia Index . . . . . . . . . . . . . . . . . . . . . . . . . 37
1.6.1 Traditional Derivation of the Divisia . . . . . . . . . . 37
1.7 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
1.7.1 Extensions . . . . . . . . . . . . . . . . . . . . . . . . . 42
1.8 Appendix A: Differential Geometric Explanation: Notation,
Conventions and Tools from Gauge Theory . . . . . . . . . . . 43
1.8.1 Notation for Goods and Services . . . . . . . . . . . . . 43
1.8.2 Vector Bundles, Sections etc.... . . . . . . . . . . . . . 46
1.8.3 Discussion of Tangent vectors, Co-vectors and Covari-
ant Derivatives . . . . . . . . . . . . . . . . . . . . . . 47

2 Welfare Implications of Divisia Indices 49


2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
2.2 Welfare and Path Dependence . . . . . . . . . . . . . . . . . . 52

1
2.3 Implicit Assumptions Underlying a Cost-of-Living Index. . . . 53
2.3.1 Changing Preferences and Psychological Neu-
trality . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
2.4 The Konus Index under Changing Preferences . . . . . . . . . 63
2.5 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69

3 Household Migration Decisions Under Uncertainty 71


3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
3.2 ‘Income’ vs ‘Substitution’ Effects . . . . . . . . . . . . . . . . 74
3.3 A Risk Sensitive Household Migration Model. . . . . . . . . . 78
3.4 Perverse Migration Incentives . . . . . . . . . . . . . . . . . . 81
3.5 Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
3.6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
3.7 Appendix B: The Relationship Between the Rural Household
Labour Allocation Model and the Todaro Model . . . . . . . . 94

2
List of Figures

1.1 Vector subspaces in a two good economy. . . . . . . . . . . . . 19


1.2 Decomposition of a vector v into components. . . . . . . . . . 20

3.1 First Graphical Example of Perverse Migration . . . . . . . . 91


3.2 Second Graphical Example of Perverse Migration . . . . . . . 92

3
Acknowledgments

I would like to begin by thanking my advisor Eric Maskin for guidance and
assistance in the writing of this thesis. This dissertation has benefited from
his extraordinary breadth, precision and intellectual flexibility in guiding me
through such non-standard approaches as the application of geometry to
welfare considerations.
I would like also to thank Peter Timmer for his generous advice and
encouragement throughout this process. In particular, I took continuous
inspiration from his ability to focus my analysis on the core economic points,
even amidst the most seemingly arcane mathematics.
The author benefited from a research assistantship working with Prof.
Amartya Sen during the 1994-1995 academic year. Additionally, I have
found Sen’s discourse on index theory contained in the article “Welfare Basis
of Real Income Comparisons” (cited here as Sen [32]) to be invaluable as
a reference laying bare the otherwise implicit assumptions underlying the
theory of index numbers.
I am grateful to W. Erwin Diewert for his thoughtful input and for
freely sharing his comprehensive knowledge of index theory. Specifically,
I received invaluable information on the state of the literature concerning,
changing preferences, chaining arguments, and welfare discussions of the Di-
visia index, from a thorough letter which is cited here as Diewert [10]. Per-
haps the most helpful resource in understanding the background and history
of Index Number theory has proved to be his compilation of articles Essays
in Index Number Theory Vol. I (cited here as Diewert and Nakamura [11]).
Many of the ideas contained in this dissertation are the result of an ongo-
ing discussion, debate, and intellectual interchange with my co-author Eric
Weinstein.
Throughout the course of writing this thesis I received a great deal of
valuable input. In particular, I would like to thank Scott Axelrod, Dick

4
Goldman, Thomas Hellmann, Stephan Klasen, Thomas Sjostrom
and Isadore Singer, for helpful conversations. It is probable that this list
contains omissions; they are entirely unintentional.
To my parents Esther and Hira Malaney: You always expected the
best from me, and it is only in striving to give it to you that I have made
it this far. I am grateful to have had the opportunity to fulfill some of the
dreams which when placed out of your reach, you put within mine. Thank
you for all the love and support that have enabled me to achieve this.
I would like to dedicate this thesis to my intellectual inspiration and best
friend, Eric: I have learned so much more than differential geometry from
you along the way, and I know the journey continues to hold ever more.

5
Introduction

The theory of index numbers has been of interest to economists for over a cen-
tury. This interest has been rejuvenated in the 1990s as the U.S. government
grapples with the question of how to construct a true cost of living adjustment
(C.O.L.A.), or how to correct for the problems of the current consumer price
index (CPI). The inconsistencies between the hundreds of available algebraic
index numbers, and their uncertain effects on welfare (when implemented as
C.O.L.A) have led to extensive debate, and some pessimism about the ability
of economic theory to answer the fundamental questions lying at the core of
this issue.
The first chapter of this thesis takes a new approach to understanding
the question of what the “correct” index number is. We first observe that
all index numbers can be viewed as a combination of two components: an
algebraic formula and a notion of constancy. A general implicit choice has
been made using the ordinary calculus derivative as the notion of constancy.
By using techniques in differential geometry, we construct here an economic
derivative precisely to perform the separation of income and substitution
effects necessary for an economic index. It is shown that when the choice
of derivative is made explicit and the ordinary non-economic derivative is
replaced by the differential geometric ‘economic co-variant derivative’, the
discrepancy between all algebraic formulas for index numbers disappears,
thereby resolving the index number problem. The unique index resulting
from the use of the economic derivative in any of the previously developed
index number formulas is then shown to equal the Divisia index.
The second chapter looks at the welfare implications of the Divisia in-
dex based on the new understanding of its mathematical underpinnings. We
compare the welfare effects of the Divisia index with those of the theoretical
Konus index. While the Konus is referred to as the ‘true’ economic index, it is
asserted to be incalculable in the absence preference maps. The Divisia index

6
is, however, entirely data based, and therefore computable. In comparing the
two, however, it is found that there is a major difference in the assumptions
upon which these indexes have been developed. The Konus index makes the
implicit assumption of unchanging preferences. The Divisia makes no such
assumption. It is essential therefore, when comparing the two, to uniformise
the assumptions under which they are studied. It is shown in the first part
of this chapter that if an individual is compensated using the Divisia price
index as a cost of living adjustment, under constant preferences the Divisia
price index is in fact equal to the Konus price index. The Divisia therefore
provides us with a revealed preference method for finding the Konus index,
previously thought to be unknowable in the absence of knowledge on pref-
erence maps. This is in fact an indication that the path dependent nature
of the Divisia C.O.L.A. arises entirely from the failure of the assumption of
unchanging preferences in the real world. In the second part of the chapter
this unrealistic assumption is relaxed, and both indexes are analysed under
the assumption of changing preferences. In a situation of unchanging pref-
erences, two distinct Konus indexes are defined, the Laspeyres Konus or the
Paasche Konus, depending on whether the indifference curve picked is that
of the base time or the current time. Once preferences are allowed to change,
one must select not only between base and current time indifference curves,
but also base and current time preferences. We therefore define four Konus
price indexes, with the various combinations of base and current indifference
curves, and base and current preferences. It is proved that chaining these
indexes also gives equivalence with the Divisia.
It is therefore shown that when viewed in its natural mathematical set-
ting, the Divisia index not only resolves the consistency issues that have
heretofore plagued index number theory, but also answers the welfare con-
cerns that have spurred the deep and longlasting economic interest in the
issue.
The third chapter deals with issues of migration from rural to urban areas.
Since the seminal work of Todaro in the 1960s, there has been much interest
in the decision making process that leads to the migration of labour into
urban areas which often suffer from high unemployment. The conclusion of
the Todaro model was that the discrepancy in incomes between the low wage
rural labour market and the high wage urban market was sufficient that it
was possible for the present discounted value of the expected urban income to
outweigh that of rural income, even with a high probability of unemployment
in the urban market. The policy suggested by this conclusion has been to

7
reduce the discrepancy in incomes either by raising the level of rural wages or
by lowering the level of urban wages. In this chapter it is shown that these
two approaches may in fact not be equivalent. The Todaro model makes
two implicit assumptions which are unlikely to hold in reality. The first is
that an individual maximises expected income as opposed to expected utility.
This is equivalent to making the assumption of risk neutrality. It has been
shown that in general individuals are risk averse, especially at the low levels
of income being discussed here. The second assumption is that the decision
making unit is the individual. Here too much evidence has been accumulated
showing that migration decisions are often made at the level of the family
unit and not solely by the individual. In this chapter we relax these two
unrealistic assumptions. It is proved that when households facing migration
decisions are characterised by decreasing absolute risk aversion, it is in fact
possible for increasing rural wages to lead to increased migration to urban
areas. This ‘perverse’ migration arises because with the increased security of
higher wages for those family workers remaining in the rural labour market,
it is possible for the family to ‘gamble’ more of its members on the high wage-
low probability urban market. It is therefore essential that before using this
policy as tool for stemming rural to urban migration, one ensures that wages
are not within the range of perversity defined herein.

8
Chapter 1

The Index Number Problem: A


Differential Geometric
Approach

1.1 Introduction
“A man with one watch knows what time it is; a man with
two is never sure.” -Proverb

The primary diagnostic measures of an economy are price and quantity in-
dex numbers. The plethora of existing indexes can therefore leave economists
with an inability to evaluate accurately the most basic of our economic con-
cepts. The most widely used index numbers, the Paasche and Laspeyres, can
disagree not only in terms of the magnitude of change but also the direction.
These contradictions in qualitative results (with regards to growth versus
shrinkage or inflation versus deflation) caused by our index number problem,
puts us as economists in the embarrassing position of being doctors unable to
take a temperature without counseling the patient to get a second opinion.
In the words of W.E. Diewert:

“The relatively large divergence between the Paasche and


Laspeyres indexes when a fixed base year is used is a source for
some concern. In many national accounting systems (such as
Canada’s), the consumer price index is constructed using a fixed

9
base Laspeyres index, while the implicit deflator for the consump-
tion expenditures component of GNP is a fixed base Paasche In-
dex. Given the importance of consumer price indexes in indexing
wages and cost of living supplements, it is important that official
price indexes be consistent with each other.” -Diewert 1978 pp.
268

Or in the less academic record of Senate hearings on Social Security cost


of living adjustments,

“[Senator] Simpson stated ... Cost-of-living adjustments (CO-


LAs) create huge transfers of wealth and about 45 percent of Fed-
eral receipts are affected by the CPI. ... [Senator] Conrad stated
that what is at stake surrounding this issue is very significant –
$1.3 trillion over the next seven years have to be cut in order
to achieve a balanced budget. A 1 percent reduction in the CPI
would achieve $280 billion in savings, over 21 percent of what is
needed.” -Report of Hearing before the Senate Finance Commit-
tee, June 6, 1995.

This points out that in practical terms, the social welfare of those who
depend on cost of living adjustments is affected to a large extent by which of
several gauges is currently in favor. To put it simply, consistency issues are
pre-requisite to welfare analysis.
The fundamental nature of this problem has, over the last century, at-
tracted the attention of many economists. Starting in the late 1800s the quest
for the “true” index number led to the development of hundreds of different
formulas, each offering a different result. The inability to evaluate the rela-
tive validity of any of these formulae led to the development of what is known
as the test approach to index numbers. Since all index numbers agree in the
case of a single good, an attempt was made to formulate plausible properties
for index numbers in the one good case and test whether a proposed index
satisfies them for many goods. There were systematic attempts made by
Walsh, Fisher and other economists to develop tests to isolate the “true”
index number. However the people developing the tests, not coincidentally,
were often the people developing the indexes. Needless to say this lent an air
of relativism to the entire process. In the 1930’s, however, the test approach
showed up an even bigger problem: Frisch proved an impossibility theorem

10
showing that no bilateral index number could pass three basic tests1 . This
led to a pessimism as to the very existence of this “true” index that had been
quested.
An entirely different approach was developed by Konus in the 1920’s
which has come to be called the ‘economic’ approach to index numbers.
The Konus index, a somewhat theoretical concept, is based upon economic
optimisation under constraints. A Konus cost of living adjustment would
compensate (or penalise) a person by the amount it would cost him to con-
sume the least expensive basket of goods on his original indifference curve.
While this in some sense captures what we as economists would like to think
of as the “true” cost of living adjustment, there are several problems with
this index and its interpretation.
Undoubtedly, the most obvious practical shortcoming of the Konus in-
dex is the general unavailability of information on preference maps. While
this practical concern may limit its computation, there are even more chal-
lenging theoretical problems. One basic flaw is its unnatural assumption of
unchanging preferences, without which the answers it returns can be more
psychological than economic in nature2 . Finally we once again are left with
the decision of whether to use base or current year preferences, bringing us
back to the original problem of consistency.
As has been noted by Sen and others, it is of the utmost importance in
work on index theory to clearly identify the scope of inquiry and the implicit
assumptions present. The focus of this paper will be on those price and
quantity indices which are directly computable from time-dependent vectors
p(t), q(t) of price and quantity data for an individual or group between
two time periods t = 0, 1. We will refer to such indices as bilateral temporal
indices. Since the disagreement between these indices can reduce what should
be an objective measurement (viz. of inflation or growth) to a subjective
choice of measurement instrument, the most vexing problem in the field is the
consistency or so-called index number problem. The solution of this problem
may in turn be thought of as the first step in the application of data-based
indexes to the analysis of welfare. It is this problem to which this paper is
addressed. It should be stressed that this is a theoretical paper attempting
to further develop the logical foundations of index number theory.3
1
Frisch 1930
2
Sen (1979) addresses a broad range of issues underlying the assumption of unchanging
preferences.
3
For previous related results see: Richter 1966, Hulten 1973, Samuelson and Swamy

11
It is possible to look at any bilateral index number as a combination of two
ingredients: an algebraic formula and a notion of constancy. We claim that
given such a decomposition there is no inherent problem with the algebraic
formulas of any of the common bilateral index numbers. The problem, when
properly isolated, is seen to arise from a consistent implicit choice of an
incorrect notion of constancy.
As the field of economics shifted in the last century to incorporate devel-
opments in calculus, economists have adopted the standard notion of deriva-
tives: df = 0, as our notion of constancy. It was only later that mathe-
maticians themselves refined their concept of derivatives to allow for them
to be adapted to particular problems. The field of differential geometry was
developed around work with these “adapted derivatives” or “connections”.
By using differential geometry to adapt our notion of constancy to fit the
unique problem of index numbers, we are able to eradicate inconsistencies
between all common bilateral index numbers.
Updating the field of index numbers to incorporate these developments in
mathematics in this way enables us to cast age old issues into an entirely new
framework. Just as differential calculus was the framework within which to
think of optimisation, differential geometry is the correct framework within
which to think of index numbers. As with previous mathematical develop-
ments, this framework immediately helps clarify the questions we are asking.
Not only is it able to recover various scattered results from index theory in
a cohesive picture, it enables us to prove new and valuable results.
We give here a plan of this chapter. In section 2 we use a simple toy
example to show explicitly how a seemingly innocuous use of the ordinary
derivative must be altered in order to recover the economics of constant
purchasing power; this guiding analogy is intended to bring out the logical
necessity of the differential geometric approach. In section 3 we will de-
fine the differential geometric set-up and show how it produces a preferred
index which we refer to as the geometric index. A simplified 2-good econ-
omy is constructed in section 4 which exhibits the index number problem
for the Paasche and Laspeyres indexes. We show in detail how the some-
what abstract geometric set-up is used to carry out an explicit computation
producing agreement between the aforementioned offenders. In section 5 we
show that this framework actually leads to a resolution of the index number
problem for temporal bilateral indices. Section 6 contains a derivation and
1974, and Diewert and Nakamura 1993 and the references therein.

12
discussion of the Divisia index. In section 7 we summarize our results and
discuss the merits of path dependence in the computations of C.O.L.A.s;
we also discuss plans for future extensions. Finally, in Appendix A we give
some of the terminology from differential geometry underlying the previous
discussions.

13
1.2 Adapted Derivatives and Differing No-
tions of Constancy
Each index number can be viewed as a combination of two elements :

1. An algebraic formula

2. A notion of constancy.

What do we mean by a notion of constancy? Recall from differential calculus


df
that when a smooth function f : R −→ R satisfies dx = 0 we conclude that
the function is constant. This suggests that the derivative is at the heart
of our notion of constancy. Economists are comfortable with the use of
the ordinary derivative to express the notion of constancy. Since the days
of Newtonian calculus, however, mathematicians have discovered that this
is but one derivative amongst many. Each derivative represents a particular
notion of constancy. The use of any derivative therefore represents a choice of
the notion of constancy, and must be justified on the basis of the economic
question that is being asked. The incorrect choice can lead us to at best
wrong and at worst nonsensical economic answers.
Let us go back to the simple one good case to illustrate this. Consider a
person who buys only one good whose price at time t is P(t). Let us say he
has a salary S(t) which at time t = 0 is $100. Our standard derivative dS(t)dt
would tell us that his salary was constant if it stayed at $100 , even if the
price of the good were rising. Clearly as economists this is not the notion
of constancy we would want. It is a constant purchasing power salary that
we want to recognise as constant. That is, for any scalar m ∈ R we would
want a derivative which sees S(t) = mP (t) as ‘constant’. While in the one
good case it may be easy to find the answer we are looking for, we would like
to use this simple example to illustrate an important point: it is possible to
actually develop a derivative that sees the constant purchasing power salary
as being constant. Let us see how we would adapt our derivative in this
simple case to give us the correct notion of constancy.
Consider a derivative operator δ(·)
δt
that sends a salary with constant pur-
chasing power to zero.

δS(t) δmP (t)


= =0 (1.1)
δt δt

14
Let us posit:
δ(·) d(·)
= + g(t)(·). (1.2)
δt dt
With this ansatz let us solve for g(t):

δS(t) δP (t) dP (t)


0= =m = m( + g(t)P (t)) (1.3)
δt δt dt

1 dP (t) dln(P (t))


⇒ g(t) = − =− (1.4)
P (t) dt dt
δ d dln(P (t))
⇒ = − (1.5)
δt dt dt
This is our adapted derivative, i.e. The derivative which recognizes a constant
purchasing power salary as constant. We will call this adapted derivative a
covariant derivative or connection.
Thus we see that we have a choice between two different derivatives where
we may have believed that only one existed. It is plain to see that failure to
select the correct derivative leads to wrong economics. Fortunately, in the one
good case it is easy to recognise the problem and it can be corrected for in
several ways; this contrasts markedly with the more complicated situations.
Our claim in this paper is that the so-called “index number problem” arises
from a consistent improper choice made from an analogous pair of derivatives.
All bilateral index numbers implicitly choose to use the ordinary derivative as
their notion of constancy; what we propose to show in this paper, is that when
instead of the ordinary derivative, we use the derivative uniquely adapted to
our problem, the discrepancies between the various bilateral index numbers
disappear.

15
1.3 The Differential Geometric Index
1.3.1 Notation and Introduction to the Index
Let V n be a space of n goods and services under consideration. Let V ∗ be
the space of pricing systems of V n . (V and V ∗ can be thought of as column
and row vectors respectively.) Then if q ∈ V n and p ∈ V ∗ , we have

p(t) · q(t) = value of q in pricing system p at time t. (1.6)

Assume p(t) · q(t) 6= 0 ∀t, i.e our basket is never worthless. Any combi-
nation of goods and services w ∈ V can be uniquely described by a sum

w(t) = λ(t)q(t) + b(t) (1.7)


where p(t) · b(t) = 0. Here b(t) ∈ β(t) where β(t) is the space of barters at
time t, i.e. all baskets which our pricing system at time t sends to zero. Then
if w(t) = dv(t)
dt
= λ(t)q(t) + b(t) with λ(t) = 0 then v(t) is “changing without
growing” because v(t) changes only by barters, the values of which are 0 at
time t.
Exactly what we are looking for in an index is the freedom to allow change
without growth. Let us say we are trying to develop a price index to evaluate
inflation from biblical times, when people’s baskets included frankincense and
myrrh, to current times, when we are consuming computers and cars. Neither
the Laspeyres, which would evaluate frankincense in today’s prices, nor the
Paasche, which would attempt to evaluate computers in ancient prices, would
give us sensible answers.
What is needed is a derivative which correctly modernises or antiquates
the components of the basket of goods while holding its value constant. Thus
we seek a derivative which will answer the question ‘what would the basket
of frankincense and myrrh be if it were given as a gift in modern times’, or
put more crudely, ‘how many power rangers and smurfs would the wise men
have given to the infant Jesus?’
It is possible for us to directly translate the basket for one important
reason: at the point of time any new good is introduced, old ones are still in
existence. This enables us at any time to evaluate the price at which one will
be traded for the other, i.e. it gives us an ability to evaluate one basket in
terms of another. An approximation to this can be found through the process
of chaining indexes, but because of time gaps in chained baskets, the ability

16
to translate any particular basket is unavailable. For instance, the U.S. CPI
is a prominent representative example of a chained index whose links are
approximately 10 years in length. Rapid changes in fields such as computers
and the automotive industry mean that several generations of changes may
occur between recalibrations of the reference basket. Consider, for example,
the category referred to by the Bureau of Labor Statistics as “Information
Processing Equipment”; there has been only one recalibration (in 1982-84)
between the last slide rules and the modern lap tops of the 1990s. Obviously,
this kind of rough taxonomy is insufficient to deal with rapidly changing
components of the basket.
What the index should require is the ability to allow the composition of
the basket to change, while maintaining constant some notion of value. We
can maintain a “constant value” basket while allowing the composition of
the basket to change by allowing trade-offs in the basket at any time based
on the current market value of the trade-offs.
This can be accomplished mathematically using an adapted or covariant
derivative. The tools we use to develop this are:

1. Projections onto the space of goods [q(t)] and barters βp(t) i.e. v =
λ(t)q(t) + b(t)

2. Total differentials of vector valued functions

We will now develop the covariant derivative which will be uniquely


adapted to give us the desired price and quantity indexes. The following
discussion represents a translation of various differential geometric concepts
into the language of linear algebra and differential calculus. For a more
formal differential geometric exposition see Appendix A.
Let us start with some definitions:

Definition 1 We define [w] to be the subspace of a vector space V given by


all multiples of a vector w ∈ V .

Definition 2 Define C to be all points in V × V ∗ such that p · q = 0.

Definition 3 We define Ξ to be V × V ∗ − C.

17
For most realistic purposes, the baskets of goods under consideration will
have non-zero value. Thus confining ourselves to consideration of only those
baskets and pricing system represented by elements of Ξ ⊂ R2n will not
present a serious restriction.
Given this motivation we give the following definition.

Definition 4 An economic history is defined to be a curve α : I −→ Ξ


where I ⊂ R is a connected subset of the real line (such as [0, 1]).

Such a path is intended to represent the collection of points (p, q) ∈ V ∗ × V


which are traced out by an economy, enterprise or individual over the interval
I.
Given a point (q(t), p(t)) ∈ Ξ, we determine the following projection maps
Π : V −→ V and Π : V ∗ −→ V ∗

p(t) · v
Π[q(t)] (v) = ( )q(t) (1.8)
p(t) · q(t)

Πβp(t) (v) = v − Π[q(t)] (v) (1.9)

ω · q(t)
Π[p(t)] (ω) = · p(t) (1.10)
p(t) · q(t)

Πβ[v] (ω) = ω − Π[p(t)] (ω) (1.11)


where v ∈ V and ω ∈ V ∗ . So, for example, it is immediate that

v = Π[q(t)] (v) + Πβp(t) (v). (1.12)

In order to intuitively understand these projections and their function, we


will give an economic interpretation to the linear algebra. For the purposes
of visualisation and ease of exposition, we examine the case where n = 2.
Let q represent the original basket containing goods x and y, within the
two dimensional vector space of goods and services V . (see Figure 1) In
the dual vector space, V ∗ we can picture corresponding price vector p. (see
Figure 1) Once a price vector is specified we can define βp to be the subset
of V given by βp = {v ∈ V : p · v = 0}. This represents all baskets of x and
y corresponding to ‘barters’ in the pricing system p; these are, as the name

18
suggests, composites of debts and assets whose total value is identically zero.
The corresponding subspace βq can be defined within V ∗ .

Figure 1.1: Vector subspaces in a two good economy.

Let us now look at the subspace [q] spanned by the vector q in V . The
dimension of [q] will clearly be 1 so long as q 6= 0; likewise the dimension
of the subspace βp will be n − 1 so long as p 6= 0. Therefore we can, in the
generic situation, decompose any other basket v into its projections onto [q]
and βp (see Figure 2). The v[q] component represents the multiple of q whose
value is equal to that of v in the pricing system p. Given the equivalence in
value between v and v[q] , there must exist a barter in βp which transforms v
into v[q] ; this is the economic interpretation of vβp .
The subspaces q and βp will, however, not be linearly independent on
those occasions when p · q = 0, in which case we will not be able to span
the space V in this way. In order to avoid this degeneracy, we define C to
be all points in V × V ∗ such that p · q = 0, and remove these points from
consideration.

19
Figure 1.2: Decomposition of a vector v into components.

Let us assume that we have n real valued functions

fi : Ξ −→ R. (1.13)

We can then assemble these real valued functions into a single vector valued
function
σ : Ξ −→ V (1.14)
given by
n
X
σ(q1 , ..., qn , p1 , ..., pn ) = fi (q, p)ei . (1.15)
i=1

Such a vector valued function is called a section in differential Geometry


(see appendix A). This mathematical construct will enable us to keep track
of the best analog of our reference basket as we antiquate or modernise its
constituents. The real-valued functions fi which determine the vector-valued
function σ are given by the contributions of the various basket components
in their various units of measurement (kilograms, gallons, etc.).
The total differential, or Jacobian, of σ is defined to be

20
n
X
o
∇σ= dfi (q, p)ei (1.16)
i=1
n n n
X X ∂fi X ∂fi
= ( dqj + dpk )ei (1.17)
i=1 j=1
∂qj k=1
∂pk

Let α : R −→ Ξ be given by 2n functions (q1 (t), ..., qn (t), p1 (t), ..., pn (t))
with tangent vector α̇ = ( dα
dt
) then
n X n n
X ∂fi dqj X ∂fi dpk
∇oα̇ σ = ( + )ei (1.18)
i=1 j=1
∂qj dt k=1
∂pk dt

This is simply the Jacobian evaluated in the direction of the tangent field
α̇ = dα
dt
along α.

Let us now define the derivative uniquely adapted to achieve the ap-
propriate index number. The non trivial ‘economic’ covariant derivative of
σ : Ξ −→ V along α is:

∇aα̇ σ = Π[qt ] ∇oα̇ (Π[qt ] σ) + Πβpt ∇oα̇ (Πβpt σ) (1.19)

The interested reader may check that ∇a satisfies the axioms for a covariant
derivative found in Appendix A. 4
If ∇aα̇ σ = 0, we will say that σ is covariantly constant along α with
respect to the covariant derivative ∇a . If σ is covariantly constant along α
with respect to a given covariant derivative ∇ and σ(α(s)) = v we will refer
to σ(α(d)) as the parallel translation of v along α from the ‘source’ time
t = s to the ‘destination’ time t = d. We will denote this parallel translate
by τs (d); that is τs (d) is determined as a particular value of a vector valued
function σ, covariantly constant along a curve. It should be noted that it is
possible to parallel translate using any covariant derivative. If the parallel
4
It should be noted that when the vector valued function σ(x) is a multiple of the
quantity vector comprising x ∈ Ξ, the second summand is identically zero; this simplifies
matters considerably and is the major case of interest in this paper. For example, if we
are attempting to track the growth of a basket q(0) from time t = 0 to time t = 1, the
vector σ(0) at the source time t = 0 will lie entirely in the subspace cut out by q(0). The
projection of σ onto the barter space, Πβpt σ, will therefore be zero. In fact if ∇aα̇ σ = 0
then it will be the case that along α we will have Πβpt σ = 0 for all t.

21
translation is done using the ordinary derivative, it will simply give us the
original basket. This corresponds to the traditional notion of constancy. We
will denote the parallel translate of q(s) (or p(s)) along α with respect to the
ordinary derivative ∇o by τ o ; likewise, the parallel translate of q(s) (or p(s))
along α using the adapted covariant derivative ∇a will be written τ a .
At this point one might ask ‘what is the economic interpretation of the
equation ∇aα̇ σ = 0?’ In order to address this question, it will help to recall
our earlier interpretation of the various projection maps. Let us begin by
making the assumption that σ(α(0)) = q(0); while this is certainly not a
generic assumption, all σ’s considered in this paper will satisfy similar con-
ditions. In order to induce modernisation or antiquation of the σ(t) basket
components while disallowing ‘growth’, we would like to require that σ(t)
be changing at any given time strictly by barters. This allows the time de-
pendent basket σ(t) to maintain the ‘value’ of the original basket q(0) while
changing its composition as the composition of q(t) changes. This goal is
achieved by selecting the covariant derivative ∇aα̇ σ, for by construction, the
parallel translation equation requires that ∇aα̇ σ = Π[q] ∇oα̇ (Π[q] σ) = 0. In
prose, this means that the projection of the “change in σ(t)” (namely ∇oα̇ )
onto ‘basket space’ (namely [q]), must be zero, thereby confining all change
to lie in barter space. This parallel translation of the basket q(0) from time
t = 0 to time t = 1 using ∇aα̇ σ will thereby ensure that at time t = 1, σ(t)
will be some multiple of the basket q(t).
We are therefore going to seek a vector valued function σ : Ξ −→ V such
that for a source time s and a destination time d we have:

1. σ(α(s)) = q(s)

2. ∇aα̇ σ = 0

The first condition requires that our vector valued function σ agree with
the source-time basket q(s) at the point from which we are parallel trans-
lating. The second condition requires that we parallel translate this basket
using the adapted derivative.
We note that σ(α(t)) is uniquely determined by a theorem in differential
geometry.5
It is clear that depending on the covariant derivative used, such a solution
will have quite different properties. For the ordinary derivative ∇o , a solution
5
See Bishop and Goldberg pg.225-226.

22
will be of the form
σ(α(t)) = q(s). (1.20)
For the adapted derivative ∇a , however, we will have,

σ(α(t)) = Λ(t)q(t) (1.21)

As discussed above, the parallel translate of the basket from the source time
to the destination time using the adapted derivative is a multiple of the
basket at the destination time t. This multiple, Λ expresses the “size” of the
source time basket relative to the destination-time basket q(t). Therefore
the geometric index is simply the multiple Λ.
We should note at this point that there is a complementary version of
this picture where we consider vector valued functions

σ̃ : Ξ −→ V ∗ (1.22)

and an adapted derivative built from ∇0 and projections

Π : V ∗ −→ V ∗ . (1.23)

which give us the corresponding price index.

1.4 An Extended Example


In order to more clearly understand how the differential geometric approach
works, we will actually calculate this index for a simplified economy. The
example is set up to show a discrepancy between the standard Paasche and
Laspeyres indexes. In the following section we will proceed to show how the
differential geometric index resolves this discrepancy.
Imagine an island economy where there are only two goods: coconuts
(good 1) and lumber (good 2) (measured in kilotons, priced in dollars). Let
us assume that the islanders produce lumber with a saw mill which is grad-
ually rusting into disrepair, but that in partial compensation the islanders
become increasingly inventive in their ability to produce coconuts. We fur-
ther assume that the consumers on the island are developing a mild preference
for coconuts over lumber. Mild in this context means that the increased de-
mand is not strong enough to keep the coconut price from decreasing relative
to lumber. The toy economy given below will be seen to exhibit the index

23
number problem for the Laspeyres and Paasche quantity indices. While the
example has obviously been abstracted, it does exhibit the full complexity of
the paradox that we are trying to resolve.

Example 1 Consider an economy with two goods g1 , g2 examined over a


one year period 0 ≤ t ≤ 1. Let q : [0, 1] −→ V ∼ = R2 given by q(t) =
(t + 1)e1 + (−t + 2)e2 be the output function of the economy at time t. That is
q(t) gives the production of goods (g1 , g2 ) at time t. Let p : [0, 1] −→ V ∗ ∼
= R2
∗ ∗
given by p(t) = (−10t + 100)e1 + (9t + 92)e2 be the price function of the
economy. Likewise p(t) gives the prices per unit quantity of the goods (g1 , g2 )
at time t.

Let us first parallel translate the production q(0) at time t = 0 to time


t = 1. That is we want to find a function Λ : [0, 1] −→ R determining
σ : [0, 1] −→ V with
1. σ(t) = Λ(t)q(t)

2. Λ(0) = 1

3. ∇aα̇ (σ(t)) = Π[q(t)] (∇0α̇ Π[q(t)] σ) = 0


Inside of
[0, 1] × V (1.24)
(referred to as a trivial vector bundle in Appendix A) the section

q : [0, 1] −→ V ∼
= R2 (1.25)

given by
q(t) = ((t + 1)e1 + (−t + 2)e2 ) (1.26)
cuts out a changing 1-dimensional vector sub-space from V (a non-trivial
sub-bundle with 1-dimensional fiber in the geometric language of Appendix
A). Conversely, the price history

p(t) = ((−10t + 100)e∗1 + (9t + 92)e∗2 ) (1.27)

determines a complementary non-constant vector sub-space of ‘barters’ (also


with 1-dimensional fiber). What we want now is to find

Λ(t) such that ∇aα̇ Λ(t)q(t) = 0. (1.28)

24
Thus we calculate
d(Λ(t)q(t))
∇aα̇ Λ(t)q(t) = Π[q(t)] ( ) (1.29)
dt

= Π[q(t)] ( (t)q(t) + Λ(t)(e1 − e2 )) (1.30)
dt

= Π[q(t)] ( (t)((t + 1)e1 + (−t + 2)e2 ) + Λ(t)(e1 − e2 )) (1.31)
dt
dΛ dΛ
= Π[q(t)] ( (t)(t + 1) + Λ(t))e1 + ( (t)(−t + 2) − Λ(t))e2 ) (1.32)
dt dt
Recalling that
p(t) · v
Π[q(t)] (v) = ( )q(t) (1.33)
p(t) · q(t)
we have

( dΛ dΛ
dt (t)(t + 1) + Λ(t))(−10t + 100) + ( dt (t)(−t + 2) − Λ(t))(9t + 92)
∇aα̇ Λ(t)q(t) = q(t)
(t + 1)(−10t + 100) + (−t + 2)(9t + 92)
(1.34)

(−19t2 + 16t + 284) dΛ


dt
(t) + (−19t + 8)Λ(t)
= q(t) (1.35)
(t + 1)(−10t + 100) + (−t + 2)(9t + 92)
so we need to solve the equation

(−19t2 + 16t + 284) (t) + (−19t + 8)Λ(t) = 0 (1.36)
dt
with initial condition Λ(0) = 1 (which is an ordinary linear differential equa-
tion). We solve it as follows. Since
dΛ −(−19t + 8)
(t) = Λ(t) (1.37)
dt (−19t2 + 16t + 284)
we should expect that Λ(t) will have the form Λ(t) = eg(t) with
dg −(−19t + 8)
(t) = . (1.38)
dt (−19t2 + 16t + 284)
We next note that
d(−19t2 + 16t + 284)
= −38t + 16 = 2 · (−19t + 8) (1.39)
dt
25
so we are looking for a function g(t) with

dg 1 (−19t2 + 16t + 284)0


(t) = − (1.40)
dt 2 (−19t2 + 16t + 284)

which indicates that g(t) is of the form g(t) = − 12 · ln(−19t2 + 16t + 284) + c
where c is a constant to be determined from initial conditions. Thus we have
1 2 +16t+284)+c c
Λ(t) = e− 2 ·ln(−19t =√ (1.41)
−19t2 + 16t + 284
which when when
√ supplemented by our requirement that Λ(0) = 1 fixes the
value c = ln( 284) giving our final solution

284
Λ(t) = p . (1.42)
2
(−19t + 16t + 284)

Conversely, if we had wanted to parallel translate backwards from time


t = 1 to t = 0 we would have sought Λ̃ : [0, 1] −→ R determining σ̃ : [0, 1] −→
V with

1. σ̃(t) = Λ̃(t)q(t)

2. Λ̃(1) = 1

3. ∇aα̇ (σ̃(t)) = Π[q(t)] ∇oα̇ Π[q(t)] σ̃ = 0

It is easy to see that this function is given by



281
Λ̃(t) = p . (1.43)
2
(−19t + 16t + 284)

26
1.5 Resolution Of the Index Number Prob-
lem
As noted previously, the two most commonly used indexes, the Paasche and
Laspeyres, have an inherent discrepancy. Let us start by observing the effects
of updating our notion of constancy for these two indexes. We will later
extend these results to all bilateral indexes.
The Paasche and Laspeyres quantity indexes have the formulas
p(0) · q(1) p(1) · q(1)
Qold
L = Qold
P = (1.44)
p(0) · q(0) p(1) · q(0)
In both of these indices, three of the four factors which appear are from a
common time period (the period used to make the comparison) and the odd
factor is imported unchanged across the time separating t = 0 from t = 1.
It has been held constant in the usual sense of the term. This leads us to
suspect that the problem is arising from the use of a derivative, un-adapted
to our situation. We thus rewrite (1.44) in the form

p(0) · τ1 (0) p(1) · q(1)


Qnew
L = Qnew
P = (1.45)
p(0) · q(0) p(1) · τ0 (1)
where Qnew refers to an index whose previously implicit choice of derivative
(used in parallel translation) has been made explicit. τ1 (0) refers to the
parallel translate of the basket q(0) from time t = 0 to time t = 1, with
an as yet unspecified choice of derivative being used for parallel translation;
likewise, τ0 (1) refers to the parallel translate backwards of the basket q(1)
from time t = 1 to time t = 0.
Note that nothing has been lost in this reformulation as

Qold new
L = QL Qold new
P = QP (1.46)

when parallel translation is performed using the ordinary derivative ∇o (i.e.


τ1 (0) = τ1o (0) and τ0 (1) = τ0o (1).) However, when we use the covariant
derivative ∇a for parallel translation to import the data we find that the
adapted Paasche and Laspeyres (and hence the Fisher Ideal index) are in
fact identically equal
p(0) · τ1a (0) p(1) · q(1)
Qnew
L = = = Qnew
P (1.47)
p(0) · q(0) p(1) · τ0a (1)

27
thus resolving the so called index number problem.
Let us verify this for our example from the previous section:
In our example we see that the Laspeyres quantity index

p(0)q(1) 292
= (1.48)
p(0)q(0) 284

indicates positive growth, whereas the Paasche quantity index

p(1)q(1) 281
= (1.49)
p(1)q(0) 292

indicates negative growth.


Let us begin with the Paasche quantity index. We first observe that since
the quantity q(0) in the expression p(t)q(0) is being evaluated at p(t) prices,
it is really the parallel translate of q(0) from α(0) to α(t) using the ordinary
derivative. That is
p(1)q(1) p(1)q(1)
= (1.50)
p(1)q(0) p(1)τ0o (1)
where τ0o (1) denotes the parallel translation from time t = 0 to time t = 1
using the ordinary derivative.
Likewise for the Laspeyres quantity index we have

p(0)q(1) p(0)τ1o (0)


= (1.51)
p(0)q(0) p(0)q(0)

Let us first parallel translate the production q(0) at time t = 0 to time


t = 1. As in the previous example we want to find a function Λ : [0, 1] −→ R
determining σ : [0, 1] −→ V with

1. σ(t) = Λ(t)q(t)

2. Λ(0) = 1

3. ∇aα̇ (σ(t)) = Π[q(t)] (∇0α̇ Π[q(t)] σ) = 0

From our calculations in the last section we see that this function is given
by √
284
Λ(t) = p . (1.52)
(−19t2 + 16t + 284)

28
enabling us to calculate the corrected Paasche index:

p(1)q(1) 281
a
=√ (1.53)
p(1)τ0 (1) 284
Similarly, to calculate the Laspeyres index we need to parallel translate
back from t = 1 to time t = 0, which we do by finding the function Λ̃ :
[0, 1] −→ R determining σ̃ : [0, 1] −→ V with
1. σ̃(t) = Λ̃(t)q(t)

2. Λ̃(0) = 1

3. ∇aα̇ (σ̃(t)) = Π[q(t)] (∇0α̇ Π[q(t)] σ̃) = 0


which we see from (equation 43) gives us

281
Λ̃(t) = p . (1.54)
2
(−19t + 16t + 284)
enabling us to calculate the corrected Laspeyres index:

p(0)τ1a (0) 281
=√ (1.55)
p(0)q(0) 284
We thus see that:

p(0)τ1a (0) p(1)q(1) 281 ∼
= a
=√ = .9947 (1.56)
p(0)q(0) p(1)τ0 (1) 284
showing unambiguously that the economy actually shrank.
The Fisher ‘ideal’ Index, is defined as the geometric mean of the Paasche
and Laspeyres indexes:
s s
p(t)q(0) p(t)q(t) p(0)q(t) p(t)q(t)
· · (1.57)
p(0)q(0) p(0)q(t) p(0)q(0) p(t)q(0)

As the Laspeyres and Paasche are actually equal for all t, the agreement with
the Fisher index is immediate.
This result is not restricted to the Paasche, Laspeyres and Fisher in-
dexes. What follows is a formal proof of the equivalence of all bilateral index
numbers when the correct adapted derivative is used.

29
1.5.1 Technical Proof of Equivalence of Bilateral Index
Numbers
We take the opportunity below to set notation before going into the proof
of the equivalence of all bilateral adapted indices. Readers should consult
appendix A for terminology. We follow the proof of equivalence with explicit
formulas for the major indices and their differential geometric adaptations.

Preliminary Definitions
We begin by motivating the concept of bundle and sub-bundle. Intuitively
the idea is as follows.
As we have seen, the evolution of the economy we wish to investigate is
given by a path α in the space of all possible economies Ξ. It is convenient
in such situations to introduce a second copy of the space of possible prices
and quantities for use in tracking the ‘best analogues’ of the initial and final
states of the economy during the period under study. Thus we will work with
the space Ξ × V × V ∗ . The first factor Ξ will be referred to as the “base
space”. For every point x ∈ Ξ there exists a (different) copy of the vector
space V × V ∗ ; each copy (V × V ∗ )x = x × V × V ∗ is referred to as the “fiber”
above the point x. The totality of these spaces is referred to as a “vector
bundle”. The base space Ξ will function as home to the history α. The space
of fibers V × V ∗ will be used to keep track of the parallel translates of the
price and quantity vectors.
Let us now give a brief intuitive explanation of the following proof. We
first observe that all of the algebraic formulas used by the common indexes
satisfy the so called proportionality test; that is, if the original basket of
goods q(0) is a multiple λ1 of the current basket q(1) = λ · q(0), then the
quantity index is guaranteed to be λ.
As discussed in Section 2, all of the common bilateral index numbers
tacitly require a notion of parallel translation. To take but one example,
the Laspeyres quantity index purports to transport today’s goods back to
yesterday’s prices. Further, all such indices make an implicit choice of the
ordinary derivative for the purpose of this translation. By contrast, we will
insist that the choice of derivative be made explicit. In addition, we will
require that the data be expressed in a form which distinguishes the usual
price and quantity vectors (at the source and destination times) from their
parallel translates.

30
When one chooses the ordinary derivative, the parallel translation of the
source-time vector q(s) to the destination-time is merely the original vec-
tor q(s). However, when the adapted derivative is selected, we are ensured
that the parallel translate of the vector q(s) will be a multiple of the vector
q(d) at the destination time. Therefore, when this data is input into any
index number with an algebraic formula passing the proportionality test, it
is guaranteed to give us the same answer: namely λ.
Definition 5 We define sub-bundles of Ξ × V × V ∗ by
Σ0 = Ξ × V × V ∗ (1.58)
Σ1 = {(x, 0, v ∗ ) ∈ Σ0 s.t. v ∗ ∈ [px ] ⊂ V ∗ } (1.59)
∗ ∗ ∗
Σ2 = {(x, 0, v ) ∈ Σ0 s.t. v ∈ βqx ⊂ V } (1.60)
Σ3 = {(x, v, 0) ∈ Σ0 s.t. v ∈ [qx ] ⊂ V } (1.61)
Σ4 = {(x, v, 0) ∈ Σ0 s.t. v ∈ βpx ⊂ V } (1.62)

Definition 6
P (Ξ) = {α|α : [0, 1] −→ Ξ (with α piecewise differentiable)} (1.63)

Definition 7 Given an element σ(s) in the fiber Eα(s) of a vector bundle E


over Ξ with connection ∇, we will let τα∇ (σ(s), d) denote the parallel translate
of σ(s) at α(s) to α(d) along α relative to ∇.

Definition 8 Let ∇u be a(n) (unadapted) connection on Ξ × V . Then we


define the covariant derivative adapted from ∇u to be ∇a by
∇a σ = Π[q] ∇u (Π[q] σ) + Πβp ∇u (Πβp σ). (1.64)
Conversely, if ∇u is a connection on Ξ × V ∗ , then the covariant derivative
adapted from ∇u is defined to be ∇a given by
∇a σ = Π[p] ∇u (Π[p] σ) + Πβ[q] ∇u (Πβ[q] σ). (1.65)
Likewise, if ∇u is a connection on Σ0 , then the covariant derivative adapted
from ∇u is defined to be ∇a given by
∇a σ = Π[q] ∇u (Π[q] σ)+Πβp ∇u (Πβp σ)+Π[p] ∇u (Π[p] σ)+Πβq ∇u (Πβq σ). (1.66)
We will say that a covariant derivative ∇ is adapted if ∇a = ∇.

31
L4 L4
Definition 9 Let us define Θ := i=1 V i=1 V ∗ and κ ⊂ Θ by the rule

1 1
κ = {θ ∈ Θ| s.t. Θ = (A, y · A, B, · B, C, z · C, D, · D )} (1.67)
y z
with A, B ∈ V , C, D ∈ V ∗ and y, z ∈ R

Definition 10 Let A be the space of all smooth connections on Σ0 .

Definition 11 Let
T : P (Ξ) × A −→ Θ (1.68)
be given by
T (α, ∇) = (1.69)
( p(0), τα∇ (p(1), 0), p(1), τα∇ (p(0), 1),
q(0), τα∇ (q(1), 0), q(1), τα∇ (q(0), 1) )

Definition 12 We will say that a price (or quantity) index formula

F : Θ −→ R (1.70)

satisfies the proportionality hypothesis if when restricted to κ we have


1
F ( (A, y · A, B, · B, (1.71)
y
1
C, z · C, D, · D ) ) = y (or z).
z
for all non-degenerate choices of A,B,C and D with y, z ∈ R.

In actuality, we do not need the function F to be defined on all of Θ but


rather we require it to be well defined on a dense subset of non-degenerate
data (which is always the case in practice). Also, for simplicity we need only
consider the case of quantity indexes as the case of price indexes goes through
mutatis mutandis.

32
Proof of Equivalence
Theorem 2 Let I1,2 : P (Ξ) × A −→ R be a pair of quantity index formulas
where I1,2 = F1,2 ◦ T ,
T : P (Ξ) × A −→ Θ (1.72)
is as before and
F1,2 : Θ −→ R (1.73)
are two formulae which satisfy the proportionality test. Then if ∇a1 = ∇a2 =
1
∇a , we have I1 (·, ∇a ) = I2 (·, ∇a ) and Ii (α(t), ∇a ) = Ii (α(1−t),∇ a ) (time rever-

sal).
Proof: The proof proceeds in two parts. First we we will show that any
adapted connection on Σ0 is “reducible” in that it is the sum of 4 connec-
tions on the Σi sub-bundles of Σ0 . The implication of reducibility is that
if we parallel translate a vector v(α(s)) belonging to a sub-bundle Σi using
a reducible connection, then the parallel translate at the destination time d
will remain in the same sub-bundle as the original vector at source time s. As
both the sub-bundle of pricing systems Σ1 , and the sub-bundle of baskets Σ3
have 1-dimensional fibers, we are guaranteed to have our parallel translates
of p(s) and q(s) expressible as non-zero multiples of the price and quantity
data at the destination time d.
The second part of the proof will be to show that for any such reducible
connection ∇a , the 8 pieces of data comprising T (α, ∇a ) are guaranteed to
fit the pattern found in the proportionality hypothesis. Since all index num-
bers can be represented by formulas satisfying the proportionality hypothesis
above, the conclusion follows.
To begin, we must demonstrate that if ∇ is an adapted connection and
σ is a section of one of the four sub-bundles of Σi ⊂ Σ0 , then ∇aX σ is again
in Σi for any X ∈ T Ξ. For concreteness, let us look at σ ⊂ Σ3 . Now by
hypothesis we have
∇X σ = ∇aX σ = Π[q] ∇X (Π[q] σ) = Π[q] ∇X σ (1.74)
proving our claim. The claim that given any connection ∇ on Σ0 , ΠΣi ∇(ΠΣi ·)
is a connection on Σi can be verified by checking the axioms for a connection
in Appendix A, and is straightforward.
Because the 4 summands of an adapted connection ∇a are individually
connections on the sub-bundles Σi , we are assured that the parallel transla-
tion of a vector in one of the summands Σi will remain in that summand. In

33
particular, for the one dimensional [q] sub-bundle we have
a
τα∇ (q(s), d) = λq(d) · q(d) (1.75)
for λq(d) ∈ R. Now the equation for a section along α to be a parallel translate
of our source data
∇aα̇ σ(t) = 0 (1.76)
is a first order linear ordinary differential equation with initial data q(s).
This implies that there exists a unique solution to the initial value problem
which depends linearly on the initial conditions. Thus if σ satisfies (1.76)
with
σ(α(s)) = q(s) (1.77)
then by (1.75) we have
σ(α(d)) = λq(d) q(d). (1.78)
Conversely, the section
1
σ̃ = σ (1.79)
λq(d)
will also satisfy (1.76) but with the boundary condition
1
σ̃(d) = · λq(d) · q(d) = q(d). (1.80)
λq(d)
This proves:
Proposition 3 For an adapted derivative ∇a on Σ0 there exist numbers
y, z ∈ R such that
T (α, ∇a ) = (1.81)
1
(p(0), y · p(0), p(1), · p(1),
y
1
q(0), z · q(0), q(1), · q(1) ).
z
This means that in the case of an adapted covariant derivative ∇a , we
are guaranteed that T (α, ∇a ) ∈ κ. Thus by the hypothesis that our index
formulas satisfy the proportionality test we can see that
1 1
I1 (α(t), Ã) = I2 (α(t), Ã) = = (1.82)
I1 (α(1 − t), Ã) I2 (α(1 − t), Ã)
for all piecewise smooth α. QED.

34
Adapted Index Numbers
Let us now see how this adjustment in the notion of constancy corrects
some common bilateral index numbers. In the following price index number
formulas the σ’s refer to the parallel translates of prices:
Laspeyres
Original:
p1 · q 0
PL (p0 , p1 , q 0 , q 1 ) ≡ 0 0 (1.83)
p ·q
Adapted:
σ1 (0) · q 0
PL (p0 , p1 , q 0 , q 1 ) ≡ (1.84)
p0 · q 0
Paasche
Original:
p1 · q 1
PP (p0 , p1 , q 0 , q 1 ) ≡ 0 1 (1.85)
p ·q
Adapted:
p1 · q 1
PP (p0 , p1 , q 0 , q 1 ) ≡ (1.86)
σ0 (1) · q 1
Fisher
Original: s
p1 · q 0 p1 · q 1
PF (p0 , p1 , q 0 , q 1 ) ≡ · (1.87)
p0 · q 0 p0 · q 1
Adapted:
s
σ1 (0) · q 0 p1 · q 1
PF (p0 , p1 , q 0 , q 1 ) ≡ · (1.88)
p0 · q 0 σ0 (1) · q 1
Tornqvist
Original:
s
N  (p0n qn0 /p0 ·q0 )  1 (p1n qn1 /p1 ·q1 )
0 1 0 1
Y p1n pn
PT (p , p , q , q ) ≡ · (1.89)
n=1
p0n p0n
Adapted:
s
N  (p0n qn0 /p0 0·q0 )  1 (p1n qn1 /p1 ·q1 )
0 1 0 1
Y σ1 (0) pn
PT (p , p , q , q ) ≡ · (1.90)
n=1
p0n σ0 (1)

35
Walsh
Original:
PN 1/2 1
0 1 0 1i=1 (qi0 qi1 ) pi
PW (p , p , q , q ) ≡ PN (1.91)
0 1 1/2 0
j=1 (qj qj ) pj
Adapted:
PN 1/2 1 PN 0 1 1/2
0 1 0 1 i=1 (qi0 qi1 )
pi i=1 (qi qi ) σ1 (0)i
PW (p , p , q , q ) ≡ PN 1/2
= PN 1/2
(1.92)
0 1 0 1
j=1 (qj qj ) σ0 (1)j j=1 (qj qj ) p0j

Jevons
Original:
N
Y p1
PJ (p0 , p1 ) ≡ ( 0 )1/N (1.93)
i=1
p
Adapted:
N N
Y p1 1/N Y σ1 (0) 1/N
PJ (p0 , p1 ) ≡ ( ) = ( 0 ) (1.94)
i=1
σ0 (1) i=1
p

36
1.6 The Divisia Index
What is this new index? In this section we will prove that this index is
the same as the Divisia index. Divisia developed this index in the 1920’s
in a somewhat ad hoc fashion. In fact, Bennet [1920] writing a few years
before Divisia using essentially the same argument, came up with almost the
same index which differs from Divisia’s by its absence of Log functions. The
following is a description of the traditional development of the Divisia index6 :

1.6.1 Traditional Derivation of the Divisia


We seek a pair of functions

Q : R −→ R P : R −→ R (1.95)

such that n
X
p(t) · q(t) = pi (t)qi (t) = P (t)Q(t) ∀t. (1.96)
i=1

Of course, the only constraint this implies is

V (t)
P (t) = (1.97)
Q(t)

with Q(t) arbitrary. Divisia’s idea was to take the derivatives of the loga-
rithms
dln(p(t) · q(t)) dln(P (t)Q(t))
= (1.98)
dt dt
expand using the chain and product rules

p0 (t) · q(t) p(t) · q 0 (t) P 0 (t) Q0 (t)


+ = + (1.99)
v(t) v(t) P (t) Q(t)

and set the summands


p0 (t) · q(t) P 0 (t)
= (1.100)
v(t) P (t)
p(t) · q 0 (t) Q0 (t)
= (1.101)
v(t) Q(t)
6
See also Diewert and Nakamura , pp. 42

37
to be equal. We can then see that this does specify unique function P (t), Q(t)
by the formulas
1 n
p0i (t)
Z X p (t)qi (t)
P (t) = exp( Pn i ) (1.102)
0 i=1 j=1 pj (t)qj (t) pi (t)

1 n
qi0 (t)
Z X p (t)qi (t)
Q(t) = exp( Pn i ). (1.103)
0 i=1 j=1 pj (t)qj (t) qi (t)

Theorem 4 The differential geometric index is equal to the Bennett-Divisia


index .

Proof of Theorem: The Bennett-Divisia Price index can be expressed


concisely using the ‘differential form’ notation for line integrals as
R Pn
wi (α)d log(pi )
=e α i=1 (1.104)
i ·qi
where pp·q = wi is the budget share of the ith commodity 7 .
The differential geometric price index will be given in the general situation
by σp(1)
p
(1)
1
= Λ(t) where Λ is determined by
0

∇aα̇ σ = 0 σ(t) = Λ(t)p(t) σ(0) = p(0). (1.105)


Now if
0 = ∇aα̇ σ = Π[p] (∇oα̇ Π[p] (σ)) + Π[β] (∇oα̇ Π[β] (σ)) (1.106)

dΛ dp
= Π[p] (∇oα̇ Π[p] (σ)) = Πp ( (t) · p(t) + Λ(t) (t)) (1.107)
dt dt

(( dΛ
dt
(t) · p(t) + Λ(t) dp
dt
(t)) · q(t))
= p(t) (1.108)
p(t) · q(t)
dΛ dp
=( (t)p(t) + Λ(t) (t)) · q(t) (1.109)
dt dt
so
dp
dΛ (t) · q(t)
(t) = −( dt )Λ(t) (1.110)
dt p(t) · q(t)
7
Deaton and Muellbauer pp. 174-175

38
hence
t dp
(r) · q(r)
Z
dr
Λ(t) = exp(− ( )dr) (1.111)
0 p(r) · q(r)
n
Z tX dpi
dr
(r)· qi (r)
= exp(− ( )dr) (1.112)
0 i=1
p(r) · q(r)
Z tXn
pi (r) · qi (r) 1 dpi (r)
= exp(− ( ) dr) (1.113)
0 i=1 p(r) · q(r) pi (r) dr

n
pi · qi dpi
Z X
= exp(− ( ) ) (1.114)
α i=1 p · q pi
Thus
1 Pn R
Diff. Geom. Index = = e i=1 α wi (α)d log(pi )
= Divisia Index (1.115)
Λ(t)

QED.

39
1.7 Results
As we have shown in this paper, much of index number theory can be viewed
as a special topic in differential geometry. When the traditional index number
problem is formulated in the context of differential geometry, a new feature
emerges. Unlike the ordinary calculus, differential geometry points out that
there is not a unique derivative as is often assumed. There is however a
unique choice of derivative adapted from the familiar one to the problems
of index theory. Once that choice is made successfully, all index numbers
currently in favor become equivalent and are equal to the Divisia index. The
shift in emphasis from the algebraic formulas of various index numbers, and
towards the choice of derivative thereby enables us to resolve the so called
Index Number Problem. Thus by using these mathematical techniques, we
have a natural answer to the question “What is the correct index number?”
Casting the question in the right framework not only allows us to unify and
recover previous results, it also enables us to understand better the nature
of the question.
One of the primary debates one sees in the current literature on index
number theory centers around whether or not to chain bilateral indexes.
With the sophistication of the differential geometric framework, we can cast
this question in an entirely new light. As we have seen above, all index
numbers make a choice of derivative. The choice has till now been made
implicitly. When we view the issue differential geometrically we are forced
to make the choice explicit. What then becomes clear is that there are two
different derivatives being used. The ordinary derivative, which has no justifi-
cation whatsoever economically, and the adapted derivative, whose economic
justification has been presented in this paper. Once one is forced to justify
the choice of derivative, one must realise the arbitrary nature of all bilateral
indexes using the ordinary derivative. The Divisia is the only index which
uses an economically justified derivative and is therefore superior to all bi-
lateral indexes using the ordinary derivative. Any further approximation to
the Divisia is therefore an improvement. Chaining is simply one such ap-
proximation. The issue should no longer be whether or not to chain bilateral
indexes, but rather to find the best way of approximating the Divisia.
The one characteristic of the Divisia index most troubling to economists
is that it is given by a path dependent line integral. What we would claim is
that even with this path dependence (or perhaps because of it), the Divisia
is the fair and correct way to calculate cost of living adjustments in our real

40
world filled with changing preferences. Let us attempt to make an argument
for this in a particular setting in order to show that path dependence can be
more of a virtue than a vice. Let us assume that we have two individuals
starting and finishing by consuming exactly the same basket of goods, and
look at a situation where the path dependent Divisia might give us a different
index for each of these individuals. Let us say that the first, as a connoisseur
of the finer things in life, starts cultivating his taste in red wine, while this
might not be a particularly popular habit. As he is functioning under a
budget constraint he thus shifts consumption away from other goods, for
instance soda pop. If we assume there is then a faddish move toward the
consumption of red wine we should expect that wine prices will start rising.
The second consumer, following the fad, starts shifting his consumption only
after the prices have already risen. The Divisia would require compensating
the first consumer, but not the second, precisely because their paths were
different. The first started consuming wine while prices were low, and then
got hit with the price increase. The second chose to shift his consumption
only after prices had increased. What we would claim is that it is only the
path dependent Divisia index that can give a fair cost of living adjustment.
What we see in the mathematics is that a Divisia index C.O.L.A. is a
guarantee that that the recipient will be able to maintain his previous life-
style with the freedom to change to an equivalently priced life-style at any
time. Unlike the Konus C.O.L.A. discussed in Sen [1979], the person will
not be compensated if they become increasingly depressed nor penalized for
greater happiness. Unlike the Paasche and Laspeyres C.O.L.A., her adjust-
ment is based on a lifestyle which is allowed to change at any time. But there
is an important feature of Divisia compensation not found amongst the other
indices: the element of risk. Under a Divisia C.O.L.A., your lifestyle is an
investment. If you change your spending patterns, there is no guarantee that
you will be able to get back to your previous level of consumption. Far from
being exotic, these risks are well known to all those who must decide how
much currency to change when visiting nations with volatile exchange rates.
What the differential geometry emphasizes is that path dependence is prefer-
able and that the path dependence of the Divisia is no more counter-intuitive
than the path dependence underlying the theory of investment.
As previously noted, the differential geometric framework allows us to
see that this index is the natural one to emerge from both the mathematics
and the economics, and that it enables us to resolve consistency issues raised
by the plethora of existing indexes. What it also enables us to do is recover

41
previous results about the Divisia in a unified framework and to extend these
results in ways that can practically facilitate our use of the index.
For economists concerned with issues of path dependence, an important
result is path independence of the Divisia under unchanging homothetic pref-
erences [see Hulten 1973]. What does the mathematics have to say about
this?
Differential geometry allows us to calculate something called a curvature
tensor. Recall that with the ordinary notion of a derivative, mixed partials
commute. This is no longer true of general covariant derivatives. The cur-
vature tensor measures the failure of the covariant derivative to commute.
i.e. ∇X ∇Y 6= ∇Y ∇X . What differential geometry tells us is that when the
curvature tensor is equal to zero on a subspace, our index will be path in-
dependent. Homothetic preferences are a special case of a zero curvature
tensor.
Another important result is Diewert’s 1980 demonstration that when cer-
tain approximations are valid, the Paasche, Laspeyres and Tornqvist indexes
can be regarded as discrete approximations to the Divisia. In addition, it has
been noted that chaining usually improves the approximation and leads to
less variation among index numbers. As discussed previously, the differential
geometry shows us that chaining is a crude way of approximating our adapted
derivative. Given that the adapted derivative resolves the consistency issue,
greater agreement under chaining is therefore explained.
Thus we see that not only does our approach recover the above results,
but it also gives a unifying framework in which to understand them.

1.7.1 Extensions
Besides recovering previous results in a consistent framework, differential
geometry enables us to extend our knowledge about the Divisia in practically
applicable ways:
I.) Using the theory of connections, it is possible to do for the space of
barters what has been done for the space of baskets. In this paper we have
shown how to modernize or antiquate the components of the usual ‘fixed
basket’ to reflect changes in production and/or consumption.
Understanding how barters change is essential to understanding the econ-
omy as a whole. For example, if our basket q(t) is taken to be the U.S. GDP,
an individual’s basket will consist of a non-zero barter vector B added to
a scaled basket vector λ · q(t) (here B is asserted to be non-zero because

42
individuals never purchase multiples of the GDP). Further, the entire theory
of option pricing rests on the fact that barters today will not in general be so
tomorrow. It can be shown however that the adapted derivative gives us a
natural identification between the spaces of barters at two different instants
of time. This in turn results in an n-1 × n-1 matrix analog for barters of the
usual Divisia index for baskets. The significance of this matrix index is not
yet clear and and is the subject of work in progress with E. Weinstein.
II.) Differential geometry allows us to discuss path independence in the
absence of knowledge of homothetic preferences. Let us assume that we are
not given access to preferences, but that we are instead given some region
a
R ⊂ V × V ∗ such that when we restrict the curvature tensor F ∇ to R we get
zero. Then if α1 , α2 are two curves with the same end points and α1 can be
deformed to α2 within R while keeping the endpoints fixed, then the Divisia
of α1 equals α2 .
a
III.) When the curvature tensor F ∇ 6= 0 we can use the ‘size’ of the
curvature tensor to estimate how far we are from path independence. In
other words, we can use the size of the curvature tensor to estimate the local
contribution to the path dependence of the Divisia index. Thus, this allows us
to deviate from pure homothetic preferences and estimate data needs based
on the curvature intensity of a region.
These issues will be further explored in following papers.

1.8 Appendix A: Differential Geometric Ex-


planation: Notation, Conventions and Tools
from Gauge Theory
1.8.1 Notation for Goods and Services
Let us assume that we are interested in a sector of the economy comprised
of some n goods and services. We will assume that each of these goods and
services come equipped with a natural price independent unit of measure-
ment. Thus gasoline will be measured in gallons and labor in hours. We will
also assume that all prices are linear and are therefore measured in dollarsunit .
Let us order the goods and services together with their system of units and
denote the collection by {gi , ui }ni=1 . Given this initial data we will describe
the evolution of our sector by the collection of data Υ = (V n , α, ∇0 ) given

43
as a triple. We will postpone the definition of ∇0 until section 1.8.3.
Let V n be the vector space of all quantities of goods and services in the
sector of the economy under consideration. That is we let {ei }ni=1 ⊂ V n be
a basis for V indexed by the (ordered) set of n goods and services under
consideration in their natural units of measurement (kilograms, units, etc...).
We will interpret negative multiples of these goods as debts or obligations to
provide.
We will let V ∗ be the dual space of linear functionals φ : V −→ R. This
is a vector space of the same dimension as V . We note that given our basis
{ei }ni=1 ⊂ V n there is a corresponding basis {e∗i }ni=1 ⊂ V ∗ where e∗i is defined
by specifying that e∗i (ej ) is equal to 1 if i = j and 0 otherwise.
We wish to comment here that the reader may choose to represent the
elements of V as ‘column’ vectors and the elements of V ∗ as ‘row’ vectors.
In this notation the act of evaluating an element µ∗ of V ∗ on a vector v is
obtained by matrix multiplying the 1×n row vector against the n×1 column
vector to get a real number (i.e. a 1 × 1 matrix).
If we assume that we are interested in an initial state of the economy
at time τ0 contrasted against a later state of the economy at time τ1 then
we can define ρ : [τ0 , τ1 ] −→ [0, 1] by ρ(x) = ( τx−τ 0
1 −τ0
) in order to restrict our
formalism to the interval [0, 1]. From now on we will assume that the time
interval of interest will always be [0, 1].
Let Ξ = {(v, φ) v ∈ V, φ ∈ V ∗ s.t. φ(v) 6= 0} be the collection of pairs
of baskets of goods and possible pricing systems.
Then let α = (m, π) where α : [0, 1] −→ Ξ and π : [0, 1] −→ V ∗ gives
the prices of the goods and services under consideration. Conversely m :
[0, 1] −→ V denotes the (possibly) time dependent basket of goods and
services with which we are attempting to calibrate the economy.
Given any non-zero vector w in a vector space W , we will denote by [w]
the linear subspace spanned by w.
With this defined we argue that if we are given any element (v, φ) ∈ Ξ
we get a decomposition of our vector space V as follows. In the first place
φ determines a subspace β[φ] defined by β[φ] = {h ∈ V s.t. φ(h) = 0}. Our
bracket notation is justified by the fact that β[φ] is independent of whether
one chooses to work with φ or r · φ (r 6= 0). Now by our above definition of
Ξ we are guaranteed that

Proposition 5 For any point (v, φ) ∈ Ξ we get a canonical decomposition


V = [v] ⊕ β[φ] . That is, for any vector w ∈ V there exist unique vectors

44
w[v] ∈ [v] and wβ[φ] ∈ β[φ] such that w = w[v] + wβ[φ] .

Proof: By hypothesis we know that φ(v) 6= 0 which implies in particular


that φ 6= 0 and thus that φ : V n −→ R is a surjective linear map. This means
that the dimension of its kernel β[φ] is n − 1. Further, we know that v 6∈ β[φ]
and our conclusion follows from the dimension count (n-1)+1=n. QED
In the same vein, if we let β[v] 8 = {κ ∈ V ∗ s.t. κ(v) = 0} then we have

Proposition 6 For any point (v, φ) ∈ Ξ we get a canonical decomposition


V ∗ = [φ] ⊕ β[v] .

Proof: The argument above goes over word for word in this case with V ∗
and V interchanged. QED
The importance of the decompositions is that they determine projection
maps onto the various subspaces. Given vectors y ∈ V, ω ∈ V ∗ and our
point (v, φ) ∈ Ξ we get projection maps

Π[v] : V −→ [v] (1.116)

Πβ[φ] : V −→ β[φ] (1.117)


Π[φ] : V ∗ −→ [φ] (1.118)
Πβ[v] : V ∗ −→ β[v] (1.119)
defined as follows.
φ(y)
Π[v] (y) = ·v (1.120)
φ(v)
Πβ[φ] (y) = y − Π[v] (y) (1.121)
ω(v)
Π[φ] (ω) = ·φ (1.122)
φ(v)
Πβ[v] (ω) = ω − Π[φ] (ω) (1.123)
8
One might ask whether there exists a word corresponding to ‘barter’ for currencies
which view our basket m(t) as worthless. Unfortunately the word exists; the Oxford English
Dictionary defines Floccinaucinihilipilification as “The action or habit of estimating as
worthless” so we rise to the challenge and suggest to the brave reader that the noun
describing the elements of β[v] is probably ‘Floccinaucinihilipilificators.’

45
1.8.2 Vector Bundles, Sections etc....
We now turn to a discussion of vector bundles. We wish to say at the outset
that we will only be discussing a restricted class of vector bundles and will
not need the full formalism. Given a vector space W n of dimension n and a
(topological) space X (set, manifold etc...) we refer to X × W as the total
space of the trivial vector bundle with base space X and fiber W . For
most all of our discussion, X will be an open dense subset of Rn × Rn ∼ = R2n
so that X 2n = R2n − C on which we will use the standard coordinates for
R2n , agreeing at the outset simply to ignore the points of C. (Note: C is a
2n − 1 dimensional space with singularities).
Let Pm (W n ) denote the space of (oriented) m-dimensional planes in an
n-dimensional vector space W n . That is, the points of the space Pm (W n )
are in one to one correspondence with the oriented m-planes through the
origin 0 ∈ W n . Denote by Pp ⊂ W n the plane corresponding to a point
p ∈ Pm (W n ).
Now if the points x ∈ X parameterise a family of m-dimensional sub-
spaces PZ(x) ⊂ W by a map Z : X −→ Pm (W n ) we can define a new non-
trivial bundle; we refer to the space Σ = {(x, u) : (x, u) ∈ X × W n and u ∈
PZ(x) } as a (non-trivial) vector bundle with fiber Rm and base space X (the
idea being that a vector bundle is trivial if it is given as a cartesian product).
The bundle we have described is a sub-bundle of the trivial bundle X ×W n .
Note that a trivial bundle is always a sub-bundle of itself (ie. when n = m).
In this paper m will usually be either n − 1 or 1; we note that in these cases
we have Pm (W n ) = S n−1 .
The surjective map π : Σ −→ X given by π((x, u)) = x (defined for a
sub-bundle of a trivial bundle) is referred to as the ‘projection map onto
the base space’. The collection of points PZ(x) = π −1 (x) are called the fiber
of the vector bundle at x (or ‘above x’).
Now a map σ : X −→ Σ is called a section of Σ if for all x ∈ X we
have σ(x) = (x, u) for some u ∈ π −1 (x). This is equivalent to requiring that
π ◦ σ(x) = x. The space of all sections is denoted by Γ(X, Σ). The space of
smooth (that is infinitely differentiable) sections is denoted by Γ∞ (X, Σ).
It is important for our paper to know that given a curve α : [0, 1] −→ X,
then every vector bundle over our space X determines a vector bundle over
[0, 1] as follows. If E is a vector bundle over X with projection map π, define
the pull-back bundle α∗ (E) to be the set of pairs α∗ (E) = {(t, u) s.t. t ∈
[0, 1], u ∈ π −1 (α(t))}. Thus if E = X × W n then α∗ (E) is simply the trivial

46
bundle [0, 1] × W n . Likewise if we have a sub-bundle of X × W n given by a
map Z : X −→ Pm (W n ) then we get a sub-bundle of [0, 1] × W n given by the
map Ẑ : [0, 1] −→ Pm (W n )which sends t to the point Z(α(t)) ∈ Pm (W n ).

1.8.3 Discussion of Tangent vectors, Co-vectors and


Covariant Derivatives
How does one define the (intuitive notion) of the space of tangent vectors at
a point x ∈ X? If we are lucky enough to be in the situation where there
exist natural coordinates on our space X (in our case Ξ) then there exists a
straightforward description. This situation is considerably simpler than the
general case and it is this case to which we address ourselves.
Let us first review the total differential df of a function f : Rn −→
Pn In ∂fgeneral economics usage the total differential df is defined by df =
R.
i=1 ∂xi (x1 , · · · , xn )dxi where the symbols dxi are referred to as differentials
and are somewhat abstract. In differential geometry the ‘total differential’ of
a function is referred to as the ‘exterior derivative’ and the dxi ’s are defined
as ‘co-vectors’. The idea is the following. We identify the intuitive concept
of the ‘unit tangent vector in the xi direction’ with the partial derivative ∂x∂ i .
The span of the space of these symbols at a point x ∈ X is denoted by Tx X
and is referred to as the tangent space to X at the point x.
With this stated we return to the symbols dxi . The idea here is that these
symbols belong to the dual space Tx∗ X which is referred to as the cotangent
space at the point x. That is, these symbols eat vectors and spit out numbers
in a linear fashion. For example dxi ( ∂x∂ j ) is equal to 1 if i=j and 0 otherwise.
With this in mind we see that the total differential is really a convenient
bookkeeping device which keeps track of all directional derivatives.
A connection (or equivalently co-variant derivative) is really a kind of
‘total differential’ in the setting of vector bundles. The first thing to remark
is that it is a total differential of ‘vector valued functions’ which are the
sections we have met above.
Definition 13 A covariant derivative ∇ on a vector bundle Σ over a man-
ifold X is a map ∇ : Γ∞ (X, Σ) −→ Γ∞ (X, T ∗ (X) ⊗ Σ) such that we have
1. Pointwise linearity in the tangent vector argument:
∇(f Y +gZ) σ = f ∇Y σ + g∇Z σ
2. Additivity in the section argument: ∇Y (f σ + gµ) = ∇Y f σ + ∇Y gµ

47
3. Product Rule: ∇Y f σ = df (Y )σ + f ∇Y σ

where f, g ∈ C ∞ (X), Y, Z ∈ Γ∞ (X, T X) and σ, µ ∈ Γ∞ (X, E).

The simplest connection is just the familiar total differential of ordinary real
valued functions.

Example 7 Usual derivative of ordinary vector valued functions of several


variables, ∇0 .

Consider a trivial vector bundle Rn × W m with base space Rn and {ei }m


i=1 a
m n n n
basis for W . Then if s : R −→ R × W is a smooth section given by
m
X
s(x) = (x, fi (x)ei ) fi (x) ∈ C ∞ (Rn ) (1.124)
i=1

then we define the trivial connection ∇0 by the formula


m X
n
X ∂fi (x)
∇0 s(x) = (x, ei ⊗ dxj ). (1.125)
i=1 j=1
∂xj

If V = nj=1 aj ∂x∂ j ∈ Tx Rn is a tangent vector at the point x we define the


P

covariant derivative of s in the direction V to be given by


m X
n
X ∂fi (x)
∇0V s(x) = (x, aj ei ) ∈ Wx . (1.126)
i=1 j=1
∂xj

If we have a vector bundle Σ over a base space R1 (or any connected subset
thereof) and a (possibly non-trivial) connection ∇ on Σ, then we define the
parallel translations T (wp , ∇) of a vector wp in the fiber above p to be
given by the section σ : R1 −→ Σ which satisfies

∇σ = 0 and σ(p) = wp . (1.127)

This is guaranteed to exist and be unique by a theorem in ordinary linear


differential equations. We solve an explicit problem of this kind in section
1.4

48
Chapter 2

Welfare Implications of Divisia


Indices1

2.1 Introduction
In recent months the U.S. Senate has been actively debating the Consumer
Price Index (CPI) and its relation to the “true” cost of living index. The
importance of this question for the U.S. economy is undeniable. According
to recent Senate hearings on the CPI, 30% of federal outlays and 45% of
federal revenues are indexed to the CPI, and a 1% overstatement in the CPI
costs the U.S. government $280,000,000,000 over 7 years.
The current CPI issued by the U.S. Bureau of Labor Statistics is based
on a Laspeyres index PL = PP (1)·Q(0)
(0)·Q(0)
for a representative consumer whose
fixed basket is updated once every 10 years or so. It is widely accepted,
however, that this is not a cost of living index. There are several reasons for
this, both technical (eg. the way data on housing is handled by the BLS)
and theoretical. We will focus here on that part of the failure of the CPI to
evaluate the true cost of living adjustment that arises from the theoretical
issues such as substitution effects, the new goods bias, and changes in quality.
The goal of a ‘cost of living adjustment’ PE (t) is that a utility maximising
consumer purchasing a basket q(0) with expenditure E(0) at time 0, should
receive the same utility from an expenditure of

E(t) = PE (t)E(0) (2.1)


1
This chapter is co-authored with Eric Ross Weinstein in the MIT mathematics de-
partment.

49
at time t.2
The Konus index (often referred to as the ‘true cost of living index’ or
simply as ‘the economic index’), is believed by most economists to be a
complete theoretical (if incalculable) solution to this welfare problem. Pollak
(1983) defines the Konus as a ratio of expenditures in the following way:

a b E(P a , s, R)
PK (P , P , s, R) = (2.2)
E(P b , s, R)
where P a and P b are prices at t = a and t = b, R represents a preference
ordering, and s represents the choice of a base indifference curve from that
map.
While the Konus may seem the most natural and intuitively appealing
solution for the index PE given normal neoclassical assumptions, it does in
fact make severely restrictive demands and is therefore not the most realistic
measure of welfare in the real world. Perhaps the two most severe require-
ments are that the Konus requires access to an individual’s preference map,
and that no changes in taste occur during the period in question. In ad-
dition, using this index involves dealing with an “index number problem”.
Whereas the Paasche and Laspeyres indexes must fix a reference basket, the
Paasche-Konus and Laspeyres-Konus are fixed utility indexes which involve
the choice of a reference indifference curve, and thereby suffer from much of
the arbitrariness plaguing data based bilateral indices.
In this paper we will examine the welfare implications of the Divisia price
index Z 1Xn
pq dpi
PD = exp Pn i i (2.3)
0 i=1 j=1 pj qj pi

defined for a consumer consuming a basket q(t) under p(t) prices. While
this index is given by a path dependent line integral, as a data based index
it suffers from none of the preference restrictions under which the Konus
labours.
It is clear that the Laspeyres index, currently being used for the CPI
is an overestimate of the Konus index, PK . What is not clear, however, is
by how much. The economists who have been asked by the government for
2
This paper addresses itself to the benchmark case where marginal analysis is applica-
ble. Accordingly, all functions of time should be assumed once continuously differentiable
unless otherwise specified. If care is taken, this assumption can be weakened in some cases
to continuous or continuous and piecewise differentiable.

50
their estimations of the overstatement have delivered a range of suggested
adjustments to the CPI. This raises two important questions:

1. Without access to preference maps, how do we know if any of our


guesses are correct?

2. In a society in which changes in preferences are far from negligible is


there a more appropriate way to estimate the welfare of a representative
consumer than the standard fixed preference Konus Index?

It is primarily towards answering these two questions that this paper is


directed: i.e. which experimental index PE in Equation 2.1 ensures that the
consumer’s utility stays constant. We show that under the right assumptions,
the Divisia index provides a complete solution to these questions.
This paper should be viewed as a companion paper to [Malaney I]. In
[Malaney I] a new relationship between the traditional bilateral indexes (eg.
Laspeyres, Fisher) and the Divisia was explored. This relationship involved
no use of the chain principle but instead relied on the use of the calculus of
covariant derivatives used in differential geometry.
Unlike chaining, the differential geometric method immediately illumi-
nated welfare aspects of the Divisia. In that paper it was shown that there
exists a special “economic” derivative uniquely capable of separating income
and substitution effects. It was then shown that all bilateral index formu-
las made implicit and unjustified use of a non-economic derivative which
confuses the two effects; once a systematic replacement was carried out all
formulas were shown to produce indexes exactly equivalent to the Divisia.
Somewhat surprisingly, path dependence was seen to emerge from this
analysis as being advantageous with important welfare implications. As path
dependence has long been considered the major drawback of the the Divisia
index, the novelty of the assertion prompted the question of how it was
possible to derive welfare implications from an index which resulted from a
change in derivatives. As measurements of welfare and changes in standard
of living are at the heart of index number theory, the importance of this
question was clear. When viewed from a differential geometric standpoint,
the answer to the question also becomes apparent. In this paper we attempt
to translate these insights into a framework familiar to economists using
advanced calculus. The geometric details will appear elsewhere.

51
2.2 Welfare and Path Dependence
Let us start by looking at why the issue of changing preferences is of funda-
mental importance to a welfare index.
Standard neoclassical economic analysis generally makes the simplifying
assumption that preferences are static; as mentioned before, the definition of
the Konus index makes this assumption implicitly. In (Malaney I), however,
we discuss the example of a “fad” based upon the changes in taste one is
likely to see in any realistic model of an economy. The situation described
was that of two persons starting and ending with the same basket of goods
comprised of red wine and soda pop.
Let us assume at time t0 that Consumer A and Consumer B share not
only preferences but also a budget constraint I(0). For the sake of simplicity,
let us consider a toy economy in which the only goods are wine and cola.
Assume that at t0 the preferences of both consumers cause them each to
purchase 70% cola and 30% wine at p(0) prices. Assume that between t0 and
ta demand, supply and prices remain nearly constant but that consumer A
begins to cultivate a taste in red wine causing her to shift her consumption
towards 30% cola and 70% wine; consumer B’s preferences and purchases
remain static.
If one then assumes that between time ta and t1 a fad for red wine in-
creases society’s demand for wine, we should expect to see an increase in
price as well. If consumer B is carried along with the fad, she will increase
her consumption of wine despite the fact that the price of wine is increas-
ing. We note that this common behavior associated with fads and trends, is
associated only with the exotic Giffen goods under static preferences.
We may then assume that at time t1 both consumers are purchasing 30%
cola and 70% wine. If we compute their Divisia Indexes we will find that
Consumer A will have undergone considerable inflation while Consumer B
will not have. This despite the fact that their initial and final baskets are
the same.
The point here is that A was hit with a price increase in the basket she
was already consuming, while B increased her consumption into a good whose
price had already undergone much of the increase.
A continuous C.O.L.A. based on the Divisia price index would protect the
rights of both consumers to keep their life styles intact in the face of a price
increase but it would compensate both consumers differently. This led us in
[Malaney 1] to make the point that the path dependence of the Divisia index

52
C.O.L.A represents something directly analogous to the path dependence of
currency exchange or stock portfolio composition; with a Divisia C.O.L.A.
your lifestyle is an investment. You can always retain your way of life but
you cannot necessarily return to it if you change your preferences.

2.3 Implicit Assumptions Underlying a Cost-


of-Living Index.
Before considering any proposed experimental income adjustment PE (t) meant
to neutralise the effects of inflation, it is important to consider the implicit
assumptions underlying our notions of welfare. Let us look once again at
Pollak’s definition of the Konus index:

“The cost of living index is the ratio of the minimum expenditures


required to attain a particular indifference curve under two price
regimes. We denote the cost of living index by I(P a , P b , s, R):

E(P a , s, R)
I(P a , P b , s, R) = (2.4)
E(P b , s, R)

The notation emphasizes that the index depends not only on the
two sets of prices, P a and P b , but also on an initial choice of
an indifference map or preference ordering, R and the choice of
a base indifference curve s from that map. One set of prices is
called “reference prices” and the other, “comparison prices’. If
the comparison prices are twice the reference prices, the index is
2; if they are one-half, the index is one-half.” -The Theory of the
Cost of Living Index Robert Pollak Pg. 94

It is plain from the above definition that ‘the cost of living index’ (otherwise
known as the Konus price index PK ) is only defined under the hypothesis
of unchanging ordinal preferences. The assertion commonly made is that if
a consumer with a given cardinal utility function U maintains such a static
indifference map R, then PK = I(P a , P b , s, R) by its definition insures ‘con-
stant utility’. We do not take issue with this statement in this paper except
to point out that this assertion appeals to an implicit condition on U which
we wish to make explicit.

53
Let us assume for the moment that we were interested in a consumer
whose preference map Rt was derived from a dynamic utility function U :
RTime × V −→ R. It is quite possible that our consumer’s “efficiency as
a [cardinal] pleasure machine” might change even if her indifference map
Rt was constant. For example, consider a static cardinal utility function
U0 : V −→ R with an indifference map R0 . Then

Ua (t, v) = U0 (v) (2.5)

and
1
Ub (t, v) = ( Sin(2πt) + 1)U0 (v) (2.6)
2
are both time dependent utility functions with a common indifference map:
R0 .
Let us then assume that our utility maximising consumer possesses just
such a static indifference map R derived from the cardinal utility function
U (s, v) and at time t expends an adjusted income of I(t) = PK (t)I(0). In this
case, the assumption that her Konus compensation yields constant cardinal
utility uc .
U (t, q(t)) = uc ∀t (2.7)
implies that
dU ∂U ds ∂U dq
= + = 0. (2.8)
dt ∂s dt ∂q dt
We know however from the definition of the Konus that dq dt
lies tangent to
∂U
the indifference curve while ∂q = ∇U is orthogonal to the tangent planes of
U ’s level sets implying
∂U dq
= 0. (2.9)
∂q dt
Therefore, for U (t, q(t)) to be constant under Konus compensation, we must
make the single assumption

∂U (s, q(t))
=0 whenever t=s. (2.10)
∂s
Relative to the basket q(t), equation (2.10) refers to the constancy of
what what Sen calls the ‘change in man’s efficiency as a pleasure machine’
and what Balk refers to as the ‘cost of living effect of pure preference change’.

54
2.3.1 Changing Preferences and Psychological Neutral-
ity
The Konus index in its standard form is of no use in evaluating welfare
considerations in the context of variable preference maps. However, if we
restrict our attention to those utility functions which:

1. Possess a static indifference map R0 .

2. Satisfy equation (2.10).

then the Konus guarantees constant utility. This raises the question of
whether there is any way to guarantee constant cardinal utility when the
first assumption is lifted.
Happily this question can be answered in the affirmative; we will show
in this paper that the Divisia index PD can be seen to be an extension of
the Konus index into the realm of changing preferences where only the above
“condition 2” is retained. To this end we make the following precise definition
based on equation (2.10).

Definition 14 Let U : R×V −→ R be a (time dependent) utility function on


V . That is that at any instant of time s0 , the function U (s0 , ·) : V −→ R has
a system of differentiable convex indifference curves. Let us assume that we
are given a time dependent basket of goods q : R −→ V with q(t) representing
the goods purchased at time t.
We will say that U is psychologically neutral3 for the basket q if

dU (s, v)
|s=t,v=q(t) = 0 (2.11)
ds

for all t. Given a dynamic utility function Û , we will refer to any U which
shares the same indifference curves as Û while satisfying (2.11) as a psycho-
logical neutralisation of Û .

While a given Û will not in general satisfy (2.11), it is always possible to


replace such a function with a function U which is psychologically neutral
and shares the same indifference curves as the original Û .
3
The terminology is based on Sen’s discussion of the welfare considerations surrounding
a manic depressive whose utility function is akin to that of equation (2.6).

55
With this in mind, we can introduce the function
Z s
∂ Û (t, q(s))
Ũq (s, v) = Û (s, v) − dt (2.12)
0 ∂t

which we will refer to as the canonical neutralisation of Û relative to q. This


function has the same indifference curves as Û at every instant of time but is
psychologically neutral by its construction. Of course this is but one among
many satisfying the P.N. hypothesis.

56
Theorem 8 Consider a representative consumer with budget constraint I(0)
at time t = 0 and a differentiable time-dependent convex utility function
Û : R × V −→ R . Let us further assume the consumer’s expenditure at time
t is given by an experimental price index PE (t) according to the formula

I(t) = PE (t)I(0) (2.13)

Let p(t) be a differentiable 1-parameter family of price vectors with q(t) be


the basket of goods which maximises U (t, ·) subject to the income constraint
I(t). Then if U is any psychological neutralisation of Û and PK , PD , QD
are the Konus price, Divisia price and Divisia quantity indices we have:

1. Utility is constant if and only if the Divisia quantity index


is 1:
U (t, q(t)) = U (0, q(0)) ∀t if and only if QD (t) = 1 ∀t.
2. Within the realm of marginal analysis, the Divisia Index is
a perfect “cardinal indicator of ordinal utility”4 :
dU (t,q(t)) dQD (t)
(a) dt
< 0 if and only if dt
< 0.
dU (t,q(t)) dQD (t)
(b) dt
> 0 if and only if dt
> 0.
dU (t,q(t)) dQD (t)
(c) dt
= 0 if and only if dt
= 0.
3. If the hypotheses needed for the definition of the Konus price
index (i.e. preferences and utility are fixed) are placed on the
Divisia, then the Divisia is equal to the Konus:
QD (t) = 1 ∀t and ∂U∂s(s,v)
= 0 ∀s, v ⇒ PK (t) = PD (t) ∀t.

Proof: For part 1, we first show that if the consumer’s PN utility U (t, q(t))
is constant for all time t then his Divisia quantity index QD (t) is equal to
unity.

dU (s, q) ∂U (s, q) ds ∂U (s, q) dq


0= = + (2.14)
dt ∂s dt ∂q dt
but by the assumption of psychological neutrality the first summand vanishes
and we have
∂U (s, q) ds
= 0 at q(t) (2.15)
∂s dt
4
Samuelson & Swamy 1974, Page 568 and Sen 1979, Page 404

57
Now define βt to be the time dependent space of barters

βt = {v ∈ V s.t. pt · v = 0} (2.16)

i.e. all baskets which our pricing system at time t evaluates as representing
a ‘fair trade’.
Then by the assumption that the basket q(t) represents the purchases of
a utility maximiser, we know that any instant of time t there exists a strictly
positive constant λt such that (∇U )q(t) = λt pq(t)

∂U (s, q) dq dq dq dq
∴ = λt p t ( ) = 0 ⇒ p t ( ) = 0 ⇒ ∈ βt ⇒ QD = 1 (2.17)
∂q dt dt dt dt
To establish the converse, assume that the Divisia quantity index is equal
to unity for all t.
dq dq
QD = 1 ⇒ ∈ βt ∀t ⇒ pt ( ) = 0 (2.18)
dt dt

βp (t) = T U −1 (U (q)) ⊥ ∇U (2.19)

dq
· ∇U = 0 (2.20)
dt
∂U dq
=0 (2.21)
∂q dt
By PN:
∂U ds
= 0 at s = t, v = q(t) (2.22)
∂s dt
∂U ds ∂U dq dU
∴ + = (2.23)
∂s dt ∂q dt dt
The second point can be deduced from the above argument.
n n
pi (s) · dqds
i (s)
Z tX Z tX
p i · qi dqi
QD (t) = exp( Pn ) = exp( Pn ds)
0 i=1 j=1 pj · qj qi 0 i=1 j=1 pj (s) · qj (s)
(2.24)

58
but by P.N., maximisation and line (2.17) we have

1 ∂U (s, q) dq dq
= pt ( ) (2.25)
λt ∂q dt dt

pi (s) · dqds
i (s) dqi (s)
Z tX Z t
1 ∇U (s, ·) · ds
exp( n Pn ds) = exp( Pn ds)
0 i=1 j=1 pj (s) · qj (s) 0 λs j=1 pj (s) · qj (s)

Z t (2.26)
dqi (s)
= exp( µ(s)(∇U (s, ·) · ) ds) (2.27)
0 ds
The point here is that the function µ is strictly positive by the positivity of
λs and I(s) = p(s) · q(s). Thus the sign of the integrand is negative, positive
or zero according to whether utility is decreasing, increasing or stagnant.
Hence
Z t
dQD (t) d dqi (s)
= exp( µ(s)(∇U (s, ·) · ) ds (2.28)
dt dt 0 ds
Z t
dqi (s) dqi (t)
= exp( µ(s)(∇U (s, ·) · ) ds)µ(t)(∇U (t, ·) · ) (2.29)
0 ds dt
dqi (t)
= ν(t)(∇U (t, ·) · ) (2.30)
dt
where ν(t) is strictly positive; this proves the assertion.
In order to establish the third point let us introduce the notation v(t) =
p(t) · q(t) = I(t) for the value of a basket We then multiply the Divisia price
and quantity indices to obtain:
dp · q + p · dq
Z
PD (t)QD (t) = exp(ln(PD ) + ln(QD )) = exp( ) (2.31)
p·q
Z Z
dv
= exp( ) = exp( dln(v) ) = exp( ln( v(t) ) + c ) = v(t)ec (2.32)
v
but since PD (0) = QD (0) = 1 we must have c = −ln( v(0) ) yielding the
expression
v(t)
PD (t)QD (t) = . (2.33)
v(0)

59
Therefore, if QD (t) = 1 we must have

v(t) p(t) · q(t)


PD (t) = = = PK (t) (2.34)
v(0) p(0) · q(0)

since q(t) is the basket of minimal cost on the indifference curve C by the
maximisation hypothesis.

QED.

60
Corollary 9 Given the same assumptions and notation as in Theorem 2:

1. Utility is constant if and only if the experimental index is the Divisia.


That is:
dU (t, q(t))
= 0 if and only if PE (t) = PD (t). (2.35)
dt
2. There is no path dependence in the index PD (t) of a consumer with
C.O.L.A. expenditure I(t) = PD (t)I(0) under constant preferences:

∂U (s, v)
PD (t) Path dependent ⇒ ∃s̃ ∈ R, ṽ ∈ V s.t. |s̃,ṽ 6= 0. (2.36)
∂s

Proof: If dU (t,q(t))
dt
= 0 then by part 1 of the preceding theorem, we know
that QD (t) = 1 ∀t. But

p(t) · q(t) I(t) PE (t)I(0)


PD (t) = = = = PE (t) (2.37)
QD (t)I(0) I(0) I(0)

Conversely, if we know that PD (t) = PE (t) we deduce that

p(t) · q(t) PE (t)I(0)


QD (t) = = =1 (2.38)
PD (t)I(0) PD (t)I(0)

and the conclusion follows from part 1 of Theorem 2.


To see that path dependence implies changing preferences we reason as
follows. Assuming the consumer with expenditure PD (t)I(0) at time t has
constant preferences, we know from Part 3 of Theorem 2 that the PD (t) must
equal PK (t). However it is clear that PK (t) is path independent when it is
defined. This implies that if PD is path dependent, the only possibility is that
the consumer must have experienced an alteration of her preference map.

QED.

Part 1 of Theorem 2 together with the definition of psychological neutral-


ity, indicates that constant utility in the context of changing preferences is a
meaningful and observable concept. Part 3 of Theorem 2 together with the

61
first part of Corollary 3 shows that in this respect the Divisia index outper-
forms the more brittle Konus. That is, it agrees with the Konus when both
are defined under constant preferences and utility but continues to guaran-
tee constant utility in the absence of constant preferences. This suggests the
idea that the definition of the Konus fixed utility price index be extended
by defining it to equal PD when QD = 1 ∀t. In the next section we back
up this assertion by showing that even more is possible. First we define four
variable-preference Konus indexes and then show through an advanced cal-
culus argument (translated from differential geometry) utilising a traditional
chaining approach, that they are all directly equal to the Divisia even under
changing utility.

62
2.4 The Konus Index under Changing Pref-
erences
As discussed before, the Konus Price Index
E(P a , s, R)
PK = (2.39)
E(P b , s, R)
measures the change in the minimal cost of getting the consumer to the level
of utility ur associated with a reference basket. Depending on whether we
take the reference basket to be the base time basket or the current time
basket, therefore, we get different indexes. The Laspeyres Konus (PLK ) uses
the base time basket as the reference basket, whereas the Paasche Konus
(PP K ) uses the current time basket. In general these are not the same,
except in the case of homothetic preferences.
Let us define Q(Ua (q(b)), p(c)) ∈ V to be the basket of minimal cost in
the pricing at time c with utility equivalent to the consumer’s basket at time
b as measured by the consumer’s utility function at time a. With the above
notation, the traditional Laspeyres Konus and Paasche Konus price indices
are given by the formulas:
p(1) · Q(U (q(0)), p(1))
PLK (p(1), p(0); U (q0 )) = (2.40)
p(0) · Q(U (q(0)), p(0))
p(1) · Q(U (q(1)), p(1))
PP K (p(1), p(0); U (q1 )) = (2.41)
p(0) · Q(U (q(1)), p(0))
Here, the subscript on the utility function is suppressed because both
these indexes implicitly assume that the preferences of the consumer, i.e. the
utility function with which he is maximising, remain absolutely constant over
time. As pointed out before, the Divisia index does not make this unrealistic
assumption. If we relax this assumption for the Konus Price index, allowing
preferences to change, we find that as well as specifying which basket is the
reference basket, we need to specify which utility function we will use as the
reference function, base time preferences or current time preferences. We
thus define four Konus indexes:
Definition 15 The dynamic utility Konus price indexes are defined by the
formulas:
p(1) · Q(U0 (q(0)), p(1))
PLKL (p1 , p0 ; U0 (q0 )) = (2.42)
p(0) · Q(U0 (q(0)), p(0))

63
p(1) · Q(U1 (q(0)), p(1))
PLKP (p1 , p0 ; U1 (q0 )) = (2.43)
p(0) · Q(U1 (q(0)), p(0))
p(1) · Q(U0 (q(1)), p(1))
PP KL (p1 , p0 ; U0 (q1 )) = (2.44)
p(0) · Q(U0 (q(1)), p(0))
p(1) · Q(U1 (q(1)), p(1))
PP KP (p1 , p0 ; U1 (q1 )) = (2.45)
p(0) · Q(U1 (q(1)), p(0))
PLKL gives the change in the cost of achieving the utility associated with
the base time basket under base time preferences, whereas PLKP gives the
change in the cost of achieving the utility associated with the base time basket
under current preferences. Similarly, PP KL represents the change in the cost
of achieving the utility associated with the current basket under base time
preferences, while PP KP gives the change in the cost of achieving the utility
associated with the current basket under current preferences.

In this section we will prove the equality between the infinitely chained
versions of these indexes and the Divisia.

Theorem 10 Assume that as vector valued functions, q : [0, 1] −→ V and


p : [0, 1] −→ V ∗ are respectively continuous and once differentiable on the
closed unit interval. Assume further that Ut is a smooth 1-parameter (time
dependent) family of utility functions Ut : V −→ R, each possessing a system
of convex indifference curves. Then the infinitely chained Laspeyres-Konus-

Laspeyres price index PLKL exists and is equal to the Divisia price index PD .

The infinitely chained Laspeyres-Konus-Laspeyres price Index over the interval [t0 , t1 ] =
s−1

Y (a)(t1 − t0 ) (a + 1)(t1 − t0 )
PLKL (t0 , t1 ) = lim PLKL ( + t0 , + t0 ) (2.46)
s→∞
a=0
s s
For the sake of simplicity we shall assume that the intervals under consider-
ation have been shifted and scaled to coincide with [0, 1].
Our strategy to demonstrate the equivalence

PLKL (0, 1) = PD (0, 1)

will be to

64
1. Re-express the infinite limit of finite products as an infinite limit of
finite sums.

2. Construct a sequence of functions whose integrals over [0, 1] are equal


to these (finite) sums.

3. Show that these functions converge uniformly to the Divisia integrand.


This shows
Divisia integral
=Integral of Limit=Limit of Integrals
=Limit of Sums=Limit of Products of Laspeyres
= Infinitly Chained Laspeyres-Konus-Laspeyres

Let us look at the infinitely chained Laspeyres-Konus-Laspeyres price


index over the interval [0, 1].
s−1
a a+1
,
Y

PLKL [0, 1] = lim PLKL
s s
(2.47)
s→∞
a=0

This limit of products can be re-expressed as a limit of sums:


s−1 s−1 a , a+1 s−1
a a+1 a a+1
, s , s
Y Y s s )
X
ln(PLKL
lim PLKL = lim
s
e = lim exp( ln(PLKL
s
)) (2.48)
s→∞ s→∞ s→∞
a=0 a=0 a=0

s−1
a a+1
,
X
= exp( lim ln(PLKL
s s
)) (2.49)
s→∞
a=0

Consider the sequence of functions on the interval [0, 1]

a a+1 a (a + 1)
φLKL
s (x) = s ln(PLKL ( , )) <x≤ (2.50)
s s s s
where s represents the number of ‘links’ in our chain and a is a non-negative
integer less than s. These functions have been specifically constructed so
that their integrals
Z 1 s−1
X a a+1
φLKL
s (x)dx = ln(PLKL ( , )) (2.51)
0 a=0
s s

65
are exactly equal to the natural logarithms of the chained PLKL -indices.
Let us look at the limit as s → ∞ of values of these functions φLKL
s (x0 )
for a fixed point x0 ∈ [0, 1].
Let as (x0 ) be the sequence of integers such that

as (x0 ) (as (x0 ) + 1)


< x0 ≤ (2.52)
s s
for all s. Obviously from our definition,
as (x0 )
lim = x0 . (2.53)
s→∞ s
We now examine the infinite limit:
as (x0 ) as (x0 ) + 1
lim φLKL
s (x0 ) = lim s ln(PLKL ( , )) (2.54)
s→∞ s→∞ s s
Setting
1
r= (2.55)
s
we get
ln(PLKL ( as (x 0 ) as (x0 )
, s + r))
lim φLKL
s (x0 ) = lim s
(2.56)
s→∞ r→0 r

ln(PLKL ( as (x
s
0 ) as (x0 )
, s + r)) − ln(PLKL ( as (x
s
0 ) as (x0 )
, s ))
= lim (2.57)
r→0 r

as (x0 ) a (x ) a (x ) as (x0 ) a (x ) a (x )
p( +r)·Q(U a (x ) (q( s 0 )),p( s 0 +r)) p( )·Q(U a (x ) (q( s 0 )),p( s 0 ))
s s 0 s s s s 0 s s
ln( s ) − ln( a (x ) s )
a (x ) a (x ) a (x ) a (x )
p( s 0 )·q( s 0 ) p( s 0 )·Q(U a (x ) (q( s 0 )),p( s 0 ))
s s s s 0 s s
s
= lim
r→0 r
(2.58)

∂ln(PLKL (t0 , t1 ))
= |t0 =t1 =x0 (2.59)
∂t1

1 1 dp(t1 ) ∂Q(Ut0 (q(t0 )), pt1 )


= · ·( · q(t0 ) + p(t1 ) · ) |t0 =t1 =x0
PLKL (t0 , t1 ) p(t0 ) · q(t0 ) dt1 ∂t1
(2.60)

66
1 dp(t1 ) ∂Q(Ux0 (q(x0 )), pt1 )
= ·( · q(x0 ) + p(t1 ) · )|t1 =x0 (2.61)
p(x0 ) · q(x0 ) dt1 ∂t1

Since we have a utility maximising consumer at an optimum where


∂Ut0
λ = p(t1 ) (2.62)
∂Q
when evaluated at t0 = t1 = x0 we must have

∂Q(Ut0 (q(t0 )), pt1 )


p(t1 ) · (2.63)
∂t1
∂Ut0 (Q(Ut0 (q(t0 )), pt1 ))) ∂Q(Ut0 (q(t0 )), pt1 )
=λ · (2.64)
∂Q ∂t1
∂Ut0 (Q(Ut0 (q(t0 )), pt1 )))
=λ (2.65)
∂t1
∂U (q(t0 ))
=λ (2.66)
∂t1
=0 (2.67)
by the definition of Q(Ut0 (q(t0 )), pt1 ) implying

1 dp(t1 ) ∂Q(Ux0 (q(x0 )), pt1 )


·( · q(x0 ) + p(t1 ) · )|t1 =x0 (2.68)
p(x0 ) · q(x0 ) dt1 ∂t1

ṗ(x0 ) · q(x0 )
= (2.69)
p(x0 ) · q(x0 )

= Divisia Integrand (2.70)


Now, by the assumption that q(t) and p(t) are continuous on the closed
unit interval, we know that they must be uniformly continuous. This in
turn guarantees that the φLKL
s converge uniformly to the Divisia integrand.
Finally this implies that the limit of the integrals is integral of the limit by
a theorem in advanced calculus. Thus, with the definitions and arguments
above we have:

67
s−1
a a+1
,
Y

PLKL [0, 1] = lim PLKL
s s
(2.71)
s→∞
a=0

s−1
a a+1
,
X
= exp( lim ln(PLKL
s s
)) (2.72)
s→∞
a=0

Z 1 Z 1
= exp( lim φLKL
s (t) dt) = exp( lim φLKL
s (t) dt) (2.73)
s→∞ 0 0 s→∞

Z 1
= exp( φLKL
∞ (t) dt) (2.74)
0

1
ṗ(t) · q(t)
Z
= exp( dt) (2.75)
0 p(t) · q(t)
1 n
p ·q
Z X dpi
= exp( Pn i i ) = PD [0, 1] (2.76)
0 i=1 j=1 pj · qj pi

QED

68
2.5 Conclusions
A comparatively small amount of work seems to have been done on the
analysis of the welfare implications of the “little understood”5 Divisia index.
This is especially curious as in this paper we have seen that for an idealised
representative consumer with static or dynamic preferences, the Divisia price
index is peerless in its ability to maintain constant utility.
In retrospect this is perhaps not so surprising for two main reasons:

1. The traditional chaining argument (connecting the Divisia to the more


common bilateral indexes) obscures the advantages and function of
the path dependent index. The differential geometric approach from
which this paper sprang is the first vantage point which views the path
dependence in its natural mathematical setting. From this perspective
the Divisia index is the unique index which correctly separates income
from substitution effects.

2. The issue of changing preferences, while sometimes discussed in the


literature, is too often treated as overly obscure or complicated. We
are fortunate in this paper to be able to reach strong welfare conclusions
without having to correct for more than ‘psychological neutrality’. This
assumption is no more than the implicit assumption needed to assure
that the Konus index guarantees constant cardinal utility.

While common simplifying assumptions like static preferences may be


non-distortionary in the case of short term economic models, index numbers
exist primarily to make evaluations over long periods of time. The assump-
tion that preferences do not change over these periods is clearly invalid. As
the Divisia does not make this assumption, but the Konus index, which rep-
resents our intuitive notion of welfare does, it requires adjusting our basic
assumptions in order to evaluate the welfare implications of the Divisia. We
prove in this paper that when we restrict the Divisia to unchanging utility,
we have equivalence between the Divisia and the Konus price indexes. More-
over, when we lift the simplifying assumption of unchanging preferences, and
adjust the Konus to accommodate changes in taste, we prove that in this
more realistic case the Divisia price index is once again equal to the Konus
price index. The relationship between the two is also shown to provide a
5
Samuelson and Swamy, pg. 578

69
means for evaluating estimates of the CPI, which is of particular relevance
given the current Congressional debate on this issue.

70
Chapter 3

Household Migration Decisions


Under Uncertainty

3.1 Introduction
The problems involved with the control of rural migration into overtaxed
urban areas presents one of the great challenges in development economics.
Thirty years of intensive study by economists have spawned numerous pol-
icy initiatives intended to combat the phenomenon. Nevertheless, despite
scattered success, many LDCs continue to report an increasing demographic
shift towards urban areas.
The pioneering work of Lewis (1954) and Todaro (1969) approached the
question of rural-to-urban labor migration in the face of high urban unem-
ployment and a high urban to rural wage ratio as an individual decision
making problem. The basic behavioral equation1 developed by Todaro can
be expressed by
Z τ
V (0) = [P (t)Yu (t) − Yr (t)]e−rt dt − C(0) (3.1)
t=0

where V (0) represents the present discounted value of the move, P represents
the probability of getting a job in the urban market, Yu represents urban in-
come, and Yr represents rural income. C is the one time cost of moving. This
model was then extended by Harris and Todaro (1970) to discuss productivity
and policy implications.
1
Cf. for example to Cole and Sanders 1985

71
While this straightforward approach has enabled economists to under-
stand some of the decision making involved in migration, it has been pointed
out that there are pervasive phenomena that are unexplained by this model.
One such issue is that of remittances. It has been found that 10%-30% of a
migrant’s income may be transferred in remittances (Lucas and Stark 1988).
There are various theories based on pure self interest or pure altruism to
explain these remittances. Lucas and Stark also discuss migration as a con-
tractual risk sharing proposition: as there are basically non-correlated risks
involved in rural production and urban job search, the two parties can co-
insure, thereby diversifying the risk. The extent to which such co-operative
behavior is found however, suggests that it might be useful to think of the
decision making process not at the individual level, but instead at the house-
hold level. As Lauby and Stark (1988) contend

“the decision to migrate and the choice of migration patterns are


often not made by the individual alone but rather are determined
by family resources and needs.”2

In fact there were already indications from the authors of the the original
individual migrant models that the ultimate goal should be to move the level
of analysis from laborer to household. According to Harris and Todaro:

“...this notion that migrants retain their ties to the rural sec-
tor is quite common and manifested by the phenomenon of the
extended family system and the flow of remittances to rural rela-
tives of large proportions of urban earnings. However, the reverse
flow, i.e. rural-urban monetary transfers is also quite common in
cases where the migrant is temporarily unemployed and therefore,
must be supported by rural relatives.”3

In this paper we will model rural-to-urban migration as a household de-


cision. Once this shift is made, it is possible to view a family’s migration
pattern as a portfolio allocation problem. The low wage rural market, where
it has traditionally been assumed that employment is full, is a safe, low return
asset. The high wage urban market is associated with uncertain employment,
and represents a risky asset. The wealth to be ‘invested’ in these assets is
taken to be the number of working family members.
2
Lauby J. and Stark O. 1988.
3
Harris and Todaro 1970 pg. 127.

72
Policy Considerations
In the face of high urban unemployment there has been an ongoing discussion
of the best policies by which to control rural to urban migration. This dis-
cussion has focused mainly on whether the migration incentive is best viewed
as a “push” from the low wage rural sector, or a “pull” from the high wage
urban sector. Throughout the development literature it is often stated as
self-evident that

“...an increase in agricultural income will induce reverse [urban-


to-rural] migration...”-Harris and Todaro 4

without supporting argument. As recently pointed out by Stark:

“[a] widely held view is that, since expected utility maximisation


under substantial intersectoral income differentials induces rural-
to-urban migration, policy measures designed to dampen ur-
ban incomes (a freeze on urban real wages) or to increase ru-
ral incomes (farm price supports) are essential... [Emphasis
added].” 5

When one views the increase in agricultural incomes simply in light of


their direct effect, it is clear why one would arrive at the conclusion that an
increase in the rural wage level would lead people to want to invest more
workers in this asset that is now more valuable. This, however, does not
take into account the possible ‘income’ effects on the investment decision
associated with the increased renumeration the household will experience if
it maintains at least some workers in the rural labour market. Given the
focus of migration literature on the expected income rather than expected
utility approach, it is perhaps not surprising that these effects have been
overlooked:

“Originating largely in Todaro’s pioneering article (Todaro, 1969),


the dominance of the expected income motive soon became vir-
tually exclusive. This is somewhat surprising, especially since
during the very same period both risk and (especially) risk avoid-
ance have assumed major significance in mainstream economics.
4
Harris and Todaro 1970, pp. 132.
5
Stark, O., 1991, pg. 16.

73
Yet the expected income hypothesis, even in its revised formu-
lations, is devoid of any explicit decisional risk content (Todaro,
1976, 1980); the hypothesis does not incorporate a random vari-
able (multiplicative or other), and the implied utility function is
linear.”6

It is only once one incorporates the effects of risk aversion into the fam-
ily’s decision making that one realises that there may be an income effect
associated with the rise in rural incomes that may have adverse effects.
It is shown in this paper that if household utility can be characterised by
Decreasing Absolute Risk Averse functions, then it is possible for increases
in rural wages to in fact lead to ‘perverse migration’; i.e. depending on which
point of their utility function the family is at, an increase in rural wages may
increase the incentive to send workers into the city.

3.2 ‘Income’ vs ‘Substitution’ Effects


The standard policy recommendation of raising rural wages to stem the flow
of rural-to-urban migration relies on what can be see as a substitution effect.
In the case of a consumption good X, a decrease in the price of X makes
the good more attractive, and the consumer will substitute away from other
goods towards good X. Similarly, when the return to assets invested in
the rural labour market increases, the family is expected to substitute its
investment towards the more attractive rural labour market. However, if the
family is risk averse, the change in income may have other effects.
It is has been empirically shown that the utility functions of economic
agents with respect to risk generally display the property of Decreasing Ab-
solute Risk Aversion (DARA). It can be shown (as it is below in Proposition
11) that when an agent with a DARA utility function is confronted by an
investment decision between a safe and a risky asset, increases in wealth will
lead the agent to invest more in the risky asset. This is suggestive of a pos-
sible analogy with the migration decision to send more family members to
look for urban employment.
The following proposition is well known, and the notation and discussion
here is based closely on the treatment in Green, Whinston and Mas-Collel.
6
Stark O. 1991 Pg 18.

74
Proposition 11 Assume that u(x) is the Bernoulli utility function of an
investor displaying decreasing absolute risk aversion (DARA). Assume fur-
ther that the investor is given the opportunity to invest amounts of wealth
w = w1 , w2 in shares of a safe asset returning r dollars for every r dollars
invested and a risky asset returning z dollars with cumulative distribution
function F . Then if w1 < w2 , the optimal investments in the risky asset will
obey α1∗ < α2∗ .

Proof:
We will show that if the utility function displays the property of Decreas-
ing Absolute Risk Aversion, then at higher levels of wealth the individual
will invest more in the risky asset.
Consider the two levels of initial wealth w1 < w2 . Denote the increments
or decrements to wealth by z. Then the individual evaluates risk at wi by
the Bernoulli utility functions ui (z) = u(wi + z).
Because u is posited to exhibit DARA we are assured that whenever
w1 < w2 , u1 (z) = u(w1 + z) is a concave transformation of u2 (z) = u(w2 + z).
The expected utility maximising investor with wealth wi seeks to max-
imise the expression
Z Z
u(wi − αi + αi z)dF (z) = ui (−αi + αi z)dF (z) = 0. (3.2)

Assuming an interior solution, the first order conditions determining the


optimal level of investment αi∗ in the risky asset is:
Z
φi (αi ) = (z − 1)u0i (αi∗ [z − 1])dF (z) = 0

(3.3)

As we know, the concavity of u(·) implies that the functions φi (·) are
decreasing. Therefore if we show φ1 (α2∗ ) < 0, it must follow that α1∗ < α2∗ ,
which is the result we are seeking. Now by the hypothesis that u exhibits
DARA there must exist an increasing concave function ψ(x) such that the
concave transformation, u1 (x) = ψ(u2 (x)) holds for all x. ψ 0 (x) is therefore
positive and decreasing. This allows us to assert that

Z
φ1 (α2∗ ) = (z − 1)ψ 0 (u2 (α2∗ [z − 1]))u02 (α2∗ [z − 1])dF (z) < 0. (3.4)

75
The reasoning is that integral
Z
φ2 (α2 ) = (z − 1)u02 (α2∗ [z − 1])dF (z)

(3.5)

Z 1 Z ∞
= (z − 1)u02 (α2∗ [z − 1])dF (z) + (z − 1)u02 (α2∗ [z − 1])dF (z) = 0. (3.6)
−∞ 1

is closely related to (3.4) with the first summand in line (3.6) exhibiting
a negative integrand and the second summand containing a positive one.
The integral in line (3.5) differs only from the integral in equation (3.4) by
the presence of the ‘weighting function’ ψ 0 (u2 (α2∗ [z − 1])) which due to its
decreasing nature will weight the negative integral more heavily. This gives
the desired result.
QED
Thus we see that increases in the wealth variable w lead to greater in-
vestment in the risky asset. Under such a shift in the budget line we see that
the risky asset behaves as a normal good.
The migration model can be regarded as just such a portfolio allocation
decision but with an important twist: the ‘wealth’ of the household is rep-
resented not by an amount of money but by the labor force within the unit.
In such a situation, direct application of the proposition above applies to an
increase in the family size n leading to an increase in the number of ‘assets’
(namely workers) being sent to the urban market. While this would give
rise to interesting population policy considerations, it is not the case being
discussed here.
It is reasonably clear that an increase in the rural wage cannot be for-
mulated to fit the structure of the above proposition. Were this possible, we
would be left with the uncomfortable conclusion that rural wage increases
could lead only to greater rural-to-urban migration and migration models
would be universally perverse. That this cannot be the case can be seen by
considering an increase in the rural wage beyond urban levels.
In the case under consideration, the potential for an increase in expected
return on investment is coming from an increase in the return to the ‘safe
asset’. Therefore, while we would expect something like the income effect
discussed above, there is also a substitution effect: the increase in the rural
wage naturally increases the relative attractiveness of the rural market. In
fact migration theory has focused on this substitution effect to the exclusion
of the ‘income’ effect, thereby positing the sort of policy discussed above.

76
What we see here is that with DARA utility functions, and the fact that the
increase in income is coming precisely from an increase in the return to the
safe asset, income and substitution effects move in opposite directions.
In the case of consumer goods it is assumed that even when these ef-
fects conflict, it is highly unlikely that the income effect will outweigh the
substitution effect. This is generally true because in order for the income
effect of a price rise in any good to be large, the good must constitute a large
proportion of the consumption basket. While this is unlikely to be true for
any good in a consumption situation, it is almost certainly true in the case of
employment. As there are only two assets between which a consumer must
choose, it is very likely that income effects will be substantial. The omission
of any discussion of this when forming policy can therefore lead to counter
productive efforts.

77
3.3 A Risk Sensitive Household Migration Model.
Our aim in this section is to construct household migration models which
incorporate risk sensitivity into labour allocation decisions. While this seems
straightforward enough, a literature search indicates that such models are
either quite obscure or have not been introduced at all.7
There is much debate in the current development literature about the
appropriate methods of modeling household decision making. The problem
is rich in intricacies which include principal-agent problems, gender specific
labour8 , migrant network structures and various game theoretic considera-
tions. Given the wide variety of phenomena which are being uncovered, it is
unlikely that a ‘universal’ model will be settled upon any time soon. For the
sake of simplicity we will thus work with a household whose decision making
is well approximated by the maximisation of a household utility function that
views the decisions as being made by a single entity. While such a choice
does not pretend to universality, the situation is known to arise from a variety
of situations (e.g. a benevolent dictatorship, a nuclear family whose inter-
ests are similar enough to work well in aggregation or an individual decision
maker) and may lend itself to most easily to varied adaptation.
We define below the category of rural household models used in this paper.

Definition 16 We define the data specifying a Rural Household Labour-


Market Allocation Model H to be

H = (ν, n, wr , wu , πr , πu , c) (3.7)

where

1. The sub-utility function of the household is given by a twice continu-


ously differentiable function ν : R −→ R such that

dν(x)
>0 ∀x ≥ 0 (3.8)
dx
guaranteeing that the function is monotonically increasing.
7
In his 1991 book, Stark encountered a related lacuna writing “To the best of our
knowledge, the argument that aversion to risk is a major cause of rural-to-urban migration
has appeared only in Stark (1978,1981)”.
8
Gender issues in particular may play a role in migration decision making according to
the suggestion of Lauby and Stark 1988.

78
2. The number of workers in the household is given by a positive integer
n ∈ Z+ .
3. The rural wage is given as 0 < wr .
4. The urban wage wu is given by a value satisfying wr ≤ wu .
5. Rural employment is full, i.e. πr = 1.
6. The probability of urban employment is 0 < πu ≤ πr .
7. A Utility function UH (m) giving the expected utility of allocating m of
the household’s n workers for rural labour while sending the remaining
n − m workers to search for urban employment.
We can then calculate the expected utility of a family allocating m of
its n workers for rural labour. In this paper we shall assume for simplic-
ity that both labour markets make all employment offers at a single time
(e.g. spot labour markets in which all work opportunities are offered before
the beginning of the work day so that the chance of later employment is
negligible.)
Proposition 12 The expected utility of allocating m of the family’s n work-
ers to the rural labour market is given by
U(ν,n,wr ,wu ,πr ,πu ,c) (m) = U H (m) (3.9)
m n−m
X  m  n − m  i m−i j n−m−j
X
= (πr ) (1 − πr ) (πu ) (1 − πu ) ν(i(wr ) + j(wu ) − (n − m)c).
i j
i=0 j=0

Proof:
Let us assume that we have an ordered set of r identical though unrelated
binary events {Xi }ri=1 (e.g. urban job searches) which all carry the probabil-
ity πA of outcome A (urban employment) and 1 − πA of outcome B (urban
unemployment). Then the probability π that only the first s events yield the
outcome A is given by
π = πAs (1 − πA )r−s . (3.10)
If one wishes to instead calculate the probability π̃ that exactly s of the
possible outcomes result in A (without respect to which events return the
result) the probability is given by the discrete binomial distribution:
 
r! s r−s r
π̃ = πA (1 − πA ) = πAs (1 − πA )r−s (3.11)
s!(r − s)! s

79
 
r
where the binomial coefficient is the number of distinct ways of getting
s
the outcome A from exactly s of the r events {Xi }ri=1 .
The above analysis tells us that if either all workers are allocated to a
single labour market (m = n or m = 0) or both rural and urban probabilities
are identical (πr = πu ), the resulting probability structure will be given by a
discrete binomial distribution. In our situation however we have two markets
which are governed by non-conditional probabilities which are in general not
equal. Therefore, since such probabilities are multiplicative, the expected
utility of a household which allocates m of its n workers for rural labour is
given by the following ‘hybrid’ distribution:

U (m, n − m) = U H (m) (3.12)

m n−m
X X m! i m−i (n − m)! j n−m−j
= (πr ) (1 − πr ) (πu ) (1 − πu ) ν(iwr + jwu − (n − m)c) (3.13)
i=0 j=0 i!(m − i)! j!(n − m − j)!

m n−m
X X  m  n − m 
= (πr )i (1−πr )m−i (πu )j (1−πu )n−m−j ν(iwr +jwu −(n−m)c)
i j
i=0 j=0

QED

80
3.4 Perverse Migration Incentives
We will show below that the migration model introduced in the previous
section may behave in a fashion exactly opposite to that expected from the
migration literature. In such situations, attempts to boost the attractiveness
of rural labour will serve to increase the appeal of migration to the urban
labour market.
While such phenomena can be shown to occur for families with any num-
ber of workers n exceeding 2, the higher cases do not contribute any quali-
tatively new phenomena; we will thus restrict our attentions to a 2 worker
household for ease of illustration.
In order to clarify our exposition, it will be helpful to define a condition
somewhat weaker than perverse migration which refers only to the effect
on incentive structure. The following definition is given for the two worker
household although the concept can be defined more generally:

Definition 17 We will define a rural wage increase to have a perverse effect


on incentives (perverse utility for short) if the marginal increase of U (1, 1)
is greater than U (2, 0). Thus perverse migration implies perverse utility.

In what follows it is also helpful to set concrete representatives for the


functions whose coefficients of risk aversion are equal to unity:

Definition 18 The functions representing CARA and CRRA equal to unity


are taken to be −exp(−x) and ln(x) respectively.

With these definitions set, let us look at the case of a two worker house-
hold. In this particular case we have the explicit formulas for the expected
utility of the 3 possible labour allocations:

Uwr (2, 0) = ν(2wr ) (3.14)

Uwr (1, 1) = (1 − Πu )ν(wr ) + (Πu )ν(wr + wu ) (3.15)


Uwr (0, 2) = (1 − Πu )2 ν(0) + 2(Πu )(1 − Πu )ν(wu ) + (Πu )2 ν(2wu ) (3.16)
so these utilities change at the marginal rates

∂Uwr (2, 0)
= 2ν̇(2wr ) (3.17)
∂wr

81
∂Uwr (1, 1)
= (1 − Πu )ν̇(wr ) + (Πu )ν̇(wr + wu ) (3.18)
∂wr
∂Uwr (0, 2)
=0 (3.19)
∂wr
relative to changes in the rural wage.
As can be seen from the formula in line (3.16), the strategy of sending both
workers to the urban labour market carries a significant risk of a complete
loss of income. It is thus important to make sure that whatever functional
form is posited for ν(x) above wr , the value for ν(0) must be low enough
to prevent ‘gambling’ with the (0, 2) strategy in all but the most extreme
circumstances as the migrants under consideration are taken to belong to
risk averse subsistence level households.9
If we assume that the family was initially indifferent between having 1 or
2 workers in the rural market
Uwr (2, 0) − Uwr (1, 1) = 0 (3.20)
then asking whether rural wage increases will lead to perverse migration is
equivalent to asking whether
∂Uwr (2r , 0u ) ∂Uwr (1r , 1u )
− = 2ν̇(2wr ) − (1 − Πu )ν̇(wr ) − (Πu )ν̇(wr + wu )
∂wr ∂wr
(3.21)
is positive or negative.
In order for the increase in rural wages to cause an increase in the utility
of sending workers to the urban labour market,
2ν̇(2wr ) < (1 − Πu )ν̇(wr ) + (Πu )ν̇(wr + wu ) (3.22)
On the pages that follow it is shown that (for a two worker household)
any unbounded DARA utility function ν which is a concave transformation
of the CRRA function ln(x) is capable of producing perverse migration at
every rural wage level. Intuitively this indicates that functions which are
‘intermediate’ between CARA and CRRA will exhibit perversity for a two
worker household.
9
See for example Sen (1984) Pg.260
“In matters of ‘life and death’ such as these decisions involve, affecting one’s
entire economic existence, the assumption of expected utility maximization
is recognized to be quite restrictive.”

82
First, we establish that for any concave transformation of the CRRA
function ln(x), there are always urban employment probabilities leading to
perverse incentives (without restriction on the urban wage).

Theorem 13 Assume that a 2 worker household is governed by a DARA


sub-utility function
ν(x) = ψ(ln(x)) (3.23)
which can be represented by an increasing transformation ψ of the constant
relative risk averse utility function ln(x) such that

ψ 0 (ln(2wr ))
c> 0 ∀wr ∈ [a, b] (3.24)
ψ (ln(wr ))

for some constant c ∈ [0, 1]. Then for any probability of urban employment
Πu satisfying
1 − c ≥ Πu (3.25)
all rural wage increases will lead to perverse incentives by increasing the
relative expected utility of the urban labour market.

Proof:
By hypothesis we have
1 − Πu ≥ c (3.26)
implying the inequality

ψ 0 (ln(2wr ))
(1 − Πu ) > 0 (3.27)
ψ (ln(wr ))

for any choice wr ∈ [a, b]. This may be rewritten successively as

ψ 0 (ln(2wr )) < (1 − Πu )ψ 0 (ln(wr )) (3.28)

1 1
2ψ 0 (ln(2wr )) < (1 − Πu )ψ 0 (ln(wr )) (3.29)
2wr wr
dψ(ln(x)) |2wr dψ(ln(x)) |wr
2 < (1 − Πu ) (3.30)
dx dx

2ν̇(2wr ) < (1 − Πu )ν̇(wr ) (3.31)

83
implying

2ν̇(2wr ) < (1 − Πu )ν̇(wr ) + (Πu )ν̇(wr + wu ) (3.32)

∂Uwr (2, 0) ∂Uwr (1, 1)


< (3.33)
∂wr ∂wr
This shows that for ν = ψ(ln(x)), the possibility of sending 1 of the two
workers to the urban labour market becomes comparatively more attractive
for marginal increases in the rural wage.
QED

From this theorem we can see that while the standard CARA function
−exp(−x) is not everywhere a concave transformation of the CRRA ln(x),
it nevertheless leads to perverse incentives.

Corollary 14 The function

ν(x) = −exp(−x) (3.34)

with constant absolute risk aversion leads to perverse incentives. Specifically,


let
ψ = −exp(−exp(x)) (3.35)
Then if wr > 1 and
ψ 0 (ln(2wr ))
Πu < 1 − (3.36)
ψ 0 (ln(wr ))
then increases in the rural wage lead to perverse utility.

Proof: The function


ψ = −exp(−exp(x)) (3.37)
is a smooth increasing function which for x > 0 is always concave. Therefore
for wr > 1 we have the function of constant absolute risk aversion as a
concave transformation of the function of constant relative risk aversion

ν(x) = −exp(−x) = −exp(−exp(ln(x))) = ψ(ln(x)) (3.38)

84
as ln(1) = 0. By the fact that ψ(x) is an increasing concave function above
x = ln(1) = 0, we are assured that for wr large

ψ 0 (ln(2wr ))
<1 (3.39)
ψ 0 (ln(wr ))

and there will thus exist probabilities 1 > Πu > 0 such that

ψ 0 (ln(2wr ))
< 1 − Πu (3.40)
ψ 0 (ln(wr ))

and the corollary follows.


QED

While perverse incentives indicate that raises in the rural wage are ineffec-
tive for combating rural to urban migration, they do not necessarily indicate
that effort is counterproductive to the point of inducing migration. The fol-
lowing theorem shows that if the concave transformation ψ is unbounded,
then for every choice of wr , it will always be possible to find an urban wage
wu which will lead to perverse migration for a marginal increase in the rural
wage.

Theorem 15 Let the notation be as in the above theorem with ψ a concave


transformation and Πu a probability leading to perverse utility for wr ∈ [a, b].
If ψ has
lim ψ(x) = ∞ (3.41)
x→+∞

then
∀wr ∈ [a, b] ∃wu > wr (3.42)
such that the family is indifferent between the pure rural strategy and the
mixed rural/urban strategy

Uwr (2, 0) = Uwr (1, 1) (3.43)

with
∂Uwr (2, 0) ∂Uwr (1, 1)
< (3.44)
∂wr ∂wr
leading to perverse migration.

85
Proof:
First, we recall that the expected utility for the ‘mixed strategy’ is given
by:
Uwr (1, 1) = (1 − Πu )ν(wr ) + (Πu )ν(wr + wu ). (3.45)
Now by hypothesis we know that
ν(x) = ψ(ln(x)) (3.46)
with ψ(x) unbounded from above. However, since ln(x) is also unbounded,
we see from (3.45) that
lim Uwr ,wu (1, 1) = ∞. (3.47)
wu →+∞

However, since the family is assumed to be risk averse we are assured that if
wu were set equal to wr we would have
Uwr ,wu (2, 0) > Uwr ,wu (1, 1). (3.48)
Thus by the intermediate value theorem we are assured that

∃wu > wr s.t. Uwr ,wu (2, 0) = Uwr ,wu (1, 1) (3.49)
as Uwr ,wu (2, 0) has no dependence on wu . Thus if wu is so chosen for a par-
ticular rural wage wr ∈ [a, b], the household will exhibit perverse migration
for marginal increases in the rural wage wr .
QED

In the following corollary, the previous theorem is applied to a DARA


function specified by its coefficient of absolute risk aversion. This produces an
infinite number of models whose values for Πu and wu are given as functions
of wr and which lead to perverse migration.
Corollary 16 Let κ > 1. Consider the DARA function ν(x) whose coeffi-
cient of absolute risk aversion for x > κ is given by
v 00 (x) 1 1
− 0
= (1 + ) (3.50)
v (x) x ln(x)
Then if wr is any value greater than κ, the choices
ln(wr )
Πu = 1 − (3.51)
ln(2wr )

86
ln(wr )
ln(ln(2wr )) − ln(2wr )
ln(ln(wr ))
wu = exp(exp( ln(wr )
)) − wr (3.52)
1− ln(2wr )

lead to a 2 worker household being indifferent between the pure rural strategy
and a mixed urban/rural strategy with marginal increases in the rural wage
leading to perverse migration.

Proof: First, it can be readily checked that (3.50) is decreasing for x > 1
and can be solved by
ν(x) = ln(ln(x)) + c (3.53)
where for simplicity we take c = 0.
Next we see that if we take the difference in utilities between the two
strategies under consideration we have

Uwr (2, 0) − Uwr (1, 1) (3.54)


= ν(2wr ) − (1 − Πu )ν(wr ) − (Πu )ν(wr + wu ) (3.55)
which for the choices above lead to:
ln(wr )
= ln(ln(2wr )) − ln(ln(wr ))− (3.56)
ln(2wr )
ln(wr )
ln(wr ) ln(ln(2wr )) − ln(2wr )
ln(ln(wr ))
(1 − ) ln(wr )
=0 (3.57)
ln(2wr ) 1− ln(2wr )

Lastly we see by design that

2ν̇(2wr ) = (1 − Πu )ν̇(wr ) < (1 − Πu )ν̇(wr ) + (Πu )ν̇(wr + wu ) (3.58)

So perverse migration is assured.


QED

Relations to Income Stabilisation


While this paper has focused on raising a certain rural wage, the problem
of increasing effective rural incomes may take a slightly different form. We

87
discuss below a proposed method of relating the models of the current paper
with models which deal with uncertainty in the agricultural incomes.
The policies proposed for raising the attractiveness of rural labour may
attempt to do so by raising rural wages directly, or by attempting to reduce
the fluctuations associated with agricultural incomes. In the latter case it is
possible to model rural incomes as lotteries in which the certainty of rural
employment is preserved but the possible incomes fluctuate with agricultural
prices. The importance of this change in perspective is that it could be used
to incorporate the effects of agricultural price stabilisation policy into the
rural-to-urban migration decision.
In this situation we would imagine that we would have two cumulative
distribution functions with G1 (x) representing the natural variation in rural
incomes and G2 (x) giving the variation which would result from government
stabilisation efforts. If the government is successful in doing no more that sta-
bilising rural incomes we could expect G1 (x) to determine a mean preserving
spread of G2 (x). In either case, if we call Z01,2 the certainty equivalents
Z
i −1
Z0 = ν ( ν(x) dGi (x)) (3.59)

of the two lotteries G1 (x), G2 (x), we know that for a household governed by
a risk averse utility function ν(x) that

Z01 < Z02 (3.60)

as G2 second-order stochastically dominates G1


Thus if one is able to impose a policy so that the original rural distribu-
tion of incomes forms a mean preserving spread of the distribution resulting
from the intervention, a situation will result which is in large part analogous
to the raising of the rural wage (where wr is analogous to Z0 ) considered in
the current paper.

88
3.5 Examples
Once one is made aware of the possibility of perverse migration, examples of
perverse models appear to be ubiquitous. It is thus not too difficult to find
examples of models where:

1. The utility function ν(x) exhibits decreasing absolute risk aversion.

2. The values for probabilities, and wages are not terribly different from
those found in actual labour markets.

While such models are not intended to mirror any particular labour mar-
ket, they do indicate that such models exist without recourse to unrealistic
hypotheses.
Let us then look at examples in which we expect perverse migration. We
will illustrate the phenomena by graphing the utility differences of the two
main allocation strategies as a function of the rural wage.
It has been shown that any unbounded DARA utility function ν(x) which
is a concave transformation of the CRRA function ln(x) is capable of pro-
ducing perverse migration. Here we look at a specific function and evaluate
ranges of rural and urban wages and urban probabilities that cause perversity.
Let us assume the household’s utility function is equal to
x
ν(x) = ln(ln ) (3.61)
10
in the range of interest (e.g. Rs. 11 and up).
We are able to plot a graph of the Rural Advantage Function (RAF) for
the household with the utility function ν. This looks at how the difference
between the utility of keeping two workers in the country and the utility of
keeping one worker in the city and one in the country changes with rises in
the rural income.

RAF (wr ) = Uwr (2, 0) − Uwr (1, 1) = ν(2wr ) − (1 − Π)ν(wr ) − (Π)ν(wr + wu )


(3.62)
Let us look at this for particular values of urban employment probabilities
Π, and urban wage rates wu . On the X axis we plot rural wage rates, and on
the Y axis is measured the difference in utilities. When rural wages increase
into the realm where the RAF becomes negative, the wage increase is seen

89
to cause perverse migration as it becomes preferable for the family to send
another worker into the city.
We will look at two specific cases; in the first case we set the urban prob-
ability equal to .5 and observe the range of rural wages wr which will cause
perverse migration with an urban wage of Rs. 200 (It will be noted in this
example that a rural wage increase would have to more than double the rural
earnings before it brought the migrant out of the range of perversity). In the
second case we look at a situation with an urban probability of employment
of .4 and urban wage rate set at Rs. 500.
We assume in these examples that the negative utility of a zero wage
is taken to be large enough that the family will consider it prohibitive to
gamble both workers in the city (as this strategy necessarily entails a high
risk of earning nothing).

90
Case 1
We set:
Πu = .5
wu = Rs.200
Perverse migration is caused whenever rural wages wr rise or fall to any
wage within the range of Rs. 15 to Rs. 44. (See Graph 1)

Graph 1:

Probability of Urban Employment = .5


Urban Wage = Rs.200

Figure 3.1: First Graphical Example of Perverse Migration

91
Case 2
We set:
Πu = .4
wu = Rs.500
Perverse migration is caused whenever rural wages wr rise or
fall to any wage within the range of Rs. 20 to Rs. 56. (See Graph
2)

Graph 2:

Probability of Urban Employment = .4


Urban Wage = Rs.500

Figure 3.2: Second Graphical Example of Perverse Migration

92
3.6 Conclusion
When developing migration control policy, understanding the deci-
sion making processes behind rural to urban migration is essential.
It has in general been assumed in the literature that there is a
theoretical equivalence between raising rural wages to dampen the
“push” factors of the low wage rural sector, and putting in place
urban wage controls, to deal with the “pull” factor of a high wage
urban sector. It is shown here, however, that once attitudes to risk
are incorporated, it is is possible to find that raising rural wages
may in fact have a counterproductive effect.
The migration literature has, until very recently, been based
closely on the Todaro model. While this model captures the basic
incentive structure for migration, it makes two simplifying assump-
tions that are generally acknowledged to be unrealistic, i.e.

1. The atomic hypothesis: migration decisions are made by the


migrants alone (i.e. workers are unremitted and unremitting).

2. Pure risk neutrality: Expected income rather than expected


utility is maximised.

Once these assumptions are lifted, it is found that one of the basic
conclusions of the Todaro model can no longer be relied upon.
Perverse incentives have been shown in this paper to afflict the
category of Decreasing Absolute Risk Averse utility functions. As
this is precisely the utility structure one expects to find in the gen-
eral situation we are left with the question of how to evaluate the
likely migration effects of a generic rural wage support program.
This paper would argue that without specific information on the
particulars of the potential migrants being targeted (e.g. family
sizes, objective functions, wage rates and probabilities) the usual
conclusion drawn in the literature is not assured.

93
3.7 Appendix B: The Relationship Between
the Rural Household Labour Allocation
Model and the Todaro Model
The original Todaro equation serves something of a dual role in
the development literature; while it is in some sense one of the
simplest migration models, it also functions as the progenitor for
the more sophisticated models which followed it. It may there-
fore be worthwhile to see how the data specifying a rural labour
allocation model (RHLMAM) compares with the data specifying
Todaro’s equation.
In Todaro’s model (3.1), the basic behavioral equation gives a
single number V (0) whose sign is the answer to the discrete op-
timisation problem; if and only if the number is positive will the
rational individual choose to migrate. In the case of a household,
2
the number of options is enlarged so that there are n 2+n decisions to
evaluate. We thus posit that the natural analog of Todaro’s equa-
tion for a remitting household is given by the following ‘behavioral
matrix’ M H .
Definition 19 The (n+1)×(n+1) skew-symmetric behavioral matrix
M H associated to a RHLMAM is given by the entries {MijH }ni,j=0
where MijH = U H (i) − U H (j)
The number of rural workers m which will optimise the expected
utility of the household H is then the number of the row whose
entries are all non-negative. The greatest number row with this
property will be called O(H). If we denote the collection of house-
hold models by H:
Definition 20 Let O : H −→ Z be the integer valued function which
gives the number of workers who should be allocated to rural
labour in order to maximise the household’s expected utility.
With these conventions established we are now in a position to
explain how these RHLMAMs relate to the established literature.
In the following proposition it is shown that these models gener-
alise the (static) Todaro model to families or networks of labourers
governed by risk-sensitive household welfare functions.

94
Proposition 17 Let us assume that the following hypotheses are
made:
1. The household work force consists of an individual: n = 1.
2. The individual is risk neutral: ν(x) = x.
3. The interest rate is negligible: r = 0.
4. The probabilities of urban and rural employment are given by
constants as are the urban and rural wages: πu = P (t), πr =
1, Yu (t) = wu and Yr (t) = wr .
5. The time period τ in Todaro’s equation is set equal to unity:
τ = 1.
Then the RHLMAM is equivalent to the Todaro equation (3.1)
Proof: In order to evaluate the migration decision presented by a
single worker RHLMAM, one need only examine the M01 compo-
nent of the 2 × 2 behavioral matrix.
H
M01 = UH (0) − UH (1) (3.63)
which when expanded from equation (3.13) yields
= πu ν(wu − c) + (1 − πu )ν(0 − c) − πr ν(wr ) − (1 − πr )ν(0) (3.64)
= πu wu − πr wr + (−c)(πu + (1 − πu )) (3.65)
= πu wu − wr − c (3.66)
Z τ
= [P (t)Yu (t) − Yr (t)]e−rt dt − C(0) = V (0) (3.67)
t=0
giving equation (3.1) (the Todaro behavioral equation).
QED.

This shows that in a well defined sense the category of RHLMAMs


can be considered as an expansion of the seminal Todaro model gen-
eralised to include risk preferences and household objective func-
tions. The following corollary is then immediate.
Corollary 18 Every RHLMAM H with n = 1 and νH (x) = x speci-
fies a Todaro model.

95
Bibliography

[1] Afriat S., The Price Index, Cambridge: Cambridge University


Press

[2] Allen, R.G.D., Index Numbers in Theory and Practice, Lon-


don: Macmillan 1975

[3] Banerjee B. and Kanbur, S.M. “On the Specification and Es-
timation of Macro Rural-Urban Migration functions: With an
Application to Indian Data” Oxford Bulletin of Economics
and Statistics 43, 7-29, 1981

[4] Bennet,T.L. “The Theory of Measurement of Changes in Cost


of Living,” Journal of the Royal Statistical Society 83, 455-
462, 1920.

[5] Besse, A.L. Einstein Manifolds, Ergebnisse der Mathematik


und ihrer Grenzgebiete: 3. Folge, Band 10, Springer-Verlag,
Berlin 1987.

[6] Bishop, R.L., and Goldberg, S.I. Tensor Analysis on Mani-


folds, New York: Dover Publications 1980

[7] Chenery, H. and Srinivasan, T.N., Eds. Handbook of Devel-


opment Economics, Vol 1, Elsevier Science Publishers 1988

[8] Cole, W. and Sanders, R. “Internal Migration and Urban Em-


ployment in the Third World” American Economic Review
75, 481-494, 1985

[9] Deaton, A. and J. Muellbauer Economics and Consumer Be-


havior, Cambridge : Cambridge University Press 1980.

96
[10] Diewert, W.E., Pesonal Communication to the Author,
Spring, 1996

[11] Diewert, W.E., and Nakamura, A.O., Eds. Contributions to


Economic Analysis Essays in Index Number Theory Vol 1.,
North-Holland 1993

[12] Dubrovin, B.A., Fomenko, A.T. and Novikov, S.P. Modern


Geometry - Methods and Applications. Part II: The Geom-
etry and Topology of Manifolds., Graduate Texts in Mathe-
matics, Vol.124, New York: Springer-Verlag 1985

[13] Friedman, M.. and Savage, L.J. “The Utility Analysis of


Choices Involving Risk” Journal of Political Economy 56,
279-304,1948

[14] Frisch, R. “Necessary and Sufficient Conditions Regarding


the Form of an Index Number Which Shall Meet Certain of
Fisher’s Tests” American Statistical Association Journal, 25
, 397-406, 1930

[15] Garrison, H. “Internal Migration in Mexico: A Test of the


Todaro Model” Food Research Institute Studies 18, 197-214,
1982

[16] Goss J. and Lindquist B. “Conceptualizing International


Labour Migration: A Structuarian Perspective” International
Migration Review 29, 317-351, 1995

[17] Harris J, and Todaro, M. “Migration, Unemployment and De-


velopment: A Two-Sector Analysis.” American Economic Re-
view 60, 126-142, 1970

[18] Hulten, Charles R., “Divisia Index Numbers” Econometrica,


41, No. 6, 1973

[19] Jorgenson, D.W., and Z.Griliches, “The Explanation of Pro-


ductivity Change,”Review of Economic Studies, 34, 249-283,
1967

97
[20] Jorgenson, D.W., and Z.Griliches, “Divisia Index Numbers
and Productivity Measurement,” Review of Income and
Wealth, 17, 227-29, 1971

[21] Katz, E. and O. Stark, “ Migration and Risk Aversion in Less


Developed Countries” Journal of Labour Economics 4, 134-
149, 1986.

[22] Lauby, J. and Stark, O. “Individual Migration as a Family


Strategy: Young Women in the Philippines” Population Stud-
ies 42, 473-486, 1988

[23] Lewis W.A. “Economic Development with Unlimited Supplies


of Labour” Manchester Sch. Econ. Soc. Stud 22, 139-91, 1954

[24] Lucas, R. and Stark, O. “Migration, Remittances and the Fam-


ily” Economic Development and Cultural Change 93, 465-481,
1988

[25] Mas-Collel, A., Whinston, M. and Green J. Microeconomic


Theory Oxford University Press 1995

[26] Niehans, J. A History of Economic Theory The Johns


Hopkins University Press 1990

[27] The MIT Dictionary of Modern Economics, Fourth Edition,


D. W. Pearce and R. Shaw Eds., MIT Press, Cambridge Mas-
sachusettes, 1994

[28] Pollak, Robert A. “The Theory of the Cost of Living In-


dex” Price Level Measurement, Diewert, W.E., and Mont-
marquette, C., Eds., 1983

[29] Richter, M.K., “Invariance Axioms and Economic Indexes,”


Econometrica 34, 739-755,1966.

[30] Samuelson, P.A. and S. Swamy, “Invariant Economic Index


Numbers and Canonical Duality: Survey and Synthesis,”
American Economic Review 64, 566-593,1974.

98
[31] Salvatore, D. “A Theoretical and Empirical Extension of the
Todaro Migration Model”Regional Science and Urban Eco-
nomics 11, 499-508 1981

[32] Sen, A. K. “Welfare Basis of Real Income Comparisons” Jour-


nal of Economic Literature 17, 1-45, 1979

[33] Sen, A.K. “Employment, Institutions and Technology: Some


Policy Issues” Reprinted in Resources Values and Develop-
ment Harvard University Press 1984

[34] Stark, O. The Migration of Labor, Basil Blackwell 1991

[35] Todaro, M. “Urban Job expansion, Induced Migration and


Rising Unemployment in Less Developed Countries” Ameri-
can Economic Review 59, 138-148, 1969

[36] Weinstein, E., 1992. Extension of Self-Dual Yang-Mills Equa-


tions Across the 8th Dimension, PhD. Dissertation, Mathe-
matics Department, Harvard University.

99

You might also like