Mendelian Randomization Mendelian Randomization
Mendelian Randomization Mendelian Randomization
MENDELIAN RANDOMIZATION
MENDELIAN
Mendelian Randomization: Methods for Using Genetic Variants in Causal Es-
timation provides thorough coverage of the methods and practical elements of
Mendelian randomization analysis. It brings together diverse aspects of Mendelian
randomization spanning epidemiology, statistics, genetics, and econometrics.
RANDOMIZATION
Through several examples, the first part of the book shows how to perform simple
applied Mendelian randomization analyses and interpret their results. The second
part addresses specific methodological issues, such as weak instruments, multiple
instruments, power calculations, and meta-analysis, relevant to practical applica-
tions of Mendelian randomization. In this part, the authors draw on data from the
C-reactive protein Coronary heart disease Genetics Collaboration (CCGC) to illus-
trate the analyses. They present the mathematics in an easy-to-understand way by
using nontechnical language and reinforcing key points at the end of each chapter.
Methods for Using
The last part of the book examines the potential of Mendelian randomization in the
future, exploring both methodological and applied developments. Genetic Variants
in Causal Estimation
Features
• Offers first-hand, in-depth guidance on Mendelian randomization from
leaders in the field
• Makes the diverse aspects of Mendelian randomization understandable to
newcomers
• Illustrates the technical details using data from a large collaborative study
• Includes other real-world examples that show how Mendelian randomization
is used in studies involving inflammation, heart disease, and more
• Discusses possible future directions for research involving Mendelian
randomization
This book gives you the foundation to understand issues concerning the use of Burgess • Thompson Stephen Burgess
genetic variants as instrumental variables. It will get you up to speed in undertak-
ing and interpreting Mendelian randomization analyses. Chapter summaries, paper
summaries, web-based applications, and software code for implementing the sta- Simon G. Thompson
tistical techniques are available on a supplementary website.
K16638
w w w. c rc p r e s s . c o m
MENDELIAN
RANDOMIZATION
Methods for Using
Genetic Variants
in Causal Estimation
Stephen Burgess
Department of Public Health and Primary Care
University of Cambridge, UK
Simon G. Thompson
Department of Public Health and Primary Care
University of Cambridge, UK
CRC Press
Taylor & Francis Group
6000 Broken Sound Parkway NW, Suite 300
Boca Raton, FL 33487-2742
© 2015 by Taylor & Francis Group, LLC
CRC Press is an imprint of Taylor & Francis Group, an Informa business
This book contains information obtained from authentic and highly regarded sources. Reasonable
efforts have been made to publish reliable data and information, but the author and publisher cannot
assume responsibility for the validity of all materials or the consequences of their use. The authors and
publishers have attempted to trace the copyright holders of all material reproduced in this publication
and apologize to copyright holders if permission to publish in this form has not been obtained. If any
copyright material has not been acknowledged please write and let us know so we may rectify in any
future reprint.
Except as permitted under U.S. Copyright Law, no part of this book may be reprinted, reproduced,
transmitted, or utilized in any form by any electronic, mechanical, or other means, now known or
hereafter invented, including photocopying, microfilming, and recording, or in any information stor-
age or retrieval system, without written permission from the publishers.
For permission to photocopy or use material electronically from this work, please access www.copy-
right.com (http://www.copyright.com/) or contact the Copyright Clearance Center, Inc. (CCC), 222
Rosewood Drive, Danvers, MA 01923, 978-750-8400. CCC is a not-for-profit organization that pro-
vides licenses and registration for a variety of users. For organizations that have been granted a photo-
copy license by the CCC, a separate system of payment has been arranged.
Trademark Notice: Product or corporate names may be trademarks or registered trademarks, and are
used only for identification and explanation without intent to infringe.
Visit the Taylor & Francis Web site at
http://www.taylorandfrancis.com
and the CRC Press Web site at
http://www.crcpress.com
Contents
Preface xi
Abbreviations xiii
Notation xiv
vii
viii Contents
7.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 99
7.2 Demonstrating the bias of IV estimates . . . . . . . . . . . . 100
7.3 Explaining the bias of IV estimates . . . . . . . . . . . . . . 102
7.4 Properties of IV estimates with weak instruments . . . . . . 106
7.5 Bias of IV estimates with different choices of IV . . . . . . . 109
7.6 Minimizing the bias of IV estimates . . . . . . . . . . . . . . 112
7.7 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
7.8 Key points from chapter . . . . . . . . . . . . . . . . . . . . 121
Bibliography 185
Index 207
Preface
The quantity of research into the genetics of common diseases has exploded
over the last 20 years. While many genetic variants related to various diseases
have been identified, their usefulness may lie more in what they offer to our
understanding of the biological mechanisms leading to disease rather than
to, for example, predicting disease risk. To understand mechanisms, we need
to separate the relationships of risk factors with diseases into those that are
causal and those that are not. This is where Mendelian randomization can
play an important role.
The technique of Mendelian randomization itself has undergone rapid de-
velopment, mostly in the last 10 years, and applications now abound in current
medical and epidemiological journals. Its basis is that of instrumental vari-
able analysis, which has a much longer history in statistics and particularly
in econometrics. Relevant papers on Mendelian randomization are therefore
dispersed across the multiple fields of genetics, epidemiology, statistics and
econometrics. The intention of this book is to bring together this literature
on the methods and practicalities of Mendelian randomization, especially to
help those who are relatively new to this area.
In writing this book, we envisage the target audience comprising two main
groups, Epidemiologists and Medical Statisticians, who want to perform ap-
plied Mendelian randomization analyses or understand how to interpret their
results. We therefore assume a familiarity with basic epidemiological terminol-
ogy, such as prospective and case-control studies, and basic statistical meth-
ods, such as ordinary least squares and logistic regression. Meanwhile, we have
tried to make the perhaps alien terminology of econometrics accessible to our
intended readership.
While we hope that this book will be accessible to a wide audience, a
geneticist may baulk at the simplistic explanations of Mendelian inheritance,
a statistician may yearn for a deeper level of technical exposition, and an
epidemiologist may wonder why we don’t just cut to the chase of how to
perform the analyses. Our hope is that enough detail is given for those who
need it, references are available for those who want more, and a section can
simply be glossed over by those for whom it is redundant.
While we have included relevant statistical methodology available up to
the publication date of the book, our focus has been on methods and issues
which are of practical relevance for applied Mendelian randomization analy-
ses, rather than those which are of more theoretical interest, or ‘cutting-edge’
developments which may not stand the test of time. As such, to a research
xi
xii Mendelian Randomization
Abbreviations for the various studies in the CCGC are given in Table 10.1.
xiii
xiv Mendelian Randomization
Notation
This book concerns making inferences about causal effects based on ob-
servational data using genetic instrumental variables, a concept known as
Mendelian randomization. In this chapter, we introduce the basic idea of
Mendelian randomization, giving examples of when the approach can be used
and why it may be useful. We aim in this chapter only to give a flavour of
the approach; details about its conditions and requirements are reserved for
later chapters. Although the examples given in this book are mainly in the
context of epidemiology, Mendelian randomization can address questions in
a variety of fields of study, and the majority of the material in this book is
equally relevant to problems in different research areas.
3
4 Mendelian Randomization
ment’ (no intervention), and the average outcomes in each of the arms of the
trial are contrasted. Here the risk factor (which we will often refer to as the
“exposure” variable) is a putative causal risk factor. We seek to assess whether
the risk factor is a cause of the outcome, and estimate (if appropriate) the
magnitude of the causal effect.
While randomized trials are in principle the best way of determining the
causal status of a particular risk factor, they have some limitations. Random-
ized trials are expensive and time-consuming, especially when the outcome
is rare or requires a long follow-up period to be observed. Additionally, in
some cases, a targeted treatment which has an effect only on the risk factor
of interest may not be available. Moreover, many risk factors cannot be ran-
domly allocated for practical or ethical reasons. For example, in assessing the
impact of drinking red wine on the risk of coronary heart disease, it would
not be feasible to recruit participants to be randomly assigned to either drink
or abstain from red wine over, say, a 20-year period. Alternative approaches
for judging causal relationships are required.
Scientific hypotheses are often assessed using observational data. Rather
than by intervening on the risk factor, individuals with high and low levels of
the risk factor are compared. In many cases, differences between the average
outcomes in the two groups have been interpreted as evidence for the causal
role of the risk factor. However, such a conclusion confuses correlation with
causation. There are many reasons why individuals with elevated levels of the
risk factor may have greater average outcome levels, without the risk factor
being a causal agent.
Interpreting an association between an exposure and a disease outcome in
observational data as a causal relationship relies on untestable and usually
implausible assumptions, such as the absence of unmeasured confounding (see
Chapter 2) and of reverse causation. This has led to several high-profile cases
where a risk factor has been widely promoted as an important factor in disease
prevention based on observational data, only to be later discredited when evi-
dence from randomized trials did not support a causal interpretation [Taubes
and Mann, 1995]. For example, observational studies reported a strong inverse
association between vitamin C and risk of coronary heart disease, which did
not attenuate on adjustment for a variety of risk factors [Khaw et al., 2001].
However, results of experimental data obtained from randomized trials showed
a non-significant association in the opposite direction [Collins et al., 2002].
The confidence interval for the observational association did not include the
randomized trial estimate [Davey Smith and Ebrahim, 2003]. Similar stories
apply to the observational and experimental associations between β-carotene
and smoking-related cancers [Peto et al., 1981; Hennekens et al., 1996], and
between vitamin E and coronary heart disease [Hooper et al., 2001]. More
worrying is the history of hormone-replacement therapy, which was previously
advocated as being beneficial for the reduction of breast cancer and cardio-
vascular mortality on the basis of observational data, but was subsequently
shown to increase mortality in randomized trials [Rossouw et al., 2002; Beral
Introduction and motivation 5
et al., 2003]. More robust approaches are therefore needed for assessing causal
relationships using observational data. Mendelian randomization is one such
approach.
first line of defence against infection and injury. There has been particular in-
terest in one of these, C-reactive protein (CRP), and the role of elevated levels
of CRP in the risk of coronary heart disease (CHD). It is known that CRP is
observationally associated with the risk of CHD [Kaptoge et al., 2010], but,
prior to robust Mendelian randomization studies, it was not known whether
this association was causal [Danesh and Pepys, 2009]. The specific question in
this example (a small part of the wider inflammation hypothesis) is whether
long-term elevated levels of CRP lead to greater risk of CHD.
in the population, from which causal inferences can be obtained. Other plausi-
ble non-genetic instrumental variables include government policy changes (for
example, the introduction of a smoking ban in public places, or an increase
in cigarette tax, which might decrease cigarette smoking prevalence without
changing other variables) and physician prescribing preference (for example,
the treatment a doctor chose to prescribe to the previous patient, which will be
representative of the doctor’s preferred treatment, but should not be affected
by the current patient’s personal characteristics or case history).
1.5.1 Structure
This book is divided into three parts. The first part, comprising Chapters 1
to 6, is entitled “Using genetic variants as instrumental variables to assess
causal relationships”. This part contains the essential information for a prac-
titioner interested in Mendelian randomization (Chapters 1 and 2), including
definitions of causal relationships and instrumental variables (Chapter 3), and
methods for the estimation of causal effects (Chapter 4). With the exception
of some of the technical details about statistical methods marked as ‘starred’,
these sections should be fully accessible to most epidemiologists. Issues sur-
rounding the application of Mendelian randomization in practice are explored
by presenting examples of Mendelian randomization investigations from the
literature (Chapter 5). Also addressed is the question of how to interpret a
Mendelian randomization estimate, and how it may compare to the effect of
an intervention on the exposure of interest in practice (Chapter 6).
The second part, comprising Chapters 7 to 10, is entitled “Statistical is-
sues with instrumental variable analysis in Mendelian randomization”. This
Introduction and motivation 11
TABLE 1.1
Examples of causal relationships assessed by Mendelian randomization in ap-
plied research.
Abbreviations:
CRP = C-reactive protein, CIMT = carotid intima-media thickness, CHD = coronary
heart disease, SHBG = sex-hormone binging globulin, HDL-C = high-density lipoprotein
cholesterol, BMI = body mass index, ADHD = attention deficit hyperactivity disorder.
References:
1. Trompet et al., 2009, 11. Kivimäki et al., 2007,
2. Timpson et al., 2005, 12. Mumby et al., 2011,
3. Kivimäki et al., 2008, 13. Norton and Han, 2008,
4. Allin et al., 2010, 14. Von Hinke et al., 2010,
5. Thompson et al., 2005, 15. Chen et al., 2008,
6. Voight et al., 2012, 16. Bech et al., 2006,
7. Casas et al., 2005, 17. Almon et al., 2010,
8. Kamstrup et al., 2009, 18. Irons et al., 2007,
9. Clarke et al., 2009, 19. Ding et al., 2009b,
10. Ding et al., 2009a, 20. Ebrahim and Davey Smith, 2008
12 Mendelian Randomization
1.6 Summary
Distinguishing between a factor which is merely associated with an outcome
and one which has a causal effect on the outcome is problematic outside of
the context of a randomized controlled trial. Instrumental variables provide
a way of assessing causal relationships in observational data, and Mendelian
randomization is the use of genetic variants as instrumental variables.
In the next chapter, we provide more detail of what Mendelian random-
ization is, and when and why it may be useful.
2
What is Mendelian randomization?
2.1.1 Motivation
A foundational aim of epidemiological research is the estimation of the effect
of changing an exposure on an outcome. This is known as the causal effect
of the exposure on the outcome, and typically differs from the observational
association between the exposure and outcome, for example due to confound-
ing. Correlation between the exposure and the outcome cannot be reliably
interpreted as evidence of a causal relationship. For example, those who drink
red wine regularly have a lower incidence of heart disease. But socio-economic
status is a common predictor of both wine consumption and better coronary
health, and so it may be that socio-economic status rather than wine con-
sumption underlies the risk of heart disease. Observational associations may
also arise as a result of reverse causation. For example, those who regularly
take headache tablets are likely to have more headaches than those who do
13
14 Mendelian Randomization
not, but taking headache tablets is unlikely to be a cause of the increased inci-
dence of headaches. Another example is vitamin D levels, which may decrease
in individuals who are ill and therefore do not go outside, rather than vitamin
D being a cause of illness.
The idea of Mendelian randomization is to find a genetic variant (or vari-
ants) associated with the exposure, but not associated with any other risk fac-
tor which affects the outcome, and not directly associated with the outcome.
This means that any association of the genetic variant with the outcome must
come via the variant’s association with the exposure, and therefore implies a
causal effect of the exposure on the outcome. Such a genetic variant would
satisfy the assumptions of an instrumental variable (IV) [Greenland, 2000a;
Sussman and Hayward, 2010]. As the theory of IVs was initially developed
in the field of econometrics, a number of terms commonly used in the IV lit-
erature derive from this field and are not always well understood by medical
statisticians or epidemiologists. Table 2.1 is a glossary of terms which are used
in each field.
TABLE 2.1
A summary of instrumental variable terms used in the fields of econometrics
and epidemiology.
16 Mendelian Randomization
FIGURE 2.1
Comparison of a randomized controlled trial and Mendelian randomization.
• Alleles are the variant forms of a single nucleotide polymorphism (SNP). For
a diallelic SNP where there are two possible alleles, the more common allele is
called the major allele or wildtype allele, and the less common allele is the minor
allele or variant allele.
• Canalization (also known as developmental compensation) is the process by
which potentially disruptive influences on normal development from genetic (and
environmental) variation are damped or buffered by compensatory processes.
• A chromosome carries a collection of genes located on a long string of DNA.
Humans have 22 pairs of autosomal (non-sex) chromosomes and 1 pair of sex
chromosomes.
• A copy number variant (or variation) is a (possibly) repeating section of DNA
where the number of copies of the section varies between individuals.
• DNA (deoxyribonucleic acid) is a molecule that contains the genetic instruc-
tions used in the development and functioning of all living organisms. The main
role of DNA is the long-term storage of information. It contains the instructions
needed to construct other components of cells, including proteins and ribonucleic
acid (RNA) molecules. DNA has four nucleotide bases labelled A, T, C and G.
• A gene is a section of a chromosome comprising DNA which encodes infor-
mation relevant to the function of an organism.
• The genotype of an individual at a particular locus refers to the two alleles at
that locus. If the alleles are the same, the genotype is homozygous; if different,
heterozygous.
• A haplotype describes a particular combination of alleles from linked loci found
on a single chromosome.
• Linkage disequilibrium (LD) is the correlation between allelic states at dif-
ferent loci within the population. The term LD describes a state that represents
a departure from the hypothetical situation in which all loci exhibit complete
independence (linkage equilibrium).
• A locus (plural: loci ) is the position in a DNA sequence and can be a SNP, a
region of DNA sequence, or a whole gene.
• Meiosis is the process of cell division leading to gametes (sex cells) which
contain half of the genetic material from the original cell.
• Pleiotropy is the potential for genes or genetic variants to have more than one
independent phenotypic effect.
• Polymorphism is the existence of two or more variants at a locus. The term
polymorphism is usually restricted to moderately common genetic variants, with
at least two alleles having frequencies of greater than 1% in the population. A
less common variant allele is called a mutation.
• Single nucleotide polymorphisms (SNPs) are genetic variations in which one
base in the DNA is altered, for example a T instead of an A.
TABLE 2.2
A glossary of genetic terminology, adapted from Lawlor et al., 2008.
22 Mendelian Randomization
...ATTACGCTTCCGAGCTTCCGCAG...
and that same locus on the paired chromosome reads:
...ATTACGCCTCCGAGCTTCCGCAG...
The underlined nucleotide represents a nucleotide at a particular locus that is
polymorphic: it exists in various forms. All individuals contain many genetic
mutations, where the DNA code has changed from that generally seen in the
population. A single nucleotide polymorphism (SNP) is a mutation where a
single nucleotide base at a particular locus has been replaced with a differ-
ent nucleotide. The different possible nucleotides which may appear at each
locus are known as alleles. For example, at the highlighted locus above, one
chromosome has the letter T, and the other has the letter C: so T and C are
alleles of this particular SNP. If these are the only two possibilities, this is a
diallelic SNP; triallelic and quadrallelic SNPs are far less common, but have
also been observed.
For a diallelic SNP, it is conventional to denote the more common allele,
known as the wildtype or major allele, by an upper case letter (for example,
A) and the less common allele, the variant or minor allele, by a lower case
letter (for example, a). The choice of letter is arbitrary; there is no connection
between the letter A commonly used for the first variant considered, and the
nucleotide base adenine represented by letter A. The proportion of minor
alleles in a population for a given SNP is called the ‘minor allele frequency’.
Although some genetic mutations seem to be specific to particular individuals,
others are more widespread, showing up in a substantial proportion of the
population. SNPs occur on average about once in every 300 nucleotides along
the genome, and extensive catalogues of SNPs have been compiled.
As people have two copies of each chromosome (one from each parent),
individuals can be categorized for each diallelic SNP into three possible sub-
groups corresponding to their combination of alleles (their genotype). These
subgroups are the major homozygotes (AA), heterozygotes (Aa) and minor
homozygotes (aa). We shall denote these subgroups as 0, 1 and 2, correspond-
ing to the number of minor alleles for that SNP. For a more complicated
genetic variant, such as a triallelic SNP where there are three possible alleles
at one locus, there is no natural ordering of the six possible subgroups given
by the SNP.
When multiple SNPs on a single chromosome are considered, the combi-
nation of alleles on each of the chromosomes is known as a haplotype. For
example, if an individual has one chromosome reading:
...GCACCTTAC...GTAGAATC...TCAACTGTCAT
and the other reading:
...GCACCGTAC...GTAAAATC...TCAACTGTCAT
then the individual is a heterozygote for the first two SNPs, and a homozygote
for the final SNP. The haplotypes are TGT and GAT. One of these haplotypes
What is Mendelian randomization? 23
TABLE 2.3
Example: alcohol intake and the ALDH2 polymorphism in the Japanese pop-
ulation.
effect on oesophageal cancer risk as well as a strong association with the ge-
netic variant to provide an alternative explanation for these results.
These associations are summarized in Table 2.3. The genetic mutation
provides a fair test to compare three populations who differ systematically
only in their consumption of alcohol and exposure to acetaldehyde, and who
have vastly differing risks of the outcome. The evidence for a causal link be-
tween alcohol consumption, exposure to acetaldehyde and oesophageal cancer
is compelling [Schatzkin et al., 2009]. However, in other cases, particularly if
the genetic variant(s) do not explain much of the variation in the exposure, the
power to detect a causal effect may be insufficient to provide such a convincing
conclusion.
2.4 Summary
Mendelian randomization has the potential to be a useful tool in a range
of scientific contexts to investigate claims of causal relationships. It must be
applied with care, as its causal claims come at the price of assumptions which
are not empirically testable. Its methods must be refined, as often data on
multiple genetic variants or data taken from several study populations are
required to achieve meaningful findings. But, when properly used, it gives an
insight into the underlying causal relationships between variables which few
other approaches can rival.
3
Assumptions for causal inference
In the previous chapters, we repeatedly used the word ‘causal’ to describe the
inferences obtained by Mendelian randomization. In this chapter, we clarify
what is meant by the causal effect of an exposure on an outcome. We give
a more detailed explanation of the theory of instrumental variables, and ex-
plain in biological terms various situations that may lead to violations of the
instrumental variable assumptions and thus misleading causal inferences. We
conclude by discussing the difference between testing for the presence of a
causal relationship and estimating a causal effect, and the additional assump-
tions necessary for causal effect estimation.
25
26 Mendelian Randomization
variables correlated with the exposure. With the causal effect, setting the value
of the exposure only alters the exposure and variables on the causal pathway
downstream of the exposure, not variables on alternative causal pathways.
The outcome variable Y for different observed values x of the exposure X
is written as Y |X = x, read as Y conditional on X equalling x. Causal effects
cannot be expressed in terms of probability distributions and so additional
notation is required [Pearl, 2010]. The outcome variable Y when the exposure
X is set to a given value x is written as Y |do(X = x), where the do operator
indicates that the variable is manipulated to be set to the given value.
FIGURE 3.1
Directed acyclic graph illustrating instrumental variable (IV) assumptions.
that the same distribution of outcomes would be expected if each of the sub-
groups were exposed to the treatment or the control regime. Although an in-
dividual can only be exposed to one of the two treatment regimes (and so only
observed in one universe), by exposing each subgroup to a different treatment
regime, in effect we observe the population in each of the two counterfactual
parallel universes, and the average outcomes in each of the universes (sub-
groups) can be compared [Greenland and Robins, 1986]. A causal effect can
be consistently estimated which represents the average effect of being assigned
to the treatment group as opposed to the control group. This means that an
RCT can estimate an average causal effect for the population as the contrast
between the average levels of the outcome in the randomized subgroups of
the population (which will have the same characteristics on average as the
overall population due to the random assignment into subgroups). An indi-
vidual causal effect cannot be estimated, as an individual cannot in general
be subjected to both the treatment and control regimes [Rubin, 1974].
For Mendelian randomization, the similar key property of an IV is that the
division of the population into genetic subgroups is independent of competing
risk factors, and so genetic subgroups defined by the IV are exchangeable. For
a genetic variant to be an IV, it is necessary that same distribution of out-
comes would be observed if individuals with no copies of the genetic variant
instead had one copy of the genetic variant (and the exposure distributions
were unchanged), and vice versa. However, empirical testing of the exchange-
ability criterion is not possible.
We return to the question of assessing the validity of genetic variants as
IVs later in this section; firstly we consider reasons why a genetic variant may
not be a valid IV. These include issues of biological mechanism, genetic co-
inheritance, and population effects. Invalid IVs lead to unreliable inferences
for the causal effect of an exposure. The situations discussed here represent
potential lack of internal validity of estimates; the question of the external
validity of an IV estimate as an estimate of the effect of a clinical intervention
is discussed in Chapter 6.
purposes. For example, the FTO gene is a determinant of satiety (how full of
food a person feels) [Wardle et al., 2008]. If satiety affects body mass index
(BMI), then a variant in the FTO gene can be used as an IV for BMI if the
two variables are on the same causal pathway, and if there is no alternative
causal pathway from the genetic variant to the outcome not via BMI. How-
ever, if the FTO gene was also associated with (say) blood pressure, and this
association was not completely mediated by the association of the gene with
BMI, then it would be misleading to use a variant in the FTO gene to make
specific inferences about the causal effect of BMI on an outcome.
Concerns about pleiotropy can be alleviated by using genetic variants lo-
cated in genes, the biological function of which are well-understood. For ex-
ample, for C-reactive protein (CRP), we can use genetic variants in the CRP
gene which are known to have functional relevance in the regulation of CRP
levels. Associations of a variant with measured covariates can be assessed to
investigate potential pleiotropy, although such associations may also reflect
mediation, particularly if the associations are consistent across independent
variants.
Canalization: Canalization, or developmental compensation, is the phe-
nomenon by which an individual adapts in response to genetic change in such
a way that the expected effect of the change is reduced or absent [Debat and
David, 2001]. It is most evident in knockout studies, where a gene is rendered
completely inactive in an organism, typically a mouse. Often the organism de-
velops a compensatory mechanism to allow for the missing gene such that the
functionality of the gene is expressed via a different biological pathway. This
buffering of the genetic effect may have downstream effects on other variables.
Canalization may be a problem in Mendelian randomization if groups with dif-
ferent levels of the genetic variants differ with respect not only to the exposure
of interest, but also to other risk factors via a canalization mechanism.
In a sense, canalization is not a violation of the IV assumptions, but merely
an (often unwanted) consequence. Canalization is the same process as that
assessed by Mendelian randomization, as any change in other risk factors
from canalization occurs as a causal effect of the genetic variant. However,
the aim of Mendelian randomization is not simply to describe the effects of
genetic change, but to assess the causal effect of the (non-genetic) exposure.
If there is substantial canalization, Mendelian randomization estimates may
be unrepresentative of clinical interventions on the exposure performed in a
mature cohort.
Confounder
Measured variant
FIGURE 3.2
Graph of instrumental variable assumptions where a variant in linkage dise-
quilibrium with the causal variant has been measured. Such a variant would
still be a valid instrumental variable. The dashed line connecting the genetic
variants indicates correlation without a causal interpretation.
subgroups will differ and not be the same as those in the general population.
This may introduce bias in the estimation of causal effects, as there is a path-
way opened up from the genetic variant to the outcome by conditioning on
a common cause of the variant and the outcome (sometimes called collider
bias).
This would also be a problem in studies looking at genetic associations
in populations of diseased individuals, such as clinical trials of secondary dis-
ease prevention. Individuals with greater genetically determined disease risk
are less likely to survive to study recruitment, and so the randomization of
individuals into genetic subgroups at conception would not hold in the study
population, leading to biased genetic associations.
TABLE 3.1
Bradford Hill criteria applied to Mendelian randomization for judging the
biological plausibility of a genetic variant as an instrumental variable.
Assumptions for causal inference 37
which corresponds to the directed acyclic graph (DAG) in Figure 3.3 [Dawid,
2002; Didelez and Sheehan, 2007].
It is a common mistake to think that the third IV assumption should read
not G ⊥ ⊥ Y |X, U , but G ⊥
⊥ Y |X, that is conditioning on U is not necessary.
As X is a common descendent of G and U , conditioning on X induces an
association between G and U , and therefore between G and Y . For example,
if X and U are positively correlated and both have positive causal effects on
Y , then conditional on X taking a value around the middle of its distribution,
a large value of Y is associated with a low value of G. This is because the large
value of Y is associated with a large value of U , and so G is more likely to be
G X Y
FIGURE 3.3
Directed acyclic graph of Mendelian randomization assumptions as random
variables.
38 Mendelian Randomization
low so that the value of X is moderate and not large. The lack of independence
(G 6⊥
⊥ Y |X) means that, in the regression of Y on X and G, the coefficient
for G will generally be close to, but not equal to zero in a large sample (and
especially if X is measured with error).
In order to interpret the unconfounded estimates produced by IV analysis
as causal estimates, we require the additional structural assumption:
where 1(.) is the indicator function. This ensures that intervening on X does
not affect the distributions of any other variables except the conditional dis-
tribution of Y [Didelez et al., 2010].
Genetic variant
Exposure Outcome
FIGURE 3.4
Triangle of associations: an ‘expected’ association estimate between the ex-
posure and the outcome can be calculated by dividing the coefficient for the
association between the genetic variant and the outcome by the coefficient for
the association between the exposure and the variant. The dashed line is the
association which is estimated.
Assumptions for causal inference 41
3.5 Summary
The instrumental variable assumptions make assessment of causation in an ob-
servational setting possible without complete knowledge of all the confounders
of the exposure–outcome association. Genetic variants have good theoretical
and empirical plausibility for use as instrumental variables in general, but the
instrumental variable assumptions may be violated for a number of reasons.
We continue in the next chapter to consider methods for estimating the
magnitude of a causal effect using instrumental variables.
4
Methods for instrumental variable analysis
In this chapter, we discuss methods for the estimation of causal effects using
instrumental variables (IVs) with both continuous and binary outcomes. We
focus attention on the case of a single continuous exposure variable, as this
is the usual situation in Mendelian randomization studies; although the same
methods could be used in the case of a single binary exposure. We explain for
each method how to estimate a causal effect, and describe specific properties
of the estimator. In turn, we consider the ratio of coefficients method, two-
stage methods, likelihood-based methods, and semi-parametric methods. This
order corresponds roughly to the complexity of the methods, with the simplest
ones first. These methods are contrasted in terms of bias, coverage, efficiency,
power, robustness to misspecification, and existence of finite moments. We
have included a simple explanation of each method at first, and then further
details for more technical readers. Also discussed are implementations of the
methods using standard statistical software packages.
45
46 Mendelian Randomization
∆Y 0.4
= = 0.4
Outcome
∆X 1.0
∆Y = 0.4
0.2
0.1
0.0
∆X= 1.0
Exposure
FIGURE 4.1
Points representing mean exposure and outcome in two genetic subgroups
with IV ratio estimate.
Methods for instrumental variable analysis 47
Outcome
Outcome
Exposure Exposure
Outcome
Outcome
Exposure Exposure
FIGURE 4.2
Illustration of ratio method for polytomous IV taking three values with a
continuous outcome in a fictitious dataset: (top-left) exposure and outcome
for all individuals, observational estimate with 95% confidence interval; (top-
right) individuals divided into genetic subgroups by plot symbol; (bottom-left)
mean exposure and outcome in each genetic subgroup (lines represent 95%
confidence intervals); (bottom-right) ratio IV estimate with 95% confidence
interval.
50 Mendelian Randomization
similar to a consistency assumption, which states that the outcome for an indi-
vidual would be the same if the value of the exposure were observed naturally
or set due to an intervention [VanderWeele, 2009]. Although confounding is
represented by a single variable U , this is simply for presentation; U represents
the combined effect of all confounding variables.
We note that the ratio estimate can be calculated simply from the coef-
ficients β̂X|G and β̂Y |G , and as such only requires the availability of summa-
rized data, not individual-level data. Methods for obtaining IV estimates using
summarized data are discussed further in Section 9.4. The two coefficients can
also be estimated in different groups of individuals. Common examples include
where the IV–outcome association is measured on the whole sample and the
IV–exposure association on a subsample (subsample Mendelian randomiza-
tion, see Section 8.5.2), or the associations are estimated on non-overlapping
datasets (two-sample Mendelian randomization, see Section 9.8.2).
This approximation assumes that the numerator and denominator of the ratio
estimator are uncorrelated; such correlation could be accounted for by includ-
ing a third term of the delta expansion [Thomas et al., 2007], but is unlikely
to have a considerable impact on the estimate of the standard error.
However, asymptotic (large sample) normal approximations may result in
overly narrow confidence intervals, especially if the sample size is not large or
the IV is ‘weak’. This is because IV estimates are not normally distributed.
Fieller’s theorem: If the regression coefficients in the ratio method β̂Y |G
and β̂X|G are assumed to be normally distributed, critical values and confi-
dence intervals for the ratio estimator may be calculated using Fieller’s theo-
rem [Fieller, 1954; Lawlor et al., 2008]. We assume that the correlation between
β̂Y |G and β̂X|G is zero; other values can be used, but the impact on the confi-
dence interval is usually small [Minelli et al., 2004]. If the standard errors are
se(β̂Y |G ) and se(β̂X|G ) and the sample size is N , then we define:
0.4
0.3
0.2
Outcome
0.1
0.0
−0.1
−0.2
−0.3
Exposure
FIGURE 4.3
Points representing mean exposure and outcome (lines are 95% confidence
intervals) in two genetic subgroups where the confidence interval from Fieller’s
theorem for the IV ratio estimate is unbounded.
to plus infinity.
√ All possible values
√ are included in the interval except those
between (f2 + D)/f1 and (f2 − D)/f1 . An example where Fieller’s theorem
would give such a confidence interval including infinity but excluding zero, is
displayed in Figure 4.4. This suggests that the differences in the outcome are
not caused solely by differences in the exposure, and so the IV assumptions
are violated.
To summarize, Fieller’s theorem gives confidence intervals that have one
of three possible forms [Buonaccorsi, 2005]:
i. The interval may be a closed interval [a, b],
ii. The interval may be the complement of a closed interval (−∞, b] ∪ [a, ∞),
iii. The interval may be unbounded.
√ √
where a = (f2 − D)/f1 , b = (f2 + D)/f1 . Confidence intervals from Fieller’s
theorem are preferred to those from an asymptotic normal approximation
when the IV is weak. A tool to calculate confidence intervals from Fieller’s
theorem based on the gene–exposure and gene–outcome associations is avail-
able online (http://spark.rstudio.com/sb452/fieller/).
54 Mendelian Randomization
0.8
0.6
Outcome
0.4
0.2
0.0
−0.2
Exposure
FIGURE 4.4
Points representing mean exposure and outcome (lines are 95% confidence
intervals) in two genetic subgroups where the confidence interval from Fieller’s
theorem for the IV ratio estimate is compatible with an infinite (vertical)
association, but not a null (horizontal) association.
biased for the causal effect, whereas the IV estimate is unbiased, but typically
imprecisely estimated. The loss of precision in the IV estimate is the cost of
unbiased estimation. When making causal assessments, we would argue that
no appreciable amount of bias should be introduced in order to reduce the
variance of the estimate [Zohoori and Savitz, 1997].
However, the sample size required to obtain precise enough causal esti-
mates to be clinically relevant can be very large [Ebrahim and Davey Smith,
2008]. A rule of thumb for power is that the sample size for a conventional
analysis should be divided by the coefficient of determination (R2 ) of the IV
on the exposure (Section 8.3) [Wooldridge, 2009]. For example, if the sample
size for an observational regression analysis of the outcome on the exposure
to detect a given effect size requires a sample size of 400, and the IV explains
2% of the variation in the exposure, then the sample size required for an IV
analysis is approximately 400/0.02 = 20000. For this reason, while for some
researchers the ratio method may be sufficient for the analysis in question, we
are motivated to consider methods which can incorporate data on more than
one IV, and hence give more precise estimates of causal effects.
4.2.3* Non-collapsibility
Several measures of association, including odds ratios, differ depending on
whether they are considered conditional or marginal on a covariate. For ex-
ample in the left half of Table 4.1, the odds ratio of an outcome for exposed
versus unexposed individuals is equal to 2 for men and 2 for women. Even
under the assumption of no confounding (that the proportion of exposed and
non-exposed individuals is the same in both men and women), the odds ratio
for a population with equal numbers of men and women is not 2. In contrast,
as the example in the right half of Table 4.1 shows, a relative risk is the same
whether considered conditional or marginal on sex.
A measure of association, such as an odds ratio or relative risk, would be
termed collapsible if, when it is constant across the strata of the covariate, this
constant value equals the value obtained from the overall (marginal) analysis.
Non-collapsibility is the violation of this property [Greenland et al., 1999]. The
relative risk and absolute risk difference are collapsible measures of association.
Odds ratios are generally non-collapsible [Ducharme and LePage, 1986]. This
means that the conditional model:
TABLE 4.1
Illustrative examples of collapsing an effect estimate over a covariate: non-
equality of conditional and marginal odds ratios and equality of relative risks.
Methods for instrumental variable analysis 59
This model is similar to that in the FIML and LIML methods, except that
the causal parameter β1 represents the causal effect between the true means ξi
and ηi rather than the measured values of outcome and exposure. The model
can be estimated in a Markov chain Monte Carlo (MCMC) framework, such
as that implemented in WinBUGS [Spiegelhalter et al., 2003]. The output
is a posterior distribution, from which the posterior mean or median can
be interpreted as a point estimate, and the 2.5th and 97.5th percentiles as
a ‘95% confidence interval’. Under certain choices of prior, estimates based
on the Bayesian posterior distribution are similar to those from the 2SLS
or LIML methods [Kleibergen and Zivot, 2003]. With vague priors, the joint
posterior distribution is similar to the frequentist likelihood function [Burgess
and Thompson, 2012].
An advantage of the Bayesian approach is that no distributional assump-
tion is made for the posterior distribution of the causal parameter. Inference is
therefore more robust using weak instruments [Burgess and Thompson, 2012].
following model:
2
xi ∼ N (ξi , σX ) (4.19)
yi ∼ Bernoulli(πi )
K
X
ξi = α0 + αk gik
k=1
logit(πi ) = β0 + β1 xi
then the joint likelihood L is given by:
Y y 1 1
1−yi 2
L= πi (1 − πi )
i
√ {exp(− 2 (xi − ξi ) )} . (4.20)
2πσX σX
i=1,...,N
may be different in finite samples, although they each assume the same struc-
tural model between Y and X.
When there is more than one instrument, gi becomes gik and we have a
separate estimating equation for each instrument k = 1, . . . , K. The orthog-
onality conditions for each instrument cannot generally be simultaneously
satisfied. The estimate is taken as the minimizer of the objective function
1
P
where ḡk = N i gik and the summation is across i, which indexes study
participants.
Where the model for the expected outcomes is non-linear, this is known
as a generalized structural mean model. With a binary outcome, it is natural
to use a log-linear (or multiplicative) SMM:
estimated in the same way, as the expectation logit E(Y (x)) depends on the
distribution of the IV [Robins, 1999]. This problem can be addressed by es-
timating Y (x) assuming an observational model [Vansteelandt and Goetghe-
beur, 2003]:
logit E(Y (x)) = β0a + β1a x (4.28)
where the subscripts a indicate associational, as well as a structural model:
The commands ivregress 2sls y (x=g) and ivreg y (x=g) give the
same estimates as ivreg2 y (x=g), but a more limited output. The command
ivhettest performs a test of heteroscedasticity of the errors in the second-
stage regression. If heteroscedasticity is present, a GMM analysis with robust
standard errors is preferred to a 2SLS analysis. The command overid gives
more information about overidentification tests. The command ivendog gives
more information about endogeneity tests. The command qvf has been written
to implement a fast bootstrap estimation of standard errors for IV analysis
[Hardin et al., 2003]. Each of these commands can be used with multiple
instruments, for example ivreg2 y (x=g1 g2 g3).
reg x g
predict xhat
logit y xhat, robust
where robust standard errors are calculated in the second-stage regression.
Generic estimating equations for GMM or SMM analyses can be solved in
Stata using the gmm command [Drukker, 2009]. For example, a linear GMM
estimate can be obtained using:
Useful notes are available for the estimation of SMMs with a binary out-
come [Clarke et al., 2011]. Each of these commands can be used with multiple
instruments: for example gmm (y - beta0 - x*beta1), instruments(g1
g2). The command qvf can also be used in non-linear cases [Hardin et al.,
2003], such as a two-stage analysis with a non-linear second-stage model, to
prevent the over-precision of estimates resulting from a sequential regression
method. A probit IV model (not considered in this book) can be estimated
using the command ivprobit.
4.6.4 IV analysis in R
The R command tsls in the library sem carries out a 2SLS procedure [Fox,
2006]. Care must be taken as the constant term usually used in regression
equations is not included by default. If the exposure is x, the outcome is y
and the IV is g, the syntax for a two-stage (2SLS) analysis is:
g0=g[y==0]
glm(y~predict(lm(x[y==0]~g0), newdata=list(g0=g)), family=binomial)
Generic estimating equations for GMM or SMM can be solved in R us-
ing the gmm package [Chaussé, 2010]; details and sample code are available
[Clarke et al., 2011].
Methods for instrumental variable analysis 73
model {
beta1 ~ dnorm(0, 1E-6)
beta0 ~ dnorm(0, 1E-6)
alpha0 ~ dnorm(0, 1E-6)
for (k in 1:K) { alpha1[k] ~ dnorm(0, 1E-6) }
xsd ~ dunif(0, 20)
ysd ~ dunif(0, 20)
rho ~ dunif(-1, 1)
# priors for the parameters
xtau <- pow(xsd, -2)
ytau <- pow(ysd, -2)
tauy <- ytau/(1-pow(rho,2))
# tauy is the precision of y conditional on x
for (i in 1:N) {
ksi[i] <- alpha0 + inprod(alpha1[1:K], g[i,1:K])
x[i] ~ dnorm(ksi[i], xtau)
eta[i] <- beta0 + beta1 * ksi[i]
muy[i] <- eta[i] + sqrt(xtau/ytau)*rho*(x[i]-ksi[i])
# muy[i] is the mean of y[i] conditional on x[i]
y[i] ~ dnorm(muy[i], tauy)
} }
In the above, the bivariate normal distribution of (X, Y )T from equation (4.18)
has been equivalently replaced by the marginal distribution of X and the
conditional distribution of Y |X = x [Burgess and Thompson, 2012].
In a case-control study with a binary outcome and a logistic model of
association, the following code can be used (the first P individuals in the
dataset are the controls):
model {
beta1 ~ dnorm(0, 1E-6)
beta0 ~ dnorm(0, 1E-6)
alpha0 ~ dnorm(0, 1E-6)
for (k in 1:K) { alpha1[k] ~ dnorm(0, 1E-6) }
74 Mendelian Randomization
4.7 Summary
Methods for IV analysis range from the very simple (calculate the difference
between two pairs of numbers and divide one by the other) to the more com-
plicated. The development of complex methods has been driven by the desire
to produce efficient estimates, for example by integrating data on multiple
IVs, to allow for more flexible modelling assumptions, or to provide robust-
ness against misspecification of modelling assumptions. Each method has its
own advantages and disadvantages. The properties of many of these estima-
tors will be discussed in the chapters to come in the specific contexts of weak
instruments, binary outcomes, and evidence synthesis.
In the next chapter, we consider examples of the use of Mendelian ran-
domization, focusing particularly on practical aspects of the analyses, such as
study design, and their impact on methods.
5
Examples of Mendelian randomization
analysis
75
76 Mendelian Randomization
risk with the variant is estimated in the whole study population using lo-
gistic regression. In the meta-analysis, summary-level data from each study
on the number of cases and controls in each genetic subgroup are used to
estimate the association of the variant with the risk of CHD in each study.
The study-specific estimates are then combined using a fixed-effect inverse-
variance weighted meta-analysis (Chapter 9). A per allele genetic model is
used, as this is best supported by the data on fibrinogen levels. In both cases,
the result is cited as the risk ratio of CHD per additional variant allele.
5.1.4 Results
The analyses show a null association of the variant with CHD risk, with a
narrow confidence interval (CI) for the genetic association with disease risk:
risk ratios of 1.06 (95% CI 0.96 to 1.16) per fibrinogen-increasing allele in ISIS
alone and of 1.00 (95% CI 0.95 to 1.04) in the meta-analysis. On the basis
of this, the authors conclude that “these genetic results provide strong evi-
dence that long-term differences in fibrinogen concentrations are not a major
determinant of coronary disease risk”.
5.1.5 Commentary
A weakness of the presentation of the results is that a causal estimate of the
effect of fibrinogen on CHD risk is not presented. Although the risk ratio
estimate of 1.00 (95% CI, 0.95 to 1.04) per additional allele appears to be a
small effect, each additional allele is only associated with a small increase in
fibrinogen levels (0.14 (standard error 0.024) g/l), meaning that a standard
deviation increase in fibrinogen (0.81 g/l, estimated in control participants)
could still lead to an approximate 25% increase in CHD risk based on the upper
bound of the 95% CI (assuming a log-linear relationship between fibrinogen
and the risk of CHD).
5.2.4 Results
The IV analysis shows a positive causal effect of BMI on blood pressure and
hypertension of similar magnitude to the observational association. For exam-
ple, the estimate for the increase in systolic blood pressure associated with a
10% increase in BMI is 2.75 mmHg (95% CI 2.62 to 2.88) from the observa-
tional analysis with adjustment for age, sex and height, and 2.54 mmHg (95%
CI 2.39 to 2.69) with further adjustment for socio-behavioural factors. The
corresponding estimate of the increase in systolic blood pressure caused by
a 10% increase in BMI from the IV analysis is 3.85 mmHg (95% CI 1.88 to
5.83). The FTO SNP has statistically robust associations with BMI (1.18%
[95% CI 0.96 to 1.41] increase in BMI on a multiplicative scale per additional
allele) and with blood pressure (0.63 mmHg [95% CI 0.33 to 0.93] increase in
systolic blood pressure per additional allele), whereas the MC4R SNP has a
smaller magnitude of association with BMI (0.78%, 95% CI 0.53 to 1.04), and
an association with blood pressure compatible with the null (0.20 mmHg, 95%
CI -0.14 to 0.54). This may be due to the MC4R SNP’s reduced association
with BMI and the statistical uncertainty in the association estimates, but it
may reflect heterogeneity of the causal effects identified by the two variants.
The association of the FTO SNP with severe hypertension does not fully
attenuate on adjustment for BMI: attenuation from an odds ratio of 1.07 (95%
1.04 to 1.11) on adjustment for age and sex, to 1.07 (95% 1.03 to 1.11) on
additional adjustment for socio-behavioural factors, and to 1.04 (95% 1.01 to
1.08) on additional adjustment for log(BMI). Although a complete attenuation
is not expected, the limited attenuation suggests that the causal effect of
adiposity on hypertension may not simply be explained as a function of BMI.
The Durbin–Wu–Hausmann tests for each variant are not significant, indi-
cating no difference between the observational and IV estimates beyond that
compatible with chance.
80 Mendelian Randomization
5.2.5 Commentary
Although the Mendelian randomization analysis suggests that adiposity is
causally associated with blood pressure, the unknown function of the genetic
variants limits the certainty of the conclusions that can be drawn.
5.3.4 Results
The analyses show a positive causal effect of lp(a) on MI risk. The odds
ratios of MI in the quartiles of the IV (fourth quartile is reference group)
were 1.3 (95% CI, 1.1 to 1.5) in the first quartile, 1.1 (95% CI, 0.9 to 1.3)
in the second quartile, and 0.9 (95% CI, 0.8 to 1.1) in the third quartile in
the Copenhagen General Population Study (p = 0.005 for trend), and 1.4
(95% CI, 1.1 to 1.7), 1.2 (95% CI, 1.0 to 1.6), and 1.3 (95% CI, 1.0 to 1.6)
in the Copenhagen Ischemic Heart Disease Study (p = .01 for trend). In the
Copenhagen City Heart Study, the IV estimate for the hazard ratio (HR)
of MI per doubling of lp(a) (HR 1.22, 95% CI 1.09 to 1.37) is considerably
larger than the observational estimate (HR 1.08, 95% CI 1.03 to 1.12). This
finding, which was replicated in a similar study [Clarke et al., 2009], may
reflect the increased effect of lifelong differences in lp(a) levels, similar to that
observed for low-density lipoprotein cholesterol (LDL-C) (Section 6.2.1). It
also may result from the association of the KIV-2 polymorphism with both
the concentration of lp(a) and the lp(a) particle size, which is also implicated
82 Mendelian Randomization
5.3.5 Commentary
A limitation of the interpretation of the IV estimate is the non-linear associa-
tion of the number of kringle repeats with lp(a) levels. The IV estimate should
be interpreted as a population-averaged effect, comparing genetic subgroups
which have different average levels of the exposure. With non-linear relation-
ships, the IV estimate does not necessary represent the effect of intervening
on lp(a) for an individual (Section 11.1.2).
5.4.4 Results
From observational epidemiology, the expected odds ratio (OR) for each vari-
ant allele in the endothelial lipase gene is 0.87 (95% CI 0.84 to 0.91). This
is obtained by triangulation of the observed estimate of the association of
HDL-C on MI risk from multivariable adjusted logistic regression with the
observed genetic association of the variant with HDL-C (Section 3.3.3). How-
ever, the variant is not associated with risk of myocardial infarction (OR 0.99,
95% CI 0.88 to 1.11). With the allele score, the expected OR for a 1 standard
deviation increase in HDL-C from observational epidemiology (OR 0.62, 95%
CI 0.58 to 0.66) is not compatible with the estimated OR for a 1 SD increase in
HDL cholesterol from Mendelian randomization (OR 0.93, 95% CI 0.68-1.26).
For a 1 standard deviation increase in LDL-C, the observational epidemiology
84 Mendelian Randomization
(OR 1.54, 95% CI 1.45 to 1.63) and Mendelian randomization (OR 2.13, 95%
CI 1.69 to 2.69) estimates are directionally concordant.
The authors conclude that “some genetic mechanisms that raise plasma
HDL-C do not seem to lower risk of myocardial infarction”. This tentative
conclusion reflects the limited power of the single SNP analysis, where the
CIs for the causal effect and the observational estimate substantially overlap,
and the limited knowledge of the specific function of the allele score, which
may contain variants not exclusively or not directly associated with HDL-C.
5.4.5 Commentary
Aside from the limitations of the conclusions stated above, this paper demon-
strates the statistical difficulty of applied Mendelian randomization analysis
where the studies under analysis are heterogeneous. Although Mendelian ran-
domization investigations can be undertaken in a number of study designs,
differences between studies and specific features of each study may make in-
tegrated analysis of the entirety of the data available challenging.
The authors choose a pragmatic approach, combining a more conservative
analysis using a single genetic variant with a more speculative analysis using
an allele score. This is contrasted with a parallel analysis of LDL-C, which
provides plausibility of the allele score approach, as a positive causal effect of
LDL-C on MI risk is estimated.
5.5 Discussion
A question of interpretation relating to each of these analyses, and to
Mendelian randomization more widely, is how much weight of evidence to
attach to the result of a Mendelian randomization investigation. In a hierar-
chy of evidence, Mendelian randomization has been advocated as providing
“critical evidence” on exposure–outcome relationships [Gidding et al., 2012].
However, the true weight of evidence in each case depends strongly on the
plausibility of the instrumental variable assumptions for the genetic variants.
If the function of the genetic variants is poorly understood, then a causal con-
clusion is in doubt, particularly if there are multiple genetic variants and there
is little consistency in the causal effect estimates using each of the variants.
Equally, if the genetic variants explain a small proportion of the variance in the
exposure, then a Mendelian randomization investigation using those variants
will be inconclusive unless the sample size is very large.
For exposures which are biomarkers, variants can be employed as IVs which
are located in the gene coding the biomarker (such as for fibrinogen in the
example above). For exposures which are complex multifactorial traits, such
as body mass index and blood pressure, the association between the genetic
Examples of Mendelian randomization analysis 85
variants and exposure is less proximal, giving more opportunities for viola-
tions of the IV assumptions. A non-null Mendelian randomization estimate
is indicative that genetic predictors of the exposure are also associated with
the outcome, but there may be an alternative causal pathway other than that
through the exposure of interest. An analogous situation is inferring a causal
effect of a specific biomarker based on a pharmaceutical intervention with
multiple effects, such as those of statins on lipid fractions and inflammation
markers. The existence of pleiotropic associations of variants is particularly
likely if a large number of variants is included in the analysis.
In conclusion, the reliability of the findings from a Mendelian randomiza-
tion study depends heavily on both the validity of the variant(s) used as an
IV, and the power of the analysis to detect a clinically relevant causal effect.
Before further considering the statistical properties of IV estimators, in the
next chapter we consider a more fundamental question: what does a Mendelian
randomization estimate represent?
6
Generalizability of estimates from Mendelian
randomization
In the previous chapters, we have discussed the meaning of causation and pre-
sented methods and examples of estimating causal effects using instrumental
variables (IVs). In this chapter, we consider the interpretation of causal effects
assessed and estimated in Mendelian randomization, and address the question
of under what circumstances a Mendelian randomization estimate may be a
reliable guide to the effect of an intervention on the exposure of interest in
practice.
87
88 Mendelian Randomization
causal effect due to the change in the exposure as a result of the presence of
the genetic variant is similar to the causal effect due to the proposed inter-
vention on the exposure. There are several reasons why these effects may be
unequal, as we now discuss.
TABLE 6.1
Association of five SNPs with log-transformed low-density lipoprotein choles-
terol (LDL-C) and coronary heart disease (CHD) risk. Causal estimates of
odds ratio for 30% reduction in LDL-C on coronary heart disease from
Mendelian randomization using each SNP in turn.
1.00
Per allele odds ratio of CHD
0.95
0.90
0.85
−4% −2% 0
Per allele percentage change in LDL−C
FIGURE 6.1
Estimates of percentage change in low-density lipoprotein cholesterol (LDL-C)
and odds ratio of coronary heart disease (CHD) per LDL-C decreasing allele
for five SNPs (point estimates with 95% confidence intervals), plus estimate of
causal effect of LDL-C on CHD risk from Mendelian randomization using all
5 SNPs (solid line) with 95% confidence interval (dotted lines). Proportionate
effect from meta-analysis of statin use on CHD risk in RCTs (dashed line with
dotted lines for 95% confidence interval) is displayed for comparison.
Generalizability of estimates from Mendelian randomization 93
6.3 Discussion
External validity in epidemiology is often thought of in terms of generaliz-
ability to a population other than the one considered in the original study
[Dekkers et al., 2010]. Although variation in populations may cause some dif-
ficulties, the differences between the change in exposure levels associated with
natural genetic variation and with any proposed clinical intervention on the
exposure lead to inescapable problems in generalizing Mendelian randomiza-
tion estimates to clinical questions of interest.
Mendelian randomization is a useful tool for exploring causal relationships
between modifiable exposures and outcomes of interest. It is one of the few
methodologies that can aid the selection of targets for therapeutic interven-
tion. However, it would be misleading to assume that the estimate from a
Mendelian randomization study gave the definitive answer to every question
of causal relevance of an exposure. Mendelian randomization estimates are es-
pecially relevant when the effect of interest is that of a long-term population-
based intervention; otherwise, although a Mendelian randomization approach
94 Mendelian Randomization
higher triglyceride levels [Beer et al., 2009]. This observation may suggest that
additional monitoring of triglyceride levels would be advisable in clinical trials
of glucokinase activators.
of the causal null hypothesis if there is no causal pathway from the genetic
variant(s) to the outcome not via adiposity, in spite of the IV assumptions for
BMI being violated.
For these reasons, some authors have questioned whether causal effect esti-
mates should ever be considered as part of a Mendelian randomization analysis
[VanderWeele et al., 2014]. Although there is a danger of estimates being over-
interpreted, there are several reasons why causal estimates are useful. First,
in epidemiology generally, estimates with confidence intervals are preferred
to hypothesis tests with p-values, as they are more informative [Sterne and
Davey Smith, 2001]. For instance, if a p-value does not achieve conventional
levels of statistical significance, a point estimate with a confidence interval al-
lows the reader to judge in a quantitative way whether the null result reflects
a lack of evidence or a genuine negative finding in comparison with either
the observational association, or with a minimal clinically relevant effect. Sec-
ondly, if several genetic variants are valid instrumental variables for the same
exposure, greater power to detect a clinically relevant causal effect can be ob-
tained using information on all of the variants simultaneously rather than that
using the variants individually. Causal estimates from multiple variants also
enable the quantitative comparison of the consistency of genetic associations,
using a heterogeneity or overidentification test, as a statistical assessment of
pleiotropy (Section 4.5.3). Finally, although the causal estimate in a Mendelian
randomization analysis may not be equal to the effect of an intervention in
the exposure, it does have a well-defined interpretation as the effect of an
intervention in the genetic code at conception. Hence, although assessment
of causation should be the primary outcome of a Mendelian randomization
investigation, the estimate of a causal effect also has considerable utility.
6.4 Summary
In Mendelian randomization, differences in the exposure distribution due to
genetic variation are materially distinct from the change due to any proposed
therapeutic intervention on the exposure, and so may affect the outcome dif-
ferently. Consequently, it may be misleading to generalize the magnitude of a
Mendelian randomization association to the effect of a potential intervention
on the exposure in practice. Awareness of this is important for the use of
Mendelian randomization in target-based drug development.
In this chapter, we have considered qualitative and quantitative issues
relating to the interpretation of causal effects using Mendelian randomization
and their relationship to the effects of interventions. In the next part of this
book, we consider statistical aspects of Mendelian randomization analyses
relating to the topics discussed in this and previous chapters.
Part II
Statistical issues in
instrumental variable
analysis and Mendelian
randomization
7
Weak instruments and finite-sample bias
7.1 Introduction
Although IV techniques can be used to give asymptotically unbiased estimates
of causal effects in the presence of confounding, these estimates suffer from bias
when evaluated in finite samples [Nelson and Startz, 1990]. A weak instrument
(or a weak IV) is still a valid IV, in that it satisfies the IV assumptions, and es-
timates using the IV with an infinite sample size will be unbiased; but for any
finite sample size, the average value of the IV estimator will be biased. This
bias, known as weak instrument bias, is towards the observational confounded
estimate. Its magnitude depends on the strength of association between the
IV and the exposure, which is measured by the F statistic in the regression
of the exposure on the IV [Bound et al., 1995]. In this chapter, we assume
the context of ‘one-sample’ Mendelian randomization, in which evidence on
the genetic variant, exposure, and outcome are taken on the same set of indi-
viduals, rather than subsample (Section 8.5.2) or two-sample (Section 9.8.2)
Mendelian randomization, in which genetic associations with the exposure
and outcome are estimated in different sets of individuals (overlapping sets in
subsample, non-overlapping sets in two-sample Mendelian randomization).
We illustrate this chapter using data from the CRP CHD Genetics Col-
laboration (CCGC) to estimate the causal effect of blood concentrations of C-
reactive protein (CRP) on plasma fibrinogen concentrations (Section 1.3). As
the distribution of CRP is positively skewed, we take its logarithm and assume
a linear relationship between log(CRP) and fibrinogen. Although log(CRP)
99
100 Mendelian Randomization
and fibrinogen are highly positively correlated (r = 0.45 to 0.55 in the ex-
amples below), it is thought that long-term elevated levels of CRP are not
causally associated with an increase in fibrinogen.
We first demonstrate the direction and magnitude of weak instrument bias
for IV estimates from real and simulated data (Section 7.2). We explain why
this bias comes about, why it acts in the direction of the confounded observa-
tional association, and why it is related to instrument strength (Section 7.3).
We discuss simulated results that quantify the size of this bias for different
strengths of instruments and different analysis methods (Section 7.4). When
multiple IVs are available, we show how the choice of IV affects the variance
and bias of IV estimators (Section 7.5). We propose ways of designing and
analysing Mendelian randomization studies to minimize bias (Section 7.6).
We conclude with a discussion of this bias from both theoretical and practi-
cal viewpoints, ending with a summary of recommendations aimed at applied
researchers on how to design and analyse a Mendelian randomization study
to minimize bias from weak instruments (Section 7.7).
TABLE 7.1
Estimates of effect (standard error) of log(CRP) on fibrinogen (µmol/l) from
the Copenhagen General Population Study (N = 35 679) divided randomly
into substudies of equal size and combined using fixed-effect meta-analysis:
observational estimates using unadjusted linear regression, IV estimates using
2SLS. Mean F statistics averaged across substudies from linear regression of
log(CRP) on three genetic variants.
same time, the standard error of the pooled IV estimates decreases. We can
see that even where the number of substudies is 16 and the average F statistic
is around 10, there is a serious bias. The causal estimate with 16 substudies
is positive (p = 0.09) despite the causal estimate with the data analysed as
one study being near to zero.
The analyses in Table 7.1 and simulations in Figure 7.1 show that IV
estimates can be biased. This bias has two notable features: it is larger when
the F statistic for the IV–exposure relationship is smaller, and it is in the
direction of the confounded observational estimate.
X = α1 G + α2 U + εX (7.1)
Y = β1 X + β2 U + ε Y
2 2
U ∼ N (0, σU ); εX ∼ N (0, σX ); εY ∼ N (0, σY2 ) independently.
2
We initially assume that σX = σY2 = 0 for ease of explanation.
If ūj is the average confounder level for the subgroup with G = j (where
j = 0, 1), an expression for the causal effect from the ratio method is:
∆Y β1 ∆X + β2 ∆U β2 ∆U
β1R = = = β1 + (7.2)
∆X ∆X α1 + α2 ∆U
where ∆U = ū1 − ū0 is normally distributed with expectation zero; ∆X and
∆Y are defined similarly. When the instrument is strong, α1 is large compared
to α2 ∆U . Then the expression β1R will be close to β1 . When the instrument is
weak, α1 may be small compared to β2 ∆U and α2 ∆U . Then the bias β1R − β1
is close to αβ22 , which is approximately the bias of the confounded observational
association (it is exactly this if α1 is zero). This is true whether ∆U is positive
or negative. Figure 7.2 (top panel) shows how the IV estimate bias varies with
∆U . Although for any non-zero α1 the IV estimator will be an asymptotically
consistent estimator as sample size increases and ∆U tends towards zero, a
bias in the direction of the confounded association will be present in finite
samples. From Figure 7.2 (top panel), the median bias will be positive, as the
Weak instruments and finite-sample bias 103
0.4
Density
Density
0.2
0.2
0.0
0.0
−2 −1 0 1 2 −2 −1 0 1 2
Density
0.4
0.3
0.0
0.0
−2 −1 0 1 2 −2 −1 0 1 2
Density
0.6
0.4
0.0
0.0
−2 −1 0 1 2 −2 −1 0 1 2
FIGURE 7.1
Histograms of IV estimates of a null causal effect using weak instruments
from simulated data for six strengths of the IV–exposure association. Average
F statistics and median IV estimates for each scenario are shown.
104 Mendelian Randomization
α1
estimate is greater than β1 when ∆U > 0 or ∆U < − α 2
, which happens with
probability greater than 0.5.
This also explains the heavier negative tail in the histograms in Figure 7.1.
The estimator takes extreme values when the denominator α1 +α2 ∆U is close
to zero. Taking parameters α1 , α2 and β2 as positive, as in the example of Sec-
tion 7.2.2, this is associated with a negative value of ∆U , whence the numer-
ator β2 ∆U will be negative. As ∆U has expectation zero, the denominator
is more likely to be small and positive than small and negative, giving more
negative extreme values of β1R than positive ones.
2
If there is independent error in X and Y (that is, σX and σY2 in equation
(7.1) are non-zero), then the picture is similar, but more noisy, as seen in
Figure 7.2 (bottom panel). The expression for the IV estimator is:
β2 ∆U + ∆εY
β1R = β1 +
α1 + α2 ∆U + ∆εX
where ∆εX = ε̄X1 − ε̄X0 and ∆εY = ε̄Y 1 − ε̄Y 0 are defined analogously to ∆U
above.
8
6
1 − β1
4
2
Bias: βR
0
−2
−4
−6
(G = 0, 1, or 2):
xi = α1 gi + ui + εXi (7.3)
yi = β1 xi + ui + εY i
2 2
ui ∼ N (0, σU ); εXi ∼ N (0, σX ); εY i ∼ N (0, σY2 ) independently.
2
We set β1 = −0.4, σU = 12 , σX2
= 0.22 , and σY2 = 0.22 , and take four val-
ues for the strength of the IV (α1 = 0.5, 0.2, 0.1, and 0.05) corresponding
to expected F statistics of 100, 16, 4.7, and 2.0. The mean levels of exposure
and outcome for each genetic subgroup from each simulated dataset are plot-
ted (Figure 7.3), representing joint density functions for each subgroup. To
examine the sampling distribution of the IV estimate, we draw one point at
random from each of these distributions; the gradient of the line through these
three points is the 2SLS IV estimate. When the instrument is strong, the large
differences in exposure between the subgroups due to variation in the IV will
generally lead to estimating a negative effect of exposure on outcome. When
the instrument is weak, the differences in exposure between the subgroups due
to the IV are small and the positively confounded observational association is
more likely to be recovered.
x
xxx
x xx
x xxxxxxxxxxx xxxx x
x x xx xxx xx x x
x x xxxx xx
x x
x xx x x xx xx xxxxxxxxxxx x
x xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xx x
xxx xxxxxxxxxxx oo
Outcome
Outcome
xxxxx xx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xx x o
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxx x x
x o o oo ooo o
x xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx x o oo o o
xxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx x xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx x
x x ooooooooo o o
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx x xx xxx x xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxx o ooooo o o o o
xxxxxxxxxxxxxxxxxxxxxxxxxxx x x x x x x x x x xxx x x o o
xxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxx xxx xxxxxxxxxxxxxxx xxxxxxxxxxx xxxxx x o ooo
ooooooooo ooooo
ooooo
oo
xxxxxxxxxxxxxxxxxxxxxxxxxxx x xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx o ooooooo ooooo ooooo
o
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx x x xxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx x o o oo
o
oooo
o o o
oo o
o
o o o
oo
o o o o
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx x xx xxxxxxxxxxxxxxxxxxxxxxxx x xx x x
x x xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxx
oooooo o oo
oooooooooo oo
o
ooo
ooooooooooooo
oo
oo ooooooooo
ooo ooo
x xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xx o x x xxxxxxxxxx xx oooooo
ooo oo
oooooooooo ooooooooooo oooooooooo oo
ooo
x xx xxxxxxxxxxxxxxxx o xx xxxxxxxxxxxxxxxxxxxxxxx xx ooooo ooooo o
o o
oooo ooo oooo
oo
oo
ooo
oo
o oooo oo o
xx xxxxxxxxxxxx x ooo o ooooo xx x xxxxxxxx xxxx
x oo
oooooo
oo ooo
oo oo
ooo ooooo
o o
ooooo oo
x o oooo oooooooooooo x xxxxxxxxxxxxxxxxx xx
oooooo oo
ooo o
oo
oooo oooo
oo
o
o oo oo
oooo oo ooooooo
xx ooooo ooooooooo oooooooo ooooooo oooo oooooo oooooooooo
ooooooo ooo
ooo
oooooooo oooo
o
oo xxx xxxxx x o o o ooo o o oo
ooo
ooo
o ooo
oo o
o
o o o
x x
ooo oooo oo
o o
o ooo oo
o
ooooo
oo
oo
ooo
o
oooo
oooooooooo o xx x o ooooo ooooooo
oo
o oooo
oooo
oo
oooo
oo
oooooo
o
ooo
oo oo
oooo
oooooo
xx x o
ooo o
o
oooo o
ooo
o
oo
oo oo o
oo
o
ooooooooo o o o x x oooooo
ooooo
ooo oo ooooooo
o
oooo
o
oo o
oo oo
oooooo o
oo oo oooo
oo oooo oo
oooo oo
oo
oo oooo oooooo ooo x ooooo o
oooo
o ooooo ooooooo o
o oo
o
oooo o
o oo oooooo
o oooo
oo
oo ooo
oooooo
o
oo o
oo
oooooooooooo oooo o ooo oo
o o
oo oooooo
ooooooooooo
o oo
oooo
o
oooo
ooooo o
ooo
o o
oo o
ooooo
oo oo
o o
oooo
oooo oo
oooooo oo ooo
o
ooooooooooo oo
oo oooooooo
oo oooooo ooooo ooooo
o oooo o
oooooooooooooo
oo o oo
ooo ooo oo o oo o
oooo o
oo
oooo
ooo
oo oo
oooooo
o oo oooo
ooo oooooooo o ooooo ooooo ooo o o
o o o oo
oo o oo o oo o
ooooooooooooooo
o oo o o oooo
oo o o o
ooooooo o o
ooo o oooo
o
Exposure Exposure
x
x x
x x o xx
x x o x x o
x x x x
xxx xxxx xx x xx
x xx x x x o x x xx xxxxxx xxo o o o
x x x x x x x x x x xx ox ox
xxxx xxxxx xxx x o o x
xxxx xx x xxx xxxxx o o ooo ooo
x x xxxx xxxxxxxxxxxxxxxxxxxxx o oo x xxxx xxxxxxo xx xxx x oxoo x oo x
ox o o o
x x x x x x x
x x xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xx x ooooo oooooooo ooo oo
x x x x o x x xxxxxxxxxxxxo xxxxooxoxxoxxooooo ooooooo oo o
xoo
Outcome
Outcome
xx xxxxoxxxxxo
xx xxxx xx xxxxxxxxxxxxx xxx oo oo oo oo oo xxxxxxxxxxxxxxxxxxxxxxo x xxxxooo xxoox oo xoooo
o
xooooooooooo oo o
o
x xxxxxxxxxxxxxxxxx xxxxxxxxxxx x xx oxooo o oooooooo o oo ooooo o x x x x xxxxxxo
x
x xx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx oooooo oooo xx x x x
xxxxxxxxxxxxxxxxxxxxoxo
x x xxxxxxxxo xxxxoxxxoxxoxxo
x xxx x x o x ooo oxo
oooo o o
oooo ooooooo ooooooo
o oo
o oo x xox o xox xooxoooo o o o o
xx xxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxx x xx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xx oooo oooooooo
ooo
ooo ooo
oooo o ooo
oo ooo
o oooo
x
x x x xxxxxxxxxxxxxxxxxxoxxxxxxxxo x x x x
xxxxxoxxo x x
o o o
xooooo o
xooooooo oo ooo
oo ooooooo o o
x ooooooooooooooo oo oooooo ooo ooo x xxxxxxxxxxxxxxxxxxxxxxxxo xxoxoxxxoxxo xxxxxoooo xooxoooooxoooooo o ooooooo
x x xx
x xxx xxxxxxxxxxxxxxxxxxxxxxxxxxx xxxx ooooooooo x ooo
ooo oooooooooooo o o xxxx xx xxxxxxxxxxx xxxxoo xooxooooooo
xo ooooo
oo oooooo
oooooo ooo o
x x x xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xx oxooooo o oooooo oooo o o xxx xxxxxoxxxoxo xooxxo xooxoooo oooooooooooo
o ooo oooo o o
oooooo
oooo
ooo oo o
ooooo o
ooooo o o x xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxoxxxxxxxxxxxxxxooxo ooxoo xoxoo xooo ooo ooo
oooo oo o
xx xx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx x
x x x x xxx ooooooooooooo o
ooo ooooooo
oo ooooooooooo
o o o ooooo o x xxxxxxxxxxxxxxxxxxxoxxxxxxxxxxoxxoo xooxooxxo oxooxo ooxo oo
oooo oo
oxoooooo
o ooo
o
ooooooo
ooooooo oo o
x xxxxxxxxxxxxxxxxxxxxxxxxxx x x o oo o oooooo oooooo oo o oooooo
ooo
o oo
oooo oooooooo oo
ooo o x xxxxxxx xxoxxxoxxxxxxxoxxoxoxxxxoxoooxooxoxxoxxo oxxooooooooooo
o oooooxoooooooooooooooooooo
xx xxx xxxxxxxxxx xxxxxxxxxxxxxxxxxxooxooooooooooo oooooo
o oo oooooo
ooo oooooooo
o x x xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxo
xxxxxoxoxoxxooxoxo x x x
o o ooo o o o oo
o o
x xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xx oooooooooo ooooooo oooooooo oooo oooooooo oooo o oo
x x o
oxxoxooooxo o
x
ooxo
o
xoxoooooo o
oo
ooooo
o o
oooooooooooo oo oo o o o
x xx xxx xxxxxxxxxxxxxxx x xoo
x o
oooo oooooo
ooo ooooo
ooo oooooo ooo
oo ooooooo
o ooooo oooo o x xxxxxxxxxxxxxxxxxxxxxxxxoxxxxxxxoxxoxxoxoxo xoxo xxxoooxo
ooxooooo oooooooo ooooo
o o
ooo o
x x xxxxxxxxxxxxxxxxxxxxxxxxxxxx x ox xo
ooooo o
o ooo o oo ooo x xxxoxxo xo oo xxo xxoxxooxoxo oxoo
ooo oo
xoooooo
ooooooo
o oo o o
o o ooo o oo o o
ooooooo oo
o ooooooo oo
o oo o x x xxxxxxx xxxxxxxxxxxxxoxxxo
xxxxoxoxxoooxooo
o
xoo ooooooo oooo
o
oooooooo
oooo oooooo oo
o
x xxxxxx xx xx x x o oooooooo o ooooooooo
oooooo
o o o o
oooooooooooo oo oo o x xxxxxxxxxxxxxoxxxxo xxoooxoooxoxo xo o
o
ooooo oo
o oooo o o
xx xxxxx xxx xx x xo o ooooooo o ooooooooo ooo
ooooo oo oo xx x xxx xxxxxxxxxxxxxxxxxxxxxxxxo
o xo ooo
xoooooooooooooo
ooo oooo oooo ooooo ooooooo ooo x xx xxxxx xxoxxoxoxoo ox xo ooo
oxxoxo oooooo
oooooooo
x xxxxxx x xx x o o ooo oo
oo ooooooo oo x xx x oooo xxoooxooo o
ooo
oo o o
o o o
xx o ooooo o o xxxxxoxxxox xoo oxo
o
oooooooooo ooooooo
o
x o oooo ooooooooo oo o oo x x xxx ooxox oooo ooxooo oooooooo o
o
o oo oo o oo x x x o o
xx xx
x x
oo oo ooo
o o ooo ooooo
o xx x x o o ooox oo oo o ooo
x x o oo o o
o oo x o o oo o
o o x
x o o
o o
Exposure Exposure
FIGURE 7.3
Distribution of mean outcome and mean exposure levels in three genetic
subgroups (indicated by different symbols and shades of grey) for various
strengths of the instrument, with expected values of the F statistic. One point
of each colour comes from each of 1000 simulated datasets. The IV estimate
in each simulation is the gradient of the line through the three points.
108 Mendelian Randomization
10% of the bias in the observational association estimate. However, weak in-
strument bias depends in a graded way on the F statistic, and such cut-offs are
not always helpful or sensible. Biases less than this, corresponding to greater
F statistics, can be important in practice. Moreover, as explained later in this
chapter, there is an important distinction between the expected F statistic,
on which the magnitude of the bias depends, and the F statistic observed in
a particular dataset.
With a single IV, the expected value of the 2SLS estimate, and hence the
bias, is undefined (Section 4.1.6). Simulations have shown that the median
bias of the 2SLS method with a single IV (or equivalently the ratio or LIML
method) is close to zero even for IVs with expected F statistics around 5, where
the median bias is defined as the difference between the median estimate and
the true value [Burgess and Thompson, 2011].
Other methods, such as likelihood-based methods, are less susceptible to
bias. Although the mean bias of the LIML estimate is undefined (Section
4.3.2), the median bias is close to zero [Angrist and Pischke, 2009]. Simulations
for Bayesian methods for IVs with expected F statistics around 5 have shown
mean and median bias close to zero [Burgess and Thompson, 2012].
εY i :
6
X
xi = α1k gik + α2 ui + εXi (7.5)
k=1
yi = β1 xi + β2 ui + εY i
ui , εXi , εY i ∼ N (0, 12 ) independently.
We set β1 = 0, α2 = 1, β2 = 1 so that X is observationally strongly posi-
tively associated with Y , but the causal effect is null. We take parameters for
the genetic association α1k = 0.4 for each genetic instrument k, corresponding
to a mean F statistic of 10.2. We used a sample size of 512 divided equally
between the 26 = 64 genetic subgroups. The IVs are uncorrelated, so that the
variation in X explained by each IV is independent, and the mean F statistics
do not depend greatly on the number of IVs (mean 10.2 using 1 IV, 11.3 using
6 IVs).
Table 7.2 shows the median and 95% range of the estimates from the
2SLS and LIML methods and the mean estimate for the 2SLS method using
all combinations of all numbers of IVs as the instrument, with the mean
across simulations of the F statistic for all the instruments used. We also give
results using the IV with the greatest and lowest observed F statistics in each
simulation, as well as using all IVs with an F statistic greater than 10 in
univariate regressions of exposure on each IV.
Using 2SLS, as the number of IVs increases, the bias increases, despite the
mean F statistic remaining fairly constant. This is because there is a greater
risk of imbalances in confounders between the greater number of genetic sub-
groups defined by the instruments. The data are being subdivided in more
different ways, and so there is more chance of these divisions giving genetic
subgroups with different average levels of confounders. However, the variabil-
ity of the IV estimator decreases. This is because a greater proportion of the
variance in the exposure is modelled. The greatest increase in median bias is
from one IV to two IVs, and coincides with the greatest increase in precision.
With the 2SLS method, we therefore have a bias–variance trade-off in deciding
how many IVs to use [Zohoori and Savitz, 1997].
While LIML provides estimates which are slightly more variable than
2SLS, a similar increase in precision with the number of IVs is observed, but
no increase in bias. For 2SLS, the mean estimates are slightly smaller than the
median estimates presented. In the case of a single IV, the theoretical mean
is infinite (Section 4.1.6). For LIML, the mean bias is infinite for all numbers
of IVs (Section 4.3.2).
Using the single IV with the greatest F statistic gives markedly biased
results, despite a mean F statistic of 23.9. There is a similar bias only using
IVs with F > 10. In the simulation, each IV in truth explains the same
amount of variation in the exposure. If the IVs are chosen to be included in
an analysis because they explain a large proportion of the variation in the
exposure in the data under analysis, then the estimate using these IVs is
Weak instruments and finite-sample bias 111
additionally biased. This is because the IVs explaining the most variation will
be overestimating the proportion of true variation explained, due to chance
correlation with confounders. In the notation of Section 7.3.1, ∆U is large
and, having the same sign as α1 , leads to an estimate biased in the direction
of αβ22 . Conversely, if the IV with the least F statistic is used as an instrument,
the IV estimator will be biased in the opposite direction to the observational
association, as shown in Table 7.2.
So we see that if the F statistic is used either to choose between instru-
ments, or via a rule such as only including an IV in the analysis if F > 10, this
procedure itself introduces a selection bias which can be greater in magnitude
than the bias from weak instruments [Hall et al., 1996]. In a more realistic
example, IVs would not all have the same true strength. However, the large
sampling variation in F statistics means that choosing between IVs on the
basis of a single measured F statistic is unreliable. One solution to this in
practice is to use the strength of the IVs in an independent dataset to de-
termine the IVs to include in an applied analysis, or to use an allele score to
summarize multiple variants as a single IV (see Chapter 8).
TABLE 7.2
Evaluation of bias: Median and 95% range of estimates of β1 = 0 using 2SLS
and LIML methods, mean estimate using 2SLS method and mean F statistic
across 100 000 simulations using combinations of six uncorrelated instruments,
using the instrument with the greatest/least F statistic, and using all instru-
ments with univariate F statistics greater than 10.
1 2 3 4 5
Number of instruments
FIGURE 7.4
2SLS IV estimates for causal effect in the Framingham Heart Study of
log(CRP) on fibrinogen (µmol/l) using all combinations of varying numbers
of SNPs as IVs. Point estimates, associated box plots (median, inter-quartile
range, range) and mean F statistics across combinations are displayed.
114 Mendelian Randomization
bias can be reduced by increasing the sample size. Similarly, if there are in-
struments that are not contributing much to explaining the variation in the
exposure, then excluding these instruments will increase the F statistic. In
general, employing fewer degrees of freedom to model the genetic association,
that is using parsimonious models, will increase the F statistic and reduce
weak instrument bias, provided that the model does not misrepresent the
data [Pierce et al., 2011; Palmer et al., 2011a]. Simulations have shown that,
even when the true model is only approximately linear in the IV, a per allele
genetic model reduces bias [Burgess and Thompson, 2011].
However, it is not enough to simply rely on an F statistic measured from
data to inform us about bias [Hall et al., 1996]. Returning to the example from
Section 7.2.1 where we divided the Copenhagen General Population Study into
16 equally sized substudies with mean F statistic 10.8, Figure 7.5 shows the
estimates of these 16 substudies using the 2SLS method with their correspond-
ing F statistics. We see that the substudies which have greater estimates are
the ones with larger F statistics; the correlation between F statistics and point
estimates is 0.83. The substudies with higher F statistics also have tighter CIs
and so receive more weight in the meta-analysis. If we exclude from the meta-
analysis substudies with an F statistic less than 10, then the pooled estimate
increases from 0.23 (SE 0.14, p = 0.09) to 0.43 (SE 0.16, p = 0.006). Equally,
if we only use as instruments in each substudy the IVs with an F statistic
greater than 10 when regressed in a univariate regression on the exposure,
then the pooled estimate increases to 0.28 (SE 0.15, p = 0.06). So neither of
these approaches are useful in reducing bias.
Although the expectation of the F statistic is a good indicator of bias,
the observed F statistic shows considerable variation. In the 16 substudies of
Figure 7.5, the measured F statistic ranges from 3.4 to 22.6. In more realistic
examples, assuming similar instruments in each study, larger studies would
have higher expected F statistics which would correspond to truly stronger
instruments and less bias. However, the sampling variation of causal effects
and observed F statistics in each study would still tend to follow the pattern
of Figure 7.5, with larger observed F statistics corresponding to more biased
causal estimates.
So while it is desirable to use strong instruments, the measured strength
of instruments in data is not a good guide to the true instrument strength.
Echoing the comments of Section 7.5 regarding the inclusion of IVs in a model,
any guidance that relies on providing a threshold (such as F > 10) as an
inclusion criterion is flawed and may introduce more bias than it prevents.
Weak instruments and finite-sample bias 115
FIGURE 7.5
Forest plot of causal estimates of log(CRP) on fibrinogen (µmol/l) using data
from the Copenhagen General Population Study divided randomly into 16
equally sized substudies (each N ≃ 2230). Studies ordered by causal esti-
mate. F statistic from regression of exposure on three IVs. Size of markers is
proportional to weight in a fixed-effect meta-analysis.
116 Mendelian Randomization
TABLE 7.3
2SLS estimates and standard errors (SE) of the causal effect of log(CRP)
on fibrinogen, and F statistic for regression of log(CRP) on IVs, calculated
using each SNP separately and all SNPs together in per allele additive model,
without and with adjustment for log(IL6) in the Cardiovascular Health Study.
variant k (k = 1, 2, . . . Km ) is:
Km
X
xim = α0m + αkm gikm + εXim (7.6)
k=1
yim = β0m + β1 x̂im + εY im
2
εXim ∼ N (0, σX ); εY im ∼ N (0, σY2 ) independently.
The exposure levels are regressed on the IVs using a per allele additive linear
model separately in each study, and then the outcome levels are regressed on
the fitted values of exposure (x̂im ). The terms α0m and β0m are study-specific
intercept terms. Here we assume homogeneity of variances across studies; we
can use Bayesian methods to allow for possible heterogeneity (see Section 9.6).
If the same genetic variants are measured in each study and are assumed
to have the same effect on the exposure, we can use common genetic effects
(i.e. αkm = αk ) across studies by replacing the first line in equation (7.6) with:
K
X
xim = α0m + αk gikm + εXim (7.7)
k=1
If the assumption of common genetic effects is correct, this will improve the
precision of the fitted values (x̂im ) and reduce weak instrument bias.
To illustrate this, we consider the Copenhagen City Heart Study (CCHS),
Edinburgh Artery Study (EAS), Health Professionals Follow-up Study
(HPFS), Nurses Health Study (NHS), and Stockholm Heart Epidemiology
Program (SHEEP), which are cohort studies or case-control studies measuring
CRP and fibrinogen levels at baseline [CCGC, 2008]. In case-control studies,
we use the data from controls alone since these better represent cross-sectional
population studies. These five studies measured the same three SNPs on the
CRP gene: rs1205, rs1130864 and rs3093077 (or rs3093064, which is in com-
plete linkage disequilibrium with rs3093077). We estimate the causal effect
118 Mendelian Randomization
Causal Observational
Study N F df estimate (SE) estimate (SE)
CCHS 7999 29.6 (3, 7995) −0.286 (0.373) 1.998 (0.030)
EAS 650 6.9 (3, 646) 0.754 (0.327) 1.115 (0.056)
HPFS 405 5.3 (3, 401) 0.758 (0.423) 1.048 (0.081)
NHS 385 6.1 (3, 381) −0.906 (0.636) 0.562 (0.114)
SHEEP 1044 10.5 (3, 1040) 0.088 (0.345) 1.078 (0.051)
Different genetic effects 14.4 (15, 10463) 0.021 (0.195)
Common genetic effects 56.6 ( 3, 10475) −0.093 (0.225)
Study-level estimates 0.234 (0.174)
TABLE 7.4
Estimates of effect of log(CRP) on fibrinogen (µmol/l) from each of five stud-
ies separately and from meta-analysis of studies: number of participants (N ),
F statistic (F ) with degrees of freedom (df) from per allele additive regression
of exposure on three SNPs used as IVs, causal estimate using 2SLS with stan-
dard error (SE), observational estimate with SE. Fixed-effect meta-analyses
conducted using individual-level data with different study-level genetic ef-
fects, common pooled genetic effects, and combining study-level estimates
with inverse-variance weighting.
using the 2SLS method with different genetic effects (model 7.6), common ge-
netic effects (model 7.7) and by a fixed-effect meta-analysis of estimates from
each study.
Table 7.4 shows that the studies analysed separately have apparently dis-
parate causal estimates with large SEs. The meta-analysis estimate assuming
common genetic effects across studies is further from the confounded observa-
tional estimates and closer to the IV estimate from the largest study with the
strongest instruments (CCHS) than the model with different genetic effects,
suggesting that the latter suffers bias from weak instruments.
The pooled estimate from the study-level meta-analysis is greater than
those from the individual-level meta-analyses. Although the CCHS study has
about 8 times the number of participants as SHEEP and 12 times as many as
EAS, its causal estimate has a larger standard error. The standard errors in
the 2SLS method are known to be underestimated when the correlation due to
confounding is strong, especially with weak instruments (Section 7.4.2) [Stock
and Yogo, 2002]. Also, Figure 7.5 showed that causal estimates nearer to the
observational association have lower variance. So a study-level meta-analysis
may be biased due to overestimated weights in the studies with more biased
estimates.
Returning to the example of data from the Copenhagen General Popu-
lation Study considered in Section 7.2.1, if we use the IPD (model 7.6) to
Weak instruments and finite-sample bias 119
Different Common
genetic genetic
Substudies Meta-analysis p-value effects p-value effects p-value
1 −0.05 (0.15) 0.76
5 −0.01 (0.15) 0.95 −0.03 (0.15) 0.85 −0.05 (0.15) 0.75
10 0.09 (0.14) 0.54 0.04 (0.14) 0.80 −0.05 (0.15) 0.76
16 0.23 (0.14) 0.09 0.15 (0.14) 0.26 −0.05 (0.15) 0.75
40 0.46 (0.13) < 0.001 0.30 (0.13) 0.02 −0.04 (0.15) 0.77
100 0.83 (0.11) < 0.001 0.68 (0.11) < 0.001 −0.04 (0.15) 0.77
250 1.27 (0.08) < 0.001 1.15 (0.08) < 0.001 −0.04 (0.15) 0.78
TABLE 7.5
2SLS estimates of causal effect (standard error) of log(CRP) on fibrinogen
from the Copenhagen General Population Study divided randomly into sub-
studies and combined: using fixed-effect meta-analysis of substudy estimates,
and using individual patient data (IPD) with different or common genetic
effects across substudies.
7.7 Discussion
This chapter has demonstrated the effect of weak instrument bias on causal
estimates in real and simulated data. The magnitude of this bias depends on
the statistical strength of the association between instrument and exposure.
Weak instrument bias can reintroduce the problem that IVs were developed
to solve. It is misleading not solely because it biases estimates, but because
estimates suffering from the bias do not provide a valid test of the null hy-
pothesis. Weak instruments may convince a researcher that an observational
association that they have estimated is in fact causal. The reason for the bias
is that the variation in the exposure explained by the IV is not large enough to
dominate the variation in the exposure caused by chance correlation between
the IV and confounders.
While the magnitude of the bias depends on the instrument strength
through the expected or mean F statistic, for a study of fixed size and under-
lying instrument strength, an observed F statistic greater than its expected
value corresponds to an estimate closer to the observational association with
120 Mendelian Randomization
observed F statistic than expected, then the causal estimate will be more
biased.
• Coverage levels with weak instruments can be poorly estimated by methods
which rely on assumptions of asymptotic normality.
• Data-driven choice of instruments or analysis can exacerbate bias. In partic-
ular, any threshold guideline such as ensuring that an observed F statistic
is greater than 10 is misleading. Methods, instruments, and data to be
used should be specified prior to data analysis. Meta-analyses based on
study-specific estimates of causal effect are susceptible to bias.
• Bias can be alleviated by use of measured covariates and parsimonious
modelling of the genetic association (such as a per allele additive SNP
model rather than one coefficient per genotype). This should be accom-
panied by sensitivity analyses to assess potential bias, for example from
model misspecification.
• Bias can be reduced substantially by using LIML, Bayesian and allele score
(see next chapter) methods rather than 2SLS, and bias in practice with a
single IV should be minimal. Nominal coverage levels can be maintained
by the use of Fieller’s theorem with a single IV, and confidence intervals
from the Anderson–Rubin test statistic or Bayesian MCMC methods with
multiple IVs.
8
Multiple instruments and power
8.1 Introduction
Although instrumental variable (IV) methods give estimates which are con-
sistent for the causal effect, their variance is typically much larger than the
variance of the estimate from an observational analysis [Davey Smith and
Ebrahim, 2004]. This is because the variation in the exposure explained by
the IV is usually small. If there are multiple IVs available, a more precise
causal effect estimate can be obtained by incorporating data on all the IVs
simultaneously to estimate a single causal effect [Palmer et al., 2011a]. How-
ever, two problems arising from including multiple IVs in an analysis are weak
instruments and missing data.
When there are large numbers of genetic variants, several IV methods
give estimates which are biased in the direction of the observational estimate
with incorrectly sized confidence intervals (see Chapter 7). Allele scores are a
convenient way of summarizing a large number of genetic variants associated
with an exposure. Using a univariate allele score as a single IV rather than
each genetic variant as a separate IV helps resolve problems in IV estimation
resulting from weak instruments.
Sporadically missing genetic data typically arise due to difficulty in in-
terpreting the output of genotyping platforms. If the output is not clear, a
“missing” result is recorded. Hence, although efficiency will be gained from
using multiple instruments, this may be offset in a complete-case analysis due
to more participants with missing data being omitted. Rather than omitting
participants, methods for incorporating participants with partially missing
data can be employed.
123
124 Mendelian Randomization
several parametric assumptions are made in specifying the allele score, such
as additivity in the genetic model with no interactions between variants.
var(RYIV )
var(β̂1 ) = (8.1)
N var(X) ρ2GX
from the delta method expansion (equation 4.9), and ignores uncertainty in
the IV–exposure association; the subsequent calculations therefore represent
the power to detect an association between the IV and the outcome.
The asymptotic variance of the conventional regression (ordinary least
squares, OLS) estimate of the association between the exposure X and the
outcome Y is given by the formula:
var(RYOLS )
var(β̂OLS ) = (8.2)
N var(X)
squared correlation ρ2GX = 0.005 to 0.03 and the sample size as before. In
each of the figures, the power to detect a positive causal effect is displayed;
this tends to 0.025 as the sample size tends to zero. Sample sizes of several
thousands are required to achieve adequate power in settings typical for many
Mendelian randomization studies (modest causal effects, low correlation of
genetic variants with exposure). Code to customize these calculations for a
given scenario is available [Brion et al., 2013] together with an online calcula-
tor (http://glimmer.rstudio.com/kn3in/mRnd/).
100
β1s=0.30
β1s=0.25
β1s=0.20
80
β1s=0.15
β1s=0.10
β1s=0.05
60
Power (%)
40
20
0
FIGURE 8.1
Power curves with two-sided significance level α = 0.05 varying the sample
size for a fixed value of the IV strength (ρ2GX = 0.02) and different values of
the size of the standardized causal effect (β1s = 0.05 to 0.3) with a single IV.
Multiple instruments and power 129
100
ρ2GX=0.030
=0.005
ρ2GX=0.025
=0.010
ρ2GX=0.020
=0.015
80
ρ2GX=0.015
=0.020
ρ2GX=0.010
=0.025
ρ2GX=0.005
=0.030
60
Power (%)
40
20
0
FIGURE 8.2
Power curves with two-sided significance level α = 0.05 varying the sample
size for a fixed size of standardized causal effect (β1s = 0.2) and varying the
value of the IV strength (ρ2GX = 0.005 to 0.3) with a single IV.
130 Mendelian Randomization
where β̂Y |G and β̂X|G are the genetic association estimates with the outcome
and exposure respectively.
The sample size for an IV analysis can therefore be approximated by con-
sidering the variance of the coefficient β̂Y |G . Assuming the outcome is binary
(Y = 0 or 1) and using a logistic regression model to obtain β̂Y |G , the variance
of the IV estimate is approximately:
1
var(β̂1 ) = (8.7)
N var(X) ρ2GX P(Y = 1) P(Y = 0)
where P(Y = 1) and P(Y = 0) are the probabilities of the two outcomes for
Y in the sample population (so the proportions of cases and controls in a
case-control study).
The sample size required to detect a standardized causal effect of size β1s
(the log odds ratio per standard deviation increase in X) with 80% power and
a two-sided significance level of α = 0.05 is therefore:
7.848
N= 2 ρ2 . (8.8)
β1s GX P(Y = 1) P(Y = 0)
100K
Number of cases required for 80% power
ρ2GX=0.005
ρ2GX=0.010
80K
ρ2GX=0.015
ρ2GX=0.020
ρ2GX=0.025
60K
40K
20K
5K ρ2GX=0.030
ρ2GX=0.01
ρ2GX=0.02
40K
ρ2GX=0.03
ρ2GX=0.05
ρ2GX=0.08
30K
20K
10K
2K
FIGURE 8.3
Number of cases required (assuming an equal number of controls) in a
Mendelian randomization analysis with a binary outcome and a single in-
strumental variable for 80% power with a 5% significance level varying the
size of the standardized causal effect (odds ratio per standard deviation in-
crease in exposure) for different values of IV strength (ρ2GX : 0.005 to 0.03 in
top panel, 0.01 to 0.08 in bottom panel).
Multiple instruments and power 133
TABLE 8.1
Estimate and standard error of causal effect of unit increase in log(CRP) on
fibrinogen (µmol/l) using various SNPs as IVs: analyses for participants (N )
with complete data on SNP used as IV in analysis, and for participants with
complete data on all SNPs.
of posterior probabilities of the number of variant alleles for each SNP in indi-
viduals, or as imputed datasets randomly drawn from the same posterior dis-
tributions. Either multiple imputed datasets (multiple imputations method),
or the posterior probabilities (SNP imputation method) can be used as inputs
in an analysis model. Both approaches acknowledge uncertainty in the impu-
tation process; however, as the imputation and analysis models are performed
separately, there is no feedback between the two stages. Alternatively, impu-
tation can be performed as part of the analysis model using a latent variable
approach, modelling the haplotypes using a multivariate normal distribution
(latent variable method) [Lunn et al., 2006], or by modelling the probability of
an individual having a given set of haplotypes directly using knowledge about
the structure of the data and the prevalence of known haplotype patterns
(haplotype imputation method).
8.5 Discussion
In this chapter, we have considered using multiple instruments in IV analyses.
Using multiple instruments has the potential to reduce the standard error
of causal estimates, but if there are sporadically missing genetic data, this
increase is offset by a decrease in sample size in a complete-case analysis.
136 Mendelian Randomization
TABLE 8.2
Estimate, standard error (SE) and 95% confidence interval of the causal effect
for a unit increase in log(CRP) on fibrinogen (µmol/l) in a complete-case
analysis (N = 3188) and in the entire study population (N = 3693) using
different imputation methods for missing genetic data in the British Women’s
Heart and Health Study.
9.1 Introduction
In general, the variation in the exposure of interest explained by genetic vari-
ants in Mendelian randomization is small, and so adequately powered inves-
tigations typically require large sample sizes. This often demands synthesis of
evidence from multiple, possibly heterogeneous studies.
In this chapter, we first consider assessment of the causal relationship us-
ing data from multiple studies (Section 9.2) before proceeding to methods for
estimating a pooled causal effect. Methods are presented in order of the homo-
geneity and detail of data required from each constituent study. A study-level
meta-analysis requires the least detail, combining the causal effect estimates
obtained in each study (Section 9.3). However, in order to estimate such a
pooled causal effect, each study needs to measure data on genetic variants,
the exposure and the outcome. A summary-level meta-analysis requires more
detailed information from studies, including information which may not be
routinely reported in a published paper but is increasingly being made avail-
able by large consortia (Section 9.4). An individual-level meta-analysis re-
quires individual participant data (IPD) from studies (Section 9.5). However,
individual-level models are the most flexible for addressing the heterogeneity
of data available in each study. The methods are illustrated and compared us-
ing real data (Section 9.6). We discuss extensions to the meta-analysis model
for binary outcomes (Section 9.7), and conclude with a discussion of applica-
tion of the methods presented in practice (Section 9.8).
139
140 Mendelian Randomization
such as estimates of the associations between the genetic variant(s) and each
of the exposure and outcome, or the mean level of the exposure and outcome
in each genetic subgroup. A pooled estimate can be evaluated by combining
data in a hierarchical model, which we now describe.
ηk = β1 ξk . (9.2)
The causal effect β1 is assumed to be the same for all variants (Section 8.5.1).
This and subsequent models in this chapter can be estimated either by nu-
merical maximization of the log-likelihood function or by Bayesian methods
[Thompson et al., 2005], the latter for example using WinBUGS [Spiegelhalter
et al., 2003] or MLwiN [Rasbash et al., 2009].
The correlation ρ can be estimated as part of the analysis, but there is
likely to be little information on the parameter in the data [Riley et al., 2007].
We recommend that the value of the parameter be specified as part of the
model, and a sensitivity analysis performed to assess the effect of varying this
parameter value on estimates; its value should be similar to the observational
correlation between the exposure and outcome.
By combining the estimates of association from multiple variants into a
single estimate of the causal effect, an assumption is made that the variants
provide independent information on the causal effect. If the association es-
timates are derived from the same data, then they will not be independent.
However, if the variants are independently distributed (that is, they are not
in linkage disequilibrium, LD), correlation between these estimates should be
low unless the sample size is particularly small. Simulations using indepen-
dently distributed variants have shown that estimates from the summary-level
142 Mendelian Randomization
data model (9.1) and (9.2) are well-behaved even in the presence of statisti-
cal interactions between the effects of the genetic variants (gene–gene inter-
actions). Similar weak instrument bias was observed to estimates from the
two-stage least squares (2SLS) method, and confidence intervals were appro-
priately sized with correct coverage rates. The efficiency of estimates based on
summary-level data was similar to that of estimates based on individual-level
data [Burgess et al., 2013].
If the genetic variants are correlated in their distributions (that is, they
are in LD), then the association estimates β̂Xk (k = 1, . . . , K) will be corre-
lated, as will β̂Y k (k = 1, . . . , K). This can be accounted for in the likelihood
model by a multivariate normal distribution for the genetic association es-
timates from each variant using estimates of the correlations between the
variants (which will be the same as the correlations between the association
estimates) [Burgess et al., 2014e]. This extension is not discussed further here.
If the variants are correlated, then estimates from equation (9.1) will overstate
precision.
For a single study, a further method has been developed for combining
summary-level data on multiple genetic variants not in LD [Johnson, 2011].
This method combines the ratio estimates β̂β̂Y k from each variant in an inverse-
Xk
variance weighted meta-analysis using asymptotic variances calculated from
the delta method for the ratio of two random variables [Dastani et al., 2012].
This variance is:
σY2 k
. (9.3)
2
β̂Xk
This differs from the formula for the standard error of the ratio estimate
given in equation (4.9) as the uncertainty in the genetic association with the
exposure is assumed to be zero.
The combined inverse-variance weighted (IVW) estimate β̂IV W is:
P −2
k β̂Xk β̂Y k σY k
β̂IV W = −2
. (9.4)
2
P
k β̂Xk σY k
As the method assumes the ratio estimates are normally distributed, and as
the uncertainty in the genetic associations with the exposure is not accounted
for, the precision of IVW estimates is overstated. However, simulations have
shown that the underestimation of confidence intervals may be slight, with an
average 93% coverage probability for a nominal 95% confidence interval in a
plausibly realistic scenario [Burgess et al., 2013]. Therefore the IVW method
may be a reasonable simpler alternative to a likelihood-based model when
Multiple studies and evidence synthesis 143
the genetic associations with the exposure are estimated precisely. However,
likelihood-based models should be preferred for use in practice where possi-
ble, particularly if the genetic associations with the exposure are estimated
imprecisely.
This model is appropriate even if the genetic variants are in LD, as the
genetic subgroups are defined using all the variants, and so the mean values of
the exposure and outcome in the genetic subgroups are independent. However,
unlike estimates of genetic association, data on the mean values of the exposure
and outcome in genetic subgroups are unlikely to be routinely reported in
publications.
Multiple studies and evidence synthesis 145
between the studies, then the value of the heterogeneity parameter will be
close to zero, and the random-effects analysis will approximate a fixed-effect
analysis. If there is considerable heterogeneity, then this provides evidence
against the fixed-effect model and in favour of the random-effects model. A
fixed-effect model may be used if there is a strong argument why a parameter
may be similar across studies (for example, if the separate studies were in
fact centres in a clustered investigation using the same protocol and sampling
individuals from the same population). If there are few studies then it may be
difficult to obtain a precise estimate of heterogeneity, and either a fixed-effect
model or an informative prior on the heterogeneity parameter in a Bayesian
analysis may be employed. A fixed-effect model may also be useful in compar-
ing between meta-analysis coefficients from different analyses to ensure that
the differences were not simply due to changes in the heterogeneity estimate.
The hierarchical nature of the above models is now clear: at the first
(within-study) level, a causal parameter (β1m ) is specified in each study, and
at the second (between-study) level, these causal parameters are pooled to
provide a single causal estimate (β1 ). By evaluating the estimates using a
likelihood-based method in a single model, the meta-analysis is performed in
a single step. This is in contrast with a two-step meta-analysis, as described
in Section 9.3, in which the causal estimates are first estimated in each study,
and then the estimates are combined. By performing the analysis in a sin-
gle step, uncertainty in the model is correctly acknowledged and feedback is
allowed between the two levels of the model.
the models such as equation (9.1), as the correlation between the genetic as-
sociation estimates with the exposure and with the outcome (ρ) would be
zero.
It may be that gene–exposure and gene–outcome association estimates
taken from the literature are not estimated from a single study, but themselves
represent pooled estimates from meta-analyses. These can be combined across
variants using the methods of Section 9.4.1. However, the heterogeneity across
studies will not be modelled as faithfully as in a hierarchical model using the
study-specific association estimates.
dropping the subscript j and the division into genetic subgroups. In model
(9.14), there are up to 3K genetic subgroups and an equal number of ξj pa-
rameters in the model of genetic association with the exposure (assuming each
variant is a biallelic SNP and so takes three values). In model (9.15), there
are K + 1 αk parameters (one parameter for each genetic variant and an in-
tercept parameter), meaning that the genetic association with the exposure is
modelled more parsimoniously.
When the same set of genetic variants has been used in several studies, we
can combine the estimates of genetic association αkm across studies, in the
Multiple studies and evidence synthesis 149
same way as the parameter ξm was combined in equation (9.11). This should
give a more precise model of association in smaller studies and should reduce
weak instrument bias, as instrument strength will be combined across the
studies (Section 7.6.3). Due to possible heterogeneity between populations,
we propose a random-effects model, where we impose a multivariate normal
distribution on the study level parameters αm = (αkm , k = 1, . . . , Km ) with
mean vector µα and variance-covariance matrix Ψ. Note that the intercept
parameters α0m are not pooled, as these depend on the characteristics of each
study population and would not necessarily be similar across studies.
K
X
ξim = α0m + αkm gikm (9.17)
k=1
αm ∼ NK (µα , Ψ)
CCHS EAS
9.4
8.5
9.0
8.0
8.6
7.5
8.2
0.6 0.7 0.8 0.9 1.0 0.4 0.6 0.8 1.0 1.2
HPFS NHS
11
12.5
10
fibrinogen
12.0
9
11.5
8
11.0
0.0 0.5 1.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4
SHEEP
11.0
10.6
10.2
9.8
log−transformed CRP
FIGURE 9.1
Summary plot of mean fibrinogen (µmol/l) against mean log(CRP) (lines are
95% confidence intervals) in each genetic subgroup for five studies. Subgroups
with less than 5 subjects have been omitted; the size of the shaded squares is
proportional to the number of subjects in each subgroup.
152 Mendelian Randomization
TABLE 9.1
Estimates of causal effect (95% confidence intervals, CI) of log(CRP) on fib-
rinogen (µmol/l) from meta-analysis of five studies using study-level (data
on study-specific causal effects combined by inverse-variance weighting),
summary-level (data on genetic subgroups), and individual-level (without and
with pooling of parameters of genetic association): hierarchical models with
deviance information criterion (DIC) for individual-level models.
A log-linear regression model for the genetic associations with the outcome,
or a log-linear model for relating the outcome and exposure could also be
considered; in this case a causal log relative risk parameter would be estimated.
with previous history of disease (prevalent cases) and controls as all non-
diseased individuals. A prospective analysis excludes all prevalent cases and
considers new incident events within the reporting period. An individual who
is censored at the end of the follow-up period is taken as a control in both
the retrospective and prospective analyses. However, while we do not want to
include the individual’s exposure measurement twice, we want to ensure that
the same odds ratio parameter is estimated in both analyses.
In the corresponding model (9.20), we consider genetic subgroup j, con-
taining N1j individuals, n1j of whom are prevalent cases, and N2j (= N1j −n1j )
non-prevalent individuals, n2j of whom have incident events.
Xij ∼ N (ξj , σ 2 ) for i = 1, . . . N2j non-prevalent individuals (9.20)
n1j ∼ Binomial(N1j , π1j )
n2j ∼ Binomial(N2j , π2j )
logit(π1j ) = η1j = β01 + β1 ξj
logit(π2j ) = η2j = β02 + β1 ξj
This model ensures that the same fitted values of the exposure are used in
both logistic regressions without including individuals twice in the regression
of the exposure on the genetic variants. The causal log odds ratio parameter
is β1 , which is assumed to be the same in the retrospective and prospective
analyses.
9.8 Discussion
In this chapter, we have presented a flexible set of models for meta-analysis of
multiple studies in a hierarchical framework. This allows for the efficient syn-
thesis of summary-level and/or individual-level data from different sources.
Although study-level causal estimates can be combined in a conventional
inverse-variance weighted meta-analysis, such an analysis has a number of
technical deficiencies, and does not allow for the inclusion of extra information
from studies where a causal estimate cannot be obtained. More detailed data
present a number of advantages to the researcher, including the incorporation
of information from studies where the exposure or outcome is not measured,
and the efficient estimation of the genetic model of association where the same
genetic variants are measured in multiple studies.
An advantage of the hierarchical structure is that the whole meta-analysis
can be performed in one step. This keeps each study distinct within the hier-
archical model, only combining information from studies at the top level. This
is more effective at dealing with heterogeneity, both statistical and in study
design, than performing separate meta-analyses on each of the gene–exposure
and gene–outcome associations [Thompson et al., 2005].
Multiple studies and evidence synthesis 155
Much of this book has been motivated and illustrated by data collected by the
CRP CHD Genetics Collaboration [CCGC, 2008]. In this chapter, we anal-
yse the entirety of the CCGC data to estimate the causal effect of C-reactive
protein (CRP) on coronary heart disease (CHD) risk as an illustration of the
Mendelian randomization approach, as well as several of the methodological
issues highlighted in this book. We first give an overview of the complete
dataset and address the validity of the genetic variants as instrumental vari-
ables (IVs) (Section 10.1). We then analyse a single study, exemplifying some
features of the data (Section 10.2), before continuing to present an analysis
of the full dataset (Section 10.3). We conclude this chapter with a discussion,
including the interpretation of the results of this analysis (Section 10.4). A
more detailed analysis of these data is available in a published paper [Burgess
et al., 2012].
157
158 Mendelian Randomization
AGES The Reykjavik Study of Healthy Aging for the New Millennium
ARIC Atherosclerosis Risk in Communities Study
BHF-FHS British Heart Foundation Family Heart Study
BRHS British Regional Heart Study
BWHHS British Women’s Heart and Health Study
CAPS Caerphilly Study
CCHS Copenhagen City Heart Study
CGPS Copenhagen General Population Study
CHAOS Cambridge Heart Antioxidant Study
CHS Cardiovascular Health Study
CIHDS Copenhagen Ischaemic Heart Disease Study
CUDAS Carotid Ultrasound Disease Assessment Study
CUPID Carotid Ultrasound in Patients with Ischaemic Heart Disease
DDDD Die Deutsche Diabetes Dialyse (4D) Trial
EAS Edinburgh Artery Study
ELSA English Longitudinal Aging Study
EPICNL European Prospective Investigation in Cancer and Nutrition,
Netherlands Centre
EPICNOR European Prospective Investigation in Cancer and Nutrition, Nor-
folk Centre
FRAMOFF Framingham Offspring Study
GISSI Gruppo Italiano per lo Studio della Sopravvivenza nell’Infarto Mio-
cardico
HEALTHABC Health Aging and Body Composition Study
HIFMECH The Hypercoagulability and Impaired Fibrinolytic Function Mech-
anisms Study
HIMS Health in Men Study
HPFS Health Professionals Follow Up Study
HVHS Heart and Vascular Health Study
INTHEART INTERHEART Study
ISIS International Study of Infarct Survival
LURIC The Ludwigshafen Risk and Cardiovascular Health Study
MALMO Malmo Diet and Cancer Study
MONICA/ Monitoring of Trends and Determinants in Cardiovascular Disease/
KORA Cooperative Health Research in the Region of Augsburg Study
NHS Nurses Health Study
NPHSII Northwick Park Heart Study II
NSC Northern Swedish Cohort Study
PENNCATH University of Pennsylvania Catheterization Study
PROCARDIS Precocious Coronary Artery Disease Study
PROSPER Prospective Study of Pravastatin in the Elderly at Risk
ROTT Rotterdam Study
SHEEP Stockholm Heart Epidemiology Program
SPEED Speedwell Study
UCP Utrecht Cardiovascular Pharmacogenetics Study
WHIOS Women’s Health Initiative Observational Study
WHITE2 Whitehall II Study
WOSCOPS West of Scotland Coronary Prevention Study
TABLE 10.1
Abbreviations for the 43 studies with subjects of European descent in the
CCGC.
Example: The CRP CHD Genetics Collaboration 159
Further details on the individual studies can be found in the main published
paper from the collaboration [CCGC, 2011]. We discuss below issues relating
to the study design for studies in the collaboration, as well as relevant details
for the analysis about the exposure, genetic variants, outcome, and various
covariates.
Mendelian Randomization
HVHS Unmatched case-control (no CRP data) 4407 1040 0 X P X X
INTHEART Unmatched case-control (no CRP data) 4188 1883 0 X X X X
UCP Unmatched case-control (no CRP data) 2011 922 0 X X X P
AGES Tabular data 3219 800 0 X X X X
HEALTHABC Tabular data 1660 584 0 X X X X
MONICA/KORA Tabular data 1675 272 0 X X X X
PENNCATH Tabular data 1509 1022 0 X X X X
Total 162 416 8392 28 089 103 039
TABLE 10.2
Summary of studies in the CRP CHD Genetics Collaboration with subjects of European descent.
polymorphisms (SNPs) in the CRP gene region. Only SNPs located in this
region were considered as potential IVs to ensure maximal plausibility of the
IV assumptions. The region is on chromosome 1 and is responsible for the pro-
duction of CRP and its regulation. The number of relevant SNPs measured
in each study varied from 1 to 13. Over 20 SNPs in total were measured in at
least one study. Four SNPs were pre-specified in the study protocol as the in-
strumental variables to be used in the analysis: rs1205, rs1130864, rs1800947,
and rs3093077 [CCGC, 2008]. These four SNPs show varying degrees of cor-
relation and give rise to five haplotypes which comprise at least 99% of the
genetic variation exhibited in European descent populations. Indeed, over 99%
of individuals in the CCGC had a genotype which was compatible with these
haplotypes. Only 11 studies measured all four of the pre-specified SNPs. Some
studies measured SNPs which are in complete linkage disequilibrium (LD)
with one of the pre-specified SNPs (r2 > 0.97 in European populations in the
HapMap database), and which are used as proxies for these SNPs. 20 mea-
sured all four SNPs or proxies thereof and an additional 17 measured some
three out of these four. Five of the remaining studies measured fewer than
this, and the final study ISIS measured no SNPs which correspond to any of
these four (in ISIS a single SNP rs2808628, also in the CRP gene region, was
used as an IV).
Proxy SNPs are treated as if they are the SNP of interest. We denote rs1205
(or proxies thereof) as g1, rs1130864 (or proxies thereof) as g2, rs1800947 (or
proxies thereof) as g3, and rs3093077 (or proxies thereof) as g4. Overall minor
allele frequencies were 0.34 for g1, 0.30 for g2, 0.06 for g3, and 0.06 for g4.
There was some sporadic missingness in the genetic data in most of the
studies, although this was rarely greater than 10% per SNP and usually much
less. Table 10.2 lists the pre-specified SNPs measured in each study. We found
that an additive per allele model of association was the most appropriate,
with similar coefficients for the per allele increase in the exposure in each
study [Burgess et al., 2012].
FIGURE 10.1
Pooled estimates of standard deviation change in covariate per CRP-increasing
allele change in SNP for a range of covariates and the SNPs used for IV anal-
ysis. Estimates and 95% confidence intervals presented are based on random-
effects meta-analyses of study-specific associations.
164 Mendelian Randomization
10.2.1 Results
We analyse the data separately retrospectively and prospectively using some
of the methods of Chapter 4: two-stage, Bayesian, generalized method of mo-
ments (GMM), and structural mean model (SMM) methods. Results are given
in Table 10.3 using each SNP individually as an IV (analyses using an addi-
tional SNP rs2808630, labelled g5, are also presented here, although this SNP
is not used in the overall meta-analysis). We see that the results from differ-
ent methods are similar throughout, with differences between estimates small
compared to their uncertainty. CHS suggests a significantly positive causal
effect of CRP in some of the prospective analyses; this is not representative
of the totality of the data (Section 10.3).
TABLE 10.3
Causal log odds ratios (standard errors) of CHD per unit increase in log(CRP)
in prospective and retrospective analyses of the Cardiovascular Health Study
(N = sample size, n = number of events) using two-stage, Bayesian, general-
ized method of moments (GMM), and structural mean model (SMM) meth-
ods.
1 In the Bayesian, GMM, and SMM analyses, the estimates using g5 as an IV failed to
converge.
g1
1.0
g2
g3
Prior
0.8
Density
0.6
0.4
0.2
0.0
−4 −2 0 2 4
β1
FIGURE 10.2
Prior and posterior distributions of causal log odds ratio parameter (β1 ) for
retrospective logistic IV analyses of the Cardiovascular Health Study using
SNPs rs1205 (g1), rs1417938 (g2) and rs1800947 (g3). On this horizontal scale,
the prior appears as a flat line at close to zero density.
Example: The CRP CHD Genetics Collaboration 167
g5
0.008
Prior
Density
0.004
0.000
β1
FIGURE 10.3
Prior and posterior distributions of causal log odds ratio parameter (β1 ) for
retrospective logistic IV analysis of the Cardiovascular Health Study using
SNP rs2808630 (g5).
TABLE 10.4
Pooled estimates from univariate inverse-variance weighted random-effects
meta-analysis of per allele effect on log(CRP) (G–X association) and log odds
of CHD (G–Y association) in regression on each SNP in turn, and hetero-
geneity (I 2 represents the percentage of the variability in effect estimates that
is due to heterogeneity rather than sampling error [Higgins et al., 2003]);
causal estimates (X–Y association) for odds ratio of CHD per unit increase in
log(CRP) from meta-analysis using each SNP as the sole IV from the method
of equation (9.11).
TABLE 10.5
Causal estimates of odds ratio of CHD per unit increase in log(CRP) using all
available pre-specified SNPs as IVs in random-effects meta-analyses: number
of studies and CHD events included in analysis, estimate of causal effect (95%
confidence/credible interval), heterogeneity estimate (τ̂ , the between-study
standard deviation of the causal log odds ratios); pooling refers to pooling of
the genetic associations with log(CRP) across studies.
170 Mendelian Randomization
10.4 Discussion
This chapter has illustrated methods for the synthesis of Mendelian random-
ization data comprising a variety of study designs and measuring a variety
of genetic variants. Studies with differing design can be analysed separately
and then combined in a study-level meta-analysis, or alternatively analysed
together in an individual-level meta-analysis using a hierarchical model.
175
176 Mendelian Randomization
estimate using this variant, expressed as the causal effect on the outcome of a
0.4 kg/m2 change in BMI, would approximate the average effect of increasing
the BMI of every individual in the population by 0.4 kg/m2 .
If the exposure–outcome relationship is not monotone (for example, it is
J- or U-shaped), then the true change in the outcome for a given change in
the exposure may be in different directions for various members of the pop-
ulation; but the IV estimate is of the average change in the outcome across
the population [Angrist et al., 2000]. Hence, standard IV methods can still be
used to test for the presence of a causal effect even if the exposure–outcome
relationship is non-linear, and the estimated parameter has a natural inter-
pretation, but any single effect estimate will not tell the whole story of the
causal relationship.
If the shape of the exposure–outcome causal relationship is of interest,
local IV estimates can be obtained within strata of the exposure, such as
deciles or quintiles. By plotting these estimates against the average level of
the exposure in the strata, the shape of the causal relationship can be assessed
graphically. However, if the exposure is stratified on directly, misleading re-
sults may be obtained. This is because the exposure lies on the causal pathway
between the IV and the outcome, and so conditioning on the exposure induces
an association between the IV and confounders. This can be circumvented by
initially subtracting the effect of the IV on the exposure from the exposure
measurement, to obtain the ‘IV-free exposure’. This quantity, representing
the expected value of the exposure for an individual if their IV took the value
zero, can then be safely conditioned on. For the approach to be valid, it is
necessary for the average genetic association with the exposure in the popu-
lation to remain constant at different levels of the exposure [Burgess et al.,
2014b]. Further methodological work is required to assess the robustness of
this approach to violation of this assumption, as well as to stratifying directly
on the exposure in situations where calculating the IV-free exposure may be
problematic, such as if the exposure takes discrete values or has a natural
maximum or minimum value.
2008]. More generally, mediation analysis can suggest the pathway by which
a genetic variant is associated with the outcome, and hence be informative
about causal mechanisms linking an exposure measure to the outcome.
There has been considerable recent research on mediation analysis, in-
cluding technical definitions of direct and indirect effects [Pearl, 2001], and
investigations into the assumptions necessary for valid estimation of these
effects, in particular relating to unmeasured confounding [VanderWeele and
Vansteelandt, 2009]. When the genetic variant can be assumed to be ran-
domly assigned, as in Mendelian randomization for a valid genetic instrumen-
tal variable, the “no unmeasured confounding” assumptions relating to the
associations between the genetic variant and the exposure and between the
genetic variant and the outcome are automatically satisfied; however addi-
tional assumptions such as no unmeasured confounding between the mediator
and exposure and no post-treatment confounding are still required [Emsley
et al., 2010].
11.3 Conclusion
In conclusion, there are still areas of ongoing methodological research in
Mendelian randomization, and work is needed to translate existing and future
methodological developments into the context of Mendelian randomization for
applied researchers. This is fueled to a large extent by increasing data avail-
ability: new exposure variables, increasing detail of genetic measurements, and
publicly-available data resources. These are likely to provide further insights
into causal mechanisms, and further scope for methodological and applied
developments in the future.
Bibliography
Allin, K., Nordestgaard, B., Zacho, J., Tybjærg-Hansen, A., and Bojesen, S.
2010. C-reactive protein and the risk of cancer: a Mendelian randomization
study. Journal of the National Cancer Institute, 102(3):202–206. (Cited on
page 11.)
Almon, R., Álvarez-Leon, E., Engfeldt, P., Serra-Majem, L., Magnuson, A.,
and Nilsson, T. 2010. Associations between lactase persistence and the
metabolic syndrome in a cross-sectional study in the Canary Islands. Eu-
ropean Journal of Nutrition, 49(3):141–146. (Cited on page 11.)
Anderson, T. and Rubin, H. 1949. Estimators of the parameters of a single
equation in a complete set of stochastic equations. Annals of Mathematical
Statistics, 21(1):570–582. (Cited on pages 54 and 108.)
Angrist, J., Graddy, K., and Imbens, G. 2000. The interpretation of instru-
mental variables estimators in simultaneous equations models with an appli-
cation to the demand for fish. Review of Economic Studies, 67(3):499–527.
(Cited on pages 48, 56, and 177.)
Angrist, J., Imbens, G., and Rubin, D. 1996. Identification of causal effects us-
ing instrumental variables. Journal of the American Statistical Association,
91(434):444–455. (Cited on pages 38, 41, and 176.)
Angrist, J. and Krueger, A. 1992. The effect of age at school entry on ed-
ucational attainment: An application of instrumental variables with mo-
ments from two samples. Journal of the American Statistical Association,
87(418):328–336. (Cited on page 146.)
Angrist, J. and Pischke, J. 2009. Mostly harmless econometrics: an empiri-
cist’s companion. Chapter 4: Instrumental variables in action: sometimes
you get what you need. Princeton University Press. (Cited on pages 56, 57,
58, 61, 67, and 108.)
Basmann, R. 1960. On finite sample distributions of generalized classical linear
identifiability test statistics. Journal of the American Statistical Associa-
tion, 55(292):650–659. (Cited on page 68.)
Baum, C., Schaffer, M., and Stillman, S. 2003. Instrumental variables and
GMM: Estimation and testing. Stata Journal, 3(1):1–31. (Cited on pages 67,
68, and 69.)
185
186 Mendelian Randomization
Baum, C., Schaffer, M., and Stillman, S. 2007. Enhanced routines for instru-
mental variables/generalized method of moments estimation and testing.
Stata Journal, 7(4):465–506. (Cited on page 67.)
Bech, B., Autrup, H., Nohr, E., Henriksen, T., and Olsen, J. 2006. Stillbirth
and slow metabolizers of caffeine: comparison by genotypes. International
Journal of Epidemiology, 35(4):948–953. (Cited on page 11.)
Beer, N., Tribble, N., McCulloch, L., et al. 2009. The P446L variant in GCKR
associated with fasting plasma glucose and triglyceride levels exerts its effect
through increased glucokinase activity in liver. Human Molecular Genetics,
18(21):4081–4088. (Cited on page 95.)
Bekker, P. 1994. Alternative approximations to the distributions of instrumen-
tal variable estimators. Econometrica: Journal of the Econometric Society,
62(3):657–681. (Cited on pages 61 and 108.)
Beral, V., Banks, E., Bull D., Reeves, G. (Million Women Study Collabora-
tors) 2003. Breast cancer and hormone-replacement therapy in the Million
Women Study. The Lancet, 362(9382):419–427. (Cited on page 4.)
Bochud, M., Chiolero, A., Elston, R., and Paccaud, F. 2008. A cautionary note
on the use of Mendelian randomization to infer causation in observational
epidemiology. International Journal of Epidemiology, 37(2):414–416. (Cited
on page 32.)
Bochud, M. and Rousson, V. 2010. Usefulness of Mendelian randomization
in observational epidemiology. International Journal of Environmental Re-
search and Public Health, 7(3):711–728. (Cited on page 9.)
Borenstein, M., Hedges, L., Higgins, J., and Rothstein, H. 2009. Introduc-
tion to meta-analysis. Chapter 34: Generality of the basic inverse-variance
method. Wiley. (Cited on page 140.)
Bound, J., Jaeger, D., and Baker, R. 1995. Problems with instrumental vari-
ables estimation when the correlation between the instruments and the en-
dogenous explanatory variable is weak. Journal of the American Statistical
Association, 90(430):443–450. (Cited on pages 99 and 104.)
Bowden, J. and Vansteelandt, S. 2011. Mendelian randomisation analysis
of case-control data using structural mean models. Statistics in Medicine,
30(6):678–694. (Cited on pages 51 and 159.)
Brennan, P., McKay, J., Moore, L., et al. 2009. Obesity and cancer: Mendelian
randomization approach utilizing the FTO genotype. International Journal
of Epidemiology, 38(4):971–975. (Cited on page 89.)
Brion, M.-J., Shakhbazov, K., and Visscher, P. 2013. Calculating statistical
power in Mendelian randomization studies. International Journal of Epi-
demiology, 42(5):1497–1501. (Cited on page 128.)
Bibliography 187
Casas, J., Bautista, L., Smeeth, L., Sharma, P., and Hingorani, A. 2005. Ho-
mocysteine and stroke: evidence on a causal link from Mendelian randomi-
sation. The Lancet, 365(9455):224–232. (Cited on page 11.)
CCGC (CRP CHD Genetics Collaboration) 2008. Collaborative pooled analy-
sis of data on C-reactive protein gene variants and coronary disease: judging
causality by Mendelian randomisation. European Journal of Epidemiology,
23(8):531–540. (Cited on pages 9, 117, 157, and 161.)
CCGC (CRP CHD Genetics Collaboration) 2011. Association between C reac-
tive protein and coronary heart disease: Mendelian randomisation analysis
based on individual participant data. British Medical Journal, 342:d548.
(Cited on pages 88, 116, 159, and 162.)
Chaussé, P. 2010. Computing generalized method of moments and generalized
empirical likelihood with R. Journal of Statistical Software, 34(11):1–35.
(Cited on page 72.)
Chen, L., Davey Smith, G., Harbord, R., and Lewis, S. 2008. Alcohol in-
take and blood pressure: a systematic review implementing a Mendelian
randomization approach. PLoS Medicine, 5(3):e52. (Cited on page 11.)
Cheung, B., Lauder, I., Lau, C., and Kumana, C. 2004. Meta-analysis of large
randomized controlled trials to evaluate the impact of statins on cardiovas-
cular outcomes. British Journal of Clinical Pharmacology, 57(5):640–651.
(Cited on page 91.)
Christenfeld, N., Sloan, R., Carroll, D., and Greenland, S. 2004. Risk factors,
confounding, and the illusion of statistical control. Psychosomatic Medicine,
66(6):868–875. (Cited on page 16.)
Clarke, P., Palmer, T., and Windmeijer, F. 2011. Estimating structural mean
models with multiple instrumental variables using the generalised method of
moments. The Centre for Market and Public Organisation 11/266, Centre
for Market and Public Organisation, University of Bristol, UK. (Cited on
page 72.)
Clarke, P. and Windmeijer, F. 2010. Instrumental variable estimators for bi-
nary outcomes. The Centre for Market and Public Organisation 10/239,
Centre for Market and Public Organisation, University of Bristol, UK.
(Cited on pages 38, 63, 65, and 66.)
Clarke, R., Peden, J., Hopewell, J., et al. 2009. Genetic variants associated
with Lp(a) lipoprotein level and coronary disease. New England Journal of
Medicine, 361(26):2518–2528. (Cited on pages 11 and 81.)
Cohen, J., Boerwinkle, E., Mosley Jr, T., and Hobbs, H. 2006. Sequence vari-
ations in PCSK9, low LDL, and protection against coronary heart disease.
New England Journal of Medicine, 354(12):1264–1272. (Cited on page 95.)
190 Mendelian Randomization
Davey Smith, G. 2011. Use of genetic markers and gene-diet interactions for
interrogating population-level causal influences of diet on health. Genes &
Nutrition, 6(1):27–43. (Cited on pages 17, 35, and 89.)
Davey Smith, G. and Ebrahim, S. 2003. ‘Mendelian randomization’: can ge-
netic epidemiology contribute to understanding environmental determinants
of disease? International Journal of Epidemiology, 32(1):1–22. (Cited on
pages 4, 5, and 20.)
Davey Smith, G. and Ebrahim, S. 2004. Mendelian randomization:
prospects, potentials, and limitations. International Journal of Epidemi-
ology, 33(1):30–42. (Cited on pages 23 and 123.)
Davey Smith, G., Lawlor, D., Harbord, R., Timpson, N., Day, I., and Ebrahim,
S. 2007. Clustered environments and randomized genes: a fundamental
distinction between conventional and genetic epidemiology. PLoS Medicine,
4(12):e352. (Cited on page 18.)
Davidson, R. and MacKinnon, J. 1993. Estimation and inference in economet-
rics. Chapter 18: Simultaneous equation models. Oxford University Press.
(Cited on page 61.)
Bibliography 191
Fox, J. 2006. Teacher’s Corner: Structural Equation Modeling with the sem
Package in R. Structural Equation Modeling: A Multidisciplinary Journal,
13(3):465–486. (Cited on page 72.)
Freeman, G., Cowling, B., and Schooling, M. 2013. Power and sample size
calculations for Mendelian randomization studies. International Journal of
Epidemiology, 42(4):1157–1163. (Cited on pages 127 and 131.)
Fried, L., Borhani, N., Enright, P., et al. 1991. The Cardiovascular Health
Study: Design and rationale. Annals of Epidemiology, 1(3):263–276. (Cited
on page 164.)
Frost, C. and Thompson, S. 2000. Correcting for regression dilution bias:
comparison of methods for a single predictor variable. Journal of the Royal
Statistical Society: Series A (Statistics in Society), 163(2):173–189. (Cited
on page 19.)
Geiger, D., Verma, T., and Pearl, J. 1990. Identifying independence in
Bayesian networks. Networks, 20(5):507–534. (Cited on page 27.)
Gidding, S., Daniels, S., Kavey, R., and Expert Panel on Cardiovascular Health
and Risk Reduction in Youth 2012. Developing the 2011 integrated pediatric
guidelines for cardiovascular risk reduction. Pediatrics, 129(5):e1311–e1319.
(Cited on page 84.)
Glymour, M., Tchetgen Tchetgen, E., and Robins, J. 2012. Credible Mendelian
randomization studies: approaches for evaluating the instrumental variable
assumptions. American Journal of Epidemiology, 175(4):332–339. (Cited
on pages 34, 35, 68, and 95.)
Grassi, M., Assanelli, D., and Pezzini, A. 2007. Direct, reverse or reciprocal
causation in the relation between homocysteine and ischemic heart disease.
Thrombosis Research, 120(1):61–69. (Cited on page 179.)
Greenland, S. 1987. Interpretation and choice of effect measures in epidemio-
logic analyses. American Journal of Epidemiology, 125(5):761–768. (Cited
on page 42.)
Greenland, S. 2000a. An introduction to instrumental variables for epidemi-
ologists. International Journal of Epidemiology, 29(4):722–729. (Cited on
page 14.)
Greenland, S. 2000b. Causal analysis in the health sciences. Journal of the
American Statistical Association, 95(449):286–289. (Cited on page 26.)
Greenland, S., Lanes, S., and Jara, M. 2008. Estimating effects from random-
ized trials with discontinuations: the need for intent-to-treat design and
G-estimation. Clinical Trials, 5(1):5–13. (Cited on page 65.)
194 Mendelian Randomization
Lawlor, D., Harbord, R., Sterne, J., Timpson, N., and Davey Smith, G. 2008.
Mendelian randomization: using genes as instruments for making causal
inferences in epidemiology. Statistics in Medicine, 27(8):1133–1163. (Cited
on pages 20, 21, 52, and 67.)
Le Marchand, L., Derby, K., Murphy, S., et al. 2008. Smokers with the
CHRNA lung cancer–associated variants are exposed to higher levels of
nicotine equivalents and a carcinogenic tobacco-specific nitrosamine. Can-
cer Research, 68(22):9137–9140. (Cited on page 179.)
Lewis, S. and Davey Smith, G. 2005. Alcohol, ALDH2, and esophageal can-
cer: a meta-analysis which illustrates the potentials and limitations of a
Mendelian randomization approach. Cancer Epidemiology Biomarkers &
Prevention, 14(8):1967–1971. (Cited on page 23.)
Little, R. and Rubin, D. 2002. Statistical analysis with missing data (2nd
edition). Wiley. (Cited on page 134.)
Lunn, D., Whittaker, J., and Best, N. 2006. A Bayesian toolkit for genetic asso-
ciation studies. Genetic Epidemiology, 30(3):231–247. (Cited on page 135.)
Maldonado, G. and Greenland, S. 2002. Estimating causal effects. Interna-
tional Journal of Epidemiology, 31(2):422–429. (Cited on page 26.)
Manolio, T. 2010. Genomewide association studies and assessment of the risk
of disease. New England Journal of Medicine, 363(2):166–176. (Cited on
page 181.)
Martens, E., Pestman, W., de Boer, A., Belitser, S., and Klungel, O. 2006. In-
strumental variables: application and limitations. Epidemiology, 17(3):260–
267. (Cited on pages 16 and 93.)
McPherson, J., et al. (The International Human Genome Mapping Consor-
tium) 2001. A physical map of the human genome. Nature, 409(6822):934–
941. (Cited on page 5.)
Mendel, G. 1866. Versuche über Pflanzen-hybriden. Verhandlungen des natur-
forschenden Vereines in Brünn [Proceedings of the Natural History Society
of Brünn], 4:3–47. (Cited on page 5.)
Mikusheva, A. 2010. Robust confidence sets in the presence of weak in-
struments. Journal of Econometrics, 157(2):236–247. (Cited on pages 54
and 108.)
Mikusheva, A. and Poi, B. 2006. Tests and confidence sets with correct size
when instruments are potentially weak. Stata Journal, 6(3):335–347. (Cited
on pages 54 and 108.)
198 Mendelian Randomization
Minelli, C., Thompson, J., Tobin, M., and Abrams, K. 2004. An integrated
approach to the meta-analysis of genetic association studies using Mendelian
randomization. American Journal of Epidemiology, 160(5):445–452. (Cited
on pages 51, 52, and 140.)
Mogstad, M. and Wiswall, M. 2010. Linearity in instrumental variables esti-
mation: Problems and solutions. Technical report, Forschungsinstitut zur
Zukunft der Arbeit. Bonn, Germany. (Cited on pages 42 and 176.)
Moreira, M. 2003. A conditional likelihood ratio test for structural models.
Econometrica, 71(4):1027–1048. (Cited on pages 54 and 108.)
Moreira, M., Porter, J., and Suarez, G. 2009. Bootstrap validity for the score
test when instruments may be weak. Journal of Econometrics, 149(1):52–
64. (Cited on page 54.)
Morris, A., Voight, B., Teslovich, T., et al. 2012. Large-scale association
analysis provides insights into the genetic architecture and pathophysiology
of type 2 diabetes. Nature Genetics, 44(9):981–990. (Cited on page 146.)
Mumby, H., Elks, C., Li, S., et al. 2011. Mendelian randomisation study of
childhood BMI and early menarche. Journal of Obesity, available online
before print. (Cited on page 11.)
Nagelkerke, N., Fidler, V., Bernsen, R., and Borgdorff, M. 2000. Estimat-
ing treatment effects in randomized clinical trials in the presence of non-
compliance. Statistics in Medicine, 19(14):1849–1864. (Cited on page 60.)
Nelson, C. and Startz, R. 1990. The distribution of the instrumental variables
estimator and its t-ratio when the instrument is a poor one. Journal of
Business, 63(1):125–140. (Cited on pages 99 and 126.)
Nitsch, D., Molokhia, M., Smeeth, L., DeStavola, B., Whittaker, J., and Leon,
D. 2006. Limits to causal inference based on Mendelian randomization: a
comparison with randomized controlled trials. American Journal of Epi-
demiology, 163(5):397–403. (Cited on page 16.)
Norton, E. and Han, E. 2008. Genetic information, obesity, and labor market
outcomes. Health Economics, 17(9):1089–1104. (Cited on page 11.)
Ogbuanu, I., Zhang, H., and Karmaus, W. 2009. Can we apply the Mendelian
randomization methodology without considering epigenetic effects? Emerg-
ing Themes in Epidemiology, 6(1):3. (Cited on page 32.)
Palmer, T., Lawlor, D., Harbord, R., et al. 2011a. Using multiple genetic
variants as instrumental variables for modifiable risk factors. Statistical
Methods in Medical Research, 21(3):223–242. (Cited on pages 114 and 123.)
Bibliography 199
Palmer, T., Sterne, J., Harbord, R., et al. 2011b. Instrumental variable estima-
tion of causal risk ratios and causal odds ratios in Mendelian randomization
analyses. American Journal of Epidemiology, 173(12):1392–1403. (Cited on
pages 64 and 71.)
Palmer, T., Thompson, J., Tobin, M., Sheehan, N., and Burton, P. 2008.
Adjusting for bias and unmeasured confounding in Mendelian randomiza-
tion studies with binary responses. International Journal of Epidemiology,
37(5):1161–1168. (Cited on pages 37 and 60.)
Pauling, L., Itano, H., Singer, S., and Wells, I. 1949. Sickle cell anemia, a
molecular disease. Science, 110(2865):543–548. (Cited on page 5.)
Pearl, J. 2000a. Causality: models, reasoning, and inference. Cambridge Uni-
versity Press. (Cited on page 25.)
Pearl, J. 2000b. Causality: models, reasoning, and inference. Chapter 3, Sec-
tion 3.1: The back-door criterion. Cambridge University Press. (Cited on
page 28.)
Pearl, J. 2001. Direct and indirect effects. In Proceedings of the Seventeenth
Conference on Uncertainty in Artificial Intelligence, pages 411–420. (Cited
on page 180.)
Pearl, J. 2010. An introduction to causal inference. The International Journal
of Biostatistics, 6(2):1–60. (Cited on page 26.)
Peto, R., Doll, R., Buckley, J., and Sporn, M. 1981. Can dietary beta-carotene
materially reduce human cancer rates? Nature, 290:201–208. (Cited on
page 4.)
Pierce, B., Ahsan, H., and VanderWeele, T. 2011. Power and instrument
strength requirements for Mendelian randomization studies using multi-
ple genetic variants. International Journal of Epidemiology, 40(3):740–752.
(Cited on pages 114 and 131.)
Pierce, B. and Burgess, S. 2013. Efficient design for Mendelian randomiza-
tion studies: subsample and two-sample instrumental variable estimators.
American Journal of Epidemiology, 178(7):1177–1184. (Cited on page 137.)
Pierce, B. and VanderWeele, T. 2012. The effect of non-differential mea-
surement error on bias, precision and power in Mendelian randomization
studies. International Journal of Epidemiology, 41(5):1383–1393. (Cited on
page 19.)
Plenge, R., Scolnick, E., and Altshuler, D. 2013. Validating therapeutic targets
through human genetics. Nature Reviews Drug Discovery, 12(8):581–594.
(Cited on page 94.)
200 Mendelian Randomization
Robinson, J., Nedergaard, B., Rogers, W., et al. 2014. Effect of evolocumab
or ezetimibe added to moderate-or high-intensity statin therapy on LDL-
C lowering in patients with hypercholesterolemia: the LAPLACE-2 ran-
domized clinical trial. Journal of the American Medical Association,
311(18):1870–1882. (Cited on page 95.)
Rossouw, J., et al. (Writing Group for the Women’s Health Initiative Investi-
gators) 2002. Risks and benefits of estrogen plus progestin in healthy post-
menopausal women: principal results from the Women’s Health Initiative
randomized controlled trial. Journal of the American Medical Association,
288(3):321–333. (Cited on page 4.)
Rothwell, P. 2010. Commentary: External validity of results of randomized
trials: disentangling a complex concept. International Journal of Epidemi-
ology, 39(1):94–96. (Cited on page 87.)
Rubin, D. 1974. Estimating causal effects of treatments in randomized and
nonrandomized studies. Journal of Educational Psychology, 66(5):688–701.
(Cited on page 30.)
Sargan, J. 1958. The estimation of economic relationships using instrumental
variables. Econometrica, 26(3):393–415. (Cited on page 68.)
Sarwar, N., Butterworth, A., Freitag, D., et al. (IL6R Genetics Consortium
and Emerging Risk Factors Collaboration) 2012. Interleukin-6 receptor
pathways in coronary heart disease: a collaborative meta-analysis of 82 stud-
ies. Lancet, 379(9822):1205–1213. (Cited on page 20.)
SAS (SAS Institute and SAS Publishing Staff) 2004. SAS/STAT 9.1 User’s
Guide. SAS Institute Inc, Cary, NC. (Cited on page 69.)
Schatzkin, A., Abnet, C., Cross, A., et al. 2009. Mendelian randomization:
how it can – and cannot – help confirm causal relations between nutrition
and cancer. Cancer Prevention Research, 2(2):104–113. (Cited on pages 24
and 89.)
Scheet, P. and Stephens, M. 2006. A fast and flexible statistical model
for large-scale population genotype data: applications to inferring missing
genotypes and haplotypic phase. American Journal of Human Genetics,
78(4):629–644. (Cited on page 134.)
Schunkert, H., König, I., Kathiresan, S., et al. 2011. Large-scale association
analysis identifies 13 new susceptibility loci for coronary artery disease.
Nature Genetics, 43(4):333–338. (Cited on page 146.)
Sheehan, N., Didelez, V., Burton, P., and Tobin, M. 2008. Mendelian randomi-
sation and causal inference in observational epidemiology. PLoS Medicine,
5(8):e177. (Cited on page 20.)
202 Mendelian Randomization
Timpson, N., Harbord, R., Davey Smith, G., Zacho, J., Tybjærg-Hansen, A.,
and Nordestgaard, B. 2009. Does greater adiposity increase blood pressure
and hypertension risk? Mendelian randomization using the FTO /MC4R
genotype. Hypertension, 54(1):84–90. (Cited on page 77.)
204 Mendelian Randomization
Timpson, N., Lawlor, D., Harbord, R., et al. 2005. C-reactive protein and its
role in metabolic syndrome: mendelian randomisation study. The Lancet,
366(9501):1954–1959. (Cited on page 11.)
Timpson, N., Nordestgaard, B., Harbord, R., et al. 2011. C-reactive protein
levels and body mass index: elucidating direction of causation through recip-
rocal Mendelian randomization. International Journal of Obesity, 35:300–
308. (Cited on page 179.)
Tobin, M., Minelli, C., Burton, P., and Thompson, J. 2004. Commentary: De-
velopment of Mendelian randomization: from hypothesis test to ‘Mendelian
deconfounding’. International Journal of Epidemiology, 33(1):26–29. (Cited
on pages 16 and 39.)
Trompet, S., Jukema, J., Katan, M., et al. 2009. Apolipoprotein E genotype,
plasma cholesterol, and cancer: a Mendelian randomization study. American
Journal of Epidemiology, 170(11):1415–1421. (Cited on page 11.)
Welsh, P., Polisecki, E., Robertson, M., et al. 2010. Unraveling the direc-
tional link between adiposity and inflammation: a bidirectional Mendelian
randomization approach. Journal of Clinical Endocrinology & Metabolism,
95(1):93–99. (Cited on page 179.)
Wooldridge, J. 2009. Introductory econometrics: A modern approach. Chap-
ter 5: Multiple regression analysis – OLS asymptotics. South-Western,
Nashville, TN. (Cited on pages 56, 67, and 127.)
Wright, P. 1928. The tariff on animal and vegetable oils. Appendix B. Macmil-
lan, New York. (Cited on page 10.)
Yang, J., Lee, S., Goddard, M., and Visscher, P. 2011. GCTA: a tool for
genome-wide complex trait analysis. The American Journal of Human Ge-
netics, 88(1):76–82. (Cited on page 182.)
Youngman, L., Keavney, B., Palmer, A., et al. 2000. Plasma fibrinogen and
fibrinogen genotypes in 4685 cases of myocardial infarction and in 6002 con-
trols: test of causality by ‘Mendelian randomization’. Circulation, 102(Suppl
II):31–32. (Cited on page 9.)
206 Mendelian Randomization
Zacho, J., Tybjaerg-Hansen, A., Jensen, J., Grande, P., Sillesen, H., and
Nordestgaard, B. 2008. Genetically elevated C-reactive protein and ischemic
vascular disease. New England Journal of Medicine, 359(18):1897–1908.
(Cited on page 100.)
Zohoori, N. and Savitz, D. 1997. Econometric approaches to epidemiologic
data: Relating endogeneity and unobserved heterogeneity to confounding.
Annals of Epidemiology, 7(4):251–257. (Cited on pages 56 and 110.)
Statistics
MENDELIAN RANDOMIZATION
MENDELIAN
Mendelian Randomization: Methods for Using Genetic Variants in Causal Es-
timation provides thorough coverage of the methods and practical elements of
Mendelian randomization analysis. It brings together diverse aspects of Mendelian
randomization spanning epidemiology, statistics, genetics, and econometrics.
RANDOMIZATION
Through several examples, the first part of the book shows how to perform simple
applied Mendelian randomization analyses and interpret their results. The second
part addresses specific methodological issues, such as weak instruments, multiple
instruments, power calculations, and meta-analysis, relevant to practical applica-
tions of Mendelian randomization. In this part, the authors draw on data from the
C-reactive protein Coronary heart disease Genetics Collaboration (CCGC) to illus-
trate the analyses. They present the mathematics in an easy-to-understand way by
using nontechnical language and reinforcing key points at the end of each chapter.
Methods for Using
The last part of the book examines the potential of Mendelian randomization in the
future, exploring both methodological and applied developments. Genetic Variants
in Causal Estimation
Features
• Offers first-hand, in-depth guidance on Mendelian randomization from
leaders in the field
• Makes the diverse aspects of Mendelian randomization understandable to
newcomers
• Illustrates the technical details using data from a large collaborative study
• Includes other real-world examples that show how Mendelian randomization
is used in studies involving inflammation, heart disease, and more
• Discusses possible future directions for research involving Mendelian
randomization
This book gives you the foundation to understand issues concerning the use of Burgess • Thompson Stephen Burgess
genetic variants as instrumental variables. It will get you up to speed in undertak-
ing and interpreting Mendelian randomization analyses. Chapter summaries, paper
summaries, web-based applications, and software code for implementing the sta- Simon G. Thompson
tistical techniques are available on a supplementary website.
K16638
w w w. c rc p r e s s . c o m