0% found this document useful (0 votes)
45 views7 pages

Survival Analysis and Interpretation Of.32

Uploaded by

vidhujoshy
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
45 views7 pages

Survival Analysis and Interpretation Of.32

Uploaded by

vidhujoshy
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

General Articles

Section Editor: Thomas R. Vetter

E SPECIAL ARTICLE

Survival Analysis and Interpretation of Time-to-Event


Data: The Tortoise and the Hare
Patrick Schober, MD, PhD, MMedStat,* and Thomas R. Vetter, MD, MPH†
Downloaded from http://journals.lww.com/anesthesia-analgesia by BhDMf5ePHKav1zEoum1tQfN4a+kJLhEZgbsI
Ho4XMi0hCywCX1AWnYQp/IlQrHD3i3D0OdRyi7TvSFl4Cf3VC4/OAVpDDa8K2+Ya6H515kE= on 08/29/2024

Survival analysis, or more generally, time-to-event analysis, refers to a set of methods for ana-
lyzing the length of time until the occurrence of a well-defined end point of interest. A unique
feature of survival data is that typically not all patients experience the event (eg, death) by the
end of the observation period, so the actual survival times for some patients are unknown. This
phenomenon, referred to as censoring, must be accounted for in the analysis to allow for valid
inferences. Moreover, survival times are usually skewed, limiting the usefulness of analysis
methods that assume a normal data distribution. As part of the ongoing series in Anesthesia &
Analgesia, this tutorial reviews statistical methods for the appropriate analysis of time-to-event
data, including nonparametric and semiparametric methods—specifically the Kaplan-Meier esti-
mator, log-rank test, and Cox proportional hazards model. These methods are by far the most
commonly used techniques for such data in medical literature. Illustrative examples from stud-
ies published in Anesthesia & Analgesia demonstrate how these techniques are used in prac-
tice. Full parametric models and models to deal with special circumstances, such as recurrent
events models, competing risks models, and frailty models, are briefly discussed. (Anesth
Analg 2018;127:792–8)

The surprising thing about young fools is how many survive to also often interested in whether survival times are related
become old fools. to covariates, and estimating the effect size of a specific
—Doug Larson (1926–2017), American journalist, covariate (eg, magnitude of the treatment effect) when it is
columnist, and editor
adjusted for potential confounders.

T
he occurrence of a well-defined event such as patient Furthermore, it may initially appear that such a research
mortality is often a primary outcome in medical question about the length of a time interval, which is essen-
research. This is essentially a binary outcome (the tially a continuous outcome variable, can be addressed by
event has occurred versus it has not occurred). In a previ- linear regression or related techniques like a t test or analy-
ous tutorial in this series, we described how such binary sis of variance.1,2 However, a key distinction between sur-
outcome data can be analyzed with logistic regression.1 For vival times and other continuous data is that the event of
example, one can estimate the relationship between one or interest (eg, death) will usually have occurred only in some
more covariates, also referred to as independent variables but not in all patients by the time the study ends.
or predictor variables (eg, treatments or prognostic factors) For patients who survive until the end of the study period,
and the odds of experiencing the outcome within a specific or who are lost to follow-up before the end of the observa-
time frame (eg, mortality within 30 days postoperatively). tion period, full survival times are unknown. Instead all that
However, logistic regression analysis is not appropriate is known is that the survival time is greater than the observa-
when the research question involves the length of time until tion time. This unique feature of survival data is referred to
the end point occurs—for example, estimating median sur-
as right censoring, which is described in more detail below.3
vival times, plotting survival over time after treatment, or
Ignoring censored patients in the analysis, or simply equat-
estimating the probability of surviving beyond a prespeci-
ing their observed survival time (follow-up time) with the
fied time interval (eg, 5-year survival rate). Researchers are
unobserved total survival time, would bias the results. Even if
there was no censoring in the data set, survival times usually
From the *Department of Anesthesiology, VU University Medical Center,
Amsterdam, the Netherlands; and †Department of Surgery and Perioperative have a heavily skewed distribution, limiting the usefulness of
Care, Dell Medical School at the University of Texas at Austin, Austin, Texas. statistical tests that assume a normal data distribution.3
Accepted for publication June 8, 2018. Analyzing survival data is unique in that the research
Funding: None. interest is typically a combination of whether the event has
The authors declare no conflicts of interest. occurred (binary outcome) and when it has occurred (contin-
Reprints will not be available from the authors. uous outcome). Appropriate analysis of survival data requires
Address correspondence to Patrick Schober, MD, PhD, MMedStat, Depart- specific statistical methods that can deal with censored data.
ment of Anesthesiology, VU University Medical Center, De Boelelaan 1117,
1081 HV, Amsterdam, the Netherlands. Address e-mail to [email protected]. As the assessed outcome is frequently mortality, these tech-
Copyright © 2018 The Author(s). Published by Wolters Kluwer Health, Inc. niques are subsumed under the term survival analysis.
on behalf of the International Anesthesia Research Society. This is an open- More generally, however, these techniques can be used
access article distributed under the terms of the Creative Commons Attribu-
tion-Non Commercial-No Derivatives License 4.0 (CCBY-NC-ND), where it for the analysis of the time until any event of interest occurs
is permissible to download and share the work provided it is properly cited. (eg, recurrence of a disease; initial, breakthrough postopera-
The work cannot be changed in any way or used commercially without per-
mission from the journal. tive pain; or failure of an implanted medical device), and
DOI: 10.1213/ANE.0000000000003653 such data can thus also be called time-to-event or failure

792 www.anesthesia-analgesia.org September 2018 • Volume 127 • Number 3


Survival Analysis

time data.4,5 In this tutorial, we use the terms survival time, disadvantage that they do not occur instantaneously, making
time-to-event, and failure time synonymously. it difficult to specify the exact time point of occurrence. In such
As part of the ongoing series in Anesthesia & Analgesia, settings, the clearest description of the outcome is often “time-
this basic tutorial reviews statistical methods that are appro- to-detection” rather than “time-to-event.” Whatever the event
priate for survival data. We focus on the most common tech- of interest, a clear and unambiguous definition is essential.
niques, which are the Kaplan-Meier estimator, log-rank test, The total length of follow-up and follow-up intervals should
and the Cox proportional hazards (PH) model. The Table be sensibly chosen to ensure that a sufficient number of events
provides a summary of key-related terms. An extensive dis- are observed (see Power and Sample Size Considerations sec-
Downloaded from http://journals.lww.com/anesthesia-analgesia by BhDMf5ePHKav1zEoum1tQfN4a+kJLhEZgbsI

cussion of full parametric techniques and the special circum- tion) and that the timing of occurrence can be determined,
Ho4XMi0hCywCX1AWnYQp/IlQrHD3i3D0OdRyi7TvSFl4Cf3VC4/OAVpDDa8K2+Ya6H515kE= on 08/29/2024

stances that call for other techniques is beyond the scope of to avoid interval censoring (as detailed below). Conversely,
this basic tutorial, and we thus intentionally provide only long-term observational studies carry the risk that factors that
precursory coverage of these more advanced aspects. influence survival time, other than the treatment or factor
under investigation, may also change during the study period.
CONCEPTS AND TERMINOLOGY IN SURVIVAL Patients recruited to the study early should ideally have the
ANALYSIS same risk of event occurrence as patients recruited late.3
General Considerations As the failure time is the time between some starting point
The event of interest should be clinically relevant, well (origin) and the event, not only the event but also the time of
defined, unambiguous, and preferably easily observable. origin needs to be clearly specified. Ideally, the time of ori-
While the patient death seems to be such an unambiguous end gin should also be sensibly chosen, so all individuals are as
point, misclassification is possible when specific-cause mor- much as possible on a par.6 In a study comparing therapeu-
tality, rather than all-cause mortality, is the outcome of inter- tic interventions on a survival outcome, the starting point
est.3 Some end points, such as recurrence of cancer, have the is typically the time when the intervention is administered.

Table. Basic Terminology and Definitions


Term Definition
Censoring Incomplete observation of survival times; subjects with an incomplete observation are referred to as censored.
Left censoring The patient is known to have experienced the event before the start of the observation period, so the actual time-to-
event is shorter than the interval between the origin and start of observation, but it is unknown by how much.
Right censoring Observation of the patient is terminated before the event occurs, so the actual time-to-event, if it were to occur, is
longer than the observation time, but it is unknown by how much.
Cox regression A semiparametric technique that is commonly used for survival data analysis that allows one to simultaneously assess
the association between multiple covariates and survival, also termed Cox proportional hazards model.
Delayed entry Synonym for left truncation.
Failure rate Synonym for hazard rate.
Failure time Time until an event occurs (interval between origin and event); synonym for survival time or time-to-event.
Hazard function Describes the instantaneous rate of occurrence of the event over time.
Hazard rate Rate of occurrence of the event during some time interval; often used to denote the instantaneous rate in an
infinitesimally small time interval.
Hazard ratio Ratio of 2 hazard rates (eg, for 2 different treatment groups).
Kaplan-Meier curve Graphical representation of the survival function with probabilities estimated by the Kaplan-Meier method.
Kaplan-Meier method Nonparametric method to estimate survival probabilities over time.
Lead time The time between the early diagnosis of a specific disease with screening and the time in which diagnosis would have
been made without screening.
Lead-time bias Lead time increases the observed survival time even if there was no effect of screening on absolute survival time;
hence, it biases estimates of survival time.
Log-rank test Nonparametric hypothesis test to compare 2 or more survival functions.
Nonparametric method Statistical method that neither imposes assumptions on the distribution of the outcome variable (a specific shape of
the survival function or hazard function) nor assumes a functional form of the covariates.
Origin Starting point in the timeline of a survival time, which is often the time point when a disease or condition is diagnosed,
or the time point of a specific event (eg, birth or operation).
Parametric method Statistical method that assumes a specific distribution of the outcome variable, including the relationship between
covariates and survival.
Proportional hazards Used to model the hazard function, assuming that the effect of a covariate is the same at all time points; such models
model can be either semiparametric (Cox regression model) or parametric.
Semiparametric method Statistical method that does not assume a specific distribution of the outcome variable (the survival function or hazard
function), but does assume a specific relationship between covariates and the outcome.
Survival data Umbrella term covering data that describe the time until any well-defined event (not necessarily death) occurs.
Survival (survivor) function Describes the probability over time that the event of interest has not yet occurred.
Survival time Time until an event occurs (interval between origin and event, which does not necessarily have to be the death of a
subject), synonym for failure time, or time-to-event.
Time-to-event Time until an event occurs (interval between origin and event), synonym for survival time or failure time.
Truncation Subject selection depending on whether or not the event has occurred.
Left truncation Selective inclusion of patients in whom the event has not occurred early; patients who have already experienced the
event before the time point of patient identification are not identified and often may not even be known to exist.
Right truncation Selective inclusion of patients in whom the event has occurred (eg, due to selection of patients from a death registry).

September 2018 • Volume 127 • Number 3 www.anesthesia-analgesia.org 793


EE SPECIAL ARTICLE

In epidemiologic or screening studies, the origin is often Left censoring occurs when a subject is known to have
when a condition or disease is diagnosed. However, this can had the event before the start of the observation, but the
lead to biased estimates of survival times, especially when exact time of the event is unknown. This contrasts with
the study intervention does not only presumably affect the left truncation, where the patient is often not even known
event but also the time of origin. This so-called lead-time to exist.12 Similarly, interval censoring is where it is only
bias is common in studying whether a screening program known that the event occurred between 2 time points, but
for a specific disease increases survival time. Observed pro- again, the exact time is unknown. Left and interval censored
longed survival in screened patients could be merely due data are less common and usually do not exist when death
Downloaded from http://journals.lww.com/anesthesia-analgesia by BhDMf5ePHKav1zEoum1tQfN4a+kJLhEZgbsI

to the disease being diagnosed earlier, and not necessarily is the outcome of interest. Statistical techniques to deal with
Ho4XMi0hCywCX1AWnYQp/IlQrHD3i3D0OdRyi7TvSFl4Cf3VC4/OAVpDDa8K2+Ya6H515kE= on 08/29/2024

reflect a benefit of screening on absolute survival time.7 left and interval censored data are available; however, they
are infrequently used and will not be covered in this basic
Truncation tutorial.
As in any clinical study, the target population, as well as
inclusion and exclusion criteria, must be clearly defined. A Survival (Survivor) Function, Hazard Rate,
unique feature of survival data is truncation, which results Hazard Function, and Hazard Ratio
from selection bias and refers to subject selection depend- The survival (or survivor) function and the hazard func-
ing on whether or not the event has occurred.8 Subjects may tion are fundamental to survival analysis. The survival
only be identified for observation at some time point after function describes the probability of surviving past a spec-
their respective time of origin.9 ified time point, or more generally, the probability that the
Patients who have already experienced the event before event of interest has not yet occurred by this time point
the time point of patient identification may not be identi- (Figure 1).13
fied—for example, because they have already died, these A hazard rate (or failure rate) is the rate of occurrence of
patients may not be known to exist.8 In this situation, only the event during a given time interval.10 The hazard func-
those patients who have not experienced the event will tion describes the instantaneous rate of occurrence over
selectively enter the study, which is referred to as left trun- time, which can conceptually be viewed as the hazard rate
cation or delayed entry. during an infinitesimally small time interval. The hazard
Alternatively, right truncation occurs when patients who and survival functions are closely related and can easily be
experience the event are selectively included6—for exam- converted to each other.3 When the hazard rate is high, sur-
ple, when patients are obtained from a death registry, and vival declines rapidly and vice versa.
hence, survivors are not selected for the study. While the While it is not necessary to understand the hazard
bias caused by truncation can be partially addressed during function in detail, it is the basis of PH models, which are
the analysis, it is often preferable be prevent it at the design extensively used to model survival data. Importantly, the
phase of the study. exponentiated parameter estimates of these models can be
interpreted as a hazard ratio (HR), which is an estimate of
Censoring the ratio of the hazard rates between 2 groups (eg, treatment
Censoring refers to incompletely observed survival times and versus control).14
is inherent with most survival data. The situation described The HR is similar to the risk ratio (relative risk), with a
above in which not all the patients experience the event until value higher or lower than 1 indicating a higher or lower
completion of the study is referred to as right censoring. hazard rate, respectively, than the comparison group. While
Visualizing the timeline of a patient’s observed survival time, the HR is technically not the same as the risk ratio, it is often
the unobserved event, if it were to occur, would lie beyond conveniently interpreted as such in the literature.15
the right side of the time point at which the patient is cen-
sored. Right censoring is the most common type of censor- General Overview of Methods to Analyze
ing in survival studies, and the statistical methods described Survival Data
below are well suited to deal with this type of censoring. In analyzing survival data, 3 common classes of methods
Basically, censored patients are: (1) included in estimates of are broadly distinguished:
survival probabilities at time points preceding their censor-
ing time point; and (2) excluded from the analysis thereafter.10 1. Nonparametric methods, which neither impose
Unbiased inferences require the censoring to be nonin- assumptions on the distribution of survival times
formative, with the time of censoring absolutely not related (a specific shape of the survival function or hazard
to the event time.3 Informative censoring would occur when function) nor assume a specific relationship between
patients are censored due to a medical condition that is covariates and the survival time. This class includes
related to the future risk of the event (eg, inability to show the Kaplan-Meier estimator and log-rank test.
up for a clinic visit due to severe illness and thus loss to 2. Semiparametric methods also make no assumptions
follow-up). Unfortunately, this problem is neither easy to regarding the distribution of survival times but do
detect nor is there an ideal solution—other than conducting assume a specific relationship between covariates
the study in a way that promotes complete follow-up and and the hazard function—and hence, the survival
avoids informative censoring.11 If possible, data on the rea- time. The widely used Cox PH model is a semipara-
sons for loss-to-follow-up should be collected because such metric method.
information can be used in sensitivity analyses to assess for 3. Parametric methods assume a distribution of the sur-
potential bias. vival times and a functional form of the covariates.

794   
www.anesthesia-analgesia.org ANESTHESIA & ANALGESIA
Survival Analysis

Figure 1. Survival (survivor) function estimated


by the Kaplan-Meier method, including 95% con-
Downloaded from http://journals.lww.com/anesthesia-analgesia by BhDMf5ePHKav1zEoum1tQfN4a+kJLhEZgbsI

fidence bands. Censoring is indicated by vertical


Ho4XMi0hCywCX1AWnYQp/IlQrHD3i3D0OdRyi7TvSFl4Cf3VC4/OAVpDDa8K2+Ya6H515kE= on 08/29/2024

marks (at 5 and 21 d). The number of patients at


risk at different time points is displayed on the
graph. The point on the x-axis where the horizon-
tal dashed line at a survival probability of .5 inter-
sects the curve represents the estimated median
survival time (17 d).

KAPLAN-MEIER ESTIMATOR When reporting a log-rank P value comparing Kaplan-


The Kaplan-Meier method estimates the unadjusted Meier curves, researchers should make clear that the entire
probability of surviving beyond a certain time point.16 A distribution is being tested, and not a particular time such
Kaplan-Meier curve shows the estimated survival function as 5-year survival. The log-rank test is based on the same
by plotting estimated survival probabilities against time assumptions as the Kaplan-Meier survival curve, and
(Figure 1).3 The estimated survival probability is constant makes no explicit assumptions about the distribution of the
between the events. Therefore, the curve is a step-function survival curves.
in which each vertical drop indicates the occurrence of one However, when the survival curves of different groups
or more events.17 Right censoring of patients is typically cross—indicating that 1 group has a more favorable sur-
indicated by a vertical mark at the censoring time or other vival in a certain time interval and less favorable survival in
symbols like an asterisk.6 another time interval—the power to detect such differences
CIs for the survival probabilities can be readily calcu- is very low.19 Moreover, the log-rank test cannot adjust for
lated, and confidence bands can be plotted around the sur- other covariates that might affect survival time. While it
vival function (Figure 1).3 These CIs provide an estimate can determine whether observed differences are significant,
of the range of plausible values of the survival probability it cannot provide an estimate of the difference between
in the population from which the patients were sampled.18 groups.14 Other techniques, described below, can be used to
Often, several Kaplan-Meier curves of different groups address these issues.
(eg, treatment groups or prognostic factors) are plotted
together in 1 graph, allowing for a visual comparison of the COX PH MODEL
survival probabilities (Figure 2). The Cox PH model is the most commonly used survival
The median survival time, which is the time by when data analysis technique that simultaneously allows one
the event has occurred in 50% of the patients or study sub- to include and to assess the effect of multiple covariates.14
jects, is a commonly reported summary statistic for survival These model covariates can include the variables of spe-
time data. This median survival time can be conveniently cific research interest (treatment groups), as well as poten-
estimated from the Kaplan-Meier curve as the x-axis (time) tial confounders for which the researcher wants to control
value at the point where an (imaginary) horizontal line at (demographic and other clinical factors). Multiple strategies
the 50% survival probability on the y-axis crosses the sur- for covariate selection have been described, and the aim of
vival curve (Figure 1).10 Additionally or alternatively, the the study—most often to determine the effect of a covari-
survival probability at appropriate time points (eg, at 1 and ate while controlling for confounding versus prediction of
5 years) can be reported. survival using a set of predictor variables—should be con-
sidered in choosing a strategy.11,20,21
LOG-RANK TEST Cox PH regression actually does not directly model sur-
Analogous to comparing groups of continuous data using vival probabilities or survival times, but the hazard func-
a t test or analysis of variance, the survival curves for 2 or tion.22 Herein, it is assumed that all patients have a common
more different groups (eg, treatments or prognostic fac- baseline hazard function that only depends on time. Each
tors) can also be compared with hypothesis testing. Most subject’s individual hazard function is a multiple of this
commonly, the log-rank test is applied, which tests the null common baseline hazard, and the individual multiplicator
hypothesis that there is no difference in the probability of an is a constant, determined by a time-independent function of
event at any time point.19 a patient’s individual covariate values.22 This implies that

September 2018 • Volume 127 • Number 3 www.anesthesia-analgesia.org 795


EE SPECIAL ARTICLE

methods to check this and other assumptions have been


suggested, and we refer to previous literature on the topic
for a detailed overview.21,25
The Cox PH model assumes the covariates to be time-
independent—in other words, the values of the variable of
each patient (eg, gender and age at time of diagnosis) do not
change over time. Extensions of the Cox model are avail-
able that allow for covariates that vary over time (eg, blood
Downloaded from http://journals.lww.com/anesthesia-analgesia by BhDMf5ePHKav1zEoum1tQfN4a+kJLhEZgbsI

pressure recordings at follow-up time points).11 However,


Ho4XMi0hCywCX1AWnYQp/IlQrHD3i3D0OdRyi7TvSFl4Cf3VC4/OAVpDDa8K2+Ya6H515kE= on 08/29/2024

such models should be used carefully, as they are difficult to


interpret, prone to misspecifications, and markedly increase
the potential for erroneous inferences.28

PUBLISHED REPRESENTATIVE USE OF


NONPARAMETRIC AND SEMIPARAMETRIC
TECHNIQUES
Figure 2. Kaplan-Meier curves displaying the estimated survival Nonparametric and semiparametric methods are com-
probability for 4 different groups of patients after lung cancer sur- monly used to analyze survival data in anesthesia, critical
gery. Patients either did or did not perioperatively receive flurbipro-
fen axetil (FA) and dexamethasone (DXM) (reprinted with permission care, perioperative, and pain research.29–35 We illustrate the
from Huang et al29). Each vertical step in the curve indicates one or practical use of these techniques in 3 different types of stud-
more events (ie, deaths), and right-censored patients are indicated ies recently published in Anesthesia & Analgesia.29–31
by a vertical mark in the curve at the censoring time. A visual inspec- In their retrospective cohort study, Huang et al29 sought
tion suggests that survival seems to be more favorable for patients
who received FA and DXM, compared with patients who received to identify predictors of long-term survival in patients after
none of these 2 drugs. The log-rank test indicates a significant dif- lung cancer surgery. The authors initially used multiple log-
ference between the survival curves. rank tests to identify covariates that are potentially related
to survival. Those covariates that were either considered
the ratio of the hazard rates between different patients (the clinically important or displayed a P value <.2 in their log-
HR) is assumed to be constant over time—in other words, rank test were entered in a multivariable Cox PH model.
the effect of a covariate is assumed to be the same at all time With this model, the authors identified 6 factors associated
points. This is the PH assumption of the Cox PH model,6 with either longer or shorter overall survival. For example,
which is discussed in more detail below. limited resection was associated with a higher hazard rate
Under this assumption, exponentiated regression coeffi- and hence shorter survival (HR, 1.46; 95% CI, 1.08–1.98;
cients for each covariate can be interpreted in terms of the P = .013), whereas perioperative use of dexamethasone
HR for a 1-unit change in the respective covariate value. was associated with prolonged survival (HR, 0.70; 95% CI,
This is akin to the interpretation of exponentiated regres- 0.54–0.90; P = .006).29 The authors also specifically compared
sion coefficients as odds ratios in logistic regression.23 patients who received dexamethasone and flurbiprofen
While the Cox PH model estimates regression coeffi- axetil with patients who received only one of either drugs
cients without making assumptions about the shape of the or none of the drugs. Kaplan-Meier curves were presented
hazard function, it is possible to work backward and use for each of the 4 possible combinations (Figure 2), and a
the parameter estimates to estimate the adjusted hazard log-rank test was used for an unadjusted comparison of the
or adjusted survival function. This allows the plotting of survival curves. A multivariable model adjusting for con-
adjusted curves for different groups, which are very simi- founders suggested that administration of both flurbiprofen
lar to Kaplan-Meier curves, but instead show or predict the axetil and dexamethasone was associated with prolonged
probability of survival in each group, while keeping the overall survival when compared to no use of both, with an
other covariates fixed at their mean values.24 Survival pro- adjusted HR of 0.57 (95% CI, 0.38–0.84; P = .005).29
portions can also be predicted for each arbitrary combina- In their randomized controlled trial, Wilson et al31 stud-
tion of covariate values.14 ied whether dural puncture epidural (DPE)—a technique
where the dura is punctured but medication is not admin-
Model Assumptions istered in the subarachnoid space—expedites analgesia in
Assumptions of the Cox PH model—other than assump- laboring patients compared to the conventional lumbar epi-
tions that apply to all survival analyses, such as nonin- dural (LE) technique. Adequate analgesia was defined as a
formative censoring described above—include a linear score of ≤10 mm on a 100-mm Visual Analog Scale during
relationship between the covariates and the log-hazard, as active contractions. The authors applied the Kaplan-Meier
well as the PH assumption. method to estimate median time to achieve adequate anal-
While the PH assumption is central to the Cox model, its gesia in each treatment group. A Cox PH regression model
actual importance is debated. While some authors stress the with treatment group as a sole independent variable was
importance,25,26 others de-emphasize it and take the view used to estimate the treatment effect. Median time to ade-
that the HR can still be viewed as an average effect during quate analgesia was 8 minutes (95% CI, 6–10 minutes) in
the observation period when the assumption is violated.27 the DPE group and 10 minutes (95% CI, 8–14 minutes) in
We recommend that the PH assumption should be assessed LE group. The estimated HR was 1.67 (95% CI, 1.02–2.64;
because it is sometimes possible to fix violations. Several P = .042).31 To make these results more understandable, the

796   
www.anesthesia-analgesia.org ANESTHESIA & ANALGESIA
Survival Analysis

authors followed the common (albeit not statistically cor- may not want to simply consider all-cause mortality as
rect) practice of interpreting the HR as a relative risk: “The the failure event, but would like to study the relationship
relative risk of achieving pain control at any collected time between covariates and specific-cause mortality. Or, com-
point in parturients receiving DPE was 1.7 times greater monly, researchers are interested in an event such as cancer
than in those receiving LE.”31 Of note, in the above first recurrence, but death that occurs before the event of interest
example, observational data were analyzed using the Cox is a competing risk. In this setting, the researcher can either
PH model to adjust for confounding. Here, in this random- model the time to the earliest of death or cancer recurrence
ized controlled trial, the purpose of the Cox PH model was or use special methods to model both events.
Downloaded from http://journals.lww.com/anesthesia-analgesia by BhDMf5ePHKav1zEoum1tQfN4a+kJLhEZgbsI

to obtain an estimate of the treatment effect. Frailty models account for nonindependence of obser-
Ho4XMi0hCywCX1AWnYQp/IlQrHD3i3D0OdRyi7TvSFl4Cf3VC4/OAVpDDa8K2+Ya6H515kE= on 08/29/2024

Using data on patients who participated in 2 trials across vations in clustered data (for correlated failure times), by
4 clinical sites for a follow-up analysis, Podolyak et al30 stud- incorporating random effects.39 Such data may arise when
ied effects of supplemental perioperative oxygen on long- the survival times of individuals within a cluster (eg, fam-
term mortality in patients undergoing colorectal surgery. In ily or hospital) tend to be more similar to each other than
the 2 original trials, patients had randomly received either survival times of patients who belong to different clusters.
30% or 80% inspired oxygen perioperatively. The authors These models are analogous to mixed effect models for
present survival curves using Kaplan-Meier estimates and uncensored longitudinal and correlated data, as described
use a Cox PH model, stratified by study and site to allow in a recent tutorial in this series.40
for separate baseline hazards for each study and site. This
approach was (presumably) chosen as it allows for the POWER AND SAMPLE SIZE CONSIDERATIONS
estimation of an overall HR estimate and significance test The power of a method to analyze survival time data
across all study sites. No effect of 80% vs 30% inspired oxy- depends on the number of events rather the total sample
gen was observed on mortality, with an overall estimated size.21 Therefore, calculation of total sample size is a 2-step
HR of 0.93 (95% CI, 0.72–1.20; P = .57).30 process.41
First, the number of events needed to detect a minimum
PARAMETRIC MODELS clinically important effect size, like a prespecified HR, with a
Parametric models assume a specific distribution of the preselected power and alpha level is computed. Depending
survival times. Advantages of a parametric model include on the planned data analysis method, different approaches
a higher efficiency (ie, greater power),14 which can be par- for estimating the number of events have been proposed,
ticularly useful with smaller sample sizes. Furthermore, a including the Schoenfeld method for log-rank tests or PH
variety of parametric techniques can model survival times models.42
when the PH assumption is not met. Second, to calculate the total sample size, the propor-
However, it can be quite challenging to identify the most tion of patients who are expected to experience the event
appropriate data distribution, and parametric models have needs to be estimated.41 Of note, for multivariable models
the drawback of providing misleading inferences if the dis- like the Cox PH model, it has been suggested that at least
tributional assumptions are not met. In contrast, the semi- 10 events need to be observed per covariate to be included
parametric Cox model is a safe and proven method without in the model.43
the need to specify a specific data distribution,36 which is
why this model is most common in analyzing survival data. CONCLUSIONS
For a more detailed discussion on parametric models, we Survival data are unique in that the research questions
refer to previously published literature on the topic.14,36 essentially involve a combination of whether the event has
occurred in the observation period and when it has occurred.
RECURRENT EVENTS, COMPETING RISKS, AND Censoring, or the incomplete observation of failure times, is
FRAILTY MODELS common in these data, such that specific statistical methods
The previously described techniques are useful for study- are required for an appropriate analysis.
ing time until occurrence of a specific event that occurs only The Kaplan-Meier method estimates the unadjusted
once, terminates the observation of a patient, and occurs probability of surviving beyond a certain time point, and a
independently between the patients. While this situation is Kaplan-Meier curve is a useful graphical tool to display the
common in many time-to-event study designs, researchers estimated survival function. The log-rank test is commonly
may be interested in: (1) events that can occur more than used to compare survival curves between different groups,
once or in a series of events in which each event has its own but can only be used for a crude, unadjusted comparison.
failure time; (2) situations in which follow-up may be termi- The Cox PH model is the most commonly used tech-
nated by >1 event; or (3) clusters of patients for whom the nique to assess the effect of factors, such as treatments, that
event does not occur independently. simultaneously allows one to control for the effects of other
Recurrent event models are capable of modeling the covariates. The exponentiated regression coefficients can be
sequential occurrence of events over time.37 This can be the interpreted in terms of an HR. This semiparametric tech-
same event transpiring several times (eg, occurrence of myo- nique makes no assumptions about the distribution of the
cardial infarction) or a series of different ordered events (eg, survival times.
different, progressive stages of a disease until death occurs). If the distribution can be appropriately identified and
Competing risk models can accommodate multiple modeled, parametric techniques can alternatively be used.
(competing) types of failure events, each of which terminate For special circumstances in which the standard techniques
the observation of an individual.38 For example, a researcher cannot be validly used, a variety of methods including

September 2018 • Volume 127 • Number 3 www.anesthesia-analgesia.org 797


EE SPECIAL ARTICLE

recurrent events models, competing risks models, and 22. Cox DR. Regression models and life-tables. J R Stat Soc Series B.
frailty models are available. E 1972;34:187–220.
23. Hosmer DW, Lemeshow S, May S. Interpretation of a fitted
proportional hazards regression model. In: Applied Survival
DISCLOSURES Analysis. 2nd ed. Hoboken, NJ: John Wiley & Sons, 2008:92–131.
Name: Patrick Schober, MD, PhD, MMedStat. 24. Nieto FJ, Coresh J. Adjusting survival curves for confounders: a
Contribution: This author helped write and revise the manuscript. review and a new method. Am J Epidemiol. 1996;143:1059–1068.
Name: Thomas R. Vetter, MD, MPH. 25. Hosmer DW, Lemeshow S, May S. Assessment of model ade-
Contribution: This author helped write and revise the manuscript. quacy. In: Applied Survival Analysis. 2nd ed. Hoboken, NJ: John
This manuscript was handled by: Jean-Francois Pittet, MD. Wiley & Sons, 2008:169–206.
Downloaded from http://journals.lww.com/anesthesia-analgesia by BhDMf5ePHKav1zEoum1tQfN4a+kJLhEZgbsI
Ho4XMi0hCywCX1AWnYQp/IlQrHD3i3D0OdRyi7TvSFl4Cf3VC4/OAVpDDa8K2+Ya6H515kE= on 08/29/2024

26. Uno H, Claggett B, Tian L, et al. Moving beyond the hazard


REFERENCES ratio in quantifying the between-group difference in survival
1. Vetter TR, Schober P. Regression: the apple does not fall far analysis. J Clin Oncol. 2014;32:2380–2385.
from the tree. Anesth Analg. 2018;127:277–283. 27. Allison PD. Survival analysis. In: Hancock GR, Mueller RO,
2. Vetter TR, Mascha EJ. Unadjusted bivariate two-group compar- eds. The Reviewer’s Guide to Quantitative Methods in the Social
isons: when simpler is better. Anesth Analg. 2018;126:338–342. Sciences. New York, NY: Routledge, Taylor and Francis Group,
3. Clark TG, Bradburn MJ, Love SB, Altman DG. Survival 2010:413–424.
analysis part I: basic concepts and first analyses. Br J Cancer. 28. Fisher LD, Lin DY. Time-dependent covariates in the Cox pro-
2003;89:232–238. portional-hazards regression model. Annu Rev Public Health.
4. Kasza J, Wraith D, Lamb K, Wolfe R. Survival analysis of time- 1999;20:145–157.
to-event data in respiratory health research studies. Respirology. 29. Huang WW, Zhu WZ, Mu DL, et al. Perioperative manage-
2014;19:483–492. ment may improve long-term survival in patients after lung
5. Kalbfleisch JD, Prentice RL. The Statistical Analysis of Failure cancer surgery: a retrospective cohort study. Anesth Analg.
Time Data. 2nd ed. Hoboken, NJ: John Wiley & Sons; 2002. 2018;126:1666–1674.
6. Kartsonaki C. Survival analysis. Diagn Histopathol. 30. Podolyak A, Sessler DI, Reiterer C, et al. Perioperative supple-
2016;22:263–270. mental oxygen does not worsen long-term mortality of colorec-
7. Baum M. Breast cancer screening comes full circle. J Natl Cancer tal surgery patients. Anesth Analg. 2016;122:1907–1911.
Inst. 2004;96:1490–1491. 31. Wilson SH, Wolf BJ, Bingham K, et al. Labor analgesia onset
8. Dai H, Wang H. Introduction. In: Analysis for Time-to-Event Data with dural puncture epidural versus traditional epidural using
Under Censoring and Truncation. London, UK: Academic Press a 26-Gauge Whitacre needle and 0.125% bupivacaine bolus: a
Elsevier Ltd, 2017:1–13. randomized clinical trial. Anesth Analg. 2018;126:545–551.
9. Tsai WY, Jewell NP, Wang MC. A note on the product-limit 32. Kim M, Li G. Postoperative complications affecting survival
estimator under right censoring and left truncation. Biometrika. after cardiac arrest in general surgery patients. Anesth Analg.
1987;74:883–886. 2018;126:858–864.
10. Blagoev KB, Wilkerson J, Fojo T. Hazard ratios in cancer clinical 33. Chau A, Bibbo C, Huang CC, et al. Dural puncture epidural tech-
trials–a primer. Nat Rev Clin Oncol. 2012;9:178–183. nique improves labor analgesia quality with fewer side effects
11. Clark TG, Bradburn MJ, Love SB, Altman DG. Survival analysis compared with epidural and combined spinal epidural tech-
part IV: further concepts and methods in survival analysis. Br J niques: a randomized clinical trial. Anesth Analg. 2017;124:560–569.
Cancer. 2003;89:781–786. 34. Lee EK, Ahn HJ, Zo JI, Kim K, Jung DM, Park JH. Paravertebral
12. Cain KC, Harlow SD, Little RJ, et al. Bias due to left truncation block does not reduce cancer recurrence, but is related to higher
and left censoring in longitudinal studies of developmental and overall survival in lung cancer surgery: a retrospective cohort
disease processes. Am J Epidemiol. 2011;173:1078–1084. study. Anesth Analg. 2017;125:1322–1328.
13. Kleinbaum DG, Klein M. Introduction to survival analy- 35. Windpassinger M, Plattner O, Gemeiner J, et al. Pharyngeal oxygen
sis. In: Survival Analysis. 3rd ed. New York, NY: Springer, insufflation during AirTraq Laryngoscopy slows arterial desatura-
2012:1–54. tion in infants and small children. Anesth Analg. 2016;122:1153–1157.
14. Bradburn MJ, Clark TG, Love SB, Altman DG. Survival analysis 36. Hosmer DW, Lemeshow S, May S. Parametric regression mod-
part II: multivariate data analysis–an introduction to concepts els. In: Applied Survival Analysis. 2nd ed. Hoboken, NJ: John
and methods. Br J Cancer. 2003;89:431–436. Wiley & Sons, 2008:244–285.
15. Kalbfleisch JD, Prentice RL. Relative risk (Cox) regression 37. Amorim LD, Cai J. Modelling recurrent events: a tutorial for
models. In: The Statistical Analysis of Failure Time Data. 2nd ed. analysis in epidemiology. Int J Epidemiol. 2015;44:324–333.
Hoboken, NJ: John Wiley & Sons, 2002:95–147. 38. Dignam JJ, Zhang Q, Kocherginsky M. The use and interpre-
16. Hosmer DW, Lemeshow S, May S. Descriptive methods for sur- tation of competing risks regression models. Clin Cancer Res.
vival data. In: Applied Survival Analysis. 2nd ed. Hoboken, NJ: 2012;18:2301–2308.
John Wiley & Sons, 2008:16–66. 39. Austin PC. A tutorial on multilevel survival analysis: methods,
17. Bewick V, Cheek L, Ball J. Statistics review 12: survival analysis. models and applications. Int Stat Rev. 2017;85:185–203.
Crit Care. 2004;8:389–394. 40. Schober P, Vetter TR. Repeated measures designs and analy-
18. Schober P, Bossers SM, Schwarte LA. Statistical significance sis of longitudinal data: if at first you do not succeed—try, try
versus clinical importance of observed effect sizes: what do P again. Anesth Analg. 2018;127:569–573.
values and confidence intervals really represent? Anesth Analg. 41. Hosmer DW, Lemeshow S, May S. Other models and topics. In:
2018;126:1068–1072. Applied Survival Analysis. 2nd ed. Hoboken, NJ: John Wiley &
19. Bland JM, Altman DG. The logrank test. BMJ. 2004;328:1073. Sons, 2008:286–354.
20. Hosmer DW, Lemeshow S, May S. Model development. In: 42. Schoenfeld DA. Sample-size formula for the proportional-haz-
Applied Survival Analysis. 2nd ed. Hoboken, NJ: John Wiley & ards regression model. Biometrics. 1983;39:499–503.
Sons, 2008:132–168. 43. Peduzzi P, Concato J, Feinstein AR, Holford TR. Importance
21. Bradburn MJ, Clark TG, Love SB, Altman DG. Survival analy- of events per independent variable in proportional hazards
sis part III: multivariate data analysis—choosing a model and regression analysis. II. Accuracy and precision of regression
assessing its adequacy and fit. Br J Cancer. 2003;89:605–611. estimates. J Clin Epidemiol. 1995;48:1503–1510.

798   
www.anesthesia-analgesia.org ANESTHESIA & ANALGESIA

You might also like