0% found this document useful (0 votes)

44 views3 pages

Bias in Censored Median Regression

This document evaluates the performance of censored median regression estimators. It finds that approaches that treat censored observations differently based on information like conditional probabilities introduce less bias than methods that treat all censored data the same. Specifically, an "inequality" loss model that handles censoring with inequality constraints is biased, while a weighted loss model that accounts for conditional probabilities of censoring using weights introduces only small bias. Simulation results support that the weighted approach has better mean squared error properties.

Uploaded by

Antonio Eleuteri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views3 pages

Bias in Censored Median Regression

Uploaded by

Antonio Eleuteri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 3

Bias in censored median regression

A. Eleuteri

In this short report we evaluate the performance of a censored median regression

estimator, as presented by two different groups of authors [1], [2]. In both cases the
algorithm can be reduced to the same form, which we will define as based on an
“inequality” loss (since censoring is dealt with by a set of inequality constraints.) In
the following we consider the case of right censoring.

Source of bias

Suppose we have a random sample of pairs, { (Ti , Ci ) : i = 1,L , n} , Ti : F , Ti and Ci

conditionally independent (though in the following we will assume the one-sample
case.) Let us consider the case of right censoring, so Yi = min { Ti , Ci } and
d i = I ( Ti < Ci ) , where I(.) is the set indicator function. The median loss function is
r (r ) = r { 1 2 - I (r < 0)} .
In ordinary quantile regression the contribution of each point to the subgradient
condition only depends on the sign of the residuals [3] ri = Ti - x .
For uncensored data we observe both Yi = Ti < Ci and I ( ri < 0 ) (note the residuals can
be either negative or positive.)
Similarly for censored data, in the case x < Yi = Ci (note in this case Ti > Ci , hence
I ( ri < 0 ) = 0 ). However, if x > Yi = Ci we cannot observe the sign of the residual,
since we can have either x > Ti or x �Ti (i.e. the residual can be negative or positive.)
We can however evaluate the following conditional expectation (w.r.t. the measure F):
Pr { Ci < Ti < x } F (x ) - F (Ci ) 1 2 - F (Ci )
E [ I (ri < 0) | Ti > Ci ] = = = .
Pr { Ci < Ti } 1 - F (Ci ) 1 - F (Ci )

The above quantity (calculated for F (Ci ) < 1 2 ) gives a measure of the “weight”
attached to ambiguous observations. This suggests a weighting scheme originally
proposed by Efron [4] and adapted by Portnoy [5] to quantile regression.

In contrast, the censored observations are all dealt with in the same way in the
“inequality” loss model; this fact introduces a bias in the estimates.

In the following graph we compare different expressions of the empirical loss,

compared with the true (unfeasible) empirical loss. The “naïve” loss simply ignores
the censoring information, introducing a large bias. The “inequality” loss also
introduces some bias. The weighted loss introduces only a small bias, though it
requires knowledge of the distribution of the events, which is normally not available;
it seems plausible that a nonparametric estimator might be used, e.g. the Kaplan-
Meier estimator. It’s not clear how the presence of covariates affects the estimate.
naive loss
weighted loss
100
"inequality" loss
true (unfeasible) loss
80

0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8

One-sample asymptotics experiment

The following tables report the results of a simulation experiment comparing the finite
sample performance of some estimates of the median in a censored one-sample
setting. We assume the distribution of events as standard lognormal, and the censoring
distribution as exponential with rate 0.25. We follow the experimental setup in [6].
For each problem instance the estimate was calculated 1000 times and the results
averaged.
Note we also report the performance of the (unfeasible) sample median.

TABLE 2: Scaled MSE

Sample Kaplan-Meier Censored median Censored median
median regression regression
(“inequality” loss) (weighted loss)
N=50 1.674 1.756 1.528 1.686
N=200 1.780 2.023 2.265 1.826
N=500 1.565 1.902 3.694 1.774
N=1000 1.445 1.716 5.613 1.547
N= 1.571 1.839 ? ?
MSE for some estimators of the median. The estimates are scaled by sample size to
conform to asymptotic variance calculations.

It is evident that the “inequality” loss approach produces biased results.

References

[1] K. Pelckmans, J. De Brabanter, J. A. K. Suykens, B. De Moor. Risk Scores,

Empirical Z-estimators and its application to Censored Regression. Technical Report
kp06-105 (2006).
[2] P. Shivaswamy, W. Chu, M. Jansche. A Support Vector Approach to Censored
Targets. Proceedings of the 2007 Seventh IEEE International Conference on Data
Mining (2007).

[3] R. Koenker. Quantile Regression. Cambridge University Press (2005).

[4] B. Efron. The Two Sample Problem with Censored Data. Proceedings of the
5th Berkeley Symposium on Mathematical Statistics and Probability, Prentice-Hall,
New York (1967).

[5] S. Portnoy. Censored Quantile Regression." Journal of American Statistical

Association, 98, 1001-1012 (2003).

[6] R. Koenker. Censored Quantile Regression Redux. Journal of Statistical Software,

27-6 (2008).

Unified Methods For Censored Longitudinal Data and Causality Full-Resolution Download
100% (12)
Unified Methods For Censored Longitudinal Data and Causality Full-Resolution Download
15 pages
Quantile Regression: 40 Years On: Annual Review of Economics
No ratings yet
Quantile Regression: 40 Years On: Annual Review of Economics
24 pages
Quantitative Analysis of Market Data A Primer
100% (2)
Quantitative Analysis of Market Data A Primer
43 pages
Quantile Regression
No ratings yet
Quantile Regression
122 pages
Gray Robert J 1983
No ratings yet
Gray Robert J 1983
80 pages
Quantile Regression: Roger Koenker and Kevin F. Hallock
No ratings yet
Quantile Regression: Roger Koenker and Kevin F. Hallock
14 pages
Module in Assessment of Learning 2
No ratings yet
Module in Assessment of Learning 2
17 pages
Basic Concepts of Statistics
100% (1)
Basic Concepts of Statistics
313 pages
Surv
No ratings yet
Surv
123 pages
Boukeloua 2024 CIS-TM
No ratings yet
Boukeloua 2024 CIS-TM
39 pages
Introduction To Biostatistics
100% (1)
Introduction To Biostatistics
13 pages
Meth 2024 SM3
No ratings yet
Meth 2024 SM3
20 pages
(International Centre For Mechanical Sciences 140) T. Kailath (Auth.) - Lectures On Wiener and Kalman Filtering-Springer-Verlag Wien (1981) PDF
No ratings yet
(International Centre For Mechanical Sciences 140) T. Kailath (Auth.) - Lectures On Wiener and Kalman Filtering-Springer-Verlag Wien (1981) PDF
189 pages
STAT130 Revision Test 1 Semester 2 2023
No ratings yet
STAT130 Revision Test 1 Semester 2 2023
6 pages
Estimation Inverse Weibull Random Censoring
No ratings yet
Estimation Inverse Weibull Random Censoring
28 pages
10.1515 Rose.2011.008
No ratings yet
10.1515 Rose.2011.008
26 pages
CLM: Review: - OLS Estimation
No ratings yet
CLM: Review: - OLS Estimation
44 pages
Reading 4
No ratings yet
Reading 4
15 pages
Jep 15 4 143
No ratings yet
Jep 15 4 143
125 pages
Zhang 1994
No ratings yet
Zhang 1994
22 pages
Generalized Linear Models With 1-Bit Measurements: Asymptotics of The Maximum Likelihood Estimator
No ratings yet
Generalized Linear Models With 1-Bit Measurements: Asymptotics of The Maximum Likelihood Estimator
12 pages
Econometric Analysis of Cross Section and Panel Data, 2e
No ratings yet
Econometric Analysis of Cross Section and Panel Data, 2e
72 pages
Week 2, OLS
No ratings yet
Week 2, OLS
83 pages
Gjpamv17n2 01
No ratings yet
Gjpamv17n2 01
27 pages
Hendry and Krolzig 2004
No ratings yet
Hendry and Krolzig 2004
13 pages
There and Back Again: Abilities
No ratings yet
There and Back Again: Abilities
3 pages
Lecture2 PDF
No ratings yet
Lecture2 PDF
48 pages
14 Model Selection
No ratings yet
14 Model Selection
24 pages
Measures of Central Tendency and Measures of Variability: I. Introduction and Focus Questions
No ratings yet
Measures of Central Tendency and Measures of Variability: I. Introduction and Focus Questions
64 pages
3 SimpleLinearRegression
No ratings yet
3 SimpleLinearRegression
30 pages
Final 2015 PDF
No ratings yet
Final 2015 PDF
13 pages
Choi 2018
No ratings yet
Choi 2018
13 pages
Manual Econometrics
No ratings yet
Manual Econometrics
20 pages
Linear Model
No ratings yet
Linear Model
14 pages
GMM Estimation PDF
No ratings yet
GMM Estimation PDF
35 pages
METULecture 1
No ratings yet
METULecture 1
15 pages
Random Censoring
No ratings yet
Random Censoring
5 pages
Marginal Effects in The Censored Regression Model
No ratings yet
Marginal Effects in The Censored Regression Model
7 pages
Multivariate Censored Regression: Karl G J Oreskog 20 January 2004
No ratings yet
Multivariate Censored Regression: Karl G J Oreskog 20 January 2004
11 pages
Censored Quantile Instrumental Variable Estimation With Stata
No ratings yet
Censored Quantile Instrumental Variable Estimation With Stata
10 pages
Kernel regression uniform rate estimation for censored data under α-mixing condition
No ratings yet
Kernel regression uniform rate estimation for censored data under α-mixing condition
18 pages
Class Exercises Topic 2 Solutions: Jordi Blanes I Vidal Econometrics: Theory and Applications
No ratings yet
Class Exercises Topic 2 Solutions: Jordi Blanes I Vidal Econometrics: Theory and Applications
12 pages
Environmental Data Sets With Below Detection Limit Observations
No ratings yet
Environmental Data Sets With Below Detection Limit Observations
27 pages
Statistical Inference in Nonlinear Sure Model
No ratings yet
Statistical Inference in Nonlinear Sure Model
7 pages
Censored Quantile Regression Redux
No ratings yet
Censored Quantile Regression Redux
26 pages
Seemingly Unrelated Regressions
No ratings yet
Seemingly Unrelated Regressions
9 pages
Semiparametric Censored Regression Models: Kenneth Y. Chay and James L. Powell
No ratings yet
Semiparametric Censored Regression Models: Kenneth Y. Chay and James L. Powell
14 pages
Violations of Classical Linear Regression Assumptions Mis-Specification
No ratings yet
Violations of Classical Linear Regression Assumptions Mis-Specification
7 pages
Econometrics I: TA Session 5: Giovanna Ubida
No ratings yet
Econometrics I: TA Session 5: Giovanna Ubida
20 pages
Box and Whisker Plots Packet
No ratings yet
Box and Whisker Plots Packet
11 pages
Cens Reg
No ratings yet
Cens Reg
12 pages
Linear Regression Analysis: Module - Vii
No ratings yet
Linear Regression Analysis: Module - Vii
10 pages
We Ran One Regression: David F. Hendry and Hans-Martin Krolzig Department of Economics, Oxford University. March 10, 2004
No ratings yet
We Ran One Regression: David F. Hendry and Hans-Martin Krolzig Department of Economics, Oxford University. March 10, 2004
9 pages
1 Interval Censoring: Nonparametric Estimation For Interval Censored Data
No ratings yet
1 Interval Censoring: Nonparametric Estimation For Interval Censored Data
4 pages
Cuantil Regression in R
No ratings yet
Cuantil Regression in R
26 pages
A Guide To Modern Econometrics, 5th Edition Answers To Selected Exercises - Chapter 2
No ratings yet
A Guide To Modern Econometrics, 5th Edition Answers To Selected Exercises - Chapter 2
5 pages
Integrated Conditional Moment Testing of Quantile Regression Models
No ratings yet
Integrated Conditional Moment Testing of Quantile Regression Models
24 pages
Parcheggio Liverpool
No ratings yet
Parcheggio Liverpool
5 pages
Mth302 Final Current Exams Feb 2025
No ratings yet
Mth302 Final Current Exams Feb 2025
7 pages
Groebner Tif Ch03
No ratings yet
Groebner Tif Ch03
27 pages
Biometrika 1982 MILLER 521 31
No ratings yet
Biometrika 1982 MILLER 521 31
11 pages
Chapter 3-Measures of Central Tendency
No ratings yet
Chapter 3-Measures of Central Tendency
23 pages
Regration
No ratings yet
Regration
4 pages
Basic Statistics For Life Sciences
No ratings yet
Basic Statistics For Life Sciences
58 pages
Introduction To Curve Fitting
No ratings yet
Introduction To Curve Fitting
10 pages
QTS105D Study Notes
No ratings yet
QTS105D Study Notes
184 pages
Problem Set 1: Panel Data
No ratings yet
Problem Set 1: Panel Data
3 pages
Classx Ds Unit 1
No ratings yet
Classx Ds Unit 1
58 pages
Stat LAS 10 PDF
No ratings yet
Stat LAS 10 PDF
6 pages
Chapter 11 Summary: Observed Value of The Latent Variable. Here We Have To Multiply The Slope
No ratings yet
Chapter 11 Summary: Observed Value of The Latent Variable. Here We Have To Multiply The Slope
2 pages
Central Tendency, The Variability and Distribution of Your Dataset Is Important To Understand When Performing Descriptive Statistics.
No ratings yet
Central Tendency, The Variability and Distribution of Your Dataset Is Important To Understand When Performing Descriptive Statistics.
14 pages
Review: Generalized Least Squares: 3.1. Covariance Matrices
No ratings yet
Review: Generalized Least Squares: 3.1. Covariance Matrices
12 pages
Emet2007 Notes
No ratings yet
Emet2007 Notes
6 pages
Aitken' GLS
No ratings yet
Aitken' GLS
7 pages
Example2 7NewsDealer
100% (1)
Example2 7NewsDealer
22 pages
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Baumgartner - Applications of SEM in Marketing and Consumer Research - A
No ratings yet
Baumgartner - Applications of SEM in Marketing and Consumer Research - A
23 pages
Healthcare Science Practitioner Profiles: Factor Relevant Job Information JE Level
No ratings yet
Healthcare Science Practitioner Profiles: Factor Relevant Job Information JE Level
2 pages
Recap
No ratings yet
Recap
75 pages
Stat 250.3 - Practice Quiz 1
No ratings yet
Stat 250.3 - Practice Quiz 1
4 pages
Unequal
No ratings yet
Unequal
29 pages
Diary Entry
No ratings yet
Diary Entry
1 page
Simulation Handout PDF
No ratings yet
Simulation Handout PDF
11 pages
Modules: AD&D Module Pregen Characters
No ratings yet
Modules: AD&D Module Pregen Characters
9 pages
Intro To Statistics: Final Project
No ratings yet
Intro To Statistics: Final Project
7 pages
Title # Price (Each)
No ratings yet
Title # Price (Each)
4 pages
B. Com (Honors)
No ratings yet
B. Com (Honors)
43 pages
PHYS866 2016 Assignment
No ratings yet
PHYS866 2016 Assignment
2 pages
Association Between NIRS and EEG - IVH 030717
No ratings yet
Association Between NIRS and EEG - IVH 030717
1 page
PSMOD - Sample Practical Test (A)
No ratings yet
PSMOD - Sample Practical Test (A)
3 pages
Andrew Warren Nathan Warren: Occupation: Carpenter Occupation: Discharged Soldier
No ratings yet
Andrew Warren Nathan Warren: Occupation: Carpenter Occupation: Discharged Soldier
1 page
Worcester DT20 Programmer Installation and Servicing Instructions
No ratings yet
Worcester DT20 Programmer Installation and Servicing Instructions
16 pages
Lester Et Al - Unusually Persistent Complaints
No ratings yet
Lester Et Al - Unusually Persistent Complaints
6 pages
Exports of Goods - % of GDP
No ratings yet
Exports of Goods - % of GDP
5 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
LME Analysis Burdjalov - 160218
No ratings yet
LME Analysis Burdjalov - 160218
6 pages
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
The Sun - Reading - Comprehension
No ratings yet
The Sun - Reading - Comprehension
6 pages
"Most of Glioma Patients Have Local Recurrence and Their Median Survival Rate Is Reported To
No ratings yet
"Most of Glioma Patients Have Local Recurrence and Their Median Survival Rate Is Reported To
4 pages
Curriculum Vitae
No ratings yet
Curriculum Vitae
3 pages
Living in The It Era Midterm Exam Pointers
No ratings yet
Living in The It Era Midterm Exam Pointers
45 pages
Descriptivestatistics 170330121728
No ratings yet
Descriptivestatistics 170330121728
36 pages
Teaching Strategies Based On Generational Teacher Cohorts
No ratings yet
Teaching Strategies Based On Generational Teacher Cohorts
13 pages
Margate Activity 6.1 Eda
No ratings yet
Margate Activity 6.1 Eda
5 pages
SDAR CarlosPGarciaHighSchool AralingPanlipunan Grade7
No ratings yet
SDAR CarlosPGarciaHighSchool AralingPanlipunan Grade7
2 pages
Assignment For Statistics
No ratings yet
Assignment For Statistics
3 pages
Aquarate Sample Size Calculations
No ratings yet
Aquarate Sample Size Calculations
1 page

Bias in Censored Median Regression

Uploaded by

Bias in Censored Median Regression

Uploaded by

Bias in censored median regression

In this short report we evaluate the performance of a censored median regression

Suppose we have a random sample of pairs, { (Ti , Ci ) : i = 1,L , n} , Ti : F , Ti and Ci

In the following graph we compare different expressions of the empirical loss,

0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8

One-sample asymptotics experiment

TABLE 2: Scaled MSE

It is evident that the “inequality” loss approach produces biased results.

[1] K. Pelckmans, J. De Brabanter, J. A. K. Suykens, B. De Moor. Risk Scores,

[3] R. Koenker. Quantile Regression. Cambridge University Press (2005).

[5] S. Portnoy. Censored Quantile Regression." Journal of American Statistical

[6] R. Koenker. Censored Quantile Regression Redux. Journal of Statistical Software,

You might also like