Testing the prediction error difference between two predictors

M.A. van de Wiel; J. Berkhof; W.N. van Wieringen

doi:https://doi.org/10.1093/biostatistics/kxp011

Testing the prediction error difference between two predictors

M.A. van de Wiel, J. Berkhof, W.N. van Wieringen

Epidemiology and Data Science (VUmc)

Research output: Contribution to journal › Article › Academic › peer-review

46 Citations (Scopus)

Abstract

We develop an inference framework for the difference in errors between 2 prediction procedures. The 2 procedures may differ in any aspect and possibly utilize different sets of covariates. We apply training and testing on the same data set, which is accommodated by sample splitting. For each split, both procedures predict the response of the same samples, which results in paired residuals to which a signed-rank test is applied. Multiple splits result in multiple p-values. The median p-value and the mean inverse normal transformed p-value are proposed as summary (test) statistics, for which bounds on the overall type I error rate under a variety of assumptions are proven. A simulation study is performed to check type I error control of the least conservative bound. Moreover, it confirms superior power of our method with respect to a one-split approach. Our inference framework is applied to genomic survival data sets to study 2 issues: compare lasso and ridge regression and decide upon use of both methylation and gene expression markers or the latter only. The framework easily accommodates any prediction paradigm and allows comparing any 2, possibly nonmodel-based, prediction procedures.

Original language	English
Pages (from-to)	550-560
Journal	Biostatistics
Volume	10
DOIs	https://doi.org/10.1093/biostatistics/kxp011
Publication status	Published - 2009

Access to Document

https://doi.org/10.1093/biostatistics/kxp011

Cite this

@article{ee4d0f0850b74a6c8a56235ea2b4ddd8,

title = "Testing the prediction error difference between two predictors",

abstract = "We develop an inference framework for the difference in errors between 2 prediction procedures. The 2 procedures may differ in any aspect and possibly utilize different sets of covariates. We apply training and testing on the same data set, which is accommodated by sample splitting. For each split, both procedures predict the response of the same samples, which results in paired residuals to which a signed-rank test is applied. Multiple splits result in multiple p-values. The median p-value and the mean inverse normal transformed p-value are proposed as summary (test) statistics, for which bounds on the overall type I error rate under a variety of assumptions are proven. A simulation study is performed to check type I error control of the least conservative bound. Moreover, it confirms superior power of our method with respect to a one-split approach. Our inference framework is applied to genomic survival data sets to study 2 issues: compare lasso and ridge regression and decide upon use of both methylation and gene expression markers or the latter only. The framework easily accommodates any prediction paradigm and allows comparing any 2, possibly nonmodel-based, prediction procedures.",

author = "{van de Wiel}, M.A. and J. Berkhof and {van Wieringen}, W.N.",

year = "2009",

doi = "https://doi.org/10.1093/biostatistics/kxp011",

language = "English",

volume = "10",

pages = "550--560",

journal = "Biostatistics",

issn = "1465-4644",

publisher = "Oxford University Press",

}

TY - JOUR

T1 - Testing the prediction error difference between two predictors

AU - van de Wiel, M.A.

AU - Berkhof, J.

AU - van Wieringen, W.N.

PY - 2009

Y1 - 2009

N2 - We develop an inference framework for the difference in errors between 2 prediction procedures. The 2 procedures may differ in any aspect and possibly utilize different sets of covariates. We apply training and testing on the same data set, which is accommodated by sample splitting. For each split, both procedures predict the response of the same samples, which results in paired residuals to which a signed-rank test is applied. Multiple splits result in multiple p-values. The median p-value and the mean inverse normal transformed p-value are proposed as summary (test) statistics, for which bounds on the overall type I error rate under a variety of assumptions are proven. A simulation study is performed to check type I error control of the least conservative bound. Moreover, it confirms superior power of our method with respect to a one-split approach. Our inference framework is applied to genomic survival data sets to study 2 issues: compare lasso and ridge regression and decide upon use of both methylation and gene expression markers or the latter only. The framework easily accommodates any prediction paradigm and allows comparing any 2, possibly nonmodel-based, prediction procedures.

AB - We develop an inference framework for the difference in errors between 2 prediction procedures. The 2 procedures may differ in any aspect and possibly utilize different sets of covariates. We apply training and testing on the same data set, which is accommodated by sample splitting. For each split, both procedures predict the response of the same samples, which results in paired residuals to which a signed-rank test is applied. Multiple splits result in multiple p-values. The median p-value and the mean inverse normal transformed p-value are proposed as summary (test) statistics, for which bounds on the overall type I error rate under a variety of assumptions are proven. A simulation study is performed to check type I error control of the least conservative bound. Moreover, it confirms superior power of our method with respect to a one-split approach. Our inference framework is applied to genomic survival data sets to study 2 issues: compare lasso and ridge regression and decide upon use of both methylation and gene expression markers or the latter only. The framework easily accommodates any prediction paradigm and allows comparing any 2, possibly nonmodel-based, prediction procedures.

U2 - https://doi.org/10.1093/biostatistics/kxp011

DO - https://doi.org/10.1093/biostatistics/kxp011

M3 - Article

C2 - 19380517

SN - 1465-4644

VL - 10

SP - 550

EP - 560

JO - Biostatistics

JF - Biostatistics

ER -