Sparse classification with paired covariates

Armin Rauschenberger; Iuliana Ciocănea-Teodorescu; Marianne A. Jonker; Renée X. Menezes; Mark A. van de Wiel

doi:https://doi.org/10.1007/s11634-019-00375-6

Sparse classification with paired covariates

Armin Rauschenberger, Iuliana Ciocănea-Teodorescu, Marianne A. Jonker, Renée X. Menezes, Mark A. van de Wiel

Research output: Contribution to journal › Article › Academic › peer-review

5 Citations (Scopus)

Abstract

This paper introduces the paired lasso: a generalisation of the lasso for paired covariate settings. Our aim is to predict a single response from two high-dimensional covariate sets. We assume a one-to-one correspondence between the covariate sets, with each covariate in one set forming a pair with a covariate in the other set. Paired covariates arise, for example, when two transformations of the same data are available. It is often unknown which of the two covariate sets leads to better predictions, or whether the two covariate sets complement each other. The paired lasso addresses this problem by weighting the covariates to improve the selection from the covariate sets and the covariate pairs. It thereby combines information from both covariate sets and accounts for the paired structure. We tested the paired lasso on more than 2000 classification problems with experimental genomics data, and found that for estimating sparse but predictive models, the paired lasso outperforms the standard and the adaptive lasso. The R package palasso is available from cran.

Original language	English
Pages (from-to)	571-588
Number of pages	18
Journal	Advances in Data Analysis and Classification
Volume	14
Issue number	3
DOIs	https://doi.org/10.1007/s11634-019-00375-6
Publication status	Published - 1 Sept 2020

Keywords

Lasso regression
Paired data
Prediction
Sparsity

Access to Document

https://doi.org/10.1007/s11634-019-00375-6

Cite this

@article{330dd6383a0a449aa43eb68038101b53,

title = "Sparse classification with paired covariates",

abstract = "This paper introduces the paired lasso: a generalisation of the lasso for paired covariate settings. Our aim is to predict a single response from two high-dimensional covariate sets. We assume a one-to-one correspondence between the covariate sets, with each covariate in one set forming a pair with a covariate in the other set. Paired covariates arise, for example, when two transformations of the same data are available. It is often unknown which of the two covariate sets leads to better predictions, or whether the two covariate sets complement each other. The paired lasso addresses this problem by weighting the covariates to improve the selection from the covariate sets and the covariate pairs. It thereby combines information from both covariate sets and accounts for the paired structure. We tested the paired lasso on more than 2000 classification problems with experimental genomics data, and found that for estimating sparse but predictive models, the paired lasso outperforms the standard and the adaptive lasso. The R package palasso is available from cran.",

keywords = "Lasso regression, Paired data, Prediction, Sparsity",

author = "Armin Rauschenberger and Iuliana Cioc{\u a}nea-Teodorescu and Jonker, {Marianne A.} and Menezes, {Ren{\'e}e X.} and {van de Wiel}, {Mark A.}",

year = "2020",

month = sep,

day = "1",

doi = "https://doi.org/10.1007/s11634-019-00375-6",

language = "English",

volume = "14",

pages = "571--588",

journal = "Advances in Data Analysis and Classification",

issn = "1862-5347",

publisher = "Springer Verlag",

number = "3",

}

TY - JOUR

T1 - Sparse classification with paired covariates

AU - Rauschenberger, Armin

AU - Ciocănea-Teodorescu, Iuliana

AU - Jonker, Marianne A.

AU - Menezes, Renée X.

AU - van de Wiel, Mark A.

PY - 2020/9/1

Y1 - 2020/9/1

N2 - This paper introduces the paired lasso: a generalisation of the lasso for paired covariate settings. Our aim is to predict a single response from two high-dimensional covariate sets. We assume a one-to-one correspondence between the covariate sets, with each covariate in one set forming a pair with a covariate in the other set. Paired covariates arise, for example, when two transformations of the same data are available. It is often unknown which of the two covariate sets leads to better predictions, or whether the two covariate sets complement each other. The paired lasso addresses this problem by weighting the covariates to improve the selection from the covariate sets and the covariate pairs. It thereby combines information from both covariate sets and accounts for the paired structure. We tested the paired lasso on more than 2000 classification problems with experimental genomics data, and found that for estimating sparse but predictive models, the paired lasso outperforms the standard and the adaptive lasso. The R package palasso is available from cran.

AB - This paper introduces the paired lasso: a generalisation of the lasso for paired covariate settings. Our aim is to predict a single response from two high-dimensional covariate sets. We assume a one-to-one correspondence between the covariate sets, with each covariate in one set forming a pair with a covariate in the other set. Paired covariates arise, for example, when two transformations of the same data are available. It is often unknown which of the two covariate sets leads to better predictions, or whether the two covariate sets complement each other. The paired lasso addresses this problem by weighting the covariates to improve the selection from the covariate sets and the covariate pairs. It thereby combines information from both covariate sets and accounts for the paired structure. We tested the paired lasso on more than 2000 classification problems with experimental genomics data, and found that for estimating sparse but predictive models, the paired lasso outperforms the standard and the adaptive lasso. The R package palasso is available from cran.

KW - Lasso regression

KW - Paired data

KW - Prediction

KW - Sparsity

UR - http://www.scopus.com/inward/record.url?scp=85075439602&partnerID=8YFLogxK

U2 - https://doi.org/10.1007/s11634-019-00375-6

DO - https://doi.org/10.1007/s11634-019-00375-6

M3 - Article

SN - 1862-5347

VL - 14

SP - 571

EP - 588

JO - Advances in Data Analysis and Classification

JF - Advances in Data Analysis and Classification

IS - 3

ER -

Sparse classification with paired covariates

Abstract

Keywords

Access to Document

Other files and links

Cite this