Benchmarking computational methods for B-cell receptor reconstruction from single-cell RNA-seq data

Tommaso Andreani; Linda M. Slot; Samuel Gabillard; Carsten Strübing; Claus Reimertz; Veeranagouda Yaligara; Aleida M. Bakker; Reza Olfati-Saber; Rene E. M. Toes; Hans U. Scherer; Franck Auge; Deimantė Šimaitė

doi:https://doi.org/10.1093/nargab/lqac049

Benchmarking computational methods for B-cell receptor reconstruction from single-cell RNA-seq data

Tommaso Andreani, Linda M. Slot, Samuel Gabillard, Carsten Strübing, Claus Reimertz, Veeranagouda Yaligara, Aleida M. Bakker, Reza Olfati-Saber, Rene E. M. Toes, Hans U. Scherer, Franck Auge, Deimantė Šimaitė

Doctoral School

Research output: Contribution to journal › Article › Academic › peer-review

6 Citations (Scopus)

Abstract

Multiple methods have recently been developed to reconstruct full-length B-cell receptors (BCRs) from single-cell RNA sequencing (scRNA-seq) data. This need emerged from the expansion of scRNA-seq techniques, the increasing interest in antibody-based drug development and the importance of BCR repertoire changes in cancer and autoimmune disease progression. However, a comprehensive assessment of performance-influencing factors such as the sequencing depth, read length or number of somatic hypermutations (SHMs) as well as guidance regarding the choice of methodology is still lacking. In this work, we evaluated the ability of six available methods to reconstruct full-length BCRs using one simulated and three experimental SMART-seq datasets. In addition, we validated that the BCRs assembled in silico recognize their intended targets when expressed as monoclonal antibodies. We observed that methods such as BALDR, BASIC and BRACER showed the best overall performance across the tested datasets and conditions, whereas only BASIC demonstrated acceptable results on very short read libraries. Furthermore, the de novo assembly-based methods BRACER and BALDR were the most accurate in reconstructing BCRs harboring different degrees of SHMs in the variable domain, while TRUST4, MiXCR and BASIC were the fastest. Finally, we propose guidelines to select the best method based on the given data characteristics.

Original language	English
Article number	lqac049
Journal	NAR Genomics and Bioinformatics
Volume	4
Issue number	3
DOIs	https://doi.org/10.1093/nargab/lqac049
Publication status	Published - 1 Sept 2022

Access to Document

https://doi.org/10.1093/nargab/lqac049

Cite this

Andreani, T., Slot, L. M., Gabillard, S., Strübing, C., Reimertz, C., Yaligara, V., Bakker, A. M., Olfati-Saber, R., Toes, R. E. M., Scherer, H. U., Auge, F., & Šimaitė, D. (2022). Benchmarking computational methods for B-cell receptor reconstruction from single-cell RNA-seq data. NAR Genomics and Bioinformatics, 4(3), Article lqac049. https://doi.org/10.1093/nargab/lqac049

@article{0ab21c383d3e47e8b843014aa65502ae,

title = "Benchmarking computational methods for B-cell receptor reconstruction from single-cell RNA-seq data",

abstract = "Multiple methods have recently been developed to reconstruct full-length B-cell receptors (BCRs) from single-cell RNA sequencing (scRNA-seq) data. This need emerged from the expansion of scRNA-seq techniques, the increasing interest in antibody-based drug development and the importance of BCR repertoire changes in cancer and autoimmune disease progression. However, a comprehensive assessment of performance-influencing factors such as the sequencing depth, read length or number of somatic hypermutations (SHMs) as well as guidance regarding the choice of methodology is still lacking. In this work, we evaluated the ability of six available methods to reconstruct full-length BCRs using one simulated and three experimental SMART-seq datasets. In addition, we validated that the BCRs assembled in silico recognize their intended targets when expressed as monoclonal antibodies. We observed that methods such as BALDR, BASIC and BRACER showed the best overall performance across the tested datasets and conditions, whereas only BASIC demonstrated acceptable results on very short read libraries. Furthermore, the de novo assembly-based methods BRACER and BALDR were the most accurate in reconstructing BCRs harboring different degrees of SHMs in the variable domain, while TRUST4, MiXCR and BASIC were the fastest. Finally, we propose guidelines to select the best method based on the given data characteristics.",

author = "Tommaso Andreani and Slot, {Linda M.} and Samuel Gabillard and Carsten Str{\"u}bing and Claus Reimertz and Veeranagouda Yaligara and Bakker, {Aleida M.} and Reza Olfati-Saber and Toes, {Rene E. M.} and Scherer, {Hans U.} and Franck Auge and Deimantė {\v S}imaitė",

note = "Publisher Copyright: {\textcopyright} 2022 The Author(s). Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.",

year = "2022",

month = sep,

day = "1",

doi = "https://doi.org/10.1093/nargab/lqac049",

language = "English",

volume = "4",

journal = "NAR Genomics and Bioinformatics",

issn = "2631-9268",

publisher = "Oxford University Press",

number = "3",

}

TY - JOUR

T1 - Benchmarking computational methods for B-cell receptor reconstruction from single-cell RNA-seq data

AU - Andreani, Tommaso

AU - Slot, Linda M.

AU - Gabillard, Samuel

AU - Strübing, Carsten

AU - Reimertz, Claus

AU - Yaligara, Veeranagouda

AU - Bakker, Aleida M.

AU - Olfati-Saber, Reza

AU - Toes, Rene E. M.

AU - Scherer, Hans U.

AU - Auge, Franck

AU - Šimaitė, Deimantė

PY - 2022/9/1

Y1 - 2022/9/1

N2 - Multiple methods have recently been developed to reconstruct full-length B-cell receptors (BCRs) from single-cell RNA sequencing (scRNA-seq) data. This need emerged from the expansion of scRNA-seq techniques, the increasing interest in antibody-based drug development and the importance of BCR repertoire changes in cancer and autoimmune disease progression. However, a comprehensive assessment of performance-influencing factors such as the sequencing depth, read length or number of somatic hypermutations (SHMs) as well as guidance regarding the choice of methodology is still lacking. In this work, we evaluated the ability of six available methods to reconstruct full-length BCRs using one simulated and three experimental SMART-seq datasets. In addition, we validated that the BCRs assembled in silico recognize their intended targets when expressed as monoclonal antibodies. We observed that methods such as BALDR, BASIC and BRACER showed the best overall performance across the tested datasets and conditions, whereas only BASIC demonstrated acceptable results on very short read libraries. Furthermore, the de novo assembly-based methods BRACER and BALDR were the most accurate in reconstructing BCRs harboring different degrees of SHMs in the variable domain, while TRUST4, MiXCR and BASIC were the fastest. Finally, we propose guidelines to select the best method based on the given data characteristics.

AB - Multiple methods have recently been developed to reconstruct full-length B-cell receptors (BCRs) from single-cell RNA sequencing (scRNA-seq) data. This need emerged from the expansion of scRNA-seq techniques, the increasing interest in antibody-based drug development and the importance of BCR repertoire changes in cancer and autoimmune disease progression. However, a comprehensive assessment of performance-influencing factors such as the sequencing depth, read length or number of somatic hypermutations (SHMs) as well as guidance regarding the choice of methodology is still lacking. In this work, we evaluated the ability of six available methods to reconstruct full-length BCRs using one simulated and three experimental SMART-seq datasets. In addition, we validated that the BCRs assembled in silico recognize their intended targets when expressed as monoclonal antibodies. We observed that methods such as BALDR, BASIC and BRACER showed the best overall performance across the tested datasets and conditions, whereas only BASIC demonstrated acceptable results on very short read libraries. Furthermore, the de novo assembly-based methods BRACER and BALDR were the most accurate in reconstructing BCRs harboring different degrees of SHMs in the variable domain, while TRUST4, MiXCR and BASIC were the fastest. Finally, we propose guidelines to select the best method based on the given data characteristics.

UR - http://www.scopus.com/inward/record.url?scp=85134588202&partnerID=8YFLogxK

U2 - https://doi.org/10.1093/nargab/lqac049

DO - https://doi.org/10.1093/nargab/lqac049

M3 - Article

C2 - 35855325

SN - 2631-9268

VL - 4

JO - NAR Genomics and Bioinformatics

JF - NAR Genomics and Bioinformatics

IS - 3

M1 - lqac049

ER -

Benchmarking computational methods for B-cell receptor reconstruction from single-cell RNA-seq data

Abstract

Access to Document

Other files and links

Cite this