A transformer architecture for retention time prediction in liquid chromatography mass spectrometry-based proteomics

Thang V. Pham; Vinh V. Nguyen; Duong Vu; Alex A. Henneman; Robin A. Richardson; Sander R. Piersma; Connie R. Jimenez

doi:https://doi.org/10.1002/pmic.202200041

A transformer architecture for retention time prediction in liquid chromatography mass spectrometry-based proteomics

Thang V. Pham, Vinh V. Nguyen, Duong Vu, Alex A. Henneman, Robin A. Richardson, Sander R. Piersma, Connie R. Jimenez

Research output: Contribution to journal › Article › Academic › peer-review

2 Citations (Scopus)

Abstract

Accurate retention time (RT) prediction is important for spectral library-based analysis in data-independent acquisition mass spectrometry-based proteomics. The deep learning approach has demonstrated superior performance over traditional machine learning methods for this purpose. The transformer architecture is a recent development in deep learning that delivers state-of-the-art performance in many fields such as natural language processing, computer vision, and biology. We assess the performance of the transformer architecture for RT prediction using datasets from five deep learning models Prosit, DeepDIA, AutoRT, DeepPhospho, and AlphaPeptDeep. The experimental results on holdout datasets and independent datasets exhibit state-of-the-art performance of the transformer architecture. The software and evaluation datasets are publicly available for future development in the field.

Original language	English
Article number	2200041
Journal	PROTEOMICS
Volume	23
Issue number	7-8
Early online date	2023
DOIs	https://doi.org/10.1002/pmic.202200041
Publication status	Published - Apr 2023

Keywords

DIA-MS
deep learning
retention time prediction
spectral library
transformer architecture

Access to Document

https://doi.org/10.1002/pmic.202200041

Cite this

@article{c557ce1924db4f52af0e8b2cb723adf1,

title = "A transformer architecture for retention time prediction in liquid chromatography mass spectrometry-based proteomics",

abstract = "Accurate retention time (RT) prediction is important for spectral library-based analysis in data-independent acquisition mass spectrometry-based proteomics. The deep learning approach has demonstrated superior performance over traditional machine learning methods for this purpose. The transformer architecture is a recent development in deep learning that delivers state-of-the-art performance in many fields such as natural language processing, computer vision, and biology. We assess the performance of the transformer architecture for RT prediction using datasets from five deep learning models Prosit, DeepDIA, AutoRT, DeepPhospho, and AlphaPeptDeep. The experimental results on holdout datasets and independent datasets exhibit state-of-the-art performance of the transformer architecture. The software and evaluation datasets are publicly available for future development in the field.",

keywords = "DIA-MS, deep learning, retention time prediction, spectral library, transformer architecture",

author = "Pham, {Thang V.} and Nguyen, {Vinh V.} and Duong Vu and Henneman, {Alex A.} and Richardson, {Robin A.} and Piersma, {Sander R.} and Jimenez, {Connie R.}",

note = "Funding Information: This work was supported by the Netherlands eScience Center, grant no. ASDI.2020.014. This work made use of the Dutch national e‐infrastructure with the support of the SURF Cooperative using grant no. EINF‐3550. Publisher Copyright: {\textcopyright} 2023 The Authors. Proteomics published by Wiley-VCH GmbH.",

year = "2023",

month = apr,

doi = "https://doi.org/10.1002/pmic.202200041",

language = "English",

volume = "23",

journal = "PROTEOMICS",

issn = "1615-9853",

publisher = "Wiley-VCH Verlag",

number = "7-8",

}

TY - JOUR

T1 - A transformer architecture for retention time prediction in liquid chromatography mass spectrometry-based proteomics

AU - Pham, Thang V.

AU - Nguyen, Vinh V.

AU - Vu, Duong

AU - Henneman, Alex A.

AU - Richardson, Robin A.

AU - Piersma, Sander R.

AU - Jimenez, Connie R.

N1 - Funding Information: This work was supported by the Netherlands eScience Center, grant no. ASDI.2020.014. This work made use of the Dutch national e‐infrastructure with the support of the SURF Cooperative using grant no. EINF‐3550. Publisher Copyright: © 2023 The Authors. Proteomics published by Wiley-VCH GmbH.

PY - 2023/4

Y1 - 2023/4

N2 - Accurate retention time (RT) prediction is important for spectral library-based analysis in data-independent acquisition mass spectrometry-based proteomics. The deep learning approach has demonstrated superior performance over traditional machine learning methods for this purpose. The transformer architecture is a recent development in deep learning that delivers state-of-the-art performance in many fields such as natural language processing, computer vision, and biology. We assess the performance of the transformer architecture for RT prediction using datasets from five deep learning models Prosit, DeepDIA, AutoRT, DeepPhospho, and AlphaPeptDeep. The experimental results on holdout datasets and independent datasets exhibit state-of-the-art performance of the transformer architecture. The software and evaluation datasets are publicly available for future development in the field.

AB - Accurate retention time (RT) prediction is important for spectral library-based analysis in data-independent acquisition mass spectrometry-based proteomics. The deep learning approach has demonstrated superior performance over traditional machine learning methods for this purpose. The transformer architecture is a recent development in deep learning that delivers state-of-the-art performance in many fields such as natural language processing, computer vision, and biology. We assess the performance of the transformer architecture for RT prediction using datasets from five deep learning models Prosit, DeepDIA, AutoRT, DeepPhospho, and AlphaPeptDeep. The experimental results on holdout datasets and independent datasets exhibit state-of-the-art performance of the transformer architecture. The software and evaluation datasets are publicly available for future development in the field.

KW - DIA-MS

KW - deep learning

KW - retention time prediction

KW - spectral library

KW - transformer architecture

UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85150781802&origin=inward

UR - https://www.ncbi.nlm.nih.gov/pubmed/36906835

U2 - https://doi.org/10.1002/pmic.202200041

DO - https://doi.org/10.1002/pmic.202200041

M3 - Article

C2 - 36906835

SN - 1615-9853

VL - 23

JO - PROTEOMICS

JF - PROTEOMICS

IS - 7-8

M1 - 2200041

ER -

A transformer architecture for retention time prediction in liquid chromatography mass spectrometry-based proteomics

Abstract

Keywords

Access to Document

Other files and links

Cite this