Multi-class glioma segmentation on real-world data with missing MRI sequences: comparison of three deep learning algorithms

Hugh G. Pemberton; Jiaming Wu; Ivar Kommers; Domenique M. J. Müller; Yipeng Hu; Olivia Goodkin; Sjoerd B. Vos; Sotirios Bisdas; Pierre A. Robe; Hilko Ardon; Lorenzo Bello; Marco Rossi; Tommaso Sciortino; Marco Conti Nibali; Mitchel S. Berger; Shawn L. Hervey-Jumper; Wim Bouwknegt; Wimar A. van den Brink; Julia Furtner; Seunggu J. Han; Albert J. S. Idema; Barbara Kiesel; Georg Widhalm; Alfred Kloet; Michiel Wagemakers; Aeilko H. Zwinderman; Sandro M. Krieg; Emmanuel Mandonnet; Ferran Prados; Philip de Witt Hamer; Frederik Barkhof; Roelant S. Eijgelaar

doi:https://doi.org/10.1038/s41598-023-44794-0

Multi-class glioma segmentation on real-world data with missing MRI sequences: comparison of three deep learning algorithms

Hugh G. Pemberton, Jiaming Wu, Ivar Kommers, Domenique M. J. Müller, Yipeng Hu, Olivia Goodkin, Sjoerd B. Vos, Sotirios Bisdas, Pierre A. Robe, Hilko Ardon, Lorenzo Bello, Marco Rossi, Tommaso Sciortino, Marco Conti Nibali, Mitchel S. Berger, Shawn L. Hervey-Jumper, Wim Bouwknegt, Wimar A. van den Brink, Julia Furtner, Seunggu J. HanAlbert J. S. Idema, Barbara Kiesel, Georg Widhalm, Alfred Kloet, Michiel Wagemakers, Aeilko H. Zwinderman, Sandro M. Krieg, Emmanuel Mandonnet, Ferran Prados, Philip de Witt Hamer, Frederik Barkhof, Roelant S. Eijgelaar

Research output: Contribution to journal › Article › Academic › peer-review

2 Citations (Scopus)

Abstract

This study tests the generalisability of three Brain Tumor Segmentation (BraTS) challenge models using a multi-center dataset of varying image quality and incomplete MRI datasets. In this retrospective study, DeepMedic, no-new-Unet (nn-Unet), and NVIDIA-net (nv-Net) were trained and tested using manual segmentations from preoperative MRI of glioblastoma (GBM) and low-grade gliomas (LGG) from the BraTS 2021 dataset (1251 in total), in addition to 275 GBM and 205 LGG acquired clinically across 12 hospitals worldwide. Data was split into 80% training, 5% validation, and 15% internal test data. An additional external test-set of 158 GBM and 69 LGG was used to assess generalisability to other hospitals’ data. All models’ median Dice similarity coefficient (DSC) for both test sets were within, or higher than, previously reported human inter-rater agreement (range of 0.74–0.85). For both test sets, nn-Unet achieved the highest DSC (internal = 0.86, external = 0.93) and the lowest Hausdorff distances (10.07, 13.87 mm, respectively) for all tumor classes (p < 0.001). By applying Sparsified training, missing MRI sequences did not statistically affect the performance. nn-Unet achieves accurate segmentations in clinical settings even in the presence of incomplete MRI datasets. This facilitates future clinical adoption of automated glioma segmentation, which could help inform treatment planning and glioma monitoring.

Original language	English
Article number	18911
Journal	Scientific reports
Volume	13
Issue number	1
DOIs	https://doi.org/10.1038/s41598-023-44794-0
Publication status	Published - 1 Dec 2023

Access to Document

https://doi.org/10.1038/s41598-023-44794-0

Cite this

Pemberton, H. G., Wu, J., Kommers, I., Müller, D. M. J., Hu, Y., Goodkin, O., Vos, S. B., Bisdas, S., Robe, P. A., Ardon, H., Bello, L., Rossi, M., Sciortino, T., Nibali, M. C., Berger, M. S., Hervey-Jumper, S. L., Bouwknegt, W., van den Brink, W. A., Furtner, J., ... Eijgelaar, R. S. (2023). Multi-class glioma segmentation on real-world data with missing MRI sequences: comparison of three deep learning algorithms. Scientific reports, 13(1), Article 18911. https://doi.org/10.1038/s41598-023-44794-0

@article{1ec2ac1b765f4485acd2b8e99b3e6348,

title = "Multi-class glioma segmentation on real-world data with missing MRI sequences: comparison of three deep learning algorithms",

abstract = "This study tests the generalisability of three Brain Tumor Segmentation (BraTS) challenge models using a multi-center dataset of varying image quality and incomplete MRI datasets. In this retrospective study, DeepMedic, no-new-Unet (nn-Unet), and NVIDIA-net (nv-Net) were trained and tested using manual segmentations from preoperative MRI of glioblastoma (GBM) and low-grade gliomas (LGG) from the BraTS 2021 dataset (1251 in total), in addition to 275 GBM and 205 LGG acquired clinically across 12 hospitals worldwide. Data was split into 80% training, 5% validation, and 15% internal test data. An additional external test-set of 158 GBM and 69 LGG was used to assess generalisability to other hospitals{\textquoteright} data. All models{\textquoteright} median Dice similarity coefficient (DSC) for both test sets were within, or higher than, previously reported human inter-rater agreement (range of 0.74–0.85). For both test sets, nn-Unet achieved the highest DSC (internal = 0.86, external = 0.93) and the lowest Hausdorff distances (10.07, 13.87 mm, respectively) for all tumor classes (p < 0.001). By applying Sparsified training, missing MRI sequences did not statistically affect the performance. nn-Unet achieves accurate segmentations in clinical settings even in the presence of incomplete MRI datasets. This facilitates future clinical adoption of automated glioma segmentation, which could help inform treatment planning and glioma monitoring.",

author = "Pemberton, {Hugh G.} and Jiaming Wu and Ivar Kommers and M{\"u}ller, {Domenique M. J.} and Yipeng Hu and Olivia Goodkin and Vos, {Sjoerd B.} and Sotirios Bisdas and Robe, {Pierre A.} and Hilko Ardon and Lorenzo Bello and Marco Rossi and Tommaso Sciortino and Nibali, {Marco Conti} and Berger, {Mitchel S.} and Hervey-Jumper, {Shawn L.} and Wim Bouwknegt and {van den Brink}, {Wimar A.} and Julia Furtner and Han, {Seunggu J.} and Idema, {Albert J. S.} and Barbara Kiesel and Georg Widhalm and Alfred Kloet and Michiel Wagemakers and Zwinderman, {Aeilko H.} and Krieg, {Sandro M.} and Emmanuel Mandonnet and Ferran Prados and {de Witt Hamer}, Philip and Frederik Barkhof and Eijgelaar, {Roelant S.}",

note = "Funding Information: The authors would like to thank all patients whose data was used in this study. HP is a full-time employee of Deloitte. FB, SB, FP and JW are supported by the National Institute for Health Research (NIHR) biomedical research centre at UCLH. FP received a Guarantors of Brain fellowship 2017–2020 and is also supported by the Biomedical Research Centre initiative at University College London Hospitals (UCLH). The PICTURE project is sponsored by an unrestricted grant of Stichting Hanarth fonds, “Machine learning for better neurosurgical decisions in patients with glioblastoma”; a grant for public-private partnerships (Amsterdam UMC PPP-grant) sponsored by the Dutch government (Ministry of Economic Affairs) through the Rijksdienst voor Ondernemend Nederland (RVO) and Topsector Life Sciences and Health (LSH), “Picturing predictions for patients with brain tumors”; a grant from the Innovative Medical Devices Initiative program, project number 10-10400-96-14003; The Netherlands Organisation for Scientific Research (NWO), 2020.027; a grant from the Dutch Cancer Society, VU2014-7113 and the Anita Veldman foundation, CCA2018-2-17. Publisher Copyright: {\textcopyright} 2023, The Author(s).",

year = "2023",

month = dec,

day = "1",

doi = "https://doi.org/10.1038/s41598-023-44794-0",

language = "English",

volume = "13",

journal = "Scientific reports",

issn = "2045-2322",

publisher = "Springer Nature",

number = "1",

}

Pemberton, HG, Wu, J, Kommers, I, Müller, DMJ, Hu, Y, Goodkin, O, Vos, SB, Bisdas, S, Robe, PA, Ardon, H, Bello, L, Rossi, M, Sciortino, T, Nibali, MC, Berger, MS, Hervey-Jumper, SL, Bouwknegt, W, van den Brink, WA, Furtner, J, Han, SJ, Idema, AJS, Kiesel, B, Widhalm, G, Kloet, A, Wagemakers, M, Zwinderman, AH, Krieg, SM, Mandonnet, E, Prados, F, de Witt Hamer, P , Barkhof, F & Eijgelaar, RS 2023, 'Multi-class glioma segmentation on real-world data with missing MRI sequences: comparison of three deep learning algorithms', Scientific reports, vol. 13, no. 1, 18911. https://doi.org/10.1038/s41598-023-44794-0

TY - JOUR

T1 - Multi-class glioma segmentation on real-world data with missing MRI sequences

T2 - comparison of three deep learning algorithms

AU - Pemberton, Hugh G.

AU - Wu, Jiaming

AU - Kommers, Ivar

AU - Müller, Domenique M. J.

AU - Hu, Yipeng

AU - Goodkin, Olivia

AU - Vos, Sjoerd B.

AU - Bisdas, Sotirios

AU - Robe, Pierre A.

AU - Ardon, Hilko

AU - Bello, Lorenzo

AU - Rossi, Marco

AU - Sciortino, Tommaso

AU - Nibali, Marco Conti

AU - Berger, Mitchel S.

AU - Hervey-Jumper, Shawn L.

AU - Bouwknegt, Wim

AU - van den Brink, Wimar A.

AU - Furtner, Julia

AU - Han, Seunggu J.

AU - Idema, Albert J. S.

AU - Kiesel, Barbara

AU - Widhalm, Georg

AU - Kloet, Alfred

AU - Wagemakers, Michiel

AU - Zwinderman, Aeilko H.

AU - Krieg, Sandro M.

AU - Mandonnet, Emmanuel

AU - Prados, Ferran

AU - de Witt Hamer, Philip

AU - Barkhof, Frederik

AU - Eijgelaar, Roelant S.

N1 - Funding Information: The authors would like to thank all patients whose data was used in this study. HP is a full-time employee of Deloitte. FB, SB, FP and JW are supported by the National Institute for Health Research (NIHR) biomedical research centre at UCLH. FP received a Guarantors of Brain fellowship 2017–2020 and is also supported by the Biomedical Research Centre initiative at University College London Hospitals (UCLH). The PICTURE project is sponsored by an unrestricted grant of Stichting Hanarth fonds, “Machine learning for better neurosurgical decisions in patients with glioblastoma”; a grant for public-private partnerships (Amsterdam UMC PPP-grant) sponsored by the Dutch government (Ministry of Economic Affairs) through the Rijksdienst voor Ondernemend Nederland (RVO) and Topsector Life Sciences and Health (LSH), “Picturing predictions for patients with brain tumors”; a grant from the Innovative Medical Devices Initiative program, project number 10-10400-96-14003; The Netherlands Organisation for Scientific Research (NWO), 2020.027; a grant from the Dutch Cancer Society, VU2014-7113 and the Anita Veldman foundation, CCA2018-2-17. Publisher Copyright: © 2023, The Author(s).

PY - 2023/12/1

Y1 - 2023/12/1

N2 - This study tests the generalisability of three Brain Tumor Segmentation (BraTS) challenge models using a multi-center dataset of varying image quality and incomplete MRI datasets. In this retrospective study, DeepMedic, no-new-Unet (nn-Unet), and NVIDIA-net (nv-Net) were trained and tested using manual segmentations from preoperative MRI of glioblastoma (GBM) and low-grade gliomas (LGG) from the BraTS 2021 dataset (1251 in total), in addition to 275 GBM and 205 LGG acquired clinically across 12 hospitals worldwide. Data was split into 80% training, 5% validation, and 15% internal test data. An additional external test-set of 158 GBM and 69 LGG was used to assess generalisability to other hospitals’ data. All models’ median Dice similarity coefficient (DSC) for both test sets were within, or higher than, previously reported human inter-rater agreement (range of 0.74–0.85). For both test sets, nn-Unet achieved the highest DSC (internal = 0.86, external = 0.93) and the lowest Hausdorff distances (10.07, 13.87 mm, respectively) for all tumor classes (p < 0.001). By applying Sparsified training, missing MRI sequences did not statistically affect the performance. nn-Unet achieves accurate segmentations in clinical settings even in the presence of incomplete MRI datasets. This facilitates future clinical adoption of automated glioma segmentation, which could help inform treatment planning and glioma monitoring.

AB - This study tests the generalisability of three Brain Tumor Segmentation (BraTS) challenge models using a multi-center dataset of varying image quality and incomplete MRI datasets. In this retrospective study, DeepMedic, no-new-Unet (nn-Unet), and NVIDIA-net (nv-Net) were trained and tested using manual segmentations from preoperative MRI of glioblastoma (GBM) and low-grade gliomas (LGG) from the BraTS 2021 dataset (1251 in total), in addition to 275 GBM and 205 LGG acquired clinically across 12 hospitals worldwide. Data was split into 80% training, 5% validation, and 15% internal test data. An additional external test-set of 158 GBM and 69 LGG was used to assess generalisability to other hospitals’ data. All models’ median Dice similarity coefficient (DSC) for both test sets were within, or higher than, previously reported human inter-rater agreement (range of 0.74–0.85). For both test sets, nn-Unet achieved the highest DSC (internal = 0.86, external = 0.93) and the lowest Hausdorff distances (10.07, 13.87 mm, respectively) for all tumor classes (p < 0.001). By applying Sparsified training, missing MRI sequences did not statistically affect the performance. nn-Unet achieves accurate segmentations in clinical settings even in the presence of incomplete MRI datasets. This facilitates future clinical adoption of automated glioma segmentation, which could help inform treatment planning and glioma monitoring.

UR - http://www.scopus.com/inward/record.url?scp=85175837438&partnerID=8YFLogxK

U2 - https://doi.org/10.1038/s41598-023-44794-0

DO - https://doi.org/10.1038/s41598-023-44794-0

M3 - Article

C2 - 37919354

SN - 2045-2322

VL - 13

JO - Scientific reports

JF - Scientific reports

IS - 1

M1 - 18911

ER -

Multi-class glioma segmentation on real-world data with missing MRI sequences: comparison of three deep learning algorithms

Abstract

Access to Document

Other files and links

Cite this