Robustness of radiomics to variations in segmentation methods in multimodal brain MRI

M. G. Poirot; M. W. A. Caan; H. G. Ruhe; A. Bjørnerud; I. Groote; L. Reneman; H. A. Marquering

doi:https://doi.org/10.1038/s41598-022-20703-9

Robustness of radiomics to variations in segmentation methods in multimodal brain MRI

M. G. Poirot, M. W. A. Caan, H. G. Ruhe, A. Bjørnerud, I. Groote, L. Reneman, H. A. Marquering

Research output: Contribution to journal › Article › Academic › peer-review

2 Citations (Scopus)

Abstract

Radiomics in neuroimaging uses fully automatic segmentation to delineate the anatomical areas for which radiomic features are computed. However, differences among these segmentation methods affect radiomic features to an unknown extent. A scan-rescan dataset (n = 46) of T1-weighted and diffusion tensor images was used. Subjects were split into a sleep-deprivation and a control group. Scans were segmented using four segmentation methods from which radiomic features were computed. First, we measured segmentation agreement using the Dice-coefficient. Second, robustness and reproducibility of radiomic features were measured using the intraclass correlation coefficient (ICC). Last, difference in predictive power was assessed using the Friedman-test on performance in a radiomics-based sleep deprivation classification application. Segmentation agreement was generally high (interquartile range = 0.77–0.90) and median feature robustness to segmentation method variation was higher (ICC > 0.7) than scan-rescan reproducibility (ICC 0.3–0.8). However, classification performance differed significantly among segmentation methods (p < 0.001) ranging from 77 to 84%. Accuracy was higher for more recent deep learning-based segmentation methods. Despite high agreement among segmentation methods, subtle differences significantly affected radiomic features and their predictive power. Consequently, the effect of differences in segmentation methods should be taken into account when designing and evaluating radiomics-based research methods.

Original language	English
Article number	16712
Pages (from-to)	16712
Journal	Scientific reports
Volume	12
Issue number	1
DOIs	https://doi.org/10.1038/s41598-022-20703-9
Publication status	Published - Dec 2022

Access to Document

https://doi.org/10.1038/s41598-022-20703-9

Cite this

@article{351aaf459dc44514976498dcdf81249c,

title = "Robustness of radiomics to variations in segmentation methods in multimodal brain MRI",

abstract = "Radiomics in neuroimaging uses fully automatic segmentation to delineate the anatomical areas for which radiomic features are computed. However, differences among these segmentation methods affect radiomic features to an unknown extent. A scan-rescan dataset (n = 46) of T1-weighted and diffusion tensor images was used. Subjects were split into a sleep-deprivation and a control group. Scans were segmented using four segmentation methods from which radiomic features were computed. First, we measured segmentation agreement using the Dice-coefficient. Second, robustness and reproducibility of radiomic features were measured using the intraclass correlation coefficient (ICC). Last, difference in predictive power was assessed using the Friedman-test on performance in a radiomics-based sleep deprivation classification application. Segmentation agreement was generally high (interquartile range = 0.77–0.90) and median feature robustness to segmentation method variation was higher (ICC > 0.7) than scan-rescan reproducibility (ICC 0.3–0.8). However, classification performance differed significantly among segmentation methods (p < 0.001) ranging from 77 to 84%. Accuracy was higher for more recent deep learning-based segmentation methods. Despite high agreement among segmentation methods, subtle differences significantly affected radiomic features and their predictive power. Consequently, the effect of differences in segmentation methods should be taken into account when designing and evaluating radiomics-based research methods.",

author = "Poirot, {M. G.} and Caan, {M. W. A.} and Ruhe, {H. G.} and A. Bj{\o}rnerud and I. Groote and L. Reneman and Marquering, {H. A.}",

note = "Funding Information: This work was supported by the Eurostars funding program (Reference number 113351) and research grants from the Norwegian South-East Health Authorities (reference numbers 2018077 and 2017090). Publisher Copyright: {\textcopyright} 2022, The Author(s).",

year = "2022",

month = dec,

doi = "https://doi.org/10.1038/s41598-022-20703-9",

language = "English",

volume = "12",

pages = "16712",

journal = "Scientific reports",

issn = "2045-2322",

publisher = "Springer Nature",

number = "1",

}

TY - JOUR

T1 - Robustness of radiomics to variations in segmentation methods in multimodal brain MRI

AU - Poirot, M. G.

AU - Caan, M. W. A.

AU - Ruhe, H. G.

AU - Bjørnerud, A.

AU - Groote, I.

AU - Reneman, L.

AU - Marquering, H. A.

N1 - Funding Information: This work was supported by the Eurostars funding program (Reference number 113351) and research grants from the Norwegian South-East Health Authorities (reference numbers 2018077 and 2017090). Publisher Copyright: © 2022, The Author(s).

PY - 2022/12

Y1 - 2022/12

N2 - Radiomics in neuroimaging uses fully automatic segmentation to delineate the anatomical areas for which radiomic features are computed. However, differences among these segmentation methods affect radiomic features to an unknown extent. A scan-rescan dataset (n = 46) of T1-weighted and diffusion tensor images was used. Subjects were split into a sleep-deprivation and a control group. Scans were segmented using four segmentation methods from which radiomic features were computed. First, we measured segmentation agreement using the Dice-coefficient. Second, robustness and reproducibility of radiomic features were measured using the intraclass correlation coefficient (ICC). Last, difference in predictive power was assessed using the Friedman-test on performance in a radiomics-based sleep deprivation classification application. Segmentation agreement was generally high (interquartile range = 0.77–0.90) and median feature robustness to segmentation method variation was higher (ICC > 0.7) than scan-rescan reproducibility (ICC 0.3–0.8). However, classification performance differed significantly among segmentation methods (p < 0.001) ranging from 77 to 84%. Accuracy was higher for more recent deep learning-based segmentation methods. Despite high agreement among segmentation methods, subtle differences significantly affected radiomic features and their predictive power. Consequently, the effect of differences in segmentation methods should be taken into account when designing and evaluating radiomics-based research methods.

AB - Radiomics in neuroimaging uses fully automatic segmentation to delineate the anatomical areas for which radiomic features are computed. However, differences among these segmentation methods affect radiomic features to an unknown extent. A scan-rescan dataset (n = 46) of T1-weighted and diffusion tensor images was used. Subjects were split into a sleep-deprivation and a control group. Scans were segmented using four segmentation methods from which radiomic features were computed. First, we measured segmentation agreement using the Dice-coefficient. Second, robustness and reproducibility of radiomic features were measured using the intraclass correlation coefficient (ICC). Last, difference in predictive power was assessed using the Friedman-test on performance in a radiomics-based sleep deprivation classification application. Segmentation agreement was generally high (interquartile range = 0.77–0.90) and median feature robustness to segmentation method variation was higher (ICC > 0.7) than scan-rescan reproducibility (ICC 0.3–0.8). However, classification performance differed significantly among segmentation methods (p < 0.001) ranging from 77 to 84%. Accuracy was higher for more recent deep learning-based segmentation methods. Despite high agreement among segmentation methods, subtle differences significantly affected radiomic features and their predictive power. Consequently, the effect of differences in segmentation methods should be taken into account when designing and evaluating radiomics-based research methods.

UR - http://www.scopus.com/inward/record.url?scp=85139306284&partnerID=8YFLogxK

U2 - https://doi.org/10.1038/s41598-022-20703-9

DO - https://doi.org/10.1038/s41598-022-20703-9

M3 - Article

C2 - 36202934

SN - 2045-2322

VL - 12

SP - 16712

JO - Scientific reports

JF - Scientific reports

IS - 1

M1 - 16712

ER -

Robustness of radiomics to variations in segmentation methods in multimodal brain MRI

Abstract

Access to Document

Other files and links

Cite this