Impact of imperfection in medical imaging data on deep learning-based segmentation performance: An experimental study using synthesized data

Ayetullah Mehdi Güneş; Ward van Rooij; Sadaf Gulshad; Ben Slotman; Max Dahele; Wilko Verbakel

doi:https://doi.org/10.1002/mp.16437

Impact of imperfection in medical imaging data on deep learning-based segmentation performance: An experimental study using synthesized data

Ayetullah Mehdi Güneş, Ward van Rooij, Sadaf Gulshad, Ben Slotman, Max Dahele, Wilko Verbakel

Research output: Contribution to journal › Article › Academic › peer-review

1 Citation (Scopus)

Abstract

Background: Clinical data used to train deep learning models are often not clean data. They can contain imperfections in both the imaging data and the corresponding segmentations. Purpose: This study investigates the influence of data imperfections on the performance of deep learning models for parotid gland segmentation. This was done in a controlled manner by using synthesized data. The insights this study provides may be used to make deep learning models better and more reliable. Methods: The data were synthesized by using the clinical segmentations, creating a pseudo ground-truth in the process. Three kinds of imperfections were simulated: incorrect segmentations, low image contrast, and artifacts in the imaging data. The severity of each imperfection was varied in five levels. Models resulting from training sets from each of the five levels were cross-evaluated with test sets from each of the five levels. Results: Using synthesized data led to almost perfect parotid gland segmentation when no error was added. Lowering the quality of the parotid gland segmentations used for training substantially lowered the model performance. Additionally, lowering the image quality of the training data by decreasing the contrast or introducing artifacts made the resulting models more robust to data containing those respective kinds of data imperfection. Conclusion: This study demonstrated the importance of good-quality segmentations for deep learning training and it shows that using low-quality imaging data for training can enhance the robustness of the resulting models.

Original language	English
Pages (from-to)	6421-6432
Number of pages	12
Journal	Medical physics
Volume	50
Issue number	10
Early online date	2023
DOIs	https://doi.org/10.1002/mp.16437
Publication status	Published - Oct 2023

Keywords

data imperfection
deep learning
parotid gland
segmentation
synthesized data

Access to Document

https://doi.org/10.1002/mp.16437

Cite this

@article{0bd34150363f43b78f603c7dae0035fe,

title = "Impact of imperfection in medical imaging data on deep learning-based segmentation performance: An experimental study using synthesized data",

abstract = "Background: Clinical data used to train deep learning models are often not clean data. They can contain imperfections in both the imaging data and the corresponding segmentations. Purpose: This study investigates the influence of data imperfections on the performance of deep learning models for parotid gland segmentation. This was done in a controlled manner by using synthesized data. The insights this study provides may be used to make deep learning models better and more reliable. Methods: The data were synthesized by using the clinical segmentations, creating a pseudo ground-truth in the process. Three kinds of imperfections were simulated: incorrect segmentations, low image contrast, and artifacts in the imaging data. The severity of each imperfection was varied in five levels. Models resulting from training sets from each of the five levels were cross-evaluated with test sets from each of the five levels. Results: Using synthesized data led to almost perfect parotid gland segmentation when no error was added. Lowering the quality of the parotid gland segmentations used for training substantially lowered the model performance. Additionally, lowering the image quality of the training data by decreasing the contrast or introducing artifacts made the resulting models more robust to data containing those respective kinds of data imperfection. Conclusion: This study demonstrated the importance of good-quality segmentations for deep learning training and it shows that using low-quality imaging data for training can enhance the robustness of the resulting models.",

keywords = "data imperfection, deep learning, parotid gland, segmentation, synthesized data",

author = "G{\"u}ne{\c s}, {Ayetullah Mehdi} and {van Rooij}, Ward and Sadaf Gulshad and Ben Slotman and Max Dahele and Wilko Verbakel",

note = "Funding Information: The authors thank Varian Medical Systems for providing a research grant for this work. This work was supported by Varian Medical Systems, Palo Alto, CA, USA. Publisher Copyright: {\textcopyright} 2023 The Authors. Medical Physics published by Wiley Periodicals LLC on behalf of American Association of Physicists in Medicine.",

year = "2023",

month = oct,

doi = "https://doi.org/10.1002/mp.16437",

language = "English",

volume = "50",

pages = "6421--6432",

journal = "Medical physics",

issn = "0094-2405",

publisher = "AAPM - American Association of Physicists in Medicine",

number = "10",

}

TY - JOUR

T1 - Impact of imperfection in medical imaging data on deep learning-based segmentation performance

T2 - An experimental study using synthesized data

AU - Güneş, Ayetullah Mehdi

AU - van Rooij, Ward

AU - Gulshad, Sadaf

AU - Slotman, Ben

AU - Dahele, Max

AU - Verbakel, Wilko

N1 - Funding Information: The authors thank Varian Medical Systems for providing a research grant for this work. This work was supported by Varian Medical Systems, Palo Alto, CA, USA. Publisher Copyright: © 2023 The Authors. Medical Physics published by Wiley Periodicals LLC on behalf of American Association of Physicists in Medicine.

PY - 2023/10

Y1 - 2023/10

N2 - Background: Clinical data used to train deep learning models are often not clean data. They can contain imperfections in both the imaging data and the corresponding segmentations. Purpose: This study investigates the influence of data imperfections on the performance of deep learning models for parotid gland segmentation. This was done in a controlled manner by using synthesized data. The insights this study provides may be used to make deep learning models better and more reliable. Methods: The data were synthesized by using the clinical segmentations, creating a pseudo ground-truth in the process. Three kinds of imperfections were simulated: incorrect segmentations, low image contrast, and artifacts in the imaging data. The severity of each imperfection was varied in five levels. Models resulting from training sets from each of the five levels were cross-evaluated with test sets from each of the five levels. Results: Using synthesized data led to almost perfect parotid gland segmentation when no error was added. Lowering the quality of the parotid gland segmentations used for training substantially lowered the model performance. Additionally, lowering the image quality of the training data by decreasing the contrast or introducing artifacts made the resulting models more robust to data containing those respective kinds of data imperfection. Conclusion: This study demonstrated the importance of good-quality segmentations for deep learning training and it shows that using low-quality imaging data for training can enhance the robustness of the resulting models.

AB - Background: Clinical data used to train deep learning models are often not clean data. They can contain imperfections in both the imaging data and the corresponding segmentations. Purpose: This study investigates the influence of data imperfections on the performance of deep learning models for parotid gland segmentation. This was done in a controlled manner by using synthesized data. The insights this study provides may be used to make deep learning models better and more reliable. Methods: The data were synthesized by using the clinical segmentations, creating a pseudo ground-truth in the process. Three kinds of imperfections were simulated: incorrect segmentations, low image contrast, and artifacts in the imaging data. The severity of each imperfection was varied in five levels. Models resulting from training sets from each of the five levels were cross-evaluated with test sets from each of the five levels. Results: Using synthesized data led to almost perfect parotid gland segmentation when no error was added. Lowering the quality of the parotid gland segmentations used for training substantially lowered the model performance. Additionally, lowering the image quality of the training data by decreasing the contrast or introducing artifacts made the resulting models more robust to data containing those respective kinds of data imperfection. Conclusion: This study demonstrated the importance of good-quality segmentations for deep learning training and it shows that using low-quality imaging data for training can enhance the robustness of the resulting models.

KW - data imperfection

KW - deep learning

KW - parotid gland

KW - segmentation

KW - synthesized data

UR - http://www.scopus.com/inward/record.url?scp=85158031976&partnerID=8YFLogxK

U2 - https://doi.org/10.1002/mp.16437

DO - https://doi.org/10.1002/mp.16437

M3 - Article

C2 - 37118976

SN - 0094-2405

VL - 50

SP - 6421

EP - 6432

JO - Medical physics

JF - Medical physics

IS - 10

ER -

Impact of imperfection in medical imaging data on deep learning-based segmentation performance: An experimental study using synthesized data

Abstract

Keywords

Access to Document

Other files and links

Cite this