TY - JOUR
T1 - A deep learning system for detection of early Barrett's neoplasia
T2 - a model development and validation study
AU - Fockens, K. N.
AU - Jong, M. R.
AU - Jukema, J. B.
AU - Boers, T. G. W.
AU - Kusters, C. H. J.
AU - van der Putten, J. A.
AU - Pouw, R. E.
AU - Duits, L. C.
AU - Montazeri, N. S. M.
AU - van Munster, S. N.
AU - Weusten, B. L. A. M.
AU - Alvarez Herrero, L.
AU - Houben, M. H. M. G.
AU - Nagengast, W. B.
AU - Westerhof, J.
AU - Alkhalaf, A.
AU - Mallant-Hent, R. C.
AU - Scholten, P.
AU - Ragunath, K.
AU - Seewald, S.
AU - Elbe, P.
AU - Baldaque-Silva, F.
AU - Barret, M.
AU - Ortiz Fernández-Sordo, J.
AU - Villarejo, G. Moral
AU - Pech, O.
AU - Barrett's Oesophagus Imaging for Artificial Intelligence (BONS-AI) consortium
AU - Beyna, T.
AU - van der Sommen, F.
AU - de With, P. H.
AU - de Groof, A. J.
AU - Bergman, J. J.
N1 - Publisher Copyright: © 2023 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY 4.0 license
PY - 2023/12/1
Y1 - 2023/12/1
N2 - Background: Computer-aided detection (CADe) systems could assist endoscopists in detecting early neoplasia in Barrett's oesophagus, which could be difficult to detect in endoscopic images. The aim of this study was to develop, test, and benchmark a CADe system for early neoplasia in Barrett's oesophagus. Methods: The CADe system was first pretrained with ImageNet followed by domain-specific pretraining with GastroNet. We trained the CADe system on a dataset of 14 046 images (2506 patients) of confirmed Barrett's oesophagus neoplasia and non-dysplastic Barrett's oesophagus from 15 centres. Neoplasia was delineated by 14 Barrett's oesophagus experts for all datasets. We tested the performance of the CADe system on two independent test sets. The all-comers test set comprised 327 (73 patients) non-dysplastic Barrett's oesophagus images, 82 (46 patients) neoplastic images, 180 (66 of the same patients) non-dysplastic Barrett's oesophagus videos, and 71 (45 of the same patients) neoplastic videos. The benchmarking test set comprised 100 (50 patients) neoplastic images, 300 (125 patients) non-dysplastic images, 47 (47 of the same patients) neoplastic videos, and 141 (82 of the same patients) non-dysplastic videos, and was enriched with subtle neoplasia cases. The benchmarking test set was evaluated by 112 endoscopists from six countries (first without CADe and, after 6 weeks, with CADe) and by 28 external international Barrett's oesophagus experts. The primary outcome was the sensitivity of Barrett's neoplasia detection by general endoscopists without CADe assistance versus with CADe assistance on the benchmarking test set. We compared sensitivity using a mixed-effects logistic regression model with conditional odds ratios (ORs; likelihood profile 95% CIs). Findings: Sensitivity for neoplasia detection among endoscopists increased from 74% to 88% with CADe assistance (OR 2·04; 95% CI 1·73–2·42; p<0·0001 for images and from 67% to 79% [2·35; 1·90–2·94; p<0·0001] for video) without compromising specificity (from 89% to 90% [1·07; 0·96–1·19; p=0·20] for images and from 96% to 94% [0·94; 0·79–1·11; ] for video; p=0·46). In the all-comers test set, CADe detected neoplastic lesions in 95% (88–98) of images and 97% (90–99) of videos. In the benchmarking test set, the CADe system was superior to endoscopists in detecting neoplasia (90% vs 74% [OR 3·75; 95% CI 1·93–8·05; p=0·0002] for images and 91% vs 67% [11·68; 3·85–47·53; p<0·0001] for video) and non-inferior to Barrett's oesophagus experts (90% vs 87% [OR 1·74; 95% CI 0·83–3·65] for images and 91% vs 86% [2·94; 0·99–11·40] for video). Interpretation: CADe outperformed endoscopists in detecting Barrett's oesophagus neoplasia and, when used as an assistive tool, it improved their detection rate. CADe detected virtually all neoplasia in a test set of consecutive cases. Funding: Olympus.
AB - Background: Computer-aided detection (CADe) systems could assist endoscopists in detecting early neoplasia in Barrett's oesophagus, which could be difficult to detect in endoscopic images. The aim of this study was to develop, test, and benchmark a CADe system for early neoplasia in Barrett's oesophagus. Methods: The CADe system was first pretrained with ImageNet followed by domain-specific pretraining with GastroNet. We trained the CADe system on a dataset of 14 046 images (2506 patients) of confirmed Barrett's oesophagus neoplasia and non-dysplastic Barrett's oesophagus from 15 centres. Neoplasia was delineated by 14 Barrett's oesophagus experts for all datasets. We tested the performance of the CADe system on two independent test sets. The all-comers test set comprised 327 (73 patients) non-dysplastic Barrett's oesophagus images, 82 (46 patients) neoplastic images, 180 (66 of the same patients) non-dysplastic Barrett's oesophagus videos, and 71 (45 of the same patients) neoplastic videos. The benchmarking test set comprised 100 (50 patients) neoplastic images, 300 (125 patients) non-dysplastic images, 47 (47 of the same patients) neoplastic videos, and 141 (82 of the same patients) non-dysplastic videos, and was enriched with subtle neoplasia cases. The benchmarking test set was evaluated by 112 endoscopists from six countries (first without CADe and, after 6 weeks, with CADe) and by 28 external international Barrett's oesophagus experts. The primary outcome was the sensitivity of Barrett's neoplasia detection by general endoscopists without CADe assistance versus with CADe assistance on the benchmarking test set. We compared sensitivity using a mixed-effects logistic regression model with conditional odds ratios (ORs; likelihood profile 95% CIs). Findings: Sensitivity for neoplasia detection among endoscopists increased from 74% to 88% with CADe assistance (OR 2·04; 95% CI 1·73–2·42; p<0·0001 for images and from 67% to 79% [2·35; 1·90–2·94; p<0·0001] for video) without compromising specificity (from 89% to 90% [1·07; 0·96–1·19; p=0·20] for images and from 96% to 94% [0·94; 0·79–1·11; ] for video; p=0·46). In the all-comers test set, CADe detected neoplastic lesions in 95% (88–98) of images and 97% (90–99) of videos. In the benchmarking test set, the CADe system was superior to endoscopists in detecting neoplasia (90% vs 74% [OR 3·75; 95% CI 1·93–8·05; p=0·0002] for images and 91% vs 67% [11·68; 3·85–47·53; p<0·0001] for video) and non-inferior to Barrett's oesophagus experts (90% vs 87% [OR 1·74; 95% CI 0·83–3·65] for images and 91% vs 86% [2·94; 0·99–11·40] for video). Interpretation: CADe outperformed endoscopists in detecting Barrett's oesophagus neoplasia and, when used as an assistive tool, it improved their detection rate. CADe detected virtually all neoplasia in a test set of consecutive cases. Funding: Olympus.
UR - http://www.scopus.com/inward/record.url?scp=85177799871&partnerID=8YFLogxK
U2 - https://doi.org/10.1016/S2589-7500(23)00199-1
DO - https://doi.org/10.1016/S2589-7500(23)00199-1
M3 - Article
C2 - 38000874
SN - 2589-7500
VL - 5
SP - e905-e916
JO - The Lancet Digital Health
JF - The Lancet Digital Health
IS - 12
ER -