Endoscopists' diagnostic accuracy in detecting upper gastrointestinal neoplasia in the framework of artificial intelligence studies

Leonardo Frazzoni; Julia Arribas; Giulio Antonelli; Diogo Libanio; Alanna Ebigbo; Fons van der Sommen; Albert Jeroen de Groof; Hiromu Fukuda; Masayasu Ohmori; Ryu Ishihara; Lianlian Wu; Honggang Yu; Yuichi Mori; Alessandro Repici; Jacques J. G. H. M. Bergman; Prateek Sharma; Helmut Messmann; Cesare Hassan; Lorenzo Fuccio; M. rio Dinis-Ribeiro

doi:https://doi.org/10.1055/a-1500-3730

Endoscopists' diagnostic accuracy in detecting upper gastrointestinal neoplasia in the framework of artificial intelligence studies

Leonardo Frazzoni, Julia Arribas, Giulio Antonelli, Diogo Libanio, Alanna Ebigbo, Fons van der Sommen, Albert Jeroen de Groof, Hiromu Fukuda, Masayasu Ohmori, Ryu Ishihara, Lianlian Wu, Honggang Yu, Yuichi Mori, Alessandro Repici, Jacques J. G. H. M. Bergman, Prateek Sharma, Helmut Messmann, Cesare Hassan, Lorenzo Fuccio, M. rio Dinis-Ribeiro

Gastroenterology and Hepatology (AMC)

Research output: Contribution to journal › Article › Academic › peer-review

16 Citations (Scopus)

Abstract

Background ?Estimates on miss rates for upper gastrointestinal neoplasia (UGIN) rely on registry data or old studies. Quality assurance programs for upper GI endoscopy are not fully established owing to the lack of infrastructure to measure endoscopists' competence. We aimed to assess endoscopists' accuracy for the recognition of UGIN exploiting the framework of artificial intelligence (AI) validation studies. Methods ?Literature searches of databases (PubMed/MEDLINE, EMBASE, Scopus) up to August 2020 were performed to identify articles evaluating the accuracy of individual endoscopists for the recognition of UGIN within studies validating AI against a histologically verified expert-annotated ground-truth. The main outcomes were endoscopists' pooled sensitivity, specificity, positive and negative predictive value (PPV/NPV), and area under the curve (AUC) for all UGIN, for esophageal squamous cell neoplasia (ESCN), Barrett esophagus-related neoplasia (BERN), and gastric adenocarcinoma (GAC). Results ?Seven studies (2 ESCN, 3 BERN, 1 GAC, 1 UGIN overall) with 122 endoscopists were included. The pooled endoscopists' sensitivity and specificity for UGIN were 82?% (95?% confidence interval [CI] 80?%-84?%) and 79?% (95?%CI 76?%-81?%), respectively. Endoscopists' accuracy was higher for GAC detection (AUC 0.95 [95?%CI 0.93-0.98]) than for ESCN (AUC 0.90 [95?%CI 0.88-0.92]) and BERN detection (AUC 0.86 [95?%CI 0.84-0.88]). Sensitivity was higher for Eastern vs. Western endoscopists (87?% [95?%CI 84?%-89?%] vs. 75?% [95?%CI 72?%-78?%]), and for expert vs. non-expert endoscopists (85?% [95?%CI 83?%-87?%] vs. 71?% [95?%CI 67?%-75?%]). Conclusion ?We show suboptimal accuracy of endoscopists for the recognition of UGIN even within a framework that included a higher prevalence and disease awareness. Future AI validation studies represent a framework to assess endoscopist competence.

Original language	English
Journal	Endoscopy
Early online date	2021
DOIs	https://doi.org/10.1055/a-1500-3730
Publication status	E-pub ahead of print - 2021

Access to Document

https://doi.org/10.1055/a-1500-3730

Cite this

Frazzoni, L., Arribas, J., Antonelli, G., Libanio, D., Ebigbo, A., van der Sommen, F., de Groof, A. J., Fukuda, H., Ohmori, M., Ishihara, R., Wu, L., Yu, H., Mori, Y., Repici, A., Bergman, J. J. G. H. M., Sharma, P., Messmann, H., Hassan, C., Fuccio, L., & Dinis-Ribeiro, M. R. (2021). Endoscopists' diagnostic accuracy in detecting upper gastrointestinal neoplasia in the framework of artificial intelligence studies. Endoscopy. Advance online publication. https://doi.org/10.1055/a-1500-3730

@article{c370c42d7f7e4070884ad0e5c5981b5b,

title = "Endoscopists' diagnostic accuracy in detecting upper gastrointestinal neoplasia in the framework of artificial intelligence studies",

abstract = "Background ?Estimates on miss rates for upper gastrointestinal neoplasia (UGIN) rely on registry data or old studies. Quality assurance programs for upper GI endoscopy are not fully established owing to the lack of infrastructure to measure endoscopists' competence. We aimed to assess endoscopists' accuracy for the recognition of UGIN exploiting the framework of artificial intelligence (AI) validation studies. Methods ?Literature searches of databases (PubMed/MEDLINE, EMBASE, Scopus) up to August 2020 were performed to identify articles evaluating the accuracy of individual endoscopists for the recognition of UGIN within studies validating AI against a histologically verified expert-annotated ground-truth. The main outcomes were endoscopists' pooled sensitivity, specificity, positive and negative predictive value (PPV/NPV), and area under the curve (AUC) for all UGIN, for esophageal squamous cell neoplasia (ESCN), Barrett esophagus-related neoplasia (BERN), and gastric adenocarcinoma (GAC). Results ?Seven studies (2 ESCN, 3 BERN, 1 GAC, 1 UGIN overall) with 122 endoscopists were included. The pooled endoscopists' sensitivity and specificity for UGIN were 82?% (95?% confidence interval [CI] 80?%-84?%) and 79?% (95?%CI 76?%-81?%), respectively. Endoscopists' accuracy was higher for GAC detection (AUC 0.95 [95?%CI 0.93-0.98]) than for ESCN (AUC 0.90 [95?%CI 0.88-0.92]) and BERN detection (AUC 0.86 [95?%CI 0.84-0.88]). Sensitivity was higher for Eastern vs. Western endoscopists (87?% [95?%CI 84?%-89?%] vs. 75?% [95?%CI 72?%-78?%]), and for expert vs. non-expert endoscopists (85?% [95?%CI 83?%-87?%] vs. 71?% [95?%CI 67?%-75?%]). Conclusion ?We show suboptimal accuracy of endoscopists for the recognition of UGIN even within a framework that included a higher prevalence and disease awareness. Future AI validation studies represent a framework to assess endoscopist competence.",

author = "Leonardo Frazzoni and Julia Arribas and Giulio Antonelli and Diogo Libanio and Alanna Ebigbo and {van der Sommen}, Fons and {de Groof}, {Albert Jeroen} and Hiromu Fukuda and Masayasu Ohmori and Ryu Ishihara and Lianlian Wu and Honggang Yu and Yuichi Mori and Alessandro Repici and Bergman, {Jacques J. G. H. M.} and Prateek Sharma and Helmut Messmann and Cesare Hassan and Lorenzo Fuccio and Dinis-Ribeiro, {M. rio}",

year = "2021",

doi = "https://doi.org/10.1055/a-1500-3730",

language = "English",

journal = "Endoscopy",

issn = "0013-726X",

publisher = "Georg Thieme Verlag",

}

Frazzoni, L, Arribas, J, Antonelli, G, Libanio, D, Ebigbo, A, van der Sommen, F, de Groof, AJ, Fukuda, H, Ohmori, M, Ishihara, R, Wu, L, Yu, H, Mori, Y, Repici, A, Bergman, JJGHM, Sharma, P, Messmann, H, Hassan, C, Fuccio, L & Dinis-Ribeiro, MR 2021, 'Endoscopists' diagnostic accuracy in detecting upper gastrointestinal neoplasia in the framework of artificial intelligence studies', Endoscopy. https://doi.org/10.1055/a-1500-3730

TY - JOUR

T1 - Endoscopists' diagnostic accuracy in detecting upper gastrointestinal neoplasia in the framework of artificial intelligence studies

AU - Frazzoni, Leonardo

AU - Arribas, Julia

AU - Antonelli, Giulio

AU - Libanio, Diogo

AU - Ebigbo, Alanna

AU - van der Sommen, Fons

AU - de Groof, Albert Jeroen

AU - Fukuda, Hiromu

AU - Ohmori, Masayasu

AU - Ishihara, Ryu

AU - Wu, Lianlian

AU - Yu, Honggang

AU - Mori, Yuichi

AU - Repici, Alessandro

AU - Bergman, Jacques J. G. H. M.

AU - Sharma, Prateek

AU - Messmann, Helmut

AU - Hassan, Cesare

AU - Fuccio, Lorenzo

AU - Dinis-Ribeiro, M. rio

PY - 2021

Y1 - 2021

N2 - Background ?Estimates on miss rates for upper gastrointestinal neoplasia (UGIN) rely on registry data or old studies. Quality assurance programs for upper GI endoscopy are not fully established owing to the lack of infrastructure to measure endoscopists' competence. We aimed to assess endoscopists' accuracy for the recognition of UGIN exploiting the framework of artificial intelligence (AI) validation studies. Methods ?Literature searches of databases (PubMed/MEDLINE, EMBASE, Scopus) up to August 2020 were performed to identify articles evaluating the accuracy of individual endoscopists for the recognition of UGIN within studies validating AI against a histologically verified expert-annotated ground-truth. The main outcomes were endoscopists' pooled sensitivity, specificity, positive and negative predictive value (PPV/NPV), and area under the curve (AUC) for all UGIN, for esophageal squamous cell neoplasia (ESCN), Barrett esophagus-related neoplasia (BERN), and gastric adenocarcinoma (GAC). Results ?Seven studies (2 ESCN, 3 BERN, 1 GAC, 1 UGIN overall) with 122 endoscopists were included. The pooled endoscopists' sensitivity and specificity for UGIN were 82?% (95?% confidence interval [CI] 80?%-84?%) and 79?% (95?%CI 76?%-81?%), respectively. Endoscopists' accuracy was higher for GAC detection (AUC 0.95 [95?%CI 0.93-0.98]) than for ESCN (AUC 0.90 [95?%CI 0.88-0.92]) and BERN detection (AUC 0.86 [95?%CI 0.84-0.88]). Sensitivity was higher for Eastern vs. Western endoscopists (87?% [95?%CI 84?%-89?%] vs. 75?% [95?%CI 72?%-78?%]), and for expert vs. non-expert endoscopists (85?% [95?%CI 83?%-87?%] vs. 71?% [95?%CI 67?%-75?%]). Conclusion ?We show suboptimal accuracy of endoscopists for the recognition of UGIN even within a framework that included a higher prevalence and disease awareness. Future AI validation studies represent a framework to assess endoscopist competence.

AB - Background ?Estimates on miss rates for upper gastrointestinal neoplasia (UGIN) rely on registry data or old studies. Quality assurance programs for upper GI endoscopy are not fully established owing to the lack of infrastructure to measure endoscopists' competence. We aimed to assess endoscopists' accuracy for the recognition of UGIN exploiting the framework of artificial intelligence (AI) validation studies. Methods ?Literature searches of databases (PubMed/MEDLINE, EMBASE, Scopus) up to August 2020 were performed to identify articles evaluating the accuracy of individual endoscopists for the recognition of UGIN within studies validating AI against a histologically verified expert-annotated ground-truth. The main outcomes were endoscopists' pooled sensitivity, specificity, positive and negative predictive value (PPV/NPV), and area under the curve (AUC) for all UGIN, for esophageal squamous cell neoplasia (ESCN), Barrett esophagus-related neoplasia (BERN), and gastric adenocarcinoma (GAC). Results ?Seven studies (2 ESCN, 3 BERN, 1 GAC, 1 UGIN overall) with 122 endoscopists were included. The pooled endoscopists' sensitivity and specificity for UGIN were 82?% (95?% confidence interval [CI] 80?%-84?%) and 79?% (95?%CI 76?%-81?%), respectively. Endoscopists' accuracy was higher for GAC detection (AUC 0.95 [95?%CI 0.93-0.98]) than for ESCN (AUC 0.90 [95?%CI 0.88-0.92]) and BERN detection (AUC 0.86 [95?%CI 0.84-0.88]). Sensitivity was higher for Eastern vs. Western endoscopists (87?% [95?%CI 84?%-89?%] vs. 75?% [95?%CI 72?%-78?%]), and for expert vs. non-expert endoscopists (85?% [95?%CI 83?%-87?%] vs. 71?% [95?%CI 67?%-75?%]). Conclusion ?We show suboptimal accuracy of endoscopists for the recognition of UGIN even within a framework that included a higher prevalence and disease awareness. Future AI validation studies represent a framework to assess endoscopist competence.

UR - http://www.scopus.com/inward/record.url?scp=85108512744&partnerID=8YFLogxK

U2 - https://doi.org/10.1055/a-1500-3730

DO - https://doi.org/10.1055/a-1500-3730

M3 - Article

C2 - 33951743

SN - 0013-726X

JO - Endoscopy

JF - Endoscopy

ER -

Endoscopists' diagnostic accuracy in detecting upper gastrointestinal neoplasia in the framework of artificial intelligence studies

Abstract

Access to Document

Other files and links

Cite this