Informative Frame Classification of Endoscopic Videos Using Convolutional Neural Networks and Hidden Markov Models

Joost van der Putten; Jeroen de Groof; Fons van der Sommen; Maarten Struyvenberg; Svitlana Zinger; Wouter Curvers; Erik Schoon; Jacques Bergman; Peter H. N. de With

doi:https://doi.org/10.1109/ICIP.2019.8802947

Informative Frame Classification of Endoscopic Videos Using Convolutional Neural Networks and Hidden Markov Models

Joost van der Putten, Jeroen de Groof, Fons van der Sommen, Maarten Struyvenberg, Svitlana Zinger, Wouter Curvers, Erik Schoon, Jacques Bergman, Peter H. N. de With

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Academic › peer-review

12 Citations (Scopus)

Abstract

The goal of endoscopic analysis is to find abnormal lesions and determine further therapy from the obtained information. For example, in case of Barrett's esophagus, the objective of endoscopy is to timely detect dysplastic lesions, before endoscopic resection is no longer possible. However, the procedure produces a variety of non-informative frames and lesions can be missed due to poor video quality. Especially when analyzing entire endoscopic videos made by non-expert endoscopists, informative frame classification is crucial to e.g. video quality grading. This analysis involves classification problems such as polyp detection or dysplasia detection in Barrett's Esophagus. This work concentrates on the design of an automated indication of informativeness of video frames. We propose an algorithm consisting of state-of-the-art deep learning techniques, to initialize frame-based classification, followed by a hidden Markov model to incorporate temporal information and control consistent decision making. Results from the performed experiments show that the proposed model improves on the state-of-the-art with an F1-score of 91%, and a substantial increase in sensitivity of 10%, thereby indicating improved labeling consistency. Additionally, the algorithm is capable of processing 261 frames per second, which is multiple times faster compared to other informative frame classification algorithms, thus enabling real-time computation.

Original language	English
Title of host publication	2019 IEEE International Conference on Image Processing, ICIP 2019 - Proceedings
Publisher	IEEE Computer Society
Pages	380-384
Volume	2019-September
ISBN (Electronic)	9781538662496
DOIs	https://doi.org/10.1109/ICIP.2019.8802947
Publication status	Published - 2019
Event	26th IEEE International Conference on Image Processing, ICIP 2019 - Taipei, Taiwan, Province of China Duration: 22 Sept 2019 → 25 Sept 2019

Publication series

Name	Proceedings - International Conference on Image Processing, ICIP

Conference

Conference	26th IEEE International Conference on Image Processing, ICIP 2019
Country/Territory	Taiwan, Province of China
City	Taipei
Period	22/09/2019 → 25/09/2019

Access to Document

https://doi.org/10.1109/ICIP.2019.8802947

Cite this

van der Putten, J., de Groof, J., van der Sommen, F., Struyvenberg, M., Zinger, S., Curvers, W., Schoon, E., Bergman, J., & de With, P. H. N. (2019). Informative Frame Classification of Endoscopic Videos Using Convolutional Neural Networks and Hidden Markov Models. In 2019 IEEE International Conference on Image Processing, ICIP 2019 - Proceedings (Vol. 2019-September, pp. 380-384). (Proceedings - International Conference on Image Processing, ICIP). IEEE Computer Society. https://doi.org/10.1109/ICIP.2019.8802947

van der Putten, Joost ; de Groof, Jeroen ; van der Sommen, Fons et al. / Informative Frame Classification of Endoscopic Videos Using Convolutional Neural Networks and Hidden Markov Models. 2019 IEEE International Conference on Image Processing, ICIP 2019 - Proceedings. Vol. 2019-September IEEE Computer Society, 2019. pp. 380-384 (Proceedings - International Conference on Image Processing, ICIP).

@inproceedings{c2224b442f8249d4b544d88d16d58ff5,

title = "Informative Frame Classification of Endoscopic Videos Using Convolutional Neural Networks and Hidden Markov Models",

abstract = "The goal of endoscopic analysis is to find abnormal lesions and determine further therapy from the obtained information. For example, in case of Barrett's esophagus, the objective of endoscopy is to timely detect dysplastic lesions, before endoscopic resection is no longer possible. However, the procedure produces a variety of non-informative frames and lesions can be missed due to poor video quality. Especially when analyzing entire endoscopic videos made by non-expert endoscopists, informative frame classification is crucial to e.g. video quality grading. This analysis involves classification problems such as polyp detection or dysplasia detection in Barrett's Esophagus. This work concentrates on the design of an automated indication of informativeness of video frames. We propose an algorithm consisting of state-of-the-art deep learning techniques, to initialize frame-based classification, followed by a hidden Markov model to incorporate temporal information and control consistent decision making. Results from the performed experiments show that the proposed model improves on the state-of-the-art with an F1-score of 91%, and a substantial increase in sensitivity of 10%, thereby indicating improved labeling consistency. Additionally, the algorithm is capable of processing 261 frames per second, which is multiple times faster compared to other informative frame classification algorithms, thus enabling real-time computation.",

author = "{van der Putten}, Joost and {de Groof}, Jeroen and {van der Sommen}, Fons and Maarten Struyvenberg and Svitlana Zinger and Wouter Curvers and Erik Schoon and Jacques Bergman and {de With}, {Peter H. N.}",

year = "2019",

doi = "https://doi.org/10.1109/ICIP.2019.8802947",

language = "English",

volume = "2019-September",

series = "Proceedings - International Conference on Image Processing, ICIP",

publisher = "IEEE Computer Society",

pages = "380--384",

booktitle = "2019 IEEE International Conference on Image Processing, ICIP 2019 - Proceedings",

note = "26th IEEE International Conference on Image Processing, ICIP 2019 ; Conference date: 22-09-2019 Through 25-09-2019",

}

van der Putten, J, de Groof, J, van der Sommen, F, Struyvenberg, M, Zinger, S, Curvers, W, Schoon, E, Bergman, J & de With, PHN 2019, Informative Frame Classification of Endoscopic Videos Using Convolutional Neural Networks and Hidden Markov Models. in 2019 IEEE International Conference on Image Processing, ICIP 2019 - Proceedings. vol. 2019-September, Proceedings - International Conference on Image Processing, ICIP, IEEE Computer Society, pp. 380-384, 26th IEEE International Conference on Image Processing, ICIP 2019, Taipei, Taiwan, Province of China, 22/09/2019. https://doi.org/10.1109/ICIP.2019.8802947

Informative Frame Classification of Endoscopic Videos Using Convolutional Neural Networks and Hidden Markov Models. / van der Putten, Joost; de Groof, Jeroen; van der Sommen, Fons et al.
2019 IEEE International Conference on Image Processing, ICIP 2019 - Proceedings. Vol. 2019-September IEEE Computer Society, 2019. p. 380-384 (Proceedings - International Conference on Image Processing, ICIP).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Academic › peer-review

TY - GEN

T1 - Informative Frame Classification of Endoscopic Videos Using Convolutional Neural Networks and Hidden Markov Models

AU - van der Putten, Joost

AU - de Groof, Jeroen

AU - van der Sommen, Fons

AU - Struyvenberg, Maarten

AU - Zinger, Svitlana

AU - Curvers, Wouter

AU - Schoon, Erik

AU - Bergman, Jacques

AU - de With, Peter H. N.

PY - 2019

Y1 - 2019

N2 - The goal of endoscopic analysis is to find abnormal lesions and determine further therapy from the obtained information. For example, in case of Barrett's esophagus, the objective of endoscopy is to timely detect dysplastic lesions, before endoscopic resection is no longer possible. However, the procedure produces a variety of non-informative frames and lesions can be missed due to poor video quality. Especially when analyzing entire endoscopic videos made by non-expert endoscopists, informative frame classification is crucial to e.g. video quality grading. This analysis involves classification problems such as polyp detection or dysplasia detection in Barrett's Esophagus. This work concentrates on the design of an automated indication of informativeness of video frames. We propose an algorithm consisting of state-of-the-art deep learning techniques, to initialize frame-based classification, followed by a hidden Markov model to incorporate temporal information and control consistent decision making. Results from the performed experiments show that the proposed model improves on the state-of-the-art with an F1-score of 91%, and a substantial increase in sensitivity of 10%, thereby indicating improved labeling consistency. Additionally, the algorithm is capable of processing 261 frames per second, which is multiple times faster compared to other informative frame classification algorithms, thus enabling real-time computation.

AB - The goal of endoscopic analysis is to find abnormal lesions and determine further therapy from the obtained information. For example, in case of Barrett's esophagus, the objective of endoscopy is to timely detect dysplastic lesions, before endoscopic resection is no longer possible. However, the procedure produces a variety of non-informative frames and lesions can be missed due to poor video quality. Especially when analyzing entire endoscopic videos made by non-expert endoscopists, informative frame classification is crucial to e.g. video quality grading. This analysis involves classification problems such as polyp detection or dysplasia detection in Barrett's Esophagus. This work concentrates on the design of an automated indication of informativeness of video frames. We propose an algorithm consisting of state-of-the-art deep learning techniques, to initialize frame-based classification, followed by a hidden Markov model to incorporate temporal information and control consistent decision making. Results from the performed experiments show that the proposed model improves on the state-of-the-art with an F1-score of 91%, and a substantial increase in sensitivity of 10%, thereby indicating improved labeling consistency. Additionally, the algorithm is capable of processing 261 frames per second, which is multiple times faster compared to other informative frame classification algorithms, thus enabling real-time computation.

UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85076799862&origin=inward

U2 - https://doi.org/10.1109/ICIP.2019.8802947

DO - https://doi.org/10.1109/ICIP.2019.8802947

M3 - Conference contribution

VL - 2019-September

T3 - Proceedings - International Conference on Image Processing, ICIP

SP - 380

EP - 384

BT - 2019 IEEE International Conference on Image Processing, ICIP 2019 - Proceedings

PB - IEEE Computer Society

T2 - 26th IEEE International Conference on Image Processing, ICIP 2019

Y2 - 22 September 2019 through 25 September 2019

ER -

van der Putten J, de Groof J, van der Sommen F, Struyvenberg M, Zinger S, Curvers W et al. Informative Frame Classification of Endoscopic Videos Using Convolutional Neural Networks and Hidden Markov Models. In 2019 IEEE International Conference on Image Processing, ICIP 2019 - Proceedings. Vol. 2019-September. IEEE Computer Society. 2019. p. 380-384. (Proceedings - International Conference on Image Processing, ICIP). doi: https://doi.org/10.1109/ICIP.2019.8802947

Informative Frame Classification of Endoscopic Videos Using Convolutional Neural Networks and Hidden Markov Models

Abstract

Publication series

Conference

Access to Document

Other files and links

Cite this