Does Reinforcement Learning Improve Outcomes for Critically Ill Patients?: A Systematic Review and Level-of-Readiness Assessment

Martijn Otten; Ameet R Jagesar; Tariq A Dam; Laurens A Biesheuvel; Floris den Hengst; Kirsten A Ziesemer; Patrick J Thoral; Harm-Jan de Grooth; Armand R J Girbes; Vincent François-Lavet; Mark Hoogendoorn; Paul W G Elbers

doi:10.1097/CCM.0000000000006100

Does Reinforcement Learning Improve Outcomes for Critically Ill Patients? A Systematic Review and Level-of-Readiness Assessment

Martijn Otten, Ameet R Jagesar, Tariq A Dam, Laurens A Biesheuvel, Floris den Hengst, Kirsten A Ziesemer, Patrick J Thoral, Harm-Jan de Grooth, Armand R J Girbes, Vincent François-Lavet, Mark Hoogendoorn, Paul W G Elbers

Research output: Contribution to journal › Article › Academic › peer-review

1 Citation (Scopus)

Abstract

OBJECTIVE: Reinforcement learning (RL) is a machine learning technique uniquely effective at sequential decision-making, which makes it potentially relevant to ICU treatment challenges. We set out to systematically review, assess level-of-readiness and meta-analyze the effect of RL on outcomes for critically ill patients.

DATA SOURCES: A systematic search was performed in PubMed, Embase.com, Clarivate Analytics/Web of Science Core Collection, Elsevier/SCOPUS and the Institute of Electrical and Electronics Engineers Xplore Digital Library from inception to March 25, 2022, with subsequent citation tracking.

DATA EXTRACTION: Journal articles that used an RL technique in an ICU population and reported on patient health-related outcomes were included for full analysis. Conference papers were included for level-of-readiness assessment only. Descriptive statistics, characteristics of the models, outcome compared with clinician's policy and level-of-readiness were collected. RL-health risk of bias and applicability assessment was performed.

DATA SYNTHESIS: A total of 1,033 articles were screened, of which 18 journal articles and 18 conference papers, were included. Thirty of those were prototyping or modeling articles and six were validation articles. All articles reported RL algorithms to outperform clinical decision-making by ICU professionals, but only in retrospective data. The modeling techniques for the state-space, action-space, reward function, RL model training, and evaluation varied widely. The risk of bias was high in all articles, mainly due to the evaluation procedure.

CONCLUSION: In this first systematic review on the application of RL in intensive care medicine we found no studies that demonstrated improved patient outcomes from RL-based technologies. All studies reported that RL-agent policies outperformed clinician policies, but such assessments were all based on retrospective off-policy evaluation.

Original language	English
Pages (from-to)	E79-E88
Journal	Critical Care Medicine
Volume	52
Issue number	2
Early online date	8 Nov 2023
DOIs	https://doi.org/10.1097/CCM.0000000000006100
Publication status	Published - 1 Feb 2024

Keywords

artificial intelligence
intensive care medicine
machine learning
reinforcement learning
sequential decision-making
systematic review

Access to Document

10.1097/CCM.0000000000006100

https://journals.lww.com/ccmjournal/abstract/2024/02000/does_reinforcement_learning_improve_outcomes_for.29.aspx

Cite this

Otten, M., Jagesar, A. R., Dam, T. A., Biesheuvel, L. A., den Hengst, F., Ziesemer, K. A., Thoral, P. J., de Grooth, H.-J., Girbes, A. R. J., François-Lavet, V., Hoogendoorn, M., & Elbers, P. W. G. (2024). Does Reinforcement Learning Improve Outcomes for Critically Ill Patients? A Systematic Review and Level-of-Readiness Assessment. Critical Care Medicine, 52(2), E79-E88. https://doi.org/10.1097/CCM.0000000000006100

@article{d3a23d6599ec42b9950eedc6a17575b6,

title = "Does Reinforcement Learning Improve Outcomes for Critically Ill Patients?: A Systematic Review and Level-of-Readiness Assessment",

abstract = "OBJECTIVE: Reinforcement learning (RL) is a machine learning technique uniquely effective at sequential decision-making, which makes it potentially relevant to ICU treatment challenges. We set out to systematically review, assess level-of-readiness and meta-analyze the effect of RL on outcomes for critically ill patients.DATA SOURCES: A systematic search was performed in PubMed, Embase.com, Clarivate Analytics/Web of Science Core Collection, Elsevier/SCOPUS and the Institute of Electrical and Electronics Engineers Xplore Digital Library from inception to March 25, 2022, with subsequent citation tracking.DATA EXTRACTION: Journal articles that used an RL technique in an ICU population and reported on patient health-related outcomes were included for full analysis. Conference papers were included for level-of-readiness assessment only. Descriptive statistics, characteristics of the models, outcome compared with clinician's policy and level-of-readiness were collected. RL-health risk of bias and applicability assessment was performed.DATA SYNTHESIS: A total of 1,033 articles were screened, of which 18 journal articles and 18 conference papers, were included. Thirty of those were prototyping or modeling articles and six were validation articles. All articles reported RL algorithms to outperform clinical decision-making by ICU professionals, but only in retrospective data. The modeling techniques for the state-space, action-space, reward function, RL model training, and evaluation varied widely. The risk of bias was high in all articles, mainly due to the evaluation procedure.CONCLUSION: In this first systematic review on the application of RL in intensive care medicine we found no studies that demonstrated improved patient outcomes from RL-based technologies. All studies reported that RL-agent policies outperformed clinician policies, but such assessments were all based on retrospective off-policy evaluation.",

keywords = "artificial intelligence, intensive care medicine, machine learning, reinforcement learning, sequential decision-making, systematic review",

author = "Martijn Otten and Jagesar, {Ameet R} and Dam, {Tariq A} and Biesheuvel, {Laurens A} and {den Hengst}, Floris and Ziesemer, {Kirsten A} and Thoral, {Patrick J} and {de Grooth}, Harm-Jan and Girbes, {Armand R J} and Vincent Fran{\c c}ois-Lavet and Mark Hoogendoorn and Elbers, {Paul W G}",

year = "2024",

month = feb,

day = "1",

doi = "10.1097/CCM.0000000000006100",

language = "English",

volume = "52",

pages = "E79--E88",

journal = "Critical Care Medicine",

issn = "0090-3493",

publisher = "Lippincott Williams and Wilkins",

number = "2",

}

Otten, M , Jagesar, AR , Dam, TA , Biesheuvel, LA, den Hengst, F, Ziesemer, KA, Thoral, PJ , de Grooth, H-J , Girbes, ARJ, François-Lavet, V, Hoogendoorn, M & Elbers, PWG 2024, 'Does Reinforcement Learning Improve Outcomes for Critically Ill Patients? A Systematic Review and Level-of-Readiness Assessment', Critical Care Medicine, vol. 52, no. 2, pp. E79-E88. https://doi.org/10.1097/CCM.0000000000006100

TY - JOUR

T1 - Does Reinforcement Learning Improve Outcomes for Critically Ill Patients?

T2 - A Systematic Review and Level-of-Readiness Assessment

AU - Otten, Martijn

AU - Jagesar, Ameet R

AU - Dam, Tariq A

AU - Biesheuvel, Laurens A

AU - den Hengst, Floris

AU - Ziesemer, Kirsten A

AU - Thoral, Patrick J

AU - de Grooth, Harm-Jan

AU - Girbes, Armand R J

AU - François-Lavet, Vincent

AU - Hoogendoorn, Mark

AU - Elbers, Paul W G

PY - 2024/2/1

Y1 - 2024/2/1

N2 - OBJECTIVE: Reinforcement learning (RL) is a machine learning technique uniquely effective at sequential decision-making, which makes it potentially relevant to ICU treatment challenges. We set out to systematically review, assess level-of-readiness and meta-analyze the effect of RL on outcomes for critically ill patients.DATA SOURCES: A systematic search was performed in PubMed, Embase.com, Clarivate Analytics/Web of Science Core Collection, Elsevier/SCOPUS and the Institute of Electrical and Electronics Engineers Xplore Digital Library from inception to March 25, 2022, with subsequent citation tracking.DATA EXTRACTION: Journal articles that used an RL technique in an ICU population and reported on patient health-related outcomes were included for full analysis. Conference papers were included for level-of-readiness assessment only. Descriptive statistics, characteristics of the models, outcome compared with clinician's policy and level-of-readiness were collected. RL-health risk of bias and applicability assessment was performed.DATA SYNTHESIS: A total of 1,033 articles were screened, of which 18 journal articles and 18 conference papers, were included. Thirty of those were prototyping or modeling articles and six were validation articles. All articles reported RL algorithms to outperform clinical decision-making by ICU professionals, but only in retrospective data. The modeling techniques for the state-space, action-space, reward function, RL model training, and evaluation varied widely. The risk of bias was high in all articles, mainly due to the evaluation procedure.CONCLUSION: In this first systematic review on the application of RL in intensive care medicine we found no studies that demonstrated improved patient outcomes from RL-based technologies. All studies reported that RL-agent policies outperformed clinician policies, but such assessments were all based on retrospective off-policy evaluation.

AB - OBJECTIVE: Reinforcement learning (RL) is a machine learning technique uniquely effective at sequential decision-making, which makes it potentially relevant to ICU treatment challenges. We set out to systematically review, assess level-of-readiness and meta-analyze the effect of RL on outcomes for critically ill patients.DATA SOURCES: A systematic search was performed in PubMed, Embase.com, Clarivate Analytics/Web of Science Core Collection, Elsevier/SCOPUS and the Institute of Electrical and Electronics Engineers Xplore Digital Library from inception to March 25, 2022, with subsequent citation tracking.DATA EXTRACTION: Journal articles that used an RL technique in an ICU population and reported on patient health-related outcomes were included for full analysis. Conference papers were included for level-of-readiness assessment only. Descriptive statistics, characteristics of the models, outcome compared with clinician's policy and level-of-readiness were collected. RL-health risk of bias and applicability assessment was performed.DATA SYNTHESIS: A total of 1,033 articles were screened, of which 18 journal articles and 18 conference papers, were included. Thirty of those were prototyping or modeling articles and six were validation articles. All articles reported RL algorithms to outperform clinical decision-making by ICU professionals, but only in retrospective data. The modeling techniques for the state-space, action-space, reward function, RL model training, and evaluation varied widely. The risk of bias was high in all articles, mainly due to the evaluation procedure.CONCLUSION: In this first systematic review on the application of RL in intensive care medicine we found no studies that demonstrated improved patient outcomes from RL-based technologies. All studies reported that RL-agent policies outperformed clinician policies, but such assessments were all based on retrospective off-policy evaluation.

KW - artificial intelligence

KW - intensive care medicine

KW - machine learning

KW - reinforcement learning

KW - sequential decision-making

KW - systematic review

UR - http://www.scopus.com/inward/record.url?scp=85182028895&partnerID=8YFLogxK

U2 - 10.1097/CCM.0000000000006100

DO - 10.1097/CCM.0000000000006100

M3 - Article

C2 - 37938042

SN - 0090-3493

VL - 52

SP - E79-E88

JO - Critical Care Medicine

JF - Critical Care Medicine

IS - 2

ER -

Does Reinforcement Learning Improve Outcomes for Critically Ill Patients? A Systematic Review and Level-of-Readiness Assessment

Abstract

Keywords

Access to Document

Other files and links

Cite this