Statistical robustness of randomized controlled trials in high-impact journals has improved but was low across medical specialties

Jasper M. Kampman; Oren Turgman; Nicolaas H. Sperna Weiland; Markus W. Hollmann; Sjoerd Repping; Jeroen Hermanides

doi:https://doi.org/10.1016/j.jclinepi.2022.07.001

Statistical robustness of randomized controlled trials in high-impact journals has improved but was low across medical specialties

Jasper M. Kampman, Oren Turgman, Nicolaas H. Sperna Weiland, Markus W. Hollmann, Sjoerd Repping, Jeroen Hermanides

Research output: Contribution to journal › Article › Academic › peer-review

5 Citations (Scopus)

Abstract

Objectives: To determine whether the statistical fragility of randomized controlled trials (RCTs) in high-impact journals has improved in the last decade and to perform an umbrella review of all published data on the Fragility Index (FI) across medical specialties. Study Design and Setting: The FI was calculated for all eligible RCTs published from 2014–2021 in the New England Journal of Medicine, The Lancet, the Journal of the American Medical Association, the British Medical Journal, and the Annals of Internal Medicine. Trials reporting dichotomous, statistically significant, superiority results were eligible. All previously published systematic reviews on the FI were included in the umbrella review and analyzed by medical (sub) specialty. Results: Of 2,544 screened RCTs, 643 were eligible for the FI analysis. These had a median sample size of 625 (interquartile range [IQR]: 265–2,056), a median FI of 12 (IQR: 3–28), and a median Fragility Quotient of 0.015 (IQR: 0.004–0.045). This is an improvement compared with the median FI of 8 (IQR: 3–18) of RCTs published a decade earlier in the same five journals (P < 0.001). The umbrella review included 57 publications across 15 different medical specialties, with a total of between 10 and 692 RCTs for each specialty. The median FI ranged between two and four for all disciplines. Conclusion: In the last decade, the median statistical robustness of RCTs published in high-impact journals has improved, yet the unchanged lower bound of the interquartile range reveals that statistical significance in 25% of trials is still dependent on three or less events. The umbrella review revealed that statistical fragility is prevalent across all medical specialties. The FI is an easy-to-understand metric that can be used to supplement reported P values and help readers look beyond merely reaching statistical significance.

Original language	English
Pages (from-to)	165-170
Number of pages	6
Journal	Journal of Clinical Epidemiology
Volume	150
DOIs	https://doi.org/10.1016/j.jclinepi.2022.07.001
Publication status	Published - 1 Oct 2022

Keywords

Fragility index
P value
Randomized controlled trials
Research design
Research reporting
Statistical significance

Access to Document

https://doi.org/10.1016/j.jclinepi.2022.07.001

Cite this

@article{1ef1bfe19067455f99b1e27d408b5654,

title = "Statistical robustness of randomized controlled trials in high-impact journals has improved but was low across medical specialties",

abstract = "Objectives: To determine whether the statistical fragility of randomized controlled trials (RCTs) in high-impact journals has improved in the last decade and to perform an umbrella review of all published data on the Fragility Index (FI) across medical specialties. Study Design and Setting: The FI was calculated for all eligible RCTs published from 2014–2021 in the New England Journal of Medicine, The Lancet, the Journal of the American Medical Association, the British Medical Journal, and the Annals of Internal Medicine. Trials reporting dichotomous, statistically significant, superiority results were eligible. All previously published systematic reviews on the FI were included in the umbrella review and analyzed by medical (sub) specialty. Results: Of 2,544 screened RCTs, 643 were eligible for the FI analysis. These had a median sample size of 625 (interquartile range [IQR]: 265–2,056), a median FI of 12 (IQR: 3–28), and a median Fragility Quotient of 0.015 (IQR: 0.004–0.045). This is an improvement compared with the median FI of 8 (IQR: 3–18) of RCTs published a decade earlier in the same five journals (P < 0.001). The umbrella review included 57 publications across 15 different medical specialties, with a total of between 10 and 692 RCTs for each specialty. The median FI ranged between two and four for all disciplines. Conclusion: In the last decade, the median statistical robustness of RCTs published in high-impact journals has improved, yet the unchanged lower bound of the interquartile range reveals that statistical significance in 25% of trials is still dependent on three or less events. The umbrella review revealed that statistical fragility is prevalent across all medical specialties. The FI is an easy-to-understand metric that can be used to supplement reported P values and help readers look beyond merely reaching statistical significance.",

keywords = "Fragility index, P value, Randomized controlled trials, Research design, Research reporting, Statistical significance",

author = "Kampman, {Jasper M.} and Oren Turgman and {Sperna Weiland}, {Nicolaas H.} and Hollmann, {Markus W.} and Sjoerd Repping and Jeroen Hermanides",

note = "Funding Information: Funding: The lead author, J.M.K., is funded for his working hours by the Amsterdam University Fund (reference #3030 ). Publisher Copyright: {\textcopyright} 2022 The Authors",

year = "2022",

month = oct,

day = "1",

doi = "https://doi.org/10.1016/j.jclinepi.2022.07.001",

language = "English",

volume = "150",

pages = "165--170",

journal = "Journal of Clinical Epidemiology",

issn = "0895-4356",

publisher = "Elsevier USA",

}

TY - JOUR

T1 - Statistical robustness of randomized controlled trials in high-impact journals has improved but was low across medical specialties

AU - Kampman, Jasper M.

AU - Turgman, Oren

AU - Sperna Weiland, Nicolaas H.

AU - Hollmann, Markus W.

AU - Repping, Sjoerd

AU - Hermanides, Jeroen

PY - 2022/10/1

Y1 - 2022/10/1

N2 - Objectives: To determine whether the statistical fragility of randomized controlled trials (RCTs) in high-impact journals has improved in the last decade and to perform an umbrella review of all published data on the Fragility Index (FI) across medical specialties. Study Design and Setting: The FI was calculated for all eligible RCTs published from 2014–2021 in the New England Journal of Medicine, The Lancet, the Journal of the American Medical Association, the British Medical Journal, and the Annals of Internal Medicine. Trials reporting dichotomous, statistically significant, superiority results were eligible. All previously published systematic reviews on the FI were included in the umbrella review and analyzed by medical (sub) specialty. Results: Of 2,544 screened RCTs, 643 were eligible for the FI analysis. These had a median sample size of 625 (interquartile range [IQR]: 265–2,056), a median FI of 12 (IQR: 3–28), and a median Fragility Quotient of 0.015 (IQR: 0.004–0.045). This is an improvement compared with the median FI of 8 (IQR: 3–18) of RCTs published a decade earlier in the same five journals (P < 0.001). The umbrella review included 57 publications across 15 different medical specialties, with a total of between 10 and 692 RCTs for each specialty. The median FI ranged between two and four for all disciplines. Conclusion: In the last decade, the median statistical robustness of RCTs published in high-impact journals has improved, yet the unchanged lower bound of the interquartile range reveals that statistical significance in 25% of trials is still dependent on three or less events. The umbrella review revealed that statistical fragility is prevalent across all medical specialties. The FI is an easy-to-understand metric that can be used to supplement reported P values and help readers look beyond merely reaching statistical significance.

AB - Objectives: To determine whether the statistical fragility of randomized controlled trials (RCTs) in high-impact journals has improved in the last decade and to perform an umbrella review of all published data on the Fragility Index (FI) across medical specialties. Study Design and Setting: The FI was calculated for all eligible RCTs published from 2014–2021 in the New England Journal of Medicine, The Lancet, the Journal of the American Medical Association, the British Medical Journal, and the Annals of Internal Medicine. Trials reporting dichotomous, statistically significant, superiority results were eligible. All previously published systematic reviews on the FI were included in the umbrella review and analyzed by medical (sub) specialty. Results: Of 2,544 screened RCTs, 643 were eligible for the FI analysis. These had a median sample size of 625 (interquartile range [IQR]: 265–2,056), a median FI of 12 (IQR: 3–28), and a median Fragility Quotient of 0.015 (IQR: 0.004–0.045). This is an improvement compared with the median FI of 8 (IQR: 3–18) of RCTs published a decade earlier in the same five journals (P < 0.001). The umbrella review included 57 publications across 15 different medical specialties, with a total of between 10 and 692 RCTs for each specialty. The median FI ranged between two and four for all disciplines. Conclusion: In the last decade, the median statistical robustness of RCTs published in high-impact journals has improved, yet the unchanged lower bound of the interquartile range reveals that statistical significance in 25% of trials is still dependent on three or less events. The umbrella review revealed that statistical fragility is prevalent across all medical specialties. The FI is an easy-to-understand metric that can be used to supplement reported P values and help readers look beyond merely reaching statistical significance.

KW - Fragility index

KW - P value

KW - Randomized controlled trials

KW - Research design

KW - Research reporting

KW - Statistical significance

UR - http://www.scopus.com/inward/record.url?scp=85135886374&partnerID=8YFLogxK

U2 - https://doi.org/10.1016/j.jclinepi.2022.07.001

DO - https://doi.org/10.1016/j.jclinepi.2022.07.001

M3 - Article

C2 - 35820586

SN - 0895-4356

VL - 150

SP - 165

EP - 170

JO - Journal of Clinical Epidemiology

JF - Journal of Clinical Epidemiology

ER -

Statistical robustness of randomized controlled trials in high-impact journals has improved but was low across medical specialties

Abstract

Keywords

Access to Document

Other files and links

Cite this