TY - JOUR
T1 - Reporting diagnostic accuracy studies: some improvements after 10 years of STARD
AU - Korevaar, Daniël A.
AU - Wang, Junfeng
AU - van Enst, W. Annefloor
AU - Leeflang, Mariska M.
AU - Hooft, Lotty
AU - Smidt, Nynke
AU - Bossuyt, Patrick M. M.
PY - 2015
Y1 - 2015
N2 - To evaluate how diagnostic accuracy study reports published in 2012 adhered to the Standards for Reporting of Diagnostic Accuracy (STARD) statement and whether there were any differences in reporting compared with 2000 and 2004. PubMed was searched for studies published in 12 high-impact-factor journals in 2012 that evaluated the accuracy of one or more diagnostic tests against a clinical reference standard. Two independent reviewers scored reporting completeness of each article with the 25-item STARD checklist. Mixed-effects modeling was used to analyze differences in reporting with previous evaluations from articles published in 2000 and 2004. Included were 112 articles. The overall mean number of STARD items reported in 2012 was 15.3 ± 3.9 (standard deviation; range, 6.0-23.5). There was an improvement of 3.4 items (95% confidence interval: 2.6, 4.3) compared with studies published in 2000, and an improvement of 1.7 items (95% confidence interval: 0.9, 2.5) compared with studies published in 2004. Significantly more items were reported for single-gate studies compared with multiple-gate studies (16.8 vs 12.1, respectively; P < .001) and for studies that evaluated imaging tests compared with laboratory tests and other types of tests (17.0 vs 14.0 vs 14.5, respectively; P < .001). Completeness of reporting improved in the 10 years after the launch of STARD, but it remains suboptimal for many articles. Reporting of inclusion criteria and sampling methods for recruiting patients, information about blinding, and confidence intervals for accuracy estimates are in need of further improvement
AB - To evaluate how diagnostic accuracy study reports published in 2012 adhered to the Standards for Reporting of Diagnostic Accuracy (STARD) statement and whether there were any differences in reporting compared with 2000 and 2004. PubMed was searched for studies published in 12 high-impact-factor journals in 2012 that evaluated the accuracy of one or more diagnostic tests against a clinical reference standard. Two independent reviewers scored reporting completeness of each article with the 25-item STARD checklist. Mixed-effects modeling was used to analyze differences in reporting with previous evaluations from articles published in 2000 and 2004. Included were 112 articles. The overall mean number of STARD items reported in 2012 was 15.3 ± 3.9 (standard deviation; range, 6.0-23.5). There was an improvement of 3.4 items (95% confidence interval: 2.6, 4.3) compared with studies published in 2000, and an improvement of 1.7 items (95% confidence interval: 0.9, 2.5) compared with studies published in 2004. Significantly more items were reported for single-gate studies compared with multiple-gate studies (16.8 vs 12.1, respectively; P < .001) and for studies that evaluated imaging tests compared with laboratory tests and other types of tests (17.0 vs 14.0 vs 14.5, respectively; P < .001). Completeness of reporting improved in the 10 years after the launch of STARD, but it remains suboptimal for many articles. Reporting of inclusion criteria and sampling methods for recruiting patients, information about blinding, and confidence intervals for accuracy estimates are in need of further improvement
U2 - https://doi.org/10.1148/radiol.14141160
DO - https://doi.org/10.1148/radiol.14141160
M3 - Article
C2 - 25350641
SN - 0033-8419
VL - 274
SP - 781
EP - 789
JO - Radiology
JF - Radiology
IS - 3
ER -