TY - JOUR
T1 - Reproducibility and Prognostic Performance of the 1973 and 2004 World Health Organization Classifications for Grade in Non-muscle-invasive Bladder Cancer: A Multicenter Study in 328 Bladder Tumors.
AU - Bosschieter, J
AU - Hentschel, AE
AU - Savci - Heijink, CD
AU - van der Voorn, Patrick J.J.P.
AU - Rozendaal, L
AU - Vis, AN
AU - Rhijn, B.W.G.
AU - Lissenberg-Witte, BI
AU - van de Putte, E. E.Fransen
AU - van Moorselaar, RJA
AU - Nieuwenhuijzen, JA
PY - 2018/10
Y1 - 2018/10
N2 - Two classifications for bladder cancer grade are widely used; the World Health Organization (WHO) 1973 and the WHO 2004. We evaluated inter-observer variability of both classifications and investigated which histologic criteria cause this variability. We found that reproducibility of both classifications is poor, as well as scoring of the individual histologic criteria. This suggests that descriptions of these criteria for grade are not specific enough. Background: Histologic grade is an important prognosticator in patients with non–muscle-invasive bladder cancer (NMIBC). Currently, 2 classifications for grade are widely used; the World Health Organization (WHO) 1973 and the WHO 2004. We compare inter-observer variability of both classifications and investigate which histologic criteria cause this variability. Furthermore, the prognostic value of both classifications was assessed. Patients and Methods: Three pathologists reviewed 328 bladder tissue samples of 232 patients with NMIBC in a blinded manner. WHO 1973 grade, WHO 2004 grade, histologic criteria of both classifications, and T-category were evaluated. Reproducibility was analyzed using the weighted Fleiss κ, association between criteria scores and grade with the χ2 test, and time-to-recurrence and time-to-progression with the log-rank test and Cox regression. Results: Reproducibility of both classifications was poor. The WHO 2004 showed better reproducibility (κ = 0.35; 95% confidence interval (CI), 0.29-0.42) compared with the WHO 1973 as a 3-tiered (κ = 0.24; 95% CI, 0.19-0.28), but not as a 2-tiered (G1 + G2 vs. G3) classification (κ = 0.36; 95% CI, 0.29-0.42). Reproducibility of individual criteria was poor (κ range, −0.05 to 0.25). All criteria were associated with grade (P <.05). After a median follow-up of 60 months, 33 of 232 and 112 of 232 patients developed progression and recurrence, respectively. In 1 out of the 3 pathologists, progression was predicted by both the WHO 1973 grade and the WHO 2004 grade in multivariable analysis. Recurrence was not predicted by grade (multivariable). Conclusions: Reproducibility of the WHO 2004 and WHO 1973 classification for grade are poor. Scoring of individual criteria is poorly reproducible, suggesting that descriptions of these criteria for grade are not specific. The prognostic value of both the WHO 1973 and the WHO 2004 differ per pathologist.
AB - Two classifications for bladder cancer grade are widely used; the World Health Organization (WHO) 1973 and the WHO 2004. We evaluated inter-observer variability of both classifications and investigated which histologic criteria cause this variability. We found that reproducibility of both classifications is poor, as well as scoring of the individual histologic criteria. This suggests that descriptions of these criteria for grade are not specific enough. Background: Histologic grade is an important prognosticator in patients with non–muscle-invasive bladder cancer (NMIBC). Currently, 2 classifications for grade are widely used; the World Health Organization (WHO) 1973 and the WHO 2004. We compare inter-observer variability of both classifications and investigate which histologic criteria cause this variability. Furthermore, the prognostic value of both classifications was assessed. Patients and Methods: Three pathologists reviewed 328 bladder tissue samples of 232 patients with NMIBC in a blinded manner. WHO 1973 grade, WHO 2004 grade, histologic criteria of both classifications, and T-category were evaluated. Reproducibility was analyzed using the weighted Fleiss κ, association between criteria scores and grade with the χ2 test, and time-to-recurrence and time-to-progression with the log-rank test and Cox regression. Results: Reproducibility of both classifications was poor. The WHO 2004 showed better reproducibility (κ = 0.35; 95% confidence interval (CI), 0.29-0.42) compared with the WHO 1973 as a 3-tiered (κ = 0.24; 95% CI, 0.19-0.28), but not as a 2-tiered (G1 + G2 vs. G3) classification (κ = 0.36; 95% CI, 0.29-0.42). Reproducibility of individual criteria was poor (κ range, −0.05 to 0.25). All criteria were associated with grade (P <.05). After a median follow-up of 60 months, 33 of 232 and 112 of 232 patients developed progression and recurrence, respectively. In 1 out of the 3 pathologists, progression was predicted by both the WHO 1973 grade and the WHO 2004 grade in multivariable analysis. Recurrence was not predicted by grade (multivariable). Conclusions: Reproducibility of the WHO 2004 and WHO 1973 classification for grade are poor. Scoring of individual criteria is poorly reproducible, suggesting that descriptions of these criteria for grade are not specific. The prognostic value of both the WHO 1973 and the WHO 2004 differ per pathologist.
UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85048583100&origin=inward
UR - https://www.ncbi.nlm.nih.gov/pubmed/29884516
U2 - https://doi.org/10.1016/j.clgc.2018.05.002
DO - https://doi.org/10.1016/j.clgc.2018.05.002
M3 - Article
C2 - 29884516
SN - 1558-7673
VL - 16
SP - e985-e992
JO - Clinical genitourinary cancer
JF - Clinical genitourinary cancer
IS - 5
ER -