TY - JOUR
T1 - Reproducibility of histological assessments of disease activity in UC
AU - Mosli, Mahmoud H.
AU - Feagan, Brian G.
AU - Zou, Guangyong
AU - Sandborn, William J.
AU - D'Haens, Geert
AU - Khanna, Reena
AU - Behling, Cynthia
AU - Kaplan, Keith
AU - Driman, David K.
AU - Shackelton, Lisa M.
AU - Baker, Kenneth A.
AU - Macdonald, John K.
AU - Vandervoort, Margaret K.
AU - Samaan, Mark A.
AU - Geboes, Karel
AU - Valasek, Mark A.
AU - Pai, Rish
AU - Langner, Cord
AU - Riddell, Robert
AU - Harpaz, Noam
AU - Sewitch, Maida
AU - Peterson, Michael
AU - Stitt, Larry W.
AU - Levesque, Barrett G.
PY - 2015
Y1 - 2015
N2 - Histopathology is potentially an important outcome measure in UC. Multiple histological disease activity (HA) indices, including the Geboes score (GS) and modified Riley score (MRS), have been developed; however, the operating properties of these instruments are not clearly defined. We assessed the reproducibility of existing measures of HA. Five experienced pathologists with GI pathology fellowship training and expertise in IBD evaluated, on three separate occasions at least two weeks apart, 49 UC colon biopsies and scored the GS, MRS and a global rating of histological severity using a 100 mm visual analogue scale (VAS). The reproducibility of each grading system and for individual instrument items was quantified by estimates of intraclass correlation coefficients (ICCs) based on two-way random effects models. Uncertainty of estimates was quantified by 95% two-sided CIs obtained using the non-parametric cluster bootstrap method. Biopsies responsible for the greatest disagreement based on the ICC estimates were identified. A consensus process was used to determine the most common sources of measurement disagreement. Recommendations for minimising disagreement were subsequently generated. Intrarater ICCs (95% CIs) for the total GS, MRS and VAS scores were 0.82 (0.73 to 0.88), 0.71 (0.63 to 0.80) and 0.79 (0.72 to 0.85), respectively. Corresponding inter-rater ICCs were substantially lower: 0.56 (0.39 to 0.67), 0.48 (0.35 to 0.66) and 0.61 (0.47 to 0.72). Correlation between the GS and VAS was 0.62 and between the MRS and VAS was 0.61. Although 'substantial' to 'almost perfect' ICCs for intrarater agreement were found in the assessment of HA in UC, ICCs for inter-rater agreement were considerably lower. According to the consensus process results, standardisation of item definitions and modification of the existing indices is required to create an optimal UC histological instrument
AB - Histopathology is potentially an important outcome measure in UC. Multiple histological disease activity (HA) indices, including the Geboes score (GS) and modified Riley score (MRS), have been developed; however, the operating properties of these instruments are not clearly defined. We assessed the reproducibility of existing measures of HA. Five experienced pathologists with GI pathology fellowship training and expertise in IBD evaluated, on three separate occasions at least two weeks apart, 49 UC colon biopsies and scored the GS, MRS and a global rating of histological severity using a 100 mm visual analogue scale (VAS). The reproducibility of each grading system and for individual instrument items was quantified by estimates of intraclass correlation coefficients (ICCs) based on two-way random effects models. Uncertainty of estimates was quantified by 95% two-sided CIs obtained using the non-parametric cluster bootstrap method. Biopsies responsible for the greatest disagreement based on the ICC estimates were identified. A consensus process was used to determine the most common sources of measurement disagreement. Recommendations for minimising disagreement were subsequently generated. Intrarater ICCs (95% CIs) for the total GS, MRS and VAS scores were 0.82 (0.73 to 0.88), 0.71 (0.63 to 0.80) and 0.79 (0.72 to 0.85), respectively. Corresponding inter-rater ICCs were substantially lower: 0.56 (0.39 to 0.67), 0.48 (0.35 to 0.66) and 0.61 (0.47 to 0.72). Correlation between the GS and VAS was 0.62 and between the MRS and VAS was 0.61. Although 'substantial' to 'almost perfect' ICCs for intrarater agreement were found in the assessment of HA in UC, ICCs for inter-rater agreement were considerably lower. According to the consensus process results, standardisation of item definitions and modification of the existing indices is required to create an optimal UC histological instrument
U2 - https://doi.org/10.1136/gutjnl-2014-307536
DO - https://doi.org/10.1136/gutjnl-2014-307536
M3 - Article
C2 - 25360036
SN - 0017-5749
VL - 64
SP - 1765
EP - 1773
JO - Gut
JF - Gut
IS - 11
ER -