Estimating the sample mean and standard deviation from commonly reported quantiles in meta-analysis

the DEPRESsion Screening Data (DEPRESSD) Collaboration

Research output: Contribution to journalArticleAcademicpeer-review

374 Citations (Scopus)

Abstract

Researchers increasingly use meta-analysis to synthesize the results of several studies in order to estimate a common effect. When the outcome variable is continuous, standard meta-analytic approaches assume that the primary studies report the sample mean and standard deviation of the outcome. However, when the outcome is skewed, authors sometimes summarize the data by reporting the sample median and one or both of (i) the minimum and maximum values and (ii) the first and third quartiles, but do not report the mean or standard deviation. To include these studies in meta-analysis, several methods have been developed to estimate the sample mean and standard deviation from the reported summary data. A major limitation of these widely used methods is that they assume that the outcome distribution is normal, which is unlikely to be tenable for studies reporting medians. We propose two novel approaches to estimate the sample mean and standard deviation when data are suspected to be non-normal. Our simulation results and empirical assessments show that the proposed methods often perform better than the existing methods when applied to non-normal data.

Original languageEnglish
Pages (from-to)2520-2537
Number of pages18
JournalStatistical methods in medical research
Volume29
Issue number9
DOIs
Publication statusPublished - 1 Sept 2020

Keywords

  • Meta-analysis
  • first quartile
  • maximum value
  • median
  • minimum value
  • third quartile

Cite this