TY - JOUR
T1 - Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database
AU - Norman, Christopher R.
AU - Gargon, Elizabeth
AU - Leeflang, Mariska M. G.
AU - Névéol, Aurélie
AU - Williamson, Paula R.
PY - 2019
Y1 - 2019
N2 - Curated databases of scientific literature play an important role in helping researchers find relevant literature, but populating such databases is a labour intensive and time-consuming process. One such database is the freely accessible Comet Core Outcome Set database, which was originally populated using manual screening in an annually updated systematic review. In order to reduce the workload and facilitate more timely updates we are evaluating machine learning methods to reduce the number of references needed to screen. In this study we have evaluated a machine learning approach based on logistic regression to automatically rank the candidate articles. Data from the original systematic review and its four first review updates were used to train the model and evaluate performance. We estimated that using automatic screening would yield a workload reduction of at least 75% while keeping the number of missed references around 2%. We judged this to be an acceptable trade-off for this systematic review, and the method is now being used for the next round of the Comet database update.
AB - Curated databases of scientific literature play an important role in helping researchers find relevant literature, but populating such databases is a labour intensive and time-consuming process. One such database is the freely accessible Comet Core Outcome Set database, which was originally populated using manual screening in an annually updated systematic review. In order to reduce the workload and facilitate more timely updates we are evaluating machine learning methods to reduce the number of references needed to screen. In this study we have evaluated a machine learning approach based on logistic regression to automatically rank the candidate articles. Data from the original systematic review and its four first review updates were used to train the model and evaluate performance. We estimated that using automatic screening would yield a workload reduction of at least 75% while keeping the number of missed references around 2%. We judged this to be an acceptable trade-off for this systematic review, and the method is now being used for the next round of the Comet database update.
UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85074624230&origin=inward
UR - https://www.ncbi.nlm.nih.gov/pubmed/31697361
U2 - https://doi.org/10.1093/database/baz109
DO - https://doi.org/10.1093/database/baz109
M3 - Article
C2 - 31697361
SN - 1758-0463
VL - 2019
JO - Database
JF - Database
ER -