Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database

Christopher R. Norman; Elizabeth Gargon; Mariska M. G. Leeflang; Aurélie Névéol; Paula R. Williamson

doi:https://doi.org/10.1093/database/baz109

Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database

Christopher R. Norman, Elizabeth Gargon, Mariska M. G. Leeflang, Aurélie Névéol, Paula R. Williamson

Research output: Contribution to journal › Article › Academic › peer-review

6 Citations (Scopus)

Abstract

Curated databases of scientific literature play an important role in helping researchers find relevant literature, but populating such databases is a labour intensive and time-consuming process. One such database is the freely accessible Comet Core Outcome Set database, which was originally populated using manual screening in an annually updated systematic review. In order to reduce the workload and facilitate more timely updates we are evaluating machine learning methods to reduce the number of references needed to screen. In this study we have evaluated a machine learning approach based on logistic regression to automatically rank the candidate articles. Data from the original systematic review and its four first review updates were used to train the model and evaluate performance. We estimated that using automatic screening would yield a workload reduction of at least 75% while keeping the number of missed references around 2%. We judged this to be an acceptable trade-off for this systematic review, and the method is now being used for the next round of the Comet database update.

Original language	English
Journal	Database
Volume	2019
DOIs	https://doi.org/10.1093/database/baz109
Publication status	Published - 2019

Access to Document

https://doi.org/10.1093/database/baz109

Cite this

@article{f78f826d4104470fa282193e0a88f253,

title = "Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database",

abstract = "Curated databases of scientific literature play an important role in helping researchers find relevant literature, but populating such databases is a labour intensive and time-consuming process. One such database is the freely accessible Comet Core Outcome Set database, which was originally populated using manual screening in an annually updated systematic review. In order to reduce the workload and facilitate more timely updates we are evaluating machine learning methods to reduce the number of references needed to screen. In this study we have evaluated a machine learning approach based on logistic regression to automatically rank the candidate articles. Data from the original systematic review and its four first review updates were used to train the model and evaluate performance. We estimated that using automatic screening would yield a workload reduction of at least 75% while keeping the number of missed references around 2%. We judged this to be an acceptable trade-off for this systematic review, and the method is now being used for the next round of the Comet database update.",

author = "Norman, {Christopher R.} and Elizabeth Gargon and Leeflang, {Mariska M. G.} and Aur{\'e}lie N{\'e}v{\'e}ol and Williamson, {Paula R.}",

year = "2019",

doi = "https://doi.org/10.1093/database/baz109",

language = "English",

volume = "2019",

journal = "Database",

issn = "1758-0463",

publisher = "Oxford University Press",

}

TY - JOUR

T1 - Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database

AU - Norman, Christopher R.

AU - Gargon, Elizabeth

AU - Leeflang, Mariska M. G.

AU - Névéol, Aurélie

AU - Williamson, Paula R.

PY - 2019

Y1 - 2019

N2 - Curated databases of scientific literature play an important role in helping researchers find relevant literature, but populating such databases is a labour intensive and time-consuming process. One such database is the freely accessible Comet Core Outcome Set database, which was originally populated using manual screening in an annually updated systematic review. In order to reduce the workload and facilitate more timely updates we are evaluating machine learning methods to reduce the number of references needed to screen. In this study we have evaluated a machine learning approach based on logistic regression to automatically rank the candidate articles. Data from the original systematic review and its four first review updates were used to train the model and evaluate performance. We estimated that using automatic screening would yield a workload reduction of at least 75% while keeping the number of missed references around 2%. We judged this to be an acceptable trade-off for this systematic review, and the method is now being used for the next round of the Comet database update.

AB - Curated databases of scientific literature play an important role in helping researchers find relevant literature, but populating such databases is a labour intensive and time-consuming process. One such database is the freely accessible Comet Core Outcome Set database, which was originally populated using manual screening in an annually updated systematic review. In order to reduce the workload and facilitate more timely updates we are evaluating machine learning methods to reduce the number of references needed to screen. In this study we have evaluated a machine learning approach based on logistic regression to automatically rank the candidate articles. Data from the original systematic review and its four first review updates were used to train the model and evaluate performance. We estimated that using automatic screening would yield a workload reduction of at least 75% while keeping the number of missed references around 2%. We judged this to be an acceptable trade-off for this systematic review, and the method is now being used for the next round of the Comet database update.

UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85074624230&origin=inward

UR - https://www.ncbi.nlm.nih.gov/pubmed/31697361

U2 - https://doi.org/10.1093/database/baz109

DO - https://doi.org/10.1093/database/baz109

M3 - Article

C2 - 31697361

SN - 1758-0463

VL - 2019

JO - Database

JF - Database

ER -

Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database

Abstract

Access to Document

Other files and links

Cite this