Improving data sharing in research with context-free encoded missing data

Marieke P. Hoevenaar-Blom; Juliette Guillemont; Tiia Ngandu; Cathrien R. L. Beishuizen; Nicola Coley; Eric P. Moll van Charante; Sandrine Andrieu; Miia Kivipelto; Hilkka Soininen; Carol Brayne; Yannick Meiller; Edo Richard

doi:https://doi.org/10.1371/journal.pone.0182362

Improving data sharing in research with context-free encoded missing data

Marieke P. Hoevenaar-Blom, Juliette Guillemont, Tiia Ngandu, Cathrien R. L. Beishuizen, Nicola Coley, Eric P. Moll van Charante, Sandrine Andrieu, Miia Kivipelto, Hilkka Soininen, Carol Brayne, Yannick Meiller, Edo Richard

Research output: Contribution to journal › Article › Academic › peer-review

3 Citations (Scopus)

Abstract

Lack of attention to missing data in research may result in biased results, loss of power and reduced generalizability. Registering reasons for missing values at the time of data collection, or-in the case of sharing existing data-before making data available to other teams, can save time and efforts, improve scientific value and help to prevent erroneous assumptions and biased results. To ensure that encoding of missing data is sufficient to understand the reason why data are missing, it should ideally be context-free. Therefore, 11 context-free codes of missing data were carefully designed based on three completed randomized controlled clinical trials and tested in a new randomized controlled clinical trial by an international team consisting of clinical researchers and epidemiologists with extended experience in designing and conducting trials and an Information System expert. These codes can be divided into missing due to participant and/or participation characteristics (n = 6), missing by design (n = 4), and due to a procedural error (n = 1). Broad implementation of context-free missing data encoding may enhance the possibilities of data sharing and pooling, thus allowing more powerful analyses using existing data

Original language	English
Pages (from-to)	e0182362
Journal	PLOS ONE
Volume	12
Issue number	9
DOIs	https://doi.org/10.1371/journal.pone.0182362
Publication status	Published - 2017

Access to Document

https://doi.org/10.1371/journal.pone.0182362

Cite this

@article{d16ca1ec0dd34255a451453255654e45,

title = "Improving data sharing in research with context-free encoded missing data",

abstract = "Lack of attention to missing data in research may result in biased results, loss of power and reduced generalizability. Registering reasons for missing values at the time of data collection, or-in the case of sharing existing data-before making data available to other teams, can save time and efforts, improve scientific value and help to prevent erroneous assumptions and biased results. To ensure that encoding of missing data is sufficient to understand the reason why data are missing, it should ideally be context-free. Therefore, 11 context-free codes of missing data were carefully designed based on three completed randomized controlled clinical trials and tested in a new randomized controlled clinical trial by an international team consisting of clinical researchers and epidemiologists with extended experience in designing and conducting trials and an Information System expert. These codes can be divided into missing due to participant and/or participation characteristics (n = 6), missing by design (n = 4), and due to a procedural error (n = 1). Broad implementation of context-free missing data encoding may enhance the possibilities of data sharing and pooling, thus allowing more powerful analyses using existing data",

author = "Hoevenaar-Blom, {Marieke P.} and Juliette Guillemont and Tiia Ngandu and Beishuizen, {Cathrien R. L.} and Nicola Coley and {Moll van Charante}, {Eric P.} and Sandrine Andrieu and Miia Kivipelto and Hilkka Soininen and Carol Brayne and Yannick Meiller and Edo Richard",

year = "2017",

doi = "https://doi.org/10.1371/journal.pone.0182362",

language = "English",

volume = "12",

pages = "e0182362",

journal = "PLOS ONE",

issn = "1932-6203",

publisher = "Public Library of Science",

number = "9",

}

TY - JOUR

T1 - Improving data sharing in research with context-free encoded missing data

AU - Hoevenaar-Blom, Marieke P.

AU - Guillemont, Juliette

AU - Ngandu, Tiia

AU - Beishuizen, Cathrien R. L.

AU - Coley, Nicola

AU - Moll van Charante, Eric P.

AU - Andrieu, Sandrine

AU - Kivipelto, Miia

AU - Soininen, Hilkka

AU - Brayne, Carol

AU - Meiller, Yannick

AU - Richard, Edo

PY - 2017

Y1 - 2017

N2 - Lack of attention to missing data in research may result in biased results, loss of power and reduced generalizability. Registering reasons for missing values at the time of data collection, or-in the case of sharing existing data-before making data available to other teams, can save time and efforts, improve scientific value and help to prevent erroneous assumptions and biased results. To ensure that encoding of missing data is sufficient to understand the reason why data are missing, it should ideally be context-free. Therefore, 11 context-free codes of missing data were carefully designed based on three completed randomized controlled clinical trials and tested in a new randomized controlled clinical trial by an international team consisting of clinical researchers and epidemiologists with extended experience in designing and conducting trials and an Information System expert. These codes can be divided into missing due to participant and/or participation characteristics (n = 6), missing by design (n = 4), and due to a procedural error (n = 1). Broad implementation of context-free missing data encoding may enhance the possibilities of data sharing and pooling, thus allowing more powerful analyses using existing data

AB - Lack of attention to missing data in research may result in biased results, loss of power and reduced generalizability. Registering reasons for missing values at the time of data collection, or-in the case of sharing existing data-before making data available to other teams, can save time and efforts, improve scientific value and help to prevent erroneous assumptions and biased results. To ensure that encoding of missing data is sufficient to understand the reason why data are missing, it should ideally be context-free. Therefore, 11 context-free codes of missing data were carefully designed based on three completed randomized controlled clinical trials and tested in a new randomized controlled clinical trial by an international team consisting of clinical researchers and epidemiologists with extended experience in designing and conducting trials and an Information System expert. These codes can be divided into missing due to participant and/or participation characteristics (n = 6), missing by design (n = 4), and due to a procedural error (n = 1). Broad implementation of context-free missing data encoding may enhance the possibilities of data sharing and pooling, thus allowing more powerful analyses using existing data

U2 - https://doi.org/10.1371/journal.pone.0182362

DO - https://doi.org/10.1371/journal.pone.0182362

M3 - Article

C2 - 28898245

SN - 1932-6203

VL - 12

SP - e0182362

JO - PLOS ONE

JF - PLOS ONE

IS - 9

ER -