Multiple imputation for handling missing outcome data in randomized trials involving a mixture of independent and paired data

Sullivan, T.R.; Yelland, L.N.; Moreno-Betancur, M.; Lee, K.J.

doi:10.1002/sim.9166

Multiple imputation for handling missing outcome data in randomized trials involving a mixture of independent and paired data

dc.contributor.author	Sullivan, T.R.
dc.contributor.author	Yelland, L.N.
dc.contributor.author	Moreno-Betancur, M.
dc.contributor.author	Lee, K.J.
dc.date.issued	2021
dc.description	Accepted: 31 July 2021
dc.description.abstract	Randomized trials involving independent and paired observations occur in many areas of health research, for example in paediatrics, where studies can include infants from both single and twin births. Multiple imputation (MI) is often used to address missing outcome data in randomized trials, yet its performance in trials with independent and paired observations, where design effects can be less than or greater than one, remains to be explored. Using simulated data and through application to a trial dataset, we investigated the performance of different methods of MI for a continuous or binary outcome when followed by analysis using generalized estimating equations to account for clustering due to the pairs. We found that imputing data separately for independent and paired data, with paired data imputed in wide format, was the best performing MI method, producing unbiased point and standard error estimates for the treatment effect throughout. Ignoring clustering in the imputation model performed well in settings where the design effect due to the inclusion of paired data was close to one, but otherwise led to moderately biased variance estimates. Including a random cluster effect in the imputation model led to slightly biased point estimates for binary outcome data and variance estimates that were too small in some settings. Based on these results, we recommend researchers impute independent and paired data separately where feasible to do so. The exception is if the design effect due to the inclusion of paired data is close to one, where ignoring clustering may be appropriate.
dc.description.statementofresponsibility	Thomas R. Sullivan, Lisa N. Yelland, Margarita Moreno-Betancur, Katherine J. Lee
dc.identifier.citation	Statistics in Medicine, 2021; 40(27):6008-6020
dc.identifier.doi	10.1002/sim.9166
dc.identifier.issn	0277-6715
dc.identifier.issn	1097-0258
dc.identifier.orcid	Sullivan, T.R. [0000-0002-6930-5406]
dc.identifier.orcid	Yelland, L.N. [0000-0003-3803-8728]
dc.identifier.uri	https://hdl.handle.net/2440/132404
dc.language.iso	en
dc.publisher	Wiley
dc.relation.grant	http://purl.org/au-research/grants/nhmrc/1166023
dc.relation.grant	http://purl.org/au-research/grants/nhmrc/1171422
dc.relation.grant	http://purl.org/au-research/grants/nhmrc/1173576
dc.rights	© 2021 John Wiley & Sons Ltd.
dc.source.uri	https://doi.org/10.1002/sim.9166
dc.subject	Clinical trials; clustered data; missing outcome data; multiple imputation
dc.title	Multiple imputation for handling missing outcome data in randomized trials involving a mixture of independent and paired data
dc.type	Journal article
pubs.publication-status	Published

Collections

Public Health publications

Multiple imputation for handling missing outcome data in randomized trials involving a mixture of independent and paired data

Files

Collections