Multiple imputation for handling missing outcome data in randomized trials involving a mixture of independent and paired data

dc.contributor.authorSullivan, T.R.
dc.contributor.authorYelland, L.N.
dc.contributor.authorMoreno-Betancur, M.
dc.contributor.authorLee, K.J.
dc.date.issued2021
dc.descriptionAccepted: 31 July 2021
dc.description.abstractRandomized trials involving independent and paired observations occur in many areas of health research, for example in paediatrics, where studies can include infants from both single and twin births. Multiple imputation (MI) is often used to address missing outcome data in randomized trials, yet its performance in trials with independent and paired observations, where design effects can be less than or greater than one, remains to be explored. Using simulated data and through application to a trial dataset, we investigated the performance of different methods of MI for a continuous or binary outcome when followed by analysis using generalized estimating equations to account for clustering due to the pairs. We found that imputing data separately for independent and paired data, with paired data imputed in wide format, was the best performing MI method, producing unbiased point and standard error estimates for the treatment effect throughout. Ignoring clustering in the imputation model performed well in settings where the design effect due to the inclusion of paired data was close to one, but otherwise led to moderately biased variance estimates. Including a random cluster effect in the imputation model led to slightly biased point estimates for binary outcome data and variance estimates that were too small in some settings. Based on these results, we recommend researchers impute independent and paired data separately where feasible to do so. The exception is if the design effect due to the inclusion of paired data is close to one, where ignoring clustering may be appropriate.
dc.description.statementofresponsibilityThomas R. Sullivan, Lisa N. Yelland, Margarita Moreno-Betancur, Katherine J. Lee
dc.identifier.citationStatistics in Medicine, 2021; 40(27):6008-6020
dc.identifier.doi10.1002/sim.9166
dc.identifier.issn0277-6715
dc.identifier.issn1097-0258
dc.identifier.orcidSullivan, T.R. [0000-0002-6930-5406]
dc.identifier.orcidYelland, L.N. [0000-0003-3803-8728]
dc.identifier.urihttps://hdl.handle.net/2440/132404
dc.language.isoen
dc.publisherWiley
dc.relation.granthttp://purl.org/au-research/grants/nhmrc/1166023
dc.relation.granthttp://purl.org/au-research/grants/nhmrc/1171422
dc.relation.granthttp://purl.org/au-research/grants/nhmrc/1173576
dc.rights© 2021 John Wiley & Sons Ltd.
dc.source.urihttps://doi.org/10.1002/sim.9166
dc.subjectClinical trials; clustered data; missing outcome data; multiple imputation
dc.titleMultiple imputation for handling missing outcome data in randomized trials involving a mixture of independent and paired data
dc.typeJournal article
pubs.publication-statusPublished

Files