Multiple imputation in data that grow over time: a comparison of three strategies

X. M. Kavelaars, J. R. van Ginkel, S. van Buuren

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Multiple imputation is a recommended technique to deal with missing data. We study the problem where the investigator has already created imputations before the arrival of the next wave of data. The newly arriving data contain missing values that need to be imputed. The standard method (RE-IMPUTE) is to combine the new and old data before imputation, and re-impute all missing values in the combined data. We study the properties of two methods that impute the missing data in the new part only, thus preserving the historic imputations. Method NEST multiply imputes the new data conditional on each filled-in old data m2>1
times. Method APPEND is the special case of NEST with m2=1,
thus appending each filled-in data by single imputation. We found that NEST and APPEND have the same validity as RE-IMPUTE for monotone missing data-patterns. NEST and APPEND also work well when relations within waves are stronger than between waves and for moderate percentages of missing data. We do not recommend the use of NEST or APPEND when relations within time points are weak and when associations between time points are strong.
Original languageEnglish
Pages (from-to)513-523
Number of pages11
JournalMultivariate Behavioral Research
Volume57
Issue number2-3
DOIs
Publication statusPublished - 2022
Externally publishedYes

Fingerprint

Dive into the research topics of 'Multiple imputation in data that grow over time: a comparison of three strategies'. Together they form a unique fingerprint.

Cite this