Evaluation of software for multiple imputation of semi-continuous data

Yu, L.-M.; Burton, Andrea; Rivero-Arias, Oliver
June 2007
Statistical Methods in Medical Research;Jun2007, Vol. 16 Issue 3, p243
Academic Journal
journal article
It is now widely accepted that multiple imputation (MI) methods properly handle the uncertainty of missing data over single imputation methods. Several standard statistical software packages, such as SAS, R and STATA, have standard procedures or user-written programs to perform MI. The performance of these packages is generally acceptable for most types of data. However, it is unclear whether these applications are appropriate for imputing data with a large proportion of zero values resulting in a semi-continuous distribution. In addition, it is not clear whether the use of these applications is suitable when the distribution of the data needs to be preserved for subsequent analysis. This article reports the findings of a simulation study carried out to evaluate the performance of the MI procedures for handling semi-continuous data within these statistical packages. Complete resource use data on 1060 participants from a large randomized clinical trial were used as the simulation population from which 500 bootstrap samples were obtained and missing data imposed. The findings of this study showed differences in the performance of the MI programs when imputing semi-continuous data. Caution should be exercised when deciding which program should perform MI on this type of data.


Related Articles

  • Sensitivity analysis after multiple imputation under missing at random: a weighting approach. Carpenter, James R.; Kenward, Michael G.; White, Ian R. // Statistical Methods in Medical Research;Jun2007, Vol. 16 Issue 3, p259 

    Multiple imputation (MI) is now well established as a flexible, general, method for the analysis of data sets with missing values. Most implementations assume the missing data are ;missing at random' (MAR), that is, given the observed data, the reason for the missing data does not depend on the...

  • Alternative analyses for handling incomplete follow-up in the intention-to-treat analysis: the randomized controlled trial of balloon kyphoplasty versus non-surgical care for vertebral compression fracture (FREE).  // BMC Medical Research Methodology;2012, Vol. 12 Issue 1, p35 

    The article presents a randomized controlled trial of balloon kyphoplasty (BKP) versus non-surgical care for vertebral compression fracture (FREE). As stated, during the study percutaneous BKP was performed with introducer tools, inflatable bone tamps, and polymethylmethacrylate bone cement on...

  • Missing covariate data in clinical research: when and when not to use the missing-indicator method for analysis. Groenwold, Rolf H. H.; White, Ian R.; Donders, A. Rogier T.; Carpenter, James R.; Altman, Douglas G.; Moons, Karel G. M. // CMAJ: Canadian Medical Association Journal;8/7/2012, Vol. 184 Issue 11, p1265 

    The article presents a review of the missing-indicator method and illustrate its validity with the use of real data with incomplete covariates from randomized and nonrandomized studies. It states that the easiest way to handle missing covariate data is to omit from participants with any missing...

  • PERFORMANCE EVALUATION OF IMPUTATION METHODS FOR INCOMPLETE DATASETS. YENDURI, SUMANTH; IYENGAR, S. S. // International Journal of Software Engineering & Knowledge Engine;Feb2007, Vol. 17 Issue 1, p127 

    In this study, we compare the performance of four different imputation strategies ranging from the commonly used Listwise Deletion to model based approaches such as the Maximum Likelihood on enhancing completeness in incomplete software project data sets. We evaluate the impact of each of these...

  • Missing Data. Newgard, Craig D.; Lewis, Roger J. // JAMA: Journal of the American Medical Association;9/1/2015, Vol. 314 Issue 9, p940 

    The article looks at the possible imputation methodologies for handling missing data in a clinical study so as to determine the most valid results and to avoid introducing bias, focusing on an article by George L. Bakris and colleagues in the current issue reporting their randomized clinical...

  • Missing Data in Randomized Clinical Trials for Weight Loss: Scope of the Problem, State of the Field, and Performance of Statistical Methods. Elobeid, Mai A.; Padilla, Miguel A.; McVie, Theresa; Thomas, Olivia; Brock, David W.; Musser, Bret; Kaifeng Lu; Coffey, Christopher S.; Desmond, Renee A.; St-Onge, Marie-Pierre; Gadde, Kishore M.; Heymsfield, Steven B.; Allison, David B. // PLoS ONE;2009, Vol. 4 Issue 8, p1 

    Background: Dropouts and missing data are nearly-ubiquitous in obesity randomized controlled trails, threatening validity and generalizability of conclusions. Herein, we meta-analytically evaluate the extent of missing data, the frequency with which various analytic methods are employed to...

  • MISSING DATA IMPUTATION IN CARDIAC DATA SET (SURVIVAL PROGNOSIS). Kumar, R.Kavitha; Chadrasekar, R. M. // International Journal on Computer Science & Engineering;2010, Vol. 2 Issue 5, p1836 

    Treating missing value is very big task in the data preprocessing methods. Missing data are a potential source of bias when analyzing clinical trials. In this paper we analyze the performance of different data imputation methods in a task where the aim is to predict the probability of survival...

  • Multiple imputation to deal with missing EQ-5D-3L data: Should we impute individual domains or the actual index? Simons, Claire; Rivero-Arias, Oliver; Yu, Ly-Mee; Simon, Judit // Quality of Life Research;Apr2015, Vol. 24 Issue 4, p805 

    Purpose: Missing data are a well-known and widely documented problem in cost-effectiveness analyses alongside clinical trials using individual patient-level data. Current methodological research recommends multiple imputation (MI) to deal with missing health outcome data, but there is little...

  • Can we afford to ignore missing data in cost-effectiveness analyses? Marshall, Andrea; Billingham, Lucinda; Bryan, Stirling // European Journal of Health Economics;Mar2009, Vol. 10 Issue 1, p1 

    The author reflects on the missing data in cost-effectiveness analyses (CEA). According to the author, failure to account in the CEA missing data may lead to a biased result for patients. The author suggests ways on handling missing data, such as imputation and case analysis. Lastly, the author...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics