Missing value imputation for gene expression data: computational techniques to recover missing data from available information

A. W.-C. Liew, N.-F. Law, H. Yan
2010 Briefings in Bioinformatics  
Microarray gene expression data generally suffers from missing value problem due to a variety of experimental reasons. Since the missing data points can adversely affect downstream analysis, many algorithms have been proposed to impute missing values. In this survey, we provide a comprehensive review of existing missing value imputation algorithms, focusing on their underlying algorithmic techniques and how they utilize local or global information from within the data, or their use of domain
more » ... ir use of domain knowledge during imputation. In addition, we describe how the imputation results can be validated and the different ways to assess the performance of different imputation algorithms, as well as a discussion on some possible future research directions. It is hoped that this review will give the readers a good understanding of the current development in this field and inspire them to come up with the next generation of imputation algorithms.
doi:10.1093/bib/bbq080 pmid:21156727 fatcat:rhpy27by3rdgzld5qg553jbfqu