UDC 004.423, DOI: 10.2298/csis0902165H

Microarray Missing Values Imputation Methods: Critical Analysis Review

Mou'ath Hourani1 and Ibrahiem M. M. El Emary2

  1. Faculty of Information Technology, Al Ahliyya Amman University
    Amman, Jordan
    Mouath.hourani@gmail.com
  2. Mouath.hourani@gmail.com
    Amman, Jordan
    omary57@hotmail.com

Abstract

Gene expression data often contain missing expression values. For the purpose of conducting an effective clustering analysis and since many algorithms for gene expression data analysis require a complete matrix of gene array values, choosing the most effective missing value estimation method is necessary. In this paper, the most commonly used imputation methods from literature are critically reviewed and analyzed to explain the proper use, weakness and point the observations on each published method. From the conducted analysis, we conclude that the Local Least Square (LLS) and Support Vector Regression (SVR) algorithms have achieved the best performances. SVR can be considered as a complement algorithm for LLS especially when applied to noisy data. However, both algorithms suffer from some deficiencies presented in choosing the value of Number of Selected Genes (K) and the appropriate kernel function. To overcome these drawbacks, the need for new method that automatically chooses the parameters of the function and it also has an appropriate computational complexity is imperative.

Key words

Completely at random (MCAR), Missing At Random (MAR), Sequential K-Nearest Neighbors (SKNN), Gene Ontology (GO), Singular Value Decomposition (SVD), Least Squares Imputation (LSI), Local Least Square Imputation (LLSI), Bayesian Principal Component Analysis (BPCA) and Fixed Rank Approximation Method (FRAA)

Digital Object Identifier (DOI)

https://doi.org/10.2298/csis0902165H

Publication information

Volume 6, Issue 2 (December 2009)
Year of Publication: 2009
ISSN: 1820-0214 (Print) 2406-1018 (Online)
Publisher: ComSIS Consortium

Full text

DownloadAvailable in PDF
Portable Document Format

How to cite

Hourani, M., Emary, I. M. M. E.: Microarray Missing Values Imputation Methods: Critical Analysis Review. Computer Science and Information Systems, Vol. 6, No. 2, 165-190. (2009), https://doi.org/10.2298/csis0902165H