A Weighted Mutual Information Biclustering Algorithm for Gene Expression Data

Yidong Li1, Wenhua Liu1, Yankun Jia1 and Hairong Dong2

  1. School of Computer and Information Technology
    Beijing Jiaotong University
  2. State Key Laboratory of Rail Traffic Control and Safety
    Beijing Jiaotong University

Abstract

Microarrays are one of the latest breakthroughs in experimental molecular biology, which have already provided huge amount of high dimensional genetic data. Traditional clustering methods are difficult to deal with this high dimensional data, whose a subset of genes are co-regulated under a subset of conditions. Biclustering algorithms are introduced to discover local characteristics of gene expression data. In this paper, we present a novel biclustering algorithm, which called Weighted Mutual Information Biclustering algorithm(WMIB) to discover this local characteristics of gene expression data. In our algorithm, we use the weighted mutual information as new similarity measure which can be simultaneously detect complex linear and nonlinear relationships between genes, and our algorithm proposes a new objective function to update weights of each bicluster, which can simultaneously select the conditions set of each bicluster using some rules.We have evaluated our algorithm on yeast gene expression data, the experimental results show that our algorithm can generate larger biclusters with lower mean square residues simultaneously.

Key words

biclustering, mutual information, gene expression data

Digital Object Identifier (DOI)

https://doi.org/10.2298/CSIS170301021Y

Publication information

Volume 14, Issue 3 (September 2017)
Advances in Information Technology, Distributed and Model Driven Systems
Year of Publication: 2017
ISSN: 1820-0214 (Print) 2406-1018 (Online)
Publisher: ComSIS Consortium

Full text

DownloadAvailable in PDF
Portable Document Format

How to cite

Li, Y., Liu, W., Jia, Y., Dong, H.: A Weighted Mutual Information Biclustering Algorithm for Gene Expression Data. Computer Science and Information Systems, Vol. 14, No. 3, 643–660. (2017)