A general framework for biclustering gene expression data.

J Bioinform Comput Biol

Center of Excellence in Genomic Science, University of Southern California, Los Angeles, CA 90089, USA.

Published: August 2006

A large number of biclustering methods have been proposed to detect patterns in gene expression data. All these methods try to find some type of biclusters but no one can discover all the types of patterns in the data. Furthermore, researchers have to design new algorithms in order to find new types of biclusters/patterns that interest biologists. In this paper, we propose a novel approach for biclustering that, in general, can be used to discover all computable patterns in gene expression data. The method is based on the theory of Kolmogorov complexity. More precisely, we use Kolmogorov complexity to measure the randomness of submatrices as the merit of biclusters because randomness naturally consists in a lack of regularity, which is a common property of all types of patterns. On the basis of algorithmic probability measure, we develop a Markov Chain Monte Carlo algorithm to search for biclusters. Our method can also be easily extended to solve the problems of conventional clustering and checkerboard type biclustering. The preliminary experiments on simulated as well as real data show that our approach is very versatile and promising.

Download full-text PDF

Source
http://dx.doi.org/10.1142/s021972000600217xDOI Listing

Publication Analysis

Top Keywords

gene expression
12
expression data
12
patterns gene
8
types patterns
8
kolmogorov complexity
8
data
5
general framework
4
biclustering
4
framework biclustering
4
biclustering gene
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!