GeneMCL in microarray analysis.

Comput Biol Chem

School of Animal and Microbial Sciences, University of Reading, Whiteknights, Reading RG6 6AJ, UK.

Published: October 2005

Accurately and reliably identifying the actual number of clusters present with a dataset of gene expression profiles, when no additional information on cluster structure is available, is a problem addressed by few algorithms. GeneMCL transforms microarray analysis data into a graph consisting of nodes connected by edges, where the nodes represent genes, and the edges represent the similarity in expression of those genes, as given by a proximity measurement. This measurement is taken to be the Pearson correlation coefficient combined with a local non-linear rescaling step. The resulting graph is input to the Markov Cluster (MCL) algorithm, which is an elegant, deterministic, non-specific and scalable method, which models stochastic flow through the graph. The algorithm is inherently affected by any cluster structure present, and rapidly decomposes a graph into cohesive clusters. The potential of the GeneMCL algorithm is demonstrated with a 5,730 gene subset (IGS) of the Van't Veer breast cancer database, for which the clusterings are shown to reflect underlying biological mechanisms.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiolchem.2005.07.002DOI Listing

Publication Analysis

Top Keywords

microarray analysis
8
cluster structure
8
genemcl microarray
4
analysis accurately
4
accurately reliably
4
reliably identifying
4
identifying actual
4
actual number
4
number clusters
4
clusters dataset
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!