Background: The analysis of massive high throughput data via clustering algorithms is very important for elucidating gene functions in biological systems. However, traditional clustering methods have several drawbacks. Biclustering overcomes these limitations by grouping genes and samples simultaneously. It discovers subsets of genes that are co-expressed in certain samples. Recent studies showed that biclustering has a great potential in detecting marker genes that are associated with certain tissues or diseases. Several biclustering algorithms have been proposed. However, it is still a challenge to find biclusters that are significant based on biological validation measures. Besides that, there is a need for a biclustering algorithm that is capable of analyzing very large datasets in reasonable time.

Results: Here we present a fast biclustering algorithm called DeBi (Differentially Expressed BIclusters). The algorithm is based on a well known data mining approach called frequent itemset. It discovers maximum size homogeneous biclusters in which each gene is strongly associated with a subset of samples. We evaluate the performance of DeBi on a yeast dataset, on synthetic datasets and on human datasets.

Conclusions: We demonstrate that the DeBi algorithm provides functionally more coherent gene sets compared to standard clustering or biclustering algorithms using biological validation measures such as Gene Ontology term and Transcription Factor Binding Site enrichment. We show that DeBi is a computationally efficient and powerful tool in analyzing large datasets. The method is also applicable on multiple gene expression datasets coming from different labs or platforms.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3152888PMC
http://dx.doi.org/10.1186/1748-7188-6-18DOI Listing

Publication Analysis

Top Keywords

differentially expressed
8
expressed biclusters
8
frequent itemset
8
biclustering algorithms
8
biological validation
8
validation measures
8
biclustering algorithm
8
analyzing large
8
large datasets
8
biclustering
6

Similar Publications

Identification and characterization of a novel QTL for barley yellow mosaic disease resistance from bulbous barley.

Plant Genome

March 2025

Key Laboratory of Plant Functional Genomics of the Ministry of Education/Jiangsu Key Laboratory of Crop Genomics and Molecular Breeding/Jiangsu Co-Innovation Center for Modern Production Technology of Grain Crops/Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education of China, Yangzhou University, Yangzhou, China.

Winter barley (Hordeum vulgare) production areas in the middle and lower reaches of the Yangtze River are severely threatened by barley yellow mosaic disease, which is caused by Barley yellow mosaic virus and Barley mild mosaic virus. Improving barley disease resistance in breeding programs requires knowledge of genetic loci in germplasm resources. In this study, bulked segregant analysis (BSA) identified a novel major quantitative trait loci (QTL) QRym.

View Article and Find Full Text PDF

Regulation of protein production in response to physiological signals is achieved through precise control of Eukaryotic Elongation Factor 2 (eEF2), whose distinct translocase function is crucial for cell survival. Phosphorylation of eEF2 at its Thr56 (T56) residue inactivates this function in translation. Using genetically modified paralogue of a colon cancer cell line, HCT116 which carries a point mutation at Ser595-to-Alanine in the eEF2 gene we were able to create a constitutively active form of eEF2.

View Article and Find Full Text PDF

Hypertension, a major cause of cardiomyopathy, is one of the most critical risk factors for heart failure and mortality worldwide. Loss of metabolic flexibility of cardiomyocytes is one of the major causes of heart failure. Although Catestatin (CST) treatment is known to be both hypotensive and cardioprotective, its effect on cardiac metabolism is unknown.

View Article and Find Full Text PDF

Unlabelled: -methyladenosine (m A) is the most prevalent cellular mRNA modification and plays a critical role in regulating RNA stability, localization, and gene expression. m A modification plays a vital role in modulating the expression of viral and cellular genes during HIV-1 infection. HIV-1 infection increases cellular RNA m A levels in many cell types, which facilitates HIV-1 replication and infectivity in target cells.

View Article and Find Full Text PDF

There is growing interest to investigate classic psychedelics as potential therapeutics for mental illnesses. Previous studies have demonstrated that one dose of psilocybin leads to persisting neural and behavioral changes. The durability of psilocybin's effects suggests that there are likely alterations of gene expression at the transcriptional level.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!