A graph-theoretic modeling on GO space for biological interpretation of gene clusters.

Bioinformatics

Bioinformatics Unit, ISTECH Inc., No 704, Hyundai Town Vill 848-1, Janghang-dong, Ilsan-gu, Goyang city, Gyunggido, 411-380, Republic of Korea.

Published: February 2004

Motivation: With the advent of DNA microarray technologies, the parallel quantification of genome-wide transcriptions has been a great opportunity to systematically understand the complicated biological phenomena. Amidst the enthusiastic investigations into the intricate gene expression data, clustering methods have been the useful tools to uncover the meaningful patterns hidden in those data. The mathematical techniques, however, entirely based on the numerical expression data, do not show biologically relevant information on the clustering results.

Results: We present a novel methodology for biological interpretation of gene clusters. Our graph theoretic algorithm extracts common biological attributes of the genes within a cluster or a group of interest through the modified structure of gene ontology (GO) called GO tree. After genes are annotated with GO terms, the hierarchical nature of GO terms is used to find the representative biological meanings of the gene clusters. In addition, the biological significance of gene clusters can be assessed quantitatively by defining a distance function on the GO tree. Our approach has a complementary meaning to many statistical clustering techniques; we can see clustering problems from a different viewpoint by use of biological ontology. We applied this algorithm to the well-known data set and successfully obtained the biological features of the gene clusters with the quantitative biological assessment of clustering quality through GO Biological Process.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btg420DOI Listing

Publication Analysis

Top Keywords

gene clusters
20
biological
10
biological interpretation
8
interpretation gene
8
expression data
8
gene
7
clusters
5
clustering
5
graph-theoretic modeling
4
modeling space
4

Similar Publications

The gastrointestinal (GI) microbiota plays a crucial role in host health and disease in dogs, but the knowledge regarding the mucosal associated microbiota along the GI tract is limited in dogs. Therefore, the objective of this study was to characterize the phylogeny and predicted functional capacity of microbiota residing on the gut mucosa across five GI regions of healthy young adult and geriatric dogs fed different diets. Twelve weanling (8 weeks old) and 12 senior (11.

View Article and Find Full Text PDF

Integrated proteogenomic characterization of ampullary adenocarcinoma.

Cell Discov

January 2025

Center for Cell and Gene Therapy, Clinical Research Center for Cell-based Immunotherapy, Shanghai Pudong Hospital, State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, 200433, China.

Ampullary adenocarcinoma (AMPAC) is a rare and heterogeneous malignancy. Here we performed a comprehensive proteogenomic analysis of 198 samples from Chinese AMPAC patients and duodenum patients. Genomic data illustrate that 4q loss causes fatty acid accumulation and cell proliferation.

View Article and Find Full Text PDF

Groups of orthologous genes are commonly found together on the same chromosome over vast evolutionary distances. This extensive physical gene linkage, known as macrosynteny, is seen between bilaterian phyla as divergent as Chordata, Echinodermata, Mollusca, and Nemertea. Here, we report a unique pattern of genome evolution in Bryozoa, an understudied phylum of colonial invertebrates.

View Article and Find Full Text PDF

Sugarcane Pan-Transcriptome Identifying a Master Gene Regulating Lignin and Sugar Traits.

J Agric Food Chem

January 2025

State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China.

Sugarcane has the most complex polyploid genome in the world, and sugar-related traits are one of the most important aims in sugarcane breeding. It is essential to construct a representative pan-transcriptome that contains all transcripts of a species for studies on genetic diversity, population expression, and omics analyses in sugarcane. In this study, we constructed the first comprehensive pan-transcriptome for sugarcane, and 8434 highly reliable open reading frames were found, which were not aligned with any published sugarcane genome.

View Article and Find Full Text PDF

Background: Muscle-invasive bladder cancer (MIBC) is a prevalent cancer characterized by molecular and clinical heterogeneity. Assessing the spatial heterogeneity of the MIBC microenvironment is crucial to understand its clinical significance.

Methods: In this study, we used imaging mass cytometry (IMC) to assess the spatial heterogeneity of MIBC microenvironment across 185 regions of interest in 40 tissue samples.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!