Background: There is increasing evidence that gene location and surrounding genes influence the functionality of genes in the eukaryotic genome. Knowing the Gene Ontology Slim terms associated with a gene gives us insight into a gene's functionality by informing us how its gene product behaves in a cellular context using three different ontologies: molecular function, biological process, and cellular component. In this study, we analyzed if we could classify a gene in Saccharomyces cerevisiae to its correct Gene Ontology Slim term using information about its location in the genome and information from its nearest-neighbouring genes using classification learning.

Results: We performed experiments to establish that the MultiBoostAB algorithm using the J48 classifier could correctly classify Gene Ontology Slim terms of a gene given information regarding the gene's location and information from its nearest-neighbouring genes for training. Different neighbourhood sizes were examined to determine how many nearest neighbours should be included around each gene to provide better classification rules. Our results show that by just incorporating neighbour information from each gene's two-nearest neighbours, the percentage of correctly classified genes to their correct Gene Ontology Slim term for each ontology reaches over 80% with high accuracy (reflected in F-measures over 0.80) of the classification rules produced.

Conclusions: We confirmed that in classifying genes to their correct Gene Ontology Slim term, the inclusion of neighbour information from those genes is beneficial. Knowing the location of a gene and the Gene Ontology Slim information from neighbouring genes gives us insight into that gene's functionality. This benefit is seen by just including information from a gene's two-nearest neighbouring genes.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2890565PMC
http://dx.doi.org/10.1186/1471-2164-11-340DOI Listing

Publication Analysis

Top Keywords

gene ontology
28
ontology slim
28
correct gene
16
slim term
16
gene
14
genes correct
12
neighbouring genes
12
genes
10
classifying genes
8
ontology
8

Similar Publications

Molecular and functional convergences associated with complex multicellularity in Eukarya.

Mol Biol Evol

January 2025

Laboratório de Algoritmos em Biologia, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Brazil.

A key trait of Eukarya is the independent evolution of complex multicellular (CM) in animals, plants, fungi, brown algae and red algae. This phenotype is characterized by the initial exaptation of cell-cell adhesion genes followed by the emergence of mechanisms for cell-cell communication, together with the expansion of transcription factor gene families responsible for cell and tissue identity. The number of cell types (NCT) is commonly used as a quantitative proxy for biological complexity in comparative genomics studies.

View Article and Find Full Text PDF

Spondyloarthritis is a prevalent and persistent condition that significantly impacts the quality of life. Its intricate pathological mechanisms have led to a scarcity of animal models capable of replicating the disease progression in humans, making it a prominent area of research interest in the field. To delve into the pathological and physiological traits of spontaneous non-human primate spondyloarthritis, this study meticulously examined the disease features of this natural disease model through an array of techniques including X-ray imaging, MRI imaging, blood biochemistry, markers of bone metabolism, transcriptomics, proteomics, and metabolomics.

View Article and Find Full Text PDF

Screening of biomarkers for diagnosing chronic kidney disease and heart failure with preserved ejection fraction through bioinformatics analysis.

Biochem Biophys Rep

March 2025

Department of Cardiovascular Medicine, The First People's Hospital of Changzhou, The Third Affiliated Hospital of Soochow University, 213000, Changzhou, Jiangsu Province, China.

Background: Previous research has established that chronic kidney disease (CKD) and heart failure with preserved ejection fraction (HFpEF) often coexist. Although we have a preliminary understanding of the potential correlation between HFpEF and CKD, the underlying pathophysiological mechanisms remain unclear. This study aimed to elucidate the molecular mechanisms associated with CKD and HFpEF through bioinformatics analysis.

View Article and Find Full Text PDF

Problem: Oxidative stress (OS) plays a key role in the pathogenesis of gestational diabetes mellitus (GDM), but it was not well understood. We aimed to investigate the biomarkers and underlying mechanisms of OS-related genes in GDM.

Method Of Study: The GSE103552 and GSE70493 datasets of GDM were acquired from the Gene Expression Omnibus (GEO) database.

View Article and Find Full Text PDF

Alzheimer's disease (AD), a progressive neurodegenerative disorder, is frequently associated with musculoskeletal complications, including sarcopenia and osteoporosis, which substantially impair patient quality of life. Despite these clinical observations, the molecular mechanisms linking AD to bone loss remain insufficiently explored. In this study, we examined the femoral bone microarchitecture and transcriptomic profiles of APP/PS1 transgenic mouse models of AD to elucidate the disease's impact on bone pathology and identify potential gene candidates associated with bone deterioration.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!