Gene expression profiling has been widely used to study molecular signatures of many diseases and to develop molecular diagnostics for disease prediction. Gene selection, as an important step for improved diagnostics, screens tens of thousands of genes and identifies a small subset that discriminates between disease types. A two-step gene selection method is proposed to identify informative gene subsets for accurate classification of multiclass phenotypes. In the first step, individually discriminatory genes (IDGs) are identified by using one-dimensional weighted Fisher criterion (wFC). In the second step, jointly discriminatory genes (JDGs) are selected by sequential search methods, based on their joint class separability measured by multidimensional weighted Fisher criterion (wFC). The performance of the selected gene subsets for multiclass prediction is evaluated by artificial neural networks (ANNs) and/or support vector machines (SVMs). By applying the proposed IDG/JDG approach to two microarray studies, that is, small round blue cell tumors (SRBCTs) and muscular dystrophies (MDs), we successfully identified a much smaller yet efficient set of JDGs for diagnosing SRBCTs and MDs with high prediction accuracies (96.9% for SRBCTs and 92.3% for MDs, resp.). These experimental results demonstrated that the two-step gene selection method is able to identify a subset of highly discriminative genes for improved multiclass prediction.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171347PMC
http://dx.doi.org/10.1155/2007/64628DOI Listing

Publication Analysis

Top Keywords

gene selection
16
multiclass prediction
12
weighted fisher
12
fisher criterion
12
two-step gene
8
selection method
8
gene subsets
8
discriminatory genes
8
criterion wfc
8
gene
7

Similar Publications

Cellular and gene therapy (CGT) products have emerged as a popular approach in regenerative medicine, showing promise in treating various pancreatic and liver diseases in numerous clinical trials. Before these therapies can be tested in human clinical trials, it is essential to evaluate their safety and efficacy in relevant animal models. Such preclinical testing is often required to obtain regulatory approval for investigational new drugs.

View Article and Find Full Text PDF

Cancer-associated fibroblasts (CAFs) significantly influence tumor progression and therapeutic resistance in colorectal cancer (CRC). However, the distributions and functions of CAF subpopulations vary across the four consensus molecular subtypes (CMSs) of CRC. This study performed single-cell RNA and bulk RNA sequencing and revealed that myofibroblast-like CAFs (myCAFs), tumor-like CAFs (tCAFs), inflammatory CAFs (iCAFs), CXCL14CAFs, and MTCAFs are notably enriched in CMS4 compared with other CMSs of CRC.

View Article and Find Full Text PDF

Hotspots of genetic change in Yersinia pestis.

Nat Commun

January 2025

State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Sciences, Beijing, China.

The relative contributions of mutation rate variation, selection, and recombination in shaping genomic variation in bacterial populations remain poorly understood. Here we analyze 3318 Yersinia pestis genomes, spanning nearly a century and including 2336 newly sequenced strains, to shed light on the patterns of genetic diversity and variation distribution at the population level. We identify 45 genomic regions ("hot regions", HRs) that, although comprising a minor fraction of the genome, are hotbeds of genetic variation.

View Article and Find Full Text PDF

Fowl typhoid (FT) poses a significant threat to the poultry industry and can cause substantial economic losses, especially in developing regions. Caused by Salmonella Gallinarum (SG), vaccination can prevent FT. However, existing vaccines, like the SG9R strain, have limitations, including residual virulence and potential reversion of pathogenicity.

View Article and Find Full Text PDF

Background: Messenger RNA 3' untranslated regions (3'UTRs) control many aspects of gene expression and determine where the transcript will terminate. The polyadenylation signal (PAS) AAUAAA (AATAAA in DNA) is a key regulator of transcript termination and this hexamer, or a similar sequence, is very frequently found within 30 bp of 3'UTR ends. Short interspersed element (SINE) retrotransposons are found throughout genomes in high copy numbers.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!