Self-Contained Statistical Analysis of Gene Sets.

PLoS One

College of Engineering, Northern New Mexico College, Española, New Mexico, United States of America.

Published: June 2017

Microarrays are a powerful tool for studying differential gene expression. However, lists of many differentially expressed genes are often generated, and unraveling meaningful biological processes from the lists can be challenging. For this reason, investigators have sought to quantify the statistical probability of compiled gene sets rather than individual genes. The gene sets typically are organized around a biological theme or pathway. We compute correlations between different gene set tests and elect to use Fisher's self-contained method for gene set analysis. We improve Fisher's differential expression analysis of a gene set by limiting the p-value of an individual gene within the gene set to prevent a small percentage of genes from determining the statistical significance of the entire set. In addition, we also compute dependencies among genes within the set to determine which genes are statistically linked. The method is applied to T-ALL (T-lineage Acute Lymphoblastic Leukemia) to identify differentially expressed gene sets between T-ALL and normal patients and T-ALL and AML (Acute Myeloid Leukemia) patients.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5053608PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0163918PLOS

Publication Analysis

Top Keywords

gene sets
16
gene set
16
gene
10
analysis gene
8
differentially expressed
8
set
6
genes
5
self-contained statistical
4
statistical analysis
4
sets
4

Similar Publications

A cross-tissue transcriptome-wide association study identifies new susceptibility genes for benign prostatic hyperplasia.

Sci Rep

January 2025

Department of Urology, The Second Hospital & Clinical Medical School, Lanzhou University, Lanzhou, 730030, People's Republic of China.

Benign prostatic hyperplasia (BPH) is a prevalent urinary system disorder. Despite evidence of a significant genetic component from previous studies, the specific pathogenic genes and biological mechanisms are still largely unknown. The study utilized the FinnGen R10 dataset, encompassing 177,901 individuals (36,601 cases and 141,300 controls), and the GTEx v8 EQTLs files to conduct single-tissue and cross-tissue transcriptome-wide association studies (TWAS).

View Article and Find Full Text PDF

Discovery of Novel Diagnostic Biomarkers for Common Pathogenic Through Pan-Genome and Comparative Genome Analysis, with Preliminary Validation.

Pathogens

January 2025

Department of Clinical Laboratory, Beijing Chest Hospital, Beijing Tuberculosis and Thoracic Tumor Institute, Capital Medical University, Beijing 101100, China.

The aim of this study was to reveal diagnostic biomarkers of considerable importance for common pathogenic , utilizing pan-genomic and comparative genome analysis to accurately characterize clinical infections. In this study, complete or assembled genome sequences of common pathogenic and closely related species were obtained from NCBI as discovery and validation sets, respectively. Genome annotation was performed using Prokka software, and pan-genomic analysis and extraction of core genes were performed using BPGA software.

View Article and Find Full Text PDF

Fatigue Life Prediction of FRP-Strengthened Reinforced Concrete Beams Based on Soft Computing Techniques.

Materials (Basel)

January 2025

Department of Civil Engineering, School of Mechanics and Engineering Science, Shanghai University, Shanghai 200444, China.

This paper establishes fatigue life prediction models using the soft computing method to address insufficient parameter consideration and limited computational accuracy in predicting the fatigue life of fiber-reinforced polymer (FRP) strengthened concrete beams. Five different input forms were proposed by collecting 117 sets of fatigue test data of FRP-strengthened concrete beams from the existing literature and integrating the outcomes from Pearson correlation analysis and significance testing. Using Gene Expression Programming (GEP), the effects of various input configurations on the accuracy of model predictions were examined.

View Article and Find Full Text PDF

Immunoglobulin G4-related disease (IgG4-RD) is an immune-mediated, fibroinflammatory, multiorgan disease with an obscure pathogenesis. Findings indicating excessive platelet activation have been reported in systemic sclerosis, which is another autoimmune, multisystemic fibrotic disorder. The immune-mediated, inflammatory, and fibrosing intersections of IgG4-RD and systemic sclerosis raised a question about platelets' role in IgG4-RD.

View Article and Find Full Text PDF

Interspecific hybridization between relative species (with a diploid genome designated as TT), (EE) and (NN) and the successive polyploidization with transitions from sexuality to asexuality experienced by triploid hybrids likely influence their chromosomal rearrangements, including rearrangements of ribosomal DNA (rDNA) distribution patterns. Previously, we documented distinct karyotypic differences: exhibited bi-armed chromosomes while showed uni-armed chromosomes with rDNA-positive hybridization signals, respectively. In this study, fluorescence in situ hybridization (FISH) with rDNA and rDNA probes was used to analyze and compare chromosomal distribution patterns of rDNAs in clonally reproduced triploid hybrids of different genomic constitutions ETT, ETN, EEN and EET (referred to using acronyms denoting the haploid genomes of their parent species), and their parental species.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!