CoGTEx: Unscaled system-level coexpression estimation from GTEx data forecast novel functional gene partners.

PLoS One

Tecnologico de Monterrey, Escuela de Medicina, Bioinformática, Monterrey, Nuevo León, México.

Published: October 2024

Motivation: Coexpression estimations are helpful for analysis of pathways, cofactors, regulators, targets, and human health and disease. Ideally, coexpression estimations should consider as many diverse cell types as possible and consider that available data is not uniform across tissues. Importantly, the coexpression estimations accessible today are performed on a "tissue level", which is based on cell type standardized formulations. Little or no attention is paid to overall gene expression levels. The tissue-level estimation assumes that variance expression levels are more important than mean expression levels. Here, we challenge this assumption by estimating a coexpression calculation at the "system level", which is estimated without standardization by tissue, and show that it provides valuable information. We made available a resource to view, download, and analyze both, tissue- and system-level coexpression estimations from GTEx human data.

Methods: GTEx v8 expression data was globally normalized, batch-processed, and filtered. Then, PCA, clustering, and tSNE stringent procedures were applied to generate 42 distinct and curated tissue clusters. Coexpression was estimated from these 42 tissue clusters computing the correlation of 33,445 genes by sampling 70 samples per tissue cluster to avoid tissue overrepresentation. This process was repeated 20 times, extracting the minimum value provided as a robust estimation. Three metrics were calculated (Pearson, Spearman, and G-statistic) in two data processing modes, at the system-level (TPM scale) and tissue levels (z-score scale).

Results: We first validate our tissue-level estimations compared with other databases. Then, by specific analyses in several examples and literature validations of predictions, we show that system-level coexpression estimation differs from tissue-level estimations and that both contain valuable information reflected in biological pathways. We also show that coexpression estimations are associated to transcriptional regulation. Finally, we present CoGTEx, a valuable resource for viewing and analyzing coexpressed genes in human adult tissues from GTEx v8 data. We introduce our web resource to list, view and explore the coexpressed genes from GTEx data.

Conclusion: We conclude that system-level coexpression is a novel and interesting coexpression metric capable of generating plausible predictions and biological hypotheses; and that CoGTEx is a valuable resource to view, compare, and download system- and tissue- level coexpression estimations from GTEx data.

Availability: The web resource is available at http://bioinformatics.mx/cogtex.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11451983PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0309961PLOS

Publication Analysis

Top Keywords

coexpression estimations
24
system-level coexpression
16
coexpression
12
expression levels
12
valuable resource
12
coexpression estimation
8
gtex data
8
estimations
8
resource view
8
estimations gtex
8

Similar Publications

Polyploidization-driven transcriptomic dynamics in Medicago sativa neotetraploids: mRNA, smRNA and allele-specific gene expression.

BMC Plant Biol

January 2025

Department of Agricultural, Food and Environmental Sciences, University of Perugia, via Borgo XX giugno 74, Perugia, 06121, Italy.

Whole genome duplication (WGD) is a powerful evolutionary mechanism in plants. Autopolyploids have been comparatively less studied than allopolyploids, with sexual autopolyploidization receiving even less attention. In this work, we studied the transcriptomes of neotetraploids (2n = 4x = 32) obtained by crossing two diploid (2n = 2x = 16) plants of Medicago sativa that produce a significant percentage of either 2n eggs or pollen.

View Article and Find Full Text PDF

Exploration of metastasis-related signatures in osteosarcoma based on tumor microenvironment by integrated bioinformatic analysis.

Heliyon

January 2025

Center for Plastic & Reconstructive Surgery, Department of Orthopedics, Zhejiang Provincial People's Hospital (Affiliated People's Hospital, Hangzhou Medical College), Hangzhou, Zhejiang, 310014, China.

Background: The present study aims to explore the metastasis-related signatures in connection with tumor microenvironment (TME), revealing new molecular targets promising in improving osteosarcoma (OS) patients' outcomes.

Methods: The high-throughput sequencing data was downloaded from the TARGET database and performed the ESTIMATE algorithm. Metastasis-related information was obtained from the GSE21257 dataset.

View Article and Find Full Text PDF

Background: DNA methylation (DNAm) has been shown in multiple studies to be associated with the estimated glomerular filtration rate (eGFR). However, studies focusing on Chinese populations are lacking. We conducted an epigenome-wide association study to investigate the association between DNAm and eGFR in Chinese monozygotic twins.

View Article and Find Full Text PDF

Background: Triple-negative breast cancer (TNBC) is a heterogeneous disease with a worse prognosis. Despite ongoing efforts, existing therapeutic approaches show limited success in improving early recurrence and survival outcomes for TNBC patients. Therefore, there is an urgent need to discover novel and targeted therapeutic strategies, particularly those focusing on the immune infiltrate in TNBC, to enhance diagnosis and prognosis for affected individuals.

View Article and Find Full Text PDF

Background: Anti-citrullinated peptide antibodies (ACPA)-negative (ACPA-) rheumatoid arthritis (RA) presents significant diagnostic and therapeutic challenges due to the absence of specific biomarkers, underscoring the need to elucidate its distinctive cellular and metabolic profiles for more targeted interventions.

Methods: Single-cell RNA sequencing data from peripheral blood mononuclear cells (PBMCs) and synovial tissues of patients with ACPA- and ACPA+ RA, as well as healthy controls, were analyzed. Immune cell populations were classified based on clustering and marker gene expression, with pseudotime trajectory analysis, weighted gene co-expression network analysis (WGCNA), and transcription factor network inference providing further insights.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!