PCGA: a comprehensive web server for phenotype-cell-gene association analysis.

Nucleic Acids Res

Program in Bioinformatics, Zhongshan School of Medicine and The Fifth Affiliated Hospital, Sun Yat-sen University, Guangzhou 510080, China.

Published: July 2022

Most complex disease-associated loci mapped by genome-wide association studies (GWAS) are located in non-coding regions. It remains elusive which genes the associated loci regulate and in which tissues/cell types the regulation occurs. Here, we present PCGA (https://pmglab.top/pcga), a comprehensive web server for jointly estimating both associated tissues/cell types and susceptibility genes for complex phenotypes by GWAS summary statistics. The web server is built on our published method, DESE, which represents an effective method to mutually estimate driver tissues and genes by integrating GWAS summary statistics and transcriptome data. By collecting and processing extensive bulk and single-cell RNA sequencing datasets, PCGA has included expression profiles of 54 human tissues, 2,214 human cell types and 4,384 mouse cell types, which provide the basis for estimating associated tissues/cell types and genes for complex phenotypes. We develop a framework to sequentially estimate associated tissues and cell types of a complex phenotype according to their hierarchical relationships we curated. Meanwhile, we construct a phenotype-cell-gene association landscape by estimating the associated tissues/cell types and genes of 1,871 public GWASs. The association landscape is generally consistent with biological knowledge and can be searched and browsed at the PCGA website.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9252750PMC
http://dx.doi.org/10.1093/nar/gkac425DOI Listing

Publication Analysis

Top Keywords

tissues/cell types
16
web server
12
estimating associated
12
associated tissues/cell
12
cell types
12
comprehensive web
8
phenotype-cell-gene association
8
genes complex
8
complex phenotypes
8
gwas summary
8

Similar Publications

Dissecting the cellular architecture and genetic circuitry of the soybean seed.

Proc Natl Acad Sci U S A

January 2025

Department of Plant Biology, College of Biological Sciences, University of California, Davis, CA 95616.

Seeds are complex structures composed of three regions, embryo, endosperm, and seed coat, with each further divided into subregions that consist of tissues, cell layers, and cell types. Although the seed is well characterized anatomically, much less is known about the genetic circuitry that dictates its spatial complexity. To address this issue, we profiled mRNAs from anatomically distinct seed subregions at several developmental stages.

View Article and Find Full Text PDF
Article Synopsis
  • Gene expression biomarkers can help identify both genotoxic and non-genotoxic carcinogens, which could reduce the need for animal testing.
  • In August 2022, a workshop reviewed current methods for using transcriptomic profiling to detect genotoxic chemicals, examining 1341 papers to find reliable biomarkers.
  • The analysis identified two promising in vivo biomarkers and three in vitro biomarkers that show over 92% predictive accuracy and can be adapted for various testing conditions, with support from workshop participants for their regulatory adoption.
View Article and Find Full Text PDF

Effects of gene dosage on cognitive ability: A function-based association study across brain and non-brain processes.

Cell Genom

December 2024

Centre Hospitalier Universitaire Sainte-Justine Research Center, Montreal, QC, Canada; Department of Pediatrics, Université de Montréal, Montreal, QC, Canada. Electronic address:

Copy-number variants (CNVs) that increase the risk for neurodevelopmental disorders also affect cognitive ability. However, such CNVs remain challenging to study due to their scarcity, limiting our understanding of gene-dosage-sensitive biological processes linked to cognitive ability. We performed a genome-wide association study (GWAS) in 258,292 individuals, which identified-for the first time-a duplication at 2q12.

View Article and Find Full Text PDF

Plants make complex and potent therapeutic molecules, but difficulties in sourcing from natural producers or chemical synthesis can challenge their use in the clinic. A prominent example is the anti-cancer therapeutic paclitaxel (Taxol). Identification of the full paclitaxel biosynthetic pathway would enable heterologous drug production, but it has eluded discovery despite a half century of intensive research.

View Article and Find Full Text PDF

A natural language processing system for the efficient extraction of cell markers.

Sci Rep

September 2024

Marketing and Management Department, CapitalBio Technology, Beijing, 100176, China.

Single-cell RNA sequencing (scRNA-seq) has emerged as a pivotal tool for exploring cellular landscapes across diverse species and tissues. Precise annotation of cell types is essential for understanding these landscapes, relying heavily on empirical knowledge and curated cell marker databases. In this study, we introduce MarkerGeneBERT, a natural language processing (NLP) system designed to extract critical information from the literature regarding species, tissues, cell types, and cell marker genes in the context of single-cell sequencing studies.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!