Genome-wide identification of dominant polyadenylation hexamers for use in variant classification.

Hum Mol Genet

Center for Precision Health Research, National Human Genome Research Institute, National Institutes of Health, 50 South Drive, Bethesda, MD 20892, United States.

Published: November 2023

Polyadenylation is an essential process for the stabilization and export of mRNAs to the cytoplasm and the polyadenylation signal hexamer (herein referred to as hexamer) plays a key role in this process. Yet, only 14 Mendelian disorders have been associated with hexamer variants. This is likely an under-ascertainment as hexamers are not well defined and not routinely examined in molecular analysis. To facilitate the interrogation of putatively pathogenic hexamer variants, we set out to define functionally important hexamers genome-wide as a resource for research and clinical testing interrogation. We identified predominant polyA sites (herein referred to as pPAS) and putative predominant hexamers across protein coding genes (PAS usage >50% per gene). As a measure of the validity of these sites, the population constraint of 4532 predominant hexamers were measured. The predominant hexamers had fewer observed variants compared to non-predominant hexamers and trimer controls, and CADD scores for variants in these hexamers were significantly higher than controls. Exome data for 1477 individuals were interrogated for hexamer variants and transcriptome data were generated for 76 individuals with 65 variants in predominant hexamers. 3' RNA-seq data showed these variants resulted in alternate polyadenylation events (38%) and in elongated mRNA transcripts (12%). Our list of pPAS and predominant hexamers are available in the UCSC genome browser and on GitHub. We suggest this list of predominant hexamers can be used to interrogate exome and genome data. Variants in these predominant hexamers should be considered candidates for pathogenic variation in human disease, and to that end we suggest pathogenicity criteria for classifying hexamer variants.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10656703PMC
http://dx.doi.org/10.1093/hmg/ddad136DOI Listing

Publication Analysis

Top Keywords

predominant hexamers
28
hexamer variants
16
hexamers
12
variants
9
predominant
8
variants predominant
8
data variants
8
hexamer
6
genome-wide identification
4
identification dominant
4

Similar Publications

CapG, an enzyme expressed by , catalyzes an epimerization reaction to synthesize -acetyl-L-fucosamine, a constituent of capsule involved in pathogenesis. This protein has two domains, exists as the homohexamers in the solution, and usually produces products at hundred-nanomolar concentrations. To determine the folding-unfolding mechanism and the oligomeric form of CapG, particularly at low concentrations, we have investigated a recombinant CapG (rCapG) using different probes.

View Article and Find Full Text PDF

Oligomerization of protein arginine methyltransferase 1 and its functional impact on substrate arginine methylation.

J Biol Chem

December 2024

Department of Pharmaceutical and Biomedical Sciences, College of Pharmacy, University of Georgia, Athens, Georgia, United States. Electronic address:

Article Synopsis
  • Protein arginine methyltransferases (PRMTs) are crucial enzymes in eukaryotic cells that modify proteins and influence various biological processes like gene transcription and metabolism.
  • This study uncovered multiple higher-order structures of PRMT1, such as tetramers and octamers, using cryo-electron microscopy and linked these structures to enhanced enzyme activity.
  • Oligomerization was shown to increase PRMT1's efficiency in methylation and suggested that even a non-active mutant of PRMT1 could boost the function of the wild-type enzyme, indicating a new regulatory mechanism in enzyme activity.
View Article and Find Full Text PDF

Bacterial microcompartments (BMCs) are protein-bound organelles found in some bacteria that encapsulate enzymes for enhanced catalytic activity. These compartments spatially sequester enzymes within semipermeable shell proteins, analogous to many membrane-bound organelles. The shell proteins assemble into multimeric tiles; hexamers, trimers, and pentamers, and these tiles self-assemble into larger assemblies with icosahedral symmetry.

View Article and Find Full Text PDF

Alzheimer's disease (AD) is a neurological disorder associated with amyloid β-protein (Aβ) assembly into toxic oligomers. In addition to the two predominant alloforms, Aβ1-40 and Aβ1-42, other C-terminally truncated Aβ peptides, including Aβ1-38 and Aβ1-43, are produced in the brain. Here, we use discrete molecular dynamics (DMD) and a four-bead protein model with amino acid-specific hydropathic interactions, DMD4B-HYDRA, to examine oligomer formation of Aβ1-38, Aβ1-40, Aβ1-42, and Aβ1-43.

View Article and Find Full Text PDF

Structural biology of flavivirus NS1 protein and its antibody complexes.

Antiviral Res

July 2024

Lee Kong Chian School of Medicine, Nanyang Technological University, EMB 03-07, 59 Nanyang Drive, Singapore, 636921; NTU Institute of Structural Biology, Nanyang Technological University, EMB 06-01, 59 Nanyang Drive, Singapore, 636921; National Centre for Infectious Diseases, Singapore, 308442, Singapore. Electronic address:

The genus of flavivirus includes many mosquito-borne human pathogens, such as Zika (ZIKV) and the four serotypes of dengue (DENV1-4) viruses, that affect billions of people as evidenced by epidemics and endemicity in many countries and regions in the world. Among the 10 viral proteins encoded by the viral genome, the nonstructural protein 1 (NS1) is the only secreted protein and has been used as a diagnostic biomarker. NS1 has also been an attractive target for its biotherapeutic potential as a vaccine antigen.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!