Known sequence features explain half of all human gene ends.

NAR Genom Bioinform

Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada.

Published: June 2023

Cleavage and polyadenylation (CPA) sites define eukaryotic gene ends. CPA sites are associated with five key sequence recognition elements: the upstream UGUA, the polyadenylation signal (PAS), and U-rich sequences; the CAUA dinucleotide where cleavage occurs; and GU-rich downstream elements (DSEs). Currently, it is not clear whether these sequences are sufficient to delineate CPA sites. Additionally, numerous other sequences and factors have been described, often in the context of promoting alternative CPA sites and preventing cryptic CPA site usage. Here, we dissect the contributions of individual sequence features to CPA using standard discriminative models. We show that models comprised only of the five primary CPA sequence features give highest probability scores to constitutive CPA sites at the ends of coding genes, relative to the entire pre-mRNA sequence, for 59% of all human genes. U1-hybridizing sequences provide a small boost in performance. The addition of all known RBP RNA binding motifs to the model increases this figure to only 61%, suggesting that additional factors beyond the core CPA machinery have a minimal role in delineating real from cryptic sites. To our knowledge, this high effectiveness of established features to predict human gene ends has not previously been documented.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10072996PMC
http://dx.doi.org/10.1093/nargab/lqad031DOI Listing

Publication Analysis

Top Keywords

cpa sites
20
sequence features
12
gene ends
12
cpa
9
human gene
8
sites
6
sequence
5
features explain
4
explain half
4
half human
4

Similar Publications

Comparison of urinary excretion patterns among exposures to cosmetic preservative, herbicide, and nootropic stimulant in anti-doping analysis.

J Chromatogr B Analyt Technol Biomed Life Sci

January 2025

School of Pharmacy, College of Pharmacy, Kaohsiung Medical University, Kaohsiung 807, Taiwan; Doctoral Degree Program in Toxicology, College of Pharmacy, Kaohsiung Medical University, Kaohsiung 807, Taiwan. Electronic address:

Doping with meclofenoxate, a nootropic stimulant prohibited in-competition by the World Anti-Doping Agency (WADA), is identified through the primary marker of urinary 4-chlorophenoxyacetic acid (4-CPA). However, the presence of 4-CPA can also arise from permissible sources. This study ventured into comparing urinary excretion patterns among exposures to permitted chemicals (chlorphenesin and 4-CPA) and the banned stimulant (meclofenoxate) and interpreting the analytical findings according to the reporting requirements.

View Article and Find Full Text PDF

Vesicular stomatitis virus (VSV), comprising vesicular stomatitis New Jersey virus (VSNJV) and vesicular stomatitis Indiana virus (VSIV), emerges from its focus of endemic transmission in Southern Mexico to cause sporadic livestock epizootics in the Western United States. A dearth of information on the role of potential arthropod vectors in the endemic region hampers efforts to identify factors that enable endemicity and predict outbreaks. In a two-year, longitudinal study at five cattle ranches in Chiapas, Mexico, insect taxa implicated as VSV vectors (blackflies, sandflies, biting midges, and mosquitoes) were collected and screened for VSV RNA, livestock vesicular stomatitis (VS) cases were monitored, and serum samples were screened for neutralizing antibodies.

View Article and Find Full Text PDF

The discovery and structural basis of two distinct state-dependent inhibitors of BamA.

Nat Commun

October 2024

Department of Structural Biology, Genentech Inc., South San Francisco, CA, USA.

Article Synopsis
  • BamA is a crucial part of the β-barrel assembly machine (BAM) that helps insert proteins into the outer membrane of Gram-negative bacteria like E. coli.
  • Researchers used in vitro selection techniques to find peptide macrocycles that can disrupt BamA's function by binding to different conformational states, specifically Peptide Targeting BamA-1 (PTB1) and Peptide Targeting BamA-2 (PTB2).
  • These findings are important for developing new antibiotics targeting BamA and could also be applied to discover modulators for other similar proteins in the future.
View Article and Find Full Text PDF

Maintenance methylation, of palindromic CpG dinucleotides at DNA replication forks, is crucial for the faithful mitotic inheritance of genomic 5-methylcytosine (5mC) methylation patterns. MBD proteins use two arginine residues to recognize symmetrically-positioned methyl groups in fully-methylated 5mCpG/5mCpG and 5mCpA/TpG dinucleotides. In contrast, C2H2 zinc finger (ZF) proteins recognize CpG and CpA, whether methylated or not, within longer specific sequences in a site- and strand-specific manner.

View Article and Find Full Text PDF

Transport of herbicides by PIN-FORMED auxin transporters.

bioRxiv

August 2024

Plant Systems Biology, School of Life Sciences Weihenstephan, Technical University of Munich, 85354 Freising, Germany.

Auxins are a group of phytohormones that control plant growth and development . Their crucial role in plant physiology has inspired development of potent synthetic auxins that can be used as herbicides . Phenoxyacetic acid derivatives are a widely used group of auxin herbicides in agriculture and research.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!