Claims of intron-structure correlations have played a major role in debates surrounding split gene origins. In the formative (as opposed to disruptive or "insertional") model of split gene origins, introns represent the scars of chimaeric gene assembly. When analyzed retrospectively, formative introns should tend to fall between modular units, if such units exist, or at least to exhibit a preference for sites favorable to chimaera formation. However, there is another possible source of preferences: under a disruptive model of split gene origins, fortuitous intron-structure correlations may arise because the gain of introns is biased with respect to flanking nucleotide sequences. To investigate the extent to which a sequence-biased intron gain model may account for the present-day distribution of introns, data on over 10,000 introns in eukaryotic protein-coding genes were integrated with structural data from a set of 1,851 nonredundant protein chains. The positions of introns with respect to secondary structures, solvent accessibility, and so-called "modules" were evaluated relative to the expectations of a null model, a disruptive model based on amino acid frequencies at splice junctions, and a formative model defined relative to these. The null model can be excluded for most structural features and is highly improbable when intron sites are grouped by reading frame phase. Phase-dependent correlations with secondary structure and side-chain surface accessibility are particularly strong. However, these phase-dependent correlations are explained largely by the sequence-based disruptive model.

Download full-text PDF

Source
http://dx.doi.org/10.1093/molbev/msm151DOI Listing

Publication Analysis

Top Keywords

split gene
12
gene origins
12
disruptive model
12
structural features
8
intron-structure correlations
8
model
8
model split
8
null model
8
phase-dependent correlations
8
introns
6

Similar Publications

Alternative splicing impacts most multi-exonic human genes. Inaccuracies during this process may have an important role in ageing and disease. Here, we investigate splicing accuracy using RNA-sequencing data from >14k control samples and 40 human body sites, focusing on split reads partially mapping to known transcripts in annotation.

View Article and Find Full Text PDF

The flexibility and programmability of CRISPR-Cas technology have made it one of the most popular tools for biomarker diagnostics and gene regulation. Especially, the CRISPR-Cas12 system has shown exceptional clinical diagnosis and gene editing capabilities. Here, we discovered that although the top loop of the 5' handle of guide RNA can undergo central splitting, deactivating CRISPR-Cas12a, the segments can dramatically restore CRISPR function through nucleic acid self-assembly or interactions with small molecules and aptamers.

View Article and Find Full Text PDF

Light chain Split Luciferase assay implicates pathological NOTCH3 thiol reactivity in inherited cerebral small vessel disease.

J Biol Chem

January 2025

Departments of Neurology, University of Michigan, Ann Arbor, MI 48109; Departments of Molecular and Integrative Physiology, University of Michigan, Ann Arbor, MI 48109; Neurology Service, VA Ann Arbor Healthcare System, Department of Veterans Affairs, Ann Arbor, MI 48105. Electronic address:

Stereotyped mutations in NOTCH3 drive CADASIL, the leading inherited cause of stroke and vascular dementia. The vast majority of these mutations result in alterations in the number of cysteines in the gene product. However, non-cysteine altering pathogenic mutations have also been identified, making it challenging to discriminate pathogenic from benign NOTCH3 sequence variants.

View Article and Find Full Text PDF

Structures and mRNP remodeling mechanism of the TREX-2 complex.

Structure

January 2025

Department of Biochemistry, Vanderbilt University School of Medicine Basic Sciences, Nashville, TN 37232, USA; Center for Structural Biology, Vanderbilt University, Nashville, TN 37232, USA. Electronic address:

mRNAs are packaged with proteins into messenger ribonucleoprotein complexes (mRNPs) in the nucleus. mRNP assembly and export are of fundamental importance for all eukaryotic gene expression. Before export to the cytoplasm, mRNPs undergo dynamic remodeling governed by the DEAD-box helicase DDX39B (yeast Sub2).

View Article and Find Full Text PDF

Genomic sources from China are underrepresented in the population-specific reference database. We performed whole-genome sequencing or genome-wide genotyping on 1,207 individuals from four linguistically diverse groups (1,081 Sinitic, 56 Mongolic, 40 Turkic, and 30 Tibeto-Burman people) living in North China included in the 10K Chinese People Genomic Diversity Project (10K_CPGDP) to characterize the genetic architecture and adaptative history of ethnic groups in the Silk Road Region of China. We observed a population split between Northwest Chinese minorities (NWCMs) and Han Chinese since the Upper Paleolithic and later Neolithic genetic differentiation within NWCMs.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!