Exon Elongation Added Intrinsically Disordered Regions to the Encoded Proteins and Facilitated the Emergence of the Last Eukaryotic Common Ancestor.

Mol Biol Evol

Program for Information Systems, Division of Informatics, Bioengineering and Bioscience, Maebashi Institute of Technology, Maebashi-shi, Japan.

Published: January 2023

Most prokaryotic proteins consist of a single structural domain (SD) with little intrinsically disordered regions (IDRs) that by themselves do not adopt stable structures, whereas the typical eukaryotic protein comprises multiple SDs and IDRs. How eukaryotic proteins evolved to differ from prokaryotic proteins has not been fully elucidated. Here, we found that the longer the internal exons are, the more frequently they encode IDRs in eight eukaryotes including vertebrates, invertebrates, a fungus, and plants. Based on this observation, we propose the "small bang" model from the proteomic viewpoint: the protoeukaryotic genes had no introns and mostly encoded one SD each, but a majority of them were subsequently divided into multiple exons (step 1). Many exons unconstrained by SDs elongated to encode IDRs (step 2). The elongated exons encoding IDRs frequently facilitated the acquisition of multiple SDs to make the last common ancestor of eukaryotes (step 3). One prediction of the model is that long internal exons are mostly unconstrained exons. Analytical results of the eight eukaryotes are consistent with this prediction. In support of the model, we identified cases of internal exons that elongated after the rat-mouse divergence and discovered that the expanded sections are mostly in unconstrained exons and preferentially encode IDRs. The model also predicts that SDs followed by long internal exons tend to have other SDs downstream. This prediction was also verified in all the eukaryotic species analyzed. Our model accounts for the dichotomy between prokaryotic and eukaryotic proteins and proposes a selective advantage conferred by IDRs.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825244PMC
http://dx.doi.org/10.1093/molbev/msac272DOI Listing

Publication Analysis

Top Keywords

internal exons
16
encode idrs
12
exons
9
intrinsically disordered
8
disordered regions
8
common ancestor
8
prokaryotic proteins
8
multiple sds
8
eukaryotic proteins
8
exons unconstrained
8

Similar Publications

Alternative splicing is essential for the generation of various protein isoforms that are involved in cell differentiation and tissue development. In addition to internal coding exons, alternative splicing affects the exons with translation initiation codons; however, little is known about these exons. Here, we performed a systematic classification of human alternative exons using coding information.

View Article and Find Full Text PDF

In heme degradation, biliverdin reductase catalyzes the conversion of biliverdin to bilirubin. Defects in the biliverdin reductase A gene () causing biliverdinuria are extraordinarily rare in humans, and this inborn error of metabolism has not been reported in other mammals. The objective of this study was to diagnose biliverdinuria and identify the causal variants in two adult mixed-breed dogs with life-long green urine.

View Article and Find Full Text PDF

Background/purpose: Although metabolic dysfunction-associated steatotic liver disease (MASLD) has been proposed to replace the diagnosis of non-alcoholic fatty liver disease (NAFLD) with new diagnostic criteria since 2023, the genetic predisposition of MASLD remains to be explored.

Methods: Participants with data of genome-wide association studies (GWAS) in the Taiwan Biobank database were collected. Patients with missing data, positive for HBsAg, anti-HCV, and alcohol drinking history were excluded.

View Article and Find Full Text PDF

Exons within transcripts are traditionally classified as first, internal or last exons, each governed by different regulatory mechanisms. We recently described the widespread usage of 'hybrid' exons that serve as terminal or internal exons in different transcripts. Here, we employ an interpretable deep learning pipeline to dissect the sequence features governing the co-regulation of transcription initiation and splicing in hybrid exons.

View Article and Find Full Text PDF

Canine mast cell tumors (MCTs) are common skin neoplasms with varying biological behaviors. The proto-oncogene plays a key role in the development of these tumors, and internal tandem duplications on exon 11 are usually associated with more aggressive behavior, increased local recurrence, and decreased survival time. However, apart from exons 8-11 and 17, there is limited understanding of the overall mutational landscape in canine MCTs.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!