Splicing of internal large exons is defined by novel cis-acting sequence elements.

Nucleic Acids Res

Department of Biology, Johns Hopkins University, Baltimore, MD 21218, USA.

Published: October 2012

Human internal exons have an average size of 147 nt, and most are <300 nt. This small size is thought to facilitate exon definition. A small number of large internal exons have been identified and shown to be alternatively spliced. We identified 1115 internal exons >1000 nt in the human genome; these were found in 5% of all protein-coding genes, and most were expressed and translated. Surprisingly, 40% of these were expressed at levels similar to the flanking exons, suggesting they were constitutively spliced. While all of the large exons had strong splice sites, the constitutively spliced large exons had a higher ratio of splicing enhancers/silencers and were more conserved across mammals than the alternatively spliced large exons. We asked if large exons contain specific sequences that promote splicing and identified 38 sequences enriched in the large exons relative to small exons. The consensus sequence is C-rich with a central invariant CA dinucleotide. Mutation of these sequences in a candidate large exon indicated that these are important for recognition of large exons by the splicing machinery. We propose that these sequences are large exon splicing enhancers (LESEs).

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3467050PMC
http://dx.doi.org/10.1093/nar/gks652DOI Listing

Publication Analysis

Top Keywords

large exons
28
spliced large
12
exons
10
large
9
constitutively spliced
8
large exon
8
splicing
5
splicing internal
4
internal large
4
exons defined
4

Similar Publications

The cellular concentrations of splicing factors (SFs) are critical for controlling alternative splicing. Most serine and arginine-enriched (SR) protein SFs regulate their own concentration via a homeostatic feedback mechanism that involves regulation of inclusion of non-coding 'poison exons' (PEs) that target transcripts for nonsense-mediated decay. The importance of SR protein PE splicing during animal development is largely unknown despite PE ultra-conservation across animal genomes.

View Article and Find Full Text PDF

Contribution of large genomic rearrangements in BRCA1/2 genes and CHEK2 1100delC allele variant to the development of breast/ovarian cancer in Argentinian population.

Breast Cancer Res Treat

December 2024

Centro Nacional de Genética Médica, ANLIS ''Dr Carlos G Malbrán'', Ministerio de Salud de La Nación, Buenos Aires, Argentina.

Purpose: Among women in Argentina, the most common cancer is breast cancer (BC) with 21,631 new cases and 6436 deaths per year. The ovarian cancer (OC) is fifteenth in frequency. The contribution of cancer-related large genomic rearrangements (LGRs) of the BRCA1/BRCA2 genes and the 1100delC allelic variant in the CHEK2 gene has not yet been widely studied in our population.

View Article and Find Full Text PDF

Breast cancer stem cells (BCSCs) are a rare cell population that is responsible for tumour initiation, metastasis and chemoresistance. Despite this, the mechanism by which BCSCs withstand genotoxic stress is largely unknown. Here, we uncover a pivotal role for the arginine methyltransferase PRMT5 in mediating BCSC chemoresistance by modulating DNA repair efficiency.

View Article and Find Full Text PDF

The NRXN1 locus is a hotspot for non-recurrent copy number variants and exon-disrupting NRXN1 deletions have been associated with increased risk of neurodevelopmental disorders in case-control studies. However, corresponding population-based estimates of prevalence and disease-associated risk are currently lacking. Also, most studies have not differentiated between deletions affecting exons of different NRXN1 splice variants nor considered intronic deletions.

View Article and Find Full Text PDF

Large and highly repetitive genomes are common. However, research interests usually lie within the non-repetitive parts of the genome, as they are more likely functional, and can be used to answer questions related to adaptation, selection and evolutionary history. Exome capture is a cost-effective method for providing sequencing data from protein-coding parts of the genes.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!