Sequences derived from the Long INterspersed Element-1 (L1) family of retrotransposons occupy at least 17% of the human genome, with 67 distinct subfamilies representing successive waves of expansion and extinction in mammalian lineages. L1s contribute extensively to gene regulation, but their molecular history is difficult to trace, because most are present only as truncated and highly mutated fossils. Consequently, L1 entries in current databases of repeat sequences are composed mainly of short diagnostic subsequences, rather than full functional progenitor sequences for each subfamily. Here, we have coupled 2 levels of sequence reconstruction (at the level of whole genomes and L1 subfamilies) to reconstruct progenitor sequences for all human L1 subfamilies that are more functionally and phylogenetically plausible than existing models. Most of the reconstructed sequences are at or near the canonical length of L1s and encode uninterrupted ORFs with expected protein domains. We also show that the presence or absence of binding sites for KRAB-C2H2 Zinc Finger Proteins, even in ancient-reconstructed progenitor L1s, mirrors binding observed in human ChIP-exo experiments, thus extending the arms race and domestication model. RepeatMasker searches of the modern human genome suggest that the new models may be able to assign subfamily resolution identities to previously ambiguous L1 instances. The reconstructed L1 sequences will be useful for genome annotation and functional study of both L1 evolution and L1 contributions to host regulatory networks.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9252281PMC
http://dx.doi.org/10.1093/genetics/iyac074DOI Listing

Publication Analysis

Top Keywords

human genome
8
progenitor sequences
8
reconstructed sequences
8
sequences
6
reconstruction full-length
4
full-length line-1
4
line-1 progenitors
4
progenitors ancestral
4
ancestral genomes
4
genomes sequences
4

Similar Publications

Drug Development.

Alzheimers Dement

December 2024

GSK R&D, Stevenage, Hertfordshire, United Kingdom.

Background: Genetic variants in GRN, the gene encoding progranulin, are causal for or are associated with the risk of multiple neurodegenerative diseases. Modulating progranulin has been considered as a therapeutic strategy for neurodegenerative diseases including Frontotemporal Dementia (FTD) and Alzheimer's Disease (AD). Here, we integrated genetics with proteomic data to determine the causal human evidence for the therapeutic benefit of modulating progranulin in AD.

View Article and Find Full Text PDF

Background: Although investment in biomedical and pharmaceutical research has increased significantly over the past two decades, there are no oral disease-modifying treatments for Alzheimer's disease (AD).

Method: We performed comprehensive human genetic and multi-omics data analyses to test likely causal relationship between EPHX2 (encoding soluble epoxide hydrolase [sEH]) and risk of AD. Next, we tested the effect of the oral administration of EC5026 (a first-in-class, picomolar sEH inhibitor) in a transgenic mouse model of AD-5xFAD and mechanistic pathways of EC5026 in patient induced Pluripotent Stem Cells (iPSC) derived neurons.

View Article and Find Full Text PDF

Background: Genome-wide association studies (GWAS) have identified close to one hundred loci associated with Alzheimer's disease (AD) risk. However, for most of these loci we do not understand the underlying mechanism leading to disease. Crispr genome editing in human induced pluripotent stem cells (hiPSCs) provides a model system to study the effects of these genetic variants in a disease relevant cell type.

View Article and Find Full Text PDF

An IS element-driven antisense RNA attenuates the expression of serotype 2 fimbriae and the cytotoxicity of .

Emerg Microbes Infect

January 2025

Univ. Lille, CNRS, Inserm, CHU Lille, Institut Pasteur de Lille, US 41 - UAR 2014 - PLBS, F-59000 Lille, France.

Insertion sequences (IS) represent mobile genetic elements that have been shown to be associated with bacterial evolution and adaptation due to their effects on genome plasticity. In , the causative agent of whooping cough, the numerous IS elements induce genomic rearrangements and contribute to the diversity of the global population. Previously, we have shown that the majority of IS-specific endogenous promoters induce the synthesis of alternative transcripts and thereby affect the transcriptional landscape of .

View Article and Find Full Text PDF

Background: Familial hemiplegic migraine (FHM) types 1-3 are associated with protein-altering genetic variants in , and , respectively. These genes have also been linked to epilepsy. Previous studies primarily focused on phenotypes, examining genetic variants in individuals with characteristic FHM symptoms.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!