AI Article Synopsis

  • Transposable elements (TEs) play a crucial role in the human genome, with potential negative effects on gene regulation and disease when they insert in certain locations.
  • xTea is a new tool developed for identifying TE insertions in whole-genome sequencing data, working effectively with both short-read and long-read data, and outperforming other existing methods.
  • Using long-read data, xTea has created a comprehensive catalog of polymorphic TE insertions and revealed that individual genomes contain multiple full-length L1 elements, particularly in highly repetitive regions like centromeres, indicating a previously overlooked source of active elements.

Article Abstract

Transposable elements (TEs) help shape the structure and function of the human genome. When inserted into some locations, TEs may disrupt gene regulation and cause diseases. Here, we present xTea (x-Transposable element analyzer), a tool for identifying TE insertions in whole-genome sequencing data. Whereas existing methods are mostly designed for short-read data, xTea can be applied to both short-read and long-read data. Our analysis shows that xTea outperforms other short read-based methods for both germline and somatic TE insertion discovery. With long-read data, we created a catalogue of polymorphic insertions with full assembly and annotation of insertional sequences for various types of retroelements, including pseudogenes and endogenous retroviruses. Notably, we find that individual genomes have an average of nine groups of full-length L1s in centromeres, suggesting that centromeres and other highly repetitive regions such as telomeres are a significant yet unexplored source of active L1s. xTea is available at https://github.com/parklab/xTea .

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8219666PMC
http://dx.doi.org/10.1038/s41467-021-24041-8DOI Listing

Publication Analysis

Top Keywords

long-read data
8
comprehensive identification
4
identification transposable
4
transposable element
4
element insertions
4
insertions multiple
4
multiple sequencing
4
sequencing technologies
4
technologies transposable
4
transposable elements
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!