Genetic changes in repetitive sequences are a hallmark of cancer and other diseases, but characterizing these has been challenging using standard sequencing approaches. We developed a de novo kmer finding approach, called ARTEMIS (Analysis of RepeaT EleMents in dISease), to identify repeat elements from whole-genome sequencing. Using this method, we analyzed 1.2 billion kmers in 2837 tissue and plasma samples from 1975 patients, including those with lung, breast, colorectal, ovarian, liver, gastric, head and neck, bladder, cervical, thyroid, or prostate cancer. We identified tumor-specific changes in these patients in 1280 repeat element types from the LINE, SINE, LTR, transposable element, and human satellite families. These included changes to known repeats and 820 elements that were not previously known to be altered in human cancer. Repeat elements were enriched in regions of driver genes, and their representation was altered by structural changes and epigenetic states. Machine learning analyses of genome-wide repeat landscapes and fragmentation profiles in cfDNA detected patients with early-stage lung or liver cancer in cross-validated and externally validated cohorts. In addition, these repeat landscapes could be used to noninvasively identify the tissue of origin of tumors. These analyses reveal widespread changes in repeat landscapes of human cancers and provide an approach for their detection and characterization that could benefit early detection and disease monitoring of patients with cancer.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11323656PMC
http://dx.doi.org/10.1126/scitranslmed.adj9283DOI Listing

Publication Analysis

Top Keywords

repeat landscapes
16
repeat elements
12
genome-wide repeat
8
repeat
7
cancer
6
changes
5
landscapes
4
landscapes cancer
4
cancer cell-free
4
cell-free dna
4

Similar Publications

Neurodegeneration: 2024 update.

Free Neuropathol

January 2024

Department of Pathology, Nash Family Department of Neuroscience, Department of Artificial Intelligence & Human Health, Neuropathology Brain Bank & Research CoRE, Ronald M. Loeb Center for Alzheimer's Disease, Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA.

This review highlights a collection of both diverse and highly impactful studies published in the previous year selected by the author from the neurodegenerative neuropathology literature. As with previous reviews in this series, the focus is, to the best of my ability, to highlight human tissue-based experimentation most relevant to experimental and clinical neuropathologists. A concerted effort was made to balance the selected studies across neurodegenerative disease categories, approaches, and methodologies to capture the breadth of the research landscape.

View Article and Find Full Text PDF

The nucleolus is a major subnuclear compartment where ribosomal DNA (rDNA) is transcribed and ribosomes are assembled. In addition, recent studies have shown that the nucleolus is a dynamic organizer of chromatin architecture that modulates developmental gene expression. rDNA gene units are assembled into arrays located in the p-arms of five human acrocentric chromosomes.

View Article and Find Full Text PDF

The 5,000 to 8,000 monogenic diseases are inherited disorders leading to mutations in a single gene. These diseases usually appear in childhood and sometimes lead to morbidity or premature death. Although treatments for such diseases exist, gene therapy is considered an effective and targeted method and has been used in clinics for monogenic diseases since 1989.

View Article and Find Full Text PDF

Characterization of the complete plastid genome of (Amaryllidaceae).

Mitochondrial DNA B Resour

December 2024

Institute of Floriculture, Liaoning Academy of Agricultural Sciences, Shenyang, China.

Rourke 2002 is an evergreen herbaceous flower with high ornamental value. In this study, we sequenced the complete chloroplast (cp) genome of and reported it for the first time. The cp genome was 158,914 base pairs (bp) in total length, including two inverted repeats (IRs, 27,052 bp), separated by a large single-copy region (LSC, 86,519 bp) and a small single-copy region (SSC, 18,291 bp).

View Article and Find Full Text PDF

Fitness landscapes of human microsatellites.

PLoS Genet

December 2024

Laboratory of Genetics, University of Wisconsin-Madison, Madison, Wisconsin, United States of America.

Advances in DNA sequencing technology and computation now enable genome-wide scans for natural selection to be conducted on unprecedented scales. By examining patterns of sequence variation among individuals, biologists are identifying genes and variants that affect fitness. Despite this progress, most population genetic methods for characterizing selection assume that variants mutate in a simple manner and at a low rate.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!