AI Article Synopsis

  • Core promoters are DNA regions at gene beginnings that help recruit transcription initiation complexes and vary in structure and motif patterns.
  • Identifying specific motifs related to transcription start sites (TSS) can aid in classifying promoters into functional groups, although this is challenging due to overlapping architectures.
  • The study introduces seqArchR, a method using non-negative matrix factorization to cluster promoter sequences and identify TSS-directing motifs, providing insights into developmental gene expression changes over time.

Article Abstract

Core promoters are stretches of DNA at the beginning of genes that contain information that facilitates the binding of transcription initiation complexes. Different functional subsets of genes have core promoters with distinct architectures and characteristic motifs. Some of these motifs inform the selection of transcription start sites (TSS). By discovering motifs with fixed distances from known TSS positions, we could in principle classify promoters into different functional groups. Due to the variability and overlap of architectures, promoter classification is a difficult task that requires new approaches. In this study, we present a new method based on non-negative matrix factorisation (NMF) and the associated software called seqArchR that clusters promoter sequences based on their motifs at near-fixed distances from a reference point, such as TSS. When combined with experimental data from CAGE, seqArchR can efficiently identify TSS-directing motifs, including known ones like TATA, DPE, and nucleosome positioning signal, as well as novel lineage-specific motifs and the function of genes associated with them. By using seqArchR on developmental time courses, we reveal how relative use of promoter architectures changes over time with stage-specific expression. seqArchR is a powerful tool for initial genome-wide classification and functional characterisation of promoters. Its use cases are more general: it can also be used to discover any motifs at near-fixed distances from a reference point, even if they are present in only a small subset of sequences.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10695386PMC
http://dx.doi.org/10.1371/journal.pcbi.1011491DOI Listing

Publication Analysis

Top Keywords

non-negative matrix
8
matrix factorisation
8
core promoters
8
motifs near-fixed
8
near-fixed distances
8
distances reference
8
reference point
8
motifs
7
identifying promoter
4
promoter sequence
4

Similar Publications

Deciphering progressive lesion areas in breast cancer spatial transcriptomics via TGR-NMF.

Brief Bioinform

November 2024

School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, 200240 Shanghai, China.

Identifying spatial domains is critical for understanding breast cancer tissue heterogeneity and providing insights into tumor progression. However, dropout events introduces computational challenges and the lack of transparency in methods such as graph neural networks limits their interpretability. This study aimed to decipher disease progression-related spatial domains in breast cancer spatial transcriptomics by developing the three graph regularized non-negative matrix factorization (TGR-NMF).

View Article and Find Full Text PDF

Multi-view clustering has garnered significant attention due to its capacity to utilize information from multiple perspectives. The concept of anchor graph-based techniques was introduced to manage large-scale data better. However, current methods rely on K-means or uniform sampling to select anchors in the original space.

View Article and Find Full Text PDF

Nanoparticulate electrocatalysts for the oxygen reduction reaction are structurally diverse materials. Scanning transmission electron microscopy (STEM) has long been the go-to tool to obtain high-quality information about their nanoscale structure. More recently, its four-dimensional modality has emerged as a tool for a comprehensive crystal structure analysis using large data sets of diffraction patterns.

View Article and Find Full Text PDF

Ankle sprains are a common injury among athletes and the general population, with chronic ankle instability (CAI) being a frequent complication. CAI patients often display altered neuromuscular control adaptations. This study analyzed muscle synergy patterns in 20 CAI patients during anticipated and unanticipated landing tasks to understand their neuromuscular adaptation strategies.

View Article and Find Full Text PDF

Purpose Pre-clinical studies have demonstrated direct influences of the autonomic nervous system (ANS) on the immune system. However, it remains unknown if connections between the peripheral ANS and immune system exist in humans and contribute to the development of chronic inflammatory disease. This study had three aims: 1.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!