Fast decoding cell type-specific transcription factor binding landscape at single-nucleotide resolution.

Genome Res

Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, USA.

Published: April 2021

AI Article Synopsis

Article Abstract

Decoding the cell type-specific transcription factor (TF) binding landscape at single-nucleotide resolution is crucial for understanding the regulatory mechanisms underlying many fundamental biological processes and human diseases. However, limits on time and resources restrict the high-resolution experimental measurements of TF binding profiles of all possible TF-cell type combinations. Previous computational approaches either cannot distinguish the cell context-dependent TF binding profiles across diverse cell types or can only provide a relatively low-resolution prediction. Here we present a novel deep learning approach, Leopard, for predicting TF binding sites at single-nucleotide resolution, achieving the average area under receiver operating characteristic curve (AUROC) of 0.982 and the average area under precision recall curve (AUPRC) of 0.208. Our method substantially outperformed the state-of-the-art methods Anchor and FactorNet, improving the predictive AUPRC by 19% and 27%, respectively, when evaluated at 200-bp resolution. Meanwhile, by leveraging a many-to-many neural network architecture, Leopard features a hundredfold to thousandfold speedup compared with current many-to-one machine learning methods.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8015851PMC
http://dx.doi.org/10.1101/gr.269613.120DOI Listing

Publication Analysis

Top Keywords

single-nucleotide resolution
12
decoding cell
8
cell type-specific
8
type-specific transcription
8
transcription factor
8
factor binding
8
binding landscape
8
landscape single-nucleotide
8
binding profiles
8
average area
8

Similar Publications

Topologically constrained DNA-mediated one-pot CRISPR assay for rapid detection of viral RNA with single nucleotide resolution.

EBioMedicine

January 2025

Department of Chemistry, Key Laboratory of Bioorganic Phosphorus Chemistry & Chemical Biology, Tsinghua University, New Cornerstone Science Foundation, Beijing, 100084, China. Electronic address:

Background: The widespread and evolution of RNA viruses, such as the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), highlights the importance of fast identification of virus subtypes, particularly in non-laboratory settings. Rapid and inexpensive at-home testing of viral nucleic acids with single-base resolution remains a challenge.

Methods: Topologically constrained DNA ring is engineered as substrates for the trans-cleavage of Cas13a to yield an accelerated post isothermal amplification.

View Article and Find Full Text PDF

Complete Chloroplast Genomes of 9 Species: Genome Structure, Comparative Analysis, and Phylogenetic Relationships.

Int J Mol Sci

January 2025

College of Landscape Architecture and Horticulture Sciences, Southwest Research Center for Engineering Technology of Landscape Architecture (State Forestry and Grassland Administration), Yunnan Engineering Research Center for Functional Flower Resources and Industrialization, Research and Development Center of Landscape Plants and Horticulture Flowers, Southwest Forestry University, Kunming 650224, China.

is a genus of functional herbaceous plants in the Balsaminaceae, which are not only of great ornamental value and one of the world's top three flower bedding plants but also have a wide range of medicinal and edible uses. Currently, the taxonomy and phylogenetic relationships of species are still controversial. In order to better understand their chloroplast properties and phylogenetic evolution, nine plants (, , , , , , , , ) were sequenced, and their complete chloroplast genomes were analysed.

View Article and Find Full Text PDF

Evaluating genome-wide and targeted forensic sequencing approaches to kinship determination.

Forensic Sci Int Genet

January 2025

Department of Genetics, Genomics & Cancer Sciences, University of Leicester, University Road, Leicester, UK. Electronic address:

Kinship determination is a valuable tool in forensic genetics, with applications including familial searching, disaster victim identification, and investigative genetic genealogy. Conventional typing of small numbers of autosomal short tandem repeats (STRs) confidently identifies only first-degree relatives. Massively parallel sequencing (MPS) can access more STRs and resolve alleles identical by length but differing in sequence (isoalleles), which may increase the power of kinship estimation, particularly when combined with additional sequenced single nucleotide polymorphism (SNP) loci, as in the ForenSeq DNA Signature Prep kit.

View Article and Find Full Text PDF

Carotenoids are a diverse group of pigments imparting red, orange, and yellow hues to many horticultural plants, also enhancing their nutritional properties and health benefits. In strawberry, the genetic and molecular mechanisms regulating the natural variation of fruit carotenoid composition remain largely unexplored. In this study, we use a population segregating in yellow/white flesh to detect a major quantitative trait locus (QTL), qYellow Flesh-4B, located on chromosome 4B and accounting for 82% of total phenotypic variation.

View Article and Find Full Text PDF

YHSeqY3000 panel captures all founding lineages in the Chinese paternal genomic diversity database.

BMC Biol

January 2025

Institute of Rare Diseases, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, 610000, Sichuan, China.

Background: The advancements in second-/third-generation sequencing technologies, alongside computational innovations, have significantly enhanced our understanding of the genomic structure of Y-chromosomes and their unique phylogenetic characteristics. These researches, despite the challenges posed by the lack of population-scale genomic databases, have the potential to revolutionize our approach to high-resolution, population-specific Y-chromosome panels and databases for anthropological and forensic applications.

Objectives: This study aimed to develop the highest-resolution Y-targeted sequencing panel, utilizing time-stamped, core phylogenetic informative mutations identified from high-coverage sequences in the YanHuang cohort.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!