Decoding the cell type-specific transcription factor (TF) binding landscape at single-nucleotide resolution is crucial for understanding the regulatory mechanisms underlying many fundamental biological processes and human diseases. However, limits on time and resources restrict the high-resolution experimental measurements of TF binding profiles of all possible TF-cell type combinations. Previous computational approaches either cannot distinguish the cell context-dependent TF binding profiles across diverse cell types or can only provide a relatively low-resolution prediction. Here we present a novel deep learning approach, Leopard, for predicting TF binding sites at single-nucleotide resolution, achieving the average area under receiver operating characteristic curve (AUROC) of 0.982 and the average area under precision recall curve (AUPRC) of 0.208. Our method substantially outperformed the state-of-the-art methods Anchor and FactorNet, improving the predictive AUPRC by 19% and 27%, respectively, when evaluated at 200-bp resolution. Meanwhile, by leveraging a many-to-many neural network architecture, Leopard features a hundredfold to thousandfold speedup compared with current many-to-one machine learning methods.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8015851 | PMC |
http://dx.doi.org/10.1101/gr.269613.120 | DOI Listing |
EBioMedicine
January 2025
Department of Chemistry, Key Laboratory of Bioorganic Phosphorus Chemistry & Chemical Biology, Tsinghua University, New Cornerstone Science Foundation, Beijing, 100084, China. Electronic address:
Background: The widespread and evolution of RNA viruses, such as the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), highlights the importance of fast identification of virus subtypes, particularly in non-laboratory settings. Rapid and inexpensive at-home testing of viral nucleic acids with single-base resolution remains a challenge.
Methods: Topologically constrained DNA ring is engineered as substrates for the trans-cleavage of Cas13a to yield an accelerated post isothermal amplification.
Int J Mol Sci
January 2025
College of Landscape Architecture and Horticulture Sciences, Southwest Research Center for Engineering Technology of Landscape Architecture (State Forestry and Grassland Administration), Yunnan Engineering Research Center for Functional Flower Resources and Industrialization, Research and Development Center of Landscape Plants and Horticulture Flowers, Southwest Forestry University, Kunming 650224, China.
is a genus of functional herbaceous plants in the Balsaminaceae, which are not only of great ornamental value and one of the world's top three flower bedding plants but also have a wide range of medicinal and edible uses. Currently, the taxonomy and phylogenetic relationships of species are still controversial. In order to better understand their chloroplast properties and phylogenetic evolution, nine plants (, , , , , , , , ) were sequenced, and their complete chloroplast genomes were analysed.
View Article and Find Full Text PDFForensic Sci Int Genet
January 2025
Department of Genetics, Genomics & Cancer Sciences, University of Leicester, University Road, Leicester, UK. Electronic address:
Kinship determination is a valuable tool in forensic genetics, with applications including familial searching, disaster victim identification, and investigative genetic genealogy. Conventional typing of small numbers of autosomal short tandem repeats (STRs) confidently identifies only first-degree relatives. Massively parallel sequencing (MPS) can access more STRs and resolve alleles identical by length but differing in sequence (isoalleles), which may increase the power of kinship estimation, particularly when combined with additional sequenced single nucleotide polymorphism (SNP) loci, as in the ForenSeq DNA Signature Prep kit.
View Article and Find Full Text PDFPlant Biotechnol J
January 2025
Department of Plant Breeding and Biotechnology, Centro IFAPA de Málaga, Andalusian Institute of Agricultural and Fisheries Research and Training (IFAPA), Málaga, Spain.
Carotenoids are a diverse group of pigments imparting red, orange, and yellow hues to many horticultural plants, also enhancing their nutritional properties and health benefits. In strawberry, the genetic and molecular mechanisms regulating the natural variation of fruit carotenoid composition remain largely unexplored. In this study, we use a population segregating in yellow/white flesh to detect a major quantitative trait locus (QTL), qYellow Flesh-4B, located on chromosome 4B and accounting for 82% of total phenotypic variation.
View Article and Find Full Text PDFBMC Biol
January 2025
Institute of Rare Diseases, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, 610000, Sichuan, China.
Background: The advancements in second-/third-generation sequencing technologies, alongside computational innovations, have significantly enhanced our understanding of the genomic structure of Y-chromosomes and their unique phylogenetic characteristics. These researches, despite the challenges posed by the lack of population-scale genomic databases, have the potential to revolutionize our approach to high-resolution, population-specific Y-chromosome panels and databases for anthropological and forensic applications.
Objectives: This study aimed to develop the highest-resolution Y-targeted sequencing panel, utilizing time-stamped, core phylogenetic informative mutations identified from high-coverage sequences in the YanHuang cohort.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!