As pivotal markers of chromatin accessibility, DNase I hypersensitive sites (DHSs) intimately link to fundamental biological processes encompassing gene expression regulation and disease pathogenesis. Developing efficient and precise algorithms for DHSs identification holds paramount importance for unraveling genome functionality and elucidating disease mechanisms. This study innovatively presents iDHS-RGME, an Extremely Randomized Trees (Extra-Trees)-based algorithm that integrates unique feature extraction techniques for enhanced DHSs prediction. Specifically, iDHS-RGME utilizes two feature extraction approaches: Reverse Complementary Kmer (RCKmer) and Geary Spatial Autocorrelation (GSA), which comprehensively capture sequence attributes from diverse angles, bolstering information richness and accuracy. To address data imbalance, Borderline-SMOTE is employed, followed by Maximum Information Coefficient (MIC) for meticulous feature selection. Comparative evaluations underscored the superiority of the Extra-Trees classifier, which was subsequently adopted for model prediction. Through rigorous five-fold cross-validation, iDHS-RGME achieved remarkable accuracies of 94.71 % and 95.07 % on two independent datasets, outperforming previous models in terms of both precision and effectiveness.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.bbrc.2024.150618DOI Listing

Publication Analysis

Top Keywords

dnase hypersensitive
8
hypersensitive sites
8
feature extraction
8
idhs-rgme
4
idhs-rgme identification
4
identification dnase
4
sites integrating
4
integrating nucleotide
4
nucleotide composition
4
composition physicochemical
4

Similar Publications

Article Synopsis
  • Cis-regulatory elements play a key role in gene expression by connecting enhancers and promoters through 3D chromosomal structures, and changes in these systems may be linked to genetic diseases.
  • Klotho, an anti-aging protein important for kidney health, has unclear regulatory mechanisms in chronic kidney disease (CKD).
  • In this study, researchers used chromosome conformation capture to explore the chromatin structure near the Klotho gene in CKD tissues, finding that regulatory activities decreased and specific DNA sites lost their function compared to healthy tissues.
View Article and Find Full Text PDF

Maize, a vital crop globally, faces significant yield losses due to its sensitivity to cold stress, especially in temperate regions. Understanding the molecular mechanisms governing maize response to cold stress is crucial for developing strategies to enhance cold tolerance. However, the precise chromatin-level regulatory mechanisms involved remain largely unknown.

View Article and Find Full Text PDF

miRStart 2.0: enhancing miRNA regulatory insights through deep learning-based TSS identification.

Nucleic Acids Res

January 2025

School of Medicine, The Chinese University of Hong Kong, Shenzhen, 2001 Longxiang Boulevard, Longgang District, Shenzhen, Guangdong 518172, P.R. China.

MicroRNAs (miRNAs) are small non-coding RNAs that regulate gene expression by binding to the 3'-untranslated regions of target mRNAs, influencing various biological processes at the post-transcriptional level. Identifying miRNA transcription start sites (TSSs) and transcription factors' (TFs) regulatory roles is crucial for elucidating miRNA function and transcriptional regulation. miRStart 2.

View Article and Find Full Text PDF

Retinoic acid receptor beta () is a transcriptional regulator crucial for coordinating retinoic acid- (RA-) mediated morphogenic movements, cell growth, and differentiation during eye development. Loss- or gain-of-function coding variants have been associated with microphthalmia, coloboma, and anterior segment defects. We identified a variant c.

View Article and Find Full Text PDF

A chemiluminescent sensor based on CRISPR-HCR technology for the hypersensitive detection of .

Anal Methods

November 2024

Engineering Research Center for Molecular Diagnosis, Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming 650500, Yunnan, People's Republic of China.

Tuberculosis is a highly infectious bacterial disease caused by . The spread of this agent has caused serious health problems worldwide, and the rapid and accurate detection of is essential for controlling the spread of infection and for preventing the emergence of multidrug-resistant strains. In this study, the cleavage ability of CRISPR-Cas12a against single-stranded DNA was combined with hybridization chain reaction and chemiluminescent signal to establish an imaging sensor for the hypersensitive detection of DNA.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!