Promoters play an irreplaceable role in biological processes and genetics, which are responsible for stimulating the transcription and expression of specific genes. Promoter abnormalities have been found in some diseases, and the level of promoter-binding transcription factors can be used as a marker before a disease occurs. Hence, detecting promoters from DNA sequences has important biological significance, particular, distinguishing strong promoters can help to elucidate differences in gene expression and the mechanisms of specific diseases. With the introduction of third-generation sequencing, it is difficult to match the speed of sequencing to the speed of labeling promoters experimentally. Many computing models have been designed to fill this gap and identify unlabeled DNA. However, their feature representation methods are very singular, which cannot reflect the information contained in the original samples. With the aim of avoiding information loss, we propose a computational model based on multiple descriptors and feature selection to jointly express samples. It is worth mentioning that a new feature descriptor called K-mer word vector is defined. The promoter model of multiple feature descriptors dominated by K-mer word vector achieves similar performance to existing methods, the sensitivity of 85.72% can distinguish the promoter more effectively than other methods. Furthermore, the performance of the promoter strength has surpassed published methods, and accuracy of 77.00% greatly improves the ability to distinguish between strong and weak promoters.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ymeth.2022.01.001DOI Listing

Publication Analysis

Top Keywords

detecting promoters
8
multiple descriptors
8
descriptors feature
8
feature selection
8
k-mer word
8
word vector
8
promoters
6
feature
5
dpromoter-xgboost detecting
4
promoters strength
4

Similar Publications

This study investigated whether the neddylation inhibitor MLN4924 induces aberrant DNA methylation patterns in acute myeloid leukemia and contributes to the reactivation of tumor suppressor genes. DNA methylation profiles of Kasumi-1 and KU812 acute myeloid leukemia cell lines before and after MLN4924 treatment were generated using the 850K Methylation BeadChip. RNA sequencing was used to obtain transcriptomic profiles of Kasumi-1 cells.

View Article and Find Full Text PDF

SP1 activates AKT3 to facilitate the development of diabetic nephropathy.

J Endocrinol Invest

January 2025

Department of Endocrinology, Nanshi Hospital of Nanyang, No. 130, West Zhongzhou Road, Nanyang, 473065, China.

Background: Diabetic nephropathy (DN) is a severe complication of diabetes mellitus and has the complex pathogenesis. The previous study reported that protein kinase Bγ (AKT3) was involved in DN progression. Our aim was to explore the detailed mechanisms of AKT3 in DN development.

View Article and Find Full Text PDF

Aim: Microcalcification increases the vulnerability of plaques and has become an important driver of acute cardiovascular events in diabetic patients. However, the regulatory mechanisms remain unclear. DJ-1, a multifunctional protein, may play a potential role in the development of diabetic complications.

View Article and Find Full Text PDF

Background: It is now widely acknowledged that diet, lifestyle, and environmental exposures largely affect an individual's metabolic state in health and disease, including the brain. Metabolomics has demonstrated its potential to enable exciting discoveries in brain health, facilitated by advances in analytical and informatics techniques. Here, we highlighted the use of MS/MS-based untargeted metabolomics to study the diet and medication exposure of cognitively declined cohorts through the newly developed FoodMASST and DrugMASST tools.

View Article and Find Full Text PDF

Background: Whole genome methylation sequencing (WGMS) in blood identifies extensive differential DNA methylation between persons who are cognitively unimpaired (CU) and those with late-onset dementia due to Alzheimer's disease (AD). Here we investigate differentially methylated positions (DMPs) in persons with mild cognitive impairment (MCI) compared to persons with and without AD.

Method: WGMS data quantified DNA methylation levels at 25,406,945 CpG loci in 382 blood samples from 99 persons with MCI, 109 persons with AD and 174 cognitively unimpaired persons in the Wisconsin Alzheimer's Disease Research Center (WADRC) and the Wisconsin Registry for Alzheimer's Prevention (WRAP).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!