Background: Genomic variations are associated with the metabolism and the occurrence of adverse reactions of many therapeutic agents. The polymorphisms on over 2000 locations of cytochrome P450 enzymes (CYP) due to many factors such as ethnicity, mutations, and inheritance attribute to the diversity of response and side effects of various drugs. The associations of the single nucleotide polymorphisms (SNPs), the internal pharmacokinetic patterns and the vulnerability of specific adverse reactions become one of the research interests of pharmacogenomics. The conventional genomewide association studies (GWAS) mainly focuses on the relation of single or multiple SNPs to a specific risk factors which are a one-to-many relation. However, there are no robust methods to establish a many-to-many network which can combine the direct and indirect associations between multiple SNPs and a serial of events (e.g. adverse reactions, metabolic patterns, prognostic factors etc.). In this paper, we present a novel deep learning model based on generative stochastic networks and hidden Markov chain to classify the observed samples with SNPs on five loci of two genes (CYP2D6 and CYP1A2) respectively to the vulnerable population of 14 types of adverse reactions.
Methods: A supervised deep learning model is proposed in this study. The revised generative stochastic networks (GSN) model with transited by the hidden Markov chain is used. The data of the training set are collected from clinical observation. The training set is composed of 83 observations of blood samples with the genotypes respectively on CYP2D6*2, *10, *14 and CYP1A2*1C, *1 F. The samples are genotyped by the polymerase chain reaction (PCR) method. A hidden Markov chain is used as the transition operator to simulate the probabilistic distribution. The model can perform learning at lower cost compared to the conventional maximal likelihood method because the transition distribution is conditional on the previous state of the hidden Markov chain. A least square loss (LASSO) algorithm and a k-Nearest Neighbors (kNN) algorithm are used as the baselines for comparison and to evaluate the performance of our proposed deep learning model.
Results: There are 53 adverse reactions reported during the observation. They are assigned to 14 categories. In the comparison of classification accuracy, the deep learning model shows superiority over the LASSO and kNN model with a rate over 80 %. In the comparison of reliability, the deep learning model shows the best stability among the three models.
Conclusions: Machine learning provides a new method to explore the complex associations among genomic variations and multiple events in pharmacogenomics studies. The new deep learning algorithm is capable of classifying various SNPs to the corresponding adverse reactions. We expect that as more genomic variations are added as features and more observations are made, the deep learning model can improve its performance and can act as a black-box but reliable verifier for other GWAS studies.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4980789 | PMC |
http://dx.doi.org/10.1186/s12920-016-0207-4 | DOI Listing |
Sci Rep
January 2025
College of Big Data Statistics, Guizhou University of Finance and Economics, Guiyang, 550025, China.
Deep learning has achieved significant success in the field of defect detection; however, challenges remain in detecting small-sized, densely packed parts under complex working conditions, including occlusion and unstable lighting conditions. This paper introduces YOLOv8-n as the core network to propose VEE-YOLO, a robust and high-performance defect detection model. Firstly, GSConv was introduced to enhance feature extraction in depthwise separable convolution and establish the VOVGSCSP module, emphasizing feature reusability for more effective feature engineering.
View Article and Find Full Text PDFSci Rep
January 2025
Institute of Agricultural Information Technology, Henan Academy of Agricultural Sciences, Zhengzhou, 450002, China.
Identification and diagnosis of tobacco diseases are prerequisites for the scientific prevention and control of these ailments. To address the limitations of traditional methods, such as weak generalization and sensitivity to noise in segmenting tobacco leaf lesions, this study focused on four tobacco diseases: angular leaf spot, brown spot, wildfire disease, and frog eye disease. Building upon the Unet architecture, we developed the Multi-scale Residual Dilated Segmentation Model (MD-Unet) by enhancing the feature extraction module and integrating attention mechanisms.
View Article and Find Full Text PDFNature
January 2025
Gene Regulation Observatory, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
Cis-regulatory elements (CREs) control gene expression and are dynamic in their structure and function, reflecting changes in the composition of diverse effector proteins over time. However, methods for measuring the organization of effector proteins at CREs across the genome are limited, hampering efforts to connect CRE structure to their function in cell fate and disease. Here we developed PRINT, a computational method that identifies footprints of DNA-protein interactions from bulk and single-cell chromatin accessibility data across multiple scales of protein size.
View Article and Find Full Text PDFJ Imaging Inform Med
January 2025
Fujian Medical University, 1 Xue Yuan Road, University Town, Fujian, 350122, China.
Breast cancer ranks as the most prevalent cancer among women globally. Histopathological image analysis stands as one of the most reliable methods for tumor detection. This study aims to utilize deep learning to extract histopathological features and automatically identify tumor information, thereby assisting doctors in high-precision pathological diagnosis.
View Article and Find Full Text PDFJ Imaging Inform Med
January 2025
Department of Electrical and Computer Engineering, Duke University, Durham, NC, USA.
Deep neural networks (DNNs) have demonstrated exceptional performance across various image segmentation tasks. However, the process of preparing datasets for training segmentation DNNs is both labor-intensive and costly, as it typically requires pixel-level annotations for each object of interest. To mitigate this challenge, alternative approaches such as using weak labels (e.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!