Human leukocyte antigen (HLA) genes are associated with a variety of diseases, yet the direct typing of HLA alleles is both time-consuming and costly. Consequently, various imputation methods leveraging sequential single nucleotide polymorphisms (SNPs) data have been proposed, employing either statistical or deep learning models, such as the convolutional neural network (CNN)-based model, DEEP*HLA. However, these methods exhibit limited imputation efficiency for infrequent alleles and necessitate a large size of reference dataset. In this context, we have developed a Transformer-based model to HLA allele imputation, named "HLA Reliable IMpuatioN by Transformer (HLARIMNT)" designed to exploit the sequential nature of SNPs data. We evaluated HLARIMNT's performance using two distinct reference panels; Pan-Asian reference panel (n = 530) and Type 1 Diabetes genetics Consortium (T1DGC) reference panel (n = 5225), alongside a combined panel (n = 1060). HLARIMNT demonstrated superior accuracy to DEEP*HLA across several indices, particularly for infrequent alleles. Furthermore, we explored the impact of varying training data sizes on imputation accuracy, finding that HLARIMNT consistently outperformed across all data size. These findings suggest that Transformer-based models can efficiently impute not only HLA types but potentially other gene types from sequential SNPs data.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11422163 | PMC |
http://dx.doi.org/10.1038/s10038-024-01278-x | DOI Listing |
J Neurophysiol
January 2025
Department of Anesthesiology, the First Affiliated Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen, China.
Despite a significant genetic component to insomnia (heritability: 22-25%), the genetic loci that modulate insomnia risk remain limited. We employed the Unified Test for Molecular Markers (UTMOST) for transcriptome-wide association studies (TWAS) across various tissues, integrating summary statistics from a Genome-Wide Association Study (GWAS) of 462,341 European participants with gene expression data from the Genotype-Tissue Expression (GTEx) project. Three validation methods (FUSION, FOCUS, and MAGMA) were used to confirm important genes.
View Article and Find Full Text PDFJ Clin Invest
January 2025
State Key Laboratory of Oncology in South China, Guangdong Key Laboratory of Nasopharyngeal Carcinoma Diagnosis and Therapy, Guangdong Provincial Clinical Research Center for Cancer, Sun Yat-sen University Cancer Center - Zhongshan School of Medicine.
Nasopharyngeal carcinoma (NPC) presents a substantial clinical challenge due to the limited understanding of its genetic underpinnings. Here we conduct the largest scale whole-exome sequencing association study of NPC to date, encompassing 6,969 NPC cases and 7,100 controls. We unveil 3 germline genetic variants linked to NPC susceptibility: a common rs2276868 in RPL14, a rare rs5361 in SELE, and a common rs1050462 in HLA-B.
View Article and Find Full Text PDFJ Cancer
January 2025
Department of Otorhinolaryngology, Head and Neck Surgery, Changhua Christian Hospital, Changhua, Taiwan.
Oral squamous cell carcinoma (OSCC) affects a substantial proportion of the Asian population and is influenced by various genetic risk factors. The (), a regulator of the circadian rhythm, has been implicated in certain neoplasms. Accordingly, this study investigated the association between single-nucleotide polymorphisms and clinical manifestations of OSCC.
View Article and Find Full Text PDFFront Public Health
January 2025
Department of Pediatric Metabolism and Nutrition, Gazi University, Ankara, Türkiye.
Introduction: Circadian Locomotor Output Cycles Kaput (CLOCK) is one of the transcription factors from the positive end of the molecular clock and regulates biological rhythm in mammals. Studies have shown that genetic variations in the CLOCK genes are associated with chronotype, sleep patterns, obesity, dietary energy, and nutrient consumption. Although interest in the field of chrononutrition continues to increase, investigations into the temporal aspects of dietary habits in adolescents are notably limited.
View Article and Find Full Text PDFInt J Med Sci
January 2025
Department of hepatobiliary surgery, the First Affiliated Hospital of Xi'an Jiaotong University, Xi'an 710061, China.
The aim of this study is to utilize two-sample Mendelian randomization (MR) to investigate the potential causal relationship among psoriasis, iridocyclitis, and non-alcoholic fatty liver disease (NAFLD), and to explore any potential mediation effects. Pooled data were derived from the public genome-wide association study (GWAS) in NAFLD (finn-b-NAFLD), iridocyclitis (finn-b-H7_IRIDOCYCLITIS) and psoriasis (finn-b-L12_PSORI_VULG). Univariable MR (UVMR) analysis was implemented to explore the causal relationship among psoriasis, iridocyclitis, and NAFLD, and inverse variance weighting (IVW) was used as the primary analytical method.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!