Genome-Wide Association Studies, or GWAS, aim at finding Single Nucleotide Polymorphisms (SNPs) that are associated with a phenotype of interest. GWAS are known to suffer from the large dimensionality of the data with respect to the number of available samples. Other limiting factors include the dependency between SNPs, due to linkage disequilibrium (LD), and the need to account for population structure, that is to say, confounding due to genetic ancestry.We propose an efficient approach for the multivariate analysis of multi-population GWAS data based on a multitask group Lasso formulation. Each task corresponds to a subpopulation of the data, and each group to an LD-block. This formulation alleviates the curse of dimensionality, and makes it possible to identify disease LD-blocks shared across populations/tasks, as well as some that are specific to one population/task. In addition, we use stability selection to increase the robustness of our approach. Finally, gap safe screening rules speed up computations enough that our method can run at a genome-wide scale.To our knowledge, this is the first framework for GWAS on diverse populations combining feature selection at the LD-groups level, a multitask approach to address population structure, stability selection, and safe screening rules. We show that our approach outperforms state-of-the-art methods on both a simulated and a real-world cancer datasets.
Download full-text PDF |
Source |
---|
Sci Rep
January 2025
Key Laboratory of Ethnic Language Intelligent Analysis and Security Governance of MOE, Minzu University of China, Beijing, 100081, China.
Speech-to-speech translation (S2ST) has evolved from cascade systems which integrate Automatic Speech Recognition (ASR), Machine Translation (MT), and Text-to-Speech (TTS), to end-to-end models. This evolution has been driven by advancements in model performance and the expansion of cross-lingual speech datasets. Despite the paucity of research on Tibetan speech translation, this paper endeavors to tackle the challenge of Tibetan-to-Chinese direct speech-to-speech translation within the multi-task learning framework, employing self-supervised learning (SSL) and sequence-to-sequence model training.
View Article and Find Full Text PDFNat Commun
January 2025
School of Life Sciences, Northwestern Polytechnical University, Xi'an, China.
Inferring appropriate synthesis reaction (i.e., retrosynthesis) routes for newly designed molecules is vital.
View Article and Find Full Text PDFJ Am Med Dir Assoc
January 2025
School of Public Health, Shunde Women and Children's Hospital, Guangdong Medical University, Dongguan, China; Precision Key Laboratory of Public Health, Guangdong Medical University, Dongguan, China. Electronic address:
Objectives: The 3 most frequently utilized frailty assessment measures are the Fried criteria, FRAIL scale, and Frailty Index (FI). This study aimed to compare predictive capabilities of these 3 measures regarding all-cause mortality in the United States and to identify the key predictive variables.
Design: Cross-sectional study.
Objectives: To determine the perception of female community health volunteers (FCHVs) in terms of their scope of work, impact of work on their professional experiences and their coping strategies and stakeholders' perception of FCHVs programme, their contribution to the health sector and its sustainability.
Design: A qualitative study involving in-depth interviews (IDIs) with FCHVs and key informant interviews (KIIs) with local stakeholders. All the interviews were conducted through telephone.
J Chem Inf Model
January 2025
School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China.
The accurate identification of protein-nucleotide binding residues is crucial for protein function annotation and drug discovery. Numerous computational methods have been proposed to predict these binding residues, achieving remarkable performance. However, due to the limited availability and high variability of nucleotides, predicting binding residues for diverse nucleotides remains a significant challenge.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!