Courses of SARS-CoV-2 infections are highly variable, ranging from asymptomatic to lethal COVID-19. Though research has shown that host genetic factors contribute to this variability, cohort-based joint analyses of variants from the entire allelic spectrum in individuals with confirmed SARS-CoV-2 infections are still lacking. Here, we present the results of whole genome sequencing in 1,220 mainly vaccine-naïve individuals with confirmed SARS-CoV-2 infection, including 827 hospitalized COVID-19 cases.
View Article and Find Full Text PDFIntroduction: Formal genetics studies show that smoking is influenced by genetic factors; exploring this on the molecular level can offer deeper insight into the etiology of smoking behaviours.
Methods: Summary statistics from the latest wave of the GWAS and Sequencing Consortium of Alcohol and Nicotine (GSCAN) were used to calculate polygenic risk scores (PRS) in a sample of ~2200 individuals who smoke/individuals who never smoked. The associations of smoking status with PRS for Smoking Initiation (i.
Dupuytren's disease (DD) is a highly heritable fibrotic disorder of the hand with incompletely understood etiology. A number of genetic loci, including Wnt signaling members, have been previously identified. Our overall aim was to identify novel genetic loci, to prioritize genes within the loci for functional studies, and to assess genetic correlation with associated disorders.
View Article and Find Full Text PDFMotivation: We introduce SMapper, a novel web and software tool for visualizing spatial prevalence data of all types including those suffering from incomplete geographic coverage and insufficient sample sizes. We demonstrate the benefits of our tool in overcoming interpretational issues with existing tools caused by such data limitations. We exemplify the use of SMapper by applications to human genotype and phenotype data relevant in an epidemiological, anthropological and forensic context.
View Article and Find Full Text PDFMotivation: Pathogenic copy-number variants (CNVs) can cause a heterogeneous spectrum of rare and severe disorders. However, most CNVs are benign and are part of natural variation in human genomes. CNV pathogenicity classification, genotype-phenotype analyses, and therapeutic target identification are challenging and time-consuming tasks that require the integration and analysis of information from multiple scattered sources by experts.
View Article and Find Full Text PDFClinically identified genetic variants in ion channels can be benign or cause disease by increasing or decreasing the protein function. As a consequence, therapeutic decision-making is challenging without molecular testing of each variant. Our biophysical knowledge of ion-channel structures and function is just emerging, and it is currently not well understood which amino acid residues cause disease when mutated.
View Article and Find Full Text PDFPleiotropy is a widespread phenomenon that may increase insight into the etiology of biological and disease traits. Since genome-wide association studies frequently provide information on a single trait only, only univariate pleiotropy detection methods are applicable, with yet unknown comparative performance. Here, we compared five such methods with respect to their ability to detect pleiotropy, including meta-analysis, ASSET, conditional false discovery rate (cFDR), cross-phenotype Bayes (CPBayes), and pleiotropic analysis under the composite null hypothesis (PLACO), by performing extended computer simulations that varied the underlying etiological model for pleiotropy for a pair of traits, including the number of causal variants, degree of traits' overlap, effect sizes as well as trait prevalence, and varying sample sizes.
View Article and Find Full Text PDFBackground: Analyses of few gene-sets in epilepsy showed a potential to unravel key disease associations. We set out to investigate the burden of ultra-rare variants (URVs) in a comprehensive range of biologically informed gene-sets presumed to be implicated in epileptogenesis.
Methods: The burden of 12 URV types in 92 gene-sets was compared between cases and controls using whole exome sequencing data from individuals of European descent with developmental and epileptic encephalopathies (DEE, n = 1,003), genetic generalized epilepsy (GGE, n = 3,064), or non-acquired focal epilepsy (NAFE, n = 3,522), collected by the Epi25 Collaborative, compared to 3,962 ancestry-matched controls.
We detail the development of the ancestry informative single nucleotide polymorphisms (SNPs) panel forming part of the VISAGE Basic Tool (BT), which combines 41 appearance predictive SNPs and 112 ancestry predictive SNPs (three SNPs shared between sets) in one massively parallel sequencing (MPS) multiplex, whereas blood-based age analysis using methylation markers is run in a parallel MPS analysis pipeline. The selection of SNPs for the BT ancestry panel focused on established forensic markers that already have a proven track record of good sequencing performance in MPS, and the overall SNP multiplex scale closely matched that of existing forensic MPS assays. SNPs were chosen to differentiate individuals from the five main continental population groups of Africa, Europe, East Asia, America, and Oceania, extended to include differentiation of individuals from South Asia.
View Article and Find Full Text PDFSarcoidosis and chronic beryllium disease (CBD) are phenocopies, however the latter one has a clear trigger factor that is beryllium exposure. This study analyses single nucleotide polymorphisms (SNPs) in a large cohort for beryllium-exposed persons. SNPs were chosen for their relevance in sarcoidosis.
View Article and Find Full Text PDFThe prediction of human externally visible characteristics (EVCs) based solely on DNA information has become an established approach in forensic and anthropological genetics in recent years. While for a large set of EVCs, predictive models have already been established using multinomial logistic regression (MLR), the prediction performances of other possible classification methods have not been thoroughly investigated thus far. Motivated by the question to identify a potential classifier that outperforms these specific trait models, we conducted a systematic comparison between the widely used MLR and three popular machine learning (ML) classifiers, namely support vector machines (SVM), random forest (RF) and artificial neural networks (ANN), that have shown good performance outside EVC prediction.
View Article and Find Full Text PDFHearing loss (HL) is one of the most common sensory defects affecting more than 466 million individuals worldwide. It is clinically and genetically heterogeneous with over 120 genes causing non-syndromic HL identified to date. Here, we performed exome sequencing (ES) on a cohort of Iranian families with no disease-causing variants in known deafness-associated genes after screening with a targeted gene panel.
View Article and Find Full Text PDFThe prediction of appearance traits by use of solely genetic information has become an established approach and a number of statistical prediction models have already been developed for this purpose. However, given limited knowledge on appearance genetics, currently available models are incomplete and do not include all causal genetic variants as predictors. Therefore such prediction models may benefit from the inclusion of additional information that acts as a proxy for this unknown genetic background.
View Article and Find Full Text PDFEcuador is a multiethnic and pluricultural country with a complex history defined by migration and admixture processes. The present study aims to increase our knowledge on the Ecuadorian Native Amerindian groups and the unique South American Y-chromosome haplogroup C3-MPB373 through the analysis of up to 23 Y-chromosome STRs (Y-STRs) and several Y-SNPs in a sample of 527 Ecuadorians from 7 distinct populations and geographic areas, including Kichwa and non-Kichwa Native Amerindians, Mestizos and Afro-Ecuadorians. Our results reveal the presence of C3-MPB373 both in the Amazonian lowland Kichwa with frequencies up to 28 % and, for the first time, in notable proportions in Kichwa populations from the Ecuadorian highlands.
View Article and Find Full Text PDFPredicting appearance phenotypes from genotypes is relevant for various areas of human genetic research and applications such as genetic epidemiology, human history, anthropology, and particularly in forensics. Many appearance phenotypes, and thus their underlying genotypes, are highly correlated, with pigmentation traits serving as primary examples. However, all available genetic prediction models, including those for pigmentation traits currently used in forensic DNA phenotyping, ignore phenotype correlations.
View Article and Find Full Text PDFRegion-based genome-wide scans are usually performed by use of a priori chosen analysis regions. Such an approach will likely miss the region comprising the strongest signal and, thus, may result in increased type II error rates and decreased power. Here, we propose a genomic exhaustive scan approach that analyzes all possible subsequences and does not rely on a prior definition of the analysis regions.
View Article and Find Full Text PDFTo analyze family-based whole-genome sequence (WGS) data for complex traits, we developed a rare variant (RV) non-parametric linkage (NPL) analysis method, which has advantages over association methods. The RV-NPL differs from the NPL in that RVs are analyzed, and allele sharing among affected relative-pairs is estimated only for minor alleles. Analyzing families can increase power because causal variants with familial aggregation usually have larger effect sizes than those underlying sporadic diseases.
View Article and Find Full Text PDFIran, despite its size, geographic location and past cultural influence, has largely been a blind spot for human population genetic studies. With only sparse genetic information on the Iranian population available, we pursued its genome-wide and geographic characterization based on 1021 samples from eleven ethnic groups. We show that Iranians, while close to neighboring populations, present distinct genetic variation consistent with long-standing genetic continuity, harbor high heterogeneity and different levels of consanguinity, fall apart into a cluster of similar groups and several admixed ones and have experienced numerous language adoption events in the past.
View Article and Find Full Text PDF