Publications by authors named "Ulfarsson M"

Mutations that affect RNA splicing significantly impact human diversity and disease. Here we present a method using transformers, a type of machine learning model, to detect splicing from raw 45,000-nucleotide sequences. We generate embeddings with residual neural networks and apply hard attention to select splice site candidates, enabling efficient training on long sequences.

View Article and Find Full Text PDF

Mendelian Randomization studies indicate that BMI contributes to various diseases, but it's unclear if this is entirely mediated by BMI itself. This study examines whether disease risk from BMI-associated sequence variants is mediated through BMI or other mechanisms, using data from Iceland and the UK Biobank. The associations of BMI genetic risk score with diseases like fatty liver disease, knee replacement, and glucose intolerance were fully attenuated when conditioned on BMI, and largely for type 2 diabetes, heart failure, myocardial infarction, atrial fibrillation, and hip replacement.

View Article and Find Full Text PDF
Article Synopsis
  • The study aims to investigate the genetic factors associated with accessory atrioventricular pathways (APs) and related heart rhythm disorders using a genome-wide association study (GWAS).
  • It involved analyzing genetic data from over 1,200,000 control individuals and 2,310 individuals with APs from multiple countries and various health databases.
  • Key findings revealed three significant genetic variants linked to APs, particularly in specific genes (CCDC141 and SCN10A), with implications for understanding conditions like paroxysmal supraventricular tachycardia (PSVT).
View Article and Find Full Text PDF

We report a multi-ancestry genome-wide association study on liver cirrhosis and its associated endophenotypes, alanine aminotransferase (ALT) and γ-glutamyl transferase. Using data from 12 cohorts, including 18,265 cases with cirrhosis, 1,782,047 controls, up to 1 million individuals with liver function tests and a validation cohort of 21,689 cases and 617,729 controls, we identify and validate 14 risk associations for cirrhosis. Many variants are located near genes involved in hepatic lipid metabolism.

View Article and Find Full Text PDF

Clonal hematopoiesis (CH) arises when a substantial proportion of mature blood cells is derived from a single hematopoietic stem cell lineage. Using whole-genome sequencing of 45,510 Icelandic and 130,709 UK Biobank participants combined with a mutational barcode method, we identified 16,306 people with CH. Prevalence approaches 50% in elderly participants.

View Article and Find Full Text PDF

High-throughput proteomics platforms measuring thousands of proteins in plasma combined with genomic and phenotypic information have the power to bridge the gap between the genome and diseases. Here we performed association studies of Olink Explore 3072 data generated by the UK Biobank Pharma Proteomics Project on plasma samples from more than 50,000 UK Biobank participants with phenotypic and genotypic data, stratifying on British or Irish, African and South Asian ancestries. We compared the results with those of a SomaScan v4 study on plasma from 36,000 Icelandic people, for 1,514 of whom Olink data were also available.

View Article and Find Full Text PDF

Many sequence variants have additive effects on blood lipid levels and, through that, on the risk of coronary artery disease (CAD). We show that variants also have non-additive effects and interact to affect lipid levels as well as affecting variance and correlations. Variance and correlation effects are often signatures of epistasis or gene-environmental interactions.

View Article and Find Full Text PDF

Importance: Whether protein risk scores derived from a single plasma sample could be useful for risk assessment for atherosclerotic cardiovascular disease (ASCVD), in conjunction with clinical risk factors and polygenic risk scores, is uncertain.

Objective: To develop protein risk scores for ASCVD risk prediction and compare them to clinical risk factors and polygenic risk scores in primary and secondary event populations.

Design, Setting, And Participants: The primary analysis was a retrospective study of primary events among 13 540 individuals in Iceland (aged 40-75 years) with proteomics data and no history of major ASCVD events at recruitment (study duration, August 23, 2000 until October 26, 2006; follow-up through 2018).

View Article and Find Full Text PDF

Background: The TNM system is used to assess prognosis after colorectal cancer (CRC) diagnosis. Other prognostic factors reported include histopathological assessments of the tumour, tumour mutations and proteins in the blood. As some of these factors are strongly correlated, it is important to evaluate the independent effects they may have on survival.

View Article and Find Full Text PDF

Background: Persistent symptoms are common after SARS-CoV-2 infection but correlation with objective measures is unclear.

Methods: We invited all 3098 adults who tested SARS-CoV-2 positive in Iceland before October 2020 to the deCODE Health Study. We compared multiple symptoms and physical measures between 1706 Icelanders with confirmed prior infection (cases) who participated, and 619 contemporary and 13,779 historical controls.

View Article and Find Full Text PDF

The genetic basis of the human vocal system is largely unknown, as are the sequence variants that give rise to individual differences in voice and speech. Here, we couple data on diversity in the sequence of the genome with voice and vowel acoustics in speech recordings from 12,901 Icelanders. We show how voice pitch and vowel acoustics vary across the life span and correlate with anthropometric, physiological, and cognitive traits.

View Article and Find Full Text PDF
Article Synopsis
  • Intracranial volume (ICV), measured through MRI or head circumference, is hereditary and linked to cognitive abilities and neurological disorders.
  • A genome-wide association study identified 64 genetic variants associated with ICV, explaining about 5% of its variation and pointing to 12 key genes involved in cranial conditions.
  • The study found that ICV is connected to brain region sizes and various traits, with Parkinson's disease linked to larger ICV and attention deficit hyperactivity disorder (ADHD) associated with smaller ICV.
View Article and Find Full Text PDF

Nonalcoholic fatty liver (NAFL) and its sequelae are growing health problems. We performed a genome-wide association study of NAFL, cirrhosis and hepatocellular carcinoma, and integrated the findings with expression and proteomic data. For NAFL, we utilized 9,491 clinical cases and proton density fat fraction extracted from 36,116 liver magnetic resonance images.

View Article and Find Full Text PDF

Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data. Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank.

View Article and Find Full Text PDF

Genomic copy number variants (CNVs) are associated with a high risk of neurodevelopmental disorders. A growing body of genetic studies suggests that these high-risk genetic variants converge in common molecular pathways and that common pathways also exist across clinically distinct disorders, such as schizophrenia and autism spectrum disorder. A key question is how common molecular mechanisms converge into similar clinical outcomes.

View Article and Find Full Text PDF

The plasma proteome can help bridge the gap between the genome and diseases. Here we describe genome-wide association studies (GWASs) of plasma protein levels measured with 4,907 aptamers in 35,559 Icelanders. We found 18,084 associations between sequence variants and levels of proteins in plasma (protein quantitative trait loci; pQTL), of which 19% were with rare variants (minor allele frequency (MAF) < 1%).

View Article and Find Full Text PDF

Predicting all-cause mortality risk is challenging and requires extensive medical data. Recently, large-scale proteomics datasets have proven useful for predicting health-related outcomes. Here, we use measurements of levels of 4,684 plasma proteins in 22,913 Icelanders to develop all-cause mortality predictors both for short- and long-term risk.

View Article and Find Full Text PDF

Low-frequency 1q21.1 distal deletion and duplication copy number variant (CNV) carriers are predisposed to multiple neurodevelopmental disorders, including schizophrenia, autism and intellectual disability. Human carriers display a high prevalence of micro- and macrocephaly in deletion and duplication carriers, respectively.

View Article and Find Full Text PDF

The Enhancing NeuroImaging Genetics through Meta-Analysis copy number variant (ENIGMA-CNV) and 22q11.2 Deletion Syndrome Working Groups (22q-ENIGMA WGs) were created to gain insight into the involvement of genetic factors in human brain development and related cognitive, psychiatric and behavioral manifestations. To that end, the ENIGMA-CNV WG has collated CNV and magnetic resonance imaging (MRI) data from ~49,000 individuals across 38 global research sites, yielding one of the largest studies to date on the effects of CNVs on brain structures in the general population.

View Article and Find Full Text PDF

Olfactory receptor (OR) genes in humans form a special class characterized by unusually high DNA sequence diversity, which should give rise to differences in perception and behavior. In the largest genome-wide association study to date based on olfactory testing, we investigated odor perception and naming with smell tasks performed by 9,122 Icelanders, with replication in a separate sample of 2,204 individuals. We discovered an association between a low-frequency missense variant in TAAR5 and reduced intensity rating of fish odor containing trimethylamine (p.

View Article and Find Full Text PDF

Machine learning algorithms can be trained to estimate age from brain structural MRI. The difference between an individual's predicted and chronological age, predicted age difference (PAD), is a phenotype of relevance to aging and brain disease. Here, we present a new deep learning approach to predict brain age from a T1-weighted MRI.

View Article and Find Full Text PDF

Importance: Recurrent microdeletions and duplications in the genomic region 15q11.2 between breakpoints 1 (BP1) and 2 (BP2) are associated with neurodevelopmental disorders. These structural variants are present in 0.

View Article and Find Full Text PDF