Publications by Anthony Marcketta

Publications by authors named "Anthony Marcketta"

Page 1 of 1

Author Correction: A deep catalogue of protein-coding variation in 983,578 individuals.

Kathie Y Sun Xiaodong Bai Siying Chen Suying Bao Chuanyi Zhang Anthony Marcketta

Nature

January 2025

View Article and Find Full Text PDF

Genetic inactivation of zinc transporter SLC39A5 improves liver function and hyperglycemia in obesogenic settings.

Shek Man Chim Kristen Howell John Dronzek Weizhen Wu Cristopher Van Hout Anthony Marcketta

Elife

December 2024

Recent studies have revealed a role for zinc in insulin secretion and glucose homeostasis. Randomized placebo-controlled zinc supplementation trials have demonstrated improved glycemic traits in patients with type II diabetes (T2D). Moreover, rare loss-of-function variants in the zinc efflux transporter reduce T2D risk.

View Article and Find Full Text PDF

Joint testing of rare variant burden scores using non-negative least squares.

Andrey Ziyatdinov Joelle Mbatchou Anthony Marcketta Joshua Backman Sheila Gaynor

Am J Hum Genet

October 2024

Gene-based burden tests are a popular and powerful approach for analysis of exome-wide association studies. These approaches combine sets of variants within a gene into a single burden score that is then tested for association. Typically, a range of burden scores are calculated and tested across a range of annotation classes and frequency bins.

View Article and Find Full Text PDF

Yield of genetic association signals from genomes, exomes and imputation in the UK Biobank.

Sheila M Gaynor Tyler Joseph Xiaodong Bai Yuxin Zou Boris Boutkov Anthony Marcketta

Nat Genet

November 2024

Whole-genome sequencing (WGS), whole-exome sequencing (WES) and array genotyping with imputation (IMP) are common strategies for assessing genetic variation and its association with medically relevant phenotypes. To date, there has been no systematic empirical assessment of the yield of these approaches when applied to hundreds of thousands of samples to enable the discovery of complex trait genetic signals. Using data for 100 complex traits from 149,195 individuals in the UK Biobank, we systematically compare the relative yield of these strategies in genetic association studies.

View Article and Find Full Text PDF

Genetic risk factors for COVID-19 and influenza are largely distinct.

Jack A Kosmicki Anthony Marcketta Deepika Sharma Silvio Alessandro Di Gioia Samantha Batista

Nat Genet

August 2024

Article Synopsis

COVID-19 and influenza are respiratory illnesses caused by different viruses but share some symptoms and clinical risk factors, yet their genetic connections remain poorly understood.
A study involving over 18,000 influenza cases and nearly 276,000 control subjects found no common genetic risk factors between COVID-19 and influenza, revealing specific gene variants linked only to influenza.
The research highlights the potential for targeting cell surface receptors involved in viral entry, showing that manipulating specific genes could lead to treatments that prevent both COVID-19 and influenza infections.

View Article and Find Full Text PDF

A deep catalogue of protein-coding variation in 983,578 individuals.

Kathie Y Sun Xiaodong Bai Siying Chen Suying Bao Chuanyi Zhang Anthony Marcketta

Nature

July 2024

Article Synopsis

Researchers analyzed genetic data from nearly 1 million individuals to create a comprehensive catalogue of human protein-coding variations, shedding light on gene function and the frequency of rare coding variants.
The study identified over 10 million missense and 1.1 million loss-of-function variants, discovering 1,751 novel genes with rare biallelic loss-of-function variants and 3,988 genes intolerant to these variants.
They estimate that 3% of people carry a clinically significant genetic variant and provide public access to their data to enhance genetic interpretation and support precision medicine.

View Article and Find Full Text PDF

A deep catalog of protein-coding variation in 985,830 individuals.

Kathie Y Sun Xiaodong Bai Siying Chen Suying Bao Manav Kapoor Anthony Marcketta

bioRxiv

November 2023

Coding variants that have significant impact on function can provide insights into the biology of a gene but are typically rare in the population. Identifying and ascertaining the frequency of such rare variants requires very large sample sizes. Here, we present the largest catalog of human protein-coding variation to date, derived from exome sequencing of 985,830 individuals of diverse ancestry to serve as a rich resource for studying rare coding variants.

View Article and Find Full Text PDF

Genome-wide analysis provides genetic evidence that ACE2 influences COVID-19 risk and yields risk scores associated with severe disease.

Julie E Horowitz Jack A Kosmicki Amy Damask Deepika Sharma Genevieve H L Roberts Anthony Marcketta

Nat Genet

April 2022

Article Synopsis

A genome-wide association study identified a genetic variant (rs190509934) that reduces ACE2 expression by 37% and lowers the risk of SARS-CoV-2 infection by 40%.
The study confirms six previously known genetic risk variants, with four linked to worse outcomes in COVID-19 infected individuals.
A risk score based on common variants was developed, which improves prediction of severe disease beyond just demographic and clinical factors.

View Article and Find Full Text PDF

Genome-wide survey of parent-of-origin-specific associations across clinical traits derived from electronic health records.

Hye In Kim Bin Ye Jeffrey Staples Anthony Marcketta Chuan Gao

HGG Adv

July 2021

Parent-of-origin (PoO) effects refer to the differential phenotypic impacts of genetic variants dependent on their parental inheritance due to imprinting. While PoO effects can influence complex traits, they may be poorly captured by models that do not differentiate the parental origin of the variant. The aim of this study was to conduct a genome-wide screen for PoO effects on a broad range of clinical traits derived from electronic health records (EHR) in the DiscovEHR study enriched with familial relationships.

View Article and Find Full Text PDF

Exome sequencing and analysis of 454,787 UK Biobank participants.

Joshua D Backman Alexander H Li Anthony Marcketta Dylan Sun Joelle Mbatchou

Nature

November 2021

A major goal in human genetics is to use natural variation to understand the phenotypic consequences of altering each protein-coding gene in the genome. Here we used exome sequencing to explore protein-altering variants and their consequences in 454,787 participants in the UK Biobank study. We identified 12 million coding variants, including around 1 million loss-of-function and around 1.

View Article and Find Full Text PDF

Genome-wide association analysis of serum alanine and aspartate aminotransferase, and the modifying effects of BMI in 388k European individuals.

Chuan Gao Anthony Marcketta Joshua D Backman Colm O'Dushlaine Jeffrey Staples

Genet Epidemiol

September 2021

Serum alanine aminotransferase (ALT) and aspartate aminotransferase (AST) are biomarkers for liver health. Here we report the largest genome-wide association analysis to date of serum ALT and AST levels in over 388k people of European ancestry from UK biobank and DiscovEHR. Eleven million imputed markers with a minor allele frequency (MAF) ≥ 0.

View Article and Find Full Text PDF

Pan-ancestry exome-wide association analyses of COVID-19 outcomes in 586,157 individuals.

Jack A Kosmicki Julie E Horowitz Nilanjana Banerjee Rouel Lanche Anthony Marcketta

Am J Hum Genet

July 2021

Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) causes coronavirus disease 2019 (COVID-19), a respiratory illness that can result in hospitalization or death. We used exome sequence data to investigate associations between rare genetic variants and seven COVID-19 outcomes in 586,157 individuals, including 20,952 with COVID-19. After accounting for multiple testing, we did not identify any clear associations with rare variants either exome wide or when specifically focusing on (1) 13 interferon pathway genes in which rare deleterious variants have been reported in individuals with severe COVID-19, (2) 281 genes located in susceptibility loci identified by the COVID-19 Host Genetics Initiative, or (3) 32 additional genes of immunologic relevance and/or therapeutic potential.

View Article and Find Full Text PDF

Computationally efficient whole-genome regression for quantitative and binary traits.

Joelle Mbatchou Leland Barnard Joshua Backman Anthony Marcketta Jack A Kosmicki

Nat Genet

July 2021

Genome-wide association analysis of cohorts with thousands of phenotypes is computationally expensive, particularly when accounting for sample relatedness or population structure. Here we present a novel machine-learning method called REGENIE for fitting a whole-genome regression model for quantitative and binary phenotypes that is substantially faster than alternatives in multi-trait analyses while maintaining statistical efficiency. The method naturally accommodates parallel analysis of multiple phenotypes and requires only local segments of the genotype matrix to be loaded in memory, in contrast to existing alternatives, which must load genome-wide matrices into memory.

View Article and Find Full Text PDF

Exome sequencing and characterization of 49,960 individuals in the UK Biobank.

Cristopher V Van Hout Ioanna Tachmazidou Joshua D Backman Joshua D Hoffman Daren Liu Anthony Marcketta

Nature

October 2020

The UK Biobank is a prospective study of 502,543 individuals, combining extensive phenotypic and genotypic data with streamlined access for researchers around the world. Here we describe the release of exome-sequence data for the first 49,960 study participants, revealing approximately 4 million coding variants (of which around 98.6% have a frequency of less than 1%).

View Article and Find Full Text PDF

Exome sequencing of 20,791 cases of type 2 diabetes and 24,440 controls.

Jason Flannick Josep M Mercader Christian Fuchsberger Miriam S Udler Anubha Mahajan Anthony Marcketta

Nature

June 2019

Protein-coding genetic variants that strongly affect disease risk can yield relevant clues to disease pathogenesis. Here we report exome-sequencing analyses of 20,791 individuals with type 2 diabetes (T2D) and 24,440 non-diabetic control participants from 5 ancestries. We identify gene-level associations of rare variants (with minor allele frequencies of less than 0.

View Article and Find Full Text PDF

Genetic inactivation of ANGPTL4 improves glucose homeostasis and is associated with reduced risk of diabetes.

Viktoria Gusarova Colm O'Dushlaine Tanya M Teslovich Peter N Benotti Tooraj Mirshahi Anthony Marcketta

Nat Commun

June 2018

Angiopoietin-like 4 (ANGPTL4) is an endogenous inhibitor of lipoprotein lipase that modulates lipid levels, coronary atherosclerosis risk, and nutrient partitioning. We hypothesize that loss of ANGPTL4 function might improve glucose homeostasis and decrease risk of type 2 diabetes (T2D). We investigate protein-altering variants in ANGPTL4 among 58,124 participants in the DiscovEHR human genetics study, with follow-up studies in 82,766 T2D cases and 498,761 controls.

View Article and Find Full Text PDF

Detecting, quantifying, and discriminating the mechanism of mosaic chromosomal aneuploidies using MAD-seq.

Yu Kong Esther R Berko Anthony Marcketta Shahina B Maqbool Claudia A Simões-Pires

Genome Res

July 2018

Current approaches to detect and characterize mosaic chromosomal aneuploidy are limited by sensitivity, efficiency, cost, or the need to culture cells. We describe the mosaic aneuploidy detection by massively parallel sequencing (MAD-seq) capture assay and the analytical approach that allow low (<10%) levels of mosaicism for chromosomal aneuploidy or regional loss of heterozygosity to be detected, assigned to a meiotic or mitotic origin, and quantified as a proportion of the cells in the sample. We show results from a multi-ethnic MAD-seq (meMAD-seq) capture design that works equally well in populations of diverse racial and ethnic origins and how the analytical approach can be applied to exome or whole-genome sequencing data, revealing previously unrecognized aneuploidy or copy number neutral loss of heterozygosity in samples studied by the 1000 Genomes Project, cell lines from public repositories, and one of the Illumina Platinum Genomes samples.

View Article and Find Full Text PDF

Genome-wide Study of Atrial Fibrillation Identifies Seven Risk Loci and Highlights Biological Pathways and Regulatory Elements Involved in Cardiac Development.

Jonas B Nielsen Lars G Fritsche Wei Zhou Tanya M Teslovich Oddgeir L Holmen Anthony Marcketta

Am J Hum Genet

January 2018

Atrial fibrillation (AF) is a common cardiac arrhythmia and a major risk factor for stroke, heart failure, and premature death. The pathogenesis of AF remains poorly understood, which contributes to the current lack of highly effective treatments. To understand the genetic variation and biology underlying AF, we undertook a genome-wide association study (GWAS) of 6,337 AF individuals and 61,607 AF-free individuals from Norway, including replication in an additional 30,679 AF individuals and 278,895 AF-free individuals.

View Article and Find Full Text PDF

Distribution and clinical impact of functional variants in 50,726 whole-exome sequences from the DiscovEHR study.

Frederick E Dewey Michael F Murray John D Overton Lukas Habegger Joseph B Leader Anthony Marcketta

Science

December 2016

The DiscovEHR collaboration between the Regeneron Genetics Center and Geisinger Health System couples high-throughput sequencing to an integrated health care system using longitudinal electronic health records (EHRs). We sequenced the exomes of 50,726 adult participants in the DiscovEHR study to identify ~4.2 million rare single-nucleotide variants and insertion/deletion events, of which ~176,000 are predicted to result in a loss of gene function.

View Article and Find Full Text PDF

Punctuated bursts in human male demography inferred from 1,244 worldwide Y-chromosome sequences.

G David Poznik Yali Xue Fernando L Mendez Thomas F Willems Andrea Massaia Anthony Marcketta

Nat Genet

June 2016

We report the sequences of 1,244 human Y chromosomes randomly ascertained from 26 worldwide populations by the 1000 Genomes Project. We discovered more than 65,000 variants, including single-nucleotide variants, multiple-nucleotide variants, insertions and deletions, short tandem repeats, and copy number variants. Of these, copy number variants contribute the greatest predicted functional impact.

View Article and Find Full Text PDF