Haploinsufficiency predictions without study bias.

Nucleic Acids Res

MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford OX1 3PT, UK

Published: September 2015

Any given human individual carries multiple genetic variants that disrupt protein-coding genes, through structural variation, as well as nucleotide variants and indels. Predicting the phenotypic consequences of a gene disruption remains a significant challenge. Current approaches employ information from a range of biological networks to predict which human genes are haploinsufficient (meaning two copies are required for normal function) or essential (meaning at least one copy is required for viability). Using recently available study gene sets, we show that these approaches are strongly biased towards providing accurate predictions for well-studied genes. By contrast, we derive a haploinsufficiency score from a combination of unbiased large-scale high-throughput datasets, including gene co-expression and genetic variation in over 6000 human exomes. Our approach provides a haploinsufficiency prediction for over twice as many genes currently unassociated with papers listed in Pubmed as three commonly-used approaches, and outperforms these approaches for predicting haploinsufficiency for less-studied genes. We also show that fine-tuning the predictor on a set of well-studied 'gold standard' haploinsufficient genes does not improve the prediction for less-studied genes. This new score can readily be used to prioritize gene disruptions resulting from any genetic variant, including copy number variants, indels and single-nucleotide variants.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4551909PMC
http://dx.doi.org/10.1093/nar/gkv474DOI Listing

Publication Analysis

Top Keywords

variants indels
8
less-studied genes
8
genes
7
haploinsufficiency
4
haploinsufficiency predictions
4
predictions study
4
study bias
4
bias human
4
human individual
4
individual carries
4

Similar Publications

Somatic DNA Variants in Epilepsy Surgery Brain Samples from Patients with Lesional Epilepsy.

Int J Mol Sci

January 2025

Department of Neuropediatrics, Charité-Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin, Humboldt-Universität Berlin, and Berlin Institute of Health, 13353 Berlin, Germany.

Epilepsy affects 50 million people worldwide and is drug-resistant in approximately one-third of cases. Even when a structural lesion is identified as the epileptogenic focus, understanding the underlying genetic causes is crucial to guide both counseling and treatment decisions. Both somatic and germline DNA variants may contribute to the lesion itself and/or influence the severity of symptoms.

View Article and Find Full Text PDF

Pymetrozine is currently one of the primary insecticides used to control the brown planthopper, Nilaparvata lugens Stål (Hemiptera: Delphacidae), but the long-term effectiveness of this chemical is threatened by growing issues of resistance. Previous studies in a laboratory selected strain of N. lugens, Pym-R, have shown that resistance to pymetrozine can evolve without target-site mutations.

View Article and Find Full Text PDF

Genetic determinants of COVID-19 severity and mortality: Alu 287 bp polymorphism and , , expression in hospitalized patients.

PeerJ

January 2025

Departamento de genética, ecologia e evolução, Laboratório de biologia integrativa, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil.

Background: The angiotensin-converting enzyme 2 (ACE2) and the transmembrane serine protease 2 (TMPRSS2) are central human molecules in the SARS-CoV-2 virus-host interaction. Evidence indicates that may influence expression. This study aims to determine whether ACE1, ACE2, and TMPRSS2 mRNA expression levels, along with the ACE1 Alu 287 bp polymorphism (rs4646994), contribute to the severity and mortality of COVID-19.

View Article and Find Full Text PDF

Background: Peritumoral lidocaine infiltration prior to excision is associated with better survival in breast cancer (BC), which led us to hypothesize that innervation to the tumor affects its biology and patient survival. Activity-regulated cytoskeleton-associated protein (ARC) gene expression is known to be regulated by neuronal activity. Therefore, we studied the clinical relevance of ARC gene expression as a surrogate of neuronal activity in BC.

View Article and Find Full Text PDF

Goats typically have double coats, with the outermost coarse hairs providing protection against mechanical and radiation damage. While much attention has been paid to cashmere due to its status as a high-end textile material, there is limited information available on coarse hair. This study aimed to identify genomic variants, such as single nucleotide polymorphisms (SNPs) and insertion/deletions (indels), associated with coarse hair diameter using a genome-wide association study (GWAS).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!