Computational methods for assessing the likely impacts of mutations, known as variant effect predictors (VEPs), are widely used in the assessment and interpretation of human genetic variation, as well as in other applications like protein engineering. Many different VEPs have been released to date, and there is tremendous variability in their underlying algorithms and outputs, and in the ways in which the methodologies and predictions are shared. This leads to considerable challenges for end users in knowing which VEPs to use and how to use them.
View Article and Find Full Text PDFIdentifying causal mutations accelerates genetic disease diagnosis, and therapeutic development. Missense variants present a bottleneck in genetic diagnoses as their effects are less straightforward than truncations or nonsense mutations. While computational prediction methods are increasingly successful at prediction for variants in disease genes, they do not generalize well to other genes as the scores are not calibrated across the proteome.
View Article and Find Full Text PDFIdentifying causal mutations accelerates genetic disease diagnosis, and therapeutic development. Missense variants present a bottleneck in genetic diagnoses as their effects are less straightforward than truncations or nonsense mutations. While computational prediction methods are increasingly successful at prediction for variants in disease genes, they do not generalize well to other genes as the scores are not calibrated across the proteome.
View Article and Find Full Text PDFThe ability to alter genomes specifically by CRISPR-Cas gene editing has revolutionized biological research, biotechnology, and medicine. Broad therapeutic application of this technology, however, will require thorough preclinical assessment of off-target editing by homology-based prediction coupled with reliable methods for detecting off-target editing. Several off-target site nomination assays exist, but careful comparison is needed to ascertain their relative strengths and weaknesses.
View Article and Find Full Text PDFGiven the complex relationship between gene expression and phenotypic outcomes, computationally efficient approaches are needed to sift through large high-dimensional datasets in order to identify biologically relevant biomarkers. In this report, we describe a method of identifying the most salient biomarker genes in a dataset, which we call "candidate genes", by evaluating the ability of gene combinations to classify samples from a dataset, which we call "classification potential". Our algorithm, Gene Oracle, uses a neural network to test user defined gene sets for polygenic classification potential and then uses a combinatorial approach to further decompose selected gene sets into candidate and non-candidate biomarker genes.
View Article and Find Full Text PDFRuminant animals have a symbiotic relationship with the microorganisms in their rumens. In this relationship, rumen microbes efficiently degrade complex plant-derived compounds into smaller digestible compounds, a process that is very likely associated with host animal feed efficiency. The resulting simpler metabolites can then be absorbed by the host and converted into other compounds by host enzymes.
View Article and Find Full Text PDF