Motivation: As sequencing technologies and analysis pipelines evolve, de novo mutation (DNM) calling tools must be adapted. Therefore, a flexible approach is needed that can accurately identify DNMs from genome or exome sequences from a variety of datasets and variant calling pipelines.

Results: Here, we describe SynthDNM, a random-forest based classifier that can be readily adapted to new sequencing or variant-calling pipelines by applying a flexible approach to constructing simulated training examples from real data. The optimized SynthDNM classifiers predict de novo SNPs and indels with robust accuracy across multiple methods of variant calling.

Availabilityand Implementation: SynthDNM is freely available on Github (https://github.com/james-guevara/synthdnm).

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8545295PMC
http://dx.doi.org/10.1093/bioinformatics/btab225DOI Listing

Publication Analysis

Top Keywords

novo mutation
8
variant calling
8
flexible approach
8
customized novo
4
mutation detection
4
detection variant
4
calling pipeline
4
synthdnm
4
pipeline synthdnm
4
synthdnm motivation
4

Similar Publications

Lotus japonicus-ROOT HAIR LESS1-LIKE1 (LRL1) of Arabidopsis thaliana encodes a basic helix-loop-helix (bHLH) transcription factor (TF) involved in root hair development. Root hair development is regulated by an elaborate transcriptional network, in which GLABRA2 (GL2), a key negative regulator, directly represses bHLH TF genes, including LRL1 and ROOT HAIR DEFECTIVE6 (RHD6). Although RHD6 and its paralogous TFs have been shown to connect downstream to genes involved in cell morphological events such as endomembrane and cell wall modification, the network downstream of LRL1 remains elusive.

View Article and Find Full Text PDF

Drug discovery continues to face a staggering 90% failure rate, with many setbacks occurring during late-stage clinical trials. To address this challenge, there is an increasing focus on developing and evaluating new technologies to enhance the "design" and "test" phases of antibody-based drugs (e.g.

View Article and Find Full Text PDF

Intracellular protein production in bacteria is limited by the need for lysis and costly purification. A promising alternative is to engineer the host organism for protein secretion. While the serovar Typhimurium ( .

View Article and Find Full Text PDF

Case report: Multisystemic smooth muscle dysfunction syndrome: a rare genetic cause of infantile interstitial lung disease.

Front Pharmacol

January 2025

Respiratory Department II, National Clinical Research Center for Respiratory Diseases, Beijing Children's Hospital, Capital Medical University, National Center for Children's Health, Beijing, China.

Multisystemic smooth muscle dysfunction syndrome (MSMDS) is an autosomal dominant disorder caused by mutations in the gene, resulting in variable clinical manifestation and multi-organ dysfunction. Interstitial lung disease (ILD) is a rare phenotype of this condition. We describe a rare infant case of an 8-month-old boy who presented with progressively worsening dyspnea, along with intermittent episodes of respiratory distress and cyanosis since birth.

View Article and Find Full Text PDF

Van der Woude syndrome (VWS) is an autosomal dominant disorder characterized by lower lip pits and orofacial clefts (OFCs). With a prevalence of approximately 1 in 35,000 live births, it is the most common form of syndromic clefting and may account for ~2% of all OFCs. The majority of VWS is attributed to genetic variants in IRF6 (~70%) or GRHL3 (~5%), leaving up to 25% of individuals with VWS without a molecular diagnosis.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!