Background: The enormous throughput and low cost of second-generation sequencing platforms now allow research and clinical geneticists to routinely perform single experiments that identify tens of thousands to millions of variant sites. Existing methods to annotate variant sites using information from publicly available databases via web browsers are too slow to be useful for the large sequencing datasets being routinely generated by geneticists. Because sequence annotation of variant sites is required before functional characterization can proceed, the lack of a high-throughput pipeline to efficiently annotate variant sites can act as a significant bottleneck in genetics research.

Results: SeqAnt (Sequence Annotator) is an open source web service and software package that rapidly annotates DNA sequence variants and identifies recessive or compound heterozygous loci in human, mouse, fly, and worm genome sequencing experiments. Variants are characterized with respect to their functional type, frequency, and evolutionary conservation. Annotated variants can be viewed on a web browser, downloaded in a tab-delimited text file, or directly uploaded in a BED format to the UCSC genome browser. To demonstrate the speed of SeqAnt, we annotated a series of publicly available datasets that ranged in size from 37 to 3,439,107 variant sites. The total time to completely annotate these data completely ranged from 0.17 seconds to 28 minutes 49.8 seconds.

Conclusion: SeqAnt is an open source web service and software package that overcomes a critical bottleneck facing research and clinical geneticists using second-generation sequencing platforms. SeqAnt will prove especially useful for those investigators who lack dedicated bioinformatics personnel or infrastructure in their laboratories.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2955049PMC
http://dx.doi.org/10.1186/1471-2105-11-471DOI Listing

Publication Analysis

Top Keywords

variant sites
20
web service
12
dna sequence
8
second-generation sequencing
8
sequencing platforms
8
clinical geneticists
8
annotate variant
8
open source
8
source web
8
service software
8

Similar Publications

Mitochondria derive the majority of their lipids from other organelles through contact sites. These lipids, primarily phosphoglycerolipids, are the main components of mitochondrial membranes. In the cell, neutral lipids like triacylglycerides (TAGs) are stored in lipid droplets, playing an important role in maintaining cellular health.

View Article and Find Full Text PDF

Collagen stroma interactions within the extracellular microenvironment of breast tissue play a significant role in breast cancer, including risk, progression, and outcomes. Hydroxylation of proline (HYP) is a common post-translational modification directly linked to breast cancer survival and progression. Changes in HYP status lead to alterations in epithelial cell signaling, extracellular matrix remodeling, and immune cell recruitment.

View Article and Find Full Text PDF

Microvirin is a lectin molecule known to have monovalent interaction with glycoprotein gp120. A previously reported high-resolution structural analysis defines the mannobiose-binding cavity of Microvirin. Nonetheless, structure does not directly define the energetics of binding contributions of protein contact residues.

View Article and Find Full Text PDF

Chronic venous insufficiency (CVI), a chronic vascular dysfunction, is a common health problem that causes serious complications such as painful varicose veins and even skin ulcers. Identifying the underlying genetic and epigenetic factors is important for improving the quality of life of individuals with CVI. In the literature, many genes, variants, and miRNAs associated with CVI have been identified through genomic and transcriptomic studies.

View Article and Find Full Text PDF

Background: An estimated 10-15% of all genetic diseases are attributable to variants in noncanonical splice sites, auxiliary splice sites and deep-intronic variants. Most of these unstudied variants are classified as variants of uncertain significance (VUS), which are not clinically actionable. This study investigated two novel splice-altering variants, NM_000390.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!