SNPAAMapper: An efficient genome-wide SNP variant analysis pipeline for next-generation sequencing data.

Bioinformation

Morgridge Institute for Research, University of Wisconsin-Madison, 330 N Orchard St, Madison, WI 53715, U.S.A.

Published: November 2013

Unlabelled: Many NGS analysis tools focusing on read alignment and variant calling functions for exome sequencing data have been developed in recent years. However, publicly available tools dealing with the downstream analysis of genome-wide variants are fewer and have limited functionality. We developed SNPAAMapper, a novel variant analysis pipeline that can effectively classify variants by region (e.g. CDS, UTRs, intron, upstream, downstream), predict amino acid change type (e.g. synonymous, non-synonymous mutation), and prioritize mutation effects (e.g. CDS versus UTRs). Additional functionality afforded by our pipeline includes: checking variants at exon/intron junctions, customized homozygosity and allele frequency cutoff parameters, and annotation of known variants with dbSNP information, listing original and mutated amino acid sequences containing variants. The final result is reported in a spreadsheet format table containing all variant associated information and prioritized amino acids effects for investigators to examine.

Availability: Perl scripts and required input files are available on the web at http://www.ccmb.med.umich.edu/ccdu /SNPAAMapper.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3819573PMC
http://dx.doi.org/10.6026/97320630009870DOI Listing

Publication Analysis

Top Keywords

variant analysis
8
analysis pipeline
8
sequencing data
8
amino acid
8
variants
5
snpaamapper efficient
4
efficient genome-wide
4
genome-wide snp
4
variant
4
snp variant
4

Similar Publications

Warfarin is the most widely used oral anticoagulant in clinical practice. The cytochrome P450 2C9 (CYP2C9), vitamin K epoxide reductase complex 1 (VKORC1), and cytochrome P450 4F2 (CYP4F2) genotypes are associated with warfarin dose requirements in China. Accurate genotyping is vital for obtaining reliable genotype-guided warfarin dosing information.

View Article and Find Full Text PDF

Quantifying the regulatory potential of genetic variants via a hybrid sequence-oriented model with SVEN.

Nat Commun

December 2024

State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Biomedical Pioneering Innovative Center (BIOPIC) and Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), Peking University, 100871, Beijing, China.

Deciphering how noncoding DNA determines gene expression is critical for decoding the functional genome. Understanding the transcription effects of noncoding genetic variants are still major unsolved problems, which is critical for downstream applications in human genetics and precision medicine. Here, we integrate regulatory-specific neural networks and tissue-specific gradient-boosting trees to build SVEN: a hybrid sequence-oriented architecture that can accurately predict tissue-specific gene expression level and quantify the tissue-specific transcriptomic impacts of structural variants across more than 350 tissues and cell lines.

View Article and Find Full Text PDF

The general control non-repressible 5 (GCN5)-related N-acetyltransferase (GNAT) SbzI, in the biosynthesis of the sulfonamide antibiotic altemicidin, catalyzes the transfer of the 2-sulfamoylacetyl (2-SA) moiety onto 6-azatetrahydroindane dinucleotide. While most GNAT superfamily utilize acyl-coenzyme A (acyl-CoA) as substrates, SbzI recognizes a carrier-protein (CP)-tethered 2-SA substrate. Moreover, SbzI is the only naturally occurring enzyme that catalyzes the direct incorporation of sulfonamide, a valuable pharmacophore in medicinal chemistry.

View Article and Find Full Text PDF

Limited whole genome sequencing (WGS) studies in Asian populations result in a lack of representative reference panels, thus hindering the discovery of ancestry-specific variants. Here, we present the South and East Asian reference Database (SEAD) panel ( https://imputationserver.westlake.

View Article and Find Full Text PDF

Highly mutable pathogens generate viral diversity that impacts virulence, transmissibility, treatment, and thwarts acquired immunity. We previously described C19-SPAR-Seq, a high-throughput, next-generation sequencing platform to detect SARS-CoV-2 that we here deployed to systematically profile variant dynamics of SARS-CoV-2 for over 3 years in a large, North American urban environment (Toronto, Canada). Sequencing of the ACE2 receptor binding motif and polybasic furin cleavage site of the Spike gene in over 70,000 patients revealed that population sweeps of canonical variants of concern (VOCs) occurred in repeating wavelets.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!