DNAskew: statistical analysis of base compositional asymmetry and prediction of replication boundaries in the genome sequences.

Acta Biochim Biophys Sin (Shanghai)

Laboratory of Animal Virology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China.

Published: January 2004

Sueoka and Lobry declared respectively that, in the absence of bias between the two DNA strands for mutation and selection, the base composition within each strand should be A=T and C=G (this state is called Parity Rule type 2, PR2). However, the genome sequences of many bacteria, vertebrates and viruses showed asymmetries in base composition and gene direction. To determine the relationship of base composition skews with replication orientation, gene function, codon usage biases and phylogenetic evolution, in this paper a program called DNAskew was developed for the statistical analysis of strand asymmetry and codon composition bias in the DNA sequence. In addition, the program can also be used to predict the replication boundaries of genome sequences. The method builds on the fact that there are compositional asymmetries between the leading and the lagging strand for replication. DNAskew was written in Perl script language and implemented on the LINUX operating system. It works quickly with annotated or unannotated sequences in GBFF (GenBank flatfile) or fasta format. The source code is freely available for academic use at http://www.epizooty.com/pub/stat/DNAskew.

Download full-text PDF

Source
http://dx.doi.org/10.1093/abbs/36.1.16DOI Listing

Publication Analysis

Top Keywords

genome sequences
12
base composition
12
statistical analysis
8
replication boundaries
8
boundaries genome
8
bias dna
8
dnaskew statistical
4
base
4
analysis base
4
base compositional
4

Similar Publications

Catalysis and specifically autocatalysis are the quintessential building blocks of life. Yet, although autocatalytic networks are necessary, they are not sufficient for the emergence of life-like properties, such as replication and adaptation. The ultimate and potentially fatal threat faced by molecular replicators is parasitism; if the polymerase error rate exceeds a critical threshold, even the fittest molecular species will disappear.

View Article and Find Full Text PDF

Diseases that affect the vascular system or the pith are of great economic impact since they can rapidly destroy the affected plants, leading to complete loss in production. Fast and precise identification is thus important to inform containment and management, but many identification methods are slow because they are culture-dependent and they do not reach strain resolution. Here we used culture-independent long-read metagenomic sequencing of DNA extracted directly from stems of two tomato samples that displayed wilt symptoms.

View Article and Find Full Text PDF

Epstein-Barr nuclear antigen 1 (EBNA1), a sequence-specific DNA binding protein of Epstein-Barr virus (EBV), is essential for viral genome replication and maintenance and is therefore an attractive target for the therapeutic intervention of EBV-associated cancers. Several EBNA1-specific inhibitors have demonstrated the ability to block EBNA1 function in vitro, but practical delivery strategies for these inhibitors in vivo are still lacking. Here, we report an intelligent hierarchical targeting theranostic nanosystem (denoted as mZGOCS@MnO-P5) that integrates an azide (N3) terminal dual-targeting peptide (N3-P5), a tumor microenvironment-responsive degradable MnO nanosheet, and a mesoporous ZnGaO:Cr, Sn near-infrared persistent luminescence (NIR-PL) nanosphere (mZGOCS).

View Article and Find Full Text PDF

Mitochondrial endonuclease G (EndoG) contributes to chromosomal degradation when it is released from mitochondria during apoptosis. It is presumed to also have a mitochondrial function because EndoG deficiency causes mitochondrial dysfunction. However, the mechanism by which EndoG regulates mitochondrial function is not known.

View Article and Find Full Text PDF

Many biological systems operate near the physical limits to their performance, suggesting that aspects of their behavior and underlying mechanisms could be derived from optimization principles. However, such principles have often been applied only in simplified models. Here, we explore a detailed mechanistic model of the gap gene network in the embryo, optimizing its 50+ parameters to maximize the information that gene expression levels provide about nuclear positions.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!