Features of coding and noncoding sequences based on 3-tuple distributions.

Yi Chuan Xue Bao

National Laboratory of Protein Engineering & Plant Genetic Engineering, Peking University, Beijing 100871, China.

Published: October 2005

The origin of non-coding sequences, especially introns,is an outstanding issue that has been receiving continuous debate for the last two decades. In the current work we use a mathematical model to characterize DNA sequences and find that the 3-tuple distributions in different reading frames of a given coding sequence differ sharply from each other, while they are almost identical to each other in introns or other non-coding sequences. SREs (Symmetric relative entropies) decrease progressively from coding sequences of primitive prokaryotes to those of advanced eukaryotes and from non-coding sequences of low eukaryotes to those of high eukaryotes with a correlation coefficient of 0.86. In silico evolution experiments show that SREs typical of higher eukaryotic introns can be achieved from prokaryotic coding sequences as the mutation ratio reaches 2/100. The fact that (a total of 25 introns) from all three different genomes S. pombe, C. elegans and H. sapiens searched are found to share high sequence identity with coding regions indicates that at least some introns may have come directly from CDS (coding sequences). We suggest that SREs may be a useful feature for evolutionary study.

Download full-text PDF

Source

Publication Analysis

Top Keywords

non-coding sequences
12
coding sequences
12
sequences
8
3-tuple distributions
8
sequences sres
8
coding
5
features coding
4
coding noncoding
4
noncoding sequences
4
sequences based
4

Similar Publications

Long non-coding RNAs (lncRNAs) play crucial roles in numerous biological processes and are involved in complex human diseases through interactions with proteins. Accurate identification of lncRNA-protein interactions (LPI) can help elucidate the functional mechanisms of lncRNAs and provide scientific insights into the molecular mechanisms underlying related diseases. While many sequence-based methods have been developed to predict LPIs, efficiently extracting and effectively integrating potential feature information that reflects functional attributes from lncRNA and protein sequences remains a significant challenge.

View Article and Find Full Text PDF

Integrative taxonomy of the genus Pseudoacanthocephalus (Acanthocephala: Echinorhynchida) in China, with the description of two new species and the characterization of the mitochondrial genomes of Pseudoacanthocephalus sichuanensis sp. n. and Pseudoacanthocephalus nguyenthileae.

Parasit Vectors

December 2024

Hebei Collaborative Innovation Center for Eco-Environment, Hebei Key Laboratory of Animal Physiology, Biochemistry and Molecular Biology, College of Life Sciences, Hebei Normal University, Shijiazhuang, 050024, Hebei Province, People's Republic of China.

Background: Acanthocephalans (thorny headed worms) of the genus Pseudoacanthocephalus mainly parasitize amphibians and reptiles across the globe. Some species of the genus Pseudoacanthocephalus also can accidentally infect human and cause human acanthocephaliasis. Current knowledge of the species composition of the genus Pseudoacanthocephalus from amphibians and reptiles in China is incomplete.

View Article and Find Full Text PDF

Advances in A-to-I RNA editing in cancer.

Mol Cancer

December 2024

NHC Key Laboratory of Carcinogenesis and Hunan Key Laboratory of Cancer Metabolism, Hunan Cancer Hospital and the Affiliated Cancer Hospital of Xiangya School of Medicine, Central South University, Changsha, Hunan, 410078, China.

RNA modifications are widespread throughout the mammalian transcriptome and play pivotal roles in regulating various cellular processes. These modifications are strongly linked to the development of many cancers. One of the most prevalent forms of RNA modifications in humans is adenosine-to-inosine (A-to-I) editing, catalyzed by the enzyme adenosine deaminase acting on RNA (ADAR) in double-stranded RNA (dsRNA).

View Article and Find Full Text PDF

Osteoporosis is well noted to be a universal ailment that realization impaired bone mass and micro architectural deterioration thus enhancing the probability of fracture. Despite its high incidence, its management remains highly demanding because of the multifactorial pathophysiology of the disease. This review highlights recent findings in the management of osteoporosis particularly, gene expression and hormonal control.

View Article and Find Full Text PDF

Genomic Differences and Mutations in Epidemic Orf Virus and Vaccine Strains: Implications for Improving Orf Virus Vaccines.

Vet Sci

December 2024

Guangdong Provincial Key Laboratory of Animal Molecular Design and Precise Breeding, College of Animal Science and Technology, Foshan University, Foshan 528225, China.

Orf (ORF) is an acute disease caused by the Orf virus (ORFV), and poses a certain threat to animal and human health. Live attenuated vaccines play an important role in the prevention and control of ORF. The effectiveness of the live attenuated Orf virus vaccine is influenced by several factors, including the genomic match between the vaccine strain and circulating epidemic strains.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!