Mutations abound in all viral populations, which are thus rendered adaptable to changes in environmental conditions. Human cytomegalovirus (HCMV) is an important human pathogen for investigating nucleotide sequence variations because they can affect its potential to cause disease. We have determined part of the nucleotide sequence of the Toledo strain and compared it to the published sequences of the strains AD169, Toledo, and Towne and of three clinical isolates. Overall nucleotide sequence divergence between strains AD169 and Toledo amounts to roughly 2%, with considerable variations across the viral genome. In aligning the Toledo nucleotide sequences with those of the other strains and clinical isolates, numerous amino-terminal extensions of the known open reading frames (ORFs) have been noted. These extensions carry additional AUG or non-canonical CUG or GUG translational initiation codons. CUG and GUG have previously been shown to serve as translational start codons in prokaryotic and eukaryotic systems. Six of the more closely inspected extensions start with an AUG, 26 with a CUG, and 26 with a GUG. Some of these extended sequences might bestow altered biological properties upon HCMV proteins. These ORF extensions are common to the sequenced genomes of most of the HCMV strains or isolates. Supporting evidence for their functionality comes from studies on HCMV mRNAs that were isolated from HCMV-infected human cells. Several of these viral mRNA sequences carry the identified ORF extensions. Moreover, in the amino-terminal ORF extensions, codon usage in general resembles that in the main parts of several of the HCMV genes analyzed for this property.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s00705-007-1026-xDOI Listing

Publication Analysis

Top Keywords

nucleotide sequence
16
cug gug
12
orf extensions
12
strains isolates
8
human cytomegalovirus
8
codon usage
8
sequences strains
8
strains ad169
8
ad169 toledo
8
clinical isolates
8

Similar Publications

Complete genome sequence of bacteriophage Godfather isolated from .

Microbiol Resour Announc

January 2025

Department of Biological Sciences, Tarleton State University, Stephenville, Texas, USA.

Microbacteriophage Godfather was collected from a soil sample in Stephenville, Texas. The 17,452-bp double-stranded genome contains 24 protein-coding genes. The genome shares >99% nucleotide sequence identity with cluster EE microbacteriophages Scamander, Danno, Kojax4, and Burgy.

View Article and Find Full Text PDF

Next-generation cancer phenomics by deployment of multiple molecular endophenotypes coupled with high-throughput analyses of gene expression offer veritable opportunities for triangulation of discovery findings in non-small cell lung cancer (NSCLC) research. This study reports differentially expressed genes in NSCLC using publicly available datasets (GSE18842 and GSE229253), uncovering 130 common genes that may potentially represent crucial molecular signatures of NSCLC. Additionally, network analyses by GeneMANIA and STRING revealed significant coexpression and interaction patterns among these genes, with four notable hub genes-, , and -identified as pivotal in NSCLC progression.

View Article and Find Full Text PDF

Modular organization of enhancer network provides transcriptional robustness in mammalian development.

Nucleic Acids Res

January 2025

State Key Laboratory of Cellular Stress Biology, Xiang'an Hospital, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, No. 4221, Xiang'an South Road, Xiamen, Fujian 361102, China.

Enhancer clusters, pivotal in mammalian development and diseases, can organize as enhancer networks to control cell identity and disease genes; however, the underlying mechanism remains largely unexplored. Here, we introduce eNet 2.0, a comprehensive tool for enhancer networks analysis during development and diseases based on single-cell chromatin accessibility data.

View Article and Find Full Text PDF

Recent advancements in genomics, propelled by artificial intelligence, have unlocked unprecedented capabilities in interpreting genomic sequences, mitigating the need for exhaustive experimental analysis of complex, intertwined molecular processes inherent in DNA function. A significant challenge, however, resides in accurately decoding genomic sequences, which inherently involves comprehending rich contextual information dispersed across thousands of nucleotides. To address this need, we introduce GENA language model (GENA-LM), a suite of transformer-based foundational DNA language models capable of handling input lengths up to 36 000 base pairs.

View Article and Find Full Text PDF

i-Motifs (iMs) are quadruplex nucleic acid conformations that form in cytosine-rich regions. Because of their acidic pH dependence, iMs were thought to form only in vitro. The recent development of an iM-selective antibody, iMab, has allowed iM detection in cells, which revealed their presence at gene promoters and their cell cycle dependence.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!