Mutations abound in all viral populations, which are thus rendered adaptable to changes in environmental conditions. Human cytomegalovirus (HCMV) is an important human pathogen for investigating nucleotide sequence variations because they can affect its potential to cause disease. We have determined part of the nucleotide sequence of the Toledo strain and compared it to the published sequences of the strains AD169, Toledo, and Towne and of three clinical isolates. Overall nucleotide sequence divergence between strains AD169 and Toledo amounts to roughly 2%, with considerable variations across the viral genome. In aligning the Toledo nucleotide sequences with those of the other strains and clinical isolates, numerous amino-terminal extensions of the known open reading frames (ORFs) have been noted. These extensions carry additional AUG or non-canonical CUG or GUG translational initiation codons. CUG and GUG have previously been shown to serve as translational start codons in prokaryotic and eukaryotic systems. Six of the more closely inspected extensions start with an AUG, 26 with a CUG, and 26 with a GUG. Some of these extended sequences might bestow altered biological properties upon HCMV proteins. These ORF extensions are common to the sequenced genomes of most of the HCMV strains or isolates. Supporting evidence for their functionality comes from studies on HCMV mRNAs that were isolated from HCMV-infected human cells. Several of these viral mRNA sequences carry the identified ORF extensions. Moreover, in the amino-terminal ORF extensions, codon usage in general resembles that in the main parts of several of the HCMV genes analyzed for this property.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1007/s00705-007-1026-x | DOI Listing |
Microbiol Resour Announc
January 2025
Department of Biological Sciences, Tarleton State University, Stephenville, Texas, USA.
Microbacteriophage Godfather was collected from a soil sample in Stephenville, Texas. The 17,452-bp double-stranded genome contains 24 protein-coding genes. The genome shares >99% nucleotide sequence identity with cluster EE microbacteriophages Scamander, Danno, Kojax4, and Burgy.
View Article and Find Full Text PDFOMICS
January 2025
Department of Biotechnology, Brainware University, Barasat, West Bengal, India.
Next-generation cancer phenomics by deployment of multiple molecular endophenotypes coupled with high-throughput analyses of gene expression offer veritable opportunities for triangulation of discovery findings in non-small cell lung cancer (NSCLC) research. This study reports differentially expressed genes in NSCLC using publicly available datasets (GSE18842 and GSE229253), uncovering 130 common genes that may potentially represent crucial molecular signatures of NSCLC. Additionally, network analyses by GeneMANIA and STRING revealed significant coexpression and interaction patterns among these genes, with four notable hub genes-, , and -identified as pivotal in NSCLC progression.
View Article and Find Full Text PDFNucleic Acids Res
January 2025
State Key Laboratory of Cellular Stress Biology, Xiang'an Hospital, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, No. 4221, Xiang'an South Road, Xiamen, Fujian 361102, China.
Enhancer clusters, pivotal in mammalian development and diseases, can organize as enhancer networks to control cell identity and disease genes; however, the underlying mechanism remains largely unexplored. Here, we introduce eNet 2.0, a comprehensive tool for enhancer networks analysis during development and diseases based on single-cell chromatin accessibility data.
View Article and Find Full Text PDFNucleic Acids Res
January 2025
London Institute for Mathematical Sciences Royal Institution, 21 Albemarle St, London W1S 4BS, UK.
Recent advancements in genomics, propelled by artificial intelligence, have unlocked unprecedented capabilities in interpreting genomic sequences, mitigating the need for exhaustive experimental analysis of complex, intertwined molecular processes inherent in DNA function. A significant challenge, however, resides in accurately decoding genomic sequences, which inherently involves comprehending rich contextual information dispersed across thousands of nucleotides. To address this need, we introduce GENA language model (GENA-LM), a suite of transformer-based foundational DNA language models capable of handling input lengths up to 36 000 base pairs.
View Article and Find Full Text PDFNucleic Acids Res
January 2025
Department of Molecular Medicine, University of Padua, via A. Gabelli 63, 35121 Padua, Italy.
i-Motifs (iMs) are quadruplex nucleic acid conformations that form in cytosine-rich regions. Because of their acidic pH dependence, iMs were thought to form only in vitro. The recent development of an iM-selective antibody, iMab, has allowed iM detection in cells, which revealed their presence at gene promoters and their cell cycle dependence.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!