We conducted an exhaustive search for three-dimensional structural homologs to the proteins of 20 key phylogenetically distinct nucleocytoplasmic DNA viruses (NCLDV). Structural matches covered 429 known protein domain superfamilies, with the most highly represented being ankyrin repeat, P-loop NTPase, F-box, protein kinase, and membrane occupation and recognition nexus (MORN) repeat. Domain superfamily diversity correlated with genome size, but a diversity of around 200 superfamilies appeared to correlate with an abrupt switch to paralogization. Extensive structural homology was found across the range of eukaryotic RNA polymerase II subunits and their associated basal transcription factors, with the coordinated gain and loss of clusters of subunits on a virus-by-virus basis. The total number of predicted endonucleases across the 20 NCLDV was nearly quadrupled from 36 to 132, covering much of the structural and functional diversity of endonucleases throughout the biosphere in DNA restriction, repair, and homing. Unexpected findings included capsid protein-transcription factor chimeras; endonuclease chimeras; enzymes for detoxification; antimicrobial peptides and toxin-antitoxin systems associated with symbiosis, immunity, and addiction; and novel proteins for membrane abscission and protein turnover. We extended the known annotation space for the NCLDV by 46%, revealing high-probability structural matches for fully 45% of the 9,671 query proteins and confirming up to 98% of existing annotations per virus. The most prevalent protein families included ankyrin repeat- and MORN repeat-containing proteins, many of which included an F-box, suggesting extensive host cell modulation among the NCLDV. Regression suggested a minimum requirement for around 36 protein structural superfamilies for a viable NCLDV, and beyond around 200 superfamilies, genome expansion by the acquisition of new functions was abruptly replaced by paralogization. We found homologs to herpesvirus surface glycoprotein gB in cytoplasmic viruses. This study provided the first prediction of an endonuclease in 10 of the 20 viruses examined; the first report in a virus of a phenolic acid decarboxylase, proteasomal subunit, or cysteine knot (defensin) protein; and the first report of a prokaryotic-type ribosomal protein in a eukaryotic virus.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7925184PMC
http://dx.doi.org/10.1128/JVI.00854-20DOI Listing

Publication Analysis

Top Keywords

annotation space
8
dna viruses
8
structural matches
8
200 superfamilies
8
protein
7
structural
6
ncldv
5
structure-based deep
4
deep mining
4
mining reveals
4

Similar Publications

Blink detection is considered a useful indicator both for clinical conditions and drowsiness state. In this work, we propose and compare deep learning architectures for the task of detecting blinks in video frame sequences. The first step is the training and application of an eye detector that extracts the eye regions from each video frame.

View Article and Find Full Text PDF

Potato (Solanum tuberosum) is the third-most important food crop in the world. Although the potato genome has been fully sequenced, functional genomics research of potato lags behind that of other major food crops, largely due to the lack of a model experimental potato line. Here, we present a diploid potato line, 'Jan,' which possesses all essential characteristics for facile functional genomics studies.

View Article and Find Full Text PDF

Semi-Supervised Medical Image Segmentation Based on Frequency Domain Aware Stable Consistency Regularization.

J Imaging Inform Med

January 2025

Key Laboratory of Knowledge Engineering with Big Data (the Ministry of Education of China), Hefei University of Technology, Hefei, 230009, Anhui, China.

With the advancement of deep learning models nowadays, they have successfully applied in the semi-supervised medical image segmentation where there are few annotated medical images and a large number of unlabeled ones. A representative approach in this regard is the semi-supervised method based on consistency regularization, which improves model training by imposing consistency constraints (perturbations) on unlabeled data. However, the perturbations in this kind of methods are often artificially designed, which may introduce biases unfavorable to the model learning in the handling of medical image segmentation.

View Article and Find Full Text PDF

Premature senescence has a significant impact on the yield and quality of wheat crops. The process is controlled by multiple and intricate genetic pathways and regulatory elements, whereby the discovery of additional mutants provides important insights into the molecular basis of this important trait. Here, we developed a premature senescence wheat mutant je0874, its leaves started to show yellow before heading stage; with plant growth and development, the degree of yellowing worsened rapidly, and chlorophyll content in flag leaf was reduced by 93.

View Article and Find Full Text PDF

Introduction: Bryophytes are non-vascular plants that appeared on Earth before vascular plants. More than 24,000 species are reported worldwide, and only a small proportion have been studied. However, part of their biosynthetic potential has been unveiled and more than 1,600 terpenoids have been detected and identified.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!