Hepatitis B viruses (HBVs) are compact viruses with circular genomes of ∼3.2 kb in length. Four genes () generating seven products are encoded on overlapping reading frames. Ten HBV genotypes have been characterised (A-J), which may account for differences in transmission, outcomes of infection, and treatment response. However, HBV genotyping is rarely undertaken, and sequencing remains inaccessible in many settings. We set out to assess which amino acid (aa) sites in the HBV genome are most informative for determining genotype, using a machine learning approach based on random forest algorithms (RFA). We downloaded 5,496 genome-length HBV sequences from a public database, excluding recombinant sequences, regions with conserved indels, and genotypes I and J. Each gene was separately translated into aa, and the proteins concatenated into a single sequence (length 1,614 aa). Using RFA, we searched for aa sites predictive of genotype and assessed covariation among the sites with a mutual information-based method. We were able to discriminate confidently between genotypes A-H using ten aa sites. Half of these sites (5/10) sites were identified in Polymerase (Pol), of which 4/5 were in the spacer domain and one in reverse transcriptase. A further 4/10 sites were located in Surface protein and a single site in HBx. There were no informative sites in Core. Properties of the aa were generally not conserved between genotypes at informative sites. Among the highest co-varying pairs of sites, there were fifty-five pairs that included one of these 'top ten' sites. Overall, we have shown that RFA analysis is a powerful tool for identifying aa sites that predict the HBV lineage, with an unexpectedly high number of such sites in the spacer domain, which has conventionally been viewed as unimportant for structure or function. Our results improve ease of genotype prediction from limited regions of HBV sequences and may have future applications in understanding HBV evolution.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825179PMC
http://dx.doi.org/10.1093/ve/veac116DOI Listing

Publication Analysis

Top Keywords

sites
13
hbv sequences
8
spacer domain
8
informative sites
8
hbv
7
polymorphisms predicting
4
predicting phylogeny
4
phylogeny hepatitis
4
hepatitis virus
4
virus hepatitis
4

Similar Publications

Associations between bone mineral density and WOMAC scores in healthy individuals: Insights from the Qatar Biobank.

J Clin Densitom

November 2024

Department of Biomedical Sciences, College of Health Sciences, QU Health, Qatar University, Doha, 2713, Qatar. Electronic address:

Background: Bone mineral density (BMD) is an indicator of bone health that predicts future bone fractures. The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) is used to assess the severity of symptoms related to pain, stiffness, and function in diseased hip and knee joints. Here we assessed whether BMD measured at specific sites predicts WOMAC scores in healthy individuals whilst controlling for sociodemographic variables.

View Article and Find Full Text PDF

The integration of membrane separation with heterogeneous advanced oxidation processes is a prospective strategy for the elimination of contaminants during wastewater treatment. Fe-based catalysts and the green oxidant peracetic acid (PAA) are desirable candidates for the development of catalytic membranes because they are environmentally friendly. However, the construction of catalytic ceramic membranes (CMs) modified with efficient Fe-based catalysts that generate increased amounts of high-valent Fe-O species during PAA activation for the degradation of specific pollutants, especially during instantaneous membrane filtration, remains challenging.

View Article and Find Full Text PDF

Clinical and genetic characteristics of RANBP2 mutations in children with acute necrotizing encephalopathy.

Neurol Sci

December 2024

Pediatric Intensive Care Unit, Beijing Children's Hospital, Capital Medical University, National Center for Children's Health, No. 56 Nan-Li-Shi Road, Beijing, 100045, China.

Background: This study investigated RANBP2 mutations in children with acute necrotizing encephalopathy (ANE) and conducted a systematic review of the differences in clinical characteristics between with or without RANBP2 mutations.

Methods: Whole-exome sequencing was performed on 19 pediatric ANE patients at Beijing Children's Hospital affiliated to Capital Medical University between 2017 and 2020. A systematic literature review was also conducted on the clinical characteristics and spectrum analysis of RANBP2 mutations.

View Article and Find Full Text PDF

Purpose: Endoscopic resection is appropriate for selected colorectal polyp cancers, but significant variation exists in treatment. This study aims to investigate variation in management of screen-detected polyp cancers (T1), factors predicting primary endoscopic polypectomy and threshold for subsequent surgical resection.

Method: Patients with polyp cancers (T1) diagnosed by the bowel cancer screening programme (BCSP) were investigated at two screening centres (5 individual sites and 4 MDTs, 2012-2022).

View Article and Find Full Text PDF

Community Assembly Mechanisms of nirK- and nirS-type Denitrifying Bacteria in Sediments of Eutrophic Lake Taihu, China.

Curr Microbiol

December 2024

Marine Synthetic Ecology Research Center, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), School of Marine Science, Guangdong Provincial Observation and Research Station for Marine Ranching in Lingdingyang Bay, China-ASEAN Belt and Road Joint Laboratory On Mariculture Technology, State Key Laboratory for Biocontrol, Sun Yat-sen University, Zhuhai, 519082, China.

Denitrifying bacteria, particularly nirK- and nirS-type, are functionally equivalent and could occupy different niches, but their community assembly mechanisms and responses to environmental heterogeneity are poorly understood in eutrophic lakes. In this study, we investigated the community assembly mechanisms of nirK- and nirS-type denitrifying bacteria and clarified their responses to sediments environmental factors in Lake Taihu, China. The quantitative real-time PCR and Illumina HiSeq-based sequencing revealed that the abundance and composition of two types of denitrifying bacterial communities varied among different sites in the sediments of Lake Taihu.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!