Background: To investigate differences in pathogenesis, diagnosis and resistance pathways between HIV-1 subtypes, an accurate subtyping tool for large datasets is needed. We aimed to evaluate the performance of automated subtyping tools to classify the different subtypes and circulating recombinant forms using pol, the most sequenced region in clinical practice. We also present the upgraded version 3 of the Rega HIV subtyping tool (REGAv3).
Methodology: HIV-1 pol sequences (PR+RT) for 4674 patients retrieved from the Portuguese HIV Drug Resistance Database, and 1872 pol sequences trimmed from full-length genomes retrieved from the Los Alamos database were classified with statistical-based tools such as COMET, jpHMM and STAR; similarity-based tools such as NCBI and Stanford; and phylogenetic-based tools such as REGA version 2 (REGAv2), REGAv3, and SCUEAL. The performance of these tools, for pol, and for PR and RT separately, was compared in terms of reproducibility, sensitivity and specificity with respect to the gold standard which was manual phylogenetic analysis of the pol region.
Results: The sensitivity and specificity for subtypes B and C was more than 96% for seven tools, but was variable for other subtypes such as A, D, F and G. With regard to the most common circulating recombinant forms (CRFs), the sensitivity and specificity for CRF01_AE was ~99% with statistical-based tools, with phylogenetic-based tools and with Stanford, one of the similarity based tools. CRF02_AG was correctly identified for more than 96% by COMET, REGAv3, Stanford and STAR. All the tools reached a specificity of more than 97% for most of the subtypes and the two main CRFs (CRF01_AE and CRF02_AG). Other CRFs were identified only by COMET, REGAv2, REGAv3, and SCUEAL and with variable sensitivity. When analyzing sequences for PR and RT separately, the performance for PR was generally lower and variable between the tools. Similarity and statistical-based tools were 100% reproducible, but this was lower for phylogenetic-based tools such as REGA (~99%) and SCUEAL (~96%).
Conclusions: REGAv3 had an improved performance for subtype B and CRF02_AG compared to REGAv2 and is now able to also identify all epidemiologically relevant CRFs. In general the best performing tools, in alphabetical order, were COMET, jpHMM, REGAv3, and SCUEAL when analyzing pure subtypes in the pol region, and COMET and REGAv3 when analyzing most of the CRFs. Based on this study, we recommend to confirm subtyping with 2 well performing tools, and be cautious with the interpretation of short sequences.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.meegid.2013.04.032 | DOI Listing |
Front Biosci (Schol Ed)
December 2024
Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA.
Background: Alternative cleavage and polyadenylation (APA) is a crucial post-transcriptional gene regulation mechanism that regulates gene expression in eukaryotes by increasing the diversity and complexity of both the transcriptome and proteome. Despite the development of more than a dozen experimental methods over the last decade to identify and quantify APA events, widespread adoption of these methods has been limited by technical, financial, and time constraints. Consequently, APA remains poorly understood in most eukaryotes.
View Article and Find Full Text PDFFront Biosci (Elite Ed)
December 2024
Polytechnic School, University of Vale do Itajaí (Univali), Itajaí, SC 88302-202, Brazil.
Background: Enhanced biological phosphorus removal (EBPR) systems utilize phosphorus-accumulating organisms (PAOs) to remove phosphorus from wastewater since excessive phosphorus in water bodies can lead to eutrophication. This study aimed to characterize a newly isolated PAO strain for its potential application in EBPR systems and to screen for additional biotechnological potential. Here, sequencing allowed for genomic analysis, identifying the genes and molecules involved, and exploring other potentials.
View Article and Find Full Text PDFFront Biosci (Landmark Ed)
December 2024
Graduate School of Information Science and Technology, Osaka University, 565-0871 Suita, Osaka, Japan.
Background: Fusion genes are important biomarkers in cancer research because their expression can produce abnormal proteins with oncogenic properties. Long-read RNA sequencing (long-read RNA-seq), which can sequence full-length mRNA transcripts, facilitates the detection of such fusion genes. Several tools have been proposed for detecting fusion genes in long-read RNA-seq datasets derived from cancer cells.
View Article and Find Full Text PDFFront Biosci (Landmark Ed)
November 2024
Department of Hematology, Taizhou Hospital of Zhejiang Province Affiliated to Wenzhou Medical University, 317000 Taizhou, Zhejiang, China.
In this comprehensive review, we delve into the transformative role of artificial intelligence (AI) in refining the application of multi-omics and spatial multi-omics within the realm of diffuse large B-cell lymphoma (DLBCL) research. We scrutinized the current landscape of multi-omics and spatial multi-omics technologies, accentuating their combined potential with AI to provide unparalleled insights into the molecular intricacies and spatial heterogeneity inherent to DLBCL. Despite current progress, we acknowledge the hurdles that impede the full utilization of these technologies, such as the integration and sophisticated analysis of complex datasets, the necessity for standardized protocols, the reproducibility of findings, and the interpretation of their biological significance.
View Article and Find Full Text PDFJACS Au
December 2024
Center for Molecular Recognition and Biosensing, Shanghai Engineering Research Center of Organ Repair, Joint International Research Laboratory of Biomaterials and Biotechnology in Organ Repair (Ministry of Education), School of Life Sciences, Shanghai University, Shanghai 200444, China.
Electrochemical biosensors are gaining attention as powerful tools in cancer diagnosis, particularly in liquid biopsy, due to their high efficiency, rapid response, exceptional sensitivity, and specificity. However, the complexity of intra- and intertumor heterogeneity, with variations in genetic and protein expression profiles and epigenetic modifications, makes electrochemical biosensors susceptible to false-positive or false-negative diagnostic outcomes. To address this challenge, there is growing interest in simultaneously analyzing multiple biomarkers to reveal molecular characteristics of tumor heterogeneity for precise cancer diagnosis.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!