Background: Viral respiratory tract infections are frequently complicated by secondary bacterial infections. This study aimed to use machine learning to predict the risk of bacterial superinfection in SARS-CoV-2-positive individuals.

Methods: In this prospective, multicentre, observational cohort study done in nine centres in six countries (Australia, Indonesia, Singapore, Italy, Czechia, and France) blood samples and RNA sequencing were used to develop a robust model of predicting secondary bacterial infections in the respiratory tract of patients with COVID-19. Eligible participants were older than 18 years, had known or suspected COVID-19, and symptoms of a recent respiratory infection. A control cohort of participants without COVID-19 who were older than 18 years and with no infection symptoms was also recruited from one Australian centre. In the pre-analysis phase, data were filtered to include only individuals with complete blood transcriptomics and patient data (ie, age, sex, location, and WHO severity score at the time of sample collection). The dataset was then divided randomly (4:1) into a training set (80%) and a test set (20%). Gene expression data in the training set and control cohort were used for differential expression analysis. Differentially expressed genes, along with WHO severity score, location, age, and sex, were used for feature selection with least absolute shrinkage and selection operator (LASSO) in the training set. For LASSO analysis, samples were excluded if gene expression data were not obtained at study admission, no longitudinal clinical information was available, a bacterial infection at the time of study admission was present, or a fungal infection in the absence of a bacterial infection was detected. LASSO regression was performed using three subsets of predictor variables: patient data alone, gene expression data alone, or a combination of patient data and gene expression data. The accuracy of the resultant models was tested on data from the test set.

Findings: Between March, 2020, and October, 2021, we recruited 536 SARS-CoV-2-positive individuals and between June, 2013, and January, 2020, we recruited 74 participants into the control cohort. After prefiltering analysis and other exclusions, samples from 158 individuals were analysed in the training set and 47 in the test set. The expression of seven host genes (DAPP1, CST3, FGL2, GCH1, CIITA, UPP1, and RN7SL1) in the blood at the time of study admission was identified by LASSO as predictive of the risk of developing a secondary bacterial infection of the respiratory tract more than 24 h after study admission. Specifically, the expression of these genes in combination with a patient's WHO severity score at the time of study enrolment resulted in an area under the curve of 0·98 (95% CI 0·89-1·00), a true positive rate (sensitivity) of 1·00 (95% CI 1·00-1·00), and a true negative rate (specificity) of 0·94 (95% CI 0·89-1·00) in the test cohort. The combination of patient data and host transcriptomics at hospital admission identified all seven individuals in the training and test sets who developed a bacterial infection of the respiratory tract 5-9 days after hospital admission.

Interpretation: These data raise the possibility that host transcriptomics at the time of clinical presentation, together with machine learning, can forward predict the risk of secondary bacterial infections and allow for the more targeted use of antibiotics in viral infection.

Funding: Snow Medical Research Foundation, the National Health and Medical Research Council, the Jack Ma Foundation, the Helmholtz-Association, the A2 Milk Company, National Institute of Allergy and Infectious Disease, and the Fondazione AIRC Associazione Italiana per la Ricerca contro il Cancro.

Download full-text PDF

Source
http://dx.doi.org/10.1016/S2666-5247(23)00363-4DOI Listing

Publication Analysis

Top Keywords

secondary bacterial
20
bacterial infections
16
respiratory tract
16
patient data
16
training set
16
gene expression
16
expression data
16
study admission
16
bacterial infection
16
host transcriptomics
12

Similar Publications

Background: The consequences of septic arthritis of natural joints may be devastating. The purpose of this study was to evaluate the functional results of medical approach and surgical approach for septic arthritis of native joints, and to explore whether the number of drainage tubes after arthroscopic surgery will affect the knee function of patients.

Methods: A single-center retrospective study was conducted on patients diagnosed with septic arthritis from January 2018 to January 2023.

View Article and Find Full Text PDF

Background: The marine environment boasts distinctive physical, chemical, and biological characteristics. While numerous studies have delved into the microbial ecology and biological potential of the marine environment, exploration of genetically encoded, deep-sea sourced secondary metabolites remains scarce. This study endeavors to investigate marine bioproducts derived from deep-sea water samples at a depth of 1,000 m in the Java Trench, Indonesia, utilizing both culture-dependent and whole-genome sequencing methods.

View Article and Find Full Text PDF

Introduction: Bacterial Vaginosis (BV) has consistently been associated with adverse obstetric and gynaecological outcomes. It is a common vaginal condition. This study determined the prevalence and factors associated with BV among pregnant women.

View Article and Find Full Text PDF

Background: When coronavirus disease 2019 (COVID-19) mitigation efforts waned, viral respiratory infections (VRIs) surged, potentially increasing the risk of postviral invasive bacterial infections (IBIs). We sought to evaluate the change in epidemiology and relationships between specific VRIs and IBIs [complicated pneumonia, complicated sinusitis and invasive group A streptococcus (iGAS)] over time using the National COVID Cohort Collaborative (N3C) dataset.

Methods: We performed a secondary analysis of all prospectively collected pediatric (<19 years old) and adult encounters at 58 N3C institutions, stratified by era: pre-pandemic (January 1, 2018, to February 28, 2020) versus pandemic (March 1, 2020, to June 1, 2023).

View Article and Find Full Text PDF

Rhizosphere microorganisms are important factors affecting herb quality and secondary metabolite accumulation. In this study, we investigated the diversity of rhizosphere microbial communities (bacteria and fungi) and their correlations with soil physicochemical properties and active compounds of (baicalin, oroxindin, baicalein, wogonin, and oroxylin A) from cultivated with three different origins high-throughput sequencing and correlation analysis to further clarify the role of soil factors in the accumulation of the active compounds of . The results are summarized as follows.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!