Escherichia coli, a ubiquitous commensal/pathogenic member from the family, accounts for high infection burden, morbidity, and mortality throughout the world. With emerging multidrug resistance (MDR) on a massive scale, E. coli has been listed as one of the Global Antimicrobial Resistance and Use Surveillance System (GLASS) priority pathogens. Understanding the resistance mechanisms and underlying genomic features appears to be of utmost importance to tackle further spread of these multidrug-resistant superbugs. While a few of the globally prevalent sequence types (STs) of E. coli, such as ST131, ST69, ST405, and ST648, have been previously reported to be highly virulent and harboring MDR, there is no clarity if certain ST lineages have a greater propensity to acquire MDR. In this study, large-scale comparative genomics of a total of 5,653 E. coli genomes from 19 ST lineages revealed ST-wide prevalence patterns of genomic features, such as antimicrobial resistance (AMR)-encoding genes/mutations, virulence genes, integrons, and transposons. Interpretation of the importance of these features using a Random Forest Classifier trained with 11,988 genomic features from whole-genome sequence data identified ST-specific or phylogroup-specific signature proteins mostly belonging to different protein superfamilies, including the toxin-antitoxin systems. Our study provides a comprehensive understanding of a myriad of genomic features, ST-specific proteins, and resistance mechanisms entailing different lineages of E. coli at the level of genomes; this could be of significant downstream importance in understanding the mechanisms of AMR, in clinical discovery, in epidemiology, and in devising control strategies. With the leap in whole-genome data being generated, the application of relevant methods to mine biologically significant information from microbial genomes is of utmost importance to public health genomics. Machine-learning methods have been used not only to mine, curate, or classify the data but also to identify the relevant features that could be linked to a particular class/target. This is perhaps one of the pioneering studies that has attempted to classify a large repertoire of E. coli genome data sets (5,653 genomes) belonging to 19 different STs (including well-studied as well as understudied STs) using machine learning approaches. Important features identified by these approaches have revealed ST-specific signature proteins, which could be further studied to predict possible associations with the phenotypic profiles, thereby providing a better understanding of virulence and the resistance mechanisms among different clonal lineages of E. coli.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8844930PMC
http://dx.doi.org/10.1128/mbio.03796-21DOI Listing

Publication Analysis

Top Keywords

genomic features
16
resistance mechanisms
12
features
8
coli
8
escherichia coli
8
coli genomes
8
globally prevalent
8
antimicrobial resistance
8
signature proteins
8
lineages coli
8

Similar Publications

Background: To date, 11 DNA polymerase epsilon (POLE) pathogenic variants have been declared "hotspot" mutations. Patients with endometrial cancer (EC) characterized by POLE hotspot mutations (POLEmut) have exceptional survival outcomes. Whereas international guidelines encourage deescalation of adjuvant treatment in early-stage POLEmut EC, data regarding safety in POLEmut patients with unfavorable characteristics are still under investigation.

View Article and Find Full Text PDF

Proteins' flexibility is a feature in communicating changes in cell signaling instigated by binding with secondary messengers, such as calcium ions, associated with the coordination of muscle contraction, neurotransmitter release, and gene expression. When binding with the disordered parts of a protein, calcium ions must balance their charge states with the shape of calcium-binding proteins and their versatile pool of partners depending on the circumstances they transmit. Accurately determining the ionic charges of those ions is essential for understanding their role in such processes.

View Article and Find Full Text PDF

Modeling the response to interleukin-21 to inform natural killer cell immunotherapy.

Immunol Cell Biol

January 2025

Steve and Cindy Rasmussen Institute for Genomic Medicine, Abigail Wexner Research Institute, Nationwide Children's Hospital, Columbus, OH, USA.

Natural killer (NK) cells are emerging agents for cancer therapy. Several different cytokines are used to generate NK cells for adoptive immunotherapy including interleukin (IL)-2, IL-12, IL-15 and IL-18 in solution, and membrane-bound IL-21. These cytokines drive NK cell activation through the integration of signal transducers and activators of transcription (STAT) and nuclear factor-kappa B (NF-κB) pathways, which overlap and synergize, making it challenging to predict optimal cytokine combinations for both proliferation and cytotoxicity.

View Article and Find Full Text PDF

Inherited retinal diseases (IRDs) constitute a heterogeneous group of clinically and genetically diverse conditions, standing as a primary cause of visual impairment among individuals aged 15-45, with an estimated incidence of 1:2000. Our study aimed to comprehensively evaluate the genetic variants underlying IRDs in the Turkish population. This study included 50 unrelated Turkish IRD patients and their families.

View Article and Find Full Text PDF

Soft tissue tumors with smooth muscle differentiation are rare in pediatric patients. Despite often showing morphologic features sufficient for classification as "leiomyosarcoma" in adults (e.g.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!