Enhancing Bacterial Phenotype Classification Through the Integration of Autogating and Automated Machine Learning in Flow Cytometric Analysis.

In Jae Jeong Jin-Kyung Hong Young Jun Bae Tea Kwon Lee

Cytometry A

Department of Environmental and Energy Engineering, Yonsei University, Wonju, Republic of Korea.

Published: March 2025

Although flow cytometry produces reliable results, the data processing from gating to fingerprinting is prone to subjective bias. Here, we integrated autogating with Automated Machine Learning in flow cytometry to enhance the classification of bacterial phenotypes. We analyzed six bacterial strains prevalent in the soil and groundwater-Bacillus subtilis, Burkholderia thailandensis, Corynebacterium glutamicum, Escherichia coli, Pseudomonas putida, and Pseudomonas stutzeri. Using the H2O-AutoML framework, we applied gradient-boosting machine (GBM) models to classify bacteria across different metabolic phases. Our results demonstrated an overall classification accuracy of 82.34% for GBM. Notably, accuracy varied across metabolic phases, with the highest observed during the late log (88.06%), lag (88.43%), and early log phases (89.37%), whereas the stationary phase showed a slightly lower accuracy of 80.73%. P. stutzeri exhibited consistently high sensitivity and specificity across all the phases, which indicated that it was the most distinctly identifiable strain. In contrast, E. coli showed low sensitivity, particularly in the stationary phase, which indicated challenges in its classification. Overall, this study with incorporating autogating and the AutoML framework, substantially reduces subjective biases and enhances the reproducibility and accuracy of microbial classification. Our methodology offers a robust framework for microbial classification in flow cytometric analysis, paving the way for more precise and comprehensive analyses of microbial ecology.

Download full-text PDF	Source
http://dx.doi.org/10.1002/cyto.a.24923	DOI Listing

Publication Analysis

Top Keywords

autogating automated

automated machine

machine learning

learning flow

flow cytometric

cytometric analysis

flow cytometry

metabolic phases

stationary phase

microbial classification

Similar Publications

Enhancing Bacterial Phenotype Classification Through the Integration of Autogating and Automated Machine Learning in Flow Cytometric Analysis.

Cytometry A

March 2025

Department of Environmental and Energy Engineering, Yonsei University, Wonju, Republic of Korea.

In Jae Jeong Jin-Kyung Hong Young Jun Bae Tea Kwon Lee

View Article and Find Full Text PDF

Similar Publications

Automated Cytometric Gating with Human-Level Performance Using Bivariate Segmentation.

bioRxiv

May 2024

Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, PA, USA.

Jiong Chen Matei Ionita Yanbo Feng Yinfeng Lu Patryk Orzechowski

Recent advances in cytometry technology have enabled high-throughput data collection with multiple single-cell protein expression measurements. The significant biological and technical variance between samples in cytometry has long posed a formidable challenge during the gating process, especially for the initial gates which deal with unpredictable events, such as debris and technical artifacts. Even with the same experimental machine and protocol, the target population, as well as the cell population that needs to be excluded, may vary across different measurements.

View Article and Find Full Text PDF

Similar Publications

MetaCyto: A Tool for Automated Meta-analysis of Mass and Flow Cytometry Data.

Cell Rep

July 2018

Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA 94158, USA. Electronic address:

Zicheng Hu Chethan Jujjavarapu Jacob J Hughey Sandra Andorf Hao-Chih Lee

While meta-analysis has demonstrated increased statistical power and more robust estimations in studies, the application of this commonly accepted methodology to cytometry data has been challenging. Different cytometry studies often involve diverse sets of markers. Moreover, the detected values of the same marker are inconsistent between studies due to different experimental designs and cytometer configurations.

View Article and Find Full Text PDF

Similar Publications

DAFi: A directed recursive data filtering and clustering approach for improving and interpreting data clustering identification of cell populations from polychromatic flow cytometry data.

Cytometry A

June 2018

J. Craig Venter Institute, La Jolla, California.

Alexandra J Lee Ivan Chang Julie G Burel Cecilia S Lindestam Arlehamn Aishwarya Mandava

Computational methods for identification of cell populations from polychromatic flow cytometry data are changing the paradigm of cytometry bioinformatics. Data clustering is the most common computational approach to unsupervised identification of cell populations from multidimensional cytometry data. However, interpretation of the identified data clusters is labor-intensive.

View Article and Find Full Text PDF

Similar Publications

High Reproducibility of ELISPOT Counts from Nine Different Laboratories.

Cells

January 2015

Cellular Technology Limited, Shaker Hts. 44122, OH, USA.

Srividya Sundararaman Alexey Y Karulin Tameem Ansari Nadine BenHamouda Judith Gottwein

The primary goal of immune monitoring with ELISPOT is to measure the number of T cells, specific for any antigen, accurately and reproducibly between different laboratories. In ELISPOT assays, antigen-specific T cells secrete cytokines, forming spots of different sizes on a membrane with variable background intensities. Due to the subjective nature of judging maximal and minimal spot sizes, different investigators come up with different numbers.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!