Although flow cytometry produces reliable results, the data processing from gating to fingerprinting is prone to subjective bias. Here, we integrated autogating with Automated Machine Learning in flow cytometry to enhance the classification of bacterial phenotypes. We analyzed six bacterial strains prevalent in the soil and groundwater-Bacillus subtilis, Burkholderia thailandensis, Corynebacterium glutamicum, Escherichia coli, Pseudomonas putida, and Pseudomonas stutzeri. Using the H2O-AutoML framework, we applied gradient-boosting machine (GBM) models to classify bacteria across different metabolic phases. Our results demonstrated an overall classification accuracy of 82.34% for GBM. Notably, accuracy varied across metabolic phases, with the highest observed during the late log (88.06%), lag (88.43%), and early log phases (89.37%), whereas the stationary phase showed a slightly lower accuracy of 80.73%. P. stutzeri exhibited consistently high sensitivity and specificity across all the phases, which indicated that it was the most distinctly identifiable strain. In contrast, E. coli showed low sensitivity, particularly in the stationary phase, which indicated challenges in its classification. Overall, this study with incorporating autogating and the AutoML framework, substantially reduces subjective biases and enhances the reproducibility and accuracy of microbial classification. Our methodology offers a robust framework for microbial classification in flow cytometric analysis, paving the way for more precise and comprehensive analyses of microbial ecology.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1002/cyto.a.24923 | DOI Listing |
Cytometry A
March 2025
Department of Environmental and Energy Engineering, Yonsei University, Wonju, Republic of Korea.
Although flow cytometry produces reliable results, the data processing from gating to fingerprinting is prone to subjective bias. Here, we integrated autogating with Automated Machine Learning in flow cytometry to enhance the classification of bacterial phenotypes. We analyzed six bacterial strains prevalent in the soil and groundwater-Bacillus subtilis, Burkholderia thailandensis, Corynebacterium glutamicum, Escherichia coli, Pseudomonas putida, and Pseudomonas stutzeri.
View Article and Find Full Text PDFbioRxiv
May 2024
Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, PA, USA.
Recent advances in cytometry technology have enabled high-throughput data collection with multiple single-cell protein expression measurements. The significant biological and technical variance between samples in cytometry has long posed a formidable challenge during the gating process, especially for the initial gates which deal with unpredictable events, such as debris and technical artifacts. Even with the same experimental machine and protocol, the target population, as well as the cell population that needs to be excluded, may vary across different measurements.
View Article and Find Full Text PDFCell Rep
July 2018
Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA 94158, USA. Electronic address:
While meta-analysis has demonstrated increased statistical power and more robust estimations in studies, the application of this commonly accepted methodology to cytometry data has been challenging. Different cytometry studies often involve diverse sets of markers. Moreover, the detected values of the same marker are inconsistent between studies due to different experimental designs and cytometer configurations.
View Article and Find Full Text PDFComputational methods for identification of cell populations from polychromatic flow cytometry data are changing the paradigm of cytometry bioinformatics. Data clustering is the most common computational approach to unsupervised identification of cell populations from multidimensional cytometry data. However, interpretation of the identified data clusters is labor-intensive.
View Article and Find Full Text PDFCells
January 2015
Cellular Technology Limited, Shaker Hts. 44122, OH, USA.
The primary goal of immune monitoring with ELISPOT is to measure the number of T cells, specific for any antigen, accurately and reproducibly between different laboratories. In ELISPOT assays, antigen-specific T cells secrete cytokines, forming spots of different sizes on a membrane with variable background intensities. Due to the subjective nature of judging maximal and minimal spot sizes, different investigators come up with different numbers.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!