A major informatic challenge in single cell RNA-sequencing analysis is the precise annotation of datasets where cells exhibit complex multilayered identities or transitory states. Here, we present devCellPy a highly accurate and precise machine learning-enabled tool that enables automated prediction of cell types across complex annotation hierarchies. To demonstrate the power of devCellPy, we construct a murine cardiac developmental atlas from published datasets encompassing 104,199 cells from E6.5-E16.5 and train devCellPy to generate a cardiac prediction algorithm. Using this algorithm, we observe a high prediction accuracy (>90%) across multiple layers of annotation and across de novo murine developmental data. Furthermore, we conduct a cross-species prediction of cardiomyocyte subtypes from in vitro-derived human induced pluripotent stem cells and unexpectedly uncover a predominance of left ventricular (LV) identity that we confirmed by an LV-specific TBX5 lineage tracing system. Together, our results show devCellPy to be a useful tool for automated cell prediction across complex cellular hierarchies, species, and experimental systems.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9452519PMC
http://dx.doi.org/10.1038/s41467-022-33045-xDOI Listing

Publication Analysis

Top Keywords

machine learning-enabled
8
complex multilayered
8
devcellpy
5
prediction
5
devcellpy machine
4
learning-enabled pipeline
4
pipeline automated
4
annotation
4
automated annotation
4
complex
4

Similar Publications

Advances in Surface-Enhanced Raman Spectroscopy for Therapeutic Drug Monitoring.

Molecules

December 2024

Jiangxi Province Key Laboratory of Pharmacology of Traditional Chinese Medicine, School of Pharmacy, Gannan Medical University, Ganzhou 341000, China.

Therapeutic drug monitoring (TDM) is pivotal for optimizing drug dosage regimens in individual patients, particularly for drugs with a narrow therapeutic index. Surface-enhanced Raman spectroscopy (SERS) has shown great potential in TDM due to high sensitivity, non-destructive analysis, specific fingerprint spectrum, low sample consumption, simple operation, and low ongoing costs. Due to the rapid development of SERS for TDM, a review focusing on the analytical method is presented to better understand the trends.

View Article and Find Full Text PDF

Drug resistance in Mycobacterium tuberculosis (Mtb) is a significant challenge in the control and treatment of tuberculosis, making efforts to combat the spread of this global health burden more difficult. To accelerate anti-tuberculosis drug discovery, repurposing clinically approved or investigational drugs for the treatment of tuberculosis by computational methods has become an attractive strategy. In this study, we developed a virtual screening workflow that combines multiple machine learning and deep learning models, and 11 576 compounds extracted from the DrugBank database were screened against Mtb.

View Article and Find Full Text PDF

Background: Percutaneous extracorporeal membrane oxygenation (ECMO) is administered to pediatric patients with cardiogenic shock or cardiac arrest. The traditional method uses focal echocardiography to complete the left ventricular measurement. However, echocardiographic determination of the ejection fraction (EF) by manual tracing of the endocardial borders is time consuming and operator dependent.

View Article and Find Full Text PDF

Background: Non-small-cell lung cancer (NSCLC) remains a global health challenge, driving morbidity and mortality. The emerging field of radiogenomics utilizes statistical methods to correlate radiographic tumor features with genomic characteristics from biopsy samples. Radiomic techniques automate the precise extraction of imaging features from tumor regions in radiographic scans, which are subjected to machine learning (ML) to predict genomic attributes.

View Article and Find Full Text PDF

Machine-Learning-Enabled Thermochemistry Estimator.

J Chem Inf Model

December 2024

Department of Chemical, Biological and Bioengineering, North Carolina A&T State University, Greensboro, North Carolina 27411, United States.

Modeling adsorbates on single-crystal metals is critical in rational catalyst design and other research that requires detailed thermochemistry. First-principles simulations via density functional theory (DFT) are among the prevalent tools to acquire such information about surface species. While they are highly dependable, DFT calculations often require intensive computational resources and runtime.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!