Advances in single-cell RNA sequencing (scRNA-Seq) have allowed for comprehensive analyses of single cell data. However, current analyses of scRNA-Seq data usually start from unsupervised clustering or visualization. These methods ignore prior knowledge of transcriptomes and the probable structures of the data. Moreover, cell identification heavily relies on subjective and possibly inaccurate human inspection afterwards. To address these analytical challenges, we developed SCINA (Semi-supervised Category Identification and Assignment), a semi-supervised model that exploits previously established gene signatures using an expectation-maximization (EM) algorithm. SCINA is applicable to scRNA-Seq and flow cytometry/CyTOF data, as well as other data of similar format. We applied SCINA to a wide range of datasets, and showed its accuracy, stability and efficiency, which exceeded most popular unsupervised approaches. SCINA discovered an intermediate stage of oligodendrocytes from mouse brain scRNA-Seq data. SCINA also detected immune cell population changes in cytometry data in a genetically-engineered mouse model. Furthermore, SCINA performed well with bulk gene expression data. Specifically, we identified a new kidney tumor clade with similarity to FH-deficient tumors (FHD), which we refer to as FHD-like tumors (FHDL). Overall, SCINA provides both methodological advances and biological insights from perspectives different from traditional analytical methods.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6678337PMC
http://dx.doi.org/10.3390/genes10070531DOI Listing

Publication Analysis

Top Keywords

scina
8
scina semi-supervised
8
data
8
scrna-seq data
8
semi-supervised subtyping
4
subtyping algorithm
4
algorithm single
4
single cells
4
cells bulk
4
bulk samples
4

Similar Publications

Survey on the presence of floating microplastics, trace metals and metalloids in seawater from Southern Italy to the United States of America.

Ecotoxicol Environ Saf

December 2024

Istituto Zooprofilattico Sperimentale della Sicilia "A. Mirri", Via Gino Marinuzzi 3, Palermo 90100, Italy.

The presence of microplastics (MPs), trace metals (TM) and metalloids (Ms) in surface seawater is a severe emerging issue of global concern. Information about the distribution of these pollutants is often lacking, and large-scale studies come with uncertainties because of difficult comparisons of results obtained using different methods to collect and process data. This study presents a comprehensive investigation of microplastics (MPs), trace metals (TM) and metalloids (Ms) in surface seawater during two transatlantic sampling campaigns, covering approximately 17,000 nautical miles.

View Article and Find Full Text PDF

scAnnoX: an R package integrating multiple public tools for single-cell annotation.

PeerJ

April 2024

Department of Hepatobiliary Surgery, the Affiliated Drum Tower Hospital, Medical School, Nanjing University, Nanjing, Jiangsu Province, China.

Background: Single-cell annotation plays a crucial role in the analysis of single-cell genomics data. Despite the existence of numerous single-cell annotation algorithms, a comprehensive tool for integrating and comparing these algorithms is also lacking.

Methods: This study meticulously investigated a plethora of widely adopted single-cell annotation algorithms.

View Article and Find Full Text PDF

Commencing in December 2019 with the emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), three years of the coronavirus disease 2019 (COVID-19) pandemic have transpired. The virus has consistently demonstrated a tendency for evolutionary adaptation, resulting in mutations that impact both immune evasion and transmissibility. This ongoing process has led to successive waves of infections.

View Article and Find Full Text PDF

PreCanCell: An ensemble learning algorithm for predicting cancer and non-cancer cells from single-cell transcriptomes.

Comput Struct Biotechnol J

July 2023

Biomedical Informatics Research Lab, School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing 211198, China.

We propose PreCanCell, a novel algorithm for predicting malignant and non-malignant cells from single-cell transcriptomes. PreCanCell first identifies the differentially expressed genes (DEGs) between malignant and non-malignant cells commonly in five common cancer types-associated single-cell transcriptome datasets. The five common cancer types include renal cell carcinoma (RCC), head and neck squamous cell carcinoma (HNSCC), melanoma, lung adenocarcinoma (LUAD), and breast cancer (BC).

View Article and Find Full Text PDF

The emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in December 2019 resulted in the coronavirus disease 2019 (COVID-19) pandemic, which has had devastating repercussions for public health. Over the course of this pandemic, the virus has continuously been evolving, resulting in new, more infectious variants that have frequently led to surges of new SARS-CoV-2 infections. In the present study, we performed detailed genetic, phylogenetic, phylodynamic and phylogeographic analyses to examine the SARS-CoV-2 epidemic in Cyprus using 2352 SARS-CoV-2 sequences from infected individuals in Cyprus during November 2020 to October 2021.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!