Single-cell RNA sequencing (scRNA-seq) has significantly accelerated the experimental characterization of distinct cell lineages and types in complex tissues and organisms. Cell-type annotation is of great importance in most of the scRNA-seq analysis pipelines. However, manual cell-type annotation heavily relies on the quality of scRNA-seq data and marker genes, and therefore can be laborious and time-consuming. Furthermore, the heterogeneity of scRNA-seq datasets poses another challenge for accurate cell-type annotation, such as the batch effect induced by different scRNA-seq protocols and samples. To overcome these limitations, here we propose a novel pipeline, termed TripletCell, for cross-species, cross-protocol and cross-sample cell-type annotation. We developed a cell embedding and dimension-reduction module for the feature extraction (FE) in TripletCell, namely TripletCell-FE, to leverage the deep metric learning-based algorithm for the relationships between the reference gene expression matrix and the query cells. Our experimental studies on 21 datasets (covering nine scRNA-seq protocols, two species and three tissues) demonstrate that TripletCell outperformed state-of-the-art approaches for cell-type annotation. More importantly, regardless of protocols or species, TripletCell can deliver outstanding and robust performance in annotating different types of cells. TripletCell is freely available at https://github.com/liuyan3056/TripletCell. We believe that TripletCell is a reliable computational tool for accurately annotating various cell types using scRNA-seq data and will be instrumental in assisting the generation of novel biological hypotheses in cell biology.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10199768 | PMC |
http://dx.doi.org/10.1093/bib/bbad132 | DOI Listing |
Viruses
December 2024
Vaccine and Gene Therapy Institute, Oregon Health and Science University, Beaverton, OR 97006, USA.
Lymphocryptoviruses (LCVs) are ubiquitous gamma-herpesviruses that establish life-long infections in both humans and non-human primates (NHPs). In immunocompromised hosts, LCV infections are commonly associated with B cell disorders and malignancies such as lymphoma. In this study, we evaluated simian LCV-encoded small microRNAs (miRNAs) present in lymphoblastoid cell lines (LCLs) derived from a Mauritian cynomolgus macaque () with cyLCV-associated post-transplant lymphoproliferative disease (PTLD) as well as the viral miRNAs expressed in a baboon () LCL that harbors CeHV12.
View Article and Find Full Text PDFPharmaceuticals (Basel)
December 2024
Computational Biology Laboratory, Department of Genetic Engineering, School of Bioengineering, SRM Institute of Science and Technology, Kattankulathur, Chengalpattu 603203, Tamil Nadu, India.
Inflammation serves as a vital response to diverse harmful stimuli like infections, toxins, or tissue injuries, aiding in the elimination of pathogens and tissue repair. However, persistent inflammation can lead to chronic diseases. Peptide therapeutics have gained attention for their specificity in targeting cells, yet their development remains costly and time-consuming.
View Article and Find Full Text PDFInt J Mol Sci
December 2024
School of Computer Science and Artificial Intelligence Aliyun School of Big Data School of Software, Changzhou University, Changzhou 213164, China.
Long non-coding RNA (lncRNA) is a non-coding RNA longer than 200 nucleotides, crucial for functions like cell cycle regulation and gene transcription. Accurate localization prediction from sequence information is vital for understanding lncRNA's biological roles. Computational methods offer an effective alternative to traditional experimental methods for annotating lncRNA subcellular positions.
View Article and Find Full Text PDFInt J Mol Sci
December 2024
Institute of Protein Research, Russian Academy of Sciences, 142290 Pushchino, Russia.
The amino acid composition of proteins depends on many factors. It varies in organisms that are distant in taxonomic position. The amino acid composition of proteins depends on the localization of proteins in cells and tissues and the structure of proteins.
View Article and Find Full Text PDFInt J Mol Sci
December 2024
College of Chinese Materia Medica, Yunnan University of Chinese Medicine, Kunming 650500, China.
The genus is distributed in the eastern three rivers on the Yunnan-Guizhou Plateau and its adjacent regions, located to the southeast of the Qinghai-Tibet Plateau. Its origin and evolution are likely influenced by the uplift of the Qinghai-Tibet Plateau. However, the historical impact of geological events on the divergence and distribution of this fish group has not been fully elucidated.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!