The standard analysis pipeline for single-cell RNA-seq data consists of sequential steps initiated by clustering the cells. An innate limitation of this pipeline is that an imperfect clustering result can irreversibly affect the succeeding steps. For example, there can be cell types not well distinguished by clustering because they largely share the global structure, such as the anterior primitive streak and mid primitive streak cells. If one searches differentially expressed genes (DEGs) solely based on clustering, marker genes for distinguishing these types will be missed. Moreover, clustering depends on many parameters and can often be subjective to manual decisions. To overcome these limitations, we propose MarcoPolo, a method that identifies informative DEGs independently of prior clustering. MarcoPolo sorts out genes by evaluating if the distributions are bimodal, if similar expression patterns are observed in other genes, and if the expressing cells are proximal in a low-dimensional space. Using real datasets with FACS-purified cell labels, we demonstrate that MarcoPolo recovers marker genes better than competing methods. Notably, MarcoPolo finds key genes that can distinguish cell types that are not distinguishable by the standard clustering. MarcoPolo is built in a convenient software package that provides analysis results in an HTML file.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9262626 | PMC |
http://dx.doi.org/10.1093/nar/gkac216 | DOI Listing |
J Transl Med
July 2024
Department of Hematology, Tianjin Medical University Tianjin General Hospital, Tianjin, China.
Background: Myelodysplastic syndrome (MDS) is a complicated hematopoietic malignancy characterized by bone marrow (BM) dysplasia with symptoms like anemia, neutropenia, or thrombocytopenia. MDS exhibits considerable heterogeneity in prognosis, with approximately 30% of patients progressing to acute myeloid leukemia (AML). Single cell RNA-sequencing (scRNA-seq) is a new and powerful technique to profile disease landscapes.
View Article and Find Full Text PDFNucleic Acids Res
July 2022
Department of Biomedical Sciences, BK21 Plus Biomedical Science Project, Seoul National University College of Medicine, Seoul, Republic of Korea.
The standard analysis pipeline for single-cell RNA-seq data consists of sequential steps initiated by clustering the cells. An innate limitation of this pipeline is that an imperfect clustering result can irreversibly affect the succeeding steps. For example, there can be cell types not well distinguished by clustering because they largely share the global structure, such as the anterior primitive streak and mid primitive streak cells.
View Article and Find Full Text PDFAnn Dermatol
April 2016
Department of Dermatology and Cutaneous Biology Research Institute, Severance Hospital, Yonsei University College of Medicine, Seoul, Korea.
Background: Ustekinumab is a fully human monoclonal antibody approved for the treatment of chronic moderate-to-severe plaque psoriasis in adults. However, factors including efficacy, tolerability, ease of use, and cost burden may affect ustekinumab utilization. Noncompliance may, in turn, affect treatment response.
View Article and Find Full Text PDFCad Saude Publica
March 2012
Hospital Universitário de Brasília, Universidade de Brasília, Brasília, Brasil.
This study aimed to examine the prognostic value of lipid parameters for incident hypertension in elderly living in a community. The study included 306 (81% from total) persons aged > 60 years who were free of hypertension and of cardiovascular diseases at the baseline survey of the Bambuí Cohort Study of Aging. The cumulative incidence of hypertension over three years was 37.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!