AI Article Synopsis

  • Single cell data integration methods help combine data from different sources and types, with mosaic integration being the most complex and under-researched.
  • We introduce scMoMaT, a new approach that effectively integrates single cell multi-omics data using matrix tri-factorization, identifying cluster-specific bio-markers in the process.
  • Our method outperforms existing techniques, providing better quality annotations of cell types by using integrated cell data and learned bio-markers.

Article Abstract

Single cell data integration methods aim to integrate cells across data batches and modalities, and data integration tasks can be categorized into horizontal, vertical, diagonal, and mosaic integration, where mosaic integration is the most general and challenging case with few methods developed. We propose scMoMaT, a method that is able to integrate single cell multi-omics data under the mosaic integration scenario using matrix tri-factorization. During integration, scMoMaT is also able to uncover the cluster specific bio-markers across modalities. These multi-modal bio-markers are used to interpret and annotate the clusters to cell types. Moreover, scMoMaT can integrate cell batches with unequal cell type compositions. Applying scMoMaT to multiple real and simulated datasets demonstrated these features of scMoMaT and showed that scMoMaT has superior performance compared to existing methods. Specifically, we show that integrated cell embedding combined with learned bio-markers lead to cell type annotations of higher quality or resolution compared to their original annotations.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9873790PMC
http://dx.doi.org/10.1038/s41467-023-36066-2DOI Listing

Publication Analysis

Top Keywords

mosaic integration
16
single cell
12
cell
8
data integration
8
cell type
8
scmomat
7
integration
7
scmomat jointly
4
jointly performs
4
performs single
4

Similar Publications

Oxyntic gland neoplasms typically arise in Helicobacter pylori-naïve stomachs and are composed predominantly of chief cells, with a smaller component of parietal cells. The pathologic diagnosis can be challenging due to minimal cellular atypia. Especially in biopsy specimens with limited tumor volume or when pathologists have limited experience in diagnosing this neoplasm, distinguishing it from normal oxyntic glands can be difficult, and no reliable diagnostic markers are currently available.

View Article and Find Full Text PDF

Sexual mosaics in Simuliidae members have been detected at low frequencies in various locations worldwide. These phenotypic expressions are expected in very small amounts in any natural insect population and may result from inherent individual formation factors, such as hormonal or external elements, i.e.

View Article and Find Full Text PDF

CASTER: Direct species tree inference from whole-genome alignments.

Science

January 2025

Electrical and Computer Engineering Department, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, USA.

Genomes contain mosaics of discordant evolutionary histories, challenging the accurate inference of the tree of life. While genome-wide data are routinely used for discordance-aware phylogenomic analyses, due to modeling and scalability limitations, the current practice leaves out large chunks of genomes. As more high-quality genomes become available, we urgently need discordance-aware methods to infer the tree directly from a multiple genome alignment.

View Article and Find Full Text PDF

Hybridization between wild Musa species and subspecies from Southeast Asia is at the origin of cultivated bananas. The genomes of these cultivars are complex mosaics involving nine genetic groups, including two previously unknown contributors. This study provides continuous genome assemblies for six wild genetic groups, one of which represents one of the unknown ancestor, identified as M.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!