Schema: metric learning enables interpretable synthesis of heterogeneous single-cell modalities.

Genome Biol

Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA.

Published: May 2021

A complete understanding of biological processes requires synthesizing information across heterogeneous modalities, such as age, disease status, or gene expression. Technological advances in single-cell profiling have enabled researchers to assay multiple modalities simultaneously. We present Schema, which uses a principled metric learning strategy that identifies informative features in a modality to synthesize disparate modalities into a single coherent interpretation. We use Schema to infer cell types by integrating gene expression and chromatin accessibility data; demonstrate informative data visualizations that synthesize multiple modalities; perform differential gene expression analysis in the context of spatial variability; and estimate evolutionary pressure on peptide sequences.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8091541PMC
http://dx.doi.org/10.1186/s13059-021-02313-2DOI Listing

Publication Analysis

Top Keywords

gene expression
12
metric learning
8
multiple modalities
8
modalities
5
schema metric
4
learning enables
4
enables interpretable
4
interpretable synthesis
4
synthesis heterogeneous
4
heterogeneous single-cell
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!