Microsatellite instability (MSI) is a critical phenotype of cancer genomes and an FDA-recognized biomarker that can guide treatment with immune checkpoint inhibitors. Previous work has demonstrated that next-generation sequencing data can be used to identify samples with MSI-high phenotype. However, low tumor purity, as frequently observed in routine clinical samples, poses a challenge to the sensitivity of existing algorithms. To overcome this critical issue, we developed MiMSI, an MSI classifier based on deep neural networks and trained using a dataset that included low tumor purity MSI cases in a multiple instance learning framework. On a challenging yet representative set of cases, MiMSI showed higher sensitivity (0.895) and auROC (0.971) than MSISensor (sensitivity: 0.67; auROC: 0.907), an open-source software previously validated for clinical use at our institution using MSK-IMPACT large panel targeted NGS data. In a separate, prospective cohort, MiMSI confirmed that it outperforms MSISensor in low purity cases (P = 8.244e-07).

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11696176PMC
http://dx.doi.org/10.1038/s41467-024-54970-zDOI Listing

Publication Analysis

Top Keywords

multiple instance
8
instance learning
8
learning framework
8
microsatellite instability
8
low tumor
8
tumor purity
8
deep multiple
4
framework improves
4
improves microsatellite
4
instability detection
4

Similar Publications

This paper introduces the Morphologically-Analyzed and Syntactically-Annotated Quran (MASAQ) dataset, a comprehensive resource designed to address the scarcity of annotated Quranic Arabic corpora and facilitate the development of advanced Natural Language Processing (NLP) models. The Quran, being a cornerstone of classical Arabic, presents unique challenges for NLP due to its sacred nature and complex linguistic features. MASAQ provides a detailed syntactic and morphological annotation of the entire Quranic text, utilizing a rigorously verified text from Tanzil.

View Article and Find Full Text PDF

Protein-ligand binding affinity prediction using multi-instance learning with docking structures.

Front Pharmacol

January 2025

Global Security Computing Applications Division, Lawrence Livermore National Laboratory, Livermore, CA, United States.

Introduction: Recent advances in 3D structure-based deep learning approaches demonstrate improved accuracy in predicting protein-ligand binding affinity in drug discovery. These methods complement physics-based computational modeling such as molecular docking for virtual high-throughput screening. Despite recent advances and improved predictive performance, most methods in this category primarily rely on utilizing co-crystal complex structures and experimentally measured binding affinities as both input and output data for model training.

View Article and Find Full Text PDF

Background: Genomic data is essential for clinical decision-making in precision oncology. Bioinformatic algorithms are widely used to analyze next-generation sequencing (NGS) data, but they face two major challenges. First, these pipelines are highly complex, involving multiple steps and the integration of various tools.

View Article and Find Full Text PDF

Caenorhabditis Elegans as a Model for Environmental Epigenetics.

Curr Environ Health Rep

January 2025

Institute for Society and Genetics, University of California, Boyer Hall, Room 332, 611 Charles E Young Dr E., UCLA, Los Angeles, CA, 90095, USA.

Purpose Of Review: The burgeoning field of environmental epigenetics has revealed the malleability of the epigenome and uncovered numerous instances of its sensitivity to environmental influences; however, pinpointing specific mechanisms that tie together environmental triggers, epigenetic pathways, and organismal responses has proven difficult. This article describes how Caenorhabditis elegans can fill this gap, serving as a useful model for the discovery of molecular epigenetic mechanisms that are conserved in humans.

Recent Findings: Recent results show that environmental stressors such as methylmercury, arsenite, starvation, heat, bacterial infection, and mitochondrial inhibitors can all have profound effects on the epigenome, with some insults showing epigenetic and organismal effects for multiple generations.

View Article and Find Full Text PDF

The activity of miRNA varies across different cell populations and systems, as part of the mechanisms that distinguish cell types and roles in living organisms and in human health and disease. Typically, miRNA regulation drives changes in the composition and levels of protein-coding RNA and of lncRNA, with targets being down-regulated when miRNAs are active. The term "miRNA activity" is used to refer to this transcriptional effect of miRNAs.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!