Motivation: Mixed molecular data combines continuous and categorical features of the same samples, such as OMICS profiles with genotypes, diagnoses, or patient sex. Like all high-dimensional molecular data, it is prone to incorrect values that can stem from various sources for example the technical limitations of the measurement devices, errors in the sample preparation, or contamination. Most anomaly detection algorithms identify complete samples as outliers or anomalies. However, in most cases, not all measurements of those samples are erroneous but only a few one-dimensional features within the samples are incorrect. These one-dimensional data errors are continuous measurements that are either located outside or inside the normal ranges of their features but in both cases show atypical values given all other continuous and categorical features in the sample. Additionally, categorical anomalies can occur for example when the genotype or diagnosis was submitted wrongly.

Results: We introduce ADMIRE (Anomaly Detection using MIxed gRaphical modEls), a novel approach for the detection and correction of anomalies in mixed high-dimensional data. Hereby, we focus on the detection of single (one-dimensional) data errors in the categorical and continuous features of a sample. For that the joint distribution of continuous and categorical features is learned by mixed graphical models, anomalies are detected by the difference between measured and model-based estimations and are corrected using imputation. We evaluated ADMIRE in simulation and by screening for anomalies in one of our own metabolic datasets. In simulation experiments, ADMIRE outperformed the state-of-the-art methods of Local Outlier Factor, stray, and Isolation Forest.

Availability And Implementation: All data and code is available at https://github.com/spang-lab/adadmire. ADMIRE is implemented in a Python package called adadmire which can be found at https://pypi.org/project/adadmire.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10457663PMC
http://dx.doi.org/10.1093/bioinformatics/btad501DOI Listing

Publication Analysis

Top Keywords

anomaly detection
12
molecular data
12
continuous categorical
12
categorical features
12
detection mixed
8
mixed high-dimensional
8
high-dimensional molecular
8
features samples
8
one-dimensional data
8
data errors
8

Similar Publications

Carpal tunnel syndrome (CTS) is a common peripheral nerve entrapment disorder that is diagnosed using clinical signs and symptoms and confirmed via nerve conduction studies (NCSs). While NCS is a semi-invasive procedure, magnetic resonance imaging (MRI) is a non-invasive diagnostic tool that detects macroscopic nerve abnormalities and evaluates a patient's surgical or medication treatment options. This study assessed magnetic resonance neurography (MRN)'s diagnostic and grading value by comparing it to electrodiagnostic studies in patients with CTS and healthy individuals.

View Article and Find Full Text PDF

Faced with anomalies in medical images, Deep learning is facing major challenges in detecting, diagnosing, and classifying the various pathologies that can be treated via medical imaging. The main challenges encountered are mainly due to the imbalance and variability of the data, as well as its complexity. The detection and classification of skin diseases is one such challenge that researchers are trying to overcome, as these anomalies present great variability in terms of appearance, texture, color, and localization, which sometimes makes them difficult to identify accurately and quickly, particularly by doctors, or by the various Deep Learning techniques on offer.

View Article and Find Full Text PDF

Elranatamab is an effective drug for triple-class-exposed relapsed/refractory multiple myeloma (TCE-RRMM). In the pivotal study, only grade 1 or 2 immune effector cell-associated neurotoxicity syndrome (ICANS) were reported, and the risk factors for immune effector cell-associated neurotoxicity syndrome have not yet been clearly elucidated. This case report documents the first case of grade 4 ICANS in a patient treated with elranatamab, presenting alongside grade 1 cytokine release syndrome (CRS).

View Article and Find Full Text PDF

In vivo confocal microscopy (IVCM) is a non-invasive imaging technique used to visualize the layers of the cornea and conjunctiva in real time. In patients with atopic keratoconjunctivitis (AKC) and vernal keratoconjunctivitis (VKC), this technology can be useful in diagnosing and monitoring the disease, as well as evaluating the efficacy of treatments. IVCM can reveal subclinical abnormalities in the corneal and conjunctival epithelium such as inflammatory cell infiltrates and tissue damage, which can provide insight into the pathogenesis of AKC.

View Article and Find Full Text PDF

Whole Exome Sequencing in a Population of Fetuses With Structural Anomalies.

Prenat Diagn

January 2025

Richard D. Wood Jr. Center for Fetal Diagnosis and Treatment, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA.

Objective: To investigate the exome sequencing (ES) detection rate among fetuses with congenital anomalies and describe the rates in the setting of multiple versus isolated anomalies, perinatal autopsy, and family history of a previously affected child.

Methods: A single-center retrospective chart review was conducted on 397 anomalous fetuses that underwent ES from May 2012 through December 2023. Medical record review included demographics, imaging, and genetic testing.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!