Summary: Unsupervised machine learning provides tools for researchers to uncover latent patterns in large-scale data, based on calculated distances between observations. Methods to visualize high-dimensional data based on these distances can elucidate subtypes and interactions within multi-dimensional and high-throughput data. However, researchers can select from a vast number of distance metrics and visualizations, each with their own strengths and weaknesses. The Mercator R package facilitates selection of a biologically meaningful distance from 10 metrics, together appropriate for binary, categorical and continuous data, and visualization with 5 standard and high-dimensional graphics tools. Mercator provides a user-friendly pipeline for informaticians or biologists to perform unsupervised analyses, from exploratory pattern recognition to production of publication-quality graphics.

Availabilityand Implementation: Mercator is freely available at the Comprehensive R Archive Network (https://cran.r-project.org/web/packages/Mercator/index.html).

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8428582PMC
http://dx.doi.org/10.1093/bioinformatics/btab037DOI Listing

Publication Analysis

Top Keywords

data based
8
distance metrics
8
mercator
4
mercator pipeline
4
pipeline multi-method
4
multi-method unsupervised
4
unsupervised visualization
4
visualization distance
4
distance generation
4
generation summary
4

Similar Publications

Stroke is one of the leading causes of death in developing countries, and China bears the largest global burden of stroke. This study aims to investigate the relationship between different dimensions of physical activity levels and stroke risk using a nationally representative database. We performed a cross-sectional analysis using data from the China Health and Retirement Longitudinal Study (CHARLS) 2020.

View Article and Find Full Text PDF

Analyzing microbial samples remains computationally challenging due to their diversity and complexity. The lack of robust de novo protein function prediction methods exacerbates the difficulty in deriving functional insights from these samples. Traditional prediction methods, dependent on homology and sequence similarity, often fail to predict functions for novel proteins and proteins without known homologs.

View Article and Find Full Text PDF

The goal of this study was to determine how radiologists' rating of image quality when using 0.5T Magnetic Resonance Imaging (MRI) compares to Computed Tomography (CT) for visualization of pathology and evaluation of specific anatomic regions within the paranasal sinuses. 42 patients with clinical CT scans opted to have a 0.

View Article and Find Full Text PDF

Online vibration state identification of multi-rigid-body system based on self-healing model.

Sci Rep

December 2024

School of Mechanical Engineering, Liaoning Engineering Vocational College, Tieling, 112008, Liaoning, People's Republic of China.

The paper proposes a multi-rigid-body system state identification method based on self-healing model in order to improve the accuracy and reliability of CNC machine tools. Firstly, considering the influence of the joint surface, the Lagrange method is used to establish the mechanical model of the multi-rigid-body system. We input acceleration information and use the second-order modulation function to complete the online real-time identification of the joint surface parameters, thereby establishing the self-healing mechanical model of the multi-rigid-body system.

View Article and Find Full Text PDF

The new submarine volcano Fani Maoré offshore Mayotte (Comoros archipelago) discovered in 2019 has raised the awareness of a possible future eruption in Petite-Terre island, located on the same 60 km-long volcanic chain. In this context of a renewal of the volcanic activity, we present here the first volcanic hazard assessment in Mayotte, focusing on the potential reactivation of the Petite-Terre eruptive centers. Using the 2-D tephra dispersal model HAZMAP and the 1979 - 2021 meteorological ERA-5 database, we first identify single eruptive scenarios of various impacts for the population of Mayotte.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!