Nonlinear projection methods for visualizing Barcode data and application on two data sets.

Mol Ecol Resour

SAMM (Statistique, Analyse et Modélisation Multidisciplinaire), EA 4543, Université Paris 1 Panthéon Sorbonne, 90 rue de Tolbiac, Paris, 75013, France.

Published: November 2013

Developing tools for visualizing DNA sequences is an important issue in the Barcoding context. Visualizing Barcode data can be put in a purely statistical context, unsupervised learning. Clustering methods combined with projection methods have two closely linked objectives, visualizing and finding structure in the data. Multidimensional scaling (MDS) and Self-organizing maps (SOM) are unsupervised statistical tools for data visualization. Both algorithms map data onto a lower dimensional manifold: MDS looks for a projection that best preserves pairwise distances while SOM preserves the topology of the data. Both algorithms were initially developed for Euclidean data and the conditions necessary to their good implementation were not satisfied for Barcode data. We developed a workflow consisting in four steps: collapse data into distinct sequences; compute a dissimilarity matrix; run a modified version of SOM for dissimilarity matrices to structure the data and reduce dimensionality; project the results using MDS. This methodology was applied to Astraptes fulgerator and Hylomyscus, an African rodent with debated taxonomy. We obtained very good results for both data sets. The results were robust against unbalanced species. All the species in Astraptes were well displayed in very distinct groups in the various visualizations, except for LOHAMP and FABOV that were mixed up. For Hylomyscus, our findings were consistent with known species, confirmed the existence of four unnamed taxa and suggested the existence of potentially new species.

Download full-text PDF

Source
http://dx.doi.org/10.1111/1755-0998.12047DOI Listing

Publication Analysis

Top Keywords

data
12
barcode data
12
projection methods
8
visualizing barcode
8
data sets
8
structure data
8
nonlinear projection
4
visualizing
4
methods visualizing
4
data application
4

Similar Publications

Ratiometric fluorescent probe and smartphone-based visual recognition for HO and organophosphorus pesticide based on Ce/Ce cascade enzyme reaction.

Food Chem

December 2024

Laboratory of Functional Polymers, School of Materials Science and Engineering, Linyi University, Linyi 276005, China. Electronic address:

Organicphosphorus is a ubiquitous pesticide that has potential hazards to human health and environmental well-being. Therefore, the precise identification of residues of organophosphorus pesticides (OPs) emerges as an urgent necessity. A ratiometric fluorescent sensor for the detection of OPs by leveraging the catalytic activities of Ce and Ce on the two fluorescent substrates 4-Methylumbelliferyl phosphate (4-MUP) and o-phenylenediamine (OPD) correspondingly was designed.

View Article and Find Full Text PDF

Value of ultrasound-assessed dactylitis in the early diagnosis of psoriatic arthritis.

Semin Arthritis Rheum

December 2024

Department of Rheumatology and Joint and Bone Research Unit. Fundación Jiménez Díaz University Hospital and Health Research Institute Fundación Jiménez Díaz (IIS-FJD, UAM), Autonomous University of Madrid, Madrid, Spain. Electronic address:

Purpose: The primary objective of this prospective, longitudinal, observational, single-centre study was to evaluate the association between ultrasound-assessed lesions of dactylitis and the diagnosis of psoriatic arthritis (PsA) in patients with psoriasis (PsO) and hand arthralgia.

Methods: We included adult patients diagnosed with PsO with hand arthralgia, with or without other musculoskeletal complaints. They were clinically assessed at baseline, 6 and 12 months by a rheumatologist blinded to the ultrasound findings.

View Article and Find Full Text PDF

A U-Net based partial convolutional time-domain separation model to identify motor units from surface electromyographic signals in real time.

J Electromyogr Kinesiol

December 2024

School of Information Science and Technology, Dalian Maritime University, Linghai Road 1, Dalian, Liaoning Province 116026, China. Electronic address:

This study proposed a U-Net based partial convolutional time-domain model for a real-time high-density surface electromyography (HD-sEMG) decomposition. The model combines U-Net and a separation block containing partial convolution, aiming to efficiently identify motor units (MUs) without preprocessing. The proposed U-Net based network was trained by the HD-sEMG signals with innervation pulse trains (IPTs) labels, and the results are compared between different step sizes, noises, and model structures under the sliding time window with 120 sampling points.

View Article and Find Full Text PDF

Purpose: High-frequency ultrasound (HFUS) of muscle and nerve has the potential to be a reliable, responsive, and informative biomarker of disease progression for individuals with amyotrophic lateral sclerosis (ALS). High-frequency ultrasound is not able to visualize median nerve fascicles to the same extent as ultra-high-frequency ultrasound (UHFUS). Evaluating the number and size of fascicles within a nerve may facilitate a better understanding of nerve diseases.

View Article and Find Full Text PDF

Characterisation of an indeterminate ovarian mass is important as it guides management and clinical outcomes. Ultrasound is the first-line modality in the assessment of ovarian tumours. When ovarian masses are indeterminate on ultrasound, MRI provides excellent resolution in tissue characterisation and enhancement patterns.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!