Assessment of hierarchical clustering methodologies for proteomic data mining.

J Proteome Res

UR 1213, Unité de Recherches sur les Herbivores, Equipe Croissance et Métabolisme du Muscle, INRA de Clermont-Ferrand/Theix, F-63122 [corrected] Saint-Genès Champanelle, France.

Published: January 2007

Hierarchical clustering methodology is a powerful data mining approach for a first exploration of proteomic data. It enables samples or proteins to be grouped blindly according to their expression profiles. Nevertheless, the clustering results depend on parameters such as data preprocessing, between-profile similarity measurement, and the dendrogram construction procedure. We assessed several clustering strategies by calculating the F-measure, a widely used quality metric. The combination, on logged matrix, of Pearson correlation and Ward's methods for data aggregation is among the best clustering strategies, at least with the data sets we studied. This study was carried out using PermutMatrix, a freely available software derived from transcriptomics.

Download full-text PDF

Source
http://dx.doi.org/10.1021/pr060343hDOI Listing

Publication Analysis

Top Keywords

hierarchical clustering
8
proteomic data
8
data mining
8
clustering strategies
8
data
6
clustering
5
assessment hierarchical
4
clustering methodologies
4
methodologies proteomic
4
mining hierarchical
4

Similar Publications

Role of riverbed sand mining on planform and cross-sectional morphology of Mayurakshi River, India.

Sci Total Environ

January 2025

Laboratorio de Geografía Física, Escuela de Geografía, Universidad de Costa Rica, Costa Rica.

Human interventions in the form of riverbed sand mining are escalating worldwide, especially in the humid tropics with excess population pressure exerting an elevated demand for sand as construction materials. Naturally, channel morphological alterations are observed for the tropical fluvial systems to a large extent. The present work examines the riverbed sand mining of the Mayurakshi River (India) during the last fifty years (1970-2020) using topographical maps, satellite images and field-based cross-sectional measurements.

View Article and Find Full Text PDF

Nanoparticles (NPs) exhibit high reactivity and mobility in the environment, and a significant capacity to penetrate living organisms, potentially leading to harmful effects. Volcanoes are the second major source of natural NPs emitted into the atmosphere, with an estimated flux of 342 Tg/year. Few studies have focused on their fate.

View Article and Find Full Text PDF

Flexible and modular latent transition analysis-A tutorial using R.

PLoS One

January 2025

National Institute of Public Health, University of Southern Denmark, Copenhagen K, Denmark.

Latent transition analysis (LTA) is a useful statistical modelling approach for describe transitions between latent classes over time. LTA may be characterized in terms of prevalence at each time point and through transition probabilities over time. Investigating predictors of these transitions is often of key interest.

View Article and Find Full Text PDF

Dry evergreen Afromontane forests are severely threatened due to the expansion of agriculture and overgrazing by livestock. The objective of this study was to investigate the composition of woody species, structure, regeneration status and plant communities in Seqela forest, as well as the relationship between plant community types and environmental variables. Systematic sampling was used to collect vegetation and environmental data from 52 (20 m x 20 m) (400 m2) plots.

View Article and Find Full Text PDF

The identification and typing of bacteria are very expensive and time-consuming due to their growth times, and the expertise needed. MALDI-TOF MS represents a fast technique, reproducible with molecular approaches. This technique is still poorly applied in Legionella surveillance with estimation occurring only at the genus level.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!