Increasing interest in new pattern recognition methods has been motivated by bioinformatics research. The analysis of gene expression data originated from microarrays constitutes an important application area for classification algorithms and illustrates the need for identifying important predictors. We show that the Goodman-Kruskal coefficient can be used for constructing minimal classifiers for tabular data, and we give an algorithm that can construct such classifiers.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TBME.2004.827267DOI Listing

Publication Analysis

Top Keywords

goodman-kruskal coefficient
8
coefficient applications
4
applications genetic
4
genetic diagnosis
4
diagnosis cancer
4
cancer increasing
4
increasing interest
4
interest pattern
4
pattern recognition
4
recognition methods
4

Similar Publications

In the context of the thriving real estate market in developing countries like Vietnam, understanding consumer preferences and effectively addressing them through a comprehensive multi-criteria decision-making (MCDM) framework is paramount for real estate providers. This study presents a two-stage MCDM model that integrates the Delphi technique and the Technique for Order Preference by Similarity to Ideal Solution (TOPSIS) based on Spherical Fuzzy Sets (SFSs). Initially, the SF-Delphi technique validates critical criteria influencing customers' apartment selection in Vietnam.

View Article and Find Full Text PDF

Correlations for untargeted GC × GC-HRTOF-MS metabolomics of colorectal cancer.

Metabolomics

September 2023

Department of Chemistry, Organic and Biological Analytical Chemistry Group, Quartier Agora, University of Liège, Allée du Six Août,B6c, B-4000, Liège, Sart Tilman, Belgium.

Introduction: Modern comprehensive instrumentations provide an unprecedented coverage of complex matrices in the form of high-dimensional, information rich data sets.

Objectives: In addition to the usual biomarker research that focuses on the detection of the studied condition, we aimed to define a proper strategy to conduct a correlation analysis on an untargeted colorectal cancer case study with a data set of 102 variables corresponding to metabolites obtained from serum samples analyzed with comprehensive two-dimensional gas chromatography coupled to high-resolution time-of-flight mass spectrometry (GC × GC-HRTOF-MS). Indeed, the strength of association existing between the metabolites contains potentially valuable information about the molecular mechanisms involved and the underlying metabolic network associated to a global perturbation, at no additional analytical effort.

View Article and Find Full Text PDF

Biological risk of Legionella pneumophila in irrigation systems.

Rev Salud Publica (Bogota)

July 2020

EG: Ph. D. Epidemiología. Ph. D. Salud Pública. M. Sc. Medicina Preventiva y Salud Pública. M. Sc. Gestión de Riesgos Laborales, Calidad y Medio Ambiente. M. Sc. Prevención de Riesgos Profesionales en la Empresa. Pontificia Universidad Católica del Ecuador, Facultad de Medicina. Quito, Ecuador. Universidad de Málaga, Cátedra de Seguridad y Salud en el Trabajo. Málaga, España.

Objective: The goal of this study is to determine the risk of exposure to Legionella pneumophila in hotel golf courses located in the province of Malaga (Spain).

Method: Spray irrigation systems were analyzed as sources for spreading the Legionella bacterium. Spanish legislation requires that irrigation systems be monitored for their water quality as well as for reasons related to health and hygiene.

View Article and Find Full Text PDF

Product-moment correlation coefficient (PMC) is usually taken as a symmetric measure of the association because it produces an equal estimate irrespective of how two variables in the analysis are declared. However, in case the other variable has or both have non-continuous scales and when the scales of the variables differ from each other, PMC is unambiguously a directional measure directed so that the variable with a wider scale () explains the order or response pattern in the variable with a narrower scale () and not in the opposite direction or symmetrically. If the scales of the variables differ from each other, PMC is also prone to give a radical underestimation of the association, that is, the estimates are deflated.

View Article and Find Full Text PDF

Underestimation of reliability is discussed from the viewpoint of deflation in estimates of reliability caused by artificial systematic technical or mechanical error in the estimates of correlation (MEC). Most traditional estimators of reliability embed product-moment correlation coefficient (PMC) in the form of item-score correlation () or principal component or factor loading ( ). PMC is known to be severely affected by several sources of deflation such as the difficulty level of the item and discrepancy of the scales of the variables of interest and, hence, the estimates by and are always deflated in the settings related to estimating reliability.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!