Identification of risk factors in patients with a particular disease can be analyzed in clinical data sets by using feature selection procedures of pattern recognition and data mining methods. The applicability of the relaxed linear separability (RLS) method of feature subset selection was checked for high-dimensional and mixed type (genetic and phenotypic) clinical data of patients with end-stage renal disease. The RLS method allowed for substantial reduction of the dimensionality through omitting redundant features while maintaining the linear separability of data sets of patients with high and low levels of an inflammatory biomarker. The synergy between genetic and phenotypic features in differentiation between these two subgroups was demonstrated.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3904924PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0086630PLOS

Publication Analysis

Top Keywords

genetic phenotypic
12
linear separability
12
phenotypic features
8
relaxed linear
8
clinical data
8
data sets
8
rls method
8
selection genetic
4
features associated
4
associated inflammatory
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!