This work deals with the use of multiple correspondence analysis (MCA) and a weighted Euclidean distance (the tolerance distance) as an exploratory tool in developing predictive logistic models. The method was applied to a living-donor kidney transplant data set with 109 cases and 13 predictors. This approach, followed by backward and forward selection procedures, yielded two models, one with four and another with two predictors. These models were compared to two other models, ordinarily built by backward and forward stepwise selection, which yielded, respectively, five and two predictors. After internal validation, the models performance statistics showed similar results. Likelihood ratio tests suggested that backward approach achieved a better fit than the forward modelling in both methods and the Vuong's non-nested test between backward-built models suggested that these were undistinguishable. We conclude that the tolerance distance, in combination with MCA, could be a feasible method for variable selection in logistic modelling, when there are several categorical predictors.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.cmpb.2009.02.003 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!