Linear discriminant analysis (LDA) is a well-known technique for linear classification, feature extraction, and dimension reduction. To improve the accuracy of LDA under the high dimension low sample size (HDLSS) settings, shrunken estimators, such as Graphical Lasso, can be used to strike a balance between biases and variances. Although the estimator with induced sparsity obtains a faster convergence rate, however, the introduced bias may also degrade the performance. In this paper, we theoretically analyze how the sparsity and the convergence rate of the precision matrix (also known as inverse covariance matrix) estimator would affect the classification accuracy by proposing an analytic model on the upper bound of an LDA misclassification rate. Guided by the model, we propose a novel classifier, DBSDA , which improves classification accuracy through debiasing. Theoretical analysis shows that DBSDA possesses a reduced upper bound of misclassification rate and better asymptotic properties than sparse LDA (SDA). We conduct experiments on both synthetic datasets and real application datasets to confirm the correctness of our theoretical analysis and demonstrate the superiority of DBSDA over LDA, SDA, and other downstream competitors under HDLSS settings.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2018.2846783DOI Listing

Publication Analysis

Top Keywords

misclassification rate
12
bound misclassification
8
linear discriminant
8
discriminant analysis
8
hdlss settings
8
convergence rate
8
classification accuracy
8
upper bound
8
theoretical analysis
8
lda sda
8

Similar Publications

: Clavicle injuries are common and seem to be frequently subject to diagnostic misclassification. The accurate identification of clavicle fractures is essential, particularly for registry and Big Data analyses. This study aims to assess the frequency of diagnostic errors in clavicle injury classifications.

View Article and Find Full Text PDF

Background: In longitudinal studies of older persons, complete ascertainment of mortality is needed to minimize potential biases. To ascertain mortality in the National Health and Aging Trends Study (NHATS), investigators are advised to use its Sensitive files, which include month and year of death on most decedents who had not dropped out of the study. Because losses to follow-up are not insubstantial, ascertainment of mortality is likely incomplete.

View Article and Find Full Text PDF

CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization.

PLoS One

January 2025

Observational Health Data Analytics, Janssen Research and Development, LLC, Titusville, NJ, United States of America.

Objective: This paper introduces a novel framework for evaluating phenotype algorithms (PAs) using the open-source tool, Cohort Diagnostics.

Materials And Methods: The method is based on several diagnostic criteria to evaluate a patient cohort returned by a PA. Diagnostics include estimates of incidence rate, index date entry code breakdown, and prevalence of all observed clinical events prior to, on, and after index date.

View Article and Find Full Text PDF

Background: A previously published study at Norrland University Hospital, Umeå, Sweden, found that in 29.5% of patients with urinary bladder cancer (UBC) who underwent cystectomy, incorrect cT-stage (clinical T-stage) was registered in the Swedish National Register of Urinary Bladder Cancer (SNRUBC). Tumor in bladder diverticulum (TIBD) and tumor-associated hydronephrosis (TAH) were common causes for misclassification.

View Article and Find Full Text PDF

Correct classification of type 1 (T1D) and type 2 diabetes (T2D) is challenging due to overlapping clinical features and the increasingly early onset of T2D, particularly in South Asians. Polygenic risk scores (PRSs) for T1D and T2D have been shown to work relatively well in South Asians, despite being derived from largely European-ancestry samples. Here we used PRSs to investigate the rate of potential misclassification of diabetes amongst British Bangladeshis and Pakistanis.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!