Canonical Correlation Analysis and Partial Least Squares for Identifying Brain-Behavior Associations: A Tutorial and a Comparative Study.

Biol Psychiatry Cogn Neurosci Neuroimaging

Centre for Medical Image Computing, Department of Computer Science, University College London, London, United Kingdom; Max Planck University College London Centre for Computational Psychiatry and Ageing Research, University College London, London, United Kingdom.

Published: November 2022

Canonical correlation analysis (CCA) and partial least squares (PLS) are powerful multivariate methods for capturing associations across 2 modalities of data (e.g., brain and behavior). However, when the sample size is similar to or smaller than the number of variables in the data, standard CCA and PLS models may overfit, i.e., find spurious associations that generalize poorly to new data. Dimensionality reduction and regularized extensions of CCA and PLS have been proposed to address this problem, yet most studies using these approaches have some limitations. This work gives a theoretical and practical introduction into the most common CCA/PLS models and their regularized variants. We examine the limitations of standard CCA and PLS when the sample size is similar to or smaller than the number of variables. We discuss how dimensionality reduction and regularization techniques address this problem and explain their main advantages and disadvantages. We highlight crucial aspects of the CCA/PLS analysis framework, including optimizing the hyperparameters of the model and testing the identified associations for statistical significance. We apply the described CCA/PLS models to simulated data and real data from the Human Connectome Project and Alzheimer's Disease Neuroimaging Initiative (both of n > 500). We use both low- and high-dimensionality versions of these data (i.e., ratios between sample size and variables in the range of ∼1-10 and ∼0.1-0.01, respectively) to demonstrate the impact of data dimensionality on the models. Finally, we summarize the key lessons of the tutorial.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.bpsc.2022.07.012DOI Listing

Publication Analysis

Top Keywords

sample size
12
cca pls
12
canonical correlation
8
correlation analysis
8
partial squares
8
size smaller
8
smaller number
8
number variables
8
standard cca
8
data dimensionality
8

Similar Publications

Background: The recent Movement Disorders Society (MDS)-progressive supranuclear palsy (PSP) diagnostic criteria conceptualized three clinical diagnostic certainty levels: "suggestive of PSP" for sensitive early diagnosis based on subtle clinical signs, "possible PSP" balancing sensitivity and specificity, and "probable PSP" highly specific for PSP pathology.

Objective: The aim of this study was to prospectively validate the criteria against long-term clinical follow-up and characterize the diagnostic certainty increase over time.

Methods: Patients with "possible PSP" or "suggestive of PSP" diagnosis and clinical follow-up were recruited in two German multicenter longitudinal observational studies (ProPSP and DescribePSP).

View Article and Find Full Text PDF

Cerebral palsy (CP) manifests with abnormal posture and impaired selective motor control, notably affecting trunk control and dynamic balance coordination, leading to inadequate postural control. Previous research has indicated the benefits of pulsed electromagnetic field (PEMF) therapy for various musculoskeletal and neurological conditions. Therefore, we conducted a randomized pilot study to assess the feasibility of our preliminary research design and examine the effect of the PEMF treatment among children with CP.

View Article and Find Full Text PDF

Aortic valve calcification results from degenerative processes associated with several pathologies. These processes are influenced by age, chronic inflammation, and high concentrations of phosphate ions in the plasma, which contribute to induce mineralization in the aortic valve and deterioration of cardiovascular health. Environmental factors, such as wood smoke that emits harmful and carcinogenic pollutants, carbon monoxide (CO), and nitrogen oxide (NO), as well as other reactive compounds may also be implicated.

View Article and Find Full Text PDF

: Sudden cardiac death (SCD) poses a significant burden on the modern-day public health system; however, while our understanding of the underlying pathophysiology is still evolving and may not be complete, many insights are known and applied every day. Targeted prevention methods are continually being developed and refined. We conducted a systemic review and meta-analysis to identify a blood nutritional biomarker that can predict and screen population groups at high risk for cardiovascular disease mortality (CVD mortality) or SCD.

View Article and Find Full Text PDF

: Temporomandibular disorders (TMDs) represent a significant public health issue, among which masticatory muscle pain is the most common. Current publications increasingly indicate surface electromyography (sEMG) as an effective diagnostic tool for muscle dysfunctions that may be employed in TMDs recognition. The objective of this study was to establish reference ranges for TMDs patients with masticatory muscle pain and healthy individuals in the electromyographic Functional Clenching Index (FCI) for the temporalis muscles (TAs) and masseter muscles (MMs).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!