Integrating Data Transformation in Principal Components Analysis.

J Comput Graph Stat

Department of Biostatistics, Division of Quantitative Sciences, The University of Texas MD Anderson Cancer Center, Houston, TX.

Published: January 2015

Principal component analysis (PCA) is a popular dimension reduction method to reduce the complexity and obtain the informative aspects of high-dimensional datasets. When the data distribution is skewed, data transformation is commonly used prior to applying PCA. Such transformation is usually obtained from previous studies, prior knowledge, or trial-and-error. In this work, we develop a model-based method that integrates data transformation in PCA and finds an appropriate data transformation using the maximum profile likelihood. Extensions of the method to handle functional data and missing values are also developed. Several numerical algorithms are provided for efficient computation. The proposed method is illustrated using simulated and real-world data examples.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4408558PMC
http://dx.doi.org/10.1080/10618600.2014.891461DOI Listing

Publication Analysis

Top Keywords

data transformation
16
data
6
transformation
5
integrating data
4
transformation principal
4
principal components
4
components analysis
4
analysis principal
4
principal component
4
component analysis
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!