Processing Next-Generation Mass Spectrometry Imaging Data: Principal Component Analysis at Scale.

J Am Soc Mass Spectrom

The Maastricht MultiModal Molecular Imaging Institute (M4i), Division of Imaging Mass Spectrometry, Maastricht University, Maastricht 6229 ER, The Netherlands.

Published: December 2024

Mass spectrometry imaging (MSI) is constantly improving in spatial resolving power, throughput and mass resolution. Although beneficial, these improvements increase data set size and content. The larger data requires correspondingly fast computer-based analyses. However, these analyses often do not scale well with increased data size. Principal component analysis (PCA) is an important analytical tool commonly used with MSI data; however, most PCA algorithms load and process the entire data set within random access memory (RAM) which is most often insufficient for large data sets. PCA algorithms that use less RAM than the data set exist but are usually much slower or sacrifice precision and are rarely used for MSI data processing. Incremental PCA (IPCA) is an alternative algorithm that avoids large RAM allocations while also preserving speed and analytical precision. Here, we demonstrate and benchmark the use of differing implementations of IPCA, PCA, and commercial software on large and often complex MSI data sets. We show that using an already-published Python-based IPCA algorithm, IPCA can be successfully applied to MSI data sets too large to fit with RAM. Furthermore, our benchmarks demonstrate that, contrary to expectations, IPCA is faster than all other tested PCA implementations on all large data sets that can be directly compared.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11622226PMC
http://dx.doi.org/10.1021/jasms.4c00314DOI Listing

Publication Analysis

Top Keywords

msi data
16
data sets
16
data
12
data set
12
mass spectrometry
8
spectrometry imaging
8
principal component
8
component analysis
8
pca algorithms
8
large data
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!