Background: Many methods for dimensionality reduction of large data sets such as those generated in microarray studies boil down to the Singular Value Decomposition (SVD). Although singular vectors associated with the largest singular values have strong optimality properties and can often be quite useful as a tool to summarize the data, they are linear combinations of up to all of the data points, and thus it is typically quite hard to interpret those vectors in terms of the application domain from which the data are drawn. Recently, an alternative dimensionality reduction paradigm, CUR matrix decompositions, has been proposed to address this problem and has been applied to genetic and internet data. CUR decompositions are low-rank matrix decompositions that are explicitly expressed in terms of a small number of actual columns and/or actual rows of the data matrix. Since they are constructed from actual data elements, CUR decompositions are interpretable by practitioners of the field from which the data are drawn.
Results: We present an implementation to perform CUR matrix decompositions, in the form of a freely available, open source R-package called rCUR. This package will help users to perform CUR-based analysis on large-scale data, such as those obtained from different high-throughput technologies, in an interactive and exploratory manner. We show two examples that illustrate how CUR-based techniques make it possible to reduce significantly the number of probes, while at the same time maintaining major trends in data and keeping the same classification accuracy.
Conclusions: The package rCUR provides functions for the users to perform CUR-based matrix decompositions in the R environment. In gene expression studies, it gives an additional way of analysis of differential expression and discriminant gene selection based on the use of statistical leverage scores. These scores, which have been used historically in diagnostic regression analysis to identify outliers, can be used by rCUR to identify the most informative data points with respect to which to express the remaining data points.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3546429 | PMC |
http://dx.doi.org/10.1186/1471-2105-13-103 | DOI Listing |
J Phys Chem Lett
January 2025
Research Center for Materials Nanoarchitectonics (MANA), National Institute for Materials Science (NIMS), 1-1 Namiki, Tsukuba, Ibaraki 305-0044, Japan.
We fabricated Co-based catalysts by the low-temperature thermal decomposition of R-Co intermetallics (R = Y, La, or Ce) to reduce the temperature of ammonia cracking for hydrogen production. The catalysts synthesized are nanocomposites of Co/RO with a metal-rich composition. In the Co/LaO catalyst derived from LaCo, Co nanoparticles of 10-30 nm size are enclosed by the LaO matrix.
View Article and Find Full Text PDFMaize is a staple crop worldwide, essential for food security, livestock feed, and industrial uses. Its health directly impacts agricultural productivity and economic stability. Effective detection of maize crop health is crucial for preventing disease spread and ensuring high yields.
View Article and Find Full Text PDFArtif Intell Med
December 2024
Medical Image and Signal Processing Research Center, Isfahan University of Medical Sciences, Isfahan 81746-73461, Iran. Electronic address:
Modeling Optical Coherence Tomography (OCT) images is crucial for numerous image processing applications and aids ophthalmologists in the early detection of macular abnormalities. Sparse representation-based models, particularly dictionary learning (DL), play a pivotal role in image modeling. Traditional DL methods often transform higher-order tensors into vectors and then aggregate them into a matrix, which overlooks the inherent multi-dimensional structure of the data.
View Article and Find Full Text PDFSensors (Basel)
December 2024
Laboratory of Target Microwave Properties, Deqing Academy of Satellite Applications, Deqing 313200, China.
Using microwave remote sensing to invert forest parameters requires clear canopy scattering characteristics, which can be intuitively investigated through scattering measurements. However, there are very few ground-based measurements on forest branches, needles, and canopies. In this study, a quantitative analysis of the canopy branches, needles, and ground contribution of Masson pine scenes in C-, X-, and Ku-bands was conducted based on a microwave anechoic chamber measurement platform.
View Article and Find Full Text PDFPolymers (Basel)
December 2024
Key Laboratory of Organosilicon Chemistry and Material Technology, College of Material, Chemistry and Chemical Engineering, Ministry of Education, Hangzhou Normal University, Hangzhou 311121, China.
A series of Si-H- or Si-Vi-terminated, branched and linear oligomers containing MeSiO segments were prepared by equilibrium polymerization or non-equilibrium polymerization initiated by living anions, respectively. These oligomers were used to improve the defects of concentrated crosslinking points and the high hardness of crosslinked products when using phenyltris(dimethylsiloxy)silane or 1,1,5,5-tetramethyl-3,3-diphenyl trisiloxane as crosslinking agents in the preparation of silicone gel. NMR, FT-IR, and GPC characterized the structure and molecular weight information of the prepared oligomers.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!