stepwiseCM: An R Package for Stepwise Classification of Cancer Samples Using Multiple Heterogeneous Data Sets.

Cancer Inform

Department of Epidemiology and Biostatistics, VU University Medical Center, Amsterdam, The Netherlands. ; Department of Mathematics, VU University, Amsterdam, The Netherlands.

Published: June 2014

This paper presents the R/Bioconductor package stepwiseCM, which classifies cancer samples using two heterogeneous data sets in an efficient way. The algorithm is able to capture the distinct classification power of two given data types without actually combining them. This package suits for classification problems where two different types of data sets on the same samples are available. One of these data types has measurements on all samples and the other one has measurements on some samples. One is easy to collect and/or relatively cheap (eg, clinical covariates) compared to the latter (high-dimensional data, eg, gene expression). One additional application for which stepwiseCM is proven to be useful as well is the combination of two high-dimensional data types, eg, DNA copy number and mRNA expression. The package includes functions to project the neighborhood information in one data space to the other to determine a potential group of samples that are likely to benefit most by measuring the second type of covariates. The two heterogeneous data spaces are connected by indirect mapping. The crucial difference between the stepwise classification strategy implemented in this package and the existing packages is that our approach aims to be cost-efficient by avoiding measuring additional covariates, which might be expensive or patient-unfriendly, for a potentially large subgroup of individuals. Moreover, in diagnosis for these individuals test, results would be quickly available, which may lead to reduced waiting times and hence lower the patients' distress. The improvement described remedies the key limitations of existing packages, and facilitates the use of the stepwiseCM package in diverse applications.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3885337PMC
http://dx.doi.org/10.4137/CIN.S13075DOI Listing

Publication Analysis

Top Keywords

heterogeneous data
12
data sets
12
data types
12
data
9
stepwisecm package
8
stepwise classification
8
cancer samples
8
measurements samples
8
high-dimensional data
8
existing packages
8

Similar Publications

Robust multi-source geographic entities matching by maximizing geometric and semantic similarity.

Sci Rep

December 2024

Department of Geographic Information System, Chinese Academy of Surveying and mapping, Beijing, 100036, China.

Geographic entity matching is an important means for multi-source spatial data fusion and information association and sharing. Corresponding matching methods have been designed by existing studies for different types of entity data characteristics, such as line and area. However, these approaches are often limited in the generalization ability for matching heterogeneous data from multiple sources and the accuracy for complex pattern matching.

View Article and Find Full Text PDF

Turning attention to tumor-host interface and focus on the peritumoral heterogeneity of glioblastoma.

Nat Commun

December 2024

Cancer Center, Department of Neurosurgery, Zhejiang Provincial People's Hospital,Affiliated People's Hospital, Hangzhou Medical College, Hangzhou, Zhejiang, China.

Approximately 90% of glioblastoma recurrences occur in the peritumoral brain zone (PBZ), while the spatial heterogeneity of the PBZ is not well studied. In this study, two PBZ tissues and one tumor tissue sample are obtained from each patient via preoperative imaging. We assess the microenvironment and the characteristics of infiltrating immune/tumor cells using various techniques.

View Article and Find Full Text PDF

Probing regional glycogen metabolism in humans non-invasively has been challenging due to a lack of sensitive approaches. Here we studied human muscle glycogen dynamics post-exercise with a spatial resolution of millimeters and temporal resolution of minutes, using relayed nuclear Overhauser effect (glycoNOE) MRI. Data at 5T showed a homogeneous distribution of glycogen in resting muscle, with an average concentration of 99 ± 13 mM.

View Article and Find Full Text PDF

Background: Virtual surgical planning (VSP) is an emerging method in head and neck reconstruction with demonstrated benefits, however, its economic viability is supported with mixed evidence.

Methods: A structured search was performed in five electronic databases. Studies that performed an economic evaluation on VSP in head and neck reconstruction were included.

View Article and Find Full Text PDF

Metabolic Analysis of Tumor Cells Within Ameloblastoma at the Single-Cell Level.

Oral Dis

December 2024

State Key Laboratory of Oral & Maxillofacial Reconstruction and Regeneration, Key Laboratory of Oral Biomedicine Ministry of Education, Hubei Key Laboratory of Stomatology, School & Hospital of Stomatology, Wuhan University, Wuhan, China.

Background: To meet their high energy needs, tumor cells undergo aberrant metabolic reprogramming. A tumor cell may expertly modify its metabolic pathways and the differential expression of the genes for metabolic enzymes. The physiological requirements of the host tissue and the tumor cell of origin mostly dictate metabolic adaptation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!