Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery.

Bioinformatics

Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Campus E1.4, 66123 Saarbrücken and Saarbrücken Graduate School of Computer Science, Saarland University, 66123 Saarbrücken.

Published: June 2015

Motivation: Despite ongoing cancer research, available therapies are still limited in quantity and effectiveness, and making treatment decisions for individual patients remains a hard problem. Established subtypes, which help guide these decisions, are mainly based on individual data types. However, the analysis of multidimensional patient data involving the measurements of various molecular features could reveal intrinsic characteristics of the tumor. Large-scale projects accumulate this kind of data for various cancer types, but we still lack the computational methods to reliably integrate this information in a meaningful manner. Therefore, we apply and extend current multiple kernel learning for dimensionality reduction approaches. On the one hand, we add a regularization term to avoid overfitting during the optimization procedure, and on the other hand, we show that one can even use several kernels per data type and thereby alleviate the user from having to choose the best kernel functions and kernel parameters for each data type beforehand.

Results: We have identified biologically meaningful subgroups for five different cancer types. Survival analysis has revealed significant differences between the survival times of the identified subtypes, with P values comparable or even better than state-of-the-art methods. Moreover, our resulting subtypes reflect combined patterns from the different data sources, and we demonstrate that input kernel matrices with only little information have less impact on the integrated kernel matrix. Our subtypes show different responses to specific therapies, which could eventually assist in treatment decision making.

Availability And Implementation: An executable is available upon request.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4765854PMC
http://dx.doi.org/10.1093/bioinformatics/btv244DOI Listing

Publication Analysis

Top Keywords

data types
8
multiple kernel
8
kernel learning
8
cancer types
8
data type
8
kernel
6
data
6
integrating data
4
types
4
types regularized
4

Similar Publications

Neovascular age-related macular degeneration and diabetic macular edema are leading causes of vision-loss evoked by retinal neovascularization and vascular leakage. The glycoprotein microfibrillar-associated protein 4 (MFAP4) is an integrin αβ ligand present in the extracellular matrix. Single-cell transcriptomics reveal MFAP4 expression in cell-types in close proximity to vascular endothelial cells including choroidal vascular mural cells and retinal astrocytes and Müller cells.

View Article and Find Full Text PDF

Objective: Understanding healthcare-seeking propensity is crucial for optimizing healthcare utilization, especially for patients with chronic conditions like hypertension or diabetes, given their substantial burden on healthcare systems globally. This study aims to evaluate hypertensive or diabetic patients' healthcare-seeking propensity based on the severity of symptoms, categorizing symptoms as either major or minor. It also explores factors influencing healthcare-seeking propensity and examines whether healthcare-seeking propensity affects healthcare utilization and preventable hospitalizations.

View Article and Find Full Text PDF

Streptococcus dysgalactiae (S. dysgalactiae ) is a common pathogen of humans and various animals. However, the phylogenetic position of animal S.

View Article and Find Full Text PDF

Estimating self-performance when making complex decisions.

Sci Rep

January 2025

Centre for Brain, Mind and Markets, Faculty of Business and Economics, The University of Melbourne, Melbourne, Australia.

Metacognition, the ability to monitor and reflect on our own mental states, enables us to assess our performance at different levels - from confidence in individual decisions to overall self-performance estimates (SPEs). It plays a particularly important part in computationally complex decisions that require a high level of cognitive resources, as the allocation of such limited resources presumably is based on metacognitive evaluations. However, little is known about metacognition in complex decisions, in particular, how people construct SPEs.

View Article and Find Full Text PDF

Breast cancer is one of the most aggressive types of cancer, and its early diagnosis is crucial for reducing mortality rates and ensuring timely treatment. Computer-aided diagnosis systems provide automated mammography image processing, interpretation, and grading. However, since the currently existing methods suffer from such issues as overfitting, lack of adaptability, and dependence on massive annotated datasets, the present work introduces a hybrid approach to enhance breast cancer classification accuracy.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!