Molecular profiling data (e.g., gene expression) has been used for clinical risk prediction and biomarker discovery. However, it is necessary to integrate other prior knowledge like biological pathways or gene interaction networks to improve the predictive ability and biological interpretability of biomarkers. Here, we first introduce a general regularized Logistic Regression (LR) framework with regularized term , which can reduce to different penalties, including Lasso, elastic net, and network-regularized terms with different . This framework can be easily solved in a unified manner by a cyclic coordinate descent algorithm which can avoid inverse matrix operation and accelerate the computing speed. However, if those estimated and have opposite signs, then the traditional network-regularized penalty may not perform well. To address it, we introduce a novel network-regularized sparse LR model with a new penalty to consider the difference between the absolute values of the coefficients. We develop two efficient algorithms to solve it. Finally, we test our methods and compare them with the related ones using simulated and real data to show their efficiency.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCBB.2016.2640303DOI Listing

Publication Analysis

Top Keywords

network-regularized sparse
8
logistic regression
8
clinical risk
8
risk prediction
8
prediction biomarker
8
biomarker discovery
8
network-regularized
4
sparse logistic
4
regression models
4
models clinical
4

Similar Publications

scPAS: single-cell phenotype-associated subpopulation identifier.

Brief Bioinform

November 2024

College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 157 Baojian Road, Heilongjiang 150081, China.

Despite significant advancements in single-cell sequencing analysis for characterizing tissue sample heterogeneity, identifying the associations between cell subpopulations and disease phenotypes remains a challenging task. Here, we introduce scPAS, a new bioinformatics tool designed to integrate bulk data to identify phenotype-associated cell subpopulations within single-cell data. scPAS employs a network-regularized sparse regression model to quantify the association between each cell in single-cell data and a phenotype.

View Article and Find Full Text PDF
Article Synopsis
  • Splicing dysregulation due to spliceosomal mutations is linked to disease progression and treatment resistance primarily in blood cancers, while solid tumors show widespread splicing disorders that may promote tumor development.
  • A new computational tool called SMNPLS was created to analyze splicing dysregulation patterns in various solid tumors by examining the relationship between splicing factors and alternative splicing events.
  • The study identifies six distinct splicing dysregulation patterns affecting 40% of solid tumors and reveals similarities in dysregulation across certain cancer types, while highlighting unique patterns in brain tumors and discovering potential oncogenic splicing relationships.
View Article and Find Full Text PDF

With the development of high-throughput technologies, the accumulation of large amounts of multidimensional genomic data provides an excellent opportunity to study the multilevel biological regulatory relationships in cancer. Based on the hypothesis of competitive endogenous ribonucleic acid (RNA) (ceRNA) network, lncRNAs can eliminate the inhibition of microRNAs (miRNAs) on their target genes by binding to intracellular miRNA sites so as to improve the expression level of these target genes. However, previous studies on cancer expression mechanism are mostly based on individual or two-dimensional data, and lack of integration and analysis of various RNA-seq data, making it difficult to verify the complex biological relationships involved.

View Article and Find Full Text PDF

Network-based cancer genomic data integration for pattern discovery.

BMC Genom Data

December 2021

School of Mathematics and Computer Science, Jiangxi Science and Technology Normal University, Nanchang, 330038, China.

Background: Since genes involved in the same biological modules usually present correlated expression profiles, lots of computational methods have been proposed to identify gene functional modules based on the expression profiles data. Recently, Sparse Singular Value Decomposition (SSVD) method has been proposed to bicluster gene expression data to identify gene modules. However, this model can only handle the gene expression data where no gene interaction information is integrated.

View Article and Find Full Text PDF

Sparse Partial Least Squares Methods for Joint Modular Pattern Discovery.

Methods Mol Biol

December 2020

NCMIS, CEMS, RCSDS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China.

The underlying relationship between genomic factors and the response of diverse cancer drugs still remains unclear. A number of studies showed that the heterogeneous responses to anticancer treatments of patients were partly associated with their specific changes in gene expression and somatic alterations. However, how to identify the multiple-to-multiple relationships between genomic factors and drug response among pharmacogenomics data is still a challenging issue.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!