Convex biclustering.

Biometrics

Department of Electrical and Computer Engineering, Rice University, 6100 Main St, Houston, Texas, U.S.A.

Published: March 2017

In the biclustering problem, we seek to simultaneously group observations and features. While biclustering has applications in a wide array of domains, ranging from text mining to collaborative filtering, the problem of identifying structure in high-dimensional genomic data motivates this work. In this context, biclustering enables us to identify subsets of genes that are co-expressed only within a subset of experimental conditions. We present a convex formulation of the biclustering problem that possesses a unique global minimizer and an iterative algorithm, COBRA, that is guaranteed to identify it. Our approach generates an entire solution path of possible biclusters as a single tuning parameter is varied. We also show how to reduce the problem of selecting this tuning parameter to solving a trivial modification of the convex biclustering problem. The key contributions of our work are its simplicity, interpretability, and algorithmic guarantees-features that arguably are lacking in the current alternative algorithms. We demonstrate the advantages of our approach, which includes stably and reproducibly identifying biclusterings, on simulated and real microarray data.

Download full-text PDF

Source
http://dx.doi.org/10.1111/biom.12540DOI Listing

Publication Analysis

Top Keywords

biclustering problem
12
convex biclustering
8
tuning parameter
8
biclustering
5
problem
5
biclustering biclustering
4
problem seek
4
seek simultaneously
4
simultaneously group
4
group observations
4

Similar Publications

Biclustering is the task of simultaneously clustering the samples and features of a data set. In doing so, subsets of samples that exhibit similar behaviors across subsets of features can be identified. Motivated by a longitudinal diffusion tensor imaging study of sport-related concussion (SRC), we present the problem of biclustering multivariate longitudinal data in which subjects and features are grouped simultaneously based on longitudinal patterns rather than magnitude.

View Article and Find Full Text PDF

The existing biclustering algorithms often depend on assumptions like monotonicity or linearity of feature relations for finding biclusters. Though a few algorithms overcome this problem using density-based methods, they tend to miss out many biclusters because they use global criteria for identifying dense regions. The proposed method, PF-RelDenBi, uses local variations in marginal and joint densities for each pair of features to find the subset of observations, forming the basis of the relation between them.

View Article and Find Full Text PDF

Enhancer-driven gene regulatory networks inference from single-cell RNA-seq and ATAC-seq data.

Brief Bioinform

July 2024

Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, United States.

Deciphering the intricate relationships between transcription factors (TFs), enhancers, and genes through the inference of enhancer-driven gene regulatory networks (eGRNs) is crucial in understanding gene regulatory programs in a complex biological system. This study introduces STREAM, a novel method that leverages a Steiner forest problem model, a hybrid biclustering pipeline, and submodular optimization to infer eGRNs from jointly profiled single-cell transcriptome and chromatin accessibility data. Compared to existing methods, STREAM demonstrates enhanced performance in terms of TF recovery, TF-enhancer linkage prediction, and enhancer-gene relation discovery.

View Article and Find Full Text PDF

Biclustering of Log Data: Insights from a Computer-Based Complex Problem Solving Assessment.

J Intell

January 2024

Collaborative Innovation Center of Assessment for Basic Education Quality, Beijing Normal University, Beijing 100875, China.

Computer-based assessments provide the opportunity to collect a new source of behavioral data related to the problem-solving process, known as log file data. To understand the behavioral patterns that can be uncovered from these process data, many studies have employed clustering methods. In contrast to one-mode clustering algorithms, this study utilized biclustering methods, enabling simultaneous classification of test takers and features extracted from log files.

View Article and Find Full Text PDF

Research hotspot and trend of chronic wounds: A bibliometric analysis from 2013 to 2022.

Wound Repair Regen

November 2023

Department of Burns and Plastic Surgery, West China Hospital, Sichuan University, Chengdu, Sichuan, China.

Chronic wounds have been confirmed as a vital health problem facing people in the global population aging process. While significant progress has been achieved in the study of chronic wounds, the treatment effect should be further improved. The number of publications regarding chronic wounds has been rising rapidly.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!